Optimized Retrieval-Augmented Generation (RAG) System for Enterprise Knowledge Bases
Python · FAISS · LlamaIndex · HuggingFace · Sentence-Transformers · Cross-Encoder Reranking · Streamlit
Highlights
- Built an end-to-end RAG pipeline over 6,000+ documents, enabling grounded enterprise QA with citation support.
- Conducted embedding and retrieval ablations (MiniLM, BGE, E5, chunking, depth), improving answer accuracy by 10%.
- Reduced hallucination by 15% via hybrid retrieval and cross-encoder reranking while maintaining 1.6s p95 latency; implemented evaluation framework measuring Recall@k, MRR, and grounded accuracy.
Tech stack
PythonFAISSLlamaIndexHuggingFaceSentence-TransformersCross-Encoder RerankingStreamlit
Next steps
Want this page to look even stronger? Add: architecture diagram, metrics (accuracy/latency), screenshots, and links. Update data/profile.ts to include your GitHub/demo URLs.