Full-stack Retrieval-Augmented Generation (RAG) platform with hybrid search, reranking, OCR ingestion, repository indexing, speech transcription, and evaluation pipelines.
Oracle is a production-focused AI knowledge assistant designed to provide accurate, grounded answers from uploaded documents and GitHub repositories.
Instead of relying on generic LLM responses, Oracle retrieves relevant context from your actual data using a sophisticated retrieval pipeline before generating answers.
It combines:
- π Document ingestion
- π GitHub repository indexing
- π Hybrid semantic + keyword retrieval
- π§ Context reranking
- π€ Speech transcription
- π RAG evaluation metrics
- β‘ Real-time chat interface
This makes Oracle ideal for:
- Developer documentation assistants
- Repository Q&A systems
- Internal enterprise knowledge assistants
- Research document querying
- Technical codebase exploration
Oracle can ingest and understand content from multiple sources:
β
PDF documents
β
GitHub repositories
β
OCR-extracted scanned content
β
Audio transcription workflows
Unlike basic chatbot wrappers, Oracle uses a robust retrieval architecture:
- Structure-aware chunking
- Semantic vector retrieval
- BM25 keyword retrieval
- Reciprocal Rank Fusion (RRF)
- Cohere reranking
- Grounded answer generation
- Automated RAG evaluation
This improves factual accuracy and reduces hallucinations.
Index entire repositories and ask questions like:
βHow does authentication work?β
βWhere is the API routing implemented?β
βWhich file handles embeddings?β
βExplain the database architecture.β
Oracle parses repository trees, extracts meaningful code context, and enables intelligent codebase querying.
Supports richer input workflows:
- Audio transcription using AssemblyAI
- OCR extraction from scanned PDFs
- Voice-driven knowledge querying
Oracle includes evaluation pipelines to assess response quality using:
- Faithfulness
- Answer relevance
- Context precision
This makes the system engineering-focusedβnot just UI-focused.
User Query
β
Frontend Chat UI
β
Backend API Layer
β
Document / GitHub Ingestion
β
Text Parsing + Structure-Aware Chunking
β
Embedding Generation (Voyage AI)
β
Pinecone Vector Storage
β
Hybrid Retrieval (Vector + BM25)
β
Reciprocal Rank Fusion
β
Cohere Reranking
β
Groq Answer Generation
β
RAG Evaluation
β
Grounded Response to User
Documents and repositories are parsed into clean structured text.
Supported ingestion:
- PDFs
- GitHub repositories
- OCR-extracted content
- speech transcripts
Instead of naive fixed-size chunking, Oracle uses structure-aware segmentation:
- heading boundaries
- function blocks
- class boundaries
- file segmentation
- table-aware chunking
This preserves semantic meaning during retrieval.
Chunks are converted into dense semantic vectors using:
Voyage AI embeddings
Each chunk is enriched with source metadata for grounded retrieval.
Oracle combines:
Dense vector similarity from Pinecone.
BM25 lexical retrieval.
Then merges both via:
Reciprocal Rank Fusion (RRF)
This significantly improves recall.
Retrieved candidates are reranked using:
Cohere Rerank API
This ensures only the most relevant context reaches the LLM.
Final context is passed to:
Groq LLM inference
with strict grounding instructions to reduce hallucination.
Generated responses are scored on:
- faithfulness
- relevance
- precision
to monitor answer quality.
- React
- Vite
- TypeScript
- Node.js
- Express
- TypeScript
- Pinecone
- Voyage AI
- Cohere
- Groq
- BM25
- Reciprocal Rank Fusion
- GitHub API
- Clerk
- Supabase
- OCR services
- AssemblyAI
- Vercel
backend/
β£ src/
β β£ rag/ β ingestion, chunking, retrieval, reranking, evaluation
β β£ routes/ β REST API endpoints
β β£ services/ β GitHub, OCR, auth, storage integrations
β β lib/ β provider wrappers and shared utilities
frontend/
β£ src/
β β£ pages/ β main application screens
β β£ components/ β reusable UI components
β β£ hooks/ β frontend custom hooks
β β lib/ β API utilities
cd backend
cp .env.example .env
npm install
npm run build
npm run devcd frontend
npm install
npm run devOracle supports deployment across modern platforms.
See:
- Vercel
- Render
- Railway
Detailed deployment instructions:
DEPLOYMENT.mdOracle demonstrates practical software engineering + applied AI concepts:
β
Production-grade RAG architecture
β
Retrieval engineering
β
Hybrid search systems
β
LLM grounding strategies
β
AI evaluation pipelines
β
GitHub repository intelligence
β
Full-stack TypeScript architecture
β
Real-world deployment workflows
Tushar Panwar
B.Tech @ NIT Jalandhar
Full-Stack Developer | AI Engineering Enthusiast