Agentic-RAG

Universal Agentic Retrieval-Augmented Generation system that ingests PDFs, builds both vector and knowledge-graph indexes, and answers questions with tool-driven reasoning and citations.

Why This Project

Most RAG systems rely on vector search alone. This system combines vector similarity search with Neo4j knowledge graph traversal, enabling multi-hop reasoning and citation-backed responses that single-index RAG cannot achieve.

Features

LlamaParse → structured Markdown parsing (tables preserved)
Hierarchical chunking with rich metadata
Dual storage: Qdrant (vector) + Neo4j (graph)
LangGraph ReAct agent with vector_search and graph_search tools
FastAPI backend + Streamlit UI
Optional Celery + Redis async ingestion

Tech Stack

Layer	Technology
PDF Parsing	LlamaParse
Embeddings	HuggingFace bge-small-v1.5
Vector DB	Qdrant
Graph DB	Neo4j
Agent	LangGraph ReAct
LLM	Gemini 2.5 Pro / OpenAI / Anthropic
Backend	FastAPI
UI	Streamlit
Async Queue	Celery + Redis

Architecture

graph TD
    classDef user fill:#2d3436,stroke:#74b9ff,stroke-width:2px,color:#dfe6e9
    classDef system fill:#0984e3,stroke:#74b9ff,stroke-width:2px,color:#ffffff
    classDef database fill:#6c5ce7,stroke:#a29bfe,stroke-width:2px,color:#ffffff
    classDef llm fill:#d63031,stroke:#ff7675,stroke-width:2px,color:#ffffff

    User[User]:::user -->|Uploads PDF| RawFolder[data/raw/]
    User -->|Asks Question| UI[Streamlit UI]
    UI <-->|API Calls| API[FastAPI Backend]

    subgraph Ingestion Pipeline [Offline Ingestion Process]
        RawFolder -->|Read Document| LlamaParse[LlamaParse Parser]
        LlamaParse -->|Markdown Text| ProcessedFolder[data/processed/]
        LlamaParse -->|Raw Chunks| Chunker[Hierarchical Chunker]
    end

    subgraph Dual-Database Storage
        Chunker -->|Step 1: Embed Text| Embedding[HuggingFace bge-small-v1.5]
        Embedding -->|384-Dim Vectors| Qdrant[(Qdrant Vector DB)]:::database
        Chunker -->|Step 2: Extract Ontology| GraphExtractor[LLM Graph Extractor]:::llm
        GraphExtractor -->|Entities & Relationships| Neo4j[(Neo4j Knowledge Graph)]:::database
    end

    subgraph Retrieval & Orchestration
        API -->|Query| Agent[Universal ReAct Agent]:::system
        Agent -->|LLM Reasoning| LLM[Foundation Model]:::llm
        Agent -->|Checks semantic similarity| VectorTool[Vector Search Tool]
        Agent -->|Checks multi-hop connections| GraphTool[Graph Search Tool]
        VectorTool -->|Searches| Qdrant
        GraphTool -->|Generates Cypher| Neo4j
    end

    Qdrant -->|Returns top chunks| Agent
    Neo4j -->|Returns connected edges| Agent
    Agent -->|Synthesizes final answer with citations| API

Quickstart

1. Install dependencies

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

2. Configure environment

cp .env.example .env

Update .env with your own credentials. Minimum required for a full run:

LLAMA_CLOUD_API_KEY (PDF parsing)
GEMINI_API_KEY (default agent + graph extraction)
(or OPENAI_API_KEY / ANTHROPIC_API_KEY if you switch providers)
GRAPH_EXTRACTOR_PROVIDER and GRAPH_EXTRACTOR_MODEL_ID (ingestion graph extraction LLM)
RAG_AGENT_PROVIDER and RAG_AGENT_MODEL_ID (retrieval agent LLM)
QDRANT_URL (defaults to http://localhost:6333)
NEO4J_URI, NEO4J_USERNAME, NEO4J_PASSWORD, NEO4J_DATABASE

3. Start dependencies

docker-compose up -d

This brings up Qdrant and Redis. Run Neo4j separately (AuraDB or local install) and point the NEO4J_* env vars to it.

4. Ingest documents

Place PDFs in data/raw/, then run:

python ingest_all.py

The pipeline parses PDFs, chunks them, extracts a knowledge graph, ingests into Qdrant + Neo4j, and moves processed files to data/processed/.

5. Run the API

python main.py

FastAPI serves:

GET / → health message
POST /chat → agent response

6. Run the UI

streamlit run ui/app.py

The Streamlit app calls http://127.0.0.1:8000/chat.

Async ingestion (optional)

If you want background ingestion with Celery:

celery -A worker.app worker --pool=solo --loglevel=info
python submit_ingestion_jobs.py

The solo pool avoids macOS fork crashes from ML libraries used during parsing and embedding.

Project structure

ingestion/        LlamaParse, chunking, graph extraction
retrieval/        LangGraph agent + tools
storage/          Qdrant and Neo4j managers
ui/               Streamlit frontend
main.py           FastAPI backend
ingest_all.py     End-to-end ingestion pipeline
worker.py         Celery worker for async ingestion

Tests (manual scripts)

python test_ingestion.py
python test_storage.py
python test_agent.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agentic-RAG

Why This Project

Features

Tech Stack

Architecture

Quickstart

1. Install dependencies

2. Configure environment

3. Start dependencies

4. Ingest documents

5. Run the API

6. Run the UI

Async ingestion (optional)

Project structure

Tests (manual scripts)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
data		data
ingestion		ingestion
models		models
retrieval		retrieval
storage		storage
ui		ui
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.py		config.py
docker-compose.yml		docker-compose.yml
ingest_all.py		ingest_all.py
main.py		main.py
requirements.txt		requirements.txt
submit_ingestion_jobs.py		submit_ingestion_jobs.py
test_agent.py		test_agent.py
test_config.py		test_config.py
test_ingest_api.py		test_ingest_api.py
test_ingestion.py		test_ingestion.py
test_storage.py		test_storage.py
test_ui_upload.py		test_ui_upload.py
test_worker_config.py		test_worker_config.py
worker.py		worker.py

Folders and files

Latest commit

History

Repository files navigation

Agentic-RAG

Why This Project

Features

Tech Stack

Architecture

Quickstart

1. Install dependencies

2. Configure environment

3. Start dependencies

4. Ingest documents

5. Run the API

6. Run the UI

Async ingestion (optional)

Project structure

Tests (manual scripts)

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages