This repository contains code samples and infrastructure setups for implementing various components of a Retrieval-Augmented Generation (RAG) system. The project is modularized into several distinct services, each addressing a specific part of the RAG pipeline.
The repository is organized by the major steps of the RAG workflow:
1.Launch_vector_DB_container_instance: Scripts for launching the Qdrant vector database container instance.2.Create_vector_DB_client: Contains the Java client for interacting with the vector database (Qdrant).5.Push_embeddings_to_vector_DB: Handles the process of pushing embeddings into the vector database.6.Search_similar_points: Contains the client logic for performing similarity searches on the vector store.
3.1.Creating_embeddings_generator_using_ollama: Setup for using Ollama to generate embeddings.3.Create_embeddings_generator: Scripts and setup for running the embedding generator service.4.Create_embeddings_generator_client: Client implementation for consuming the embedding generation service.
7.Launch_llama_index: Setup for using LlamaIndex, including a local LLM declaration example.7.Launch_PostgreSQL_container: Setup for the PostgreSQL database container, including a Java service client.8.Re-Ranker: Implementation for a re-ranking service (e.g., using CrossEncoderService).
9.Launch_langflow_container: Scripts for launching the LangFlow UI.10.Launch_Docling_container: Scripts for launching Docling container for document parsing.
The typical RAG workflow in this project follows these steps: