# Self-Hosted End-to-End RAG Applications

> Search results for `end-to-end RAG app you can deploy yourself` on awesome-repositories.com. 117 total matches; showing the first 50.

Explore on the web: https://awesome-repositories.com/q/end-to-end-rag-app-you-can-deploy-yourself

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [this search on awesome-repositories.com](https://awesome-repositories.com/q/end-to-end-rag-app-you-can-deploy-yourself).**

## Results

- [agentscope-ai/agentscope](https://awesome-repositories.com/repository/agentscope-ai-agentscope.md) (26,895 ⭐) — Agentscope is a comprehensive toolkit for developing and orchestrating autonomous multi-agent systems. It provides a unified framework for building agents that can reason, execute tools, and manage memory, enabling the creation of complex, collaborative workflows where multiple specialized agents interact to solve multi-step objectives.

The platform distinguishes itself through a robust orchestration engine that supports both sequential and concurrent agent pipelines. It utilizes a centralized event bus for real-time telemetry, allowing developers to track agent reasoning, tool usage, and sys
- [docker/genai-stack](https://awesome-repositories.com/repository/docker-genai-stack.md) (5,333 ⭐) — This project is a containerized development stack and application framework for building retrieval-augmented generation systems. It provides a dockerized AI sandbox that integrates local model runtimes, knowledge graphs, and vector stores to enable the creation of contextual chatbots.

The stack is distinguished by its graph-based vector store, which combines structured knowledge graphs with vector indices for both semantic and structural data retrieval. It allows for local model hosting with CPU or GPU acceleration, enabling generative tasks without reliance on external cloud APIs.

The frame
- [flowiseai/flowise](https://awesome-repositories.com/repository/flowiseai-flowise.md) (53,641 ⭐) — Flowise is a low-code platform designed for building and deploying complex language model workflows through a visual, node-based interface. It functions as an orchestrator for autonomous multi-agent systems, allowing users to construct conversational pipelines by connecting language models, memory stores, and external tools on a drag-and-drop canvas.

The platform distinguishes itself through its support for sophisticated agentic patterns, including supervisor-worker delegation and iterative reasoning strategies. Users can design directed acyclic graphs to manage conditional branching, state p
- [elysiajs/elysia](https://awesome-repositories.com/repository/elysiajs-elysia.md) (18,531 ⭐) — Elysia is a high-performance TypeScript web framework designed for building type-safe backend services. It provides a modular, plugin-based architecture that allows developers to compose server logic, middleware, and validation schemas into scalable application instances. By leveraging native web standards, the framework ensures portability across diverse JavaScript runtimes, including Node.js, Deno, and various edge computing environments.

The framework distinguishes itself through its focus on end-to-end type safety, automatically synchronizing request and response definitions between the s
- [mayooear/gpt4-pdf-chatbot-langchain](https://awesome-repositories.com/repository/mayooear-gpt4-pdf-chatbot-langchain.md) (16,542 ⭐) — This project is a framework for building custom AI chatbots capable of PDF document analysis. It implements Retrieval Augmented Generation to connect a large language model to private document data.

The system utilizes graph-based agent orchestration to control conversation flow and decision logic. It maintains context across interactions through thread-based state management and delivers AI responses to the user interface via real-time streaming.

The project covers PDF document ingestion through chunk-based processing and vector-store retrieval. It includes mechanisms for query-based data r
- [facebookresearch/end-to-end-negotiator](https://awesome-repositories.com/repository/facebookresearch-end-to-end-negotiator.md) (1,398 ⭐) — Deal or No Deal? End-to-End Learning for Negotiation Dialogues
- [cfernandezlab/cfl-end-to-end-layout-recovery-from-360-images](https://awesome-repositories.com/repository/cfernandezlab-cfl-end-to-end-layout-recovery-from-360-images.md) (109 ⭐) — Tensorflow implementation of our end-to-end model to recover 3D layouts of indoor scenes.
- [grobidorg/grobid](https://awesome-repositories.com/repository/grobidorg-grobid.md) (4,954 ⭐) — Grobid is a machine learning system designed to transform academic and scientific PDF publications into structured XML. It functions as a PDF to XML parser and scholarly metadata extractor, identifying and normalizing titles, authors, affiliations, and bibliographic references from research papers.

The system utilizes a deep learning document segmenter to divide raw PDFs into functional regions and employs a bibliographic reference resolver to match citations against external registries for metadata enrichment and DOI resolution. It supports a full machine learning model training pipeline, al
- [cactus-compute/cactus](https://awesome-repositories.com/repository/cactus-compute-cactus.md) (5,363 ⭐) — Cactus is an on-device AI inference engine designed for executing large language models, vision models, and speech-to-text systems on mobile and wearable hardware. It provides a programmable tensor computation graph for defining sequences of matrix operations and activation functions, alongside a local retrieval augmented generation framework that grounds model responses using local text files.

The project features a multiplatform SDK with language bindings for integrating AI capabilities into mobile applications and a model conversion system that transforms external model formats for optimiz
- [datahub-project/datahub](https://awesome-repositories.com/repository/datahub-project-datahub.md) (12,141 ⭐) — DataHub is a metadata management platform designed to unify technical, operational, and business context across diverse data ecosystems. By utilizing a graph-based metadata model and an event-driven ingestion architecture, it creates a centralized source of truth that maps complex data relationships, lineage, and ownership. This foundational framework enables organizations to maintain a synchronized view of their data landscape, supporting both human-led discovery and automated data operations.

The platform distinguishes itself through its focus on grounding artificial intelligence and autono
- [stangirard/quivr](https://awesome-repositories.com/repository/stangirard-quivr.md) (39,167 ⭐) — Quivr is a framework for building retrieval-augmented generation pipelines that connect large language models to custom knowledge bases. It serves as a generative AI integration layer that abstracts the process of transforming diverse document sources into searchable context for AI responses.

The project orchestrates the end-to-end flow between document ingestion, vector storage management, and model provider interfaces. It features a vector-store-agnostic retrieval system and a modular API layer that allows for flexible switching between different generative model providers.

The system cove
- [kalyanmurapaka45/end-to-end-image-scraping](https://awesome-repositories.com/repository/kalyanmurapaka45-end-to-end-image-scraping.md) (14 ⭐) — The "Image Scraper" is a Flask web application that allows users to search for images on Google and download them directly to their local machines. The project leverages web scraping techniques to fetch the image URLs from Google search results and then download the images to a specified directory.
- [kalyanmurapaka45/end-to-end-movie-recommendation-system](https://awesome-repositories.com/repository/kalyanmurapaka45-end-to-end-movie-recommendation-system.md) (16 ⭐) — Movie recommendation systems are designed to provide personalized movie suggestions to users, enhancing their entertainment experience by helping them discover movies tailored to their preferences. This project showcases the development of a Movie Recommendation System using various…
- [apache/flink](https://awesome-repositories.com/repository/apache-flink.md) (26,086 ⭐) — Apache Flink is a distributed processing engine designed for both high-throughput, low-latency data streams and finite batch workloads. It functions as a stateful stream processor and a SQL stream processing engine, providing a unified runtime to execute relational queries and event-based transformations.

The system is distinguished by its ability to manage persistent operator state to ensure exactly-once processing guarantees and consistency during failures. It features specialized capabilities for complex event processing to detect temporal patterns and handles out-of-order events using eve
- [datawhalechina/all-in-rag](https://awesome-repositories.com/repository/datawhalechina-all-in-rag.md) (3,989 ⭐) — This project is a retrieval augmented generation framework designed to build pipelines that connect unstructured data and knowledge graphs with large language models. It functions as a vector database orchestrator for indexing text and multimodal content, as well as a system for translating natural language queries into structured database commands.

The framework integrates a hybrid retrieval engine that combines dense vector search with sparse keyword matching to increase the precision of retrieved contexts. It further enhances reasoning and relationship mapping through a graph-augmented ret
- [datawhalechina/prompt-engineering-for-developers](https://awesome-repositories.com/repository/datawhalechina-prompt-engineering-for-developers.md) (24,267 ⭐) — This project is a technical curriculum and development guide focused on large language model prompt engineering, fine-tuning, and the creation of retrieval augmented generation applications. It serves as a comprehensive resource for developers to master crafting precise instructions and textual patterns to improve the quality and predictability of model outputs.

The material covers the end-to-end workflow of adapting open-source models to specific datasets and integrating language models with vector databases to generate responses based on private information. It also provides a systematic ap
- [kalyanmurapaka45/end-to-end-airbnb-price-prediction](https://awesome-repositories.com/repository/kalyanmurapaka45-end-to-end-airbnb-price-prediction.md) (64 ⭐) — In today's fast-paced world, the way we travel and seek accommodations has undergone a remarkable transformation, thanks to platforms like Airbnb. This dynamic marketplace has empowered property owners and travellers, offering a diverse range of lodging options. However, one enduring challenge…
- [abraunegg/onedrive](https://awesome-repositories.com/repository/abraunegg-onedrive.md) (12,577 ⭐) — This project is a command-line synchronization client for OneDrive and SharePoint libraries on Linux. It functions as a synchronization engine that aligns local filesystems with cloud storage through bidirectional, unidirectional, or download-only workflows.

The client supports headless authentication for servers without web browsers and can be deployed as a background service or within a containerized environment. It enables the management of multiple distinct cloud accounts on a single system and integrates with shared SharePoint sites and document libraries.

The synchronization engine inc
- [kalyanm45/end-to-end-chest-disease-classification](https://awesome-repositories.com/repository/kalyanm45-end-to-end-chest-disease-classification.md) (14 ⭐) — By Hema Kalyan Murapaka
- [arialdomartini/back-end-developer-interview-questions](https://awesome-repositories.com/repository/arialdomartini-back-end-developer-interview-questions.md) (16,574 ⭐) — This project is an open-source knowledge repository that serves as a comprehensive technical interview question bank for backend engineering roles. It provides a structured resource for hiring managers and candidates to evaluate proficiency in software design, architectural patterns, and core engineering principles through a curated collection of discussion topics and coding challenges.

The repository functions as a programming paradigm evaluation tool, enabling the assessment of a candidate's understanding of object-oriented, functional, and procedural techniques. It distinguishes itself by
- [infiniflow/ragflow](https://awesome-repositories.com/repository/infiniflow-ragflow.md) (82,922 ⭐) — This project is a comprehensive retrieval-augmented generation platform designed for building, managing, and deploying knowledge-based AI applications. It provides a unified environment for organizing datasets, configuring conversational chat assistants, and developing autonomous agents that execute multi-step reasoning workflows. By integrating document intelligence with advanced retrieval pipelines, the platform enables the creation of grounded, verifiable responses supported by traceable citations.

The platform distinguishes itself through deep document understanding and sophisticated know
- [esperco/front-end-challenge](https://awesome-repositories.com/repository/esperco-front-end-challenge.md) (12 ⭐) — Front End Developer Challenge
- [dataelement/bisheng](https://awesome-repositories.com/repository/dataelement-bisheng.md) (11,455 ⭐) — Bisheng is an enterprise AI framework and LLM DevOps platform designed to manage the full lifecycle of large language models. It provides a unified system for dataset curation, supervised fine-tuning, model versioning, and performance evaluation.

The platform features a visual workflow orchestrator for building retrieval-augmented generation pipelines and complex task sequences using flowcharts with conditional logic and human intervention points. It also includes an AI agent framework that uses a specialized guidance language to embed domain expertise and professional business logic into aut
- [zepl/front-end-challenge](https://awesome-repositories.com/repository/zepl-front-end-challenge.md) (10 ⭐) — Front-end engineer challenge
- [cinnamon/kotaemon](https://awesome-repositories.com/repository/cinnamon-kotaemon.md) (25,139 ⭐) — Kotaemon is an orchestration framework designed for building modular, agentic workflows that integrate document processing, retrieval-augmented generation, and multi-step reasoning. It provides a comprehensive platform for developing document-based question answering systems, allowing users to chain language models, prompt templates, and external tools into complex, automated pipelines.

The system distinguishes itself through a highly modular architecture that emphasizes component-based composition and schema-driven data exchange. It supports autonomous agents capable of decomposing complex q
- [baconjs/bacon.js](https://awesome-repositories.com/repository/baconjs-bacon-js.md) (6,458 ⭐) — Bacon.js is a JavaScript functional reactive programming library used for coordinating complex asynchronous data flows. It functions as an observable event stream framework and an asynchronous data flow orchestrator, allowing developers to model events as declarative streams and properties.

The library distinguishes itself through its ability to manage reactive state and synchronize timing across multiple sources. It provides specialized mechanisms for atomic state synchronization to prevent glitches in derived properties and offers advanced coordination strategies such as asynchronous stream
- [nvidia/generativeaiexamples](https://awesome-repositories.com/repository/nvidia-generativeaiexamples.md) (3,802 ⭐) — This project is a library of reference implementations and blueprints for deploying large language models and generative AI workflows. It provides a collection of practical examples designed to guide the deployment of generative systems.

The repository features architectural patterns for autonomous agentic workflows that utilize reasoning and tool integration to execute multi-step tasks. It also includes frameworks and templates for building retrieval-augmented generation pipelines that connect language models to vector databases and external data sources.

The codebase covers several functio
- [h5bp/front-end-developer-interview-questions](https://awesome-repositories.com/repository/h5bp-front-end-developer-interview-questions.md) (60,886 ⭐) — This project is a curated frontend interview question bank and technical assessment guide. It serves as a web development interview resource for assessing candidates on frontend development, web accessibility, and browser performance.

The collection provides a standardized set of questions to evaluate a developer's knowledge of HTML, CSS, JavaScript, and networking. It is designed to assist in the developer hiring process, engineering team recruiting, and personal technical interview preparation.

The content is organized as a flat-file knowledge base using markdown-based storage and topic-ba
- [loktra/front-end-engineer](https://awesome-repositories.com/repository/loktra-front-end-engineer.md) (19 ⭐) — Front End Engineer Programming Challenge
- [apiki/front-end-challenge](https://awesome-repositories.com/repository/apiki-front-end-challenge.md) (271 ⭐) — Desafio para candidatos a front-end.
- [imclumsypanda/langchain-chatglm](https://awesome-repositories.com/repository/imclumsypanda-langchain-chatglm.md) (38,183 ⭐) — This project is a LangChain-based framework for building retrieval-augmented generation systems, autonomous agents, and multimodal chatbots. It functions as an open-source orchestrator that connects local inference engines and online APIs to manage various large language model deployments.

The system distinguishes itself by providing specialized interfaces for local knowledge bases, allowing the loading and vectorization of private documents to create context-aware assistants. It also supports multimodal capabilities, enabling the processing of both text and image inputs through vision-capabl
- [darcyclarke/front-end-developer-interview-questions](https://awesome-repositories.com/repository/darcyclarke-front-end-developer-interview-questions.md) (60,886 ⭐) — This project is a curated repository of specialized technical questions and assessment guides used to evaluate proficiency in core web technologies. It serves as a question bank and assessment guide for testing knowledge of browser APIs, CSS, JavaScript, and HTTP protocols.

The repository provides a technical skill evaluation framework consisting of open-ended prompts. These are used for front-end candidate evaluation, standardizing technical hiring workflows, and facilitating interview preparation for web developers.

The content is organized via a category-driven information architecture an
- [personare/front-end-challenge](https://awesome-repositories.com/repository/personare-front-end-challenge.md) (134 ⭐) — Desafio para vaga de front-end do @Personare
- [ageron/handson-ml2](https://awesome-repositories.com/repository/ageron-handson-ml2.md) (29,938 ⭐) — This project provides a collection of practical machine learning code examples, including implementations for supervised, unsupervised, and reinforcement learning algorithms. It features deep learning model implementations for convolutional, recurrent, and generative architectures, alongside specific examples of reinforcement learning agents that maximize rewards in simulated environments.

The repository includes dedicated data preprocessing pipelines for sanitization, feature scaling, and dimensionality reduction. It also provides implementations for a wide range of specific models, such as
- [stangirard/quiver](https://awesome-repositories.com/repository/stangirard-quiver.md) (39,167 ⭐) — Quiver is a framework for integrating retrieval augmented generation into applications. It provides a generative AI integration layer that connects large language models with vector stores to produce context-aware responses based on custom data.

The project features a knowledge base pipeline that parses diverse file types into searchable embeddings and a vector database orchestrator to manage data across different storage implementations. It utilizes a provider-agnostic model interface, allowing users to switch between various external AI providers or local models through a single unified sys
- [volcengine/minecontext](https://awesome-repositories.com/repository/volcengine-minecontext.md) (4,960 ⭐) — MineContext is a context management system designed to collect, store, and retrieve multimodal data to build targeted context windows for large language models. It functions as an orchestration tool and retrieval augmented generation framework that utilizes a local vector data store to index documents and enable similarity searches.

The system differentiates itself through a multimodal context collector that gathers information from screen captures, files, and version control systems. It provides mechanisms for proactive information retrieval, extracting summaries and activity records from ca
- [nicejade/front-end-tutorial](https://awesome-repositories.com/repository/nicejade-front-end-tutorial.md) (1,732 ⭐) — :panda_face: 最全的资源教程-前端涉及的所有知识体系，并在 Nice Front-end Tutorial 持续更新；
- [ageron/handson-ml](https://awesome-repositories.com/repository/ageron-handson-ml.md) (25,608 ⭐) — This is a machine learning educational repository consisting of a collection of notebooks and code examples. It provides practical implementations of diverse machine learning algorithms and workflows, ranging from traditional scientific computing to deep learning.

The project features specific implementations of Scikit-Learn models, such as decision trees, random forests, and support vector machines, as well as TensorFlow examples for building neural networks, convolutional layers, and recurrent architectures. It also includes tutorials on reinforcement learning development and the creation o
- [ionic-team/capacitor](https://awesome-repositories.com/repository/ionic-team-capacitor.md) (15,054 ⭐) — Capacitor is a cross-platform mobile framework that enables developers to build native applications using web technologies. It functions as a hybrid app container, wrapping web assets within a native runtime that provides a standardized bridge to device hardware and system-level services. By exposing native functionality through a plugin-based architecture, it allows web applications to access platform-specific features while maintaining a consistent interface across mobile and desktop environments.

The project distinguishes itself by maintaining native project files as source assets, allowin
- [tglide/to-you](https://awesome-repositories.com/repository/tglide-to-you.md) (2 ⭐) — Another to-do app for you, where completing tasks lead to great rewards.
- [datawhalechina/llm-cookbook](https://awesome-repositories.com/repository/datawhalechina-llm-cookbook.md) (24,263 ⭐) — This repository is a comprehensive set of tutorials and examples for building software powered by large language models. It serves as an application development guide and a prompt engineering framework, providing instructional content for integrating model logic with user interfaces and external data sources.

The project provides technical walkthroughs for specialized workflows, including the implementation of retrieval augmented generation using vector databases and semantic search. It includes guidance on adapting pre-trained model weights through fine-tuning with private datasets and the o
- [h2oai/h2ogpt](https://awesome-repositories.com/repository/h2oai-h2ogpt.md) (12,016 ⭐) — h2oGPT is a self-hosted platform designed for running large language models and executing retrieval-augmented generation workflows locally. It provides a comprehensive web interface that allows users to index private document collections into searchable databases, enabling context-aware question answering and summarization without exposing sensitive data to external services.

The platform distinguishes itself by offering a modular architecture that supports both local model execution and connections to external inference servers. It facilitates the development of autonomous agents capable of
- [amarofashion/front-end-challenge](https://awesome-repositories.com/repository/amarofashion-front-end-challenge.md) (103 ⭐) — Challenge for those seeking a role as a front-end developer @amarofashion
- [answerdotai/nbdev](https://awesome-repositories.com/repository/answerdotai-nbdev.md) (5,300 ⭐) — This project is a comprehensive framework for literate programming that enables developers to build production-ready Python libraries entirely within Jupyter Notebooks. By treating notebooks as the primary source of truth, it integrates code, documentation, and testing into a unified development pipeline that exports directly to standard Python modules.

The framework distinguishes itself through specialized tooling designed to overcome the inherent challenges of using notebooks in professional software engineering. It includes custom Git hooks and merge drivers that sanitize volatile notebook
- [contabilizei/front-end-teste](https://awesome-repositories.com/repository/contabilizei-front-end-teste.md) (51 ⭐) — Desafio prático para desenvolvedores front-end que querem entrar para nosso time de ninjas :)
- [timescale/pgai](https://awesome-repositories.com/repository/timescale-pgai.md) (5,802 ⭐) — pgai is a PostgreSQL AI toolkit and framework designed to integrate large language models and vector embeddings directly into a database. It serves as a bridge for executing machine learning model requests and performing text-to-SQL translations within standard database queries.

The project provides an automated vector embedding pipeline that handles the loading, parsing, and chunking of text from tables and unstructured documents. This system utilizes a background worker to synchronize embeddings automatically as source data changes and includes specialized tools for building retrieval-augme
- [chainlit/chainlit](https://awesome-repositories.com/repository/chainlit-chainlit.md) (12,213 ⭐) — Chainlit is a Python framework designed for building and deploying interactive, stateful conversational AI interfaces. It provides a backend-driven platform that connects language models and agent frameworks to a web-based chat frontend, managing the complexities of session state, message history, and real-time communication.

The framework distinguishes itself by offering a component-based UI builder that allows developers to inject interactive widgets, rich media, and data visualizations directly into the chat stream. It supports the visualization of complex agent workflows, enabling users t
- [fish-shell/fish-shell](https://awesome-repositories.com/repository/fish-shell-fish-shell.md) (33,687 ⭐) — This project is an interactive command-line shell designed to provide a user-friendly terminal environment for system interaction and task automation. It functions as both an interactive interface for developers and a scripting runtime, featuring a clean, consistent syntax that simplifies command execution and process management.

The shell distinguishes itself through a focus on discoverability and real-time feedback. It includes a predictive suggestion engine that offers command completions and history-based hints as you type, alongside a dedicated parser that provides immediate visual feedb
- [folhasp/front-end-test](https://awesome-repositories.com/repository/folhasp-front-end-test.md) (227 ⭐) — Teste para candidatos à vaga de desenvolvedor Front-end na Folha de S.Paulo
- [packtpublishing/llm-engineers-handbook](https://awesome-repositories.com/repository/packtpublishing-llm-engineers-handbook.md) (4,774 ⭐) — This project is an educational resource and engineering guide for building, deploying, and optimizing large language model applications and production pipelines. It serves as a blueprint for cloud AI infrastructure, providing a framework for orchestrating inference endpoints, data warehouses, and scalable production environments.

The repository provides specific implementation patterns for retrieval augmented generation to ground model responses in external data. It includes a training workflow for crawling, structuring, and processing datasets to facilitate model fine-tuning, alongside an ev
