RAG and chat-with-docs

Explore frameworks and tools for building retrieval-augmented generation systems that enable conversational interaction with documents.

Find the best repos with AI.We'll search the best matching repositories with AI.

aishwaryanr/awesome-generative-ai-guide
aishwaryanr/awesome-generative-ai-guide
24,755View on GitHub
This project is a community-driven knowledge repository and technical learning resource focused on the field of generative artificial intelligence. It serves as a centralized hub for developers and practitioners to access curated research, tutorials, and foundational concepts necessary for building and deploying modern artificial intelligence applications. The platform distinguishes itself through a collaborative, distributed contribution model that aggregates diverse learning materials into a structured, searchable knowledge base. It covers a wide range of specialized topics, including retri
HTMLAwesome ListGenerative AI Skill PathsLarge Language Model Tutorials
View on GitHub24,755
nomic-ai/gpt4all
nomic-ai/gpt4all
77,375View on GitHub
GPT4All is a cross-platform runtime environment designed to execute large language models directly on local consumer hardware. By leveraging an optimized C++ inference backend, it enables private, offline AI interactions without requiring an internet connection or external cloud services. The project provides a comprehensive ecosystem for managing the entire model lifecycle, including discovery, downloading, and configuration of local weights. What distinguishes the platform is its integrated retrieval-augmented generation engine, which allows users to index local documents into semantic vect
C++C++ Inference BackendsLanguage Model OrchestrationLocal AI Inference
View on GitHub77,375
datawhalechina/prompt-engineering-for-developers
datawhalechina/prompt-engineering-for-developers
24,267View on GitHub
This project is a technical curriculum and development guide focused on large language model prompt engineering, fine-tuning, and the creation of retrieval augmented generation applications. It serves as a comprehensive resource for developers to master crafting precise instructions and textual patterns to improve the quality and predictability of model outputs. The material covers the end-to-end workflow of adapting open-source models to specific datasets and integrating language models with vector databases to generate responses based on private information. It also provides a systematic ap
Jupyter NotebookPrompt EngineeringPrompt Engineering GuidesExample-Based Prompting
View on GitHub24,267
quivrhq/quivr
QuivrHQ/quivr
39,165View on GitHub
Quivr is a retrieval-augmented generation platform designed to transform raw documents into searchable knowledge bases. It functions as a centralized environment where users can ingest files, index them into vector databases, and interact with language models to receive contextually relevant, data-backed responses. The platform distinguishes itself through an agentic workflow orchestrator that sequences retrieval tasks, tool execution, and model interactions to resolve complex, multi-step queries. This engine is entirely configuration-driven, allowing users to define document ingestion, chunk
PythonRetrieval Augmented Generation SystemsAgentic OrchestratorsAgentic Workflow Orchestrators
View on GitHub39,165
datawhalechina/llm-cookbook
datawhalechina/llm-cookbook
24,263View on GitHub
This repository is a comprehensive set of tutorials and examples for building software powered by large language models. It serves as an application development guide and a prompt engineering framework, providing instructional content for integrating model logic with user interfaces and external data sources. The project provides technical walkthroughs for specialized workflows, including the implementation of retrieval augmented generation using vector databases and semantic search. It includes guidance on adapting pre-trained model weights through fine-tuning with private datasets and the o
Jupyter NotebookApplication Development GuidesLLM Application DevelopmentAgentic Tool Orchestration
View on GitHub24,263
zylon-ai/private-gpt
zylon-ai/private-gpt
57,278View on GitHub
This project is a privacy-first backend service designed to facilitate retrieval-augmented generation by processing local documents into searchable vector representations. It provides a modular architecture that allows users to ingest diverse file formats, manage document metadata, and perform semantic searches to provide context-aware responses for chat and completion requests. The system distinguishes itself through a database-agnostic abstraction layer that supports various storage backends, ranging from local disk storage to enterprise-grade vector databases. It offers flexible deployment
PythonContext-Aware Chat InterfacesLocal Inference EnginesPrivacy-First AI Backends
View on GitHub57,278
liaokongvfx/langchain-chinese-getting-started-guide
liaokongVFX/LangChain-Chinese-Getting-Started-Guide
9,039View on GitHub
This project is a collection of tutorials and guides for building large language model applications using the LangChain framework, written in Chinese. It serves as a learning resource for developing software that integrates language models with memory and chain-based logic. The resource provides specific walkthroughs for implementing retrieval augmented generation systems using vector stores and document loaders. It includes guides on creating autonomous agents that dynamically select and execute external tools, as well as tutorials for translating plain text queries into executable database
LLM Application FrameworksRetrieval-Augmented GenerationApplication Development Guides
View on GitHub9,039
run-llama/llama_index
run-llama/llama_index
50,306View on GitHub
LlamaIndex is a comprehensive development framework designed to connect private or external data sources to large language models. It functions as a data-centric toolkit that enables the construction of retrieval-augmented generation systems, allowing developers to build applications that provide context-aware answers based on specific organizational information. The project distinguishes itself through a robust agentic orchestration engine that supports the creation of autonomous agents capable of multi-step reasoning, memory management, and complex tool execution. Beyond simple retrieval, i
PythonRetrieval-Augmented Generation FrameworksAgentic FrameworksAgentic Orchestration Frameworks
View on GitHub50,306
cloudwego/eino
cloudwego/eino
9,675View on GitHub
Eino is an AI agent development kit and LLM application framework designed for building autonomous agents and orchestrating complex language model workflows. It serves as a multi-agent orchestration engine and workflow orchestrator, providing a graph-based execution model to route data between models, tools, and retrievers. The framework distinguishes itself through a robust set of multi-agent coordination patterns, including supervisor-led management, sequential flows, and autonomous reasoning loops like ReAct. It features advanced agent execution controls such as active turn preemption, che
GoAgent Workflow OrchestrationsAgentic Workflow OrchestrationAI Agent Orchestrators
View on GitHub9,675
anthropics/claude-cookbooks
anthropics/claude-cookbooks
45,835View on GitHub
This repository serves as a comprehensive library of architectural blueprints and code examples for integrating large language models into software applications. It functions as a developer learning resource, providing structured tutorials and implementation patterns that demonstrate how to build intelligent features using advanced prompting and data processing techniques. The collection distinguishes itself by focusing on complex reasoning and data-grounding workflows. It provides practical guidance on implementing retrieval-augmented generation pipelines, which connect language models to pr
Jupyter NotebookGenerative AI Integration PatternsReasoning StrategiesRetrieval-Augmented Generation
View on GitHub45,835
hwchase17/langchainjs
hwchase17/langchainjs
17,822View on GitHub
LangChainJS is an AI agent orchestrator and application framework designed for building autonomous systems that use large language models to plan and execute tasks. It serves as an integration library that connects language models with tools, memory, and external data sources to create context-aware logic and complex workflows. The project provides a provider-agnostic interface and model provider abstraction, allowing applications to switch between different language model providers without rewriting core logic. It includes a toolkit for retrieval augmented generation, utilizing retrievers to
TypeScriptAutonomous Agent FrameworksAgentic Workflow OrchestrationAI Agent Orchestrators
View on GitHub17,822
mintplex-labs/anything-llm
Mintplex-Labs/anything-llm
61,663View on GitHub
This platform serves as a comprehensive environment for managing private language models, document knowledge bases, and automated agent workflows within secure local infrastructure. It functions as a document-aware workspace that enables users to ingest diverse file formats into searchable repositories, ensuring that all data processing and model inference remain within private, local environments to maintain data sovereignty. The system distinguishes itself through a modular agentic engine that allows for the definition of custom skills and external tool execution. By utilizing a multi-model
JavaScriptAgentic Workflow EnginesAI Agent OrchestratorsDocument-Aware AI Workspaces
View on GitHub61,663
joshuac215/agent-service-toolkit
JoshuaC215/agent-service-toolkit
4,082View on GitHub
This is a framework for building and deploying customizable AI agent services using a standardized reference architecture. It provides the core infrastructure necessary to host multiple agents within a single service, supporting interactive chat interfaces and real-time response streaming. The project distinguishes itself with a human-in-the-loop mechanism that allows agent execution to be paused for manual approval or intervention. It also features path-based routing to direct requests to specific agents and a multi-stage content moderation system to filter outputs through safety guardrails.
PythonAgentic LLM FrameworksAI Agent FrameworksAgent Memory Systems
View on GitHub4,082
microsoft/graphrag
microsoft/graphrag
33,792View on GitHub
GraphRAG is a data processing pipeline and retrieval engine designed to transform unstructured text into interconnected knowledge graphs. By utilizing language models to extract entities and relationships, it builds structured representations of information that enable context-aware retrieval for downstream applications. The system distinguishes itself through hierarchical graph clustering and large-scale data synthesis, which organize massive document corpora into multi-level structures. This approach allows for both vector-based semantic searches and graph-based traversals, providing a comp
PythonGraph-Based Retrieval AugmentationGraph-Based Retrieval EnginesContext-Aware Retrieval
View on GitHub33,792
tmc/langchaingo
tmc/langchaingo
9,416View on GitHub
langchaingo is an LLM application framework for Go designed for building language model-powered applications and autonomous agents. It serves as an orchestration library and tool integration framework that allows developers to link prompt sequences and model calls into complex, multi-step workflows. The project provides a toolkit for implementing retrieval-augmented generation pipelines by processing unstructured documents and retrieving relevant context via vector search. It includes a dedicated integration layer for indexing high-dimensional embeddings and performing similarity searches acr
GoAutonomous AgentsLanguage Model IntegrationsAgent Memory Stores
View on GitHub9,416
infiniflow/ragflow
infiniflow/ragflow
82,922View on GitHub
This project is a comprehensive retrieval-augmented generation platform designed for building, managing, and deploying knowledge-based AI applications. It provides a unified environment for organizing datasets, configuring conversational chat assistants, and developing autonomous agents that execute multi-step reasoning workflows. By integrating document intelligence with advanced retrieval pipelines, the platform enables the creation of grounded, verifiable responses supported by traceable citations. The platform distinguishes itself through deep document understanding and sophisticated know
PythonAutonomous AgentsChat AssistantsGrounded Answer Generation
View on GitHub82,922
alibaba/spring-ai-alibaba
alibaba/spring-ai-alibaba
8,415View on GitHub
This project is a Java-based framework integration that provides an AI agent runtime, a graph-based AI workflow engine, and an LLM orchestration framework for Spring applications. It enables the development of stateful autonomous agents and the implementation of retrieval-augmented generation systems using document processing and vector databases. The framework distinguishes itself through a graph-based workflow runtime for designing complex AI pipelines with conditional routing and persistent state. It supports multi-agent orchestration via service-discovery coordination and provides human-i
JavaAI Integration FrameworksAI Workflow EnginesAI Workflow Orchestration
View on GitHub8,415
hkuds/lightrag
HKUDS/LightRAG
36,651View on GitHub
LightRAG is a graph-based retrieval framework designed to build retrieval-augmented generation pipelines. It structures unstructured text into knowledge graphs, enabling multi-hop reasoning and complex query synthesis across large document collections. By integrating dense vector embeddings with structured knowledge graphs, the system facilitates both similarity-based and relationship-aware information retrieval. The framework distinguishes itself through a dual-level retrieval strategy that combines low-level keyword matching with high-level semantic graph traversal to capture both specific
PythonKnowledge Graph Retrieval SystemsRetrieval Augmented Generation PipelinesGraph Reasoning Systems
View on GitHub36,651
langchain4j/langchain4j
langchain4j/langchain4j
12,346View on GitHub
LangChain4j is a framework and library for building applications powered by large language models on the JVM. It provides a unified API for developing AI agents, implementing retrieval augmented generation, and integrating generative AI capabilities into professional software built with frameworks like Spring Boot or Quarkus. The project enables the creation of autonomous agents that can reason through tasks, manage memory, and execute external tools to achieve specific goals. It differentiates itself through a unified model interface that allows developers to switch between multiple model pr
JavaJVM LLM ApplicationsProvider-Agnostic Model InterfacesAgentic Reasoning Loops
View on GitHub12,346
patchy631/ai-engineering-hub
patchy631/ai-engineering-hub
35,826View on GitHub
This project serves as an educational resource and technical guide for building production-ready intelligent systems. It provides a collection of hands-on tutorials, blueprints, and documentation focused on the development of applications powered by large language models, autonomous agentic workflows, and retrieval-augmented generation. The repository distinguishes itself by offering structured implementations for multi-agent orchestration and standardized communication protocols. It enables developers to integrate external tools and data sources into their systems, ensuring interoperability
Jupyter NotebookAgentic Orchestration PatternsAgentic Workflow OrchestrationAI Engineering Tutorials
View on GitHub35,826
mayooear/gpt4-pdf-chatbot-langchain
mayooear/gpt4-pdf-chatbot-langchain
16,542View on GitHub
This project is a framework for building custom AI chatbots capable of PDF document analysis. It implements Retrieval Augmented Generation to connect a large language model to private document data. The system utilizes graph-based agent orchestration to control conversation flow and decision logic. It maintains context across interactions through thread-based state management and delivers AI responses to the user interface via real-time streaming. The project covers PDF document ingestion through chunk-based processing and vector-store retrieval. It includes mechanisms for query-based data r
TypeScriptKnowledge Base RetrievalConversation State ManagementConversation State Persistence
View on GitHub16,542
chatgptnextweb/nextchat
ChatGPTNextWeb/NextChat
88,256View on GitHub
NextChat is a self-hosted web application that provides a unified interface for interacting with multiple large language models. It functions as a conversational platform where users can manage and switch between diverse AI providers through configurable API backends, maintaining full control over their data and infrastructure. The platform features a persistent session layer designed to handle long-running dialogues by managing message history and context. It distinguishes itself through a structured prompt engineering environment that allows for the development and application of templates
TypeScriptConversational State ManagersLanguage Model Interaction PatternsLLM Chat Interfaces
View on GitHub88,256
gventuri/pandas-ai
gventuri/pandas-ai
23,587View on GitHub
Pandas AI is a data analysis library and natural language interface that uses large language models to perform conversational querying on structured datasets. It functions as a retrieval-augmented generation framework designed to translate plain text questions into executable code for extracting insights from dataframes and structured files. The system includes a dedicated sandbox execution environment that runs AI-generated analysis code within an isolated container to prevent security risks and system compromise. It employs a natural language translation layer and contextual retrieval to ma
PythonNatural Language Code GeneratorsConversational LibrariesIntent Translation Layers
View on GitHub23,587
cherryhq/cherry-studio
CherryHQ/cherry-studio
47,419View on GitHub
Cherry Studio is a cross-platform desktop application that serves as a centralized workspace for managing and interacting with multiple artificial intelligence models. It functions as a local-first orchestrator, prioritizing user privacy by storing all conversation history and knowledge bases directly on your device. By providing a unified interface for both cloud-based and local AI services, the platform simplifies API key management and allows for consistent model interaction across different operating systems. The application distinguishes itself through a robust retrieval-augmented genera
TypeScriptModel OrchestratorsRetrieval Augmented Generation PipelinesAutonomous Agent Frameworks
View on GitHub47,419
dataelement/bisheng
dataelement/bisheng
11,455View on GitHub
Bisheng is an enterprise AI framework and LLM DevOps platform designed to manage the full lifecycle of large language models. It provides a unified system for dataset curation, supervised fine-tuning, model versioning, and performance evaluation. The platform features a visual workflow orchestrator for building retrieval-augmented generation pipelines and complex task sequences using flowcharts with conditional logic and human intervention points. It also includes an AI agent framework that uses a specialized guidance language to embed domain expertise and professional business logic into aut
TypeScriptLLM Lifecycle ManagementLLM Operations PlatformsAgent Steering Languages
View on GitHub11,455
chroma-core/chroma
chroma-core/chroma
26,198View on GitHub
Chroma is a specialized vector database designed to index and retrieve high-dimensional data representations for semantic similarity search. It functions as a comprehensive platform for information retrieval, enabling the storage and management of unstructured documents alongside structured metadata. By mapping data into numerical representations, the system facilitates rapid similarity lookups across large datasets. The platform distinguishes itself through a hybrid search infrastructure that combines dense vector embeddings with sparse keyword and regular expression matching to balance sema
RustVector DatabasesHybrid Search EnginesVector Search
View on GitHub26,198
bragai/brag-langchain
bragai/bRAG-langchain
4,028View on GitHub
bRAG-langchain is a framework for building retrieval augmented generation pipelines using LangChain to connect documents with language models. It functions as a vector store orchestrator that manages document indexing and retrieval strategies to improve context accuracy. The system implements an advanced retrieval pipeline featuring a semantic query router that directs natural language inputs to specific data sources or prompts. It includes a metadata filtering engine that translates natural language queries into structured schemas to narrow search results. The project covers hybrid search o
Jupyter NotebookRAG PipelinesHybrid Search SystemsMetadata Filtering
View on GitHub4,028
tabbyml/tabby
TabbyML/tabby
33,605View on GitHub
Tabby is a self-hosted AI coding assistant designed to provide real-time code completion and interactive chat capabilities within development environments. By functioning as a private server application, it allows teams to maintain control over their infrastructure and data while integrating intelligent code generation directly into their existing workflows. The platform distinguishes itself through its repository-aware knowledge retrieval and multi-model orchestration. It indexes local and remote source code repositories and technical documentation into a searchable vector-based knowledge gr
RustCode Generation AssistantsSelf-Hosted Coding AssistantsConversational Coding Assistants
View on GitHub33,605
h2oai/h2ogpt
h2oai/h2ogpt
12,016View on GitHub
h2oGPT is a self-hosted platform designed for running large language models and executing retrieval-augmented generation workflows locally. It provides a comprehensive web interface that allows users to index private document collections into searchable databases, enabling context-aware question answering and summarization without exposing sensitive data to external services. The platform distinguishes itself by offering a modular architecture that supports both local model execution and connections to external inference servers. It facilitates the development of autonomous agents capable of
PythonGenerative AI DashboardsLocal Language Model HostingLocal Model Execution
View on GitHub12,016
dair-ai/prompt-engineering-guide
dair-ai/Prompt-Engineering-Guide
75,678View on GitHub
This project is a comprehensive educational resource and technical guide focused on the development, optimization, and application of large language models. It provides a structured curriculum for mastering prompt engineering, ranging from foundational principles of instruction design to advanced techniques for improving model reasoning, accuracy, and reliability. The guide distinguishes itself by offering deep technical insights into agentic workflows and autonomous system design. It covers the implementation of multi-step reasoning chains, tool integration through function calling, and stat
MDXAgentic OrchestrationAgentic Orchestration FrameworksPrompt Engineering
View on GitHub75,678
imclumsypanda/langchain-chatglm
imClumsyPanda/langchain-ChatGLM
38,183View on GitHub
This project is a LangChain-based framework for building retrieval-augmented generation systems, autonomous agents, and multimodal chatbots. It functions as an open-source orchestrator that connects local inference engines and online APIs to manage various large language model deployments. The system distinguishes itself by providing specialized interfaces for local knowledge bases, allowing the loading and vectorization of private documents to create context-aware assistants. It also supports multimodal capabilities, enabling the processing of both text and image inputs through vision-capabl
PythonAgent Orchestration FrameworksLLM OrchestratorsAgentic LLM Frameworks
View on GitHub38,183
shubhamsaboo/awesome-llm-apps
Shubhamsaboo/awesome-llm-apps
114,725View on GitHub
This repository serves as a comprehensive collection of resources, templates, and starter code for building artificial intelligence applications. It provides a centralized hub for developers to access practical implementations of common workflows, including retrieval-augmented generation pipelines and autonomous agent loops, alongside educational materials designed to support rapid prototyping and experimentation. The project distinguishes itself by offering a dual focus on technical implementation and critical analysis. It provides a library of lightweight, single-file agents and tutorials f
PythonAI Application TemplatesAwesome ListRAG Pipelines
View on GitHub114,725
stangirard/quiver
StanGirard/quiver
39,167View on GitHub
Quiver is a framework for integrating retrieval augmented generation into applications. It provides a generative AI integration layer that connects large language models with vector stores to produce context-aware responses based on custom data. The project features a knowledge base pipeline that parses diverse file types into searchable embeddings and a vector database orchestrator to manage data across different storage implementations. It utilizes a provider-agnostic model interface, allowing users to switch between various external AI providers or local models through a single unified sys
PythonRAG FrameworksGenerative AI Integration LayersLLM Provider Integrations
View on GitHub39,167
labring/fastgpt
labring/FastGPT
27,132View on GitHub
FastGPT is a comprehensive platform for building, deploying, and managing context-aware artificial intelligence applications. It provides a unified environment that integrates custom data sources with language models, utilizing a retrieval-augmented generation engine to ground responses in accurate, domain-specific information. The system is designed for enterprise-scale use, featuring multi-tenant architecture, administrative controls, and secure authentication protocols including OAuth 2.0 and custom single sign-on integration. The platform distinguishes itself through a visual, node-based
TypeScriptAI Application PlatformsGenerative Answer EnginesRetrieval-Augmented Generation Frameworks
View on GitHub27,132
datawhalechina/all-in-rag
datawhalechina/all-in-rag
3,989View on GitHub
This project is a retrieval augmented generation framework designed to build pipelines that connect unstructured data and knowledge graphs with large language models. It functions as a vector database orchestrator for indexing text and multimodal content, as well as a system for translating natural language queries into structured database commands. The framework integrates a hybrid retrieval engine that combines dense vector search with sparse keyword matching to increase the precision of retrieved contexts. It further enhances reasoning and relationship mapping through a graph-augmented ret
PythonRAG PipelinesDocument and Unstructured ExtractionGraph Knowledge Indexing
View on GitHub3,989
chatboxai/chatbox
chatboxai/chatbox
40,499View on GitHub
Chatbox is a cross-platform desktop application that provides a unified interface for interacting with a wide range of artificial intelligence models. It functions as a model-agnostic client, allowing users to connect to various third-party AI providers or execute open-source models directly on their own hardware. By centralizing these diverse services into a single workspace, the application enables users to manage multiple chat sessions, adjust model parameters, and switch between different AI backends with ease. The project distinguishes itself through a local-first architecture that prior
TypeScriptAI Orchestration PlatformsLocal Model RuntimesModel Provider Integrations
View on GitHub40,499
cinnamon/kotaemon
Cinnamon/kotaemon
25,139View on GitHub
Kotaemon is an orchestration framework designed for building modular, agentic workflows that integrate document processing, retrieval-augmented generation, and multi-step reasoning. It provides a comprehensive platform for developing document-based question answering systems, allowing users to chain language models, prompt templates, and external tools into complex, automated pipelines. The system distinguishes itself through a highly modular architecture that emphasizes component-based composition and schema-driven data exchange. It supports autonomous agents capable of decomposing complex q
PythonAgentic Reasoning FrameworksConversational RetrievalGrounded Answer Generation
View on GitHub25,139
fighting41love/funnlp
fighting41love/funNLP
81,299View on GitHub
This project is a community-driven knowledge base and curated repository focused on natural language processing and large language model development. It serves as a centralized index for high-quality tools, libraries, and research materials, organizing technical resources into structured, version-controlled documentation to assist developers in navigating the evolving artificial intelligence ecosystem. The repository distinguishes itself by acting as an aggregator for AI model evaluation and benchmarking. It provides access to tools that enable the simultaneous comparison of multiple conversa
PythonAwesome ListLanguage Model IntegrationsVersion-Controlled Knowledge Bases
View on GitHub81,299
agentscope-ai/agentscope
agentscope-ai/agentscope
26,895View on GitHub
Agentscope is a comprehensive toolkit for developing and orchestrating autonomous multi-agent systems. It provides a unified framework for building agents that can reason, execute tools, and manage memory, enabling the creation of complex, collaborative workflows where multiple specialized agents interact to solve multi-step objectives. The platform distinguishes itself through a robust orchestration engine that supports both sequential and concurrent agent pipelines. It utilizes a centralized event bus for real-time telemetry, allowing developers to track agent reasoning, tool usage, and sys
PythonAgentic LLM FrameworksMulti-Agent Orchestration FrameworksAgent Task Routers
View on GitHub26,895
huggingface/transformers
huggingface/transformers
161,630View on GitHub
Transformers is a comprehensive library for machine learning that provides a unified interface for training, fine-tuning, and deploying transformer-based models. It supports a wide range of tasks, including text classification, language modeling, question answering, and sequence-to-sequence translation, while offering specialized architectures for both text and vision processing. The framework includes tools for managing the entire model lifecycle, from data preprocessing and tokenization to distributed training and inference. The library features extensive support for model optimization and
PythonAPI FrameworksByte Pair EncodingsHybrid
View on GitHub161,630
datawhalechina/llm-universe
datawhalechina/llm-universe
13,269View on GitHub
llm-universe is a structured learning resource and technical guide focused on the development of large language model applications. It serves as a curriculum for mastering model orchestration, the creation of autonomous conversational agents, and the implementation of retrieval-augmented generation systems. The project provides detailed instructions on connecting model APIs with memory and tools to create execution chains. It specifically covers the construction of retrieval pipelines, including the process of cleaning raw documents, generating embeddings, and integrating vector databases to
Jupyter NotebookLLM Application Development CurriculaLLM TutorialsRetrieval-Augmented Generation
View on GitHub13,269
xtekky/gpt4free
xtekky/gpt4free
66,335View on GitHub
This project provides a unified interface for interacting with a wide range of artificial intelligence services, acting as a central orchestration layer for text and image generation. It standardizes access to diverse AI backends, allowing developers to integrate multiple language and vision models through a single, consistent programming interface. By abstracting provider-specific protocols and authentication requirements, the tool simplifies the development of applications that rely on external AI services. The platform distinguishes itself through a resilient request routing architecture d
PythonAI Request RoutersConversation ManagementFailover Strategies
View on GitHub66,335
meta-llama/llama-cookbook
meta-llama/llama-cookbook
18,375View on GitHub
This project is a collection of implementation guides, recipes, and developer resources for building applications with Llama models. It serves as a comprehensive kit for developing autonomous agents, establishing retrieval-augmented generation systems, and executing model fine-tuning. The resource provides specific patterns for multimodal workflows that process text, images, and audio. It includes specialized guidance on adapting pre-trained model weights for targeted tasks and implementing tool-calling orchestration to connect models with external APIs and functions. The codebase covers a b
Jupyter NotebookAgentic LLM FrameworksAI Agent DevelopmentExternal Tool Integration
View on GitHub18,375
mlabonne/llm-course
mlabonne/llm-course
80,178View on GitHub
This project is a comprehensive educational curriculum and engineering handbook focused on the lifecycle of large language models. It serves as a structured knowledge base for machine learning practitioners, covering the fundamental mathematical and architectural principles of transformer-based sequence modeling, as well as the practical implementation of supervised instruction fine-tuning and preference-based model alignment. The repository distinguishes itself by providing a deep dive into advanced model composition and optimization techniques. It details methodologies for weight-space mode
AI Research RepositoriesAwesome ListFine-Tuning Strategies
View on GitHub80,178
nvidia/generativeaiexamples
NVIDIA/GenerativeAIExamples
3,802View on GitHub
This project is a library of reference implementations and blueprints for deploying large language models and generative AI workflows. It provides a collection of practical examples designed to guide the deployment of generative systems. The repository features architectural patterns for autonomous agentic workflows that utilize reasoning and tool integration to execute multi-step tasks. It also includes frameworks and templates for building retrieval-augmented generation pipelines that connect language models to vector databases and external data sources. The codebase covers several functio
Jupyter NotebookLLM and Transformer ExamplesAgentic Workflow AutomationsAgentic Workflows
View on GitHub3,802
khoj-ai/khoj
khoj-ai/khoj
35,163View on GitHub
Khoj is a self-hosted artificial intelligence platform designed for personal knowledge management and semantic information retrieval. It functions as a private assistant that indexes your local documents, notes, and external workspaces, allowing you to interact with your data through natural language queries and conversational chat. By maintaining a local-first architecture, the system ensures that your information remains under your control while providing context-aware responses grounded in your personal knowledge base. The platform distinguishes itself through a modular, cross-platform int
PythonPersonal AI AssistantsPersonal Knowledge Management SystemsSemantic Search Engines
View on GitHub35,163
packtpublishing/llm-engineers-handbook
PacktPublishing/LLM-Engineers-Handbook
4,774View on GitHub
This project is an educational resource and engineering guide for building, deploying, and optimizing large language model applications and production pipelines. It serves as a blueprint for cloud AI infrastructure, providing a framework for orchestrating inference endpoints, data warehouses, and scalable production environments. The repository provides specific implementation patterns for retrieval augmented generation to ground model responses in external data. It includes a training workflow for crawling, structuring, and processing datasets to facilitate model fine-tuning, alongside an ev
PythonLLM Engineering GuidesRetrieval-Augmented GenerationCloud Orchestration
View on GitHub4,774
assafelovic/gpt-researcher
assafelovic/gpt-researcher
27,739View on GitHub
GPT Researcher is an autonomous agent framework designed to automate the process of gathering, synthesizing, and documenting information from diverse web and local sources. It functions as a research-oriented execution environment that orchestrates specialized agents to perform complex, multi-branch research tasks, transforming raw data into structured, factual, and cited reports. The project distinguishes itself through a graph-based orchestration layer that manages state transitions and information flow between specialized agents. It employs recursive tree-search execution to explore comple
PythonAgent Orchestration FrameworksAgent Orchestration SystemsAgentic Services
View on GitHub27,739

RAG and chat-with-docs

aishwaryanr/awesome-generative-ai-guide

nomic-ai/gpt4all

datawhalechina/prompt-engineering-for-developers

QuivrHQ/quivr

datawhalechina/llm-cookbook

zylon-ai/private-gpt

liaokongVFX/LangChain-Chinese-Getting-Started-Guide

run-llama/llama_index

cloudwego/eino

anthropics/claude-cookbooks

hwchase17/langchainjs

Mintplex-Labs/anything-llm

JoshuaC215/agent-service-toolkit

microsoft/graphrag

tmc/langchaingo

infiniflow/ragflow

alibaba/spring-ai-alibaba

HKUDS/LightRAG

langchain4j/langchain4j

patchy631/ai-engineering-hub

mayooear/gpt4-pdf-chatbot-langchain

ChatGPTNextWeb/NextChat

gventuri/pandas-ai

CherryHQ/cherry-studio

dataelement/bisheng

chroma-core/chroma

bragai/bRAG-langchain

TabbyML/tabby

h2oai/h2ogpt

dair-ai/Prompt-Engineering-Guide

imClumsyPanda/langchain-ChatGLM

Shubhamsaboo/awesome-llm-apps

StanGirard/quiver

labring/FastGPT

datawhalechina/all-in-rag

chatboxai/chatbox

Cinnamon/kotaemon

fighting41love/funNLP

agentscope-ai/agentscope

huggingface/transformers

datawhalechina/llm-universe

xtekky/gpt4free

meta-llama/llama-cookbook

mlabonne/llm-course

NVIDIA/GenerativeAIExamples

khoj-ai/khoj

PacktPublishing/LLM-Engineers-Handbook

assafelovic/gpt-researcher