The visitor is looking for software frameworks or middleware designed to intercept, validate, and sanitize inputs and outputs of Large Language Models to ensure safety and reliability.

portkey-ai/gateway is the closest match — This project is a comprehensive AI gateway that provides the requested guardrails, including PII redaction, prompt injection protection, and a policy engine, while acting as a model-agnostic middleware layer for LLM interactions.. Other strong matches: protectai/llm-guard, dottxt-ai/outlines, microsoft/guidance, instructor-ai/instructor.

Why does portkey-ai/gateway match “a library for guardrails on LLM outputs”?

This project is a comprehensive AI gateway that provides the requested guardrails, including PII redaction, prompt injection protection, and a policy engine, while acting as a model-agnostic middleware layer for LLM interactions.

Why does protectai/llm-guard match “a library for guardrails on LLM outputs”?

LLM Guard is a comprehensive security firewall and guardrail framework that provides modular input/output sanitization, PII redaction, and prompt injection protection, making it a complete solution for securing LLM interactions.

Why does dottxt-ai/outlines match “a library for guardrails on LLM outputs”?

This library provides robust output validation and structural enforcement for LLMs by constraining token sampling, though it focuses more on schema compliance than on broader safety features like PII redaction or prompt injection protection.

Why does microsoft/guidance match “a library for guardrails on LLM outputs”?

This framework provides structured output enforcement and token-level constraints to steer model generation, serving as a powerful tool for validating and controlling LLM outputs even though it focuses more on orchestration than on broad security-specific guardrails like PII redaction.

Why does instructor-ai/instructor match “a library for guardrails on LLM outputs”?

Instructor is a validation library that ensures LLM outputs conform to structured schemas, serving as a specialized tool for output validation and reliability even though it lacks broader features like PII redaction or a general-purpose policy engine.

LLM Output Validation Guardrails

Frameworks and libraries for filtering, sanitizing, and validating structured content generated by large language models.

Find the best repos with AI.We'll search the best matching repositories with AI.

portkey-ai/gateway
Portkey-AI/gateway
12,091View on GitHub
This project is an artificial intelligence gateway that functions as a centralized middleware layer for managing, securing, and observing interactions with language, vision, and audio models. It provides a unified interface that standardizes requests across multiple providers, enabling teams to integrate AI capabilities into their applications through a consistent set of tools and protocols. The gateway distinguishes itself through its comprehensive infrastructure governance and traffic management capabilities. It allows for policy-driven routing, automated failover, and load balancing across different model providers to ensure high availability. Furthermore, it incorporates real-time security guardrails, sensitive data redaction, and virtual credential management, which abstracts provider-specific keys to facilitate secure access control and usage attribution across organizational units. Beyond its core proxying functions, the platform offers extensive observability and operational tools. It captures detailed telemetry, including performance metrics, request tracing, and cost analytics, while providing a centralized repository for prompt versioning and template management. The system also supports semantic response caching to reduce latency and operational costs, alongside features for auditing, feedback collection, and fine-tuning model outputs. The software is designed for deployment within private networks or cloud environments, ensuring full data ownership and compliance with internal security requirements.
This project is a comprehensive AI gateway that provides the requested guardrails, including PII redaction, prompt injection protection, and a policy engine, while acting as a model-agnostic middleware layer for LLM interactions.
TypeScriptLLM MiddlewareModel Safety FiltersData Redaction Tools
View on GitHub12,091
protectai/llm-guard
protectai/llm-guard
2,561View on GitHub
LLM Guard is a security firewall and guardrail framework designed to scan and sanitize inputs and outputs for large language models. It functions as a proxy gateway and security layer to block prompt injections, toxicity, and sensitive data leakage while ensuring that model interactions remain compliant with organizational policies. The system distinguishes itself through a modular scanner pipeline that utilizes local model orchestration to eliminate external network dependencies. It supports real-time security filtering via streaming chunk analysis and implements a fail-fast execution model to reduce latency by terminating the pipeline immediately upon detecting a security violation. The project covers a broad range of capability areas, including prompt security, output moderation, and personally identifiable information redaction. It provides tools for detecting adversarial attacks, validating output consistency and relevance, and preventing data leakage in retrieval augmented generation workflows. Additionally, it includes resource management features to prevent denial-of-service attacks through token limitation. The security interface can be deployed as a containerized Docker image and exposes its scanning capabilities over HTTP for integration with external services.
LLM Guard is a comprehensive security firewall and guardrail framework that provides modular input/output sanitization, PII redaction, and prompt injection protection, making it a complete solution for securing LLM interactions.
PythonLLM Prompt Injection PreventionPrompt Injection Detectors
View on GitHub2,561
dottxt-ai/outlines
dottxt-ai/outlines
13,446View on GitHub
Outlines is a library designed to ensure machine-readable output from generative models by applying programmatic constraints during the token sampling process. It functions as a toolkit for forcing large language models to generate text that strictly adheres to JSON schemas, regular expressions, and formal grammars, enabling the integration of model responses into existing software systems. The library distinguishes itself by integrating formal language rules directly into the sampling loop. It achieves this by converting regular expressions into deterministic finite automata and utilizing logit-based token masking to restrict the model's next-token probability distribution. By tracking the state of a formal grammar and filtering the vocabulary through a prefix tree, the system ensures that every generated sequence conforms to a predefined structural specification. Beyond core generation, the framework provides capabilities for auditing schema compliance to verify data structures against defined rules. This approach supports the design of predictable pipelines where model outputs are guaranteed to be consistent and formatted for downstream parsing. The library is available as a Python package for integration into generative AI workflows.
This library provides robust output validation and structural enforcement for LLMs by constraining token sampling, though it focuses more on schema compliance than on broader safety features like PII redaction or prompt injection protection.
PythonOutput Constraint EnginesModel Output Formatting
View on GitHub13,446
microsoft/guidance
microsoft/guidance
21,502View on GitHub
Guidance is a control framework and generation orchestrator for large language models. It provides a programming layer to steer model outputs through structured templates, schema enforcement, and logical flow management. The framework distinguishes itself by interleaving model generation with local code execution, enabling the use of loops and conditional branching within a single session. It employs grammar-based token constraints and regular expressions to force models to sample only from tokens that satisfy a specific structural format, ensuring strict adherence to predefined data models. The project further optimizes performance by injecting known tokens directly into the output stream to bypass unnecessary model computation. It also includes a mock execution layer for debugging generation logic and validating prompts without calling external APIs.
This framework provides structured output enforcement and token-level constraints to steer model generation, serving as a powerful tool for validating and controlling LLM outputs even though it focuses more on orchestration than on broad security-specific guardrails like PII redaction.
Jupyter NotebookOutput Constraint EnginesStructured Output Enforcements
View on GitHub21,502
instructor-ai/instructor
instructor-ai/instructor
13,181View on GitHub
Instructor is a schema enforcement and validation library designed to transform language model outputs into structured, type-safe data formats. It functions as a validation layer that uses Pydantic to ensure model responses conform to specific data models, acting as a tool for forcing large language models to return data in predefined schemas. The project differentiates itself through a recursive error-feedback loop that automatically retries requests when structural errors occur, passing validation failure messages back to the model to guide corrections. It also includes a streaming parser capable of processing partial fragments of structured objects in real time as they are generated. The library covers broad capabilities for structured data extraction, including the parsing of complex hierarchical information and nested structures into machine-readable formats. It utilizes prompt injection to translate type definitions into schema instructions and provides a type-safe wrapper interface to map raw responses directly into typed objects.
Instructor is a validation library that ensures LLM outputs conform to structured schemas, serving as a specialized tool for output validation and reliability even though it lacks broader features like PII redaction or a general-purpose policy engine.
PythonStructured Output EnforcementsModel Output Formatting
View on GitHub13,181
guardrails-ai/guardrails
guardrails-ai/guardrails
7,033View on GitHub
Guardrails is a Python SDK that wraps calls to large language models with configurable validation pipelines, corrective actions, and structured output generation. It provides a unified API layer that connects to over 100 language models, applying consistent validation, streaming, and error-handling across providers. The framework validates and corrects model responses against safety and quality rules, detecting and mitigating risks in both inputs and outputs using pre-built and custom validators. The project distinguishes itself through a validator-pipeline architecture that sequentially applies reusable validation rules and can automatically retry prompts or fix outputs when checks fail. It supports real-time streaming validation that applies guardrails incrementally as tokens arrive, and generates validated JSON or structured data from free-form model responses using user-defined schemas and function calling. Guardrails also offers an OpenAI-compatible server and a Flask-based REST API server for remote validation, along with LangChain integration that converts guardrail validators into runnable objects for chains and agents. The framework includes an observability layer that logs every model interaction, validator result, and performance metric for export to monitoring and debugging platforms. It supports custom model adapters for unsupported LLM APIs, user-defined validation rules, and declarative configuration files that specify validators and violation responses. The system handles concurrent LLM interactions with async support and parallelization for efficient real-time processing.
This framework provides a comprehensive suite for intercepting and validating LLM inputs and outputs, featuring a robust policy engine, PII redaction capabilities, and model-agnostic integration that directly addresses your requirements for safety and reliability.
PythonLLM Output Constraints
View on GitHub7,033
567-labs/instructor
567-labs/instructor
13,176View on GitHub
Instructor is a framework designed for structured data extraction, validation, and language model integration. It functions as a library that transforms unstructured text into validated, type-safe objects by leveraging schema definitions and model-specific tool-calling capabilities. By acting as a validation middleware, the project ensures that language model outputs strictly conform to defined data structures. The library distinguishes itself through a robust validation-based retry loop that automatically re-submits failed responses with error feedback to iteratively correct schema compliance. It provides a provider-agnostic client abstraction that normalizes diverse model interfaces into a unified execution layer, while its schema-driven prompt synthesis automatically generates model instructions by introspecting class definitions and field annotations. Additionally, the framework supports polymorphic schema mapping for complex data structures and enables incremental stream processing to yield validated objects in real-time as they are generated. Beyond its core extraction capabilities, the project offers a comprehensive suite of tools for managing the full lifecycle of model interactions. This includes support for asynchronous execution, multimodal data processing, and extensive observability features such as token usage tracking and event-driven lifecycle hooks. Developers can also utilize built-in mechanisms for caching, safety management, and automated error recovery to maintain reliable production workflows. The library is distributed as a Python package and provides a unified interface that extends existing client objects without requiring modifications to their original source code.
This framework functions as a validation middleware that ensures LLM outputs conform to structured schemas, providing essential reliability and safety features like PII redaction and automated error recovery.
PythonStructured Output EnforcementsStructured Output Parsers
View on GitHub13,176
berriai/litellm
BerriAI/litellm
50,579View on GitHub
LiteLLM is a unified gateway and proxy server designed to centralize access to over one hundred language model providers. It provides a standardized API interface that abstracts vendor-specific schemas, allowing developers to interact with diverse models through a single, consistent format. By acting as a central traffic management layer, it enables organizations to route, secure, and govern model interactions across multiple deployments. The platform distinguishes itself through its policy-driven architecture, which uses configuration-based routing to manage traffic distribution, load balancing, and automatic fallbacks without requiring code changes. It incorporates a robust security and compliance layer that enforces content moderation, secret redaction, and fine-grained access control. Additionally, it supports complex operational requirements such as semantic routing, rule-based complexity scoring, and persistent virtual key management for multi-tenant environments. Beyond core routing, the project provides comprehensive governance and observability tools to monitor usage, track spending, and log request metadata across teams. It includes an integrated software development kit for tool calling and agent orchestration, alongside support for advanced features like response caching, batch processing, and structured output configuration. The system is designed for enterprise-wide deployment, offering features for audit logging, single sign-on integration, and granular cost reporting.
LiteLLM acts as a centralized gateway that includes built-in content validation, PII redaction, and policy-driven security controls, making it a robust middleware solution for governing and securing LLM interactions.
PythonModel Safety FiltersData Redaction Tools
View on GitHub50,579
nirdiamant/agents-towards-production
NirDiamant/agents-towards-production
17,375View on GitHub
This project is a comprehensive framework for developing, orchestrating, and deploying autonomous agents. It provides a structured environment for building agents that utilize reasoning loops to perform multi-step tasks, manage state through graph-based workflows, and interact with external tools. By mapping unstructured model outputs into typed schemas, the framework ensures reliable integration with downstream application logic. The platform distinguishes itself through a focus on production-grade reliability and security. It incorporates hybrid memory systems that combine vector embeddings with structured knowledge graphs to maintain long-term context. To ensure operational safety, the framework includes built-in guardrails that intercept and validate inputs and outputs, mitigating risks such as injection attacks and enforcing strict security policies during agent execution. The system covers the entire agent lifecycle, including intelligent web scraping, retrieval-augmented generation, and containerized serverless deployment. It provides tools for monitoring agent performance, evaluating behavioral reliability, and managing complex multi-agent interactions. Developers can package these applications into portable container images for scalable execution, with built-in support for dynamic resource management and performance optimization in high-traffic environments. The repository is structured as a collection of Jupyter Notebooks that demonstrate the implementation of these agentic patterns and infrastructure components.
This framework provides built-in guardrails for intercepting and validating LLM inputs and outputs, though it is primarily structured as a comprehensive agent orchestration platform rather than a standalone validation middleware.
Jupyter NotebookStructured Output EnforcementsStructured Output Parsers
View on GitHub17,375
vllm-project/semantic-router
vllm-project/semantic-router
3,205View on GitHub
This framework provides semantic routing and PII detection capabilities that function as a middleware layer for intercepting and directing LLM traffic, aligning with the core requirements for safety and reliability validation.
GoPrompt Injection DetectorsJailbreak Detectors
View on GitHub3,205
the-pocket/pocketflow
The-Pocket/PocketFlow
10,046View on GitHub
PocketFlow is a graph-based framework for designing and executing large language model operations and reasoning patterns. It serves as an orchestrator for building goal-oriented autonomous agents, multi-agent systems, and retrieval-augmented generation pipelines. The system is distinguished by its ability to coordinate autonomous AI agents that use shared memory and tools to solve complex goals, supported by a structured output engine that enforces schema-consistent responses. It utilizes graph-based workflow orchestration to manage sequences of model operations and supports supervisor-based coordination for task delegation and self-correction. The platform covers a broad range of capabilities, including asynchronous task runtimes, hierarchical workflow nesting, and map-reduce parallel execution for large-scale data processing. It integrates vector database management for semantic retrieval and includes observability tools such as execution stack tracing and workflow hierarchy visualization. Reliability is managed through automatic retry logic and response guardrails.
PocketFlow is an agentic workflow orchestrator that includes built-in response and output guardrails, making it a capable tool for managing LLM reliability and safety within complex pipelines.
PythonStructured Output Enforcements
View on GitHub10,046
nvidia-nemo/nemo
NVIDIA-NeMo/NeMo
17,389View on GitHub
NeMo is a comprehensive framework designed for the development, training, and deployment of large-scale conversational and generative artificial intelligence models. It provides an integrated platform for building multimodal systems, encompassing speech processing, language modeling, and reinforcement learning alignment. The framework is built to handle the entire lifecycle of AI development, from data curation and model pretraining to production-ready service deployment. The platform distinguishes itself through advanced distributed training capabilities, including tensor and pipeline parallelism, which allow for the execution of models that exceed the memory capacity of individual hardware devices. It incorporates specialized architectures such as mixture-of-experts to optimize computational efficiency and includes a programmable guardrails system to enforce safety policies and topical boundaries on model outputs. Additionally, the framework supports retrieval-augmented generation to ground model responses in external knowledge bases, reducing hallucinations and improving factual accuracy. Beyond core training and inference, the framework offers extensive tools for audio signal processing, speech-to-text transcription, and text-to-speech
This is a comprehensive AI development framework that includes a dedicated guardrails system for enforcing safety policies and output validation, though its primary focus is on the broader lifecycle of model training and deployment rather than being a standalone guardrail middleware.
PythonSafety and Alignment Frameworks
View on GitHub17,389
pydantic/pydantic-ai
pydantic/pydantic-ai
17,791View on GitHub
PydanticAI is a Python framework designed for building production-grade autonomous agents. It provides a unified interface for interacting with diverse language models, enabling developers to construct agents that perform complex tasks through structured data validation, tool execution, and multi-turn conversation management. The library centers on type-safe schema enforcement, ensuring that model inputs and outputs remain consistent and reliable throughout the agent's lifecycle. The framework distinguishes itself through a robust architecture that emphasizes modularity and testability. It utilizes a dependency injection container to manage shared resources and state, allowing for context-aware workflow execution without the need for complex class inheritance. Agents are composed declaratively, bundling instructions, tools, and lifecycle hooks into reusable units. Furthermore, the system includes a state-machine orchestrator that manages asynchronous workflows, enabling developers to define clear transitions and persist progress across execution cycles. Beyond core orchestration, the project offers a comprehensive suite of tools for production environments. This includes deep observability through OpenTelemetry integration, systematic performance evaluation, and security guardrails that support human-in-the-loop approval for sensitive actions. The framework also provides advanced traffic management, such as concurrency controls and usage limits, to maintain system stability and manage operational costs during agent execution.
PydanticAI is an agent-building framework that includes built-in security guardrails and structured data validation, though its primary focus is on agent orchestration rather than serving as a standalone, model-agnostic validation middleware.
PythonStructured Output Enforcements
View on GitHub17,791
openai/openai-agents-python
openai/openai-agents-python
27,191View on GitHub
This project is a Python framework for building autonomous, event-driven agent systems. It provides a unified runtime for orchestrating multi-agent workflows, managing persistent conversation state, and executing code within secure, isolated sandbox environments. The framework is designed to handle complex task delegation, allowing agents to invoke other agents as tools while maintaining context across multi-turn interactions. The framework distinguishes itself through its deep integration with the Model Context Protocol, enabling agents to connect to external data sources and remote services using standardized communication protocols. It features a robust middleware-based guardrail system that intercepts inputs, outputs, and tool calls to enforce safety and quality constraints. Additionally, the platform includes specialized infrastructure for real-time voice AI development, supporting bidirectional streaming of audio and text with automatic interruption handling and low-latency session management. Beyond its core orchestration capabilities, the project provides comprehensive tools for observability, including distributed tracing and lifecycle event monitoring. It supports flexible tool integration through automatic schema generation from code signatures, as well as human-in-the-loop controls that allow for manual approval of agent actions. The system is designed to be extensible, with pluggable storage backends for session persistence and configurable execution environments that range from local processes to containerized workspaces.
This framework provides a robust middleware-based guardrail system that intercepts inputs, outputs, and tool calls, making it a capable tool for enforcing safety and reliability in LLM-driven agent workflows.
PythonStructured Output EnforcementsModel Output Formatting
View on GitHub27,191
raga-ai-hub/ragaai-catalyst
raga-ai-hub/RagaAI-Catalyst
16,150View on GitHub
RagaAI-Catalyst is a suite of software implementation tools providing an SDK, dashboard, and platform for monitoring, debugging, red-teaming, and evaluating agentic AI workflows. It serves as an observability framework for tracing the execution paths of large language models and multi-agent systems. The project distinguishes itself through a security suite for automated red-teaming and vulnerability scanning to detect biases, alongside a centralized prompt registry that decouples templates from application code. It further provides an evaluation platform that combines synthetic data generation with custom metric frameworks to quantify model accuracy and reliability. The system covers broad operational domains including agent behavioral observability, prompt lifecycle management, and the application of output guardrails to block undesirable content. Its monitoring capabilities include trace-based execution graphing, timeline-based event sequencing, and diagnostic tools for analyzing multi-agent interaction flows. The core functionality is delivered via a Python library for recording tool calls and decision-making processes.
This framework provides a comprehensive suite for LLM observability, red-teaming, and output guardrails, making it a suitable tool for managing the safety and reliability of AI workflows.
PythonAgent Execution TracingAgent ObservabilityAdversarial Red Teaming Toolkits
View on GitHub16,150
comet-ml/comet-llm
comet-ml/comet-llm
19,673View on GitHub
Comet LLM is an observability platform and evaluation framework designed for large language model applications and agentic workflows. It functions as a system for tracing, monitoring, and debugging execution flows while providing tools for prompt optimization and the enforcement of AI safety guardrails. The platform distinguishes itself through a combination of model-based scoring and heuristic metrics to quantify output quality and detect hallucinations. It includes a dedicated prompt and agent optimizer with an interactive playground for refining templates and tool configurations. For retrieval-augmented generation, it provides specific monitoring and evaluation tools to identify bottlenecks in document retrieval and synthesis. Broad capabilities cover production monitoring via token usage and feedback dashboards, detailed execution tracing through span recording, and automated performance evaluations integrated into continuous delivery pipelines. The system also implements safety profiles to constrain model outputs and ensure compliant behavior. The platform can be deployed via cloud-hosted workspaces or self-hosted on Kubernetes using Helm charts.
Comet LLM provides a comprehensive observability and evaluation framework that includes integrated safety profiles and runtime interceptors to constrain model outputs, making it a suitable tool for managing LLM reliability and guardrails.
PythonLLM ObservabilityAgent Debugging ToolsAgentic Workflow Debuggers
View on GitHub19,673
archestra-ai/archestra
archestra-ai/archestra
3,570View on GitHub
Archestra is a platform for enterprise AI agent deployment and Model Context Protocol orchestration. It provides a centralized system for configuring specialized agents with specific system prompts and toolsets, and managing the deployment of Model Context Protocol servers that provide large language models with external tools and data sources. The system features an AI agent gateway that exposes configured agents as networked services for external clients and integrated development environments. It incorporates a security suite that provides deterministic guardrails to prevent prompt injection and data exfiltration, alongside application sandboxing and network restrictions for protocol servers. The platform includes a retrieval-augmented generation system with hybrid vector-text search for knowledge base management. It covers broader operational capabilities including role-based access control, enterprise identity integration, distributed telemetry export via OpenTelemetry, and operational cost management through budget limits. The infrastructure is designed for self-hosting within private networks using Kubernetes, with deployment supported via Helm charts and Terraform providers.
Archestra is an enterprise AI agent platform that includes a dedicated security suite for prompt injection prevention and data exfiltration guardrails, serving as a robust middleware layer for managing and securing LLM interactions.
TypeScriptAI Agent DeploymentsMCP Server ManagementAgent Gateways
View on GitHub3,570

LLM Output Validation Guardrails

Portkey-AI/gateway

protectai/llm-guard

dottxt-ai/outlines

microsoft/guidance

instructor-ai/instructor

guardrails-ai/guardrails

567-labs/instructor

BerriAI/litellm

NirDiamant/agents-towards-production

vllm-project/semantic-router

The-Pocket/PocketFlow

NVIDIA-NeMo/NeMo

pydantic/pydantic-ai

openai/openai-agents-python

raga-ai-hub/RagaAI-Catalyst

comet-ml/comet-llm

archestra-ai/archestra