37 repository-uri
Systems that enable language models to execute external tools and API functions to extend their operational capabilities.
Distinct from Tool Use and Function Calling: Closest candidate [f0_mt1] is focused on education/techniques rather than a functional software implementation of tool calling.
Explore 37 awesome GitHub repositories matching artificial intelligence & ml · Function Calling Interfaces. Refine with filters or upvote what's useful.
Qwen-7B is a pretrained causal language model designed for natural language generation, text processing, and complex reasoning tasks. It is available as an instruction-tuned model optimized for conversational interactions and a tool-use model capable of executing function calls and interacting with external APIs. The project provides a quantized version of the model to reduce GPU memory usage and supports the development of autonomous agents that can execute code and perform functions to complete complex goals. The system covers a wide range of capabilities including model fine-tuning throug
Implements a system that interprets model output as structured calls to trigger specific external functions.
CodeQwen1.5 is a large language model designed for generating, completing, and analyzing code. It functions as an AI code generator capable of writing programming logic across hundreds of different languages. The model is distinguished by its long-context capabilities, allowing it to process up to one million tokens to reason across entire software repositories. It also operates as a function calling model, utilizing specialized formats to execute complex coding tasks and browser-based automation. The system supports intelligent code completion through fill-in-the-middle capabilities, which
Implements interfaces that enable the model to execute external tools and API functions for coding tasks.
Mistral Inference is a library for running Mistral large language models on a GPU, generating text from prompts with token streaming. It loads pretrained model weights from local disk or a remote registry into GPU memory, then produces output tokens one by one for real-time display in interactive applications. The library supports multimodal prompts that accept image URLs alongside text, enabling visual description and reasoning. It includes content safety guardrails that scan generated text against predefined policies to block or flag policy violations. For structured interactions, it provid
Formats prompts with tool definitions so the model outputs structured function calls.
Acest proiect este o bibliotecă și un framework de inferență pentru modele de limbaj mari, conceput pentru a rula modele pentru generarea de text, rezolvarea problemelor și asistența în programare. Include un framework multimodal pentru procesarea intrărilor combinate de imagine și text și o implementare de tip tool-use care permite execuția funcțiilor externe bazată pe raționamentul modelului. Sistemul dispune de un motor de inferență GPU distribuit care distribuie sarcinile de lucru ale modelelor mari pe mai multe procesoare grafice pentru a crește viteza de procesare și a îndeplini cerințele de memorie. De asemenea, oferă implementarea containerizată a modelelor prin imagini pre-pachetate și dependențe pentru servirea motoarelor de inferență în medii izolate. Biblioteca acoperă o gamă de capabilități, inclusiv analiza intrărilor multimodale, integrarea apelurilor de funcții și completarea codului (fill-in-the-middle) pentru prezicerea segmentelor de cod lipsă. De asemenea, suportă chat-ul interactiv cu modelul printr-o interfață în linie de comandă pentru menținerea sesiunilor conversaționale.
Provides interfaces that enable language models to execute external tools and API functions.
WasmEdge is an extensible WebAssembly runtime that executes WebAssembly bytecode in a secure sandbox for cloud, edge, and embedded applications. It functions as a multi-language compiler, compiling applications written in Rust, JavaScript, Go, and Python into WebAssembly bytecode for sandboxed execution, and as a server-side JavaScript runtime that runs JavaScript programs with ES6 modules, NPM packages, and Node.js-compatible APIs. The runtime also serves as an AI inference runtime, executing AI models from JavaScript using WASI-NN plug-ins for inference tasks on personal devices and edge har
Calls WebAssembly functions annotated with bindgen to pass complex parameters automatically from Go code.
This project is a terminal-based command line interface client and agent orchestrator for interacting with multiple large language model providers. It functions as an OpenAI API client and a local API gateway that exposes chat completions and embeddings through an HTTP server. The system distinguishes itself by providing a retrieval-augmented generation tool for indexing local files and URLs into a vector database to provide custom document context. It allows for the creation of specialized AI agents that combine custom system prompts with tool calling and external function execution. The to
Trigger external tools and organize them into aliases for faster activation.
Spring AI is an application framework for Java that provides a portable, fluent API for integrating AI models, tools, and vector stores into applications. It wraps multiple AI providers behind a common interface, allowing developers to switch between chat, embedding, image, and speech models without changing application code. The framework includes a chainable chat client API similar to WebClient or RestClient, supports both synchronous and streaming interactions, and offers structured output conversion that transforms unstructured AI responses into strongly-typed Java objects. The framework
Uses an advisor to handle the full tool-calling lifecycle automatically without manual intervention.
This project is a Java-based framework integration that provides an AI agent runtime, a graph-based AI workflow engine, and an LLM orchestration framework for Spring applications. It enables the development of stateful autonomous agents and the implementation of retrieval-augmented generation systems using document processing and vector databases. The framework distinguishes itself through a graph-based workflow runtime for designing complex AI pipelines with conditional routing and persistent state. It supports multi-agent orchestration via service-discovery coordination and provides human-i
Enables language models to execute external tools and API functions to extend operational capabilities.
Wasm3 este un interpretor WebAssembly conceput pentru integrarea în runtime-uri embedded. Acesta permite executarea logicii binare portabile pe microcontrolere și hardware cu resurse limitate, oferind suport pentru module care utilizează WebAssembly System Interface pentru a interacționa cu resursele sistemului. Runtime-ul folosește interpretarea bytecode bazată pe registre și dispatch-ul direct-threaded pentru a mapa registrele virtuale la registrele fizice ale CPU-ului. Include o interfață host-call pentru a mapa importurile la funcții C și utilizează validarea statică a modulelor pentru a se asigura că bytecode-ul respectă regulile specificațiilor înainte de execuție. Proiectul oferă gestionarea resurselor prin limite de alocare a memoriei liniare și contorizarea consumului de instrucțiuni (gas metering) pentru a preveni buclele infinite și epuizarea sistemului. Observabilitatea este gestionată prin tracing-ul execuției, monitorizarea fluxului și profilarea operațiunilor interpretorului. Pentru dezvoltare și asigurarea calității, runtime-ul include un REPL interactiv și suportă fuzzing ghidat de acoperire.
Maps WebAssembly imports to C functions to allow guest modules to interact with the host system.
ERNIE is a development toolkit for training, fine-tuning, and deploying large language models built on the PaddlePaddle deep learning platform. It provides a comprehensive suite of core components, including an inference server for vision and language models, a training and fine-tuning toolkit, and a framework for building retrieval-augmented generation systems using private knowledge bases. The project features multimodal AI models capable of reasoning across text, images, and video to perform complex visual understanding and information extraction. It distinguishes itself through specialize
Trains models to recognize and execute external tool calls through specialized function call training methodologies.
GLM-4 is an open weights large language model designed as a multimodal chat system. It functions as a reasoning-focused and multilingual model capable of processing and generating responses across text and visual data types. The model is distinguished by its function-calling capabilities, allowing it to interface with external tools and APIs to execute tasks and retrieve real-time information. It is optimized for complex logical reasoning, mathematical problem solving, and deep research involving long-form content generation. Broad capabilities include multilingual text generation, the creat
Maps model outputs to structured API specifications that trigger external code execution for data retrieval.
GLM-4 is a large language model and fine-tuning framework designed for human-like text production, complex reasoning, and multilingual conversation. It functions as a multimodal system capable of processing high-resolution visual content and as a long-context model designed to analyze documents with a context window of up to one million tokens. The project differentiates itself through a function calling interface that enables AI agent development by connecting the model to external APIs and real-time web browsing. It includes specialized capabilities for generating functional programming cod
Implements a function calling interface that enables the model to execute external tools and APIs.
Guardrails is a Python SDK that wraps calls to large language models with configurable validation pipelines, corrective actions, and structured output generation. It provides a unified API layer that connects to over 100 language models, applying consistent validation, streaming, and error-handling across providers. The framework validates and corrects model responses against safety and quality rules, detecting and mitigating risks in both inputs and outputs using pre-built and custom validators. The project distinguishes itself through a validator-pipeline architecture that sequentially appl
Passes structured tool definitions to the model and processes returned function calls within the guardrails pipeline.
Gopher-lua is a complete implementation of the Lua language and its standard libraries written natively in Go. It serves as an embedded scripting engine and virtual machine that allows Go applications to execute Lua scripts and exchange data between the host and the script environment. The project provides a bytecode compiler to transform source code into a binary format for faster execution. It enables deep integration between the two languages by allowing the registration of native Go functions to be called from scripts, and the invocation of script functions directly from Go. The engine c
Provides an interface for Lua scripts to trigger and execute native Go functions.
llm-zoomcamp is a comprehensive educational program and course for building real-life AI systems using large language models. It serves as a structured curriculum and implementation guide for developing AI applications and retrieval techniques. The project provides instructional material on building retrieval augmented generation pipelines to ground model responses in custom knowledge bases. It includes training on vector database implementation, semantic search, and the use of function calling to create autonomous agentic workflows. The curriculum covers a broad range of system development
Teaches how to implement systems that enable language models to execute external tools and API functions.
Detours is a library for intercepting Win32 API calls and redirecting function calls at runtime on Windows, enabling binary-level instrumentation without requiring access to the original source code. It functions as an API hooking library and binary instrumentation toolkit, allowing developers to monitor or modify the behavior of compiled Windows binaries by hooking into their function execution paths. The project achieves this through detour-based function interception, where the first few instructions of a target function are replaced with a jump to a user-supplied detour function, while pr
Routes calls from one function to a custom replacement, enabling instrumentation or extension of existing APIs.
Registers Python functions that the language model can invoke during a conversation to fetch data or perform actions.
Invokes user-defined functions to fetch data, perform actions, or interact with external systems.
Acest proiect este un ghid cuprinzător pentru scrierea configurațiilor și plugin-urilor Neovim folosind limbajul de programare Lua. Servește drept manual pentru utilizarea API-ului nativ Lua din Neovim pentru a gestiona bufferele, ferestrele și opțiunile editorului. Ghidul se concentrează pe interoperabilitatea dintre Lua și Vimscript, oferind instrucțiuni despre cum să execuți Vimscript din Lua și să apelezi funcții Lua din interiorul Vimscript. De asemenea, oferă un framework pentru dezvoltarea de plugin-uri, acoperind organizarea codului în module externe și crearea de comenzi și mapări de taste personalizate. Documentația acoperă capabilități mai largi, inclusiv automatizarea fluxului de lucru al editorului, manipularea variabilelor interne și gestionarea opțiunilor editorului. Include, de asemenea, instrucțiuni pentru încărcarea fișierelor externe și modificarea liniilor din buffer.
Explains how to call Vimscript functions from Lua and handle the automatic conversion of data types.
The official PyTorch implementation of Google's Gemma models
Generates structured function-call arguments from natural language prompts for agentic workflows.