37 مستودعات
Systems that enable language models to execute external tools and API functions to extend their operational capabilities.
Distinct from Tool Use and Function Calling: Closest candidate [f0_mt1] is focused on education/techniques rather than a functional software implementation of tool calling.
Explore 37 awesome GitHub repositories matching artificial intelligence & ml · Function Calling Interfaces. Refine with filters or upvote what's useful.
Qwen-7B is a pretrained causal language model designed for natural language generation, text processing, and complex reasoning tasks. It is available as an instruction-tuned model optimized for conversational interactions and a tool-use model capable of executing function calls and interacting with external APIs. The project provides a quantized version of the model to reduce GPU memory usage and supports the development of autonomous agents that can execute code and perform functions to complete complex goals. The system covers a wide range of capabilities including model fine-tuning throug
Implements a system that interprets model output as structured calls to trigger specific external functions.
CodeQwen1.5 is a large language model designed for generating, completing, and analyzing code. It functions as an AI code generator capable of writing programming logic across hundreds of different languages. The model is distinguished by its long-context capabilities, allowing it to process up to one million tokens to reason across entire software repositories. It also operates as a function calling model, utilizing specialized formats to execute complex coding tasks and browser-based automation. The system supports intelligent code completion through fill-in-the-middle capabilities, which
Implements interfaces that enable the model to execute external tools and API functions for coding tasks.
Mistral Inference is a library for running Mistral large language models on a GPU, generating text from prompts with token streaming. It loads pretrained model weights from local disk or a remote registry into GPU memory, then produces output tokens one by one for real-time display in interactive applications. The library supports multimodal prompts that accept image URLs alongside text, enabling visual description and reasoning. It includes content safety guardrails that scan generated text against predefined policies to block or flag policy violations. For structured interactions, it provid
Formats prompts with tool definitions so the model outputs structured function calls.
هذا المشروع عبارة عن مكتبة وإطار عمل لاستنتاج النماذج اللغوية الكبيرة مصمم لتشغيل النماذج لتوليد النصوص، وحل المشكلات، والمساعدة في البرمجة. يتضمن إطار عمل متعدد الوسائط لمعالجة مدخلات الصور والنصوص المدمجة وتنفيذ استخدام الأدوات الذي يتيح تنفيذ دوال خارجية بناءً على منطق النموذج. يتميز النظام بمحرك استنتاج GPU موزع يوزع أعباء عمل النماذج الكبيرة عبر معالجات رسومية متعددة لزيادة سرعة المعالجة وتلبية متطلبات الذاكرة. كما يوفر نشر النماذج بالحاويات من خلال صور وتبعيات معبأة مسبقاً لتقديم محركات الاستنتاج في بيئات معزولة. تغطي المكتبة مجموعة من القدرات بما في ذلك تحليل المدخلات متعددة الوسائط، وتكامل استدعاء الدوال، وإكمال الكود في المنتصف للتنبؤ بأجزاء الكود المفقودة. كما تدعم الدردشة التفاعلية مع النموذج عبر واجهة سطر أوامر للحفاظ على جلسات المحادثة.
Provides interfaces that enable language models to execute external tools and API functions.
WasmEdge is an extensible WebAssembly runtime that executes WebAssembly bytecode in a secure sandbox for cloud, edge, and embedded applications. It functions as a multi-language compiler, compiling applications written in Rust, JavaScript, Go, and Python into WebAssembly bytecode for sandboxed execution, and as a server-side JavaScript runtime that runs JavaScript programs with ES6 modules, NPM packages, and Node.js-compatible APIs. The runtime also serves as an AI inference runtime, executing AI models from JavaScript using WASI-NN plug-ins for inference tasks on personal devices and edge har
Calls WebAssembly functions annotated with bindgen to pass complex parameters automatically from Go code.
This project is a terminal-based command line interface client and agent orchestrator for interacting with multiple large language model providers. It functions as an OpenAI API client and a local API gateway that exposes chat completions and embeddings through an HTTP server. The system distinguishes itself by providing a retrieval-augmented generation tool for indexing local files and URLs into a vector database to provide custom document context. It allows for the creation of specialized AI agents that combine custom system prompts with tool calling and external function execution. The to
Trigger external tools and organize them into aliases for faster activation.
Spring AI is an application framework for Java that provides a portable, fluent API for integrating AI models, tools, and vector stores into applications. It wraps multiple AI providers behind a common interface, allowing developers to switch between chat, embedding, image, and speech models without changing application code. The framework includes a chainable chat client API similar to WebClient or RestClient, supports both synchronous and streaming interactions, and offers structured output conversion that transforms unstructured AI responses into strongly-typed Java objects. The framework
Uses an advisor to handle the full tool-calling lifecycle automatically without manual intervention.
This project is a Java-based framework integration that provides an AI agent runtime, a graph-based AI workflow engine, and an LLM orchestration framework for Spring applications. It enables the development of stateful autonomous agents and the implementation of retrieval-augmented generation systems using document processing and vector databases. The framework distinguishes itself through a graph-based workflow runtime for designing complex AI pipelines with conditional routing and persistent state. It supports multi-agent orchestration via service-discovery coordination and provides human-i
Enables language models to execute external tools and API functions to extend operational capabilities.
Wasm3 هو مترجم WebAssembly مصمم لتكامل وقت التشغيل المدمج. يتيح تنفيذ المنطق الثنائي المحمول على المتحكمات الدقيقة والأجهزة محدودة الموارد، ويوفر دعماً للوحدات التي تستخدم واجهة نظام WebAssembly للتفاعل مع موارد النظام. يستخدم وقت التشغيل ترجمة بايت كود قائمة على السجلات وإرسالاً مباشراً (direct-threaded dispatch) لربط السجلات الافتراضية بسجلات CPU الفيزيائية. يتضمن واجهة استدعاء مضيف لربط الاستيرادات بدوال C ويستخدم التحقق من الوحدات الثابت لضمان اتباع البايت كود لقواعد المواصفات قبل التنفيذ. يوفر المشروع إدارة الموارد من خلال حدود تخصيص الذاكرة الخطية وقياس الغاز على مستوى التعليمات لمنع الحلقات اللانهائية واستنفاد النظام. تتم معالجة الملاحظة عبر تتبع التنفيذ، ومراقبة التدفق، وتوصيف عمليات المترجم. للتطوير وضمان الجودة، يتضمن وقت التشغيل REPL تفاعلياً ويدعم التمويه الموجه بالتغطية (coverage-guided fuzzing).
Maps WebAssembly imports to C functions to allow guest modules to interact with the host system.
ERNIE is a development toolkit for training, fine-tuning, and deploying large language models built on the PaddlePaddle deep learning platform. It provides a comprehensive suite of core components, including an inference server for vision and language models, a training and fine-tuning toolkit, and a framework for building retrieval-augmented generation systems using private knowledge bases. The project features multimodal AI models capable of reasoning across text, images, and video to perform complex visual understanding and information extraction. It distinguishes itself through specialize
Trains models to recognize and execute external tool calls through specialized function call training methodologies.
GLM-4 is an open weights large language model designed as a multimodal chat system. It functions as a reasoning-focused and multilingual model capable of processing and generating responses across text and visual data types. The model is distinguished by its function-calling capabilities, allowing it to interface with external tools and APIs to execute tasks and retrieve real-time information. It is optimized for complex logical reasoning, mathematical problem solving, and deep research involving long-form content generation. Broad capabilities include multilingual text generation, the creat
Maps model outputs to structured API specifications that trigger external code execution for data retrieval.
GLM-4 is a large language model and fine-tuning framework designed for human-like text production, complex reasoning, and multilingual conversation. It functions as a multimodal system capable of processing high-resolution visual content and as a long-context model designed to analyze documents with a context window of up to one million tokens. The project differentiates itself through a function calling interface that enables AI agent development by connecting the model to external APIs and real-time web browsing. It includes specialized capabilities for generating functional programming cod
Implements a function calling interface that enables the model to execute external tools and APIs.
Guardrails is a Python SDK that wraps calls to large language models with configurable validation pipelines, corrective actions, and structured output generation. It provides a unified API layer that connects to over 100 language models, applying consistent validation, streaming, and error-handling across providers. The framework validates and corrects model responses against safety and quality rules, detecting and mitigating risks in both inputs and outputs using pre-built and custom validators. The project distinguishes itself through a validator-pipeline architecture that sequentially appl
Passes structured tool definitions to the model and processes returned function calls within the guardrails pipeline.
Gopher-lua is a complete implementation of the Lua language and its standard libraries written natively in Go. It serves as an embedded scripting engine and virtual machine that allows Go applications to execute Lua scripts and exchange data between the host and the script environment. The project provides a bytecode compiler to transform source code into a binary format for faster execution. It enables deep integration between the two languages by allowing the registration of native Go functions to be called from scripts, and the invocation of script functions directly from Go. The engine c
Provides an interface for Lua scripts to trigger and execute native Go functions.
llm-zoomcamp is a comprehensive educational program and course for building real-life AI systems using large language models. It serves as a structured curriculum and implementation guide for developing AI applications and retrieval techniques. The project provides instructional material on building retrieval augmented generation pipelines to ground model responses in custom knowledge bases. It includes training on vector database implementation, semantic search, and the use of function calling to create autonomous agentic workflows. The curriculum covers a broad range of system development
Teaches how to implement systems that enable language models to execute external tools and API functions.
Detours is a library for intercepting Win32 API calls and redirecting function calls at runtime on Windows, enabling binary-level instrumentation without requiring access to the original source code. It functions as an API hooking library and binary instrumentation toolkit, allowing developers to monitor or modify the behavior of compiled Windows binaries by hooking into their function execution paths. The project achieves this through detour-based function interception, where the first few instructions of a target function are replaced with a jump to a user-supplied detour function, while pr
Routes calls from one function to a custom replacement, enabling instrumentation or extension of existing APIs.
Registers Python functions that the language model can invoke during a conversation to fetch data or perform actions.
Invokes user-defined functions to fetch data, perform actions, or interact with external systems.
هذا المشروع هو دليل شامل لكتابة إعدادات وإضافات Neovim باستخدام لغة البرمجة Lua. يعمل ككتيب لاستخدام واجهة برمجة تطبيقات Lua الأصلية في Neovim لإدارة المخازن المؤقتة (buffers)، والنوافذ، وخيارات المحرر. يركز الدليل على التوافق بين Lua و Vimscript، ويقدم تعليمات حول كيفية تنفيذ Vimscript من Lua واستدعاء دوال Lua من داخل Vimscript. كما يوفر إطار عمل لتطوير الإضافات، ويغطي تنظيم الكود في وحدات خارجية وإنشاء أوامر مخصصة وتعيينات مفاتيح. تغطي الوثائق قدرات أوسع تشمل أتمتة سير عمل المحرر، ومعالجة المتغيرات الداخلية، وإدارة خيارات المحرر. كما تتضمن تعليمات لاستدعاء ملفات خارجية وتعديل أسطر المخزن المؤقت.
Explains how to call Vimscript functions from Lua and handle the automatic conversion of data types.
The official PyTorch implementation of Google's Gemma models
Generates structured function-call arguments from natural language prompts for agentic workflows.