Cactus

Cactus - run AI models on mobile devices | Awesome Repos

Features

Local AI Inference - Executes large language and vision models directly on mobile and wearable hardware using optimized kernels.
On-Device Inference Engines - Serves as an on-device AI inference engine for executing large language, vision, and speech models on mobile and wearable hardware.
Multimodal Input Processing - Performs inference on image and sound data to enable visual understanding and speech-to-text capabilities.
Chat Completion Services - Produces natural language conversational responses based on chat history and configurable generation options.
Retrieval-Augmented Generation - Grounds model responses using locally stored text documents and directories to provide context-aware generation.
RAG Document Retrieval - Retrieves relevant snippets from local text files to provide grounded context for LLM responses.
Inference Optimization Kernels - Utilizes native kernels tuned for low-latency, energy-efficient mathematical operations on mobile hardware.
Local RAG Implementations - Provides a local retrieval augmented generation framework that grounds model responses using local text files without cloud access.
Local Inference Engines - Provides an optimized runtime for executing large language models and vision models locally on consumer mobile hardware.
On-Device Speech-to-Text SDKs - Provides on-device speech-to-text transcription using locally executed models on mobile and wearable hardware.
RAG Frameworks - Provides a framework for building local retrieval augmented generation systems that ground responses in local directories.
Speech Transcription - Provides local on-device speech-to-text transcription services with low-latency execution.
Vector Embeddings - Generates numerical vector representations of text, visual, and speech inputs for similarity search and retrieval.
AI Integration SDKs - Ships a multiplatform SDK with language bindings for integrating local AI capabilities into mobile applications.
Mobile Framework Integrations - Offers native software kits to integrate AI capabilities into handheld and wearable operating systems.
Tensor Computation Graphs - Allows defining sequences of tensor operations and activation functions as computational graphs for local execution.
Graph-Based Execution Engines - Executes mathematical workflows as a sequence of tensor operations and activation functions via directed acyclic graphs.
Language Bindings - Provides multiplatform software development kits and language bindings to connect the core engine to external applications.
AI Integration Tools - Connects local AI models to external system functions and tools to perform actions based on model outputs.
Model Request Routing - Redirects inference requests to cloud providers when local hardware capacity is insufficient.
Cross-Framework Model Conversion - Transforms external model formats into representations optimized for mobile and wearable hardware.
Function Calling Interfaces - Parses model outputs into structured function calls to interact with external system tools.
Hybrid Local-Remote AI Routing - Routes AI workloads between local on-device execution and cloud-based providers based on hardware capacity.
Local Speech-to-Text - Includes a low-latency on-device transcription system for converting audio input into text.
On-Device Speech Recognizers - Performs local speech-to-text transcription and voice activity detection on handheld and wearable devices.
Voice Activity Detection - Identifies periods of human speech within audio streams to trigger transcription and downstream processing.
Mobile Model Format Converters - Transforms external model formats into optimized representations compatible with local mobile and wearable hardware execution.

Open-source alternatives to Cactus

Similar open-source projects, ranked by how many features they share with Cactus.

runanywhereai/runanywhere-sdks
RunanywhereAI/runanywhere-sdks
8,781View on GitHub
This project is an on-device AI SDK providing a framework for running large language models, vision models, and speech models locally. It serves as an orchestration layer for local LLM execution, ensuring data privacy and offline availability by utilizing hardware acceleration on the device. The SDK is distinguished by its comprehensive voice and multimodal capabilities, including a coordinated voice pipeline for activity detection, speech-to-text, and text-to-speech synthesis. It also provides a dedicated implementation kit for local retrieval-augmented generation and tools for processing co
C++androidapple-intelligencecpp
View on GitHub8,781
langroid/langroid
langroid/langroid
3,894View on GitHub
Langroid is a multi-agent orchestration framework and tool integration suite designed for building complex AI applications. It serves as a multi-modal integration layer that connects diverse local and remote language models with an agentic retrieval-augmented generation system. The project distinguishes itself through a collaborative message-exchange paradigm, allowing specialized agents to delegate tasks hierarchically and coordinate via structured communication. It features an advanced state management system for conversational AI, including the ability to rewind and prune conversation hist
Pythonagentsaichatgpt
View on GitHub3,894
imclumsypanda/langchain-chatglm
imClumsyPanda/langchain-ChatGLM
38,183View on GitHub
This project is a LangChain-based framework for building retrieval-augmented generation systems, autonomous agents, and multimodal chatbots. It functions as an open-source orchestrator that connects local inference engines and online APIs to manage various large language model deployments. The system distinguishes itself by providing specialized interfaces for local knowledge bases, allowing the loading and vectorization of private documents to create context-aware assistants. It also supports multimodal capabilities, enabling the processing of both text and image inputs through vision-capabl
Python
View on GitHub38,183
pipecat-ai/pipecat
pipecat-ai/pipecat
12,846View on GitHub
Pipecat is a framework and software development kit for building real-time multimodal AI agents and speech-to-speech systems. It utilizes a frame-based data pipeline to route audio, video, and text through a modular sequence of processors, enabling the orchestration of low-latency conversational AI. The project is distinguished by its ability to coordinate complex multimodal services, including speech-to-text, language models, and text-to-speech, within a single pipeline. It features semantic voice activity detection for natural turn-taking, state-machine conversation flows for dialogue manag
Pythonaichatbot-frameworkchatbots
View on GitHub12,846

See all 30 alternatives to Cactus

cactus-computecactus

Features

Open-source alternatives to Cactus

RunanywhereAI/runanywhere-sdks

langroid/langroid

imClumsyPanda/langchain-ChatGLM

pipecat-ai/pipecat

Star history

Open-source alternatives to Cactus

RunanywhereAI/runanywhere-sdks

langroid/langroid

imClumsyPanda/langchain-ChatGLM

pipecat-ai/pipecat