3 repos
Explore 3 awesome GitHub repositories matching artificial intelligence & ml · Inference Runtimes. Refine with filters or upvote what's useful.
GPT4All is a cross-platform runtime environment designed to execute large language models directly on local consumer hardware. By leveraging an optimized C++ inference backend, it enables private, offline AI interactions without requiring an internet connection or external cloud services. The project provides a compreh
Transforms local data into searchable vector collections to provide context-aware, private knowledge retrieval for language models.
Llama is a computational framework and runtime environment designed for executing transformer-based neural networks locally. It functions as a generative AI inference engine, enabling the processing of input sequences through pre-trained model weights to produce text completions and structured data outputs directly on
Optimizes the loading and execution of transformer-based neural networks on standard computing hardware.
This project is a privacy-first backend service designed to facilitate retrieval-augmented generation by processing local documents into searchable vector representations. It provides a modular architecture that allows users to ingest diverse file formats, manage document metadata, and perform semantic searches to prov
Transforms local data into searchable collections to enable context-aware responses from both local and cloud-based models.