awesome-repositories.comBlog
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPBlogSitemapPrivacyTerms
Embedding Pipelines · Awesome GitHub Repositories

1 repo

Awesome GitHub RepositoriesEmbedding Pipelines

Modular systems for generating vector representations from data using external machine learning models.

Distinguishing note: Focuses on the decoupling of embedding generation from storage.

Explore 1 awesome GitHub repository matching data & databases · Embedding Pipelines. Refine with filters or upvote what's useful.

  1. Home
  2. Data & Databases
  3. Embedding Pipelines

Awesome Embedding Pipelines GitHub Repositories

Describe the repository you're looking for…
Find the best repos with AI.We'll search the best matching repositories with AI.
  • chroma-core/chroma

    chroma-core/chroma

    26,198View on GitHub↗

    Chroma is a specialized vector database designed to index and retrieve high-dimensional data representations for semantic similarity search. It functions as a comprehensive platform for information retrieval, enabling the storage and management of unstructured documents alongside structured metadata. By mapping data into numerical representations, the system facilitates rapid similarity lookups across large datasets. The platform distinguishes itself through a hybrid search infrastructure that combines dense vector embeddings with sparse keyword and regular expression matching to balance sema

    Decouples the vector generation process from the storage layer to support diverse third-party machine learning models.

    Rustaidatabasedocument-retrieval
    26,198View on GitHub↗