awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPPrivacyTerms

3 repos

Awesome GitHub RepositoriesTokenizers

Utilities that convert raw text into numerical sequences or sub-word units for machine learning models.

Explore 3 awesome GitHub repositories matching artificial intelligence & ml · Tokenizers. Refine with filters or upvote what's useful.

Awesome Tokenizers GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • openai/codex

    openai/codex

    61,152GitHubView on GitHub↗

    Codex is an automated programming tool and generative code assistant designed to interpret developer intent through a natural language interface. It functions as a machine learning model trained on public code repositories to provide intelligent code completion, suggestions, and refactoring within development environme

    Rust
  • meilisearch/meilisearch

    meilisearch/meilisearch

    55,992GitHubView on GitHub↗

    Meilisearch is a Rust-based search engine providing typo-tolerant full-text and vector-based semantic search with real-time conversational capabilities.

    Rustaiapiapp-search
  • karpathy/nanoGPT

    karpathy/nanoGPT

    53,461GitHubView on GitHub↗

    nanoGPT is a lightweight engine for training and fine-tuning transformer-based language models from scratch. It provides a minimalist codebase designed for educational exploration and rapid experimentation with neural network architectures, utilizing self-attention and feed-forward layers to process sequences and predi

    Python