awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Language Tools · Awesome GitHub Repositories

9 repos

Awesome GitHub RepositoriesLanguage Tools

Utilities for managing linguistic data, including dictionaries and automated translation services for text processing.

Explore 9 awesome GitHub repositories matching artificial intelligence & ml · Language Tools. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Machine Learning
  4. Language Tools

Awesome Language Tools GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • public-apis/public-apis

    public-apis/public-apis

    399,192GitHubView on GitHub↗

    This project is a comprehensive, community-driven directory of public service endpoints designed to facilitate the discovery and integration of external data sources. It serves as a centralized registry where developers can locate reliable third-party APIs to augment their applications with specialized functionality, r

    Pythonapiapisdataset
  • huggingface/transformers

    huggingface/transformers

    156,730GitHubView on GitHub↗

    Transformers is a comprehensive library for machine learning that provides a unified interface for training, fine-tuning, and deploying transformer-based models. It supports a wide range of tasks, including text classification, language modeling, question answering, and sequence-to-sequence translation, while offering

    Pythonaudiodeep-learningdeepseek
  • jaywcjlove/awesome-mac

    jaywcjlove/awesome-mac

    99,007GitHubView on GitHub↗

    This project is a comprehensive, curated collection of software resources designed for the macOS ecosystem. It serves as a centralized directory for discovering applications across a wide range of functional domains, including professional development, system management, and personal productivity. The directory distin

    JavaScriptappappleapplication
  • Shubhamsaboo/awesome-llm-apps

    Shubhamsaboo/awesome-llm-apps

    96,116GitHubView on GitHub↗

    This repository serves as a comprehensive collection of resources, templates, and starter code for building artificial intelligence applications. It provides a centralized hub for developers to access practical implementations of common workflows, including retrieval-augmented generation pipelines and autonomous agent

    Pythonagentsllmspython
  • fighting41love/funNLP

    fighting41love/funNLP

    78,999GitHubView on GitHub↗

    This project is a community-driven knowledge base and curated repository focused on natural language processing and large language model development. It serves as a centralized index for high-quality tools, libraries, and research materials, organizing technical resources into structured, version-controlled documentati

    Python
  • netdata/netdata

    netdata/netdata

    77,812GitHubView on GitHub↗

    Netdata is a distributed observability platform designed for real-time infrastructure monitoring and performance tracking. It functions as a high-frequency agent that collects system, container, and application metrics with per-second precision, providing both local visualization and centralized aggregation across comp

    Caialertingcncf
  • nomic-ai/gpt4all

    nomic-ai/gpt4all

    77,146GitHubView on GitHub↗

    GPT4All is a cross-platform runtime environment designed to execute large language models directly on local consumer hardware. By leveraging an optimized C++ inference backend, it enables private, offline AI interactions without requiring an internet connection or external cloud services. The project provides a compreh

    C++ai-chatllm-inference
  • tesseract-ocr/tesseract

    tesseract-ocr/tesseract

    72,460GitHubView on GitHub↗

    Tesseract is a neural network-based optical character recognition engine designed to convert scanned images and digital documents into machine-readable, searchable text. It functions as both a command-line utility for automating large-scale digitization workflows and a cross-platform library that can be embedded into d

    C++hacktoberfestlstmmachine-learning
  • dair-ai/Prompt-Engineering-Guide

    dair-ai/Prompt-Engineering-Guide

    70,526GitHubView on GitHub↗

    This project is a comprehensive educational resource and knowledge base dedicated to the development and application of large language models and autonomous agentic systems. It provides a structured framework for understanding prompt engineering, context management, and the architectural patterns required to build task

    MDXagentagentsai-agents

Explore sub-tags

  • Dictionary APIsServices for accessing linguistic definitions, character data, and translations.
  • Dictionary Management UtilitiesTools for configuring, customizing, and optimizing dictionary-based text recognition and tokenization processes.
  • Embeddings2 sub-tagsSystems and pipelines for generating and managing vector representations of data for semantic search and analysis.
  • Natural Language Processing Resources1 sub-tagCurated datasets, lexicons, and linguistic tools designed to support natural language processing tasks.
  • Natural Language QueryingInterfaces for querying system data using natural language processing.
  • OCR Language SupportIdentification and management of language-specific character sets and scripts for recognition engines.
  • Text ClassificationAlgorithms and techniques for assigning predefined categories to text documents or strings.
  • Tokenization Algorithms2 sub-tagsMathematical methods for breaking down text into smaller units like words, subwords, or characters.
  • Tokenization Interfaces1 sub-tagProgramming abstractions that define how tokenization processes interact with larger machine learning pipelines.
  • Tokenization Utilities1 sub-tagHelper functions and scripts for managing, decoding, and processing tokenized data streams.
  • Transitional Phrase PatternsCategorized analysis of common transitional phrases used in machine-generated content.
  • Translation ToolsUtilities for translating text between languages, including dictionary and lookup features.