# NLP, speech and embeddings

> Search results for `NLP, speech and embeddings` on awesome-repositories.com. 118 total matches; showing the first 50.

Explore on the web: https://awesome-repositories.com/q/nlp-speech-and-embeddings

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [this search on awesome-repositories.com](https://awesome-repositories.com/q/nlp-speech-and-embeddings).**

## Results

- [haifengl/smile](https://awesome-repositories.com/repository/haifengl-smile.md) (6,387 ⭐) — Smile is a comprehensive JVM machine learning library and statistical computing toolkit. It provides a suite of algorithms for classification, regression, and clustering, implemented natively for Java, Scala, and Kotlin. The project also functions as a deep learning framework, a natural language processing library, and an inference engine for large language models.

The library distinguishes itself through GPU acceleration via LibTorch bindings and support for the ONNX model interchange format. It includes specialized capabilities for large language model inference, featuring Byte-Pair Encodin
- [graykode/nlp-tutorial](https://awesome-repositories.com/repository/graykode-nlp-tutorial.md) (14,855 ⭐) — This repository serves as an educational resource for learning the foundational architectures of natural language processing through concise code implementations. It provides a structured collection of deep learning models designed to process and understand human language, focusing on the core mechanics of neural network sequence modeling and text analysis.

The project distinguishes itself by offering direct, hands-on implementations of complex architectures, including Transformers, attention mechanisms, and word embedding generation. By utilizing tensor-based computational graphs and gradien
- [open-speech/speech-aligner](https://awesome-repositories.com/repository/open-speech-speech-aligner.md) (410 ⭐) — speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
- [facebookresearch/parlai](https://awesome-repositories.com/repository/facebookresearch-parlai.md) (10,625 ⭐) — ParlAI is a conversational AI research framework designed for training, evaluating, and sharing dialogue models using a unified interface for datasets and agents. It functions as a PyTorch-based training platform and a dialogue data collection system, providing a centralized model zoo for the distribution of versioned pretrained agents.

The project distinguishes itself through a knowledge-grounded retrieval system that combines dense and sparse indexing to ground responses in external information. It also provides a comprehensive infrastructure for gathering human-AI interaction data via inte
- [microsoft/nlp-recipes](https://awesome-repositories.com/repository/microsoft-nlp-recipes.md) (6,436 ⭐) — nlp-recipes is a collection of implementation guides and reference templates for applying natural language processing techniques to real-world tasks. It provides standardized workflows and code examples for developing NLP pipelines, from dataset preparation and model training to performance evaluation.

The project focuses on the practical application of transformer-based models, offering patterns for fine-tuning pretrained architectures for tasks such as text classification, named entity recognition, and question answering. It also includes a toolkit for model interpretability, allowing users
- [lancopku/embedding-poisoning](https://awesome-repositories.com/repository/lancopku-embedding-poisoning.md) (45 ⭐) — Code for the paper Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models (NAACL-HLT 2021) [pdf, arxiv]
- [tingsongyu/pytorch_tutorial](https://awesome-repositories.com/repository/tingsongyu-pytorch-tutorial.md) (8,018 ⭐) — This project is a comprehensive collection of educational examples and reference implementations for building vision and language models using PyTorch. It serves as a deep learning tutorial covering the end-to-end process of developing neural networks, from initial architecture definition to final production deployment.

The repository provides detailed guides on implementing a wide range of domain-specific models, including convolutional neural networks for object detection and segmentation, as well as transformer and recurrent architectures for natural language processing. It emphasizes gene
- [mastra-ai/mastra](https://awesome-repositories.com/repository/mastra-ai-mastra.md) (21,221 ⭐) — Mastra is an orchestration framework designed for building, deploying, and managing autonomous AI agents and multi-agent systems. It provides a comprehensive suite of primitives for creating resilient AI applications, including durable workflow orchestration, event-driven agent loops, and semantic memory management. By integrating these core components, the platform enables developers to build complex, multi-step processes that can reason about goals and execute tasks without manual intervention.

The framework distinguishes itself through its focus on observability and secure, isolated execut
- [yandexdataschool/nlp_course](https://awesome-repositories.com/repository/yandexdataschool-nlp-course.md) (10,591 ⭐) — YSDA course in Natural Language Processing
- [jonashaag/speech-enhancement](https://awesome-repositories.com/repository/jonashaag-speech-enhancement.md) (25 ⭐) — Collection of papers, datasets and tools on the topic of Speech Dereverberation and Speech Enhancement
- [miteshputhranneu/speech-emotion-analyzer](https://awesome-repositories.com/repository/miteshputhranneu-speech-emotion-analyzer.md) (1,408 ⭐) — The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
- [facebookresearch/fairseq](https://awesome-repositories.com/repository/facebookresearch-fairseq.md) (32,228 ⭐) — Fairseq is a PyTorch toolkit for sequence-to-sequence modeling, specializing in neural machine translation, automatic speech recognition, and large-scale language model training. It provides a framework for processing and aligning diverse data sources, including text, audio, and video, to support tasks such as speech-to-text conversion and multimodal sequence learning.

The project is distinguished by its distributed training capabilities, which utilize parameter sharding, mixed-precision training, and CPU offloading to handle models that exceed single-device memory. It also includes specializ
- [d2l-ai/d2l-en](https://awesome-repositories.com/repository/d2l-ai-d2l-en.md) (29,001 ⭐) — This project is an educational platform and research toolkit designed to teach deep learning through a combination of mathematical theory, visual diagrams, and executable code. It provides a comprehensive environment for building, training, and evaluating neural networks, grounding complex concepts in interactive computational notebooks that allow for hands-on experimentation.

The framework distinguishes itself by interleaving theoretical foundations—including linear algebra, calculus, and probability—with practical implementations across multiple industry-standard libraries. It supports flex
- [modular/modular](https://awesome-repositories.com/repository/modular-modular.md) (26,357 ⭐) — Modular is a unified machine learning development platform designed for building, compiling, and deploying high-performance neural network models. It provides a comprehensive execution engine that supports both local and production-grade inference, enabling developers to manage the entire model lifecycle from initial architecture definition to scalable, containerized service deployment.

The platform distinguishes itself through a hardware-agnostic runtime that abstracts diverse silicon architectures, allowing models to execute efficiently across varied compute environments. It includes a spec
- [oxford-cs-deepnlp-2017/lectures](https://awesome-repositories.com/repository/oxford-cs-deepnlp-2017-lectures.md) (15,854 ⭐) — This repository is a deep learning for natural language processing course and curriculum. It provides educational material and guides focused on neural network architectures used for processing natural language, speech signals, and text classification.

The content includes instructional tutorials on sequence modeling and neural language modeling, covering the implementation of n-gram and recurrent neural networks. It also provides a framework for studying word embeddings to map linguistic meanings into numerical representations.

The curriculum covers a broad range of capabilities, including
- [embedded-graphics/embedded-graphics](https://awesome-repositories.com/repository/embedded-graphics-embedded-graphics.md) (1,295 ⭐) — A no_std graphics library for embedded applications
- [embedded-boston/awesome-embedded-systems](https://awesome-repositories.com/repository/embedded-boston-awesome-embedded-systems.md) (981 ⭐) — A curated list of delightful Embedded Systems libraries, RTOSes, modules, references and more!
- [afshinea/stanford-cs-230-deep-learning](https://awesome-repositories.com/repository/afshinea-stanford-cs-230-deep-learning.md) (7,028 ⭐) — This repository collects illustrated single-page cheat sheets that compress the core topics of Stanford's CS 230 deep learning course into visual reference summaries. The collection covers convolutional neural networks, recurrent neural networks, and practical training techniques, pairing schematic diagrams with mathematical notation to bridge intuition and formal understanding.

The cheat sheets are organized by subject area and link related concepts across topics, such as connecting vanishing gradients to LSTM gates, to reinforce the full deep learning workflow. Practical training advice on
- [axa-group/nlp.js](https://awesome-repositories.com/repository/axa-group-nlp-js.md) (6,574 ⭐) — nlp.js is a JavaScript natural language processing library and development framework used to build natural language understanding engines. It provides a toolkit for creating local machine learning models for intent classification and acts as a multilingual text processor that detects languages and normalizes text across various dialects.

The framework distinguishes itself by supporting local execution on both servers and mobile devices, enabling chatbot functionality without an internet connection. It features a specialized system for conversational slot filling to collect mandatory informati
- [speech-io/bigcidian](https://awesome-repositories.com/repository/speech-io-bigcidian.md) (263 ⭐) — Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.
- [nyandwi/machine_learning_complete](https://awesome-repositories.com/repository/nyandwi-machine-learning-complete.md) (4,983 ⭐) — This is an interactive notebook-based course that teaches machine learning from Python fundamentals through deep learning and natural language processing. It uses real datasets and multiple frameworks within a structured, hands-on curriculum that combines concise explanations with executable code cells, built-in datasets, and embedded exercise checkpoints. Learning progresses through data preparation and exploration, classical machine learning workflows, computer vision with convolutional neural networks, and natural language processing with deep learning, all delivered as a cohesive progressi
- [openclaw/openclaw](https://awesome-repositories.com/repository/openclaw-openclaw.md) (380,031 ⭐) — Openclaw is a platform for managing agent execution environments, providing the infrastructure to control agent lifecycles, session state, and workspace persistence. It features a centralized gateway that handles model loops, tool invocation, and streaming events, while supporting multi-agent routing and persistent memory management. The system is designed to normalize tool execution signatures and provide a standardized interface for cross-provider compatibility.

The platform includes extensive developer tooling, such as a command-line interface for workspace management, diagnostic logging,
- [elevenlabs/elevenlabs-python](https://awesome-repositories.com/repository/elevenlabs-elevenlabs-python.md) (2,873 ⭐) — This Python SDK provides a comprehensive toolkit for synthetic audio generation, voice cloning, and the development of conversational AI agents. It enables the creation of lifelike spoken audio from text, the replication of human voices through custom cloning, and the deployment of real-time voice agents capable of interacting with external large language models.

The library distinguishes itself through deep integration of conversational AI capabilities, including the design of agent personas and the execution of real-time actions via APIs. It supports professional-grade audio production thro
- [ljpzzz/machinelearning](https://awesome-repositories.com/repository/ljpzzz-machinelearning.md) (8,706 ⭐) — This project is a machine learning implementation library featuring a collection of code examples that implement supervised, unsupervised, and reinforcement learning algorithms from scratch. It provides a comprehensive set of toolkits for core machine learning components, including a natural language processing toolkit, a reinforcement learning framework, and suites for data dimensionality reduction and pattern mining.

The library includes specialized implementations for reinforcement learning, such as Q-Learning, Deep Q-Networks, and Actor-Critic agents. The natural language processing capab
- [rust-embedded-community/embedded-nal](https://awesome-repositories.com/repository/rust-embedded-community-embedded-nal.md) (202 ⭐) — An Embedded Network Abstraction Layer
- [hammerspoon/hammerspoon](https://awesome-repositories.com/repository/hammerspoon-hammerspoon.md) (14,497 ⭐) — Hammerspoon is a programmable automation engine for macOS that enables deep system-level control through a Lua scripting environment. By bridging high-level scripts with native Objective-C APIs, it allows users to interact with the operating system's accessibility tree, intercept hardware input streams, and manage the lifecycle of running applications.

The project distinguishes itself through an event-driven architecture that registers asynchronous hooks for system notifications and hardware events. This allows for real-time automation, such as remapping keyboard and mouse inputs, managing wi
- [piskvorky/gensim](https://awesome-repositories.com/repository/piskvorky-gensim.md) (16,361 ⭐) — Gensim is a natural language processing toolkit designed for large-scale text analysis and the training of semantic vector embeddings. It provides a framework for identifying latent thematic structures within document collections and calculating semantic similarity between text segments using unsupervised statistical algorithms.

The project is distinguished by its ability to handle datasets that exceed available system memory through incremental corpus streaming, which processes documents one at a time from disk. It utilizes sparse vector representations and dictionary-based token mapping to
- [rust-embedded-community/embedded-storage](https://awesome-repositories.com/repository/rust-embedded-community-embedded-storage.md) (95 ⭐) — An Embedded Storage Abstraction Layer
- [rare-technologies/gensim](https://awesome-repositories.com/repository/rare-technologies-gensim.md) (16,442 ⭐) — Gensim is an unsupervised natural language processing toolkit designed for topic modeling, word embedding training, and the processing of large-scale text corpora. It provides a framework for discovering latent themes and semantic structures in text without the need for labeled data.

The toolkit is distinguished by its ability to handle datasets that exceed system memory through iterator-based data streaming from disk. It also supports distributed model training, allowing complex modeling tasks to be executed across computer clusters.

The library covers a broad range of analysis capabilities
- [fermyon/spin](https://awesome-repositories.com/repository/fermyon-spin.md) (6,443 ⭐) — Spin is a WebAssembly serverless framework and development toolchain for building and running portable microservices. It functions as an event-driven orchestrator and runtime that executes WebAssembly components, allowing developers to map HTTP requests, Redis messages, and cron schedules to specific modules.

The project distinguishes itself by implementing a Wasm-based AI inference gateway, enabling components to perform model inference and generate text embeddings. It utilizes the WebAssembly Component Model and WASI for language-agnostic composition and portable host interfacing, while emp
- [rust-embedded/awesome-embedded-rust](https://awesome-repositories.com/repository/rust-embedded-awesome-embedded-rust.md) (7,927 ⭐) — Curated list of resources for Embedded and Low-level development in the Rust programming language
- [nvidia-nemo/nemo](https://awesome-repositories.com/repository/nvidia-nemo-nemo.md) (17,389 ⭐) — NeMo is a comprehensive framework designed for the development, training, and deployment of large-scale conversational and generative artificial intelligence models. It provides an integrated platform for building multimodal systems, encompassing speech processing, language modeling, and reinforcement learning alignment. The framework is built to handle the entire lifecycle of AI development, from data curation and model pretraining to production-ready service deployment.

The platform distinguishes itself through advanced distributed training capabilities, including tensor and pipeline parall
- [ashishpatel26/andrew-ng-notes](https://awesome-repositories.com/repository/ashishpatel26-andrew-ng-notes.md) (3,594 ⭐) — This project is a collection of structured study notes and notebooks serving as an educational resource for deep learning and neural network fundamentals. It provides a technical reference for implementing machine learning theory, covering everything from basic network design to the construction of advanced architectures.

The material specifically focuses on the implementation of convolutional neural networks for computer vision and sequence models for natural language processing. It includes detailed guidance on building object detection systems, face recognition, and speech transcription mo
- [vseloved/cl-nlp](https://awesome-repositories.com/repository/vseloved-cl-nlp.md) (236 ⭐) — Common Lisp NLP toolset
- [vbelz/speech-enhancement](https://awesome-repositories.com/repository/vbelz-speech-enhancement.md) (763 ⭐) — This project aims at building a speech enhancement system to attenuate environmental noise.
- [camel-ai/camel](https://awesome-repositories.com/repository/camel-ai-camel.md) (17,253 ⭐) — This project is a comprehensive framework for building and managing autonomous agent systems. It provides a unified architecture for orchestrating multi-agent societies, where specialized agents collaborate through roleplay to decompose and solve complex tasks. The system integrates language models with external environments, enabling agents to perform real-world actions through a standardized tool-calling abstraction layer.

The framework distinguishes itself through its focus on iterative reasoning and data reliability. It employs automated feedback loops to refine agent outputs and self-eva
- [d2l-ai/d2l-zh](https://awesome-repositories.com/repository/d2l-ai-d2l-zh.md) (78,493 ⭐) — This project is an open-source, interactive educational platform designed to teach deep learning through a comprehensive, code-first curriculum. It provides a structured learning path that covers foundational mathematics, modern neural network architectures, and practical optimization techniques, enabling practitioners to master complex artificial intelligence concepts through hands-on experimentation.

The platform distinguishes itself by integrating technical explanations with executable Jupyter notebooks. This design allows readers to modify code and hyperparameters in real-time, facilitati
- [activepieces/activepieces](https://awesome-repositories.com/repository/activepieces-activepieces.md) (20,887 ⭐) — Activepieces is an open-source, self-hosted workflow automation platform designed to connect third-party applications through modular triggers and actions. It provides a low-code integration framework that allows users to build, manage, and execute complex business logic sequences within isolated, sandboxed environments.

The platform distinguishes itself through its focus on embeddability and enterprise-grade security. It features an embedded automation builder that can be integrated into external applications via iframes, supported by comprehensive identity and access management tools such a
- [boat-group/fancy-nlp](https://awesome-repositories.com/repository/boat-group-fancy-nlp.md) (284 ⭐) — NLP for human. A fast and easy-to-use natural language processing (NLP) toolkit, satisfying your imagination about NLP.
- [danielmiessler/fabric](https://awesome-repositories.com/repository/danielmiessler-fabric.md) (42,408 ⭐) — Fabric is a command-line orchestrator designed to automate complex data processing and content generation tasks by chaining artificial intelligence models with modular prompt templates. It functions as a terminal-based tool that utilizes standard input and output streams, allowing users to pipe data directly into predefined reasoning strategies. By providing a model-agnostic abstraction layer, the system decouples execution logic from specific artificial intelligence vendors, normalizing requests and responses across different service providers.

The platform distinguishes itself through its p
- [outcastofmusic/quick-nlp](https://awesome-repositories.com/repository/outcastofmusic-quick-nlp.md) (283 ⭐) — Pytorch NLP library based on FastAI
- [brightmart/text_classification](https://awesome-repositories.com/repository/brightmart-text-classification.md) (7,938 ⭐) — This project is a deep learning text classification framework and neural text analysis library. It provides tools for categorizing textual data, adapting large language models through fine-tuning, and treating classification tasks as sequence generation problems using transformer architectures.

The framework distinguishes itself through the implementation of ensemble learning, using boosting to combine predictions from multiple architectures to increase accuracy. It also includes a toolkit for fine-tuning pre-trained models via layer updates and the ability to restore model sessions for real-
- [edobashira/speech-language-processing](https://awesome-repositories.com/repository/edobashira-speech-language-processing.md) (2,229 ⭐) — A curated list of speech and natural language processing resources
- [anthropics/claude-code](https://awesome-repositories.com/repository/anthropics-claude-code.md) (132,728 ⭐) — Anthropic's terminal-native AI coding agent.
- [zalandoresearch/flair](https://awesome-repositories.com/repository/zalandoresearch-flair.md) (14,378 ⭐) — Flair is a natural language processing framework for training and applying models for sequence labeling and text classification. It provides a system for generating word embeddings and identifying semantic entities within text.

The framework includes a dedicated system for zero and few-shot learning, enabling text classification and entity extraction using minimal training examples by leveraging pre-trained knowledge.

Its capabilities cover named entity recognition, sentiment analysis, and the training of specialized models using custom datasets. It also includes tooling for the visual highl
- [berriai/litellm](https://awesome-repositories.com/repository/berriai-litellm.md) (50,579 ⭐) — LiteLLM is a unified gateway and proxy server designed to centralize access to over one hundred language model providers. It provides a standardized API interface that abstracts vendor-specific schemas, allowing developers to interact with diverse models through a single, consistent format. By acting as a central traffic management layer, it enables organizations to route, secure, and govern model interactions across multiple deployments.

The platform distinguishes itself through its policy-driven architecture, which uses configuration-based routing to manage traffic distribution, load balanc
- [stanfordnlp/glove](https://awesome-repositories.com/repository/stanfordnlp-glove.md) (7,224 ⭐) — GloVe is a distributed word representation system and a C implementation for training and using Global Vectors for word embeddings. It provides a word embedding training tool to learn numerical representations of words based on global co-occurrence statistics from a text corpus.

The project includes a pre-trained word vector library learned from large web datasets, allowing for the import of these representations to perform semantic analysis without local training. It enables word vector generation to identify semantic relationships, analogies, and nearest neighbors.

The system covers the fu
- [visualtext/nlp-engine](https://awesome-repositories.com/repository/visualtext-nlp-engine.md) (21 ⭐) — Many of you have been asking for tutorial videos on NLP++ and here is the first set. More coming soon... 1. NLP++ tutorial videos: http://tutorials.visualtext.org 1. Analyzers used in the videos: https://github.com/VisualText/nlp-tutorials 1. VisualText tutorial videos:…
- [diasks2/ruby-nlp](https://awesome-repositories.com/repository/diasks2-ruby-nlp.md) (1,285 ⭐) — A collection of links to Ruby Natural Language Processing (NLP) libraries, tools and software
- [flairnlp/flair](https://awesome-repositories.com/repository/flairnlp-flair.md) (14,378 ⭐) — Flair is a transformer-based natural language processing framework used to build and train models for text classification and sequence tagging. It provides a specialized library for generating contextual text embeddings and performing linguistic analysis.

The framework includes dedicated tools for named entity recognition, including the identification of specialized biomedical entities across multiple languages. It further supports entity linking to map identified text mentions to unique entries within general or biomedical knowledge bases.

The project covers a broad range of language analys
