awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Architectures · Awesome GitHub Repositories

30 repos

Awesome GitHub RepositoriesArchitectures

Structural designs and mathematical patterns used to define the internal connectivity and data flow of neural networks.

Explore 30 awesome GitHub repositories matching artificial intelligence & ml · Architectures. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Machine Learning
  4. Architectures

Awesome Architectures GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • AUTOMATIC1111/stable-diffusion-webui

    AUTOMATIC1111/stable-diffusion-webui

    160,701GitHubView on GitHub↗

    Stable Diffusion Web UI is a browser-based interface designed for managing text-to-image generation tasks. It provides a centralized dashboard for controlling generative processes, including native support for multi-stage model architectures to facilitate high-quality image refinement. The platform distinguishes itsel

    Powers the execution and management of complex generative media workflows through a graphical interface.

    Pythonaiai-artdeep-learning
  • huggingface/transformers

    huggingface/transformers

    156,730GitHubView on GitHub↗

    Transformers is a comprehensive library for machine learning that provides a unified interface for training, fine-tuning, and deploying transformer-based models. It supports a wide range of tasks, including text classification, language modeling, question answering, and sequence-to-sequence translation, while offering

    Exposes a registry-based interface for implementing custom attention mechanisms or modifying existing model behaviors.

    Pythonaudiodeep-learningdeepseek
  • microsoft/generative-ai-for-beginners

    microsoft/generative-ai-for-beginners

    106,618GitHubView on GitHub↗

    This project is a comprehensive, open-source educational curriculum designed to guide developers through the mastery of generative artificial intelligence. It provides a structured learning path that covers foundational concepts, prompt engineering, and the practical application of large language models. The repository

    Focuses on utilizing smaller, efficient language models for practical deployment.

    Jupyter Notebookaiazurechatgpt
  • deepseek-ai/DeepSeek-V3

    deepseek-ai/DeepSeek-V3

    101,631GitHubView on GitHub↗

    DeepSeek-V3 is a large language model that provides comprehensive resources for model utilization, including technical specifications, pre-trained weights, and evaluation benchmarks. The project details the core transformer architecture, including parameter counts and multi-token prediction modules, while supporting na

    Standardized performance benchmarks and technical specifications allow for rigorous analysis of capabilities against industry-recognized metrics.

    Python
  • pytorch/pytorch

    pytorch/pytorch

    97,601GitHubView on GitHub↗

    PyTorch is a machine learning framework centered on a GPU-ready tensor library that supports multi-dimensional array operations across both CPU and accelerator hardware. It provides a foundational infrastructure for mathematical computation and dynamic neural network construction, utilizing a tape-based automatic diffe

    Organizes neural network architectures through modular base classes and container types for custom layer management.

    Pythonautograddeep-learninggpu
  • openai/whisper

    openai/whisper

    94,839GitHubView on GitHub↗

    This project is a speech recognition and translation engine that utilizes a sequence-to-sequence transformer architecture to convert audio into text. It is built upon a weakly supervised learning framework, which leverages large-scale, unlabelled audio-transcript data to create generalized speech representations capabl

    Maps variable-length audio input sequences to text output sequences using deep learning and byte-level tokenization.

    Python
  • rasbt/LLMs-from-scratch

    rasbt/LLMs-from-scratch

    85,529GitHubView on GitHub↗

    This repository serves as an educational framework for building large language models from the ground up. It provides a structured curriculum that guides learners through the end-to-end lifecycle of model development, including data processing, architecture design, and optimization. By focusing on low-level implementat

    Implements gradient-based optimization logic manually to clarify the mechanics of weight updates and loss minimization.

    Jupyter Notebookaiartificial-intelligencechatbot
  • tensorflow/models

    tensorflow/models

    77,684GitHubView on GitHub↗

    This repository serves as a centralized collection of state-of-the-art deep learning architectures and reference implementations designed for research and application development. It provides a comprehensive toolkit for computer vision and natural language processing, offering pre-built models and training pipelines fo

    Houses a centralized library of state-of-the-art deep learning architectures and verified reference implementations.

    Python
  • d2l-ai/d2l-zh

    d2l-ai/d2l-zh

    75,708GitHubView on GitHub↗

    This project is an open-source, interactive educational platform designed to teach deep learning through a comprehensive, code-first curriculum. It provides a structured learning path that covers foundational mathematics, modern neural network architectures, and practical optimization techniques, enabling practitioners

    Models sequential dependencies in data through clear, code-based implementations of recurrent neural network structures.

    Pythonbookchinesecomputer-vision
  • mlabonne/llm-course

    mlabonne/llm-course

    75,340GitHubView on GitHub↗

    This project is a comprehensive educational curriculum and engineering handbook focused on the lifecycle of large language models. It serves as a structured knowledge base for machine learning practitioners, covering the fundamental mathematical and architectural principles of transformer-based sequence modeling, as we

    Details the mechanics of stacked attention layers used to process sequences and capture long-range dependencies.

    courselarge-language-modelsllm
  • twitter/the-algorithm

    twitter/the-algorithm

    72,764GitHubView on GitHub↗

    The algorithm is a distributed recommendation engine pipeline designed to construct and serve personalized content timelines. It functions as a multi-stage orchestration layer that aggregates candidate content from diverse social graphs and high-dimensional embedding spaces, processing user interaction data to deliver

    Shares model architectures to predict multiple engagement signals simultaneously for optimized content relevance.

    Scala
  • tesseract-ocr/tesseract

    tesseract-ocr/tesseract

    72,460GitHubView on GitHub↗

    Tesseract is a neural network-based optical character recognition engine designed to convert scanned images and digital documents into machine-readable, searchable text. It functions as both a command-line utility for automating large-scale digitization workflows and a cross-platform library that can be embedded into d

    Models sequential dependencies in text across diverse scripts and languages using advanced neural network architectures.

    C++hacktoberfestlstmmachine-learning
  • lobehub/lobehub

    lobehub/lobehub

    72,403GitHubView on GitHub↗

    LobeHub is a comprehensive multi-agent orchestration platform designed for building, configuring, and deploying specialized AI agents. It provides a unified chat-based gateway that allows users to manage autonomous agent teams across web, desktop, and mobile environments. By utilizing a framework that supports persiste

    Coordinates autonomous agents to work in concert on complex, long-horizon objectives and organizational tasks.

    TypeScriptagentagent-collaborationagent-harness
  • CompVis/stable-diffusion

    CompVis/stable-diffusion

    72,380GitHubView on GitHub↗

    Stable Diffusion is a generative machine learning pipeline that synthesizes high-resolution visual content by performing iterative denoising within a compressed latent space. By mapping natural language embeddings into pixel outputs through conditioned probabilistic processes, the framework enables the generation of im

    Maps pixel data into compact latent spaces to facilitate the synthesis of new visual media.

    Jupyter Notebook
  • abi/screenshot-to-code

    abi/screenshot-to-code

    71,707GitHubView on GitHub↗

    This project is an artificial intelligence-powered frontend generator that translates visual design inputs into functional source code. It functions as a workflow engine that interprets graphical user interfaces, mapping layout structures and styling rules to structured markup and programming language syntax. The tool

    Processes visual design inputs through neural networks to interpret layout structures and translate them into functional source code.

    TypeScript
  • josephmisiti/awesome-machine-learning

    josephmisiti/awesome-machine-learning

    71,702GitHubView on GitHub↗

    This project is a comprehensive, community-driven directory of machine learning resources, software libraries, and educational materials. It serves as a centralized knowledge base for developers and researchers, organizing tools and frameworks by their primary programming language and technical domain to simplify disco

    Identifies frameworks dedicated to the design and deployment of networks that simulate biological spiking patterns.

    Python
  • openai/openai-cookbook

    openai/openai-cookbook

    71,532GitHubView on GitHub↗

    This project is a technical learning resource and developer knowledge base focused on the integration of large language models into software applications. It provides a structured collection of guides and code examples designed to teach developers how to implement intelligent features using proven patterns and best pra

    Implements architectural strategies like retrieval-augmented generation to connect language models with external data sources.

    Jupyter Notebookchatgptgpt-4openai
  • dair-ai/Prompt-Engineering-Guide

    dair-ai/Prompt-Engineering-Guide

    70,526GitHubView on GitHub↗

    This project is a comprehensive educational resource and knowledge base dedicated to the development and application of large language models and autonomous agentic systems. It provides a structured framework for understanding prompt engineering, context management, and the architectural patterns required to build task

    Summarizes technical architectures capable of transforming text or image inputs into sequential video frames.

    MDXagentagentsai-agents
  • scikit-learn/scikit-learn

    scikit-learn/scikit-learn

    65,178GitHubView on GitHub↗

    Scikit-learn is a machine learning library for predictive data analysis that provides a collection of algorithms for supervised and unsupervised learning. It functions as a comprehensive toolkit for data preprocessing, dimensionality reduction, and model selection, allowing users to classify data objects, predict conti

    Chains data transformation and model estimation steps into sequential, reproducible workflows using a unified interface.

    Pythondata-analysisdata-sciencemachine-learning
  • keras-team/keras

    keras-team/keras

    63,858GitHubView on GitHub↗

    Keras is a high-level deep learning framework designed for constructing and training neural networks through the composition of modular, functional layers. It serves as a comprehensive modeling toolkit that provides standardized procedures for defining, evaluating, and deploying complex architectures. By utilizing a di

    Composes neural networks using reusable, functional layers that perform specific mathematical transformations on input data.

    Pythondata-sciencedeep-learningjax
Prev12Next

Explore sub-tags

  • Computer Vision Segmentation Models2 sub-tagsDeep learning architectures for pixel-level object isolation.
  • Generative Media Models3 sub-tagsModels focused on the synthesis of new visual or creative content from various input modalities.
  • Instruction-Tuned Language Models7 sub-tagsLarge language models specifically fine-tuned to follow user instructions and engage in chat-based interactions.
  • Model Architecture and Evaluation
4 sub-tags
Technical frameworks, structural designs, and performance metrics used to analyze and categorize model capabilities.
  • Multi-Agent Orchestration Systems4 sub-tagsSystems that coordinate multiple autonomous agents to collaborate on complex tasks and long-horizon objectives.
  • Multimodal Perception Models3 sub-tagsModels designed to interpret and analyze visual data, charts, or cross-modal inputs alongside text.
  • Neural Network Components10 sub-tagsModular building blocks and custom layer definitions used to construct and customize neural network architectures.
  • Recurrent Neural NetworksNeural network architectures designed to model sequential dependencies in data.
  • Sequence Models1 sub-tagArchitectures designed for processing ordered data where temporal or sequential dependencies are critical.
  • Spiking Neural NetworksLibraries and frameworks for designing, training, and deploying artificial neural networks that mimic biological spiking behavior.
  • Transformer3 sub-tagsNeural network designs utilizing stacked attention layers to process sequences and capture long-range dependencies.