76 repos

Awesome Python GitHub repositories

We curate 76 open-source Python repositories. AI-ranked by relevance — refine with filters, or browse the highest-voted projects in the community.

We'll search the best matching repositories with AI.

openai/whisper
openai/whisper
94,839GitHubView on GitHub
This project is a speech recognition and translation engine that utilizes a sequence-to-sequence transformer architecture to convert audio into text. It is built upon a weakly supervised learning framework, which leverages large-scale, unlabelled audio-transcript data to create generalized speech representations capabl
Python
microsoft/markitdown
microsoft/markitdown
87,305GitHubView on GitHub
This project is an AI-powered document processing engine designed to transform diverse file formats into structured Markdown. By leveraging multimodal language models, it performs complex layout analysis and semantic text extraction, allowing for the conversion of both unstructured files and scanned images into machine
Pythonautogenautogen-extensionlangchain
django/django
django/django
86,891GitHubView on GitHub
Django is a full-stack web framework designed for rapid backend development. It provides an integrated environment for building data-driven applications by combining an object-relational mapping layer for database management with a modular request-response pipeline for handling HTTP traffic. The framework emphasizes se
Pythonappsdjangoframework
home-assistant/core
home-assistant/core
84,936GitHubView on GitHub
Home Assistant is a centralized home automation platform designed to orchestrate diverse internet-connected devices and services. It functions as a local-first control system that normalizes heterogeneous hardware protocols into a unified set of entities, attributes, and services. The core architecture relies on an eve
Pythonasynciohacktoberfesthome-automation
3b1b/manim
3b1b/manim
84,611GitHubView on GitHub
Manim is a Python-based computational geometry framework designed for programmatic video production. It functions as a mathematical animation engine, allowing users to generate high-fidelity visual content by scripting scene definitions rather than using traditional timeline-based editing software. The library is built
Python3b1b-videosanimationexplanatory-math-videos
bregman-arie/devops-exercises
bregman-arie/devops-exercises
81,169GitHubView on GitHub
This project is a comprehensive educational curriculum designed to build proficiency across modern infrastructure, cloud-native technologies, and systems administration. It functions as a reference library and interview preparation resource, offering a structured collection of conceptual questions, practical coding cha
Pythonansibleawsazure
hacksider/Deep-Live-Cam
hacksider/Deep-Live-Cam
79,568GitHubView on GitHub
Deep-Live-Cam is a generative video transformation tool designed for real-time facial manipulation and cinematic enhancement. It functions as a local-first AI runtime, performing all media processing directly on the user's hardware to ensure complete data privacy without external network dependencies. By utilizing a hi
Pythonaiai-deep-fakeai-face
fighting41love/funNLP
fighting41love/funNLP
78,999GitHubView on GitHub
This project is a community-driven knowledge base and curated repository focused on natural language processing and large language model development. It serves as a centralized index for high-quality tools, libraries, and research materials, organizing technical resources into structured, version-controlled documentati
Python
browser-use/browser-use
browser-use/browser-use
78,576GitHubView on GitHub
Browser-use is a framework for building autonomous agents that navigate, interact with, and extract data from web interfaces using natural language instructions. By acting as an orchestration layer between large language models and browser automation protocols, it enables the execution of complex, multi-step workflows
Pythonai-agentsai-toolsbrowser-automation
tensorflow/models
tensorflow/models
77,684GitHubView on GitHub
This repository serves as a centralized collection of state-of-the-art deep learning architectures and reference implementations designed for research and application development. It provides a comprehensive toolkit for computer vision and natural language processing, offering pre-built models and training pipelines fo
Python
d2l-ai/d2l-zh
d2l-ai/d2l-zh
75,708GitHubView on GitHub
This project is an open-source, interactive educational platform designed to teach deep learning through a comprehensive, code-first curriculum. It provides a structured learning path that covers foundational mathematics, modern neural network architectures, and practical optimization techniques, enabling practitioners
Pythonbookchinesecomputer-vision
swisskyrepo/PayloadsAllTheThings
swisskyrepo/PayloadsAllTheThings
75,346GitHubView on GitHub
This project is a comprehensive, community-sourced knowledge base designed for security professionals and researchers. It functions as a centralized repository of offensive security techniques, providing a structured collection of exploit payloads, attack vectors, and methodologies for conducting vulnerability assessme
Pythonbountybugbountybypass
infiniflow/ragflow
infiniflow/ragflow
73,425GitHubView on GitHub
This project is a comprehensive retrieval-augmented generation platform designed for building, managing, and deploying knowledge-based AI applications. It provides a unified environment for organizing datasets, configuring conversational chat assistants, and developing autonomous agents that execute multi-step reasonin
Pythonagentagenticagentic-ai
sherlock-project/sherlock
sherlock-project/sherlock
72,906GitHubView on GitHub
Sherlock is a command-line automation tool designed to orchestrate software build, execution, and deployment workflows. It functions as an ephemeral runtime orchestrator that executes applications directly from source code, bypassing the need for persistent system-wide installations or manual dependency management. By
Pythonclicticybersecurity
anthropics/skills
anthropics/skills
71,987GitHubView on GitHub
This project provides a standardized framework for extending the functional range of artificial intelligence agents through a registry of modular, declarative instructions. It enables agentic workflow automation by allowing developers to define task-specific behaviors and operational constraints that guide how agents i
Pythonagent-skills
josephmisiti/awesome-machine-learning
josephmisiti/awesome-machine-learning
71,702GitHubView on GitHub
This project is a comprehensive, community-driven directory of machine learning resources, software libraries, and educational materials. It serves as a centralized knowledge base for developers and researchers, organizing tools and frameworks by their primary programming language and technical domain to simplify disco
Python
python/cpython
python/cpython
71,643GitHubView on GitHub
CPython is the primary, community-maintained reference implementation of the Python programming language. It functions as a high-level, interpreted execution environment that compiles source code into platform-independent bytecode for processing by a stack-based virtual machine. The runtime manages memory through a com
Python
pallets/flask
pallets/flask
71,240GitHubView on GitHub
Flask is a micro web framework designed for building web services with a flexible, lightweight structure. It functions as a standard-compliant WSGI application server, providing the essential tools required to register URL routes, handle incoming HTTP requests, and construct responses. By utilizing a central applicatio
Pythonflaskjinjapallets
PaddlePaddle/PaddleOCR
PaddlePaddle/PaddleOCR
70,931GitHubView on GitHub
PaddleOCR is a comprehensive optical character recognition framework designed for detecting and transcribing text from images and documents into structured, machine-readable formats. It provides a modular computer vision pipeline that decouples image preprocessing, text detection, and character recognition into indepen
Pythonai4sciencechineseocrdocument-parsing
vllm-project/vllm
vllm-project/vllm
70,745GitHubView on GitHub
vLLM is a high-throughput inference engine designed for the efficient serving and execution of large language models. It functions as a production-ready distributed model server, providing standard API protocols for online serving while also supporting offline batch processing. The system is built to maximize token gen
Pythonamdblackwellcuda

76 repos

Awesome Python GitHub repositories

We curate 76 open-source Python repositories. AI-ranked by relevance — refine with filters, or browse the highest-voted projects in the community.

We'll search the best matching repositories with AI.

openai/whisper
openai/whisper
94,839GitHubView on GitHub
This project is a speech recognition and translation engine that utilizes a sequence-to-sequence transformer architecture to convert audio into text. It is built upon a weakly supervised learning framework, which leverages large-scale, unlabelled audio-transcript data to create generalized speech representations capabl
Python
microsoft/markitdown
microsoft/markitdown
87,305GitHubView on GitHub
This project is an AI-powered document processing engine designed to transform diverse file formats into structured Markdown. By leveraging multimodal language models, it performs complex layout analysis and semantic text extraction, allowing for the conversion of both unstructured files and scanned images into machine
Pythonautogenautogen-extensionlangchain
django/django
django/django
86,891GitHubView on GitHub
Django is a full-stack web framework designed for rapid backend development. It provides an integrated environment for building data-driven applications by combining an object-relational mapping layer for database management with a modular request-response pipeline for handling HTTP traffic. The framework emphasizes se
Pythonappsdjangoframework
home-assistant/core
home-assistant/core
84,936GitHubView on GitHub
Home Assistant is a centralized home automation platform designed to orchestrate diverse internet-connected devices and services. It functions as a local-first control system that normalizes heterogeneous hardware protocols into a unified set of entities, attributes, and services. The core architecture relies on an eve
Pythonasynciohacktoberfesthome-automation
3b1b/manim
3b1b/manim
84,611GitHubView on GitHub
Manim is a Python-based computational geometry framework designed for programmatic video production. It functions as a mathematical animation engine, allowing users to generate high-fidelity visual content by scripting scene definitions rather than using traditional timeline-based editing software. The library is built
Python3b1b-videosanimationexplanatory-math-videos
bregman-arie/devops-exercises
bregman-arie/devops-exercises
81,169GitHubView on GitHub
This project is a comprehensive educational curriculum designed to build proficiency across modern infrastructure, cloud-native technologies, and systems administration. It functions as a reference library and interview preparation resource, offering a structured collection of conceptual questions, practical coding cha
Pythonansibleawsazure
hacksider/Deep-Live-Cam
hacksider/Deep-Live-Cam
79,568GitHubView on GitHub
Deep-Live-Cam is a generative video transformation tool designed for real-time facial manipulation and cinematic enhancement. It functions as a local-first AI runtime, performing all media processing directly on the user's hardware to ensure complete data privacy without external network dependencies. By utilizing a hi
Pythonaiai-deep-fakeai-face
fighting41love/funNLP
fighting41love/funNLP
78,999GitHubView on GitHub
This project is a community-driven knowledge base and curated repository focused on natural language processing and large language model development. It serves as a centralized index for high-quality tools, libraries, and research materials, organizing technical resources into structured, version-controlled documentati
Python
browser-use/browser-use
browser-use/browser-use
78,576GitHubView on GitHub
Browser-use is a framework for building autonomous agents that navigate, interact with, and extract data from web interfaces using natural language instructions. By acting as an orchestration layer between large language models and browser automation protocols, it enables the execution of complex, multi-step workflows
Pythonai-agentsai-toolsbrowser-automation
tensorflow/models
tensorflow/models
77,684GitHubView on GitHub
This repository serves as a centralized collection of state-of-the-art deep learning architectures and reference implementations designed for research and application development. It provides a comprehensive toolkit for computer vision and natural language processing, offering pre-built models and training pipelines fo
Python
d2l-ai/d2l-zh
d2l-ai/d2l-zh
75,708GitHubView on GitHub
This project is an open-source, interactive educational platform designed to teach deep learning through a comprehensive, code-first curriculum. It provides a structured learning path that covers foundational mathematics, modern neural network architectures, and practical optimization techniques, enabling practitioners
Pythonbookchinesecomputer-vision
swisskyrepo/PayloadsAllTheThings
swisskyrepo/PayloadsAllTheThings
75,346GitHubView on GitHub
This project is a comprehensive, community-sourced knowledge base designed for security professionals and researchers. It functions as a centralized repository of offensive security techniques, providing a structured collection of exploit payloads, attack vectors, and methodologies for conducting vulnerability assessme
Pythonbountybugbountybypass
infiniflow/ragflow
infiniflow/ragflow
73,425GitHubView on GitHub
This project is a comprehensive retrieval-augmented generation platform designed for building, managing, and deploying knowledge-based AI applications. It provides a unified environment for organizing datasets, configuring conversational chat assistants, and developing autonomous agents that execute multi-step reasonin
Pythonagentagenticagentic-ai
sherlock-project/sherlock
sherlock-project/sherlock
72,906GitHubView on GitHub
Sherlock is a command-line automation tool designed to orchestrate software build, execution, and deployment workflows. It functions as an ephemeral runtime orchestrator that executes applications directly from source code, bypassing the need for persistent system-wide installations or manual dependency management. By
Pythonclicticybersecurity
anthropics/skills
anthropics/skills
71,987GitHubView on GitHub
This project provides a standardized framework for extending the functional range of artificial intelligence agents through a registry of modular, declarative instructions. It enables agentic workflow automation by allowing developers to define task-specific behaviors and operational constraints that guide how agents i
Pythonagent-skills
josephmisiti/awesome-machine-learning
josephmisiti/awesome-machine-learning
71,702GitHubView on GitHub
This project is a comprehensive, community-driven directory of machine learning resources, software libraries, and educational materials. It serves as a centralized knowledge base for developers and researchers, organizing tools and frameworks by their primary programming language and technical domain to simplify disco
Python
python/cpython
python/cpython
71,643GitHubView on GitHub
CPython is the primary, community-maintained reference implementation of the Python programming language. It functions as a high-level, interpreted execution environment that compiles source code into platform-independent bytecode for processing by a stack-based virtual machine. The runtime manages memory through a com
Python
pallets/flask
pallets/flask
71,240GitHubView on GitHub
Flask is a micro web framework designed for building web services with a flexible, lightweight structure. It functions as a standard-compliant WSGI application server, providing the essential tools required to register URL routes, handle incoming HTTP requests, and construct responses. By utilizing a central applicatio
Pythonflaskjinjapallets
PaddlePaddle/PaddleOCR
PaddlePaddle/PaddleOCR
70,931GitHubView on GitHub
PaddleOCR is a comprehensive optical character recognition framework designed for detecting and transcribing text from images and documents into structured, machine-readable formats. It provides a modular computer vision pipeline that decouples image preprocessing, text detection, and character recognition into indepen
Pythonai4sciencechineseocrdocument-parsing
vllm-project/vllm
vllm-project/vllm
70,745GitHubView on GitHub
vLLM is a high-throughput inference engine designed for the efficient serving and execution of large language models. It functions as a production-ready distributed model server, providing standard API protocols for online serving while also supporting offline batch processing. The system is built to maximize token gen
Pythonamdblackwellcuda

Awesome Python GitHub repositories

openai/whisper

microsoft/markitdown

django/django

home-assistant/core

3b1b/manim

bregman-arie/devops-exercises

hacksider/Deep-Live-Cam

fighting41love/funNLP

browser-use/browser-use

tensorflow/models

d2l-ai/d2l-zh

swisskyrepo/PayloadsAllTheThings

infiniflow/ragflow

sherlock-project/sherlock

anthropics/skills

josephmisiti/awesome-machine-learning

python/cpython

pallets/flask

PaddlePaddle/PaddleOCR

vllm-project/vllm

Awesome Python GitHub repositories

openai/whisper

microsoft/markitdown

django/django

home-assistant/core

3b1b/manim

bregman-arie/devops-exercises

hacksider/Deep-Live-Cam

fighting41love/funNLP

browser-use/browser-use

tensorflow/models

d2l-ai/d2l-zh

swisskyrepo/PayloadsAllTheThings

infiniflow/ragflow

sherlock-project/sherlock

anthropics/skills

josephmisiti/awesome-machine-learning

python/cpython

pallets/flask

PaddlePaddle/PaddleOCR

vllm-project/vllm