43 repos
Awesome GitHub repositories, curated.
A community-curated directory of interesting public GitHub repositories. Ask in plain English — AI ranks by relevance. Save what you find.
Browse repositories
- tesseract-ocr/tesseract
tesseract-ocr/tesseract
72,460Tesseract is a neural network-based optical character recognition engine designed to convert scanned images and digital documents into machine-readable, searchable text. It functions as both a command-line utility for automating large-scale digitization workflows and a cross-platform library that can be embedded into desktop, mobile, or server-side applications. By utilizing long short-term memory networks, the engine provides robust text extraction across more than one hundred languages and dozens of scripts. The project distinguishes itself through a sophisticated document layout analysis framework that employs a hybrid approach to resolve complex structures like multi-column text and tables. It offers extensive configurability, allowing users to refine recognition accuracy through custom linguistic models, user-defined dictionaries, and specialized training pipelines. The engine supports the generation of various structured outputs, including searchable PDFs with hidden text layers, and provides hardware-accelerated math kernels to optimize inference performance. Beyond core recognition, the system includes comprehensive tooling for image pre-processing, page segmentation, and the management of modular language data. It provides C and C++ APIs alongside various language-specific wrappers, enabling integration into diverse software environments. The engine is available as pre-built binary packages or can be compiled from source using standard system compilers.
hacktoberfestlstmmachine-learning - lobehub/lobehub
lobehub/lobehub
72,403LobeHub is a comprehensive multi-agent orchestration platform designed for building, configuring, and deploying specialized AI agents. It provides a unified chat-based gateway that allows users to manage autonomous agent teams across web, desktop, and mobile environments. By utilizing a framework that supports persistent memory and granular tool integration, the platform enables the execution of complex, multi-step workflows and domain-specific tasks. The platform distinguishes itself through an interactive artifact renderer that injects dynamic, visual UI elements directly into the chat stream, transforming conversational outputs into functional content. It features an extensible ecosystem where users can discover and share community-driven agents and skills. Furthermore, the system supports collaborative workspaces where multiple agents can be organized into teams to scale intelligence and refine content through parallel task execution. Beyond its core orchestration capabilities, the project provides a robust suite of tools for self-hosting and infrastructure management. It supports containerized deployment through standardized configurations, allowing for secure, private instances that maintain data sovereignty. The platform integrates with external services through a common interface for data access and tool interaction, ensuring that agents remain adaptable and capable of handling diverse, multimodal requirements. The project is designed for self-hosted environments and includes comprehensive documentation for containerized setup, environment configuration, and security management.
agentagent-collaborationagent-harness - CompVis/stable-diffusion
CompVis/stable-diffusion
72,380Stable Diffusion is a generative machine learning pipeline that synthesizes high-resolution visual content by performing iterative denoising within a compressed latent space. By mapping natural language embeddings into pixel outputs through conditioned probabilistic processes, the framework enables the generation of images from text prompts and the transformation of existing visual inputs based on semantic instructions. The architecture utilizes a modular execution environment that decouples model loading, scheduler logic, and inference components to support diverse hardware configurations. It distinguishes itself through a symmetric encoder-decoder backbone that preserves spatial information during refinement, alongside integrated safety filters and invisible watermarking for generated outputs. The system provides a comprehensive suite of tools for latent space generative modeling, including capabilities for inpainting, outpainting, and style transfer. These functions are exposed through standardized interfaces, allowing for the integration of advanced diffusion-based inference into broader software workflows.