14 مستودعات
Tools for handling multiple conversion tasks in a single request.
Distinguishing note: Focuses on batch endpoint optimization.
Explore 14 awesome GitHub repositories matching web development · Batch Processing. Refine with filters or upvote what's useful.
gstack is an AI agent framework and development workflow system designed to automate the software development lifecycle. It coordinates specialized AI personas to manage tasks across product design, engineering management, and quality assurance, transforming product intent into technical specifications and final releases. The project is distinguished by its deep integration of headless browser automation and semantic code memory. It utilizes a persistent Chromium daemon for web scraping and visual auditing, and implements a searchable knowledge base that logs architectural decisions and repos
Sends multiple browser instructions in a single request to eliminate round-trip latency for remote agents.
Pandoc is a universal document converter that translates content between a wide range of markup and binary formats. It functions by parsing input documents into a unified intermediate abstract syntax tree, which serves as the foundation for consistent manipulation and transformation across diverse output types. The system is distinguished by its modular reader-writer pipeline, which decouples input parsing from output generation to allow for granular control over document structure. Users can programmatically manipulate this intermediate tree through a robust filter system, supporting both ex
Processes multiple document snippets efficiently in a single batch request.
This project is an agentic framework designed to enable autonomous web navigation and browser automation. It functions as a controller that translates natural language instructions into deterministic browser actions, allowing agents to interact with websites, perform data extraction, and manage complex authentication flows. By leveraging accessibility trees and semantic element resolution, the framework mimics human-like navigation, moving beyond brittle DOM selectors to interact reliably with modern web interfaces. The framework distinguishes itself through its focus on secure, scalable exec
Processes multiple browser operations in a single invocation to support complex automation workflows.
The Gemini Cookbook is a comprehensive collection of implementation patterns, code samples, and development guides designed for building applications with Google Gemini models. It serves as a central resource for developers to integrate multimodal generative artificial intelligence into their software, providing the necessary frameworks to manage model interactions, stateful workflows, and structured data extraction. The repository distinguishes itself by offering specialized toolkits for autonomous agent orchestration, enabling the construction of agents that can execute code, browse the web
Handles high-volume, asynchronous tasks with specific capacity limits for concurrent jobs and token counts.
This project is a JavaScript code protection tool designed to transform source code into a version that is difficult for humans to read. Its primary purpose is to protect intellectual property and prevent reverse engineering by altering the original program logic. The tool employs several advanced techniques to hinder analysis, including control flow flattening and the injection of dead code. It can compile functions into custom bytecode executed by an embedded virtual machine and encrypt string literals to prevent static analysis of text. Additional capabilities include anti-debugging mecha
Processes multiple source files simultaneously while maintaining consistent identifier mapping through a shared cache.
VERT is a media conversion platform designed to transform images, audio, video, and documents into various formats. It functions as a batch file processor that allows users to apply consistent conversion settings and custom naming patterns to multiple assets simultaneously, bundling the final outputs into compressed archives for streamlined organization. The system distinguishes itself through a distributed architecture that routes heavy media transcoding tasks across local hardware or remote server infrastructure. This approach optimizes performance by balancing computational workloads, allo
Applies conversion settings to multiple files simultaneously and bundles outputs into compressed archives.
Instructor is a framework designed for structured data extraction, validation, and language model integration. It functions as a library that transforms unstructured text into validated, type-safe objects by leveraging schema definitions and model-specific tool-calling capabilities. By acting as a validation middleware, the project ensures that language model outputs strictly conform to defined data structures. The library distinguishes itself through a robust validation-based retry loop that automatically re-submits failed responses with error feedback to iteratively correct schema complianc
Groups multiple data extraction tasks into a single operation to reduce costs and improve efficiency across supported language model providers.
Laravel-Excel is an integration library for importing and exporting spreadsheet data between Laravel applications and Excel or CSV files. It provides a suite of tools for bidirectional spreadsheet integration, including a system for reading workbooks and mapping data into database models. The library distinguishes itself through a background processing system that handles large imports and exports using chunking and job queues. It supports template-driven exports by converting HTML tables from view templates into spreadsheet cells. The toolset covers broad capabilities for large dataset proc
Offloads heavy import and export tasks to background workers to prevent request timeouts.
This project is a Python-based web scraping tool and command line image downloader designed to automate the retrieval of images from Google Images. It functions as an image dataset collector, allowing users to gather large sets of images for data analysis or research through a terminal interface or programmatic scripts. The tool features advanced search filtering to restrict results by file format, color, size, aspect ratio, and usage rights. It also supports reverse image search to find visually similar media based on a provided URL and offers search scope expansion to increase result volume
Implements batch processing that iterates through lists of keywords to automate the retrieval of multiple image datasets.
Kreuzberg is a document extraction engine that converts PDFs, Office files, images, and over 90 other formats into clean, structured text and metadata. It is built around a compiled Rust core that can be used as a native library, a command-line tool, a REST API server, or a WebAssembly module for browser-based processing. The system is designed to run entirely on self-hosted infrastructure, with no data leaving the user's environment. What distinguishes Kreuzberg is its breadth of integration surfaces and its pipeline architecture. It exposes extraction capabilities through native bindings fo
Processes multiple byte arrays concurrently for text extraction with concurrency management.
fswatch is a cross-platform file change monitor that watches files and directories for modifications using the native event API of each operating system. It functions as a multi-backend file watcher library that selects from platform-specific backends such as inotify, kqueue, and FSEvents, with a polling fallback when no native API is available. The tool groups multiple file changes into a single event batch and triggers one command per batch, while also executing user-specified commands each time monitored files or directories are modified. The project applies include, exclude, event-type, a
Groups multiple file changes into a single event batch to reduce overhead for rebuilds or test reruns.
ServiceStack is a high-performance .NET web framework designed for building type-safe APIs using strongly-typed request and response objects. It functions as a message-based API engine that decouples business logic from the transport layer, allowing services to be exposed via multiple protocols including HTTP, gRPC, and various message queue providers. The framework is distinguished by its type-safe API generator, which produces native client SDKs and data transfer objects from service metadata across multiple languages. It also includes a distributed service gateway for microservices orchest
Handles collections of request objects in a single execution to optimize database transactions and cache lookups.
AWS Powertools for Python is a utility framework designed for building production-ready Python functions on AWS Lambda. It provides a comprehensive suite of tools for observability, event parsing, routing, and idempotency management to streamline the development of serverless applications. The project distinguishes itself through specialized capabilities for event-driven architectures and AI agent orchestration. It enables the implementation of AI agents by exposing functions as tools via OpenAPI schemas and managing conversation states. Additionally, it features an idempotency library that p
Processes collections of requests in a single execution and returns results in the original order.
immich-power-tools is a library management tool and metadata editor designed for Immich. It provides a set of utilities for performing bulk editing, organization, and curation of photos, people, and albums. The project features a gallery curator and a facial recognition cleanup system that merges duplicate person profiles. It includes an album suggestion engine to group assets and a natural language image search that translates human-language queries into structured filters. The toolset covers broad capability areas including digital photo analysis through an analytics dashboard, photo metad
Provides the ability to apply a single modification to all assets matching specific metadata criteria.