30 open-source projects similar to cloud-annotations/cloud-annotations, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Cloud Annotations alternative.
CVAT is an open-source computer vision annotation tool and visual dataset management platform. It provides a self-hosted interface for labeling images, videos, and 3D data to create datasets for vision AI models. The platform features AI-assisted data labeling to automate the creation of masks and bounding boxes, utilizing a plug-in system to connect external machine learning models. It includes a consensus-based quality assurance system that verifies label accuracy by comparing independent annotations. The system covers collaborative team management, project organization through task decomp
BBox-Label-Tool is a web-based utility designed for labeling image collections and defining spatial object boundaries to support supervised machine learning tasks. It provides an interface for drawing rectangular bounding boxes on images, allowing users to record coordinate data for object detection and visual recognition datasets. The tool operates entirely within the browser, utilizing local file processing to read images directly from the user's system without requiring data uploads. It maintains annotation records through browser-based storage, ensuring that spatial data persists across p
Doccano is a collaborative data labeling platform and machine learning dataset management system. It provides a web-based interface for teams to import raw text, mark datasets, and export structured annotations for model training. The project specifically supports text annotation for classification and named entity recognition tasks. It enables teams to coordinate multiple users on a single project to maintain consistent labeling guidelines and increase the speed of dataset creation. The system includes tools for data management and team coordination, providing the ability to import raw data
labelImg is a desktop image annotation tool and dataset preparation utility used to create labeled datasets for computer vision training. It provides a graphical interface for drawing bounding boxes around objects in images and assigning them class labels to build ground truth data for machine learning models. The software specifically supports the Pascal VOC XML annotation format, exporting image coordinates and class names into standard XML or text structures. It allows users to load predefined class lists from text files to standardize naming across an entire project. Beyond initial label
CVAT is an open-source, web-based platform designed for annotating images, videos, and 3D point clouds to create high-quality training datasets for machine learning. It functions as a containerized server that orchestrates the entire lifecycle of computer vision data, from initial task creation and manual labeling to quality assurance and final dataset export. The platform distinguishes itself through deep integration with machine learning models, allowing users to deploy custom AI models as serverless functions for automated object detection, tracking, and skeleton annotation. It supports co
X-AnyLabeling is an AI-assisted annotation platform and computer vision labeling tool. It provides an interface for annotating images and videos using polygons and rectangles to create training sets for machine learning models. The project distinguishes itself through the integration of external AI models via a plugin-based inference backend, allowing for automated generation of candidate labels and the execution of specialized tasks like pose estimation and object detection. It also functions as an optical character recognition tool for extracting text and layout information from document im
Doccano is a collaborative labeling platform and text annotation tool designed to create training data for machine learning. It provides a specialized interface for performing sequence labeling and text classification on natural language datasets. The system functions as a supervised learning dataset manager, allowing multiple users to coordinate within a shared workspace to label datasets for natural language processing tasks. It supports the preparation of raw text data for model training by converting unstructured documents into structured labeled examples. The platform includes capabilit
labelImg is a computer vision labeling tool and image bounding box annotator used to create training datasets for machine learning models. It functions as a desktop utility for drawing rectangular labels on images and saving object coordinates and class names in common machine learning formats. The tool is specifically designed to generate and edit PascalVOC formatted XML files and create image labels in the text-based format required by YOLO object detection pipelines. The software covers object detection annotation and training data preparation, including the ability to manage label catego
Colanode is a local-first collaboration platform designed for shared documents, chat, and databases. It provides a self-hosted suite for team collaboration and knowledge management, allowing users to maintain full control over their data and privacy on their own infrastructure. The platform is distinguished by a synchronization engine that uses WebSockets for real-time data broadcasting and a local-first approach to ensure work continues offline. It incorporates AI-powered retrieval through vector-based semantic search, enabling users to find information based on meaning across documents and
Vuefire is a Firebase integration library for Vue.js that synchronizes backend state and real-time data with reactive frontend components. It functions as a backend-as-a-service client and synchronization tool, bridging remote Firebase documents, collections, and authentication status with local application state. The library provides specialized wrappers for managing user authentication workflows and real-time database synchronization. It automates the process of linking remote data sources to reactive objects and arrays, ensuring the user interface updates instantly as backend data changes.
mimic-code is a clinical data analysis framework and toolset for processing deidentified electronic health records and intensive care unit data. It provides a healthcare SQL query library and a processing tool to transform raw health records into formats suitable for longitudinal analysis and machine learning. The project features a medical research notebook environment that integrates with cloud-hosted datasets, allowing for remote querying and analysis. It includes a DICOM imaging pipeline to retrieve chest radiographs and link medical imaging with structured clinical metadata. The framewo
Stream-Framework is a Python library for building scalable activity streams, news feeds, and notification systems. It functions as an activity stream engine that manages the distribution, storage, and retrieval of chronological event streams for large user bases. The framework utilizes a combination of Cassandra and Redis to provide a scalable feed architecture, employing in-memory caching for low-latency retrieval and distributed storage for high availability. It features an asynchronous fan-out mechanism to distribute activities to multiple follower feeds and a real-time synchronization lay
Horizon is a realtime API server and RethinkDB backend designed to push database changes instantly to front-end clients. It utilizes a WebSocket data streaming API to synchronize data between the database and user interfaces without requiring manual polling. The project integrates an OAuth identity manager for verifying user identities through third-party providers and a role-based access control system to define granular permissions for viewing or modifying database documents. It is delivered as a containerized backend framework, allowing the server and its dependencies to be deployed as a p
Stoatchat is a real-time chat server and messaging backend that utilizes WebSockets to handle instant communication, live event updates, and user presence tracking. The project incorporates an encrypted file storage system and cloud storage layer that applies AES encryption to binary data before upload. It also functions as an HTTP content proxy server to mask user IP addresses when fetching media from third-party services and as a push notification gateway to ensure reliable alert delivery via a background dispatch queue. The system includes role-based access control to manage user permissi
Realtime is a real-time data distribution and synchronization engine that enables applications to stream database changes and coordinate state between clients. It functions as a synchronization layer that monitors database write-ahead logs to provide change data capture and pushes updates to authorized clients via WebSockets. The project features a real-time presence server for tracking the online status of active users and a broadcast service for sending ephemeral messages without database persistence. It organizes communication through channel-based message routing and uses a structured JSO
This project is a toolkit and API designed for parsing, manipulating, and visualizing image annotations for computer vision tasks. It provides a programming interface to load and organize Common Objects in Context annotations, specifically for object detection, image segmentation, and keypoint estimation. The library includes tools for converting formatted JSON files into data structures that support the analysis of pixel-level masks and skeletal markers. It enables the visual verification of ground truth accuracy by rendering bounding boxes, segmentation masks, and keypoint markers directly
Sound-redux is a React-based audio player and SoundCloud API client designed for browsing and playing audio tracks. It functions as a Redux state manager to synchronize playback and track metadata across the application. The project integrates with the SoundCloud API to fetch and display music tracks and artist information. This enables the construction of a music streaming interface that connects a custom user interface to remote external services for public consumption. The application utilizes a Redux-based state management system and a component-based UI rendering model. It incorporates
Label Studio is a multi-modal data annotation platform designed to create and manage high-quality training datasets for machine learning. It functions as a self-hosted, containerized environment that supports secure, private deployments, including air-gapped configurations. The platform provides a centralized workspace for labeling diverse media types, such as images, text, audio, and time-series data, to support supervised and reinforcement learning workflows. The platform distinguishes itself through deep integration with machine learning backends, enabling active learning loops, automated
This project is a desktop screen capture and annotation utility designed for Linux environments. It provides an interactive graphical overlay that allows users to select specific screen regions, apply visual annotations such as shapes, text, and pixelation, and manage the resulting images through a configurable post-capture pipeline. The application distinguishes itself through deep system integration and automation capabilities. It operates as a persistent background daemon that monitors global hotkeys and supports inter-process communication via a system message bus, enabling users to trigg
HospitalRun Frontend is an offline-first progressive web application designed for hospital information system administration, enabling healthcare facilities to manage patient records, appointments, and clinical workflows through a web-based interface. The application stores patient data locally in the browser's IndexedDB database, allowing full functionality without a persistent internet connection, and synchronizes changes with the backend server via RESTful API calls when connectivity is restored. The system implements role-based access control routing that restricts navigation and feature
tui.image-editor is a JavaScript image manipulation library and web-based photo editor. It provides a browser-based interface for cropping, resizing, and applying filters to images using the HTML5 Canvas API. The project is distinguished by its role as a canvas-based annotation tool, allowing users to add text, shapes, and freehand drawings as graphic overlays. It offers extensive UI customization through theme configuration, interface text localization, and the ability to replace default icons with custom SVG files. The library covers geometry manipulation, visual filter application, and im
Pepperoni App Kit is a starter kit and project template for building cross-platform mobile applications for iOS and Android using React Native. It provides a pre-configured scaffold that establishes a modular architecture and shared codebase to bootstrap new projects. The kit integrates a predictable state management system with disk persistence for offline access and a controlled loop for coordinating asynchronous side effects and API interactions. It also includes built-in mechanisms for token-based session authentication and identity validation to maintain secure user logins. The project
Leaf is a machine learning framework and neural network architecture toolkit used for building, training, and deploying models. It functions as a hardware abstraction layer, mapping high-level computational graphs to low-level instructions across various CPU and GPU backends and operating systems. The system enables the design of flexible model structures through a modular architecture where reusable container layers encapsulate weights and mathematical operations. This allows for the composition of complex neural networks via nested components. The framework includes a data engineering pipe
mmaction2 is a PyTorch video understanding toolbox designed for training and evaluating deep learning models. It serves as a framework for action recognition, temporal localization, and spatio-temporal action detection, providing specialized tools for both pixel-based video analysis and skeleton-based action recognition. The project distinguishes itself through a modular architecture featuring registry-based component discovery and hierarchical, config-driven model assembly. It supports multi-modal feature fusion, integrating RGB frames, optical flow, and audio, and includes capabilities for
Realm is a mobile database: an alternative to SQLite & key-value stores
This project is a full-stack TypeScript framework and boilerplate designed for building subscription-based software services. It provides a pre-configured development environment that integrates a React frontend with a Node.js and Express server and a MongoDB data layer. The framework includes integrated modules for managing subscription billing, multi-tenant user management, and secure identity verification via social logins and passwordless authentication. It further enables real-time application synchronization through a WebSocket server for bidirectional data streaming between clients and
This project is a comprehensive instructional resource and tutorial guide for learning React and the broader frontend engineering ecosystem. It serves as a structured course covering the fundamentals of building user interfaces through component-based architecture. The project provides specialized guides on coordinating application data via Redux state management, integrating and mutating data using GraphQL and Relay, and building native mobile applications using a single codebase. It also includes a dedicated tutorial for verifying component behavior through unit testing, shallow rendering,
This project is an AI-powered screenshot manager and visual assistant designed for capturing screen content and processing it through large language models. It functions as an OCR translation application and screen annotation tool, allowing users to extract text from images and perform intelligent analysis of visual data. The software differentiates itself through an AI-driven OCR pipeline and the ability to convert screenshots into structured Markdown or HTML via layout-aware document transformation. It features a visual AI assistant capable of analyzing screen content and a prompt-engineere
VoTT is a computer vision annotation software and machine learning dataset preparation tool. It is a desktop application designed for drawing bounding boxes and assigning tags to objects in images and videos to create training datasets for object detection models. The application utilizes a cross-platform desktop interface to manage image and video assets. It features a local-first storage integration to handle large media assets directly from the host machine's file system and includes frame-rate controlled video sampling to extract specific images from video streams for labeling. The softw
Baserow is a self-hosted, no-code relational database platform built on PostgreSQL. It provides a spreadsheet-like interface for structuring and managing data without writing code, while exposing all database resources via a REST API to support headless architectures. The platform distinguishes itself by integrating large language models and embedding servers to power AI assistants and automated data generation. It further extends its utility as a no-code application builder, allowing users to create custom internal portals, dashboards, and business tools using visual logic and managed data.