30 open-source projects similar to instapy/instapy, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best InstaPy alternative.
Crawlee is a web scraping framework designed for building scalable, reliable, and distributed data extraction pipelines. It provides a unified interface for managing headless browser automation and lightweight HTTP requests, allowing developers to handle complex web navigation, dynamic content rendering, and large-scale data collection within a single, modular architecture. The project distinguishes itself through its resource-aware concurrency controller, which dynamically scales task execution based on real-time CPU and memory usage to prevent host machine exhaustion. It also features a rob
OpenFang is an operating system for LLM agents designed to orchestrate autonomous agents with built-in task scheduling, tool sandboxing, and multi-model routing. It provides a secure AI execution environment that integrates prompt injection scanning, cryptographic audit trails, and resource metering to ensure controlled processing. The platform distinguishes itself through a comprehensive security architecture, featuring fuel-metered tool sandboxing and an immutable activity audit trail based on cryptographic hash-chains. It implements high-assurance identity verification via signed manifests
igbot is an Instagram automation bot that handles routine social media interactions on Instagram, including following, unfollowing, liking, and commenting on posts and profiles. It also functions as a video metadata extractor, retrieving download URLs, view counts, and audio information from Instagram Reels and public short videos. To carry out these actions, the bot combines GraphQL API scraping with headless browser automation, mimicking human behavior through randomized action timing, session cookie persistence, and user agent rotation. A task queue with exponential backoff retry ensures r
AiToEarn is an artificial intelligence-driven social media management platform designed to centralize content orchestration, audience engagement, and performance analytics. It provides a unified workspace where users can generate, optimize, and schedule content across multiple social networks while automating interactions through intelligent language and media models. The platform distinguishes itself by integrating sentiment analysis to categorize audience engagement and identify purchase intent, allowing for personalized automated responses. It utilizes a credit-based accounting system to m
geckodriver is a browser automation driver and W3C WebDriver implementation. It functions as a proxy server that translates standardized WebDriver commands into internal instructions for web browsers based on the Gecko engine. The project enables the programmatic control of Gecko-based browsers to simulate user interactions and automate repetitive web tasks. It supports both standard browser automation and headless browser orchestration for workflows executing without a graphical user interface. The software is used for automated web testing to verify website functionality and user interface
This project is a Node.js framework designed for headless browser automation, enabling the creation of automated messaging clients. It functions by controlling a headless browser instance to programmatically interact with the messaging interface, allowing developers to simulate user sessions and manage complex chat workflows. The library distinguishes itself through its comprehensive session management and event-driven architecture. It supports persistent authentication by serializing session data to local or remote storage, ensuring that automated clients can maintain continuous connectivity
Dev-browser is a browser automation framework and headless browser controller that provides a sandboxed script runner for executing web tasks. It functions as a vision-based web automator and a specialized interface for large language models, enabling the navigation and interaction of web pages within isolated execution environments. The project distinguishes itself by converting complex web pages into simplified representations and coordinate-based maps, allowing AI agents to analyze layouts and perform actions based on pixel locations. It employs a mapping system that assigns unique identif
Crawlee-python is a web crawling framework for building scalable scrapers using Python. It serves as a comprehensive tool for web scraping automation, providing a system to extract structured data from websites using both lightweight HTTP requests and headless browser automation. The framework is distinguished by its anti-bot evasion capabilities, which include browser fingerprint impersonation and tiered proxy rotation to bypass detection systems and solve challenges such as Cloudflare. It also incorporates artificial intelligence for autonomous website navigation and schema-based data extra
Consent-O-Matic is a browser extension and cookie consent automation tool designed to automatically interact with privacy notices and cookie banners. It utilizes a DOM interaction engine and a privacy preference manager to map user choices to automated actions on third-party consent management providers. The project features a custom rule engine that allows for the import of external rule lists from user-provided URLs to target specific website behaviors. It employs a targeting system that combines CSS selectors with text, style, and iframe filters to locate and interact with precise web elem
Firecrawl is a web data extraction platform designed to convert unstructured web content into clean, LLM-ready formats like markdown or JSON. It functions as an autonomous web crawler and scraper, capable of mapping entire domains, performing recursive navigation, and executing complex data gathering tasks. By leveraging headless browser orchestration, the system handles dynamic, JavaScript-heavy pages to ensure comprehensive data capture. The platform distinguishes itself through its focus on agentic workflows, providing a programmatic interface that allows autonomous agents to perform live
Skyvern is an autonomous web navigation agent and browser-based workflow orchestrator that uses large language models to execute multi-step tasks on websites. By translating natural language instructions into actionable browser commands, the framework enables the automation of complex user workflows, including data extraction and interface interaction, without manual intervention. The platform distinguishes itself through a focus on secure, self-hosted infrastructure and stealth-oriented execution. It utilizes containerized browser isolation to maintain consistent environments and employs pro
This project is an agentic framework designed to enable autonomous web navigation and browser automation. It functions as a controller that translates natural language instructions into deterministic browser actions, allowing agents to interact with websites, perform data extraction, and manage complex authentication flows. By leveraging accessibility trees and semantic element resolution, the framework mimics human-like navigation, moving beyond brittle DOM selectors to interact reliably with modern web interfaces. The framework distinguishes itself through its focus on secure, scalable exec
This project is a comprehensive suite of AI tools and frameworks, featuring an LLM multi-agent orchestrator, an autonomous agent runtime, and a stateful application framework. It provides the infrastructure to build and manage specialized AI agents capable of coordinating complex tasks through graph-based workflows and shared state. The system is distinguished by its implementation of the Model Context Protocol, allowing for standardized resource discovery and communication between AI clients and servers. It further includes an AI-powered documentation generator designed to analyze source cod
This project is an AI agent orchestration platform that provides a visual environment for building, testing, and deploying complex automation workflows. It functions as a low-code development interface where users can chain discrete functional blocks into dependency-aware pipelines to integrate artificial intelligence with external data and services. The platform supports the creation of intelligent conversational agents, automated business processes, and multi-service API orchestrations within a unified workspace. The platform distinguishes itself through its event-driven integration engine,
NetBird is a zero-trust networking platform that builds secure, encrypted peer-to-peer overlay networks using the WireGuard protocol. It functions as a software-defined perimeter, connecting distributed infrastructure across cloud environments and physical locations while hiding network resources from the public internet. By integrating with external identity providers, the platform enforces granular access control and identity-based segmentation for every user and device. The platform distinguishes itself through extensive automation and programmatic management capabilities. It provides a ce
Vue is a progressive, component-based JavaScript framework designed for building reactive user interfaces and single-page applications. It centers on a declarative template system that transforms HTML into efficient render functions, allowing developers to organize complex interfaces into isolated, reusable units that synchronize automatically with application state. The framework distinguishes itself through a dependency-tracking reactivity system that monitors data access during rendering to trigger precise updates. It provides a flexible architecture that supports both incremental adoption
Neko is a virtual desktop infrastructure platform that provides containerized browser isolation and remote desktop environments. It enables users to host secure, ephemeral browser instances that can be accessed and managed through a standard web browser, ensuring consistent execution across different host systems. The platform distinguishes itself through its collaborative capabilities, allowing multiple users to view and interact with a single shared browser session in real time. It synchronizes keyboard, mouse, and gamepad inputs from multiple participants while providing integrated tools f
Douyin-Bot is a Python-based automation tool designed for interacting with Douyin accounts through automated likes, follows, and comments. It functions as a computer vision social bot that uses face recognition and image analysis to filter profiles based on visual criteria. The project distinguishes itself by using aesthetic content filtering to trigger social actions only when a user meets a specified beauty threshold. To reduce the risk of account bans, it incorporates account safety management that mimics human behavior through randomized delay scheduling. The framework covers a broad ran
BiliBiliToolPro is an account automation tool for Bilibili designed to manage multiple profiles, claim rewards, and maintain session cookies via QR code authentication. It functions as a growth bot and reward collector that automates daily activities to increase account rank and experience points. The project is built as a containerized automation suite, allowing for scheduled task execution across Docker, Kubernetes, or other automation panels. It features multi-account profile isolation, which separates user credentials and session data to execute tasks independently for different accounts.
Playwright-cli is a command line interface for executing web tasks and automating browser interactions using the Playwright framework. It serves as a browser binary manager for downloading and installing specific browser engines and their required system dependencies, as well as a tool for running automated test suites across multiple engines to verify application behavior. The utility functions as a browser session controller, managing browser profiles and persistent storage states via the command line. It enables the execution of automation suites across different browser engines and config
nodriver is an asynchronous Chromium browser automation framework that provides headless control and web scraping capabilities. It functions as a Chrome DevTools Protocol client, allowing for granular engine control by attaching directly to the browser's debug port without the need for external driver binaries. The framework is specifically designed as an anti-bot detection bypass tool. It modifies browser fingerprints and protocol headers to evade automated security systems, handle security warnings, and bypass common obstacles like insecure connection alerts. The system covers a broad rang
php-webdriver is a browser automation library and PHP language binding for the Selenium WebDriver protocol. It serves as a web application testing tool that allows for the programmatic control of web browsers to simulate user interactions and navigate web pages. The project implements the WebDriver protocol to manage browser sessions and execute automated functional tests. It enables integration with Selenium servers to perform automated web testing and support headless browser workflows. The library provides capabilities for configuring browser properties and managing the lifecycle of remot
Puppeteer Sharp is a web browser automation library and a headless Chrome .NET API. It provides a type-safe C# interface for controlling headless browsers, functioning as a Chrome DevTools Protocol wrapper that translates .NET method calls into JSON-RPC messages. The project enables programmatic navigation of pages, interaction with elements, and the execution of JavaScript within a .NET environment. It serves as an end-to-end testing framework for simulating user workflows and verifying web application behavior. Additional capabilities include automated screenshot generation for visual regr
This project is a TypeScript-based private API SDK and automation framework for Instagram. It serves as a server-side toolkit for NodeJS that interfaces with undocumented API endpoints to programmatically manage profiles and automate account operations. The library focuses on mimicking authentic user behavior by simulating mobile application network traffic patterns and headers. It uses stateful session management to maintain authentication tokens and employs a sequence-based coordination model to execute complex workflows, such as multi-step media uploads. The framework covers several high-
This project is a Model Context Protocol server that connects large language models to the Xiaohongshu social media platform. It acts as a connector and API wrapper, enabling language models to programmatically search, read, and publish media and text. The system provides automation for content discovery and publishing, allowing for the creation of image and video posts with associated titles and descriptions. It also facilitates social engagement by managing the posting of comments and tracking engagement metrics for specific entries. The tool covers data retrieval for user profiles, post d
This project is a social media automation tool designed to publish videos and images across multiple social networks programmatically. It functions as a headless browser content publisher and a multi-platform posting API, allowing for automated social media posting and content distribution. The system utilizes browser automation to execute uploads and logins on platforms without official public APIs. It features a social media command-line manager for executing batch media uploads and managing account sessions, as well as a programmatic interface for triggering uploads and scheduling content
MoneyPrinterV2 is an integrated automation suite designed for affiliate marketing, lead acquisition, and scheduled social media distribution. It combines large language model driven content creation with workflows for acquiring sales leads and managing online presence. The system automates the production of short-form video content by generating scripts and visual assets. It further extends this capability through a social media scheduler and background job manager that publishes content to external networks. The platform also includes tools for automated lead generation, utilizing web scrap
This project is a community-curated directory of open-source software designed for deployment in private server environments and home labs. It serves as a comprehensive resource for discovering independent, self-hosted alternatives to mainstream cloud services, enabling users to maintain full data ownership and control over their digital infrastructure. The directory is structured through a hierarchical taxonomy that organizes a vast collection of applications into logical categories, ranging from media management and data analytics to private communication and team productivity tools. It dis
Automa is a browser-based automation platform that enables users to build, schedule, and execute repetitive web tasks through a visual, no-code interface. By operating as a browser extension, it provides a canvas-based environment where users construct workflows by connecting functional blocks to interact with web elements, manage browser state, and process data. The platform distinguishes itself through its deep integration with the browser environment, allowing for complex orchestration such as event-driven triggers, cross-origin request handling, and the ability to package workflows as sta