What are the best open-source alternatives to MediaCrawler?

30 open-source projects similar to nanmicoder/mediacrawler, ranked by shared features. Top picks: apify/crawlee, kovidgoyal/kitty, ultrafunkamsterdam/nodriver, sawyerhood/dev-browser, apify/crawlee-python, twintproject/twint, automaapp/automa, aria2/aria2, cli/cli, go-rod/rod.

Is apify/crawlee a good alternative to MediaCrawler?

Crawlee is a web scraping framework designed for building scalable, reliable, and distributed data extraction pipelines. It provides a unified interface for managing headless browser automation and lightweight HTTP requests, allowing developers to handle complex web navigation, dynamic content rend…

Is kovidgoyal/kitty a good alternative to MediaCrawler?

Kitty is a high-performance, GPU-accelerated terminal emulator designed to provide a consistent and extensible workspace across different operating systems. It leverages graphics hardware to render text, images, and complex layouts with low latency, while providing a robust environment for demandin…

Is ultrafunkamsterdam/nodriver a good alternative to MediaCrawler?

nodriver is an asynchronous Chromium browser automation framework that provides headless control and web scraping capabilities. It functions as a Chrome DevTools Protocol client, allowing for granular engine control by attaching directly to the browser's debug port without the need for external dri…

Is sawyerhood/dev-browser a good alternative to MediaCrawler?

Dev-browser is a browser automation framework and headless browser controller that provides a sandboxed script runner for executing web tasks. It functions as a vision-based web automator and a specialized interface for large language models, enabling the navigation and interaction of web pages wit…

Is apify/crawlee-python a good alternative to MediaCrawler?

Crawlee-python is a web crawling framework for building scalable scrapers using Python. It serves as a comprehensive tool for web scraping automation, providing a system to extract structured data from websites using both lightweight HTTP requests and headless browser automation. The framework is…

Is twintproject/twint a good alternative to MediaCrawler?

Twint is an open-source intelligence and data extraction framework designed to gather public social media information. It functions as a command-line utility that retrieves posts, user profiles, and follower lists directly from web interfaces, bypassing the need for official platform developer cred…

Is automaapp/automa a good alternative to MediaCrawler?

Automa is a browser-based automation platform that enables users to build, schedule, and execute repetitive web tasks through a visual, no-code interface. By operating as a browser extension, it provides a canvas-based environment where users construct workflows by connecting functional blocks to i…

Is aria2/aria2 a good alternative to MediaCrawler?

Aria2 is a multi-protocol command-line download manager designed to maximize bandwidth utilization by retrieving files from multiple sources and protocols simultaneously. It functions as an asynchronous, event-driven engine that handles complex download lifecycles, including peer-to-peer transfers…

Is cli/cli a good alternative to MediaCrawler?

This project is a command-line interface that bridges local development workflows with remote platform services. It functions as a terminal-based platform client, enabling users to manage repositories, issues, and pull requests directly from their command line through authenticated API interactions…

Is go-rod/rod a good alternative to MediaCrawler?

go-rod/rod is an open-source alternative to MediaCrawler.

Back to nanmicoder/mediacrawler

Open-source alternatives to MediaCrawler

30 open-source projects similar to nanmicoder/mediacrawler, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best MediaCrawler alternative.

apify/crawlee
apify/crawlee
24,002View on GitHub
Crawlee is a web scraping framework designed for building scalable, reliable, and distributed data extraction pipelines. It provides a unified interface for managing headless browser automation and lightweight HTTP requests, allowing developers to handle complex web navigation, dynamic content rendering, and large-scale data collection within a single, modular architecture. The project distinguishes itself through its resource-aware concurrency controller, which dynamically scales task execution based on real-time CPU and memory usage to prevent host machine exhaustion. It also features a rob
TypeScriptapifyautomationcrawler
View on GitHub24,002
kovidgoyal/kitty
kovidgoyal/kitty
33,462View on GitHub
Kitty is a high-performance, GPU-accelerated terminal emulator designed to provide a consistent and extensible workspace across different operating systems. It leverages graphics hardware to render text, images, and complex layouts with low latency, while providing a robust environment for demanding command-line workflows. The project distinguishes itself through its integrated workspace management and programmable interface. It functions as a tiling window manager that organizes terminal windows, tabs, and layouts into persistent, keyboard-driven sessions. Users can automate complex workflow
Pythoncgogolang
View on GitHub33,462
ultrafunkamsterdam/nodriver
ultrafunkamsterdam/nodriver
3,578View on GitHub
nodriver is an asynchronous Chromium browser automation framework that provides headless control and web scraping capabilities. It functions as a Chrome DevTools Protocol client, allowing for granular engine control by attaching directly to the browser's debug port without the need for external driver binaries. The framework is specifically designed as an anti-bot detection bypass tool. It modifies browser fingerprints and protocol headers to evade automated security systems, handle security warnings, and bypass common obstacles like insecure connection alerts. The system covers a broad rang
Python
View on GitHub3,578

Open-source alternatives to MediaCrawler

apify/crawlee

kovidgoyal/kitty

ultrafunkamsterdam/nodriver

SawyerHood/dev-browser

apify/crawlee-python

twintproject/twint

AutomaApp/automa

aria2/aria2

cli/cli

go-rod/rod

segmentio/nightmare

omkarcloud/botasaurus

autoscrape-labs/pydoll

browser-use/web-ui

qeeqbox/social-analyzer

lorien/web-scraping

instaloader/instaloader

lavague-ai/LaVague

Panniantong/Agent-Reach

Datalux/Osintgram

streamlink/streamlink

Evil0ctal/Douyin_TikTok_Download_API

g1879/DrissionPage

camel-ai/camel

subzeroid/instagrapi

free-nodes/clashfree

cloudflare/workerd

WECENG/ticket-purchase

DIYgod/RSSHub

garrytan/gstack