What are the best open-source alternatives to TorBot?

30 open-source projects similar to dedsecinside/torbot, ranked by shared features. Top picks: remitchell/python-scraping, rchipka/node-osmosis, s-rah/onionscan, six2dez/reconftw, megadose/onionsearch, awesome-selfhosted/awesome-selfhosted, aosabook/500lines, dovecoteescapee/byedpiandroid, lorien/web-scraping, bda-research/node-crawler.

Is remitchell/python-scraping a good alternative to TorBot?

This project is a Python web scraping library and automated data collection suite. It provides tools for extracting structured data from websites, implementing web crawlers to navigate site links, and parsing HTML DOM structures to isolate specific elements and attributes. The toolkit includes a p…

Is rchipka/node-osmosis a good alternative to TorBot?

This project is a Node.js web scraping framework designed to automate data extraction through a programmatic workflow of requests, parsing, and document interaction. It functions as a headless web crawler, an HTTP request manager, and a DOM parser and extractor. The framework distinguishes itself…

Is s-rah/onionscan a good alternative to TorBot?

OnionScan is a free and open source tool for investigating the Dark Web.

Is six2dez/reconftw a good alternative to TorBot?

reconftw is an attack surface management framework and reconnaissance workflow orchestrator designed to automate the discovery, mapping, and monitoring of external digital assets. It operates as a modular tool-chain pipeline that coordinates a sequence of security tools to perform intelligence gath…

Is megadose/onionsearch a good alternative to TorBot?

OnionSearch is a script that scrapes urls on different .onion search engines.

Is awesome-selfhosted/awesome-selfhosted a good alternative to TorBot?

This project is a community-curated directory of open-source software designed for deployment in private server environments and home labs. It serves as a comprehensive resource for discovering independent, self-hosted alternatives to mainstream cloud services, enabling users to maintain full data…

Is aosabook/500lines a good alternative to TorBot?

This project is a software engineering educational resource providing a collection of canonical system implementations. It serves as a library of computer science case studies and polyglot code examples designed to demonstrate architectural tradeoffs and design patterns through concise versions of…

Is dovecoteescapee/byedpiandroid a good alternative to TorBot?

ByeDPIAndroid is a deep packet inspection bypass tool for Android that functions as a local SOCKS5 proxy. It modifies TCP packets to evade network censorship and bypass regional internet restrictions on mobile devices. The project operates as a network traffic obfuscator and TCP packet fragmenter.…

Is lorien/web-scraping a good alternative to TorBot?

This project is a comprehensive resource directory for web data extraction, providing a curated collection of tools and libraries for parsing data, automating browsers, and managing network operations. It serves as a guide for extracting structured information from HTML, XML, JSON, and PDF formats.…

Is bda-research/node-crawler a good alternative to TorBot?

node-crawler is a programmable web crawler for Node.js that manages request queues and automates data extraction. It functions as a rate-limited HTTP client and a headless HTML parser, providing the infrastructure to visit large sets of URLs asynchronously while preventing duplicate processing thro…

Back to dedsecinside/torbot

Open-source alternatives to TorBot

30 open-source projects similar to dedsecinside/torbot, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best TorBot alternative.

remitchell/python-scraping
REMitchell/python-scraping
4,714View on GitHub
This project is a Python web scraping library and automated data collection suite. It provides tools for extracting structured data from websites, implementing web crawlers to navigate site links, and parsing HTML DOM structures to isolate specific elements and attributes. The toolkit includes a pipeline for processing unstructured text and cleaning raw web content to extract meaningful information. It also features capabilities for image data extraction and the integration of external APIs to retrieve structured data from remote endpoints. The system covers broad capability areas including
Jupyter Notebook
View on GitHub4,714
rchipka/node-osmosis
rchipka/node-osmosis
4,110View on GitHub
This project is a Node.js web scraping framework designed to automate data extraction through a programmatic workflow of requests, parsing, and document interaction. It functions as a headless web crawler, an HTTP request manager, and a DOM parser and extractor. The framework distinguishes itself by combining a JavaScript execution engine to interact with dynamic content and a hybrid selection system that utilizes both CSS and XPath selectors. It includes specialized middleware for proxy rotation and cookie-jar session management to maintain authenticated states and manage automated traffic.
JavaScript
View on GitHub4,110
s-rah/onionscan
s-rah/onionscan
3,251View on GitHub
OnionScan is a free and open source tool for investigating the Dark Web.
Go
View on GitHub3,251
six2dez/reconftw
six2dez/reconftw
7,226View on GitHub
reconftw is an attack surface management framework and reconnaissance workflow orchestrator designed to automate the discovery, mapping, and monitoring of external digital assets. It operates as a modular tool-chain pipeline that coordinates a sequence of security tools to perform intelligence gathering and vulnerability scanning. The project distinguishes itself through a cloud-native deployment model that parallelizes scanning workloads across a fleet of remote VPS instances to bypass local resource constraints. It utilizes container-based environment isolation to ensure consistent executio
Shellbug-bountybugbountybugbounty-tool
View on GitHub7,226

Open-source alternatives to TorBot

REMitchell/python-scraping

rchipka/node-osmosis

s-rah/onionscan

six2dez/reconftw

megadose/OnionSearch

awesome-selfhosted/awesome-selfhosted

aosabook/500lines

dovecoteescapee/ByeDPIAndroid

lorien/web-scraping

bda-research/node-crawler

FriendsOfPHP/Goutte

crawlab-team/crawlab

lining0806/PythonSpiderNotes

code4craft/webmagic

zlzforever/DotnetSpider

binux/pyspider

asciimoo/colly

apify/crawlee

shadowsocks/shadowsocks-libev

firecrawl/firecrawl

s0md3v/Photon

projectdiscovery/katana

projectdiscovery/subfinder

erebe/wstunnel

fw876/helloworld

klzgrad/naiveproxy

any4ai/AnyCrawl

mendableai/firecrawl-mcp-server

andeya/pholcus

algolia/docsearch