Twint is an open-source intelligence and data extraction framework designed to gather public social media information. It functions as a command-line utility that retrieves posts, user profiles, and follower lists directly from web interfaces, bypassing the need for official platform developer credentials or authentication keys.
The tool distinguishes itself by enabling automated, large-scale data collection through terminal-based orchestration. It supports granular filtering by keywords, geographic locations, time ranges, and account status, allowing researchers to build targeted datasets for sentiment analysis or network relationship mapping. The architecture includes state tracking to resume interrupted sessions and an integrated pipeline for real-time text translation during the collection process.
Beyond raw extraction, the project provides a modular output system that streams data into local files, databases, or external search engines. This design facilitates integration with visualization tools for generating network graphs and interactive dashboards, supporting long-term research workflows and investigative analysis.