# joeyism/linkedin_scraper

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/joeyism-linkedin-scraper).**

3,746 stars · 880 forks · Python · gpl-3.0

## Links

- GitHub: https://github.com/joeyism/linkedin_scraper
- awesome-repositories: https://awesome-repositories.com/repository/joeyism-linkedin-scraper.md

## Topics

`chrome` `company` `driver` `firefox` `linkedin` `linkedin-profile` `linkedin-scraper` `linkedin-url` `profile` `scraper` `scrapes-linkedin` `users`

## Description

This project is a LinkedIn data scraper and professional profile extractor designed to collect information from professional networking sites. It functions as a headless browser scraper that extracts professional profiles, company details, and job listings using automated browser sessions.

The tool includes a session manager that saves and loads authentication cookies to maintain persistent access to protected profiles. It employs configurable browser settings and user-agent mimicry to simulate human activity and bypass bot detection.

Data extraction capabilities cover person profiles, company overviews, social feed posts, and job listings filtered by keywords and location. The system also supports the retrieval of contact details, education, and work experience.

## Tags

### Development Tools & Productivity

- [Headless Browser Automation](https://awesome-repositories.com/f/development-tools-productivity/headless-browser-automation.md) — Uses a headless browser to simulate human interaction and render dynamic content for data extraction.
- [Profile Extractors](https://awesome-repositories.com/f/development-tools-productivity/professional-networking-groups/professional-networking-databases/profile-extractors.md) — Gathers work experience, education, and contact details from individual professional user pages.
- [Session Management](https://awesome-repositories.com/f/development-tools-productivity/web-scraping/session-management.md) — Manages authentication cookies and browser settings to access protected professional profiles.

### Web Development

- [Headless Browser Orchestrators](https://awesome-repositories.com/f/web-development/web-automation-scraping/browser-orchestration-systems/headless-browser-orchestrators.md) — Implements a headless browser system to simulate human interaction and extract dynamic professional profile data.
- [Browser Argument Configuration](https://awesome-repositories.com/f/web-development/browser-automation-engines/browser-argument-configuration.md) — Configures headless browser arguments, viewport size, and user agents to simulate human activity. ([source](https://github.com/joeyism/linkedin_scraper/blob/master/README.md))

### Part of an Awesome List

- [Job Market Data](https://awesome-repositories.com/f/awesome-lists/data/job-market-data.md) — Extracts job listings and requirements to facilitate analysis of hiring trends and industry demands.
- [Company Intelligence Utilities](https://awesome-repositories.com/f/awesome-lists/data/company-research-tools/company-intelligence-utilities.md) — Gathers business overviews and industry classifications to build professional company profiles. ([source](https://github.com/joeyism/linkedin_scraper/blob/master/README.md))
- [Organizational Data Extraction](https://awesome-repositories.com/f/awesome-lists/data/company-research-tools/company-intelligence-utilities/organizational-data-extraction.md) — Collects industry details, organizational size, and headquarters locations from company information pages. ([source](https://github.com/joeyism/linkedin_scraper#readme))

### Business & Productivity Software

- [Job Market Scraping](https://awesome-repositories.com/f/business-productivity-software/job-market-scraping.md) — Collects job requirements and application links using keyword and location filters. ([source](https://github.com/joeyism/linkedin_scraper#readme))
- [Professional Profile Scraping](https://awesome-repositories.com/f/business-productivity-software/professional-profile-scraping.md) — Gathers career data and company history from professional networking sites using automated browser tools. ([source](https://github.com/joeyism/linkedin_scraper/blob/master/requirements.txt))
- [Competitive Market Research](https://awesome-repositories.com/f/business-productivity-software/competitive-market-research.md) — Provides capabilities to extract company overviews and industry data for competitive analysis.
- [Professional Networking Automation](https://awesome-repositories.com/f/business-productivity-software/professional-networking-automation.md) — Automates the collection of posts and profile updates to monitor industry activity.

### Data & Databases

- [DOM-Based Extractions](https://awesome-repositories.com/f/data-databases/structured-data-extraction/dom-based-extractions.md) — Extracts professional data by querying the HTML structure of rendered web pages using JavaScript.
- [Contact Discovery](https://awesome-repositories.com/f/data-databases/contact-discovery.md) — Identifies and extracts contact information associated with professional user profiles. ([source](https://github.com/joeyism/linkedin_scraper/tree/master/samples))
- [Post & Comment Scraping](https://awesome-repositories.com/f/data-databases/data-scraping-tools/social-media-data-scraping/high-resolution-media-scraping/post-comment-scraping.md) — Fetches post content, timestamps, and engagement metrics from organization social feeds. ([source](https://github.com/joeyism/linkedin_scraper#readme))
- [Lead Enrichment](https://awesome-repositories.com/f/data-databases/lead-enrichment.md) — Extracts and augments lead profiles with contact information and career history from professional networking sites.
- [Profile Information Retrieval](https://awesome-repositories.com/f/data-databases/profile-information-retrieval.md) — Fetches specific personal and professional profile details using unique identifiers. ([source](https://github.com/joeyism/linkedin_scraper/tree/master/samples))

### Security & Cryptography

- [Session and Credential Management](https://awesome-repositories.com/f/security-cryptography/identity-access-management/authentication-strategies/session-and-credential-handling/session-credential-management.md) — Captures and stores user session tokens and credentials in files for persistence across multiple requests. ([source](https://github.com/joeyism/linkedin_scraper/blob/master/README.md))
- [Browser Session Authentication](https://awesome-repositories.com/f/security-cryptography/identity-access-management/authentication-strategies/session-and-credential-handling/session-credential-management/browser-session-authentication.md) — Utilizes browser cookies to authenticate automated requests to protected professional profiles. ([source](https://github.com/joeyism/linkedin_scraper#readme))
- [Session-Based Scrapers](https://awesome-repositories.com/f/security-cryptography/identity-access-management/authentication-strategies/session-and-credential-handling/session-credential-management/browser-session-authentication/session-based-scrapers.md) — Utilizes browser session cookies to scrape restricted profiles, company details, and job listings from LinkedIn.
- [Session-Cookie Persistences](https://awesome-repositories.com/f/security-cryptography/session-cookie-handlers/session-cookie-persistences.md) — Saves and reuses authentication cookies in local files to maintain persistent access across script runs.

### Software Engineering & Architecture

- [Web Session Management](https://awesome-repositories.com/f/software-engineering-architecture/user-state-management/session-coordination/web-session-management.md) — Coordinates the login process and maintains state to access restricted areas of the professional network. ([source](https://github.com/joeyism/linkedin_scraper/tree/master/samples))

### Networking & Communication

- [User Profile Retrieval](https://awesome-repositories.com/f/networking-communication/contact-management/user-profile-retrieval.md) — Retrieves detailed work experience, education, and skill sets from professional user profiles. ([source](https://github.com/joeyism/linkedin_scraper#readme))
- [User Agent Rotation](https://awesome-repositories.com/f/networking-communication/user-agent-rotation.md) — Cycles through user agent strings and viewport settings to mimic human behavior and avoid bot detection.

### User Interface & Experience

- [Search Filtering Logic](https://awesome-repositories.com/f/user-interface-experience/search-filtering-logic.md) — Narrows down job listings and professional profiles based on user-provided keywords and location criteria.
