A modern LinkedIn scraping library with CLI and MCP server
Project description
linkedin-spider
Effortless Linkedin scraping with zero detection. Extract, export, and automate your Linkedin data.
Features
- Search Linkedin profiles with advanced filters (location, connection type, current company, position)
- Extract complete profile information (experience, education, skills, contact details)
- Get company details and information
- Retrieve incoming and outgoing connection requests
- Send connection requests to profiles
- Get conversations list and detailed conversation history
- Built-in anti-detection and session management
Quick Start
Installation
Choose your preferred installation method:
Option 1: pip (Recommended for general use)
# For Python library only
pip install linkedin-spider
# For CLI usage
pip install linkedin-spider[cli]
# For MCP server usage
pip install linkedin-spider[mcp]
# For all features (CLI + MCP + library)
pip install linkedin-spider[all]
Option 2: Development setup with uv
# Clone the repo
git clone https://github.com/vertexcover-io/linkedin-spider
cd linkedin-spider
# Install with uv
uv sync
[!NOTE] Authentication Update: Linkedin has enhanced their anti-bot mechanisms, temporarily affecting cookie-based authentication. We recommend using the email/password authentication method for reliable access. We are actively working on restoring full cookie authentication support.
Different ways to use it
1. Python Library
Perfect for integration into your existing Python applications:
from linkedin_spider import LinkedinSpider, ScraperConfig
config = ScraperConfig(headless=True, page_load_timeout=30)
# Authenticate (use either email/password or cookie).
# Authentication is mostly done once and the session is saved in the chrome profile
scraper = LinkedinSpider(
email="your_email@example.com",
password="your_password",
config=config
)
# Search for profiles
results = scraper.search_profiles("software engineer", max_results=10)
Output sample:
[
{
"name": "John Doe",
"title": "Senior Software Engineer at Google",
"location": "San Francisco, CA",
"profile_url": "https://linkedin.com/in/johndoe",
"connections": "500+"
},
{
"name": "Jane Smith",
"title": "Software Engineer at Microsoft",
"location": "Seattle, WA",
"profile_url": "https://linkedin.com/in/janesmith",
"connections": "200+"
}
]
# Scrape individual profile
profile = scraper.scrape_profile("https://linkedin.com/in/someone")
Output sample:
{
"name": "John Doe",
"title": "Senior Software Engineer",
"location": "San Francisco, CA",
"about": "Passionate software engineer with 8+ years of experience...",
"experience": [
{
"title": "Senior Software Engineer",
"company": "Google",
"duration": "2021 - Present",
"description": "Leading backend development for search infrastructure..."
}
],
"education": [
{
"school": "Stanford University",
"degree": "BS Computer Science",
"years": "2013 - 2017"
}
],
"skills": ["Python", "Java", "Kubernetes", "AWS"]
}
# Scrape company information
company = scraper.scrape_company("https://linkedin.com/company/tech-corp")
Output sample:
{
"name": "TechCorp Inc",
"industry": "Software Development",
"company_size": "1,001-5,000 employees",
"headquarters": "San Francisco, CA",
"founded": "2010",
"specialties": ["Cloud Computing", "AI/ML", "Data Analytics"],
"description": "Leading technology company focused on enterprise solutions...",
"website": "https://techcorp.com",
"follower_count": "45,230"
}
# Don't forget to clean up
scraper.close()
For more examples : examples
2. Command Line Interface
Great for quick data extraction and scripting:
# If installed via pip
linkedin-spider-cli search -q "product manager" -n 10 -o results.json
linkedin-spider-cli profile -u "https://linkedin.com/in/johndoe" -o profile.json
linkedin-spider-cli company -u "https://linkedin.com/company/openai" -o company.json
# If using development setup
uv run linkedin-spider-cli search -q "product manager" -n 10 -o results.json
uv run linkedin-spider-cli profile -u "https://linkedin.com/in/johndoe" -o profile.json
uv run linkedin-spider-cli company -u "https://linkedin.com/company/openai" -o company.json
3. MCP Server
Set up environment variables in .env file:
# Authentication (choose one method)
LINKEDIN_EMAIL=your_email@example.com
LINKEDIN_PASSWORD=your_password
# OR
LINKEDIN_COOKIE=your_li_at_cookie_value
# Configuration
HEADLESS=true
# Transport (optional, defaults to stdio)
TRANSPORT=sse
HOST=127.0.0.1
PORT=8000
Start the MCP server:
# If installed via pip
# Show available transport options
linkedin-spider-mcp
# Start with specific transport
linkedin-spider-mcp serve sse
linkedin-spider-mcp serve http --host 0.0.0.0 --port 9000
linkedin-spider-mcp serve stdio
# Or use environment variables
TRANSPORT=sse linkedin-spider-mcp serve
# If using development setup
# Show available transport options
uv run linkedin-spider-mcp
# Start with specific transport
uv run linkedin-spider-mcp serve sse
uv run linkedin-spider-mcp serve http --host 0.0.0.0 --port 9000
uv run linkedin-spider-mcp serve stdio
# Or use environment variables
TRANSPORT=sse uv run linkedin-spider-mcp serve
Claude Code Integration
# Add to Claude Code
claude mcp add linkedin-spider --transport sse <server-url>
# Example server URL format: http://localhost:8080/sse
Claude Desktop Integration
Add to your Claude Desktop configuration file:
Windows: %APPDATA%\Claude\claude_desktop_config.json
macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Linux: ~/.config/Claude/claude_desktop_config.json
Option 1: Docker (Recommended)
The Docker approach provides reliable, isolated execution with all dependencies included.
First, build the Docker image:
# Build the stdio server image
docker build -f Dockerfile.stdio -t linkedin-mcp-stdio .
Then add this to your Claude Desktop configuration:
{
"mcpServers": {
"linkedin-spider": {
"command": "docker",
"args": [
"run", "--rm", "-i",
"-e", "LINKEDIN_EMAIL=your_email@example.com",
"-e", "LINKEDIN_PASSWORD=your_password",
"-e", "HEADLESS=true",
"-e", "TRANSPORT=stdio",
"linkedin-mcp-stdio"
]
}
}
}
Docker Development & Testing
For development and testing with Docker, you can use a single image with different transport configurations:
Build the Docker Image
# Build once for all transport types
docker build -t linkedin-mcp .
Run with Different Transports
SSE Server
docker run -p 8000:8000 -e TRANSPORT=sse --env-file .env linkedin-mcp
HTTP Server
docker run -p 8000:8000 -e TRANSPORT=http --env-file .env linkedin-mcp
STDIO Server
docker run --rm -i -e TRANSPORT=stdio --env-file .env linkedin-mcp
Authentication Methods
Method 1: Linkedin Cookie
- Login to Linkedin in your browser
- Open Developer Tools (F12)
- Go to Application/Storage → Cookies → linkedin.com
- Copy the
li_atcookie value - Use it in your code:
scraper = LinkedinSpider(li_at_cookie="your_cookie_value")
Method 2: Email & Password (Recommended)
scraper = LinkedinSpider(
email="your_email@example.com",
password="your_password"
)
Contributing
We welcome contributions! Please see CONTRIBUTING.md for guidelines.
License
This project is licensed under the MIT License - see the LICENSE file for details.
Disclaimer
This tool is for personal use only. Please:
- Respect Linkedin's Terms of Service
- Use reasonable rate limits
- Don't spam or harass users
- Be responsible with the data you collect
Ready to extract Linkedin data like a pro? Star this repo and start scraping!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file linkedin_spider-0.1.2.tar.gz.
File metadata
- Download URL: linkedin_spider-0.1.2.tar.gz
- Upload date:
- Size: 137.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.22
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8fe87f06a64dac1e7747c2a3df78eebd401aa0e8cb95f7fdd87fe5b39cfc31c3
|
|
| MD5 |
6bbabcb812befd39c27c6a823af8e868
|
|
| BLAKE2b-256 |
fdb714bb0d752c405015d8fb9651cb506faca5ca22507fb2317b533d14db80fd
|
File details
Details for the file linkedin_spider-0.1.2-py3-none-any.whl.
File metadata
- Download URL: linkedin_spider-0.1.2-py3-none-any.whl
- Upload date:
- Size: 51.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.22
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
31fef1d66178a5fe9dfe20ad570087599fbc41078b8f2cbc19995c8edb24a083
|
|
| MD5 |
210b3f9ecbcdc5ac4029cec79910bb24
|
|
| BLAKE2b-256 |
834346f240bef93bbd893167dd6a9e6e21de7a4bb8119548ea2264687caf99ba
|