A powerful, asynchronous scraper for LinkedIn job postings without authentication.

These details have not been verified by PyPI

Project links

Homepage

Project description

LinkedIn Job Extractor (ljobx)

A fast, simple tool to scrape LinkedIn job postings without needing to log in. It can be used as a powerful command-line script for automation or as an interactive web interface for easy scrapping. It uses LinkedIn’s public APIs, supports flexible proxy configurations, and saves results to JSON or CSV.

The GitHub repository can be found at: tusharkr1918/ljobx

✨ Features

Dual Interface: Use it as a powerful command-line tool for scripting or as an interactive web UI.
Simplified UI Mode: Launch the web interface with a --basic flag for a cleaner, more streamlined experience that's perfect for less technical users.
No Login Needed: Scrapes public job postings anonymously.
Advanced Filtering: Filter by date, experience level, job type, and remote options.
Concurrent Scraping: Fetches multiple jobs at once with randomized delays.
Flexible Proxy Support: Load proxies from API providers (like Webshare) or directly from your own local text files.
Structured Output: Save results as clean, timestamped JSON or CSV files.
Latest Symlink: Automatically creates a _latest file pointing to the newest results.

📥 Installation

You can install the tool in two ways, depending on your needs.

Standard Install (CLI Only)

For just the command-line tool, a standard installation is all you need:

pip install ljobx

Full Install (CLI + Web Interface)

To use the interactive web UI, install the package with the [ui] extra. This will download the additional libraries needed, like Streamlit.

pip install 'ljobx[ui]'

(Note: The quotes are important for some shells like zsh.)

🚀 Usage

The tool can be run as a command-line script or as a web application.

Command-Line (CLI)

Provide a search query and a location. Use flags for more control.

# Basic search saving to CSV
ljobx "Software Engineer" "Remote" --to-csv

# Advanced search with multiple filters
ljobx "Senior Python Developer" "Noida, India" \
      --job-type "Full-time" \
      --date-posted "Past week" \
      --max-jobs 50 \
      --concurrency 2 \
      --delay 3 8

Web Interface (UI)

If you performed a full installation, you can launch the Streamlit web interface.

# Launch the full-featured web UI
ljobx-ui

# Launch the simplified web UI with fewer options
ljobx-ui --basic

For a finer-tuned experience, override the basic UI defaults to adjust concurrency and delay based on your proxy configuration.

# Launch the basic UI with higher concurrency for faster scraping
ljobx-ui --basic --concurrency 5 --delay 2 4

⚙️ CLI Options

Required Arguments:

keywords: The job title or skill to search for.
location: The geographical location (e.g., "Noida, India", "Remote").

Filtering Options:

--date-posted: Any time, Past month, Past week, Past 24 hours
--experience-level: Internship, Entry level, Associate, Mid-Senior level, etc.
--job-type: Full-time, Contract, Part-time, etc.
--remote: On-site, Remote, Hybrid

Scraper Settings:

--max-jobs: Max number of jobs to scrape (Default: 25).
--concurrency: Number of parallel requests (Default: 2).
--delay MIN MAX: Random delay range in seconds (Default: 3 8).
--to-csv: Save output as a CSV file instead of JSON.
--proxy-config FILE_OR_URL: Path or URL to a proxy YAML config.

💡 A Note on Performance: It's highly recommended to adjust --concurrency and --delay based on your proxy setup.

With many working proxies, you can be more aggressive for faster scraping (e.g., --concurrency 10 --delay 1 3).

With few or no proxies, you must be conservative to avoid getting blocked. It's safest to use the default values.

🔌 Proxy Configuration

The tool offers a powerful and flexible proxy system configured via a YAML file.

Example config.yml:

# --- API Providers (Optional) ---
# Fetches proxies from a supported provider's API.
proxy_providers:
  - name: webshare
    config:
      api_key: "YOUR_WEBSHARE_API_KEY"
      page_size: 100 
      max_pages: 5

# --- Local Files (Optional) ---
# Loads proxies directly from your own text files.
proxies_files:
  - path: "/path/to/your/socks_proxies.txt"
    protocol: "socks5"

Server & Deployment Usage (Default Proxy Config)

When you run ljobx or ljobx-ui, it automatically looks for a file named proxy_config.yml in a default system location. This is the recommended method for servers and deployed applications.

Simply create your proxy_config.yml file in the correct directory, and the tool will use it automatically without needing the --proxy-config flag.

Linux/macOS: ~/.config/ljobx/proxy_config.yml
Windows: C:\Users\<YourUser>\AppData\Roaming\ljobx\proxy_config.yml

This method is ideal for services like Streamlit Community Cloud, as you can place your configuration file (using API providers, not local file paths) in the repository, and the deployed app will pick it up automatically.

Loading Proxies from Files (`proxies_files`)

This feature gives you full control over your proxy lists but is best for local use. The system follows these rules:

Default Protocol: If you specify a protocol (e.g., "socks5" or "https"), it will be automatically added to any proxy in that file that doesn't already have one.
No Default Protocol: If you only provide the path, the tool expects every line in the file to be a complete proxy URL (e.g., socks5://user:pass@ip:port).
Security Filter: The tool will always ignore any proxy that starts with insecure http://.

Command:

ljobx "Java Developer" "Delhi, India" --proxy-config "config.yml"

📂 Output & Data Fields

Results are saved as timestamped JSON or CSV files (e.g., keywords_YYYYMMDD_HHMMSS.json), with a _latest symlink for easy access.

The scraper extracts the following data for each job:

job_id
title
company
location
posted_date
applicants (if available)
salary_range (if available)
description
apply (URL and whether it's an "Easy Apply")
recruiter (Name, Title, and Profile URL, if available)

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

2.2.0

Oct 7, 2025

2.1.4

Oct 7, 2025

2.1.3

Oct 7, 2025

2.1.2

Oct 6, 2025

2.1.1

Oct 1, 2025

2.1.0

Sep 27, 2025

2.0.1

Sep 27, 2025

2.0.0

Sep 26, 2025

1.3.9

Sep 26, 2025

1.3.8

Sep 26, 2025

1.3.7

Sep 26, 2025

This version

1.3.6

Sep 26, 2025

1.3.5

Sep 25, 2025

1.3.4

Sep 24, 2025

1.3.3

Sep 24, 2025

1.3.2

Sep 24, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ljobx-1.3.6.tar.gz (30.1 kB view details)

Uploaded Sep 26, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ljobx-1.3.6-py3-none-any.whl (33.9 kB view details)

Uploaded Sep 26, 2025 Python 3

File details

Details for the file ljobx-1.3.6.tar.gz.

File metadata

Download URL: ljobx-1.3.6.tar.gz
Upload date: Sep 26, 2025
Size: 30.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for ljobx-1.3.6.tar.gz
Algorithm	Hash digest
SHA256	`2591ab587d2fa88b3f6089b0a09fddb717f7f868ad8a8ac8b2b63c66733a7a58`
MD5	`15bedca646950708b9abd767007e0601`
BLAKE2b-256	`a139904392647c47f935219bac787dc22cd2217067ec831b48dfaaca4a282428`

See more details on using hashes here.

File details

Details for the file ljobx-1.3.6-py3-none-any.whl.

File metadata

Download URL: ljobx-1.3.6-py3-none-any.whl
Upload date: Sep 26, 2025
Size: 33.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for ljobx-1.3.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`04122f0891c9b191a245adc339e123864a7a1e832910ccbd98884a85e5a70a25`
MD5	`788bed1bcda98a24967d157dca5681a8`
BLAKE2b-256	`d88832a3352ea06877099a9abdf7cf177e53ea118139280eade027d91a0dac02`

See more details on using hashes here.

ljobx 1.3.6

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

LinkedIn Job Extractor (ljobx)

✨ Features

📥 Installation

Standard Install (CLI Only)

Full Install (CLI + Web Interface)

🚀 Usage

Command-Line (CLI)

Web Interface (UI)

⚙️ CLI Options

🔌 Proxy Configuration

Server & Deployment Usage (Default Proxy Config)

Loading Proxies from Files (`proxies_files`)

📂 Output & Data Fields

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

ljobx 1.3.6

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

LinkedIn Job Extractor (ljobx)

✨ Features

📥 Installation

Standard Install (CLI Only)

Full Install (CLI + Web Interface)

🚀 Usage

Command-Line (CLI)

Web Interface (UI)

⚙️ CLI Options

🔌 Proxy Configuration

Server & Deployment Usage (Default Proxy Config)

Loading Proxies from Files (proxies_files)

📂 Output & Data Fields

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Loading Proxies from Files (`proxies_files`)