Skip to main content

A package to scrape and filter academic job listings

Project description

AcademicJobScraper

A Python package for scraping and filtering academic job listings from academicjobsonline.org.

Installation

pip install academicjobscraper

Usage

As a Python Module

from academicjobscraper import AcademicJobScraper

# Initialize the scraper with required keywords and optional file names
scraper = AcademicJobScraper(
    keywords=["machine learning", "deep learning", "AI"],  # Required
    links_file="job_links.csv",      # Optional (default: job_links.csv)
    data_file="jobs_data.json",      # Optional (default: jobs_data.json)
    results_file="relevant_jobs.csv"  # Optional (default: relevant_jobs.csv)
)

# Start scraping with a mother link
scraper.scrape("https://academicjobsonline.org/your-search-url")

Command Line Interface

# Basic usage with required parameters
academicjobscraper "https://academicjobsonline.org/your-search-url" "machine learning" "deep learning" "AI"

# With optional file name parameters
academicjobscraper "https://academicjobsonline.org/your-search-url" \
    "machine learning" "deep learning" "AI" \
    --links-file custom_links.csv \
    --data-file custom_data.json \
    --results-file custom_results.csv

Features

  • Scrapes job listings from academicjobsonline.org
  • Extracts detailed job information
  • Filters jobs based on provided keywords
  • Customizable output file names
  • Progress tracking during scraping

Output Files

The scraper generates three files:

  1. job_links.csv - Contains all scraped job URLs
  2. jobs_data.json - Contains detailed information for all jobs
  3. relevant_jobs.csv - Contains filtered jobs matching the keywords

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

academicjobscraper-0.1.0.tar.gz (4.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

academicjobscraper-0.1.0-py3-none-any.whl (5.4 kB view details)

Uploaded Python 3

File details

Details for the file academicjobscraper-0.1.0.tar.gz.

File metadata

  • Download URL: academicjobscraper-0.1.0.tar.gz
  • Upload date:
  • Size: 4.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.9.6

File hashes

Hashes for academicjobscraper-0.1.0.tar.gz
Algorithm Hash digest
SHA256 9b732ee2a7a5e7491ada730fef124e8c8107b3e9970691894bf90d23f21d2ee9
MD5 0ecd99d6573cb7daee613d7c2abd1f3d
BLAKE2b-256 df45caf0e08332018461031c163c8a45fec8f900948757f9d74d5722e5930267

See more details on using hashes here.

File details

Details for the file academicjobscraper-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for academicjobscraper-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ef8b5af53ec3bf5893686f7c8ef6ac07a3254b7503c3d30def6a6f6a0eaf9f5d
MD5 2b8fa331b4bce694452df6a878b69003
BLAKE2b-256 4a3a55d2c00aca464fe98197240b014956a80d1c34b2ea559a4801a2635db172

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page