A package to scrape and filter academic job listings
Project description
AcademicJobScraper
A Python package for scraping and filtering academic job listings from academicjobsonline.org.
Installation
pip install academicjobscraper
Usage
As a Python Module
from academicjobscraper import AcademicJobScraper
# Initialize the scraper with required keywords and optional file names
scraper = AcademicJobScraper(
keywords=["machine learning", "deep learning", "AI"], # Required
links_file="job_links.csv", # Optional (default: job_links.csv)
data_file="jobs_data.json", # Optional (default: jobs_data.json)
results_file="relevant_jobs.csv" # Optional (default: relevant_jobs.csv)
)
# Start scraping with a mother link
scraper.scrape("https://academicjobsonline.org/your-search-url")
Command Line Interface
# Basic usage with required parameters
academicjobscraper "https://academicjobsonline.org/your-search-url" "machine learning" "deep learning" "AI"
# With optional file name parameters
academicjobscraper "https://academicjobsonline.org/your-search-url" \
"machine learning" "deep learning" "AI" \
--links-file custom_links.csv \
--data-file custom_data.json \
--results-file custom_results.csv
Features
- Scrapes job listings from academicjobsonline.org
- Extracts detailed job information
- Filters jobs based on provided keywords
- Customizable output file names
- Progress tracking during scraping
Output Files
The scraper generates three files:
job_links.csv- Contains all scraped job URLsjobs_data.json- Contains detailed information for all jobsrelevant_jobs.csv- Contains filtered jobs matching the keywords
License
This project is licensed under the MIT License - see the LICENSE file for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file academicjobscraper-0.1.0.tar.gz.
File metadata
- Download URL: academicjobscraper-0.1.0.tar.gz
- Upload date:
- Size: 4.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9b732ee2a7a5e7491ada730fef124e8c8107b3e9970691894bf90d23f21d2ee9
|
|
| MD5 |
0ecd99d6573cb7daee613d7c2abd1f3d
|
|
| BLAKE2b-256 |
df45caf0e08332018461031c163c8a45fec8f900948757f9d74d5722e5930267
|
File details
Details for the file academicjobscraper-0.1.0-py3-none-any.whl.
File metadata
- Download URL: academicjobscraper-0.1.0-py3-none-any.whl
- Upload date:
- Size: 5.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ef8b5af53ec3bf5893686f7c8ef6ac07a3254b7503c3d30def6a6f6a0eaf9f5d
|
|
| MD5 |
2b8fa331b4bce694452df6a878b69003
|
|
| BLAKE2b-256 |
4a3a55d2c00aca464fe98197240b014956a80d1c34b2ea559a4801a2635db172
|