Collect and standardize data on job posting platforms.
Project description
Jobtimize
Jobtimize
is a python package which collects, standardizes and completes information about job offers published on job search platforms.
The package is mainly based on scraping and text classification to fill in missing data.
Release | Usage | Development |
---|---|---|
What's new in the current version:
Dependencies
beautifulsoup4
jsonschema
lxml
pandas
Installation
Pypi
The safest way to install Jobtimize
is to go through pip
pip install Jobtimize
Conda
It is also possible to get the latest stable version with Anaconda Cloud
conda install -c lrakotoson jobtimize
Git
The installation with git allows to have the latest version. However it can have some bugs.
pip install git+https://github.com/HireCoffee/Jobtimize.git
How to use ?
As Jobtimize
is a package, in python you just have to import it.
The main function (for now) is Jobtimize.jobscrap
.
from jobtimize import scraper
df = jobscrap(["Data Scientist", "Data Analyst"],
["UK", "FR"]
)
df.head()
The df
object is a dataframe pandas, so it inherits all its methods.
Contributing 🤝
🎊 Firstly, thank you for giving your time to contribute to Jobtimize
. 🎊
If you have a new feature to submit, don't hesitate to open an issue (By checking "new feature" to make it easier to read) We can discuss it freely there.
Then you can make a "pull request" as explained in the contribution guidelines.
Same for all contributions, code improvement, documentation writing, translations... all ideas are welcome! Check out the guidelines to make it easier.
Jobtimize
gets better with contributions.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file Jobtimize-0.0.5a2.tar.gz
.
File metadata
- Download URL: Jobtimize-0.0.5a2.tar.gz
- Upload date:
- Size: 7.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4572b22eb4c1977cca22a67b3ee7b7e3ec0207fe9efa0d1c2bf7b7eff1bd96b9 |
|
MD5 | 343babe7c9f974b81500bf0de329373e |
|
BLAKE2b-256 | 2c7061a252687326ccb0258b87b16da762fd87d91157979b01d008acd1e13580 |
File details
Details for the file Jobtimize-0.0.5a2-py3-none-any.whl
.
File metadata
- Download URL: Jobtimize-0.0.5a2-py3-none-any.whl
- Upload date:
- Size: 10.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | dd78e20cdc65857efcd8029c69598e7d8124902bcf3cad7980d54113e1bf8908 |
|
MD5 | f3f4656f34a4942e04a1224121744234 |
|
BLAKE2b-256 | 453bd6109420f484edf872f6e1d830715e4f36ca6907babeba62cf00a3aecc97 |