A batch to analyse AI jobs in Vietnam
Project description
:poop: AIJobs collector :poop:
Batch app
This repo contains batch codes to collect data from many top job postings sites in Vietnam such as Indeed VN, VietnamWorks, TopCV, ...
We use Github Actions to collect the data automatically. Please note that, some websites in Vietnam have mechanisms to prevent scrappers like bots, therefore, we must keep retrying every 5 minutes.
Currently, the list of website we are collecting data from is as follows.
Website | URL | Batch from | Batch cron | Queries |
---|---|---|---|---|
TopCV | https://www.topcv.vn | 2023-08-19 | 59 12 * * * or manual |
ai engineer , computer vision , machine learning |
VietnamWorks | https://vietnamworks.com | 2023-08-19 | 59 12 * * * or manual |
ai engineer , computer vision , machine learning |
Indeed Vietnam | https://vn.indeed.com | 2023-08-19 | 59 12 * * * or manual |
ai engineer , computer vision , machine learning |
Online app
Besides the batch app which is setup in Github Actions to crawl data daily, we provide an online app to test the scenarios of data collected. We use MongoDB to store the data collections. To setup an environment for analysing data, see mongodb environment setup.
To run the online app:
$ python uninstall aijobs_batch
$ python setup.py install
$ aijobs_online --reload --workers 1 --host 0.0.0.0 --port 9000 --log_level info
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
File details
Details for the file AIJobs_Batch-1.0.0a1-py3.11-py3-any.whl
.
File metadata
- Download URL: AIJobs_Batch-1.0.0a1-py3.11-py3-any.whl
- Upload date:
- Size: 18.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | de1e716cc70244619741aa3d06cb5e33149ec240a7f7cd8989f9d8fffeb9e365 |
|
MD5 | e8fffd913f0ff898932a1900ede121c8 |
|
BLAKE2b-256 | a8363d01601048cba7d35370b43fc224a2916bd9df7e302046d85e15a9fc4525 |
File details
Details for the file AIJobs_Batch-1.0.0a1-py3.10-py3-any.whl
.
File metadata
- Download URL: AIJobs_Batch-1.0.0a1-py3.10-py3-any.whl
- Upload date:
- Size: 18.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 34a64432fe90272201a0c1396da901310dbfbce035ea38eeb25b2352fe4c4489 |
|
MD5 | 8c49ab4a9cf8738ef78b9139fdd98a50 |
|
BLAKE2b-256 | d4336ef6c699a4b377f9797c74b6877d3ffdfec39380c41b415e4f088d6ed83e |
File details
Details for the file AIJobs_Batch-1.0.0a1-py3.9-py3-any.whl
.
File metadata
- Download URL: AIJobs_Batch-1.0.0a1-py3.9-py3-any.whl
- Upload date:
- Size: 18.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1fbd638df6083740fc10324f10d1b8157f40f5d1b856b9afd2f259aab7d0bcfb |
|
MD5 | ad0622edf6b8331f2beeafc03ac17acd |
|
BLAKE2b-256 | 3c7a492a37fa03100c5e3a837bc18fb1a4e48e5c2b65dcf9a7f7249ccbe13a27 |
File details
Details for the file AIJobs_Batch-1.0.0a1-py3.8-py3-any.whl
.
File metadata
- Download URL: AIJobs_Batch-1.0.0a1-py3.8-py3-any.whl
- Upload date:
- Size: 18.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a7e68ca436ab0ec7cd42d248b5e0a02477081b3124e8f742063e8f46b8a734da |
|
MD5 | 58f88a4b3b58ef93e7543aff12960496 |
|
BLAKE2b-256 | 2394ec6d1afbb69d9749e11dda0213f126b01efb597e012b310eab845d26ee03 |
File details
Details for the file AIJobs_Batch-1.0.0a1-py3.7-py3-any.whl
.
File metadata
- Download URL: AIJobs_Batch-1.0.0a1-py3.7-py3-any.whl
- Upload date:
- Size: 18.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7f3b1311390887325f2955d1ca4ad67da92ddd84b8ce423d5f89e61452559928 |
|
MD5 | cf44640a50dcf4cb7117a065bca06a19 |
|
BLAKE2b-256 | c3c44003b6842039c4e372d010331af9313058c924aa7bf82918ede7803a6863 |