Skip to main content

A batch to analyse AI jobs in Vietnam

Project description

:poop: AIJobs collector :poop:

Batch app

This repo contains batch codes to collect data from many top job postings sites in Vietnam such as Indeed VN, VietnamWorks, TopCV, ...

We use Github Actions to collect the data automatically. Please note that, some websites in Vietnam have mechanisms to prevent scrappers like bots, therefore, we must keep retrying every 5 minutes.

Currently, the list of website we are collecting data from is as follows.

Website URL Batch from Batch cron Queries
TopCV https://www.topcv.vn 2023-08-19 59 12 * * * or manual ai engineer, computer vision, machine learning
VietnamWorks https://vietnamworks.com 2023-08-19 59 12 * * * or manual ai engineer, computer vision, machine learning
Indeed Vietnam https://vn.indeed.com 2023-08-19 59 12 * * * or manual ai engineer, computer vision, machine learning

Online app

Besides the batch app which is setup in Github Actions to crawl data daily, we provide an online app to test the scenarios of data collected. We use MongoDB to store the data collections. To setup an environment for analysing data, see mongodb environment setup.

To run the online app:

$ python uninstall aijobs_batch
$ python setup.py install
$ aijobs_online --reload --workers 1 --host 0.0.0.0 --port 9000 --log_level info

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

AIJobs_Batch-1.0.0a1-py3.11-py3-any.whl (18.1 kB view details)

Uploaded Python 3

AIJobs_Batch-1.0.0a1-py3.10-py3-any.whl (18.1 kB view details)

Uploaded Python 3

AIJobs_Batch-1.0.0a1-py3.9-py3-any.whl (18.1 kB view details)

Uploaded Python 3

AIJobs_Batch-1.0.0a1-py3.8-py3-any.whl (18.1 kB view details)

Uploaded Python 3

AIJobs_Batch-1.0.0a1-py3.7-py3-any.whl (18.1 kB view details)

Uploaded Python 3

File details

Details for the file AIJobs_Batch-1.0.0a1-py3.11-py3-any.whl.

File metadata

File hashes

Hashes for AIJobs_Batch-1.0.0a1-py3.11-py3-any.whl
Algorithm Hash digest
SHA256 de1e716cc70244619741aa3d06cb5e33149ec240a7f7cd8989f9d8fffeb9e365
MD5 e8fffd913f0ff898932a1900ede121c8
BLAKE2b-256 a8363d01601048cba7d35370b43fc224a2916bd9df7e302046d85e15a9fc4525

See more details on using hashes here.

File details

Details for the file AIJobs_Batch-1.0.0a1-py3.10-py3-any.whl.

File metadata

File hashes

Hashes for AIJobs_Batch-1.0.0a1-py3.10-py3-any.whl
Algorithm Hash digest
SHA256 34a64432fe90272201a0c1396da901310dbfbce035ea38eeb25b2352fe4c4489
MD5 8c49ab4a9cf8738ef78b9139fdd98a50
BLAKE2b-256 d4336ef6c699a4b377f9797c74b6877d3ffdfec39380c41b415e4f088d6ed83e

See more details on using hashes here.

File details

Details for the file AIJobs_Batch-1.0.0a1-py3.9-py3-any.whl.

File metadata

File hashes

Hashes for AIJobs_Batch-1.0.0a1-py3.9-py3-any.whl
Algorithm Hash digest
SHA256 1fbd638df6083740fc10324f10d1b8157f40f5d1b856b9afd2f259aab7d0bcfb
MD5 ad0622edf6b8331f2beeafc03ac17acd
BLAKE2b-256 3c7a492a37fa03100c5e3a837bc18fb1a4e48e5c2b65dcf9a7f7249ccbe13a27

See more details on using hashes here.

File details

Details for the file AIJobs_Batch-1.0.0a1-py3.8-py3-any.whl.

File metadata

File hashes

Hashes for AIJobs_Batch-1.0.0a1-py3.8-py3-any.whl
Algorithm Hash digest
SHA256 a7e68ca436ab0ec7cd42d248b5e0a02477081b3124e8f742063e8f46b8a734da
MD5 58f88a4b3b58ef93e7543aff12960496
BLAKE2b-256 2394ec6d1afbb69d9749e11dda0213f126b01efb597e012b310eab845d26ee03

See more details on using hashes here.

File details

Details for the file AIJobs_Batch-1.0.0a1-py3.7-py3-any.whl.

File metadata

File hashes

Hashes for AIJobs_Batch-1.0.0a1-py3.7-py3-any.whl
Algorithm Hash digest
SHA256 7f3b1311390887325f2955d1ca4ad67da92ddd84b8ce423d5f89e61452559928
MD5 cf44640a50dcf4cb7117a065bca06a19
BLAKE2b-256 c3c44003b6842039c4e372d010331af9313058c924aa7bf82918ede7803a6863

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page