Skip to main content

rmotr.com Group Project | Jobs Detector

Project description

# [pyp-w3] Jobs Detector

Today we will develop a command line tool which aims to parse certain websites looking for job statistics based on given keywords. In this very first version of the tool, we will only implement a parser for the HackerNews blog, which includes a monthly report of "Who is hiring?". Example: https://news.ycombinator.com/item?id=11814828

## Command usage

The command line tool must be accessible by calling `jobs_detector` command. A `hacker_news` subcommand must be also available as part of this implementation.

To see the whole list of optional and mandatory parameters, you can execute the command using the `--help` flag.

```bash
$ jobs_detector hacker_news --help
Options:
-i, --post-id TEXT [required]
-k, --keywords TEXT
-c, --combinations TEXT
--help Show this message and exit.
```

### Default arguments

To request jobs statistics using a default set of keywords, just call the `hacker_news` subcommand providing a valid HN post id (see the last part of the sample URL above), like this:

```bash
$ jobs_detector hacker_news -i 11814828
Total job posts: 888

Keywords:
Remote: 174 (19%)
Postgres: 81 (9%)
Python: 144 (16%)
Javascript: 118 (13%)
React: 133 (14%)
Pandas: 5 (0%)
```

### Keywords filtering

For statistics about a sub set of the default keywords, or even custom keywords out of the default set you can specify the `-k` or `--keywords` options, as a comma separated list of values.

```bash
$ jobs_detector hacker_news -i 11814828 -k python,django,ruby
Total job posts: 889

Keywords:
Python: 144 (16%)
Ruby: 80 (8%)
Django: 36 (4%)
```

### Combination stats

It's also possible to request statistics of certain combination of keywords. For example, how many offers are asking for "remote", "python", and "flask" at the same time?. To do that, use the `-c` or `-combinations` option.

```bash
$ jobs_detector hacker_news -i 11814828 -c remote-python-flask,remote-django
Total job posts: 888

Keywords:
Remote: 174 (19%)
Postgres: 81 (9%)
Python: 144 (16%)
Javascript: 118 (13%)
React: 133 (14%)
Pandas: 5 (0%)

Combinations:
Remote-Python-Flask: 2 (0%)
Remote-Django: 6 (0%)
```

Feel free to extend the functionality of this command by adding extra parameters or even more subcommands to parse different websites.

## Your command available in pypi

Finally, to wrap up this group work, you must make your command tool available in pypi (Python Package Index). Any person out there must be able to use the `pip` command and install a local version of your project. To do this, we will follow some naming conventions so we don't have conflicts between each other. This is the naming convention you must follow for your package: `rmotr-bX-cY-gZ-jobs-detector`, where `X`, `Y` and `Z` are the batch number, course number and group number respectively.

Anyone should be able to install the package by executing, for example: `pip install rmotr-b6-c1-g3-jobs-detector`.

Here you have a very detailed guide about how to upload things to pypi: https://hynek.me/articles/sharing-your-labor-of-love-pypi-quick-and-dirty/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rmotr-b9-c1-g5-jobs-detector-0.0.3.tar.gz (389.1 kB view details)

Uploaded Source

Built Distribution

File details

Details for the file rmotr-b9-c1-g5-jobs-detector-0.0.3.tar.gz.

File metadata

File hashes

Hashes for rmotr-b9-c1-g5-jobs-detector-0.0.3.tar.gz
Algorithm Hash digest
SHA256 55649202306af580244b63b1925d7e08a476c59daaef6eaeda8c52a8aa463b8f
MD5 f60c66aca52a55f1c8012c0f8f7195af
BLAKE2b-256 2759b7fe35a369bbed961b367b2c03c90f39a1537627d158998b79fd7e1591e5

See more details on using hashes here.

File details

Details for the file rmotr_b9_c1_g5_jobs_detector-0.0.3-py2-none-any.whl.

File metadata

File hashes

Hashes for rmotr_b9_c1_g5_jobs_detector-0.0.3-py2-none-any.whl
Algorithm Hash digest
SHA256 1ce8edd2784d965119cff13e7cbad7dfb525dfa7341b5621ef4640a1820f9da2
MD5 aede58e2f1728f021beab28a9732af62
BLAKE2b-256 284fac49c864dcdca256020393b9f4f008dc881038e8b4f8cc07a70f943e012e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page