Fast extraction of job titles from strings
Project description
find_job_titles
Find Job Titles in Strings
Free software: MIT license
Python versions: 2.7, 3.4+
Features
Find any of 77k job titles in a given string
Text processing is extremely fast using “acora” library
Dictionary generation takes about 20 seconds upfront
Quickstart
Instantiate “Finder” and start extracting job titles:
>>> from find_job_titles import Finder >>> finder.findall('I am the Senior Vice President') [('Senior Vice President', 9), ('Vice President', 16), ('President', 21)]
All possible, overlapping matches are returned. Matches contain positional information of where the match was found.
Alternatively use “finditer” for lazy consumption of matches:
>>> finder.finditer('I am the Senior Vice President')] <generator object ...>
Credits
This package was created with Cookiecutter and the fluquid/cookiecutter-pypackage project template.
History
0.2.0-dev (2017-08-18)
proper tracking of code with releases
0.1.0 (unreleased)
First release on PyPI.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for find_job_titles-0.2.0.dev0.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | f6e36f818580894141958b0a737aea87168d7a49f8cc9042531627418041f59e |
|
MD5 | 939231775d65509dfdb8de378caacba3 |
|
BLAKE2b-256 | a6aa7171371d8c2bd22e4260e91786af6c32b6f21543cdda5528a37bfec18501 |
Hashes for find_job_titles-0.2.0.dev0-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8746a8e1c33a519297c969cba00b3d07999687421361632b3623b4a817917b90 |
|
MD5 | e0a148c2480a7eae49d3630748890830 |
|
BLAKE2b-256 | 2c0c33dd40c653f343e1f0063f3d241302cc4c32abfe620e09091ee25fe5ebda |