Skip to main content

Fast extraction of job titles from strings

Project description

find_job_titles Coverage Status

Find Job Titles in Strings

  • Free software: MIT license

  • Python versions: 2.7, 3.4+


  • Find any of 77k job titles in a given string

  • Text processing is extremely fast using “acora” library

  • Dictionary generation takes about 20 seconds upfront


Instantiate “Finder” and start extracting job titles:

>>> from find_job_titles import Finder
>>> finder.findall('I am the Senior Vice President')
[('Senior Vice President', 9),
 ('Vice President', 16),
 ('President', 21)]

All possible, overlapping matches are returned. Matches contain positional information of where the match was found.

Alternatively use “finditer” for lazy consumption of matches:

>>> finder.finditer('I am the Senior Vice President')]
<generator object ...>


This package was created with Cookiecutter and the fluquid/cookiecutter-pypackage project template.


0.5.0 (2017-08-22)

0.4.0 (2017-08-21)

  • updated title list with marketing execs

  • set non-dev version

0.3.0-dev (2017-08-18)

  • updated title list (- surnames, - blacklist, + added_roles)

0.2.0-dev (2017-08-18)

  • proper tracking of code with releases

0.1.0 (unreleased)

  • First release on PyPI.

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

find_job_titles-0.5.0.tar.gz (394.8 kB view hashes)

Uploaded source

Built Distribution

find_job_titles-0.5.0-py2.py3-none-any.whl (381.9 kB view hashes)

Uploaded py2 py3

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page