Fast extraction of job titles from strings
Project description
find_job_titles
Find Job Titles in Strings
Free software: MIT license
Python versions: 2.7, 3.4+
Features
Find any of 77k job titles in a given string
Text processing is extremely fast using “acora” library
Dictionary generation takes about 20 seconds upfront
Quickstart
Instantiate “Finder” and start extracting job titles:
>>> from find_job_titles import Finder >>> finder.findall('I am the Senior Vice President') [('Senior Vice President', 9), ('Vice President', 16), ('President', 21)]
All possible, overlapping matches are returned. Matches contain positional information of where the match was found.
Alternatively use “finditer” for lazy consumption of matches:
>>> finder.finditer('I am the Senior Vice President')] <generator object ...>
Credits
This package was created with Cookiecutter and the fluquid/cookiecutter-pypackage project template.
History
0.1.0 (unreleased)
First release on PyPI.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for find_job_titles-0.1.0-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a05bc56d25f3be1027d44aba96e219c30490f13bef152301d8b01bb9a226225b |
|
MD5 | 4afc010938e604a81c78e03ac1e0d644 |
|
BLAKE2b-256 | 8471ee6832013dc7e08184f4f1e110e0b8079c0c8a7aedc0436880c1019bee2b |