Skip to main content

No project description provided

Project description


mapling

Mapling finds things, such as place names, in texts. It returns a csv file with a row for each occurence. For each file, it creates an html page with the things highlighted. Just point mapling to a folder full of documents. Mapling uses textract to extract text from many types of files, including csv, doc, docx, pdf, html, txt and many others.

Usage: $ mapling texts/ --gazetteer=gazetteer/gazetteer.txt --model=de_core_news_sm --html To install a spaCy model: $ python -m spacy download de_core_news_sm

  • The first approach is to use a gazetteer. Mapling expects a txt file with a row for each place name. Add the --gazetter argument with the path to your file. This approach lets you search for specific terms (not just places) that appear in the text. $ mapling /dir/with/txt_files --gazetteer="/home/me/gazetter.txt"

  • The second approach uses a spaCy named entity recognition model. Add the --model argument with the name of an installed spaCy model. If your model is not installed or does not have an ner pipeline, you'll get instructions on how to fix that. This approach will return a large range of entities and places, more than you might list yourself. This is useful for establishing which places, people and organizations appear in a text. $ mapling /dir/with/txt_files --model=de_core_news_md

  • Finally, mapling can create visualizations. Add the --html argument $ mapling /dir/with/txt_files --model=de_core_news_md --html

To install:

pip install mapling

In the future, mapling will also work with the Word Historical Gazetteer to rectify, geocode and map your place names.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mapling-0.1.1.tar.gz (4.1 kB view details)

Uploaded Source

Built Distribution

mapling-0.1.1-py3-none-any.whl (4.4 kB view details)

Uploaded Python 3

File details

Details for the file mapling-0.1.1.tar.gz.

File metadata

  • Download URL: mapling-0.1.1.tar.gz
  • Upload date:
  • Size: 4.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.0.5 CPython/3.7.7 Linux/5.3.0-46-generic

File hashes

Hashes for mapling-0.1.1.tar.gz
Algorithm Hash digest
SHA256 8e7386fd902d7645b2a413a571a1f65b02feca712f7d79ffed5c298c3801a261
MD5 76ef382a91439ee1796f653c36f69bc6
BLAKE2b-256 db230e22569854f3c52e171f1a6438895b75dae6db55da9adf81864ad0a0f52d

See more details on using hashes here.

File details

Details for the file mapling-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: mapling-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 4.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.0.5 CPython/3.7.7 Linux/5.3.0-46-generic

File hashes

Hashes for mapling-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 e0b1acbd548618e45e9c8e77ccf11aab16cf1f3c7b9d3177da93ac97d4891f84
MD5 23d3a104ea01684b6856bdf086e33c9d
BLAKE2b-256 df1123e1e0631fe3f40cd8c0d56020b3281a29da2dc98d333677131dc4af93b1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page