Skip to main content

use MER inside python

Project description

Use MER scripts inside python.

(from the MER repository)

MER is a Named-Entity Recognition tool which given any lexicon and any input text returns the list of terms recognized in the text, including their exact location (annotations).

Given an ontology (owl file) MER is also able to link the entities to their classes.

More information about MER can be found in:

  • MER: a Shell Script and Annotation Server for Minimal Named Entity Recognition and Linking, F. Couto and A. Lamurias, Journal of Cheminformatics, 10:58, 2018 [https://doi.org/10.1186/s13321-018-0312-9]
  • MER: a Minimal Named-Entity Recognition Tagger and Annotation Server, F. Couto, L. Campos, and A. Lamurias, in BioCreative V.5 Challenge Evaluation, 2017 [https://www.researchgate.net/publication/316545534_MER_a_Minimal_Named-Entity_Recognition_Tagger_and_Annotation_Server]

Dependencies

awk

MER was developed and tested using the GNU awk (gawk) and grep. If you have another awk interpreter in your machine, there's no assurance that the program will work.

For example, to install GNU awk on Ubuntu:

sudo apt-get install gawk

Installation

pip install merpy

or

python setup.py install

Basic Usage

>>> import merpy
>>> merpy.generate_lexicon("hp")
>>> document = 'Influenza, commonly known as "the flu", is an infectious disease caused by an influenza virus. Symptoms can be mild to severe. The most common symptoms include: a high fever, runny nose, sore throat, muscle pains, headache, coughing, and feeling tired'
>>> entities = merpy.get_entities(document, "hp")
>>> print(entities)
[['111', '115', 'mild', 'http://purl.obolibrary.org/obo/HP_0012825'], ['119', '125', 'severe', 'http://purl.obolibrary.org/obo/HP_0012828'], ['168', '173', 'fever', 'http://purl.obolibrary.org/obo/HP_0001945'], ['214', '222', 'headache', 'http://purl.obolibrary.org/obo/HP_0002315'], ['224', '232', 'coughing', 'http://purl.obolibrary.org/obo/HP_0012735'], ['246', '251', 'tired', 'http://purl.obolibrary.org/obo/HP_0012378'], ['175', '185', 'runny nose', 'http://purl.obolibrary.org/obo/HP_0031417']]
>>> lexicons = merpy.get_lexicons()
>>> merpy.show_lexicons()
lexicons preloaded:
['lexicon', 'go', 'cell_line_and_cell_type', 'chebi_lite', 'chemical', 'hp', 'disease', 'wordnet_nouns', 'hpo', 'radlex', 'doid', 'protein', 'hpomultilang', 'tissue_and_organ', 'mirna', 'subcellular_structure']

lexicons loaded ready to use:
['lexicon', 'doid', 'hp']

lexicons with linked concepts:
['doid', 'hp', 'go', 'chebi_lite', 'lexicon']

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

merpy-0.1.1.tar.gz (25.2 MB view details)

Uploaded Source

Built Distributions

merpy-0.1.1-py3.6.egg (50.5 MB view details)

Uploaded Source

merpy-0.1.1-py3-none-any.whl (50.5 MB view details)

Uploaded Python 3

File details

Details for the file merpy-0.1.1.tar.gz.

File metadata

  • Download URL: merpy-0.1.1.tar.gz
  • Upload date:
  • Size: 25.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.7

File hashes

Hashes for merpy-0.1.1.tar.gz
Algorithm Hash digest
SHA256 57814bede2f399c54c6240d06801272977dc42d053f83546ac2d9bde24969aff
MD5 ed59a2c2507e87ef4aa1087b6796d480
BLAKE2b-256 4cca8faadb57867d00710efc84c6392fff6663e147bbd7a7c60144334fbf3a96

See more details on using hashes here.

File details

Details for the file merpy-0.1.1-py3.6.egg.

File metadata

  • Download URL: merpy-0.1.1-py3.6.egg
  • Upload date:
  • Size: 50.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.7

File hashes

Hashes for merpy-0.1.1-py3.6.egg
Algorithm Hash digest
SHA256 8eede20459d0c9a2d3cd8e2a316942dd68e72aa3a90b72e74ba92450728eceae
MD5 cd5e9d48be96b7846537c92e42c48418
BLAKE2b-256 6e17a0e191bc8d6e6b70ad0d0f6a5ee417a63a9198138f5edcf091cbbb851961

See more details on using hashes here.

File details

Details for the file merpy-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: merpy-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 50.5 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.7

File hashes

Hashes for merpy-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 57d268fccacbdc40828f04e5493370b1996ad9a8cef03b550a9b66e391c9c218
MD5 224484513f77568e14419a9b4b5dc980
BLAKE2b-256 035ee806ec61e7da4d0dcf8a61f098a127bf5dd3907a2e213f97d96437877d8d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page