drug-named-entity-recognition·PyPI

Finds drug names in a string

These details have not been verified by PyPI

Project links

Homepage

Project description

Drug named entity recognition

Developed by Fast Data Science, https://fastdatascience.com

Source code at https://github.com/fastdatascience/drug_named_entity_recognition

This is a lightweight Python library for finding drug names in a string.

Please note this library finds only high confidence drugs.

It also only finds the English names of these drugs. Names in other languages are not supported.

It also doesn’t find short code names of drugs, such as abbreviations commonly used in medicine, such as “Ceph” for “Cephradin” - as these are highly ambiguous.

Requirements

Python 3.9 and above

Installation

pip install drug-named-entity-recognition

Usage examples

You must first tokenise your input text using a tokeniser of your choice (NLTK, spaCy, etc).

You pass a list of strings to the find_drugs function.

Example 1

from drug_named_entity_recognition import find_drugs

find_drugs("i bought some Phenoxymethylpenicillin".split(" "))

outputs a list of tuples.

[({'name': 'Phenoxymethylpenicillin',
   'synonyms': {'Penicillin', 'Phenoxymethylpenicillin'},
   'nhs_url': 'https://www.nhs.uk/medicines/phenoxymethylpenicillin',
   'drugbank_id': 'DB00417'},
  3,
  3)]

You can ignore case with:

find_drugs("i bought some phenoxymethylpenicillin".split(" "), is_ignore_case=True)

Data sources

The main data source is from Drugbank, augmented by datasets from the NHS, MeSH, Medline Plus and Wikipedia.

Update the Drugbank dictionary

If you want to update the dictionary, you can use the data dump from Drugbank and replace the file drugbank vocabulary.csv:

Download the open data dump from https://go.drugbank.com/releases/latest#open-data

Update the Wikipedia dictionary

If you want to update the Wikipedia dictionary, download the dump from:

https://meta.wikimedia.org/wiki/Data_dump_torrents#English_Wikipedia

and run extract_drug_names_and_synonyms_from_wikipedia_dump.py

Update the MeSH dictionary

If you want to update the dictionary, download the open data dump from https://www.nlm.nih.gov/

and run extract_drug_names_and_synonyms_from_mesh_dump.py

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

2.0.9

Jul 15, 2025

2.0.8

Apr 13, 2025

2.0.7

Mar 28, 2025

2.0.6

Mar 28, 2025

2.0.5

Jan 24, 2025

2.0.4

Oct 14, 2024

2.0.1

Oct 10, 2024

2.0.0

Sep 6, 2024

1.0.11

Jun 21, 2024

1.0.10

Jun 21, 2024

1.0.9

Jun 21, 2024

1.0.8

Jun 20, 2024

1.0.3

Apr 14, 2024

1.0.2

Sep 27, 2023

1.0.1

Jul 7, 2023

0.5.2

Jun 20, 2024

This version

0.1

Jun 17, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

drug-named-entity-recognition-0.1.tar.gz (958.3 kB view details)

Uploaded Jun 17, 2022 Source

Built Distribution

drug_named_entity_recognition-0.1-py3-none-any.whl (962.7 kB view details)

Uploaded Jun 17, 2022 Python 3

File details

Details for the file drug-named-entity-recognition-0.1.tar.gz.

File metadata

Download URL: drug-named-entity-recognition-0.1.tar.gz
Upload date: Jun 17, 2022
Size: 958.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.0 CPython/3.9.7

File hashes

Hashes for drug-named-entity-recognition-0.1.tar.gz
Algorithm	Hash digest
SHA256	`f05f9cfcf236b3a980737fbc34deb3e94cecf9574122750bfbe9f2d8b20245b7`
MD5	`8b7c950ca70e1d17223fc80f052a0835`
BLAKE2b-256	`1e82780f461eef19de32598b01bd54b95dbdbd0cee34064dd2a609c5588f17bd`

See more details on using hashes here.

File details

Details for the file drug_named_entity_recognition-0.1-py3-none-any.whl.

File metadata

Download URL: drug_named_entity_recognition-0.1-py3-none-any.whl
Upload date: Jun 17, 2022
Size: 962.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.0 CPython/3.9.7

File hashes

Hashes for drug_named_entity_recognition-0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`da7037b26c7a5a0207634e3494e3c43b406aa04e4bc942afb9e04476e0ba5322`
MD5	`c2e22e5af8e429e3f7320f0efdc73e14`
BLAKE2b-256	`1c60e1685b0bd2a3be6c915537f59f4611927a51b53a299fcc85f845f34018bc`

See more details on using hashes here.

drug-named-entity-recognition 0.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Drug named entity recognition

Requirements

Installation

Usage examples

Data sources

Update the Drugbank dictionary

Update the Wikipedia dictionary

Update the MeSH dictionary

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes