Skip to main content

The minimal SDF metadata parser

Project description

pre-commit.ci status Tox status

The minimal SDF metadata parser.

Often, SDFs have lots of useful metadata on them in the title and record fields/values. However, reading a molecule (via rdkit, OpenEye toolkits, etc) can be slow because those libraries also construct the molecules. Modifying the metadata, or filtering/sorting based on the metadata also can induce non-idempotent differences in the file based on opinionated approaches in chemical libraries.

This library strives to be able to handle SDF files even with malformed chemistry or metadata. Since much debugging of our files and data deals with such files, having access to simple tools to interrogate the files while not modifying the file is crucial.

This package also tried to document the “canonical” ways metadata is handled by the larger packages. To wit, there are tests to monitor how, for example, rdkit deals with molecules that have multiline record values, or a “$$$$” molecule title.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sendoff-0.1.2.tar.gz (4.5 kB view hashes)

Uploaded Source

Built Distribution

sendoff-0.1.2-py3-none-any.whl (6.7 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page