Skip to main content

Accelerate Molecular Biology Research with Machine Learning

Project description

MultiMolecule

[!TIP] Accelerate Molecular Biology Research with Machine Learning

DOI

Codacy - Quality Codacy - Coverage CodeCov - Coverage

PyPI - Version PyPI - Python Version Downloads Statistics

License: AGPL v3

Introduction

Welcome to MultiMolecule (浦原), a foundational library designed to accelerate scientific research in molecular biology through machine learning. MultiMolecule provides a comprehensive yet flexible set of tools for researchers aiming to leverage AI with ease, focusing on biomolecular data (RNA, DNA, and protein).

Overview

MultiMolecule is built with flexibility and ease of use in mind. Its modular design allows you to utilize only the components you need, integrating seamlessly into your existing workflows without adding unnecessary complexity.

  • data: Smart [Dataset][multimolecule.data.Dataset] that automatically infer tasks—including their level (sequence, token, contact) and type (classification, regression). Provides multi-task datasets and samplers to facilitate multitask learning without additional configuration.
  • datasets: A collection of widely-used biomolecular datasets.
  • modules: Modular neural network building blocks, including embeddings, heads, and criterions for constructing custom models.
  • models: Implementation of state-of-the-art pre-trained models in molecular biology.
  • tokenisers: Tokenizers to convert DNA, RNA, protein and other sequences to one-hot encodings.

Installation

Install the most recent stable version on PyPI:

pip install multimolecule

Install the latest version from the source:

pip install git+https://github.com/DLS5-Omics/MultiMolecule

Citation

[!NOTE] The artifacts distributed in this repository are part of the MultiMolecule project. If you use MultiMolecule in your research, you must cite the MultiMolecule project as follows:

@software{chen_2024_12638419,
  author    = {Chen, Zhiyuan and Zhu, Sophia Y.},
  title     = {MultiMolecule},
  doi       = {10.5281/zenodo.12638419},
  publisher = {Zenodo},
  url       = {https://doi.org/10.5281/zenodo.12638419},
  year      = 2024,
  month     = may,
  day       = 4
}

License

We believe openness is the Foundation of Research.

MultiMolecule is licensed under the GNU Affero General Public License.

For additional terms and clarifications, please refer to our License FAQ.

Please join us in building an open research community.

SPDX-License-Identifier: AGPL-3.0-or-later

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

multimolecule-0.1.0.tar.gz (7.3 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

multimolecule-0.1.0-py3-none-any.whl (1.2 MB view details)

Uploaded Python 3

File details

Details for the file multimolecule-0.1.0.tar.gz.

File metadata

  • Download URL: multimolecule-0.1.0.tar.gz
  • Upload date:
  • Size: 7.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for multimolecule-0.1.0.tar.gz
Algorithm Hash digest
SHA256 8580952671c3908bac1e0e98e4221ab3efea99b2d877fc404a359a526987e17a
MD5 2dc21a61b0106407da9f364e1a113313
BLAKE2b-256 d02b06a2c8ba6b73e714d290a599b3737788e9c0c9c0f3baac3534e5a15f95da

See more details on using hashes here.

Provenance

The following attestation bundles were made for multimolecule-0.1.0.tar.gz:

Publisher: push.yaml on DLS5-Omics/multimolecule

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file multimolecule-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: multimolecule-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 1.2 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for multimolecule-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 490f74416b753c0a347ab09cc29be22d3e156aa1d241f280ede8986b53a8b9f4
MD5 2d319441da19694a782d026092cb66b5
BLAKE2b-256 8f168fdf2f9bffaefd900dc20bd020037611ce4c6b05025b5b8b7b51d960ebb0

See more details on using hashes here.

Provenance

The following attestation bundles were made for multimolecule-0.1.0-py3-none-any.whl:

Publisher: push.yaml on DLS5-Omics/multimolecule

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page