Skip to main content

Add your description here

Project description

RNA-SyntHub 🧬

Meta-scoring pipeline to prepare curated synthetic RNA structure data

Ruff uv Python

RNA-SyntHub is a pipeline applied to create curated RNA structures data. It has been applied to a set of 447,402 synthetic RNA structures and enforced by predictions from RNAComposer and Boltz-1.

Repo structure:

To download the data of RNA-SyntHub (structures from RFDiffusion, RNAComposer and Boltz-1), you can find the data on this link. The repo is structured as follows:

  • data/: folder containing the data used in the repo (figures, example pdb files, etc.) as well as the .pdb structures
    • img/: folder containing the figures used in the README
    • pdb/: folder containing example .pdb files
    • examples: folder containing example output files of the different steps of the pipeline
    • metadata: folder containing example extracted information from the RNA-SyntHub dataset
    • times: folder containing times benchmark on the subset of 1000 structures
  • src/: folder containing the source code of the pipeline and the visualisation
  • Makefile: file to run the different steps of the pipeline and visualisation

Installation

To compute molprobity scores, you need to install RNAqua using:

make install_rnaqua

To compute alignment, you need to install mlocarna and cd-hit. You can go to src/rna_synthub/locarna_helper.py to change the path of the binaries if needed. For cd-hit, you need to compile it first using:

make install_cd_hit

You can install the visualisation using pip:

pip install rna_synthub

Usage

Pipeline steps

If you want to reproduce the pipeline, please go the the src/analyze/script_example.py where there is a step-by-step example of the pipeline usage. You just need to uncomment the main function and run the script you want. Examples are provided with 10 .pdb files. The different steps of the pipeline are as follows:

def main():
    ScriptExample.compute_n_meta()
    ScriptExample.compute_filter()
    ScriptExample.compute_2d()
    ScriptExample.compute_clustering()

You can run the script using:

make run_example

or, if you installed the library using pip:

python synt_example

Viz steps

To run the code to obtain the different visualisation, you can use:

make run_viz

or, if you installed the library using pip:

python viz_cli

You can also have a look at the src/viz/viz_cli.py to choose which visualisation you want to run.

def main():
    VizCLI.viz_funnel()
    VizCLI.viz_filter()
    VizCLI.viz_best_worst()
    VizCLI.viz_distance()
    VizCLI.viz_sf()
    VizCLI.viz_times()
    VizCLI.viz_angles()

Citation

Authors

  • Clement Bernard
  • Marek Justyna
  • Guillaume Postic
  • Maciej Antczak
  • Fariza Tahi
  • Marta Szachniuk

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rna_synthub-0.1.0.tar.gz (101.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

rna_synthub-0.1.0-py3-none-any.whl (34.3 kB view details)

Uploaded Python 3

File details

Details for the file rna_synthub-0.1.0.tar.gz.

File metadata

  • Download URL: rna_synthub-0.1.0.tar.gz
  • Upload date:
  • Size: 101.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.6.6

File hashes

Hashes for rna_synthub-0.1.0.tar.gz
Algorithm Hash digest
SHA256 509f3aa9a6d9dcd0c2f6fa2bf4feb9caf83027c945de2af95b20572a6f01305f
MD5 2626e3246a65dc45c6af9b39d5e2f75f
BLAKE2b-256 4bccef0f8e548d5ebb6f60bfcd7f4592f2df5b643fc6f29f7440b03b784a3cb5

See more details on using hashes here.

File details

Details for the file rna_synthub-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for rna_synthub-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f839776915663bdbd3b5e21f077894935b9b264f4dfbccbb794e59b4f9f78be3
MD5 9646f7cbf3145299a71e0f504f4198e5
BLAKE2b-256 b5f91d54b6b5c621fe164ed20e7691e296bc3cf519f0b8002b884568868b59bb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page