Skip to main content

Turn noise to read

Project description

./logo/logo.svg Documentation Status

Turn ‘noise’ to signal: accurately rectify millions of erroneous short reads through graph learning on edit distances

noise2read, originated in a computable rule translated from PCR erring mechanism that: a rare read is erroneous if it has a neighboring read of high abundance, turns erroneous reads into their original state without bringing up any non-existing sequences into the short read set(<300bp) including DNA and RNA sequencing (DNA/RNA-seq), small RNA, unique molecular identifiers (UMI) and amplicon sequencing data.

Click noise2read to jump to its documentation

Quick-run example

Quick-run example for testing noise2read by setting only 1 trial for Optuna and 10 estimators for xGboost which are not the parameters used in our paper.

Please refer to QuickStart or Installation.

  • Clone the codes with datasets in github

git clone https://github.com/Jappy0/noise2read
cd noise2read/Examples/simulated_miRNAs
  • Quick-run testing noise2read on D14

    • with high ambiguous errors correction and using GPU for training (running about 4 mins with 26 cores and GPU)

    noise2read -m correction -c ../../config/Quick_test.ini -a True -g gpu_hist

Examples for correcting simulated miRNAs data with mimic UMIs by noise2read

Take data sets D14 and D16 as examples.

Please refer to QuickStart or Installation.

  • Clone the codes with datasets in github

git clone https://github.com/Jappy0/noise2read
cd noise2read/Examples/simulated_miRNAs
  • Reproduce the evaluation results for D14 and D16 from raw, true and corrected datasets

noise2read -m evaluation -i ./simulated_miRNAs/raw/D14_umi_miRNA_mix.fa -t ./simulated_miRNAs/true/D14_umi_miRNA_mix.fa -r ./simulated_miRNAs/correct/D14_umi_miRNA_mix.fasta -d ./result
noise2read -m evaluation -i ./simulated_miRNAs/raw/D16_umi_miRNA_subs.fa -t ./simulated_miRNAs/true/D16_umi_miRNA_subs.fa -r ./simulated_miRNAs/correct/D16_umi_miRNA_subs.fasta -d ./result
  • correcting D14

    • with high ambiguous errors correction and using GPU for training

    noise2read -m correction -c ../../config/D14.ini -a True -g gpu_hist
    • without high ambiguous errors correction and using CPU (default) for training

    noise2read -m correction -c ../../config/D14.ini -a False
  • correcting D16

    • with high ambiguous errors correction and using GPU for training

    noise2read -m correction -c ../../config/D16.ini -a True -g gpu_hist
    • without high ambiguous errors correction and using CPU (default) for training

    noise2read -m correction -c ../../config/D16.ini -a False

Examples for correcting outcome sequence of ABEs and CBEs by noise2read

  • Clone the codes

git clone https://github.com/Jappy0/noise2read
cd noise2read/CaseStudies
mkdir ABEs_CBEs
cd ABEs_CBEs
  • Download datasets D32_D33.

  • Using noise2read to correct the datasets. The running time of each experiment is about 13 minutes using 26 cores and GPU for training.

noise2read -m correction -i ./D32_D33/raw/D32_ABE_outcome_seqs.fasta -a False -d ./ABE/
noise2read -m correction -i ./D32_D33/raw/D33_CBE_outcome_seqs.fasta -a False -d ./CBE/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

noise2read-0.1.2.tar.gz (61.3 kB view details)

Uploaded Source

Built Distribution

noise2read-0.1.2-py3-none-any.whl (65.0 kB view details)

Uploaded Python 3

File details

Details for the file noise2read-0.1.2.tar.gz.

File metadata

  • Download URL: noise2read-0.1.2.tar.gz
  • Upload date:
  • Size: 61.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.8.13

File hashes

Hashes for noise2read-0.1.2.tar.gz
Algorithm Hash digest
SHA256 12f2f1a3b4377f6a3f9c4ef8185e340f32cd00b19298389d5032642ed107c36a
MD5 6add6e206768e3e072de0d7eef3cc659
BLAKE2b-256 c22f1e48e42e05248e803574fa91c00ac2a22ac03735e241a74e0676f025f433

See more details on using hashes here.

File details

Details for the file noise2read-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: noise2read-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 65.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.8.13

File hashes

Hashes for noise2read-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 a224de26fe6665ce83feed19fd26b139aeab8a747062e00a2fab60944219f762
MD5 c7b28efe1199ef6c3a5610c63e3211bb
BLAKE2b-256 783d1dac878a98164cf4cbf926f5bf434444ba4103a40ccae907b938407cc212

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page