Skip to main content

A Genomic Language Model for Chimera Artifact Detection in Nanopore Direct RNA Sequencing

Project description

logo DeepChopper social

pypi license pypi version Actions status platform Space

🧬 DeepChopper leverages language model to accurately detect and chop artificial sequences which may cause chimeric reads, ensuring higher quality and more reliable sequencing results. By integrating seamlessly with existing workflows, DeepChopper provides a robust solution for researchers and bioinformatics working with NanoPore direct-RNA sequencing data.

🚀 Quick Start: Try DeepChopper Online

Experience DeepChopper instantly through our user-friendly web interface. No installation required! Simply click the button below to launch the web application and start exploring DeepChopper's capabilities:

Open in Hugging Face Spaces

What you can do online:

  • 📤 Upload your sequencing data
  • 🔬 Run DeepChopper's analysis
  • 📊 Visualize results
  • 🎛️ Experiment with different parameters

Perfect for quick tests or demonstrations! However, for extensive analyses or custom workflows, we recommend installing DeepChopper locally.

⚠️ Note: The online version is limited to one FASTQ record at a time and may not be suitable for large-scale projects.

📦 Installation

DeepChopper can be installed using pip, the Python package installer. Follow these steps to install:

  1. Ensure you have Python 3.10 or later installed on your system.

  2. Create a virtual environment (recommended):

    python -m venv deepchopper_env
    source deepchopper_env/bin/activate  # On Windows use `deepchopper_env\Scripts\activate`
    
  3. Install DeepChopper:

    pip install deepchopper
    
  4. Verify the installation:

    deepchopper --help
    

🆘 Trouble installing? Check our Troubleshooting Guide or open an issue.

🛠️ Usage

For a comprehensive guide, check out our full tutorial. Here's a quick overview:

Command-Line Interface

DeepChopper offers three main commands: encode, predict, and chop.

  1. Encode your input data:

    deepchopper encode <input.fq>
    
  2. Predict chimeric reads:

    deepchopper predict <input.parquet> --output-path predictions
    

    Using GPUs? Add the --gpus flag:

    deepchopper predict <input.parquet> --output-path predictions --gpus 2
    
  3. Chop the chimeric reads:

    deepchopper chop <predictions> raw.fq
    

Want a GUI? Launch the web interface (note: limited to one FASTQ record at a time):

deepchopper web

Python Library

Integrate DeepChopper into your Python scripts:

import deepchopper

model = deepchopper.DeepChopper.from_pretrained("yangliz5/deepchopper")
# Your analysis code here

📚 Cite

If DeepChopper aids your research, please cite our paper:


🤝 Contribution

We welcome contributions! Here's how to set up your development environment:

Build Environment

git clone https://github.com/ylab-hi/DeepChopper.git
cd DeepChopper
conda env create -n environment.yaml
conda activate deepchopper

Install Dependencies

pip install pipx
pipx install --suffix @master git+https://github.com/python-poetry/poetry.git@master
poetry@master install

🎉 Ready to contribute? Check out our Contribution Guidelines to get started!

📬 Support

Need help? Have questions?


DeepChopper is developed with ❤️ by the YLab team. Happy sequencing! 🧬🔬

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deepchopper-1.2.1.tar.gz (75.6 MB view details)

Uploaded Source

Built Distributions

deepchopper-1.2.1-cp310-abi3-win_amd64.whl (4.4 MB view details)

Uploaded CPython 3.10+ Windows x86-64

deepchopper-1.2.1-cp310-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (5.2 MB view details)

Uploaded CPython 3.10+ manylinux: glibc 2.17+ x86-64

deepchopper-1.2.1-cp310-abi3-macosx_11_0_arm64.whl (4.3 MB view details)

Uploaded CPython 3.10+ macOS 11.0+ ARM64

deepchopper-1.2.1-cp310-abi3-macosx_10_12_x86_64.whl (4.7 MB view details)

Uploaded CPython 3.10+ macOS 10.12+ x86-64

File details

Details for the file deepchopper-1.2.1.tar.gz.

File metadata

  • Download URL: deepchopper-1.2.1.tar.gz
  • Upload date:
  • Size: 75.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.7.4

File hashes

Hashes for deepchopper-1.2.1.tar.gz
Algorithm Hash digest
SHA256 5e1bcae55a9f9d553d663771fd79c91c60d26fc329b29382ae521bdcbc0d81ed
MD5 c1c2517975e87e9eed99d6396b8eb17e
BLAKE2b-256 f5a21f2c2c728c6cbd5df618167d4b21159d2de457d4865758bdf0b4fe49b8e0

See more details on using hashes here.

File details

Details for the file deepchopper-1.2.1-cp310-abi3-win_amd64.whl.

File metadata

File hashes

Hashes for deepchopper-1.2.1-cp310-abi3-win_amd64.whl
Algorithm Hash digest
SHA256 2314e0cd2a7e96a8350d1d0b4f7b84fad64f377e8efe62817a218fd31021cb7a
MD5 db0e61d3ff292ce8dcd74cced617b846
BLAKE2b-256 992588952691fb5f59ca96044bbd3d1a87df21d8cf2311f1ee4d7f81c4832e42

See more details on using hashes here.

File details

Details for the file deepchopper-1.2.1-cp310-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper-1.2.1-cp310-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 346afe51eef9bf65050dc5d9b5e37e0c92afef2178dfa3f68e1ca46edc3b584e
MD5 7e7a01c279183e03215f58ce45051364
BLAKE2b-256 99065fee8e6ec45fe3f0b575db2ac0e1773019097fa30c3da2fcae83d52d1d96

See more details on using hashes here.

File details

Details for the file deepchopper-1.2.1-cp310-abi3-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for deepchopper-1.2.1-cp310-abi3-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 cfa6b484e955b3023917d0be81f7261abe011ad3e9f98e8560f14133e8194dc0
MD5 9cd0324d6b4fc1a09458d24b64c27568
BLAKE2b-256 6381723fbbb509531e5c01dd95060eb4b10fa152209b0643ef09d8083d83a08f

See more details on using hashes here.

File details

Details for the file deepchopper-1.2.1-cp310-abi3-macosx_10_12_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper-1.2.1-cp310-abi3-macosx_10_12_x86_64.whl
Algorithm Hash digest
SHA256 3d26ba3f8aa9b287629e9e780bf52b61ec403028de968bde536342f1f534a625
MD5 decf7db3642ed284de969a29ad490127
BLAKE2b-256 86640aec551ff5192aa22b4cd46f5679a523436cf787ccecb453736574931a3a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page