Skip to main content

A Genomic Language Model for Chimera Artifact Detection in Nanopore Direct RNA Sequencing

Project description

logo DeepChopper social

pypi PyPI - Wheel license pypi version platform Actions status Space

🧬 DeepChopper leverages language model to accurately detect and chop artificial sequences which may cause chimeric reads, ensuring higher quality and more reliable sequencing results. By integrating seamlessly with existing workflows, DeepChopper provides a robust solution for researchers and bioinformatics working with NanoPore direct-RNA sequencing data.

🚀 Quick Start: Try DeepChopper Online

Experience DeepChopper instantly through our user-friendly web interface. No installation required! Simply click the button below to launch the web application and start exploring DeepChopper's capabilities:

Open in Hugging Face Spaces

What you can do online:

  • 📤 Upload your sequencing data
  • 🔬 Run DeepChopper's analysis
  • 📊 Visualize results
  • 🎛️ Experiment with different parameters

Perfect for quick tests or demonstrations! However, for extensive analyses or custom workflows, we recommend installing DeepChopper locally.

⚠️ Note: The online version is limited to one FASTQ record at a time and may not be suitable for large-scale projects.

📦 Installation

DeepChopper can be installed using pip, the Python package installer. Follow these steps to install:

  1. Ensure you have Python 3.10 or later installed on your system.

  2. Create a virtual environment (recommended):

    python -m venv deepchopper_env
    source deepchopper_env/bin/activate  # On Windows use `deepchopper_env\Scripts\activate`
    
  3. Install DeepChopper:

    pip install deepchopper
    
  4. Verify the installation:

    deepchopper --help
    

🆘 Trouble installing? Check our Troubleshooting Guide or open an issue.

🛠️ Usage

For a comprehensive guide, check out our full tutorial. Here's a quick overview:

Command-Line Interface

DeepChopper offers three main commands: encode, predict, and chop.

  1. Encode your input data:

    deepchopper encode <input.fq>
    
  2. Predict chimeric reads:

    deepchopper predict <input.parquet> --output-path predictions
    

    Using GPUs? Add the --gpus flag:

    deepchopper predict <input.parquet> --output-path predictions --gpus 2
    
  3. Chop the chimeric reads:

    deepchopper chop <predictions> raw.fq
    

Want a GUI? Launch the web interface (note: limited to one FASTQ record at a time):

deepchopper web

Python Library

Integrate DeepChopper into your Python scripts:

import deepchopper

model = deepchopper.DeepChopper.from_pretrained("yangliz5/deepchopper")
# Your analysis code here

📚 Cite

If DeepChopper aids your research, please cite our paper:


🤝 Contribution

We welcome contributions! Here's how to set up your development environment:

Build Environment

git clone https://github.com/ylab-hi/DeepChopper.git
cd DeepChopper
conda env create -n environment.yaml
conda activate deepchopper

Install Dependencies

pip install pipx
pipx install --suffix @master git+https://github.com/python-poetry/poetry.git@master
poetry@master install

🎉 Ready to contribute? Check out our Contribution Guidelines to get started!

📬 Support

Need help? Have questions?


DeepChopper is developed with ❤️ by the YLab team. Happy sequencing! 🧬🔬

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deepchopper-1.2.3.tar.gz (75.6 MB view details)

Uploaded Source

Built Distributions

deepchopper-1.2.3-cp310-abi3-win_amd64.whl (4.4 MB view details)

Uploaded CPython 3.10+ Windows x86-64

deepchopper-1.2.3-cp310-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (5.2 MB view details)

Uploaded CPython 3.10+ manylinux: glibc 2.17+ x86-64

deepchopper-1.2.3-cp310-abi3-macosx_11_0_arm64.whl (4.3 MB view details)

Uploaded CPython 3.10+ macOS 11.0+ ARM64

deepchopper-1.2.3-cp310-abi3-macosx_10_12_x86_64.whl (4.7 MB view details)

Uploaded CPython 3.10+ macOS 10.12+ x86-64

File details

Details for the file deepchopper-1.2.3.tar.gz.

File metadata

  • Download URL: deepchopper-1.2.3.tar.gz
  • Upload date:
  • Size: 75.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.7.4

File hashes

Hashes for deepchopper-1.2.3.tar.gz
Algorithm Hash digest
SHA256 fb655333519ef03bd4aaf878b6ab71cf67c93e384163882f12e38ab6f9478b83
MD5 0ce0f834074332f235caeedf1dd9b284
BLAKE2b-256 37582d7e36f3777106fc52922af2fe5ebee579902f18a8ad22c6739a6f0fb1ba

See more details on using hashes here.

File details

Details for the file deepchopper-1.2.3-cp310-abi3-win_amd64.whl.

File metadata

File hashes

Hashes for deepchopper-1.2.3-cp310-abi3-win_amd64.whl
Algorithm Hash digest
SHA256 97a9f418ece19b62f2b40d8ade8e465a2b8fc5607455b2a87a010db0f8dd8824
MD5 cba763fdf3c3f33add7dd20d10d4cc82
BLAKE2b-256 1360b726b712b3c07c8e6557a65d63b261571030736e3b43665bb5d46dc3123c

See more details on using hashes here.

File details

Details for the file deepchopper-1.2.3-cp310-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper-1.2.3-cp310-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 8af392688c74b9e3848f5fe85213caa68a04a175f4b73ad4dba52ba8dd549c07
MD5 ac21585892bf12d9de594eebef3b25c5
BLAKE2b-256 c7ef2c809a9e9fca9294875f5f3ee9bd2a148fde83f518d6d0c3dbafa61d22ea

See more details on using hashes here.

File details

Details for the file deepchopper-1.2.3-cp310-abi3-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for deepchopper-1.2.3-cp310-abi3-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 aaf184502897e81f7ddc80f5e3e680ef324bd30cecb18a843db872bef0d0b419
MD5 39f3ef1a8c4b4b9b81fa45bc949123f6
BLAKE2b-256 025dcbcf8f78ac20f0bcd6c17886a922f12abe8e8bd3f71c265881c43580a1d1

See more details on using hashes here.

File details

Details for the file deepchopper-1.2.3-cp310-abi3-macosx_10_12_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper-1.2.3-cp310-abi3-macosx_10_12_x86_64.whl
Algorithm Hash digest
SHA256 99347f3baaa3a799a00f7a671b09e43d3fdbde8a2ee55dfbe4d3198b5235b9d7
MD5 e83c792d2c0ea7b417659b65067546da
BLAKE2b-256 f7ba67d77c64732473e5db4470bfad9027f5150778cf9066727d469bceff5d39

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page