Skip to main content

Genomic language model mitigates chimera artifacts in nanopore direct RNA sequencing

Project description

logo DeepChopper social

pypi PyPI - Wheel license pypi version platform Actions status Space

🧬 DeepChopper leverages a language model to accurately detect and chop artificial sequences that may cause chimeric reads, ensuring higher quality and more reliable sequencing results. By integrating seamlessly with existing workflows, DeepChopper provides a robust solution for researchers and bioinformaticians working with Nanopore direct-RNA sequencing data.

✨ What's New in v1.3.0

  • 🚀 Direct FASTQ Processing: No more encoding step! DeepChopper now works directly with FASTQ files
  • ⚡ Simplified Workflow: Go from raw data to results in just 2 commands (predictchop)
  • 📦 Auto-format Detection: Automatically handles .fastq, .fq, .fastq.gz, and .fq.gz files
  • ⚠️ Breaking Change: The encode command has been removed - update your pipelines accordingly

See full changelog →

📘 FEATURED: We provide a comprehensive tutorial that includes an example dataset in our full documentation.

🚀 Quick Start: Try DeepChopper Online

Experience DeepChopper instantly through our user-friendly web interface. No installation required! Simply click the button below to launch the web application and start exploring DeepChopper's capabilities:

Open in Hugging Face Spaces

What you can do online:

  • 📤 Upload your sequencing data
  • 🔬 Run DeepChopper's analysis
  • 📊 Visualize results
  • 🎛️ Experiment with different parameters

Perfect for quick tests or demonstrations! However, for extensive analyses or custom workflows, we recommend installing DeepChopper locally.

⚠️ Note: The online version is limited to one FASTQ record at a time and may not be suitable for large-scale projects.

📦 Installation

DeepChopper can be installed using pip, the Python package installer. Follow these steps to install:

  1. Ensure you have Python 3.10 or later installed on your system.

  2. Create a virtual environment (recommended):

    python -m venv deepchopper_env
    source deepchopper_env/bin/activate  # On Windows use `deepchopper_env\Scripts\activate`
    
  3. Install DeepChopper:

    pip install deepchopper
    
  4. Verify the installation:

    deepchopper --help
    

Compatibility and Support

DeepChopper is designed to work across various platforms and Python versions. Below are the compatibility matrices for PyPI installations:

PyPI Support

Python Version Linux x86_64 macOS Intel macOS Apple Silicon Windows x86_64
3.10
3.11
3.12

🆘 Trouble installing? Check our Troubleshooting Guide or open an issue.

🛠️ Usage

For a comprehensive guide, check out our full tutorial. Here's a quick overview:

Command-Line Interface

🎉 New in v1.3.0: DeepChopper now works directly with FASTQ files! No encoding step required.

DeepChopper offers two main commands: predict and chop.

  1. Predict chimera artifacts directly from FASTQ:

    deepchopper predict input.fastq --output predictions
    

    Using GPUs? Add the --gpus flag:

    deepchopper predict input.fastq --output predictions --gpus 2
    

    Supports all FASTQ formats: .fastq, .fq, .fastq.gz, .fq.gz

  2. Chop chimera artifacts:

    deepchopper chop predictions/0 input.fastq
    

Want a GUI? Launch the web interface (note: limited to one FASTQ record at a time):

deepchopper web

Python Library

Integrate DeepChopper into your Python scripts:

import deepchopper

model = deepchopper.DeepChopper.from_pretrained("yangliz5/deepchopper")
# Your analysis code here

📚 Cite

If DeepChopper aids your research, please cite our paper:

@article{li2026genomic,
  title = {Genomic Language Model Mitigates Chimera Artifacts in Nanopore Direct {{RNA}} Sequencing},
  author = {Li, Yangyang and Wang, Ting-You and Guo, Qingxiang and Ren, Yanan and Lu, Xiaotong and Cao, Qi and Yang, Rendong},
  date = {2026-01-19},
  journaltitle = {Nature Communications},
  shortjournal = {Nat Commun},
  publisher = {Nature Publishing Group},
  issn = {2041-1723},
  doi = {10.1038/s41467-026-68571-5},
  url = {https://www.nature.com/articles/s41467-026-68571-5},
  urldate = {2026-01-20}
}

🤝 Contribution

We welcome contributions! Here's how to set up your development environment:

Build Environment

Install UV and Rust

git clone https://github.com/ylab-hi/DeepChopper.git
cd DeepChopper

# Install dependencies
uv sync

# Run DeepChopper
uv run deepchopper --help

🎉 Ready to contribute? Check out our Contribution Guidelines to get started!

📬 Support

Need help? Have questions?


DeepChopper is developed with ❤️ by the YLab team. Happy sequencing! 🧬🔬

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deepchopper-1.3.1.tar.gz (58.8 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

deepchopper-1.3.1-cp310-abi3-win_amd64.whl (5.8 MB view details)

Uploaded CPython 3.10+Windows x86-64

deepchopper-1.3.1-cp310-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (6.7 MB view details)

Uploaded CPython 3.10+manylinux: glibc 2.17+ x86-64

deepchopper-1.3.1-cp310-abi3-macosx_11_0_arm64.whl (5.8 MB view details)

Uploaded CPython 3.10+macOS 11.0+ ARM64

deepchopper-1.3.1-cp310-abi3-macosx_10_12_x86_64.whl (6.2 MB view details)

Uploaded CPython 3.10+macOS 10.12+ x86-64

File details

Details for the file deepchopper-1.3.1.tar.gz.

File metadata

  • Download URL: deepchopper-1.3.1.tar.gz
  • Upload date:
  • Size: 58.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.11.5

File hashes

Hashes for deepchopper-1.3.1.tar.gz
Algorithm Hash digest
SHA256 155275d659cc7d5509f0a6f20872ca0fdef8158cf00ed20a96abf726ea6c9eb5
MD5 e2fba7b915ea8b1c64af2c3ae3a4ef17
BLAKE2b-256 c299d9a87af185f3f26d3aeb4f4a6d7b1cde1f08580ef036c187fc5e56ae0085

See more details on using hashes here.

File details

Details for the file deepchopper-1.3.1-cp310-abi3-win_amd64.whl.

File metadata

File hashes

Hashes for deepchopper-1.3.1-cp310-abi3-win_amd64.whl
Algorithm Hash digest
SHA256 56020a7fea8a7df4700510fc834f570b9ea1a18dada4e50705c93c48649b3619
MD5 b51031d5e10840fc1bf73ca02d6fb085
BLAKE2b-256 5e990cdb9ad2bf3dc51ebaa908c4f831552074840cddd86d892dc0badc397ba9

See more details on using hashes here.

File details

Details for the file deepchopper-1.3.1-cp310-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper-1.3.1-cp310-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 340561fb9ac2e00e4ef7de94a532efae7645cff7ab599cbea2efb33584e159ca
MD5 d1656e46db98ccd5009ebea79d2acda3
BLAKE2b-256 61eb6b92872c1f7e2e33f8d406d25ab7d85e1a71bdefff35f3b9d3977ea89078

See more details on using hashes here.

File details

Details for the file deepchopper-1.3.1-cp310-abi3-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for deepchopper-1.3.1-cp310-abi3-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 d8395fd32ec875ba3e56430d37c11af02a3a2fb5a79c523e3dadb2a2c2913991
MD5 8c46e6dc6f971a66f54568c5e500853b
BLAKE2b-256 89f3a06309566c016a6bfa4e671890c463ef003242f51b03b26824dba5a40c65

See more details on using hashes here.

File details

Details for the file deepchopper-1.3.1-cp310-abi3-macosx_10_12_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper-1.3.1-cp310-abi3-macosx_10_12_x86_64.whl
Algorithm Hash digest
SHA256 a0d3560e2e53852750d306f688cabc958b2b1747ccd328d07e85d6eca86821ad
MD5 ead48310b1b4dd1b88851899278319e2
BLAKE2b-256 5b55ac74c37154e21274f995a20e7707d673364c45796cfa40d4d67874a7fb42

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page