Genomic language model mitigates chimera artifacts in nanopore direct RNA sequencing
Project description
DeepChopper 
🧬 DeepChopper leverages a language model to accurately detect and chop artificial sequences that may cause chimeric reads, ensuring higher quality and more reliable sequencing results. By integrating seamlessly with existing workflows, DeepChopper provides a robust solution for researchers and bioinformaticians working with Nanopore direct-RNA sequencing data.
✨ What's New in v1.3.0
- 🚀 Direct FASTQ Processing: No more encoding step! DeepChopper now works directly with FASTQ files
- ⚡ Simplified Workflow: Go from raw data to results in just 2 commands (
predict→chop) - 📦 Auto-format Detection: Automatically handles
.fastq,.fq,.fastq.gz, and.fq.gzfiles - ⚠️ Breaking Change: The
encodecommand has been removed - update your pipelines accordingly
📘 FEATURED: We provide a comprehensive tutorial that includes an example dataset in our full documentation.
🚀 Quick Start: Try DeepChopper Online
Experience DeepChopper instantly through our user-friendly web interface. No installation required! Simply click the button below to launch the web application and start exploring DeepChopper's capabilities:
What you can do online:
- 📤 Upload your sequencing data
- 🔬 Run DeepChopper's analysis
- 📊 Visualize results
- 🎛️ Experiment with different parameters
Perfect for quick tests or demonstrations! However, for extensive analyses or custom workflows, we recommend installing DeepChopper locally.
⚠️ Note: The online version is limited to one FASTQ record at a time and may not be suitable for large-scale projects.
📦 Installation
DeepChopper can be installed using pip, the Python package installer. Follow these steps to install:
-
Ensure you have Python 3.10 or later installed on your system.
-
Create a virtual environment (recommended):
python -m venv deepchopper_env source deepchopper_env/bin/activate # On Windows use `deepchopper_env\Scripts\activate`
-
Install DeepChopper:
pip install deepchopper
-
Verify the installation:
deepchopper --help
Compatibility and Support
DeepChopper is designed to work across various platforms and Python versions. Below are the compatibility matrices for PyPI installations:
PyPI Support
| Python Version | Linux x86_64 | macOS Intel | macOS Apple Silicon | Windows x86_64 |
|---|---|---|---|---|
| 3.10 | ✅ | ✅ | ✅ | ✅ |
| 3.11 | ✅ | ✅ | ✅ | ✅ |
| 3.12 | ✅ | ✅ | ✅ | ✅ |
🆘 Trouble installing? Check our Troubleshooting Guide or open an issue.
🛠️ Usage
For a comprehensive guide, check out our full tutorial. Here's a quick overview:
Command-Line Interface
🎉 New in v1.3.0: DeepChopper now works directly with FASTQ files! No encoding step required.
DeepChopper offers two main commands: predict and chop.
-
Predict chimera artifacts directly from FASTQ:
deepchopper predict input.fastq --output predictions
Using GPUs? Add the
--gpusflag:deepchopper predict input.fastq --output predictions --gpus 2
Supports all FASTQ formats:
.fastq,.fq,.fastq.gz,.fq.gz -
Chop chimera artifacts:
deepchopper chop predictions/0 input.fastq
Want a GUI? Launch the web interface (note: limited to one FASTQ record at a time):
deepchopper web
Python Library
Integrate DeepChopper into your Python scripts:
import deepchopper
model = deepchopper.DeepChopper.from_pretrained("yangliz5/deepchopper")
# Your analysis code here
📚 Cite
If DeepChopper aids your research, please cite our paper:
@article{li2026genomic,
title = {Genomic Language Model Mitigates Chimera Artifacts in Nanopore Direct {{RNA}} Sequencing},
author = {Li, Yangyang and Wang, Ting-You and Guo, Qingxiang and Ren, Yanan and Lu, Xiaotong and Cao, Qi and Yang, Rendong},
date = {2026-01-19},
journaltitle = {Nature Communications},
shortjournal = {Nat Commun},
publisher = {Nature Publishing Group},
issn = {2041-1723},
doi = {10.1038/s41467-026-68571-5},
url = {https://www.nature.com/articles/s41467-026-68571-5},
urldate = {2026-01-20}
}
🤝 Contribution
We welcome contributions! Here's how to set up your development environment:
Build Environment
git clone https://github.com/ylab-hi/DeepChopper.git
cd DeepChopper
# Install dependencies
uv sync
# Run DeepChopper
uv run deepchopper --help
🎉 Ready to contribute? Check out our Contribution Guidelines to get started!
📬 Support
Need help? Have questions?
- 📖 Check our Documentation
- 🐛 Report issues
DeepChopper is developed with ❤️ by the YLab team. Happy sequencing! 🧬🔬
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file deepchopper-1.3.1.tar.gz.
File metadata
- Download URL: deepchopper-1.3.1.tar.gz
- Upload date:
- Size: 58.8 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/1.11.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
155275d659cc7d5509f0a6f20872ca0fdef8158cf00ed20a96abf726ea6c9eb5
|
|
| MD5 |
e2fba7b915ea8b1c64af2c3ae3a4ef17
|
|
| BLAKE2b-256 |
c299d9a87af185f3f26d3aeb4f4a6d7b1cde1f08580ef036c187fc5e56ae0085
|
File details
Details for the file deepchopper-1.3.1-cp310-abi3-win_amd64.whl.
File metadata
- Download URL: deepchopper-1.3.1-cp310-abi3-win_amd64.whl
- Upload date:
- Size: 5.8 MB
- Tags: CPython 3.10+, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/1.11.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
56020a7fea8a7df4700510fc834f570b9ea1a18dada4e50705c93c48649b3619
|
|
| MD5 |
b51031d5e10840fc1bf73ca02d6fb085
|
|
| BLAKE2b-256 |
5e990cdb9ad2bf3dc51ebaa908c4f831552074840cddd86d892dc0badc397ba9
|
File details
Details for the file deepchopper-1.3.1-cp310-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.
File metadata
- Download URL: deepchopper-1.3.1-cp310-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 6.7 MB
- Tags: CPython 3.10+, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/1.11.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
340561fb9ac2e00e4ef7de94a532efae7645cff7ab599cbea2efb33584e159ca
|
|
| MD5 |
d1656e46db98ccd5009ebea79d2acda3
|
|
| BLAKE2b-256 |
61eb6b92872c1f7e2e33f8d406d25ab7d85e1a71bdefff35f3b9d3977ea89078
|
File details
Details for the file deepchopper-1.3.1-cp310-abi3-macosx_11_0_arm64.whl.
File metadata
- Download URL: deepchopper-1.3.1-cp310-abi3-macosx_11_0_arm64.whl
- Upload date:
- Size: 5.8 MB
- Tags: CPython 3.10+, macOS 11.0+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/1.11.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d8395fd32ec875ba3e56430d37c11af02a3a2fb5a79c523e3dadb2a2c2913991
|
|
| MD5 |
8c46e6dc6f971a66f54568c5e500853b
|
|
| BLAKE2b-256 |
89f3a06309566c016a6bfa4e671890c463ef003242f51b03b26824dba5a40c65
|
File details
Details for the file deepchopper-1.3.1-cp310-abi3-macosx_10_12_x86_64.whl.
File metadata
- Download URL: deepchopper-1.3.1-cp310-abi3-macosx_10_12_x86_64.whl
- Upload date:
- Size: 6.2 MB
- Tags: CPython 3.10+, macOS 10.12+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/1.11.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a0d3560e2e53852750d306f688cabc958b2b1747ccd328d07e85d6eca86821ad
|
|
| MD5 |
ead48310b1b4dd1b88851899278319e2
|
|
| BLAKE2b-256 |
5b55ac74c37154e21274f995a20e7707d673364c45796cfa40d4d67874a7fb42
|