Skip to main content

CLI for DeepChopper: A Genomic Language Model for Chimera Artifact Detection

Project description

logo DeepChopper social

pypi PyPI - Wheel license pypi version platform Actions status Space

🧬 DeepChopper leverages a language model to accurately detect and chop artificial sequences that may cause chimeric reads, ensuring higher quality and more reliable sequencing results. By integrating seamlessly with existing workflows, DeepChopper provides a robust solution for researchers and bioinformaticians working with Nanopore direct-RNA sequencing data.

✨ What's New in v1.3.0

  • 🚀 Direct FASTQ Processing: No more encoding step! DeepChopper now works directly with FASTQ files
  • ⚡ Simplified Workflow: Go from raw data to results in just 2 commands (predictchop)
  • 📦 Auto-format Detection: Automatically handles .fastq, .fq, .fastq.gz, and .fq.gz files
  • ⚠️ Breaking Change: The encode command has been removed - update your pipelines accordingly

See full changelog →

📘 FEATURED: We provide a comprehensive tutorial that includes an example dataset in our full documentation.

🚀 Quick Start: Try DeepChopper Online

Experience DeepChopper instantly through our user-friendly web interface. No installation required! Simply click the button below to launch the web application and start exploring DeepChopper's capabilities:

Open in Hugging Face Spaces

What you can do online:

  • 📤 Upload your sequencing data
  • 🔬 Run DeepChopper's analysis
  • 📊 Visualize results
  • 🎛️ Experiment with different parameters

Perfect for quick tests or demonstrations! However, for extensive analyses or custom workflows, we recommend installing DeepChopper locally.

⚠️ Note: The online version is limited to one FASTQ record at a time and may not be suitable for large-scale projects.

📦 Installation

DeepChopper can be installed using pip, the Python package installer. Follow these steps to install:

  1. Ensure you have Python 3.10 or later installed on your system.

  2. Create a virtual environment (recommended):

    python -m venv deepchopper_env
    source deepchopper_env/bin/activate  # On Windows use `deepchopper_env\Scripts\activate`
    
  3. Install DeepChopper:

    pip install deepchopper
    
  4. Verify the installation:

    deepchopper --help
    

Compatibility and Support

DeepChopper is designed to work across various platforms and Python versions. Below are the compatibility matrices for PyPI installations:

PyPI Support

Python Version Linux x86_64 macOS Intel macOS Apple Silicon Windows x86_64
3.10
3.11
3.12

🆘 Trouble installing? Check our Troubleshooting Guide or open an issue.

🛠️ Usage

For a comprehensive guide, check out our full tutorial. Here's a quick overview:

Command-Line Interface

🎉 New in v1.3.0: DeepChopper now works directly with FASTQ files! No encoding step required.

DeepChopper offers two main commands: predict and chop.

  1. Predict chimera artifacts directly from FASTQ:

    deepchopper predict input.fastq --output predictions
    

    Using GPUs? Add the --gpus flag:

    deepchopper predict input.fastq --output predictions --gpus 2
    

    Supports all FASTQ formats: .fastq, .fq, .fastq.gz, .fq.gz

  2. Chop chimera artifacts:

    deepchopper chop predictions/0 input.fastq
    

Want a GUI? Launch the web interface (note: limited to one FASTQ record at a time):

deepchopper web

Python Library

Integrate DeepChopper into your Python scripts:

import deepchopper

model = deepchopper.DeepChopper.from_pretrained("yangliz5/deepchopper")
# Your analysis code here

📚 Cite

If DeepChopper aids your research, please cite our paper:

@article{li2026genomic,
  title = {Genomic Language Model Mitigates Chimera Artifacts in Nanopore Direct {{RNA}} Sequencing},
  author = {Li, Yangyang and Wang, Ting-You and Guo, Qingxiang and Ren, Yanan and Lu, Xiaotong and Cao, Qi and Yang, Rendong},
  date = {2026-01-19},
  journaltitle = {Nature Communications},
  shortjournal = {Nat Commun},
  publisher = {Nature Publishing Group},
  issn = {2041-1723},
  doi = {10.1038/s41467-026-68571-5},
  url = {https://www.nature.com/articles/s41467-026-68571-5},
  urldate = {2026-01-20}
}

🤝 Contribution

We welcome contributions! Here's how to set up your development environment:

Build Environment

Install UV and Rust

git clone https://github.com/ylab-hi/DeepChopper.git
cd DeepChopper

# Install dependencies
uv sync

# Run DeepChopper
uv run deepchopper --help

🎉 Ready to contribute? Check out our Contribution Guidelines to get started!

🔗 Related Projects

  • ChimeraLM - Identify artificial chimeric reads from whole genome amplification (WGA) processes

📬 Support

Need help? Have questions?


DeepChopper is developed with ❤️ by the YLab team. Happy sequencing! 🧬🔬

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deepchopper_cli-1.3.2.tar.gz (58.8 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

deepchopper_cli-1.3.2-cp312-cp312-win_amd64.whl (1.8 MB view details)

Uploaded CPython 3.12Windows x86-64

deepchopper_cli-1.3.2-cp312-cp312-musllinux_1_2_x86_64.whl (2.2 MB view details)

Uploaded CPython 3.12musllinux: musl 1.2+ x86-64

deepchopper_cli-1.3.2-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (2.4 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ x86-64

deepchopper_cli-1.3.2-cp312-cp312-macosx_11_0_arm64.whl (1.8 MB view details)

Uploaded CPython 3.12macOS 11.0+ ARM64

deepchopper_cli-1.3.2-cp312-cp312-macosx_10_12_x86_64.whl (1.9 MB view details)

Uploaded CPython 3.12macOS 10.12+ x86-64

deepchopper_cli-1.3.2-cp311-cp311-win_amd64.whl (1.8 MB view details)

Uploaded CPython 3.11Windows x86-64

deepchopper_cli-1.3.2-cp311-cp311-musllinux_1_2_x86_64.whl (2.2 MB view details)

Uploaded CPython 3.11musllinux: musl 1.2+ x86-64

deepchopper_cli-1.3.2-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (2.4 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64

deepchopper_cli-1.3.2-cp311-cp311-macosx_11_0_arm64.whl (1.8 MB view details)

Uploaded CPython 3.11macOS 11.0+ ARM64

deepchopper_cli-1.3.2-cp311-cp311-macosx_10_12_x86_64.whl (1.9 MB view details)

Uploaded CPython 3.11macOS 10.12+ x86-64

deepchopper_cli-1.3.2-cp310-cp310-win_amd64.whl (1.8 MB view details)

Uploaded CPython 3.10Windows x86-64

deepchopper_cli-1.3.2-cp310-cp310-musllinux_1_2_x86_64.whl (2.2 MB view details)

Uploaded CPython 3.10musllinux: musl 1.2+ x86-64

deepchopper_cli-1.3.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (2.4 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.17+ x86-64

deepchopper_cli-1.3.2-cp310-cp310-macosx_11_0_arm64.whl (1.8 MB view details)

Uploaded CPython 3.10macOS 11.0+ ARM64

deepchopper_cli-1.3.2-cp310-cp310-macosx_10_12_x86_64.whl (1.9 MB view details)

Uploaded CPython 3.10macOS 10.12+ x86-64

File details

Details for the file deepchopper_cli-1.3.2.tar.gz.

File metadata

  • Download URL: deepchopper_cli-1.3.2.tar.gz
  • Upload date:
  • Size: 58.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.13.1

File hashes

Hashes for deepchopper_cli-1.3.2.tar.gz
Algorithm Hash digest
SHA256 bf5bb08323a13d5ad0686842033588d41b6f43a1e0f25d20ad3bf72a199049a6
MD5 7b236734a448b06ea742527d2b44a82c
BLAKE2b-256 69b25fa50435e3aa5c6b4027931ab3f57020088e82029b80051f046879613c72

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.3.2-cp312-cp312-win_amd64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.3.2-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 8fdb9650cfff268af0dda86d98696e2e04357918725d536ffa07b87222e9cddd
MD5 2572764d4f29ea0e52bb976cafa304e9
BLAKE2b-256 6b868f47a5a02bab22bf95c3107d0d7bbe717c4e676e1ed05548a8f35c42c24f

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.3.2-cp312-cp312-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.3.2-cp312-cp312-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 bc9f3d76f9d817d374eb396595b5ae5ce4aad24364ab607dd36c504ab429c6b4
MD5 d193baddc1c78a6d74af72e3ad10402a
BLAKE2b-256 d1619d5e80c0330e0794a8325e0944297a4853b115a00ff6dc024cc7b79dfe54

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.3.2-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.3.2-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 b0e5dedcf62f4dd94808a62aa483098bdf5bb4ea54f726c754417044b2a8f8a0
MD5 367ee3a6fae258f8dc039921233f64ea
BLAKE2b-256 dfd5420af68c95df4b179d45bb43a90d100d612cdb2762a42c27e28adcadd1b6

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.3.2-cp312-cp312-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.3.2-cp312-cp312-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 faff96e32afbdcdb163f0c66c5c2233c0d460ab08328bb4047267c645aa7ae76
MD5 015a93da59e5ca2d377d95289e3e2a96
BLAKE2b-256 ff0e5f0bb8209eb751ea82e76669f954be249cac73119bd6e41da9722a3c21f6

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.3.2-cp312-cp312-macosx_10_12_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.3.2-cp312-cp312-macosx_10_12_x86_64.whl
Algorithm Hash digest
SHA256 81d4d6955a5e57b25b1c42969be6dbacd95453904060bc8009bf48975b99c1d8
MD5 ef702fd9ada911f1273fbb2f676c2b7e
BLAKE2b-256 1eeba5e313edc8243a595c50ab4928ec9fb5a0186b98854048f2d5431efaf8fa

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.3.2-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.3.2-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 4ddf22dc460d842cf1959108352012facd16dc744421c4740f477f44e771ef8c
MD5 f9953331fb2c3115c5ebca8a54dd1a3c
BLAKE2b-256 b7a06f5edb482913f7ba026ab7cdcc83593ae3aca7234a50447d12b7615c8402

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.3.2-cp311-cp311-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.3.2-cp311-cp311-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 3d685bcb150dfdcae7c9406f2cf4467c6e3cb5aaeaaa24df900411ff4dd2aa9a
MD5 9f580ec300bbaa0bb331dc9d7b164ff9
BLAKE2b-256 06b6f786c6b78140c5321734f581f82ffcf8478400def2fb37ac63b260dc3812

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.3.2-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.3.2-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 9bafb79e5fdeb5ccec92a7dc54f2a6effb5670d6615e0b768d7061d7ef410e1c
MD5 bddf76f3aeaa036a97b351adecdca4d6
BLAKE2b-256 7acd337722e7a17b0c4197cc685b091e2b3d8dfd1a9971297e4c6da59641ecf0

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.3.2-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.3.2-cp311-cp311-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 7927d54060390fa20d65927c0e1d5d4d534d7efc0aba677c408aeb51b5c87ebf
MD5 adf119a2db57723cb69d4bc28159a299
BLAKE2b-256 27bc484add12ce780691ab9d680f582614c0325ca9eee8e551adc1b69ffae3b3

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.3.2-cp311-cp311-macosx_10_12_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.3.2-cp311-cp311-macosx_10_12_x86_64.whl
Algorithm Hash digest
SHA256 17ec3bbbf42871e75b9acd18802876ab06297a9d1de2b9c23de68b2b834ae697
MD5 9912d80e6ddcbbf524e55ef490750876
BLAKE2b-256 9edba2a44ee4e454b5a8e91d5edbb0385385c85761f7707c09ef11ca2a987feb

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.3.2-cp310-cp310-win_amd64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.3.2-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 d89a42bc81ca5d05bdece3d7de5eeae6961f6ee17acc7e9dc199ae0cb63de63e
MD5 37f668db302b7ed4b9eeedaa31aa9a6e
BLAKE2b-256 e60314b4de926a881e2f53fa90688b7db691ca39ed4b235599cf3557634c9c5b

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.3.2-cp310-cp310-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.3.2-cp310-cp310-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 b337d2671530ec116e4c0d323e888802ba78983ddcaa69ea7cd36a80a54e263f
MD5 876f844a52bc4e159dc2020d714cb336
BLAKE2b-256 c8ec0525642472d43dd4bd94b614c44ab7bbdd2a14dc51e75c8989fc9002e9b2

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.3.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.3.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 075206cbc5dbb30c843512f25a1a1bc11bffdd257f0cda4fc5f4c4518c50eca8
MD5 25e8fd22675101c1c80577813c32a605
BLAKE2b-256 83fb32b3d3195d8bbe91f4e41abba325719590eca27f11909f9c3a722b2f557e

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.3.2-cp310-cp310-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.3.2-cp310-cp310-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 30cc35f479b31549f568996daf8e73ac23afb07c42c723da9707bad33944afcb
MD5 df09f0c100081922700db1919dff4278
BLAKE2b-256 973775efaa4e6565cedca284ad6469c5bc2bdf3a545fa1aea82e88608e54a824

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.3.2-cp310-cp310-macosx_10_12_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.3.2-cp310-cp310-macosx_10_12_x86_64.whl
Algorithm Hash digest
SHA256 0a3ac39823bd7817ce65254a2f9526cbc817c594f5937587af78b61144790075
MD5 34e5c65bb7eb16b5b1bc686e3e5bb5a2
BLAKE2b-256 171679421069f93f66062e3dd0ba6136738f78692ef4a62677d536c574fec4b3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page