Skip to main content

A Genomic Language Model for Chimera Artifact Detection in Nanopore Direct RNA Sequencing

Project description

logo DeepChopper social

pypi PyPI - Wheel license pypi version platform Actions status Space

🧬 DeepChopper leverages language model to accurately detect and chop artificial sequences which may cause chimeric reads, ensuring higher quality and more reliable sequencing results. By integrating seamlessly with existing workflows, DeepChopper provides a robust solution for researchers and bioinformatics working with NanoPore direct-RNA sequencing data.

🚀 Quick Start: Try DeepChopper Online

Experience DeepChopper instantly through our user-friendly web interface. No installation required! Simply click the button below to launch the web application and start exploring DeepChopper's capabilities:

Open in Hugging Face Spaces

What you can do online:

  • 📤 Upload your sequencing data
  • 🔬 Run DeepChopper's analysis
  • 📊 Visualize results
  • 🎛️ Experiment with different parameters

Perfect for quick tests or demonstrations! However, for extensive analyses or custom workflows, we recommend installing DeepChopper locally.

⚠️ Note: The online version is limited to one FASTQ record at a time and may not be suitable for large-scale projects.

📦 Installation

DeepChopper can be installed using pip, the Python package installer. Follow these steps to install:

  1. Ensure you have Python 3.10 or later installed on your system.

  2. Create a virtual environment (recommended):

    python -m venv deepchopper_env
    source deepchopper_env/bin/activate  # On Windows use `deepchopper_env\Scripts\activate`
    
  3. Install DeepChopper:

    pip install deepchopper
    
  4. Verify the installation:

    deepchopper --help
    

🆘 Trouble installing? Check our Troubleshooting Guide or open an issue.

🛠️ Usage

For a comprehensive guide, check out our full tutorial. Here's a quick overview:

Command-Line Interface

DeepChopper offers three main commands: encode, predict, and chop.

  1. Encode your input data:

    deepchopper encode <input.fq>
    
  2. Predict chimeric reads:

    deepchopper predict <input.parquet> --output predictions
    

    Using GPUs? Add the --gpus flag:

    deepchopper predict <input.parquet> --output predictions --gpus 2
    
  3. Chop the chimeric reads:

    deepchopper chop <predictions> raw.fq
    

Want a GUI? Launch the web interface (note: limited to one FASTQ record at a time):

deepchopper web

Python Library

Integrate DeepChopper into your Python scripts:

import deepchopper

model = deepchopper.DeepChopper.from_pretrained("yangliz5/deepchopper")
# Your analysis code here

📚 Cite

If DeepChopper aids your research, please cite our paper:


🤝 Contribution

We welcome contributions! Here's how to set up your development environment:

Build Environment

git clone https://github.com/ylab-hi/DeepChopper.git
cd DeepChopper
conda env create -n environment.yaml
conda activate deepchopper

Install Dependencies

pip install pipx
pipx install --suffix @master git+https://github.com/python-poetry/poetry.git@master
poetry@master install

🎉 Ready to contribute? Check out our Contribution Guidelines to get started!

📬 Support

Need help? Have questions?


DeepChopper is developed with ❤️ by the YLab team. Happy sequencing! 🧬🔬

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deepchopper-1.2.4.tar.gz (75.8 MB view details)

Uploaded Source

Built Distributions

deepchopper-1.2.4-cp310-abi3-win_amd64.whl (4.3 MB view details)

Uploaded CPython 3.10+ Windows x86-64

deepchopper-1.2.4-cp310-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (4.6 MB view details)

Uploaded CPython 3.10+ manylinux: glibc 2.17+ x86-64

deepchopper-1.2.4-cp310-abi3-macosx_11_0_arm64.whl (3.9 MB view details)

Uploaded CPython 3.10+ macOS 11.0+ ARM64

deepchopper-1.2.4-cp310-abi3-macosx_10_12_x86_64.whl (4.5 MB view details)

Uploaded CPython 3.10+ macOS 10.12+ x86-64

File details

Details for the file deepchopper-1.2.4.tar.gz.

File metadata

  • Download URL: deepchopper-1.2.4.tar.gz
  • Upload date:
  • Size: 75.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.7.4

File hashes

Hashes for deepchopper-1.2.4.tar.gz
Algorithm Hash digest
SHA256 e7f0bfed8c34e011e924bc12e30dec9c58777f1879962b9a2036da29009461e4
MD5 008f6ce0338523bbec0a9bb0b63d366e
BLAKE2b-256 8dfb5ac1ef9b03b11c0f67d764cdd20d5ad224f52c2262ed7e44a6cfc17b7cf9

See more details on using hashes here.

File details

Details for the file deepchopper-1.2.4-cp310-abi3-win_amd64.whl.

File metadata

File hashes

Hashes for deepchopper-1.2.4-cp310-abi3-win_amd64.whl
Algorithm Hash digest
SHA256 4ebb0f47cf4f7cc43254d4876152931d7b4fe20f13110e9fdd2f22212253cfd3
MD5 a5dac283f24639f1a9567df8d466b0d8
BLAKE2b-256 90f6f3d981e625f19f3f98478ad91416c8cd64f0c73fd37f00a7ac2f79013e37

See more details on using hashes here.

File details

Details for the file deepchopper-1.2.4-cp310-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper-1.2.4-cp310-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 4bb607a29d4b9298b7ddbd4da50f027c704f653f06fb43d297344ee9bcef5467
MD5 06521e343737972242e7bc72cd23be4d
BLAKE2b-256 14253b78cecaec98035b490e6df1f7a99d2ada71e2db5266daeb7808534dbd32

See more details on using hashes here.

File details

Details for the file deepchopper-1.2.4-cp310-abi3-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for deepchopper-1.2.4-cp310-abi3-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 ac33a6dd224499647effdb4d54696393352e718753ceb522649bd65c492a38e0
MD5 5cd6c9c40763ded5c8dd7da321acf54c
BLAKE2b-256 770e6e16c059849b9a1e074c59a1f6039772468ed111fa8aad74f71afbf4990a

See more details on using hashes here.

File details

Details for the file deepchopper-1.2.4-cp310-abi3-macosx_10_12_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper-1.2.4-cp310-abi3-macosx_10_12_x86_64.whl
Algorithm Hash digest
SHA256 c40a9618f2c06639299cc4da99211aded29c04e0975b15ffa34c4d8b82c284c3
MD5 5f9605dff32cd77dcd42fbd75ac82ddc
BLAKE2b-256 b64007895780892bca460bfef2d83d0b7984120f8da4bac60593f2a060090aef

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page