Skip to main content

CLI for DeepChopper: A Genomic Language Model for Chimera Artifact Detection

Project description

logo DeepChopper social

pypi PyPI - Wheel license pypi version platform Actions status Space

🧬 DeepChopper leverages language model to accurately detect and chop artificial sequences which may cause chimeric reads, ensuring higher quality and more reliable sequencing results. By integrating seamlessly with existing workflows, DeepChopper provides a robust solution for researchers and bioinformatics working with NanoPore direct-RNA sequencing data.

🚀 Quick Start: Try DeepChopper Online

Experience DeepChopper instantly through our user-friendly web interface. No installation required! Simply click the button below to launch the web application and start exploring DeepChopper's capabilities:

Open in Hugging Face Spaces

What you can do online:

  • 📤 Upload your sequencing data
  • 🔬 Run DeepChopper's analysis
  • 📊 Visualize results
  • 🎛️ Experiment with different parameters

Perfect for quick tests or demonstrations! However, for extensive analyses or custom workflows, we recommend installing DeepChopper locally.

⚠️ Note: The online version is limited to one FASTQ record at a time and may not be suitable for large-scale projects.

📦 Installation

DeepChopper can be installed using pip, the Python package installer. Follow these steps to install:

  1. Ensure you have Python 3.10 or later installed on your system.

  2. Create a virtual environment (recommended):

    python -m venv deepchopper_env
    source deepchopper_env/bin/activate  # On Windows use `deepchopper_env\Scripts\activate`
    
  3. Install DeepChopper:

    pip install deepchopper
    
  4. Verify the installation:

    deepchopper --help
    

Compatibility and Support

DeepChopper is designed to work across various platforms and Python versions. Below are the compatibility matrices for PyPI installations:

PyPI Support

Python Version Linux x86_64 macOS Intel macOS Apple Silicon Windows x86_64
3.10
3.11
3.12

🆘 Trouble installing? Check our Troubleshooting Guide or open an issue.

🛠️ Usage

For a comprehensive guide, check out our full tutorial. Here's a quick overview:

Command-Line Interface

DeepChopper offers three main commands: encode, predict, and chop.

  1. Encode your input data:

    deepchopper encode <input.fq>
    
  2. Predict chimera artifacts:

    deepchopper predict <input.parquet> --output predictions
    

    Using GPUs? Add the --gpus flag:

    deepchopper predict <input.parquet> --output predictions --gpus 2
    
  3. Chop chimera artifacts:

    deepchopper chop <predictions> raw.fq
    

Want a GUI? Launch the web interface (note: limited to one FASTQ record at a time):

deepchopper web

Python Library

Integrate DeepChopper into your Python scripts:

import deepchopper

model = deepchopper.DeepChopper.from_pretrained("yangliz5/deepchopper")
# Your analysis code here

📚 Cite

If DeepChopper aids your research, please cite our paper:

@article {Li2024.10.23.619929,
        author = {Li, Yangyang and Wang, Ting-You and Guo, Qingxiang and Ren, Yanan and Lu, Xiaotong and Cao, Qi and Yang, Rendong},
        title = {A Genomic Language Model for Chimera Artifact Detection in Nanopore Direct RNA Sequencing},
        elocation-id = {2024.10.23.619929},
        year = {2024},
        doi = {10.1101/2024.10.23.619929},
        publisher = {Cold Spring Harbor Laboratory},
        abstract = {Chimera artifacts in nanopore direct RNA sequencing (dRNA-seq) data can confound transcriptome analyses, yet no existing tools are capable of detecting and removing them due to limitations in basecalling models. We present DeepChopper, a genomic language model that accurately identifies and eliminates adapter sequences within base-called dRNA-seq reads, effectively removing chimeric read artifacts. DeepChopper significantly improves critical downstream analyses, including transcript annotation and gene fusion detection, enhancing the reliability and utility of nanopore dRNA-seq for transcriptomics research.Competing Interest StatementThe authors have declared no competing interest.},
        URL = {https://www.biorxiv.org/content/early/2024/10/25/2024.10.23.619929},
        eprint = {https://www.biorxiv.org/content/early/2024/10/25/2024.10.23.619929.full.pdf},
        journal = {bioRxiv}
}

🤝 Contribution

We welcome contributions! Here's how to set up your development environment:

Build Environment

git clone https://github.com/ylab-hi/DeepChopper.git
cd DeepChopper
conda env create -n environment.yaml
conda activate deepchopper

Install Dependencies

pip install pipx
pipx install --suffix @master git+https://github.com/python-poetry/poetry.git@master
poetry@master install

🎉 Ready to contribute? Check out our Contribution Guidelines to get started!

📬 Support

Need help? Have questions?


DeepChopper is developed with ❤️ by the YLab team. Happy sequencing! 🧬🔬

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deepchopper_cli-1.2.6.tar.gz (69.2 MB view details)

Uploaded Source

Built Distributions

deepchopper_cli-1.2.6-cp312-none-win_amd64.whl (1.5 MB view details)

Uploaded CPython 3.12 Windows x86-64

deepchopper_cli-1.2.6-cp312-none-win32.whl (1.4 MB view details)

Uploaded CPython 3.12 Windows x86

deepchopper_cli-1.2.6-cp312-cp312-musllinux_1_2_x86_64.whl (1.6 MB view details)

Uploaded CPython 3.12 musllinux: musl 1.2+ x86-64

deepchopper_cli-1.2.6-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (1.8 MB view details)

Uploaded CPython 3.12 manylinux: glibc 2.17+ x86-64

deepchopper_cli-1.2.6-cp312-cp312-macosx_11_0_arm64.whl (1.3 MB view details)

Uploaded CPython 3.12 macOS 11.0+ ARM64

deepchopper_cli-1.2.6-cp312-cp312-macosx_10_12_x86_64.whl (1.5 MB view details)

Uploaded CPython 3.12 macOS 10.12+ x86-64

deepchopper_cli-1.2.6-cp311-none-win_amd64.whl (1.5 MB view details)

Uploaded CPython 3.11 Windows x86-64

deepchopper_cli-1.2.6-cp311-none-win32.whl (1.4 MB view details)

Uploaded CPython 3.11 Windows x86

deepchopper_cli-1.2.6-cp311-cp311-musllinux_1_2_x86_64.whl (1.6 MB view details)

Uploaded CPython 3.11 musllinux: musl 1.2+ x86-64

deepchopper_cli-1.2.6-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (1.8 MB view details)

Uploaded CPython 3.11 manylinux: glibc 2.17+ x86-64

deepchopper_cli-1.2.6-cp311-cp311-macosx_11_0_arm64.whl (1.3 MB view details)

Uploaded CPython 3.11 macOS 11.0+ ARM64

deepchopper_cli-1.2.6-cp311-cp311-macosx_10_12_x86_64.whl (1.5 MB view details)

Uploaded CPython 3.11 macOS 10.12+ x86-64

deepchopper_cli-1.2.6-cp310-none-win_amd64.whl (1.5 MB view details)

Uploaded CPython 3.10 Windows x86-64

deepchopper_cli-1.2.6-cp310-none-win32.whl (1.4 MB view details)

Uploaded CPython 3.10 Windows x86

deepchopper_cli-1.2.6-cp310-cp310-musllinux_1_2_x86_64.whl (1.6 MB view details)

Uploaded CPython 3.10 musllinux: musl 1.2+ x86-64

deepchopper_cli-1.2.6-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (1.8 MB view details)

Uploaded CPython 3.10 manylinux: glibc 2.17+ x86-64

deepchopper_cli-1.2.6-cp310-cp310-macosx_11_0_arm64.whl (1.3 MB view details)

Uploaded CPython 3.10 macOS 11.0+ ARM64

deepchopper_cli-1.2.6-cp310-cp310-macosx_10_12_x86_64.whl (1.5 MB view details)

Uploaded CPython 3.10 macOS 10.12+ x86-64

File details

Details for the file deepchopper_cli-1.2.6.tar.gz.

File metadata

  • Download URL: deepchopper_cli-1.2.6.tar.gz
  • Upload date:
  • Size: 69.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.7.4

File hashes

Hashes for deepchopper_cli-1.2.6.tar.gz
Algorithm Hash digest
SHA256 2ea385c3bb6e95f2d76627c2c6a859cfdab7f3402bd4589b1e787d301df1fbd1
MD5 336d510459f6fbe7c16567288db4b4d7
BLAKE2b-256 11641b44db154e250328c0ce1615c78b051f2d408e952ba0eadf626e7a5361d7

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.2.6-cp312-none-win_amd64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.2.6-cp312-none-win_amd64.whl
Algorithm Hash digest
SHA256 1a63439819141e5f5810c6f407894d475f5218ab7f68f6cc3fc299ba99dc052d
MD5 43cc5ad8939bc561313dc15c7fbf4117
BLAKE2b-256 74388c91a2a5b68341035d2b084c25cf26a70e0d56d8f9c4e1ec4f11328ffe33

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.2.6-cp312-none-win32.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.2.6-cp312-none-win32.whl
Algorithm Hash digest
SHA256 50585091b1e8535ec3c9cb3e9206648600f8cc81f6c37412561bb629684243ee
MD5 c133e4b76850961ab812fb1ca8e10fad
BLAKE2b-256 8e962e79713f7b519dd9bf2c5c1bd3df98e68a8aa459381088372fa32eb7443a

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.2.6-cp312-cp312-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.2.6-cp312-cp312-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 4427a6cdfeada2582bc0399f2fb943291f04790a0da2629533723381128e3f04
MD5 ea9a87ae7a0539c963c1b344c7734c43
BLAKE2b-256 7dda22b88a94e82a86ee32e8cbf2e7883af66bd5d633fcd067ce1d9e667ced34

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.2.6-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.2.6-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 d540b38b41d45cf21bef253aef8a398762a526196771ffdc3c583cc9319392e4
MD5 d89436b2ff95747a4fa40153a9551526
BLAKE2b-256 7c5a22df81f4b1355c6a9a2deef1533d76bca115e0356709a2a57e2dae071027

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.2.6-cp312-cp312-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.2.6-cp312-cp312-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 3a4a0772063fd3a614b78aea808c5de623f440e9b07c5e4ba86dc21d5f32d39b
MD5 6aa51c51cd1e8c37024bfce3e638148c
BLAKE2b-256 512698ebd3d3efcef9f457bbc4238ba0742d4a683a903e87922d54882b155225

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.2.6-cp312-cp312-macosx_10_12_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.2.6-cp312-cp312-macosx_10_12_x86_64.whl
Algorithm Hash digest
SHA256 44065ce9ade39c28009bfa77dca12ce94db7411d076563072967e8c80cf081eb
MD5 aeca0b72d4fe30b5bf31fedf2ee8d841
BLAKE2b-256 19d685f76c1110641c5065f3f01038d4263eb912430632b61c516523d77f5eb4

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.2.6-cp311-none-win_amd64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.2.6-cp311-none-win_amd64.whl
Algorithm Hash digest
SHA256 5d635a51b3d3f01f9f63aedbb8c2e2983860a3e946257cbb19b4d34c3ce38704
MD5 3b900bb6fd2a29199857a4cbdf60daa8
BLAKE2b-256 d40e646fe28e79240858bc02530dc147a73e6920c7991731207e3b173fbe1d33

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.2.6-cp311-none-win32.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.2.6-cp311-none-win32.whl
Algorithm Hash digest
SHA256 7934d1510b3bbc9ef1c1c0e461f5fcbcb36257d10f3cf7fc31606a9ba6dc0112
MD5 39a30dd4dbbf5deb40c81164ff75315e
BLAKE2b-256 02254fe95a13b7fb3fb3f59021295e22ab3a5e859ec682deabe7992ea29ce33c

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.2.6-cp311-cp311-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.2.6-cp311-cp311-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 d5fc96ddb016acbca9a2c062a2838a5806225d9c0005d849fc14854a3af79454
MD5 6f9c30af3c4b03ec2cfd77267806c1e1
BLAKE2b-256 74f220f5eae040a3e2bfb8e48f1deb6e6b0fbfe0927d418789d158d6f5c9702b

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.2.6-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.2.6-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 0cfcf7ee5f345249be4b140868f9aa1ec2524a9e650e2ff8ba0af83850a26a64
MD5 6871cad5ade0a9114bf6d43cafb44bf0
BLAKE2b-256 04ca3b640abf0c202e93349c561e664335c97b68593a27d45ed7ddb9b770109e

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.2.6-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.2.6-cp311-cp311-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 600762853383dd49a22274ea946ffc31a4e4c6844c7dbe138e296708d9b8946a
MD5 fe9d88e21f0a1014ef84e1ec4c0c666b
BLAKE2b-256 8b9749fad5ea0ecf3825caa06a7986fc04e9fb8ec6d62da17aa70a44c82d33cc

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.2.6-cp311-cp311-macosx_10_12_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.2.6-cp311-cp311-macosx_10_12_x86_64.whl
Algorithm Hash digest
SHA256 5543bd21c79df9cbea2055f4180b9fb3ead687762b50247438c850898a9a81f4
MD5 107a2ff9fa4fa1b680c042a8679ed845
BLAKE2b-256 12ae329895b9fdd3a7e3e91f0af7a6eb2955793a0c7c8089486ff82d3e1337ff

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.2.6-cp310-none-win_amd64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.2.6-cp310-none-win_amd64.whl
Algorithm Hash digest
SHA256 c31c9a7549a01fc36a3c5fc749a3b60e4375bfbeb362a9139885cafd33533ae4
MD5 827d9761500f92f4f389ad13c6b360e3
BLAKE2b-256 cfeda9163eb19145fd5393e7ad2a9fc62f9d5fa0951172a637377e390b8bac0d

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.2.6-cp310-none-win32.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.2.6-cp310-none-win32.whl
Algorithm Hash digest
SHA256 3d9a7784796302c717dac807cc22afc0464fd0017e3c3cc83dc7490bf0f4385a
MD5 857d2eb2720c5e7503b97659177f188d
BLAKE2b-256 c25a0a16aa802d6a7bc3d2ec6442800c1caa4dae85d989858d735b518303fa1e

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.2.6-cp310-cp310-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.2.6-cp310-cp310-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 42d2028cc3712161b514addc03b353fe851ead83a209b814214800a0c200b502
MD5 8326a907d9bbdbd5c465ad49719cb268
BLAKE2b-256 f657b1cb5cbe5b6319abe01f1ef4c76d5c0db476c6109c502de03cb7c2af65d3

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.2.6-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.2.6-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 9c4902c579ee6e1d85015e5fbf243e3a08f40246d0da912f9069df194acf789b
MD5 1b8cad28cb301c7e074d5f43a1bc0c63
BLAKE2b-256 06d0d836d018125c21effb2a6d8e0adc523b76420419049de203e6f296e5b3f0

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.2.6-cp310-cp310-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.2.6-cp310-cp310-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 49f15ab2c6e7bbe4de744ba04aa09b8f7ece75b673d8979d8f21aae47ab9ce12
MD5 3f70afc96f4a132cd3c3009e69667d5d
BLAKE2b-256 dc6cba894451dfe7fe5c0bf103045c02bce0dcaf101f2eb0b63acdae1f14e104

See more details on using hashes here.

File details

Details for the file deepchopper_cli-1.2.6-cp310-cp310-macosx_10_12_x86_64.whl.

File metadata

File hashes

Hashes for deepchopper_cli-1.2.6-cp310-cp310-macosx_10_12_x86_64.whl
Algorithm Hash digest
SHA256 25ba340c3d4316176fc253a7071c2f031064a9a75ac662c9f430465f04e8739b
MD5 d8aba8620509a9719dfaeacecddc7972
BLAKE2b-256 b77ac71971f4a1770676d0d324545ca0bc54e2d6a62f9ce727d813944a2ff5e3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page