Skip to main content

ChimeraLM: A genomic lanuage model to identify chimera artifact introduced by whole genome amplification (WGA).

Project description

ChimeraLM Logo

ChimeraLM

A language model enables accurate structural variant detection in whole-genome amplified long-read sequencing

python pypi pyversion download hf-space ruff

release stars activity lastcommit

InstallationQuick StartWeb DemoDocumentationCitation


A deep learning-powered tool to identify chimeric artifacts introduced by whole genome amplification (WGA).

🌐 Try it Online

No installation required! Try ChimeraLM instantly in your browser:

🤗 Launch Web Demo on Hugging Face Spaces

Perfect for:

  • 🧪 Testing with individual sequences
  • 📊 Visualizing prediction confidence scores
  • 🎓 Learning about chimeric artifact detection
  • 🔬 Quick validation before batch processing

For production use with BAM files and batch processing, install the CLI tool below.

Installation

pip install chimeralm

Requirements: Python 3.10, 3.11 and 3.12

For GPU support, installation instructions, and troubleshooting, see the Installation Guide.

Quick Start

# Predict chimeric reads (CPU)
chimeralm predict your_data.bam

# Predict with GPU acceleration
chimeralm predict your_data.bam --gpus 1 --batch-size 24

# Filter BAM to remove chimeric reads
chimeralm filter your_data.bam your_data.predictions

Output:

  • Predictions: Tab-separated file with read names and labels (0=biological, 1=chimeric)
  • Filtered BAM: {input}.filtered.sorted.bam with chimeric reads removed

Need more help? See the Quick Start Tutorial for a complete walkthrough.

Documentation

Full documentation is available at ylab-hi.github.io/ChimeraLM

Key Resources:

Features

  • 🌐 Interactive Web Demo: Try it online at HuggingFace Spaces - no installation needed!
  • 🎯 High Accuracy: Deep learning model trained on real WGA data
  • ⚡ GPU Accelerated: Optimized for CUDA, MPS (Apple Silicon), and CPU
  • 🚀 Easy to Use: Simple CLI with sensible defaults
  • 📦 Fast Processing: Batch inference with configurable parallelism
  • 🖥️ Local Web Interface: Run the web UI locally with chimeralm ui
  • 🏭 Production Ready: Includes filtering, sorting, and indexing of BAM files

Related Projects

  • DeepChopper - Identify chimera artifacts induced by internal adapter sequences in Nanopore direct RNA sequencing

Contributing

Contributions are welcome! See our Contributing Guide for development setup and guidelines.

Citation

If you use ChimeraLM in your research, please cite:

@software{chimeralm2025,
  title={ChimeraLM: A genomic language model to identify chimera artifacts},
  author={Li, Yangyang and Guo, Qingxiang and Yang, Rendong},
  year={2025},
  url={https://github.com/ylab-hi/ChimeraLM}
}

License

Apache License 2.0 - see LICENSE for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chimeralm-1.0.5.tar.gz (12.6 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

chimeralm-1.0.5-py3-none-any.whl (60.4 kB view details)

Uploaded Python 3

File details

Details for the file chimeralm-1.0.5.tar.gz.

File metadata

  • Download URL: chimeralm-1.0.5.tar.gz
  • Upload date:
  • Size: 12.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.6 {"installer":{"name":"uv","version":"0.11.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for chimeralm-1.0.5.tar.gz
Algorithm Hash digest
SHA256 054688bf0e871e15ddb5ae5b7469308cb9ec32bc64c6a9cde20d71015c8ad461
MD5 8a35da563ff52479bf900ea6893a0d4b
BLAKE2b-256 72e814c3a8db423f9f1982c3d9b288a769507a48d6e2c34539c2f824800205bf

See more details on using hashes here.

File details

Details for the file chimeralm-1.0.5-py3-none-any.whl.

File metadata

  • Download URL: chimeralm-1.0.5-py3-none-any.whl
  • Upload date:
  • Size: 60.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.6 {"installer":{"name":"uv","version":"0.11.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for chimeralm-1.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 3a39d04d426845ccdf57075c6591ea421f024a2c6818d0bcdc56031b93a5bde3
MD5 1fe7f639c1d8987f0cc6c2fbe5cdea11
BLAKE2b-256 bb0ed5b0abc43f1ad481614e589acc44916f5f67e0a81d3c5ec65c13c2c8b576

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page