OmniGenome: A comprehensive toolkit for genome analysis.
Project description
**@@ #========= @@** ___ _
**@@ +----- @@** / _ \ _ __ ___ _ __ (_)
**@@ = @@** | | | || '_ ` _ \ | '_ \ | |
**@@ | |_| || | | | | || | | || |
@@** = **@@ \___/ |_| |_| |_||_| |_||_|
@@** ------+ **@@
@@** =========# **@@ ____
@@ ---------------+ @@ / ___| ___ _ __ ___ _ __ ___ ___
@@ ================== @@ | | _ / _ \| '_ \ / _ \ | '_ ` _ \ / _ \
@@ +--------------- @@ | |_| || __/| | | || (_) || | | | | || __/
@@** #========= **@@ \____| \___||_| |_| \___/ |_| |_| |_| \___|
@@** +------ **@@
@@** = **@@
@@** ____ _
**@@ = @@** | __ ) ___ _ __ ___ | |__
**@@ -----+ @@** | _ \ / _ \| '_ \ / __|| '_ \
**@@ ==========# @@** | |_) || __/| | | || (__ | | | |
@@ --------------+ @@** |____/ \___||_| |_| \___||_| |_|
OmniGenome: A Comprehensive Toolkit of Genomic Modeling and Benchmarking
Introduction
OmniGenome is a comprehensive toolkit for genomic modeling and benchmarking. It provides a unified interface for various genomic modeling tasks, including genome sequence classification, regression and so on. OmniGenome is designed to be easy to use and flexible, allowing users to customize their workflows and experiment with different algorithms and parameters. It also includes a set of benchmarking tools to evaluate the performance of different genomic foundation models and compare their results. OmniGenome is written in Python and is available as an open-source project on GitHub.
What you can do with OmniGenome
- Click-to-run tutorials of Genomic sequence modeling
- Automated benchmarking of genomic foundation models
- Instant inference using pre-trained checkpoints
- Customizable pipeline for genomic modeling tasks
Installation
before installing OmniGenome, you need to install the following dependencies:
-
Python 3.9+
-
PyTorch 2.0+
-
Transformers 4.37.0+
-
To install OmniGenome, you can use pip:
pip install omnigenome -U
or you can clone the repository and install it from source:
git clone https://github.com/yangheng95/OmniGenome.git
cd OmniGenome
pip install -e .
Quick Start
Auto-benchmark for genomic foundation models (a.k.a., pretrained models)
from omnigenome import AutoBench
gfm = 'LongSafari/hyenadna-medium-160k-seqlen-hf'
# bench_root could be "RGB", "GB", "PGB", "GUE", which will be downloaded from the Hugging Face model hub
bench_root = "RGB"
bench = AutoBench(bench_root=bench_root, model_name_or_path=gfm, overwrite=False)
bench.run(autocast=False, batch_size=bench_size, seeds=seeds)
License
OmniGenome is licensed under the Apache License 2.0. See the LICENSE file for more information.
Citation
TBC
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Hashes for OmniGenome-0.1.2a0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0f9218cd52aab84644d834b98bd4a8354ebaea660d6904cf0987b9aec84815f2 |
|
MD5 | c4563215fcb4e1061775f40ad3d9ca28 |
|
BLAKE2b-256 | 8fd7de49437533dbcbeb6bebfa09778ed0f8aef38c440ca9402b9031854f5296 |