Skip to main content

Machine Learning for Genomic

Project description

GenoML-core

GenoML is an Automated Machine Learning (AutoML) for Genomic. This is the core package of GenoML. this repo is under development, please report any issues (bug, performance, documentation) on the GenoML issues page.

Here are some quick "get started" exmaples, please checkout the additional options and details in the Usage and CLI. In general, use linux or mac with python > 3.5 for best results.

Install

Run:

pip install genoml

Train the ML model

You can use the IPDGC (International Parkinson's Disease Genomics Consortium) test data. This data is a simulation of the genetic and clinical data used for Parkinson's diagnosis in previous publications. You can find it at IPDGC example data.

Download and unzip data:

wget https://github.com/ipdgc/GenoML-Brief-Intro/raw/master/exampleData.zip
unzip exampleData.zip 

To train, run:

genoml-train --geno-prefix=./exampleData/training --pheno-file=./exampleData/training.pheno --model-dir=./exampleModel

Final tuned model and performance metrics are stored in the --model-dir directory.

Using the trained ML model for inference

genoml-inference --model-dir=./exampleModel --valid-dir=./exampleData --valid-geno-prefix=./exampleData/validation --valid-pheno-file=./exampleData/validation.pheno

Valdiation results and model performance metrics are stored in the --valid-dir directory.

For debugging purposes, include the -v or -vvv flags at the end of a command.

Report issues

Please report any issue or suggestions on the GenoML issues page.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

genoml-1.0.4.tar.gz (78.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

genoml-1.0.4-py3-none-any.whl (250.7 kB view details)

Uploaded Python 3

File details

Details for the file genoml-1.0.4.tar.gz.

File metadata

  • Download URL: genoml-1.0.4.tar.gz
  • Upload date:
  • Size: 78.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.2 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.7.6

File hashes

Hashes for genoml-1.0.4.tar.gz
Algorithm Hash digest
SHA256 abc55f4fbef5fc3704c2a9b756054a1513f4a67bfaa94d5ea7584d16bb4e2352
MD5 af08b34ae3b5e32dc3aa3e03e1af9675
BLAKE2b-256 295c62c41f3540bb328b4f3693235e6cfd942bdf87548292da339490397f8a11

See more details on using hashes here.

File details

Details for the file genoml-1.0.4-py3-none-any.whl.

File metadata

  • Download URL: genoml-1.0.4-py3-none-any.whl
  • Upload date:
  • Size: 250.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.2 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.7.6

File hashes

Hashes for genoml-1.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 9b2acc32874ec38bd95fa685142ad0e65fabd040fa58de3e4af0706e447187c5
MD5 a1b75087104cc7af16d3cffe9901a487
BLAKE2b-256 2b34f29d6cf51ecd4f8932cd28de73c1683d429b5367e45238ad5ece066afe18

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page