Skip to main content

AI-powered CLI for DNA sequence and genome analysis

Project description

Genomix CLI

AI-powered CLI for DNA sequence and genome analysis.

Genomix CLI Demo

Genomix is an intelligent command-line tool that helps biologists, bioinformaticians, and researchers analyze genomic data through natural language. Ask questions about your VCF, FASTA, or FASTQ files — the AI reads them, queries real databases (NCBI, Ensembl, ClinVar), and explains results in accessible language.

Local-first. Runs with Ollama by default — your genomic data never leaves your machine.

What It Does

❯ Read raw_variants.vcf and give me a clinical summary

  ⚡ read_file(path='raw_variants.vcf')
    ↳ ##fileformat=VCFv4.2 ...

  1. BRCA1 missense (chr17:43094464): Pathogenic — increased breast/ovarian cancer risk
  2. CFTR deletion (chr7:117559593): Pathogenic — cystic fibrosis (homozygous)
  3. HBB missense (chr11:5226773): Pathogenic — sickle cell trait (carrier)
  4. APOE missense (chr19:44908822): Risk factor — Alzheimer's disease
  ...

❯ What does this reveal about the person's ancestry?

  Based on the variant profile:
  - HBB/rs334 (sickle cell trait): high frequency in African/Mediterranean populations
  - CFTR deltaF508: most common in Northern European populations
  - Combined profile suggests mixed European/African ancestry

Features

  • Natural language interface — ask questions about your genomic data in plain English or French
  • 18 MCP servers — 5 biotools (samtools, BWA, GATK, BLAST+, FastQC) + 13 databases (see below)
  • 20 slash commands/qc, /align, /variant-call, /blast, /msa, /explain, /report, /structure, and more
  • 21 built-in skills — specialized AI instructions for sequencing, comparative genomics, clinical, oncology, pharmacogenomics, and more
  • Smart analysis — reads raw VCFs (no annotations needed), identifies genes from coordinates, infers clinical significance
  • Ancestry inference — population frequency analysis via gnomAD/1000 Genomes
  • 3 AI providers — Ollama/local (default), Claude (Anthropic), OpenAI
  • Privacy mode — automatically active with local models, raw sequences never sent to cloud
  • MCP management/mcp to view, connect, and manage bioinformatics tool servers

Installation

# Install
pip install genomix-cli

# Check dependencies
genomix setup

# Initialize a project
cd my-analysis/
genomix init

Requirements

  • Python 3.11+
  • Ollama with a model (e.g., ollama pull qwen3-coder:30b)
  • Optional: samtools, BWA, GATK, BLAST+ for bioinformatics tools

Quick Start

# Start interactive mode
genomix

# Non-interactive usage
genomix ask "What is the BRCA1 gene?"
genomix ask "Read sample.vcf and summarize the variants"
genomix run /qc data/reads.fastq.gz

Interactive Session

   ██████╗ ███████╗███╗   ██╗ ██████╗ ███╗   ███╗██╗██╗  ██╗
  ...
  v0.3.0 — AI-powered genome analysis

  ┌──────────────────────────────────────────────────────┐
  │  Project    BRCA Analysis - Cohort 2026              │
  │  Organism   Homo sapiens                             │
  │  Reference  GRCh38                                   │
  │  Provider   opencode (qwen3-coder:30b)               │
  │  Privacy    🔒 ON                                    │
  │  MCP        18 registered (4 connected, 14 missing)   │
  └──────────────────────────────────────────────────────┘

  Connecting MCP servers...
  Connecting to ClinVar... ✓ (3 tools)
  Connecting to dbSNP... ✓ (3 tools)
  Connecting to Ensembl... ✓ (5 tools)
  Connecting to NCBI... ✓ (4 tools)

❯ _

Slash Commands

Command Description
Analysis
/qc Quality control (FastQC)
/align Align reads to reference genome
/variant-call Call variants (GATK/FreeBayes)
/annotate Annotate variants (SnpEff/VEP)
/pipeline Full pipeline: QC → align → call → annotate
Comparative
/blast BLAST similarity search
/msa Multiple sequence alignment
/phylo Phylogenetic tree construction
Exploration
/summary Summarize a genomic file
/search Query databases (NCBI, Ensembl...)
/explain Explain a variant, gene, or region
Session
/mcp Manage MCP servers (connect, status)
/swarm Show background analyses
/provider Switch AI provider
/model Switch model
/help Show available commands

Supported Databases

Database Description
NCBI Gene, nucleotide, and protein search
Ensembl Genome browser, gene annotations, variants
ClinVar Clinical variant interpretations
dbSNP SNP identifiers and allele frequencies
gnomAD Population allele frequencies
OMIM Mendelian disease catalog
PharmGKB Pharmacogenomics annotations
COSMIC Somatic mutations in cancer
InterPro Protein domains and families
PubMed Biomedical literature search
AlphaFold Protein structure predictions
UniProt Protein sequences and annotations
PDB Experimental protein structures

Architecture

┌─────────────────────────────────────────────┐
│              genomix-cli                     │
│                                              │
│  CLI/TUI ── Agent Loop ── Swarm Manager      │
│                 │                             │
│    ┌────────────┼────────────┐                │
│    ▼            ▼            ▼                │
│  Tool       Skills       Project              │
│  Registry   System       Manager              │
│    │                                          │
│    ▼                                          │
│  MCP Servers                                  │
│  ├── biotools: samtools, BWA, GATK,           │
│  │   BLAST+, FastQC                           │
│  └── databases: NCBI, Ensembl, ClinVar,        │
│      dbSNP, gnomAD, OMIM, PharmGKB, COSMIC,   │
│      InterPro, PubMed, AlphaFold, UniProt, PDB │
│                                               │
│  AI Providers                                 │
│  Ollama (local) │ Claude │ OpenAI             │
└───────────────────────────────────────────────┘

Configuration

# ~/.genomix/config.yaml
provider:
  default: opencode        # ollama local (default)
  model: qwen3-coder:30b

# Or use Claude/OpenAI:
# provider:
#   default: claude
#   model: claude-sonnet-4-6

API keys (if using cloud providers):

# ~/.genomix/secrets.yaml (mode 0600)
anthropic_api_key: "sk-ant-..."
openai_api_key: "sk-..."

Contributing

Contributions welcome! See CONTRIBUTING.md for development setup, project structure, and how to add new MCP servers, skills, or AI providers.

The easiest way to contribute is adding a new database MCP server — each one is a single self-contained file. See the architecture docs for an overview of the system.

License

Apache 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

genomix_cli-0.4.0.tar.gz (127.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

genomix_cli-0.4.0-py3-none-any.whl (96.7 kB view details)

Uploaded Python 3

File details

Details for the file genomix_cli-0.4.0.tar.gz.

File metadata

  • Download URL: genomix_cli-0.4.0.tar.gz
  • Upload date:
  • Size: 127.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for genomix_cli-0.4.0.tar.gz
Algorithm Hash digest
SHA256 837814e7e31ff67b9117193755fb3ba25b7667813f79f3cfb0bead607a066507
MD5 95960119c03b856242b723757bf9ac66
BLAKE2b-256 b03beb0e5c116f74159f750af4080c50d3f52e0c1bf0f9941d902c15e7035e18

See more details on using hashes here.

File details

Details for the file genomix_cli-0.4.0-py3-none-any.whl.

File metadata

  • Download URL: genomix_cli-0.4.0-py3-none-any.whl
  • Upload date:
  • Size: 96.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for genomix_cli-0.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 28e669a9213abce71ef7312d283bc00b7a48f8df9f160d2c207534b96cddac86
MD5 369954cdb3a1e83bf39cf0b729e5ff77
BLAKE2b-256 e0a7e127cbb44ae36997827182cb4a1bb7772d64dd74b88c8db64674a3337c0f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page