Skip to main content

AI-powered CLI for DNA sequence and genome analysis

Project description

Genomix CLI

   ██████╗ ███████╗███╗   ██╗ ██████╗ ███╗   ███╗██╗██╗  ██╗
  ██╔════╝ ██╔════╝████╗  ██║██╔═══██╗████╗ ████║██║╚██╗██╔╝
  ██║  ███╗█████╗  ██╔██╗ ██║██║   ██║██╔████╔██║██║ ╚███╔╝
  ██║   ██║██╔══╝  ██║╚██╗██║██║   ██║██║╚██╔╝██║██║ ██╔██╗
  ╚██████╔╝███████╗██║ ╚████║╚██████╔╝██║ ╚═╝ ██║██║██╔╝ ██╗
   ╚═════╝ ╚══════╝╚═╝  ╚═══╝ ╚═════╝ ╚═╝     ╚═╝╚═╝╚═╝  ╚═╝

AI-powered CLI for DNA sequence and genome analysis.

Genomix CLI Demo

Genomix is an intelligent command-line tool that helps biologists, bioinformaticians, and researchers analyze genomic data through natural language. Ask questions about your VCF, FASTA, or FASTQ files — the AI reads them, queries real databases (NCBI, Ensembl, ClinVar), and explains results in accessible language.

Local-first. Runs with Ollama by default — your genomic data never leaves your machine.

What It Does

❯ Read raw_variants.vcf and give me a clinical summary

  ⚡ read_file(path='raw_variants.vcf')
    ↳ ##fileformat=VCFv4.2 ...

  1. BRCA1 missense (chr17:43094464): Pathogenic — increased breast/ovarian cancer risk
  2. CFTR deletion (chr7:117559593): Pathogenic — cystic fibrosis (homozygous)
  3. HBB missense (chr11:5226773): Pathogenic — sickle cell trait (carrier)
  4. APOE missense (chr19:44908822): Risk factor — Alzheimer's disease
  ...

❯ What does this reveal about the person's ancestry?

  Based on the variant profile:
  - HBB/rs334 (sickle cell trait): high frequency in African/Mediterranean populations
  - CFTR deltaF508: most common in Northern European populations
  - Combined profile suggests mixed European/African ancestry

Features

  • Natural language interface — ask questions about your genomic data in plain English or French
  • 9 MCP servers — samtools, BWA, GATK, BLAST+, FastQC, NCBI, Ensembl, ClinVar, dbSNP
  • Smart analysis — reads raw VCFs (no annotations needed), identifies genes from coordinates, infers clinical significance
  • Ancestry inference — population frequency analysis via gnomAD/1000 Genomes
  • 12 built-in skills — specialized AI instructions for sequencing, comparative genomics, and exploration workflows
  • 3 AI providers — Ollama/local (default), Claude (Anthropic), OpenAI
  • Privacy mode — automatically active with local models, raw sequences never sent to cloud
  • Slash commands/qc, /align, /variant-call, /blast, /msa, /explain, and more
  • MCP management/mcp to view, connect, and manage bioinformatics tool servers

Installation

# Install
pip install genomix-cli

# Check dependencies
genomix setup

# Initialize a project
cd my-analysis/
genomix init

Requirements

  • Python 3.11+
  • Ollama with a model (e.g., ollama pull qwen3-coder:30b)
  • Optional: samtools, BWA, GATK, BLAST+ for bioinformatics tools

Quick Start

# Start interactive mode
genomix

# Non-interactive usage
genomix ask "What is the BRCA1 gene?"
genomix ask "Read sample.vcf and summarize the variants"
genomix run /qc data/reads.fastq.gz

Interactive Session

   ██████╗ ███████╗███╗   ██╗ ██████╗ ███╗   ███╗██╗██╗  ██╗
  ...
  v0.1.0 — AI-powered genome analysis

  ┌──────────────────────────────────────────────────────┐
  │  Project    BRCA Analysis - Cohort 2026              │
  │  Organism   Homo sapiens                             │
  │  Reference  GRCh38                                   │
  │  Provider   opencode (qwen3-coder:30b)               │
  │  Privacy    🔒 ON                                    │
  │  MCP        9 registered (4 connected, 5 missing)    │
  └──────────────────────────────────────────────────────┘

  Connecting MCP servers...
  Connecting to ClinVar... ✓ (3 tools)
  Connecting to dbSNP... ✓ (3 tools)
  Connecting to Ensembl... ✓ (5 tools)
  Connecting to NCBI... ✓ (4 tools)

❯ _

Slash Commands

Command Description
Analysis
/qc Quality control (FastQC)
/align Align reads to reference genome
/variant-call Call variants (GATK/FreeBayes)
/annotate Annotate variants (SnpEff/VEP)
/pipeline Full pipeline: QC → align → call → annotate
Comparative
/blast BLAST similarity search
/msa Multiple sequence alignment
/phylo Phylogenetic tree construction
Exploration
/summary Summarize a genomic file
/search Query databases (NCBI, Ensembl...)
/explain Explain a variant, gene, or region
Session
/mcp Manage MCP servers (connect, status)
/swarm Show background analyses
/provider Switch AI provider
/model Switch model
/help Show available commands

Architecture

┌─────────────────────────────────────────────┐
│              genomix-cli                     │
│                                              │
│  CLI/TUI ── Agent Loop ── Swarm Manager      │
│                 │                             │
│    ┌────────────┼────────────┐                │
│    ▼            ▼            ▼                │
│  Tool       Skills       Project              │
│  Registry   System       Manager              │
│    │                                          │
│    ▼                                          │
│  MCP Servers                                  │
│  ├── biotools: samtools, BWA, GATK,           │
│  │   BLAST+, FastQC                           │
│  └── databases: NCBI, Ensembl,                │
│      ClinVar, dbSNP                           │
│                                               │
│  AI Providers                                 │
│  Ollama (local) │ Claude │ OpenAI             │
└───────────────────────────────────────────────┘

Configuration

# ~/.genomix/config.yaml
provider:
  default: opencode        # ollama local (default)
  model: qwen3-coder:30b

# Or use Claude/OpenAI:
# provider:
#   default: claude
#   model: claude-sonnet-4-6

API keys (if using cloud providers):

# ~/.genomix/secrets.yaml (mode 0600)
anthropic_api_key: "sk-ant-..."
openai_api_key: "sk-..."

Contributing

Contributions welcome! This project is in early alpha.

  • Spec: docs/superpowers/specs/2026-03-21-genomix-cli-design.md
  • Plan: docs/superpowers/plans/2026-03-21-genomix-cli.md
  • Skills guide: Create custom skills in .genomix/skills/ following the SKILL.md format

License

Apache 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

genomix_cli-0.1.0.tar.gz (333.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

genomix_cli-0.1.0-py3-none-any.whl (70.9 kB view details)

Uploaded Python 3

File details

Details for the file genomix_cli-0.1.0.tar.gz.

File metadata

  • Download URL: genomix_cli-0.1.0.tar.gz
  • Upload date:
  • Size: 333.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for genomix_cli-0.1.0.tar.gz
Algorithm Hash digest
SHA256 5b17d608cfc31755c923f6a95ce4a79e0e48d32b3624523609a00028b99ee8ba
MD5 751e55dd8b1767efb20399223643f40c
BLAKE2b-256 42eabdd0a196d80996a453e56cb61675e3609f3ff90f4adedf8e40a3b1f3ddd9

See more details on using hashes here.

File details

Details for the file genomix_cli-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: genomix_cli-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 70.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for genomix_cli-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 5f0c7fd8d4939a6e174df511b1122ca4c4eb922f5cdd29a5e2834fff2666f3eb
MD5 f5bf253f3034bba36da51bb5c5f9aecb
BLAKE2b-256 e6d6fc50ccf1c2a3345f80cbb871832abc5a4ce4d91e5edd0acff29a48e0e0bf

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page