Skip to main content

No project description provided

Project description

bio: making bioinformatics fun again

bio - command-line utilities to make bioinformatics explorations more enjoyable.

bio is a bioinformatics toy to play with.

Like LEGO pieces that match one another bio aims to provide you with commands that naturally fit together and let you express your intent with short, explicit and simple commands. It is a project in an exploratory phase, we'd welcome input and suggestions on what it should grow up into.

What does this software do?

If you've ever done bioinformatics, you know how even seemingly straightforward tasks require multiple steps, arcane incantations, and various other preparations that slow down progress.

Even well-defined, supposedly simple tasks can take a seemingly inordinate number of complicated steps. The bio package is meant to solve that tedium.

Usage examples

# Fetch genbank data
bio fetch NC_045512 MN996532 > genomes.gb

# Convert the first then bases of the genomes to FASTA.
bio fasta genomes.gb --end 10

# Align the coding sequences for the S protein
bio fasta genomes.gb --gene S --protein | bio align | head

# Print the GFF record that corresponds to the coding sequence for gene S
bio gff genomes.gb --gene S 

# Show the descendants of taxid 117565
bio taxon 117565 | head

# Show the lineage of a taxonomic rank.
bio taxon 117565 --lineage | head

# Get metadata on a viral sample
bio meta 11138 -H | head

# Define a sequence ontology terms
bio explain exon

# Define a gene ontology terms
bio explain food vacuole

Documentation

Detailed documentation is maintained at

Quick install

bio works on Linux and Mac computers and on Windows when using the Linux Subsystem.

As a rule, all Python based command line utilities should be installed via pipx to avoid conflicts with other Python packages:

pipx install bio 

Alternatively, if you can also use pip to install:

pip install bio 

See more details in the documentation.

bio is stream oriented

bio supports stream oriented programming where the output of one task may be chained into the second. Take the example above but now start with a file acc.txt that contains just the accession numbers:

NC_045512
MN996532

we can run bio to generate a VCF file with the variants of the S nucleotides forming the S protein like so:

cat acc.txt | bio fetch | bio fasta --gene S | bio align --vcf | head

to print:

##fileformat=VCFv4.2
##FORMAT=<ID=GT,Number=1,Type=String,Description="Genotype">
##FILTER=<ID=PASS,Description="All filters passed">
##INFO=<ID=TYPE,Number=1,Type=String,Description="Type of the variant">
##contig=<ID=YP_009724390.1,length=3822,assembly=YP_009724390.1>
#CHROM  POS     ID      REF     ALT     QUAL    FILTER  INFO    FORMAT  QHR63300.2
YP_009724390.1  33      33C/T   C       T       .       PASS    TYPE=SNP        GT      1
YP_009724390.1  54      54T/A   T       A       .       PASS    TYPE=SNP        GT      1
YP_009724390.1  60      60C/T   C       T       .       PASS    TYPE=SNP        GT      1
YP_009724390.1  69      69A/G   A       G       .       PASS    TYPE=SNP        GT      1

Who is bio designed for?

The software was written to teach bioinformatics and is the companion software to the Biostar Handbook textbook. The targeted audience comprises:

  • Students learning about bioinformatics.
  • Bioinformatics educators who need a platform to demonstrate bioinformatics concepts.
  • Scientists working with large numbers of similar genomes (bacterial/viral strains).
  • Scientists who need to investigate and understand the precise details of a genomic region closely.

The ideas and motivations fueling bio have been developed while educating the many cohorts of students who used the handbook in the classroom. bio is an opinionated take on how bioinformatics, particularly data representation and access, should be simplified and streamlined.

Development

We use the uv package system to manage the software:

uv venv

# Run the software
uv run bio

# Run the tests.
uv run bio test

Tests are automatically built from a shell script that mimics real-life usage scenarios.

Generating documentation

To generate the docs, you will need the bookdown package:

conda install r-bookdown r-servr

To run the docs in a browse:

make 

then visit http://localhost:8000

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bio-1.8.3.tar.gz (8.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bio-1.8.3-py3-none-any.whl (321.4 kB view details)

Uploaded Python 3

File details

Details for the file bio-1.8.3.tar.gz.

File metadata

  • Download URL: bio-1.8.3.tar.gz
  • Upload date:
  • Size: 8.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.15 {"installer":{"name":"uv","version":"0.9.15","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for bio-1.8.3.tar.gz
Algorithm Hash digest
SHA256 5c01920d842fdc7a15212c1af50296db30fa76a3bd3597f8c65724b294557937
MD5 14a5e2f85a8549e1a261da195f84c015
BLAKE2b-256 4d5c9e28f4b7902160965f7901126d303cfa8255821b25a3c8348a3d91317953

See more details on using hashes here.

File details

Details for the file bio-1.8.3-py3-none-any.whl.

File metadata

  • Download URL: bio-1.8.3-py3-none-any.whl
  • Upload date:
  • Size: 321.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.15 {"installer":{"name":"uv","version":"0.9.15","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for bio-1.8.3-py3-none-any.whl
Algorithm Hash digest
SHA256 52f91db8bf8b83652270610e0109d3448cfeb6e18e64b4a8778c5201dc6be44f
MD5 c1718915fc0ce846441faa2d479eed91
BLAKE2b-256 dd989521eecb05c0b30073bdf7e05ca428115ff234aea3d479d6ac642183b4a6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page