Skip to main content

TAXonomic Profile Aggregation and STAndardisation

Project description

TAXPASTA

Package Latest PyPI Version Supported Python Versions DOI
Meta Apache-2.0 Code of Conduct Code Style Black
Automation GitHub Workflow Documentation Code Coverage

TAXonomic Profile Aggregation and STAndardisation

About

The main purpose of taxpasta is to standardise taxonomic profiles created by a range of bioinformatics tools. We call those tools taxonomic profilers. They each come with their own particular tabular output format. Across the profilers, relative abundances can be reported in read counts, fractions, or percentages, as well as any number of additional columns with extra information. We therefore decided to take the lessons learnt to heart and provide our own solution to deal with this pasticcio. With taxpasta you can ingest all of those formats and, at a minimum, output taxonomy identifiers and their integer counts.

Taxpasta can not only standardise profiles but also merge them across samples for the same profiler into a single table. In future, we also intend to offer methods for forming a consensus for the same sample analyzed by different profilers.

Install

It's as simple as:

pip install taxpasta

Taxpasta is also available from the Bioconda channel

conda install -c bioconda taxpasta

and thus automatically generated Docker and Singularity BioContainers images also exist.

Optional Dependencies

Taxpasta supports a number of extras that you can install for additional features; primarily support for additional output file formats. You can install them by specifying a comma separated list within square brackets, for example,

pip install 'taxpasta[rich,biom]'
  • rich provides rich-formatted command line output and logging.
  • arrow supports writing output tables in Apache Arrow format.
  • parquet supports writing output tables in Apache Parquet format.
  • biom supports writing output tables in BIOM format.
  • ods supports writing output tables in ODS format.
  • xlsx supports writing output tables in Microsoft Excel format.
  • all includes all of the above.
  • dev provides all tools needed for contributing to taxpasta.

Copyright

  • Copyright © 2022, 2023, Moritz E. Beber, Maxime Borry, James A. Fellows Yates, and Sofia Stamouli.
  • Free software distributed under the Apache Software License 2.0.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

taxpasta-0.2.3.tar.gz (36.5 kB view details)

Uploaded Source

Built Distribution

taxpasta-0.2.3-py3-none-any.whl (119.3 kB view details)

Uploaded Python 3

File details

Details for the file taxpasta-0.2.3.tar.gz.

File metadata

  • Download URL: taxpasta-0.2.3.tar.gz
  • Upload date:
  • Size: 36.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-httpx/0.23.3

File hashes

Hashes for taxpasta-0.2.3.tar.gz
Algorithm Hash digest
SHA256 dbcef1ea1b3aee987e7505c5c8288e1601acd8568a102f3f346b99c037980285
MD5 061b24881268de0238fbc8526353ab0a
BLAKE2b-256 b851238568e54e70a3b38efc060c542224b68c641af2aea159ca86adda03ede9

See more details on using hashes here.

File details

Details for the file taxpasta-0.2.3-py3-none-any.whl.

File metadata

  • Download URL: taxpasta-0.2.3-py3-none-any.whl
  • Upload date:
  • Size: 119.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-httpx/0.23.3

File hashes

Hashes for taxpasta-0.2.3-py3-none-any.whl
Algorithm Hash digest
SHA256 26a1adb61edff811b711c065fe3d343838707c4a6ad89b0963b1a4fc8d8bc84d
MD5 1bde61c2024c55286f31e0675c3bf32b
BLAKE2b-256 1d5d60f8fc89f493935e6fab06b1b5c7a4038002d8570e147d6b4b6e315797d8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page