Skip to main content

Parsing library for extracting data frames of genomic features from GTF files

Project description

Tests Coverage Status PyPI

gtfparse

Parsing tools for GTF (gene transfer format) files.

Example usage

Parsing all rows of a GTF file into a Pandas DataFrame

from gtfparse import read_gtf

# returns GTF with essential columns such as "feature", "seqname", "start", "end"
# alongside the names of any optional keys which appeared in the attribute column
df = read_gtf("gene_annotations.gtf")

# filter DataFrame to gene entries on chrY
df_genes = df[df["feature"] == "gene"]
df_genes_chrY = df_genes[df_genes["seqname"] == "Y"]

Getting gene FPKM values from a StringTie GTF file

from gtfparse import read_gtf

df = read_gtf(
    "Transcripts.gtf",
    column_converters={"FPKM": float})

gene_fpkms = {
    gene_name: fpkm
    for (gene_name, fpkm, feature)
    in zip(df["seqname"], df["FPKM"], df["feature"])
    if feature == "gene"
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gtfparse-2.7.1.tar.gz (25.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gtfparse-2.7.1-py3-none-any.whl (17.3 kB view details)

Uploaded Python 3

File details

Details for the file gtfparse-2.7.1.tar.gz.

File metadata

  • Download URL: gtfparse-2.7.1.tar.gz
  • Upload date:
  • Size: 25.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.6

File hashes

Hashes for gtfparse-2.7.1.tar.gz
Algorithm Hash digest
SHA256 1816a40c66e486eed28499b8a4e2f1ebbfad7e540c8d540eb2f9040873a05ebe
MD5 c326265e107815e76952b4c84add11b6
BLAKE2b-256 461fa23e9b6479e87abe5928b56d2e0bd0356d185656ede1ff6fa2a268f22382

See more details on using hashes here.

File details

Details for the file gtfparse-2.7.1-py3-none-any.whl.

File metadata

  • Download URL: gtfparse-2.7.1-py3-none-any.whl
  • Upload date:
  • Size: 17.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.6

File hashes

Hashes for gtfparse-2.7.1-py3-none-any.whl
Algorithm Hash digest
SHA256 9b88a2a15b5910ab6d67210de2d7a5f7b9b19e09275d41205a6a69aa19ba31e8
MD5 3279b312feaffbf99d5aeb53743b5faa
BLAKE2b-256 88fd38178378f41ea548e0afa0016e9c31ad272632fa2a768e35eb2b67740944

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page