Skip to main content

Parsing library for extracting data frames of genomic features from GTF files

Project description

Tests Coverage Status PyPI

gtfparse

Parsing tools for GTF (gene transfer format) files.

Example usage

Parsing all rows of a GTF file into a Pandas DataFrame

from gtfparse import read_gtf

# returns GTF with essential columns such as "feature", "seqname", "start", "end"
# alongside the names of any optional keys which appeared in the attribute column
df = read_gtf("gene_annotations.gtf")

# filter DataFrame to gene entries on chrY
df_genes = df[df["feature"] == "gene"]
df_genes_chrY = df_genes[df_genes["seqname"] == "Y"]

Getting gene FPKM values from a StringTie GTF file

from gtfparse import read_gtf

df = read_gtf(
    "Transcripts.gtf",
    column_converters={"FPKM": float})

gene_fpkms = {
    gene_name: fpkm
    for (gene_name, fpkm, feature)
    in zip(df["seqname"], df["FPKM"], df["feature"])
    if feature == "gene"
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gtfparse-2.6.3.tar.gz (18.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gtfparse-2.6.3-py3-none-any.whl (15.6 kB view details)

Uploaded Python 3

File details

Details for the file gtfparse-2.6.3.tar.gz.

File metadata

  • Download URL: gtfparse-2.6.3.tar.gz
  • Upload date:
  • Size: 18.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.6

File hashes

Hashes for gtfparse-2.6.3.tar.gz
Algorithm Hash digest
SHA256 f00f1e593903a8255111483bc9efe45197a9d719a2fccb7f56a7bca07e431eb2
MD5 60d5599896d32ab578537001bebf871a
BLAKE2b-256 b024afc03ae26bc2de62170b978e30ac61a2f5a87413fe716eeb1291cba07521

See more details on using hashes here.

File details

Details for the file gtfparse-2.6.3-py3-none-any.whl.

File metadata

  • Download URL: gtfparse-2.6.3-py3-none-any.whl
  • Upload date:
  • Size: 15.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.6

File hashes

Hashes for gtfparse-2.6.3-py3-none-any.whl
Algorithm Hash digest
SHA256 dec31349ec81bc401ddf7912d53398a021c2f1ee4365837ed04fae3993a24c04
MD5 a82a2cc6bd7598cd9209bce5c2dbbd46
BLAKE2b-256 a3c7da712dd80acd52bb6db302c01ce0dffddbc408ec01f26c97a028c8959530

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page