Skip to main content

GTF Parsing

Project description

[![Build Status](https://travis-ci.org/hammerlab/gtfparse.svg?branch=master)](https://travis-ci.org/hammerlab/gtfparse) [![Coverage Status](https://coveralls.io/repos/hammerlab/gtfparse/badge.svg?branch=master&service=github)](https://coveralls.io/github/hammerlab/gtfparse?branch=master)

gtfparse
========
Parsing tools for GTF (gene transfer format) files.

# Example usage

## Parsing all rows of a GTF file into a Pandas DataFrame

```python
from gtfparse import read_gtf_as_dataframe

# returns GTF with essential columns such as "feature", "seqname", "start", "end"
# alongside the names of any optional keys which appeared in the attribute column
df = read_gtf_as_dataframe("gene_annotations.gtf")

# filter DataFrame to gene entries on chrY
df_genes = df[df["feature"] == "gene"]
df_genes_chrY = df_genes[df_genes["seqname"] == "Y"]
```


## Getting gene FPKM values from a StringTie GTF file

```python
from gtfparse import read_gtf_as_dict

gtf_dict = read_gtf_as_dict(
"stringtie-output.gtf",
column_converters={"FPKM": float})

gene_fpkms = {
gene_name: fpkm
for (gene_name, fpkm, feature)
in zip(gtf_dict["gene_name"], gtf_dict["FPKM"], gtf_dict["feature"])
if feature == "gene"
}
```

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gtfparse-0.0.6.tar.gz (11.5 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page