Skip to main content

Process various file format for RNA-Seq data analysis

Project description

\n# Bioinformatics Tool: bioFile

Introduction

Retrieve data from various file formats used in RNA-Seq data analysis. The tool currently support:

  • GTF file: genomic annotations
  • GFF file: genomic annoations

quick installation

pip install biofile

Development

git clone git@github.com:Tiezhengyuan/bio_file.git
cd bio_file
source venv/bin/activate

Run unit testing:

pytest tests/unittests

Quick tour

Process GFF:

Retrieve annotations by features from <gff_file>. Multiple json files would be stored in <out_dir>

from biofile import GFF
g = GFF(gff_file, out_dir)
g.split_by_features()

Given an attribute, retrieve annotations from <gff_file>. and save dataframe in <out_dir>. Here, search all mRNA according to transcript_id. All related annotations are included. The output is transcript_id_mRNA.txt.

from biofile import GFF
g = GFF(gff_file, out_dir)
g.parse_attributes('transcript_id', 'mRNA')

Process GTF:

Retrieve annotations by features from <gtf_file>. Multiple json files would be stored in <out_dir>

from biofile import GTF
g = GTF(gtf_file, out_dir)
g.split_by_features()

Given an attribute, retrieve annotations from <gtf_file>. and save dataframe in <out_dir>. Here, search all mRNA according to transcript_id. All related annotations are included. The output is transcript_id_mRNA.txt.

from biofile import GTF
g = GTF(gtf_file, out_dir)
g.parse_attributes('transcript_id', 'mRNA')

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

biofile-0.1.0.tar.gz (12.8 kB view details)

Uploaded Source

Built Distribution

biofile-0.1.0-py3-none-any.whl (16.6 kB view details)

Uploaded Python 3

File details

Details for the file biofile-0.1.0.tar.gz.

File metadata

  • Download URL: biofile-0.1.0.tar.gz
  • Upload date:
  • Size: 12.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.12.2

File hashes

Hashes for biofile-0.1.0.tar.gz
Algorithm Hash digest
SHA256 a009d31b3c3d523e656e3d8d5ce49aee8ce0f9954e21dbc04730ecfff4a96b0c
MD5 df591be83314eced171e43c00cde736a
BLAKE2b-256 c9b6fc83ab385e97ca08f0ff4cd3aa3fcf682401fe620989a8f974dbc9a69ee5

See more details on using hashes here.

File details

Details for the file biofile-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: biofile-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 16.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.12.2

File hashes

Hashes for biofile-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 857356795aa6d57ea374ded9c9f18abbb3a4996c88ff0e958cd3fb8a8dd1b031
MD5 8c8015ae2ff05f714f19445604412377
BLAKE2b-256 d35c21620078b8fd47d66491faa3cb33b12a058bca7b9c3f9c387e3ccb4d5f62

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page