Process various file format for RNA-Seq data analysis
Project description
\n# Bioinformatics Tool: bioFile
Introduction
Retrieve data from various file formats used in RNA-Seq data analysis. The tool currently support:
- GTF file: genomic annotations
- GFF file: genomic annoations
quick installation
pip install biofile
Development
git clone git@github.com:Tiezhengyuan/bio_file.git
cd bio_file
source venv/bin/activate
Run unit testing:
pytest tests/unittests
Quick tour
Process GFF:
Retrieve annotations by features from <gff_file>. Multiple json files would be stored in <out_dir>
from biofile import GFF
g = GFF(gff_file, out_dir)
g.split_by_features()
Given an attribute, retrieve annotations from <gff_file>. and save dataframe in <out_dir>. Here, search all mRNA according to transcript_id. All related annotations are included. The output is transcript_id_mRNA.txt.
from biofile import GFF
g = GFF(gff_file, out_dir)
g.parse_attributes('transcript_id', 'mRNA')
Process GTF:
Retrieve annotations by features from <gtf_file>. Multiple json files would be stored in <out_dir>
from biofile import GTF
g = GTF(gtf_file, out_dir)
g.split_by_features()
Given an attribute, retrieve annotations from <gtf_file>. and save dataframe in <out_dir>. Here, search all mRNA according to transcript_id. All related annotations are included. The output is transcript_id_mRNA.txt.
from biofile import GTF
g = GTF(gtf_file, out_dir)
g.parse_attributes('transcript_id', 'mRNA')
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file biofile-0.1.0.tar.gz
.
File metadata
- Download URL: biofile-0.1.0.tar.gz
- Upload date:
- Size: 12.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.12.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a009d31b3c3d523e656e3d8d5ce49aee8ce0f9954e21dbc04730ecfff4a96b0c |
|
MD5 | df591be83314eced171e43c00cde736a |
|
BLAKE2b-256 | c9b6fc83ab385e97ca08f0ff4cd3aa3fcf682401fe620989a8f974dbc9a69ee5 |
File details
Details for the file biofile-0.1.0-py3-none-any.whl
.
File metadata
- Download URL: biofile-0.1.0-py3-none-any.whl
- Upload date:
- Size: 16.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.12.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 857356795aa6d57ea374ded9c9f18abbb3a4996c88ff0e958cd3fb8a8dd1b031 |
|
MD5 | 8c8015ae2ff05f714f19445604412377 |
|
BLAKE2b-256 | d35c21620078b8fd47d66491faa3cb33b12a058bca7b9c3f9c387e3ccb4d5f62 |