NeatBio - a simple Yet Another BioInformatics Library for DNA Sequence
Project description
neatbio
A Simple Yet Another Bioinformatics Library for DNA,RNA and Protein Sequencing
Installation
pip install neatbio
Benefits
- Handling Sequences(DNA,RNA,Protein)
- Protein Synthesis
- Sequence Similarity
- Kmers Generation and Kmer Distance
- Probable Back Translation of Amino Acids
- Reading FASTA files
Why NeatBio?
- NeatBio is yet another bioinformatics library along side powerful and popular bioinformatics libraries such as biopython,scikit-bio,biotite.
- It is meant to complement these powerful library in a simple way.
Usage
Handling Sequences
- Neatbio offers the ability to analyse sequences for more insight
>>>import neatbio as nt
>>> seq1 = nt.Sequence('ATGCATTGA')
>>> seq1.gc
33.33333333333333
>>> seq1.gc_frequency()
3
>>> seq1.at
66.66666666666666
>>> seq1.at_frequency()
6
>>> seq1.transcribe()
'AUGCAUUGA'
>>> mrna = seq1.transcribe()
>>> nt.Sequence(mrna).back_transcribe()
'ATGCATTGA'
>>>
>>> seq1.translate()
'MH*'
>>> seq1.translate
>>>
>>> rna_seq = nt.Sequence("AUGCAUUGA","RNA")
>>> rna_seq.seqtype
>>> Sequence(seq='AUGCAUUGA',seqtype='RNA')
Working with Proteins
>>> protein1 = nt.ProteinSeq('MIT')
>>> protein1
ProteinSeq(seq='MIT')
>>> protein1.back_translate()
'ATGATAACT'
- Note that the back_translate() function offers a probable sequence and not the exact back-translation as multiple codons can represent the same amino acids.
Convert 3 Letter Amino Acid to 1 and vice versa
>>> from neatbio.sequtils import convert_3to1,convert_1to3,get_acid_name
>>> convert_3to1('Ala')
'A'
>>> convert_1to3('L')
'Leu'
>>> get_acid_name('Ala')
'Alanine'
Generate DotPlot
>>> import neatbio as nt
>>> import neatbio.sequtils as utils
>>> seq1 = nt.Sequence('AGTCGTACT')
>>> seq2 = nt.Sequence('AGGCGCACT')
>>>
>>> utils.dotplot(seq1,seq2)
|AGGCGCACT
-----------
A|■ ■
G| ■■ ■
T| ■
C| ■ ■ ■
G| ■■ ■
T| ■
A|■ ■
C| ■ ■ ■
T| ■
>>>
Reading FASTA Files
>>> import neatbio as nt
>>> file1 = nt.read_fasta('sequence.fasta')
>>> file1['seqRecord']
>>> seq1 = nt.Sequence(file1['seqRecord'])
Documentation
- Please read the documentation for more information on what neatbio does and how to use is for your needs.
More Features To Add
- sequence alignment
- writing FASTA files
- support for more file formats
Acknowledgements
- Inspired by packages like BioPython,Scikit-Bio and Biotite
NB
- Contributions Are Welcomed
- Notice a bug, please let us know.
- Thanks A lot
By
- Jesse E.Agbe(JCharis)
- Jesus Saves @JCharisTech
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
neatbio-0.0.2.tar.gz
(12.6 kB
view details)
Built Distribution
neatbio-0.0.2-py3-none-any.whl
(12.6 kB
view details)
File details
Details for the file neatbio-0.0.2.tar.gz
.
File metadata
- Download URL: neatbio-0.0.2.tar.gz
- Upload date:
- Size: 12.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.0.5 CPython/3.6.9 Linux/5.3.0-51-generic
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a39f2ee6c7c7b8954e7016273d75779a0391836990cc8448fff05bdb232b6131 |
|
MD5 | b9a5a88654298088ef93efaf7882ed15 |
|
BLAKE2b-256 | f7335b5ceb5dd93e94ca94724e184a924ce11c34d0f412cdd9d1f74d3e3b5269 |
File details
Details for the file neatbio-0.0.2-py3-none-any.whl
.
File metadata
- Download URL: neatbio-0.0.2-py3-none-any.whl
- Upload date:
- Size: 12.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.0.5 CPython/3.6.9 Linux/5.3.0-51-generic
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e78f9edf4963fde7363b5d7fce42824ff1070b26871a57344c520d8507492e5a |
|
MD5 | 873831ce70eef14c4270d62b7deab0cb |
|
BLAKE2b-256 | 49fc2ae3fe1bebfe51f7f19b94fe0fd76aca603955d174ab794b4bc1860df5a5 |