A simple Python class for generating custom orthogonal DNA/RNA barcodes.
Project description
Orthogonal Barcodes
A simple Python class for generating custom orthogonal DNA/RNA barcodes.
Features
- Generate a configurable amount of barcodes.
- Define length of barcodes.
- GC percent filter.
- Hamming distance filter.
- Avoid common and custom restriction sites.
- Custom motifs avoidance from barcodes.
- Start and end sequence avoidance.
- Avoid barcode homology to organism of interest.
Installation
pip install OrthogonalBarcodes
Quickstart
barcodes = OrthogonalBarcode()
barcodes.length=25
barcodes.gc=50
barcodes.amount=10
barcodes.hamming_distance=4
barcodes.generate_barcodes()
print(barcodes.barcodes)
>
>
# Prints the list of barcodes below
['TTAGACGTGCAGTCGTCTTGACCCT', 'CCGTGTGGTCTCCCTCATGGTTAAA', 'GAGCCGGGTGAAACTCAAACTCAAC', 'AGTGGGGACTGTGCTAAGGCAAGAT', 'TGCAGCACCTTTGGTCACCTTTCTC', 'CAGCCGGTCTCCGATTATCTATCTC', 'TATGGGGAAGCTGTCGTGATTGGCT', 'GGCGCCACCAGAATCACTTTAAGTG', 'TACGGTCGATATGGATCCTTCCGTG', 'TGAGTCTCAGGTACGCAAGTTGCCT']
Full Usage
# Instantiate the class and set required/optional properties below.
barcodes = OrthogonalBarcode()
# Required — The length of each barcode (Default:20)
barcodes.length=25
# Required — The GC percentage for each barcode generated (Default:50)
barcodes.gc=50
# Required — The amount of barcodes to generate (Default:1)
barcodes.amount=10
# Optional — Hamming distance between all barcodes. (Default:0)
barcodes.hamming_distance=4
# Optional — Fasta file path for organism of interest. (Default:none)
barcodes.ooi_file='tests/test_ooi.fasta'
# Optional — Maximum barcode homology to any part of the oranism of interest file Default:50)
barcodes.ooi_homology_threshold = 40
# Optional — List of restriction sites to avoid when generating barcodes. (Default:none; Accepts Bio.Restriction properties)
barcodes.avoid_rs=[EcoRI,BamHI,NheI,XhoI,KasI]
# Optional — List of any motifs to filter from barcodes (Default: "AAAA","TTTT","CGCGCGCG","ATATATAT")
barcodes.avoid_motifs=["AAATTTT","TTTTTT","GGGGGGGG"]
# Optional — List of starting sequences to avoid (Default: "ATATAT")
barcodes.avoid_start=["AAATTTT","TTTTTT","GGGGGGGG"]
# Optional — List of ending sequences to avoid (Default: "ATATAT")
barcodes.avoid_end=["AAATTTT","TTTTTT","GGGGGGGG"]
# Generate the barcodes
barcodes.generate_barcodes()
# Print a list of generated barcodes.
print(barcodes.barcodes)
>
>
# Prints the list of barcodes below
['AATCGATCGTGGCATCGTCCCTATC', 'GAAATCGAGACTCCGACCGATGTCT', 'GGTAACTAGTCCTAGATCAGCGAGG', 'AACAGTTCCTGGTGGTGTCTAGGCT', 'TGTTGCGTCCGTACTGTGGCGTAAA', 'CCGATTGATCTGACGTCGTGTCAAG', 'CTAGGACCATTGACTCGGCAACAAG', 'AATACTCACGATGCGATTTCCGCGG', 'TCGGTCTGTAGAGAGAGATACGTGC', 'AGGGTCGTCAGAAACTGAACCTGCT']
Notes
This class uses a brute force method to generate barcodes. The amount of time to generate barcodes increases greatly with the amount of barcodes being generated. Additionally, when filtering against an organism of interest or increasing the hamming distance between barcodes, the class will increase its complexity and barcode generation time will increase.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file OrthogonalBarcodes-0.2.3.tar.gz
.
File metadata
- Download URL: OrthogonalBarcodes-0.2.3.tar.gz
- Upload date:
- Size: 4.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.8.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e8254eda7843d961783ae4e0687e5d4a83c0a985be84538c84109c537aa22ac0 |
|
MD5 | 5aa40beed518937b00169a690118b364 |
|
BLAKE2b-256 | 37760e63c55a67df01a222bf1692f4f7820985eb94501f568a5f671aaa7e08d4 |