Python package to parse Observed antibody sequence data for easy reporting, visualization, annotation and alignment.
Project description
OAS_wrapper
A wrapper to parse Observed Antibody Space (OAS) data for better visualization, annotation and comparison of sequence to germline.
Functionalities
The scripts include functionalities to:
-
Provide basic metrics and visualizations of sequences and their annotations
-
Provide original IMGT sequences for V, D and J calls made in OAS data using IMGT reference database
-
Align sequence and germline to highlight regions of mismatches, providing positional information e.g.,
Example Output:
Original Sequence 1: GAGGTGCAGCTGTTGGAGTCTGGGGGAGGCTTAGTACAGCCTGGGGGGTCCCTGAGACTCTCCTGTGTAGCCTCTGGATTCACCTTTAGCAGCTATGCCATGAGCTGGGTCCGCCAGGCTCCAGGGAAGGGGCTGGAGTGGGTCTCAGGTATTAGTGCTAGTGGTGCTAGCACATACTACGCAGACTCCGTGAAGGGCCGGTTCACCATCTCCAGAGACAATTCCAAGAACACGCTGTATCTGCAAATGAACAGCCTGAGAGCCGAGGACACGGCCGTATATTACTGTGCGAAAACCCCCAAATACGATGTTTGGAGTGGTTATTATACGTCCAATGCCTTTGATATCTGGGGCCAAGGGACAATGGTCACCGTCTCTTCAG
Original Sequence 2: GAGGTGCAGCTGTTGGAGTCTGGGGGAGGCTTGGTACAGCCTGGGGGGTCCCTGAGACTCTCCTGTGCAGCCTCTGGATTCACCTTTAGCAGCTATGCCATGAGCTGGGTCCGCCAGGCTCCAGGGAAGGGGCTGGAGTGGGTCTCAGCTATTAGTGGTAGTGGTGGTAGCACATACTACGCAGACTCCGTGAAGGGCCGGTTCACCATCTCCAGAGACAATTCCAAGAACACGCTGTATCTGCAAATGAACAGCCTGAGAGCCGAGGACACGGCCGTATATTACTGTGCGAAANNNNNNNNNTACGATTTTTGGAGTGGTTATTATACNNNNNATGCTTTTGATATCTGGGGCCAAGGGACAATGGTCACCGTCTCTTCAG
Aligned and Highlighted Differences:
GAGGTGCAGCTGTTGGAGTCTGGGGGAGGCTT[31mA[0mGTACAGCCTGGGGGGTCCCTGAGACTCTCCTGTG[31mT[0mAGCCTCTGGATTCACCTTTAGCAGCTATGCCATGAGCTGGGTCCGCCAGGCTCCAGGGAAGGGGCTGGAGTGGGTCTCAG[31mG[0mTATTAGTG[31mC[0mTAGTGGTG[31mC[0mTAGCACATACTACGCAGACTCCGTGAAGGGCCGGTTCACCATCTCCAGAGACAATTCCAAGAACACGCTGTATCTGCAAATGAACAGCCTGAGAGCCGAGGACACGGCCGTATATTACTGTGCGAAA[31mA[0m[31mC[0m[31mC[0m[31mC[0m[31mC[0m[31mC[0m[31mA[0m[31mA[0m[31mA[0mTACGAT[31mG[0mTTTGGAGTGGTTATTATAC[31mG[0m[31mT[0m[31mC[0m[31mC[0m[31mA[0mATGC[31mC[0mTTTGATATCTGGGGCCAAGGGACAATGGTCACCGTCTCTTCAG
GAGGTGCAGCTGTTGGAGTCTGGGGGAGGCTT[31mG[0mGTACAGCCTGGGGGGTCCCTGAGACTCTCCTGTG[31mC[0mAGCCTCTGGATTCACCTTTAGCAGCTATGCCATGAGCTGGGTCCGCCAGGCTCCAGGGAAGGGGCTGGAGTGGGTCTCAG[31mC[0mTATTAGTG[31mG[0mTAGTGGTG[31mG[0mTAGCACATACTACGCAGACTCCGTGAAGGGCCGGTTCACCATCTCCAGAGACAATTCCAAGAACACGCTGTATCTGCAAATGAACAGCCTGAGAGCCGAGGACACGGCCGTATATTACTGTGCGAAA[31mN[0m[31mN[0m[31mN[0m[31mN[0m[31mN[0m[31mN[0m[31mN[0m[31mN[0m[31mN[0mTACGAT[31mT[0mTTTGGAGTGGTTATTATAC[31mN[0m[31mN[0m[31mN[0m[31mN[0m[31mN[0mATGC[31mT[0mTTTGATATCTGGGGCCAAGGGACAATGGTCACCGTCTCTTCAG
Indices of Differences: [32, 67, 148, 157, 166, 294, 295, 296, 297, 298, 299, 300, 301, 302, 309, 329, 330, 331, 332, 333, 338]
--------------------------------------------------
- Group data by germline, to infer sequences that originate from germline, including providing information on V, D and J annotations
Example Output:
germline_alignment_heavy CAGGTGCAGCTGCAGGAGTCGGGCCCAGGACTGGTGAAGCCTTCAC...
number_of_sequences 16
sequence_heavy GGGAGGGTCCTGCTCACATGGGAAATACTTTCTGAGAGTCCTGGAC...
j_call_heavy IGHJ4*02
v_call_heavy IGHV4-31*03
d_call_heavy IGHD3-10*01
- Annotate sequence with CDRs and FWRs for easy inference of regions of interest
Example Output:
'AGCTCTGAGAGAGGAGCCCAGCCCTGGGATTTTCAGGTGTTTTCATTTGGTGATCAGGACTGAACAGAGAGAACTCACCATGGAGTTTGGGCTGAGCTGGCTTTTTCTTGTGGCTATTTTAAAAGGTGTCCAGTGTGAGGTGCAGCTGTTGGAGTCTGGGGGAGGCTTAGTACAGCCTGGGGGGTCCCTGAGA(cdr1 170-193) CTCTCCTGTGTAGCCTCTGGATTCACCTTTAGCAGCTATGCCATGAGCTGGGTCCGCCAG(cdr2 245-253) GCTCCAGGGAAGGGGCTGGAGTGGGTCTCAGGTATTAGTGCTAGTGGTGCTAGCACATACTACGCAGACTCCGTGAAGGGCCGGTTCACCATCTCCAGAGACAATTCCAAGAACACGCTGTATCTGCAAATGAACAGCCTG(cdr3 362-394) AGAGCCGAGGACACGGCCGTATATTACTGTGCGAAAACCCCCAAATACGATGTTTGGAGTGGTTATTATACGTCCAATGCCTTTGATATCTGGGGCCAAGGGACAATGGTCACCGTCTCTTCAGGGAGTGCATCCGCCCCAACCCTTTTCCCCCTCGTCTCCTGTGAGAATTCCCCGTCGGATACGAGCAGCGTG'
Project Organization
├── LICENSE <- Open-source license if one is chosen
├── README.md <- The top-level README for developers using this project.
├── data
├── notebooks <- Jupyter notebooks. Naming convention is a number (for ordering),
└── src <- Source code for use in this project.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file oas_wrapper-1.1.tar.gz.
File metadata
- Download URL: oas_wrapper-1.1.tar.gz
- Upload date:
- Size: 10.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
89edcd85b7c75a8369891e8e77d402cf78742b992d59450b7b4f652748b0581e
|
|
| MD5 |
47ae0fe33075e69b04ed57771919fb75
|
|
| BLAKE2b-256 |
22807652a31062e07beb8da950ad15710fef601f434c4787099a43673db2d222
|
File details
Details for the file OAS_wrapper-1.1-py3-none-any.whl.
File metadata
- Download URL: OAS_wrapper-1.1-py3-none-any.whl
- Upload date:
- Size: 13.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
87fc4166a1de6decee79fcb3a8436cc4fb3cf727e963a30b804b8e3069f09f92
|
|
| MD5 |
f29962af949613c40aab69e92feebf4c
|
|
| BLAKE2b-256 |
279a82e62df94c82df053eb6b6083defefb4491e8e4821f231eea8d9890c39dd
|