PreKO: Precise KO system
Project description
PreKO: Precise KO system
Analysis pipeline for PreKO project
Installation
PreKO에 대한 분석 프로그램은 아래와 같이 설치할 수 있습니다.
pip install PreKO
InDelSearcher: Cas9 nuclease indel analyzer
InDelSearcher는 target sequence에서 indel frequency를 분석하고 계산해주는 파이프라인이다. 특히, high-throughput screening 데이터에서 barcode에 따른 indel frequency를 분석하는 것에 특화되어 있다.
분석을 위해서, 아래와 같이 barcode와 target sequence 정보가 담긴 csv 파일이 필요하다.
| Barcode | Target_region | Reference_sequence |
|---|---|---|
| TTTGCTGTGAGCACTGCTG | TTGTGAACATAGATCCATTTTTCTTGG | CTTGAAAAAGTGGCACCGAGTCGGTGCTTTTTTNNNNNNNNTTTGCTGTGAGCACTGCTGT |
| TTTGGACGTCATAGTGAGA | TCCAGATAGTCATCAACTTTTTGTTGG | CTTGAAAAAGTGGCACCGAGTCGGTGCTTTTTTNNNNNNNNTTTGGACGTCATAGTGAGAT |
| TTTGGCTATCTGCACGTGC | GTGGGGGGCCTGGGGCCTGGAGCCTGG | CTTGAAAAAGTGGCACCGAGTCGGTGCTTTTTTNNNNNNNNTTTGGCTATCTGCACGTGCG |
| TTTGATGCGCATCTCTACG | CCCAGGCAAAACTGCAGTTTTACCTGG | CTTGAAAAAGTGGCACCGAGTCGGTGCTTTTTTNNNNNNNNTTTGATGCGCATCTCTACGC |
| TTTGACTCGAGTCTCTCAC | ACGAGGTGGCCCTGGGGGGCCCCCTGG | CTTGAAAAAGTGGCACCGAGTCGGTGCTTTTTTNNNNNNNNTTTGACTCGAGTCTCTCACA |
barcode 파일과 분석할 FASTQ 파일이 있다면, InDelSearcher를 이용한 분석을 할 수 있다.
import pandas as pd
from PreKO.indel import InDelSearcher
# Setting: required information
DIR_FASTQ = f'NGS_data/Cas9_FASTQ_combined/'
barcode = f'ref_Small_Cas9_KO_Lib.csv'
sample_name = f'HCT_Cas9_R1_Day7'
fastq_file = f'{DIR_FASTQ}/{sample_name}.fastq'
ids = InDelSearcher()
# Run and show summary
df_summary = ids.run(strFq=fastq_file, barcode=barcode, sample_name=sample_name, thread=25)
df_summary.to_csv(f'InDel_Summary_{sample_name}.csv')
InDelSearcher.run의 output (pd.DataFrame)은 아래와 같이 나온다.
| Barcode | Target_region | intSwitching | intNumOfTotal | intNumOfIns | intNumOfDel | intNumofCom | intFrame_0 | intFrame_1 | intFrame_2 | IndelFrequency |
|---|---|---|---|---|---|---|---|---|---|---|
| ATGTGATCATGC | CAGGAAAAAATATGTGCTATGGAGGGG | 8 | 17 | 1 | 7 | 0 | 9 | 2 | 6 | 47.05882 |
| GACTCTCGTCGA | TTTTAGAGAATATTCACCGTGTCACGG | 8 | 18 | 0 | 12 | 1 | 6 | 9 | 3 | 72.22222 |
| GACTGATACTGT | ACAAAGTCAACTGCCTTCAAACAAGGG | 13 | 22 | 1 | 14 | 0 | 8 | 2 | 12 | 68.18182 |
| GCTGCGCGCACT | CTTCTATAACAAGAAATCTGATGTGGG | 11 | 25 | 2 | 18 | 1 | 13 | 4 | 8 | 84 |
| TCGCTGTGACTC | CCGCGCCGCGCGTTACCTTCCGCGGGG | 6 | 11 | 0 | 11 | 0 | 2 | 0 | 9 | 100 |
이를 to_csv() 등의 함수로 저장해서 이후 분석에 사용한다.
Environments
These codes were tested in Ubuntu 22.04 LTS environments.
Requirements
- Python >= 3.8
- biopython
- pandas
- numpy
- pydantic
- tqdm
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file preko-0.3.1.tar.gz.
File metadata
- Download URL: preko-0.3.1.tar.gz
- Upload date:
- Size: 324.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: python-httpx/0.28.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0a4493243597adb8e69a5cb4e47d33ff31a8d31588258d7b560ad1a7b331af66
|
|
| MD5 |
8442d54561469f8f7d2e296223f8c98a
|
|
| BLAKE2b-256 |
73779c3fe24136aee7204b5121e7fec52c2f8a177162d54750f56a8647673ec0
|
File details
Details for the file preko-0.3.1-py3-none-any.whl.
File metadata
- Download URL: preko-0.3.1-py3-none-any.whl
- Upload date:
- Size: 26.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: python-httpx/0.28.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
23c060456a096e68603430f31e6cc144e96a656be3c63295c1a3e552623729a0
|
|
| MD5 |
5a38792c32d4622f794f2074fe62f8e3
|
|
| BLAKE2b-256 |
821e820973c75910496fe4bbbf9d3b69753781325a75b3400d16e366bc247d4e
|