A pacakge which provides various ways to analyze NGS data from phage display campaigns

Project description

Welcome to ExpoSeq

ExpoSeq is a powerful pipeline for processing and analyzing FASTQ files from sequencing phage Display panning samples. It utilizes MiXCR to align and assemble the data which you can subsequently analyze in multiple plots. The pipeline focuses on analysing the identity between samples but also applies various clustering techniques to analyse the relation between the sequences. Besides, you can add binding data to relate the clusters to affinity. overview

Installation

Open a virtual environment and type pip install ExpoSeq. Ensure that you have python > 3.11 installed.

To get started, please download and follow the instructions for MiXCR under the following link: https://docs.milaboratories.com/mixcr/getting-started/installation/ You can also only use the test version of ExpoSeq without installing it.

Importing the Plotting Tool

To access the plotting tool, you will need to import it into your console by running the following command:

from ExpoSeq.pipeline import PlotManager

Using the PlotManager

The PlotManager is the main interface for creating various plots using your FASTQ data. You can create an instance of the PlotManager by running the following command:

plot = PlotManager()

To use the PlotManager to create plots, you will need to upload your FASTQ data to the pipeline. This will automatically happen as soon as you have called the PlotManager. In the following you can obtain an insight in the worklow of the pipeline after the initial call. There, the blue boxes indicate your input, gray are optional inputs while black and red are processing steps and output, respectively.
relative_path_to_image
If you just want to test the pipeline and see its functions you can call:

plot = PlotManager(test_version = True)

Alternatively you can take a look in the Jupyter script

Once you have called the test version or have finished the data processing, you can use the PlotManager to create a variety of plots, such as an identity plot based on the jaccard similarity. Here is an example of how to create this type of plot:

plot.jaccard()

If you want to change the style of the plot you can use the PlotManager. If you called it plot you can do for instance the following:

plot.style.title_xaxis("your_title")

If you want to implement further plot change you can also refer to the matplotlib.pyplot library and change it in the same way as following:

import matplotlib.pyplot as plt

plt.xlabel("your_title")

If you would like to have details about the inputs and functions of the PlotManager call:

help(plot)

You can also call

help(plot.jaccard)

Upload binding data

If you have conducted DELFIA or other techniques to receive binding data for certain sequences (usually sanger sequenced), you can upload these in a certain format and use these for clustering to potentially find other suitable sequences with high binding. You need to import the data as csv file where the first column starts in the first row with the header: aaSeqCDR3 which are the sequences. It is very important to keep the header at this position. In the second column you can put the binding data for your epitope which you can name in the first row however you prefer. You can have a look in this csv file to see the general structure of the file. Moreover, you can download it and import it in Excel. Therefore, open Excel and choose under "Data" in the Excel header "From Text/CSV". Then make sure to delete the first column which contains the row number. After that you can delete the random data in that excel sheet and add your own. Finally you can export the data as a csv and import it with the pipeline either in the initial uploading process which will be prompted or with the command

plot.add_binding_data()

Note: If you decide to add more binding data to your analysis you can just use the same command and choose the new file with the filechooser and it will be added to the existing data.

References

[1] Dmitriy A. Bolotin, Stanislav Poslavsky, Igor Mitrophanov, Mikhail Shugay, Ilgar Z. Mamedov, Ekaterina V. Putintseva, and Dmitriy M. Chudakov. "MiXCR: software for comprehensive adaptive immunity profiling." Nature methods 12, no. 5 (2015): 380-381.

[2] Dmitriy A. Bolotin, Stanislav Poslavsky, Alexey N. Davydov, Felix E. Frenkel, Lorenzo Fanchi, Olga I. Zolotareva, Saskia Hemmers, Ekaterina V. Putintseva, Anna S. Obraztsova, Mikhail Shugay, Ravshan I. Ataullakhanov, Alexander Y. Rudensky, Ton N. Schumacher & Dmitriy M. Chudakov. "Antigen receptor repertoire profiling from RNA-seq data." Nature Biotechnology 35, 908â€“911 (2017)

[3] (1, 2) Tareen A, Kinney JB (2019) Logomaker: beautiful sequence logos in Python. Bioinformatics btz921. bioRxiv doi:10.1101/635029.

[4] M.A. Larkin and others, Clustal W and Clustal X version 2.0, Bioinformatics, Volume 23, Issue 21, November 2007, Pages 2947â€“2948, https://doi.org/10.1093/bioinformatics/btm404

Project details

Release history Release notifications | RSS feed

4.6.1

Jul 22, 2024

4.6.0

Jul 17, 2024

4.5.3

Jul 4, 2024

4.5.2

Jul 4, 2024

4.5.0

Mar 16, 2024

4.4.5

Feb 25, 2024

4.4.4

Feb 23, 2024

4.4.3

Feb 23, 2024

4.4.2

Feb 20, 2024

4.4.1

Feb 20, 2024

4.4.0

Feb 20, 2024

4.3.6

Feb 20, 2024

4.3.5

Jan 7, 2024

4.3.4

Jan 3, 2024

4.3.3

Dec 30, 2023

4.3.2

Dec 28, 2023

4.3.1

Dec 28, 2023

4.3.0

Nov 10, 2023

4.2.2

Oct 24, 2023

4.2.1

Oct 21, 2023

4.2.0

Oct 20, 2023

4.1.3

Oct 19, 2023

4.1.2

Oct 18, 2023

4.1.1

Oct 17, 2023

4.1.0

Oct 17, 2023

4.0.7

Oct 14, 2023

4.0.6

Oct 8, 2023

4.0.5

Oct 8, 2023

4.0.4

Oct 8, 2023

4.0.3

Oct 8, 2023

4.0.1

Oct 3, 2023

4.0

Oct 1, 2023

3.1.5

Oct 1, 2023

3.1.4

Sep 30, 2023

3.1.3

Sep 30, 2023

3.1.2

Sep 27, 2023

3.1.1

Sep 26, 2023

This version

3.1.0

Sep 1, 2023

3.0.7

Sep 1, 2023

3.0.6

Sep 1, 2023

3.0.5

Aug 21, 2023

3.0.4

Aug 21, 2023

3.0.3

Aug 21, 2023

3.0.2

Aug 21, 2023

3.0.1

Aug 21, 2023

2.0.3

Aug 1, 2023

2.0.2

Jul 29, 2023

2.0.1

Jul 29, 2023

2.0.0

Jul 22, 2023

1.1.19

Jul 22, 2023

1.1.18

Jul 22, 2023

1.1.17

Jul 16, 2023

1.1.16

Jul 15, 2023

1.1.14

Jul 15, 2023

1.1.13

Jul 3, 2023

1.1.12

Jul 3, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ExpoSeq-3.1.0.tar.gz (7.2 MB view details)

Uploaded Sep 1, 2023 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ExpoSeq-3.1.0-py3-none-any.whl (7.3 MB view details)

Uploaded Sep 1, 2023 Python 3

File details

Details for the file ExpoSeq-3.1.0.tar.gz.

File metadata

Download URL: ExpoSeq-3.1.0.tar.gz
Upload date: Sep 1, 2023
Size: 7.2 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.11.3

File hashes

Hashes for ExpoSeq-3.1.0.tar.gz
Algorithm	Hash digest
SHA256	`bb8873130bb1df436fff94c1ce7f29213b89de338d8c36a744d97c60d3a80a81`
MD5	`695ffc6d228b538f2ad17beee9f81469`
BLAKE2b-256	`2c59156a0aca9624addd10bae1f2d16ad3738eb04d65c8ef6c71c8779a32e9ff`

See more details on using hashes here.

File details

Details for the file ExpoSeq-3.1.0-py3-none-any.whl.

File metadata

Download URL: ExpoSeq-3.1.0-py3-none-any.whl
Upload date: Sep 1, 2023
Size: 7.3 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.11.3

File hashes

Hashes for ExpoSeq-3.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`18055252f70e7dccedd11274d8a3c5da780f0b62fcea7015c9fd24825e7e3fb5`
MD5	`284a8dd2fd90c57ddc6aea0d8fe2490f`
BLAKE2b-256	`d5f34e3d192bf0591630cc116e10413fe737aa90b20c999471102c8cb7ab67b6`

See more details on using hashes here.

ExpoSeq 3.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Welcome to ExpoSeq

Installation

Importing the Plotting Tool

Using the PlotManager

Upload binding data

References

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes