A automated script for processing and combining Byonic and PD standard output.

These details have not been verified by PyPI

Project description

GlypNirO

A automated script for processing and combining Byonic and PD standard output.

Requirements

Pythons 3.9+ with the following packages installed.

pandas
sequal
uniprotparser
requests
xlrd
openpyxl
scipy

Installation

The script can be installed using the following command. pip install glypniro

Usage

The program can be run from the location of the script using the command glypniro. The following parameters can be used for operating the script within the commandline.

Paremeters	Descriptions
`-i`, `--input-file`	Filepath to an xlsx file describing the experiment.
`-o`, `--output`	Filepath to the output xlsx file for the analysis.
`-s`, `--score-cutoff`	(Optional) Default=200. Cutoff score for filtering of Byonic output
`-t`, `--trust-byonic`	(Optional) Instruct the script to trust glycan position assignment and used them for area under the curve calculation.
`-d`, `--debug`	(Optional) In conjunction to the final output, the script would also create debug files that contain the unique PSM selected for calculation of the data in the final output.
`-p`, `--parse-uniprot`	(Optional) Attempt to parse UniProt accession ID using regular expression and use them as master id.
`-g`, `--get-uniprot`	(Optional) Using the `uniprotparser` module to access and parse protein name from uniprot accession ids of the proteins of those within the dataset.

Input file format

The input file used in the -i parameter should have the following format.

Ex:

condition_id	replicate_id	filename	area_filename
Depleted_Plasma_HCC	1	\data\NoMutliFuc_Nlink_10_20ppm_05Da_Depleted_Plasma_HCC_1.raw_Byonic.xlsx	\data\Depleted_Plasma_HCC_1_MSnSpectrumInfo.txt
Depleted_Plasma_HCC	2	\data\NoMutliFuc_Nlink_10_20ppm_05Da_Depleted_Plasma_HCC_2.raw_Byonic.xlsx	\data\Depleted_Plasma_HCC_2_MSnSpectrumInfo.txt
Depleted_Plasma_HCC	3	\data\NoMutliFuc_Nlink_10_20ppm_05Da_Depleted_Plasma_HCC_3.raw_Byonic.xlsx	\data\Depleted_Plasma_HCC_3_MSnSpectrumInfo.txt
Depleted_Plasma_Nor	1	\data\NoMutliFuc_Nlink_10_20ppm_05Da_Depleted_Plasma_Nor_1.raw_Byonic.xlsx	\data\Depleted_Plasma_Nor_1_MSnSpectrumInfo.txt
Depleted_Plasma_Nor	2	\data\NoMutliFuc_Nlink_10_20ppm_05Da_Depleted_Plasma_Nor_2.raw_Byonic.xlsx	\data\Depleted_Plasma_Nor_2_MSnSpectrumInfo.txt
Depleted_Plasma_Nor	3	\data\NoMutliFuc_Nlink_10_20ppm_05Da_Depleted_Plasma_Nor_3.raw_Byonic.xlsx	\data\Depleted_Plasma_Nor_3_MSnSpectrumInfo.txt
Non_Depleted_Plasma_HCC	1	\data\NoMutliFuc_Nlink_10_20ppm_05Da_Non_Depleted_Plasma_HCC_1.raw_Byonic.xlsx	\data\Non_Depleted_Plasma_HCC_1_MSnSpectrumInfo.txt
Non_Depleted_Plasma_HCC	2	\data\NoMutliFuc_Nlink_10_20ppm_05Da_Non_Depleted_Plasma_HCC_2.raw_Byonic.xlsx	\data\Non_Depleted_Plasma_HCC_2_MSnSpectrumInfo.txt
Non_Depleted_Plasma_HCC	3	\data\NoMutliFuc_Nlink_10_20ppm_05Da_Non_Depleted_Plasma_HCC_3.raw_Byonic.xlsx	\data\Non_Depleted_Plasma_HCC_3_MSnSpectrumInfo.txt
Non_Depleted_Plasma_Nor	1	\data\NoMutliFuc_Nlink_10_20ppm_05Da_Non_Depleted_Plasma_Nor_1.raw_Byonic.xlsx	\data\Non_Depleted_Plasma_Nor_1_MSnSpectrumInfo.txt
Non_Depleted_Plasma_Nor	2	\data\NoMutliFuc_Nlink_10_20ppm_05Da_Non_Depleted_Plasma_Nor_2.raw_Byonic.xlsx	\data\Non_Depleted_Plasma_Nor_2_MSnSpectrumInfo.txt
Non_Depleted_Plasma_Nor	3	\data\NoMutliFuc_Nlink_10_20ppm_05Da_Non_Depleted_Plasma_Nor_3.raw_Byonic.xlsx	\data\Non_Depleted_Plasma_Nor_3_MSnSpectrumInfo.txt

The rows within the input file should not have to follow any particular order however the columns have to contain the necessary content:

condition_id condition label of the experiment
replicate_id replicate label for the experiment
filename filepath to the Byonic .xlsx output file of the experiment.
area_filename filepath to the PD tabulated output file of the experiment.

Output for default execution mode

The output of the script is a single .xlsx file with 4 sheets.

The first one is the output where we calculate the proportion by combining both the glycosylated and unglycosylated glycoform data.
The second one is the output where the proportion of glycosylated was calculated without the unglycosylated data while the unglycosylated data was calculated similar to the first sheet.
The third one is the only the filter of the glycosylated output from the second sheet.
The forth one is the filter for only of the unglycosylated output from the first sheet.

Example

glypniro -i test_experiment.xlsx -o test_output.xlsx -t -g

The above command would instruct the script to use the test_experiment.xlsx file as input file and output as test_output.xlsx. Inclusion of -t would mean that we trust Byonic assignment of glycan position and shall use them for calculation of that specific glycoform AUC within the proteins. Inclusion of -g would instruct the script to connect to the UniProt online database and attempt to parse protein name from the UniProt accession id contain in the protein name within the Byonic file.

glypniro -i test_experiment.xlsx -o test_output.xlsx

The above command would instruct the script to use the test_experiment.xlsx file as input file and output as test_output.xlsx. Without -t optional parameter, we only use the information of what glycans were found but not assigning them any positions. The AUC will only be combined for those PSMs with the same peptide sequence and glycan combination.

Note for using with PeakView SWATH data

The command glypniro-reformat should be used to generate appropriate input for GlypNirO from SWATH and Byonic data.

glypniro-reformat -b byonic.xlsx -p peakview_peptide.xlsx -o description_peakview.xlsx

The command above will generate input files from experiment information and peptides information from GlypNirO and SWATH and create a description file that can be directly input into the main GlypNirO script. For optimal result, the SWATH library used for the PeakView should be constructed from Byonic identification data.

After that, the glypniro command can be used to process the data with the additional argument -s 0 to instruct the script to ignore the Byonic score cutoff since the combining process does not parse Byonic score and only substitute with value 1.

Note for using with Skyline data

The command glypniro-reformats should be used to generate appropriate input for GlypNirO from Skyline and Byonic data.

glypniro-reformats -b byonic.xlsx -s skyline.csv -o description_skyline.xlsx

The command above will generate input files from experiment information and peptides information from GlypNirO and Skyline and create a description file that can be directly input into the main GlypNirO script.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.8

Jan 11, 2024

0.1.7

Sep 11, 2023

0.1.6

Sep 8, 2023

0.1.5

Jul 27, 2023

0.1.4

Jul 27, 2023

0.1.3

Jul 27, 2023

0.1.2

Jul 27, 2023

0.1.1

Jul 24, 2023

0.1.0

Jul 21, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

glypniro-0.1.8.tar.gz (17.4 kB view details)

Uploaded Jan 11, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

glypniro-0.1.8-py3-none-any.whl (19.2 kB view details)

Uploaded Jan 11, 2024 Python 3

File details

Details for the file glypniro-0.1.8.tar.gz.

File metadata

Download URL: glypniro-0.1.8.tar.gz
Upload date: Jan 11, 2024
Size: 17.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.7.1 CPython/3.12.1 Windows/11

File hashes

Hashes for glypniro-0.1.8.tar.gz
Algorithm	Hash digest
SHA256	`31e899ea0a4b152c61065e770dbd63dbce4ecad3f9bf9adf2db90d7643da87b5`
MD5	`7935d7617eaeecba489db29687e80288`
BLAKE2b-256	`833906a94b340ceeadab903a226b3f5d3a101d420c45041e21507cc287642bfc`

See more details on using hashes here.

File details

Details for the file glypniro-0.1.8-py3-none-any.whl.

File metadata

Download URL: glypniro-0.1.8-py3-none-any.whl
Upload date: Jan 11, 2024
Size: 19.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.7.1 CPython/3.12.1 Windows/11

File hashes

Hashes for glypniro-0.1.8-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1cce3c201968906cbb5e84b3ac04e553a1789c47a4d22e47116e339f272b29b6`
MD5	`2c8d26dc970eb4d04c13312442eb38e6`
BLAKE2b-256	`ffcebe352438f56e71e9555e6e6c53eac6dc8c790094e734e5c2d7397addee1c`

See more details on using hashes here.

glypniro 0.1.8

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

GlypNirO

Requirements

Installation

Usage

Input file format

Output for default execution mode

Example

Note for using with PeakView SWATH data

Note for using with Skyline data

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes