Skip to main content

A very simple DTASelect-Filter.txt parser.

Project description

example workflow example workflow

filterframes

filterframes is a Python package that provides an easy way to parse and manipulate DTASelect filter files using pandas. The package allows you to read DTASelect-filter.txt files, create peptide and protein dataframes, modify the dataframes, and write the modified dataframes back to a new DTASelect-filter.txt file.

Note on dataframe columns:

The column names in the peptide and protein dataframes will correspond to the header lines in the DTASelect-filter.txt files. In order to edit and output a valid DTASelect-filter file you must ensure that the peptide and protein dataframe column names and order are conserved, and that no additional columns are included. Any changes in the columns order or names will be reflected in the output DTASelect-filter.txt file.

Installation

You can install filterframes using pip:

pip install filterframes

You can also install filterframes locally:

git clone https://github.com/pgarrett-scripps/FilterFrames.git
cd filterframes
pip install .

Usage

Here are some basic examples of how to use the package:

Example Python Script:

from filterframes import from_dta_select_filter, to_dta_select_filter

# Read DTASelect-filter.txt file and create peptide and protein dataframes
file_input = r'tests/data/DTASelect-filter_V2_1_12_paser.txt'
header_lines, peptide_df, protein_df, end_lines = from_dta_select_filter(file_input)

# Display the first 5 rows of the peptide and protein dataframes
print("Peptide DataFrame:")
print(peptide_df.head())
print("\nProtein DataFrame:")
print(protein_df.head())

# Modify peptide or protein dataframes as needed (e.g., filtering, normalization, etc.)
# ...

# Write modified peptide and protein dataframes back to a DTASelect-filter.txt file
file_output = r'tests/data/DTASelect-filter_V2_1_12_paser.out.txt'
with open(file_output, 'w') as f:
    output_string_io = to_dta_select_filter(header_lines, peptide_df, protein_df, end_lines)
    f.write(output_string_io.getvalue())

print(f"\nModified DTASelect-filter.txt file saved to {file_output}")

Example Streamlit App:

# app.py
from io import StringIO

import streamlit as st
from filterframes import from_dta_select_filter, to_dta_select_filter

uploaded_filter_file = st.file_uploader("Choose a DTASelect-filter.txt file", type="txt")

if uploaded_filter_file:
    header_lines, peptide_df, protein_df, end_lines = from_dta_select_filter(StringIO(uploaded_filter_file.getvalue().decode('utf-8')))
    
    st.header('Peptide df')
    st.dataframe(peptide_df)
    st.header('Protein df')
    st.dataframe(protein_df)

    # Modify peptide or protein dataframes as needed (e.g., filtering, normalization, etc.)
    # ...

    io = to_dta_select_filter(header_lines, peptide_df, protein_df, end_lines)

    st.download_button(label="Download Filter",
                       data=io.getvalue(),
                       file_name="DTASelect-filter.txt",
                       mime="text/plain")

Functions

The main functions provided by the package are:

from_dta_select_filter(file_input: Union[str, TextIOWrapper, StringIO]) -> Tuple[List[str], pd.DataFrame, pd.DataFrame, List[str]]

Reads a DTASelect-filter.txt file and returns header lines, peptide dataframe, protein dataframe, and end lines.

to_dta_select_filter(header_lines: List[str], peptide_df: pd.DataFrame, protein_df: pd.DataFrame, end_lines: List[str]) -> StringIO

Writes the given header lines, peptide dataframe, protein dataframe, and end lines to a StringIO object in the DTASelect-filter.txt format.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

filterframes-0.2.0.tar.gz (89.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

filterframes-0.2.0-py3-none-any.whl (7.8 kB view details)

Uploaded Python 3

File details

Details for the file filterframes-0.2.0.tar.gz.

File metadata

  • Download URL: filterframes-0.2.0.tar.gz
  • Upload date:
  • Size: 89.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for filterframes-0.2.0.tar.gz
Algorithm Hash digest
SHA256 2ae95b63d851863f9b2679e65a08a8b8fa35b8b33b16dd2906c188e377d79a01
MD5 740250051f561818927fbaa9fa7efa8e
BLAKE2b-256 478a6106b188080185584bc2d7dc48646943c178bc8f8de7154d02cc854a7b1a

See more details on using hashes here.

File details

Details for the file filterframes-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: filterframes-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 7.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for filterframes-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 74b079f03690d3f7b2e76a38899b0f1d70fb800ec7570aba81f376c0fd284933
MD5 3bad2bdf2b9b307e6bac95640de2ef4f
BLAKE2b-256 9f409da5c94ccf6ee5132ccd20aaa7110832d6fbb1dccde3d16c5c2f80daf6c0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page