Skip to main content

A library of universal parsers for microplate data

Project description

platechain

release

Parsing microplate data is an extremely common task in the life sciences. This package provides a simple, universal interface for parsing all different types of microplate data into a tidy format.

Note: this package is still early in development and may not work with all machines and data formats. If you are having trouble with a specific usecase, please reach out

Platechain

Motivation

One of the core problems of dealing with plate based data is that every manufacturer and machine has a different output format. Rather than build hundreds of separate parsers, we leverage the latest advancements in LLMs to build a single parser that can handle many different formats.

Raw Spark data Parsed dataframe

Platechain: From raw data output (left) to a parsed, structured format (right).

Getting started

pip install platechain

Usage

import pandas as pd
from platechain import parse_plates

# Load your data
df = pd.read_csv('examples/byonoy_absolute_example.csv')

# Parse the plates
plate_dfs = parse_plates(df)
# Output: [[96 rows x 4 columns], [96 rows x 4 columns]]

See the notebooks directory for more examples.

Implementation

The core logic is implemented in chain.py. This file contains a single function, parse_plates, which takes a dataframe and returns a list of dataframes, one for each plate.

It also exposes a LangChain chain using LCEL which can be modified to support your specific usecase.

Limitations

  • Currently only supports 24, 96, and 384 well plates. 1536 well plates are coming soon!
  • Only handles plate data that is visually laid out in a grid. If your plate data is already in a table (e.g. a timeseries), this package will not parse it correctly.

Contributing

We welcome contributions! If you'd like to contribute, please check out our contribution guidelines.

License

This project is licensed under the terms of the Apache 2.0 license.

Acknowledgements

This is a Sphinx Bio project! If you're interested in a hosted solution for your lab, please reach out.

Thanks to Harrison and the Langchain team for their help as well!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

platechain-0.0.5.tar.gz (8.0 kB view details)

Uploaded Source

Built Distribution

platechain-0.0.5-py3-none-any.whl (8.1 kB view details)

Uploaded Python 3

File details

Details for the file platechain-0.0.5.tar.gz.

File metadata

  • Download URL: platechain-0.0.5.tar.gz
  • Upload date:
  • Size: 8.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/4.0.2 CPython/3.11.6

File hashes

Hashes for platechain-0.0.5.tar.gz
Algorithm Hash digest
SHA256 1703a7a34129be77335c3f7882027811c589efab52d19074c14d6a6ee54c2fbc
MD5 a396c37464866dae3119bd6abae8d437
BLAKE2b-256 45d65ba2bca2c8652ca71f93e0d8db08a8d1aed94d8aa1c76fb9aa94fc23bef7

See more details on using hashes here.

File details

Details for the file platechain-0.0.5-py3-none-any.whl.

File metadata

  • Download URL: platechain-0.0.5-py3-none-any.whl
  • Upload date:
  • Size: 8.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/4.0.2 CPython/3.11.6

File hashes

Hashes for platechain-0.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 7b6479e791d5733125e46033b847d4fc638daa0c8997862d1646605b52701815
MD5 8ba7e825051eb8596e50ebdf25f05b39
BLAKE2b-256 007a28ffbc512bf12002a381ce323afce73514c0c77763d8d1f6fd38c84f058e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page