py-wsi

Python package for dealing with whole slide images (.svs) for machine learning, including intuitive, painless patch sampling using OpenSlide, automatic labeling from ImageScope XML annotation files, and functions for saving these patches and their meta data into lightning memory-mapped databases (LMDB) for quick reads.

These details have not been verified by PyPI

Project links

Homepage

Project description

Current version

Notice: it is strongly recommended to use py-wsi version >= 1.0.

The current update to py_wsi has added three major improvements which are essential for dealing with very large datasets of .svs images:

better memory management
error handling
functionality to allow for sampling test patches before sampling from all images

See this blog post py_wsi for computer analysis on whole slide .svs images using OpenSlide for help on understanding the relationship between patch and tile sampling. The test patch sampling functionality in this version will also help users to know exactly what they are sampling.

For any early users who have downloaded previous versions of py_wsi (< 1.0) I would strongly suggest downloading the update. Please feel free to submit any issues to the GitHub repository and I will provide help as I am able to.

While suggestions for extra/additional functionality will not be immediately considered, pull requests are welcome.

Introduction to py_wsi

py-wsi provides a series of Python classes and functions which deal with databases of whole slide images (WSI), or Aperio .svs files for machine learning, using Python OpenSlide. py-wsi provides functions to perform patch sampling from .svs files, generation of metadata, and several store options: saving to a lightning memory-mapped database (LMDB), HDF5 files, or disk.

These Python functions deal with whole slide images (WSI), or Aperio .svs files for deep learning, using OpenSlide. py-wsi provides functions to perform patch sampling from .svs files, generation of metadata, and several store options: saving to a lightning memory-mapped database (LMDB), HDF5 files, or disk.

Lim et al. in “An analysis of image storage systems for scalable training of deep neural networks” perform a thorough evaluation of the best image storage systems, taking into consideration memory usage and access speed. LMDB, a B+tree based key-value storage, is not the most memory efficient, but provides optimal read time.

py-wsi uses OpenSlide Python. According to the Python OpenSlide website, “OpenSlide is a C library that provides a simple interface for reading whole-slide images, also known as virtual slides, which are high-resolution images used in digital pathology. These images can occupy tens of gigabytes when uncompressed, and so cannot be easily read using standard tools or libraries, which are designed for images that can be comfortably uncompressed into RAM. Whole-slide images are typically multi-resolution; OpenSlide allows reading a small amount of image data at the resolution closest to a desired zoom level.”

Note: HDF5 functionality will not be available until version 1.2

Check Jupyter Notebook on GitHub to view example usage:Example usage of py-wsi

Setup

This library is dependent on the following, but may be compatible with previous versions.

python 3.6.1 numpy 1.12.1 openslide-python 1.1.1

Check dependencies listed in setup.py; notably, openslide-python which requires openslide, and lmdb. The python geometry package Shapely is used for inferring labels from XML annotations.

brew install openslide

Install py_wsi using pip.

pip install py_wsi

Check out Jupyter Notebook “Using py-wsi” to see what py-wsi can do and get started!

Feel free to contact me with any issues and feedback.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

1.1

May 22, 2018

1.0

May 22, 2018

0.21

Mar 1, 2018

0.2

Mar 1, 2018

0.1

Feb 28, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

py_wsi-1.1.tar.gz (12.2 kB view details)

Uploaded May 22, 2018 Source

File details

Details for the file py_wsi-1.1.tar.gz.

File metadata

Download URL: py_wsi-1.1.tar.gz
Upload date: May 22, 2018
Size: 12.2 kB
Tags: Source
Uploaded using Trusted Publishing? No

File hashes

Hashes for py_wsi-1.1.tar.gz
Algorithm	Hash digest
SHA256	`d4cc4cc0e7e9d3d1f1ea09d589823be89e8d7f1844e4ba07aaf68d1f31dd4430`
MD5	`4f97f6fd56719090a944b8727d2c2de7`
BLAKE2b-256	`32f4a4f692b8d7791df38874e678e28c60783284ca4a5fe4f083214f45e4ede2`

See more details on using hashes here.

py-wsi 1.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Current version

Introduction to py_wsi

Setup

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes