torch-adata
Project description
Create PyTorch Datasets
from AnnData
Installation
Install from PYPI (current version: 0.0.23
):
pip install torch-adata
Install the developer version:
git clone https://github.com/mvinyard/torch-adata.git; cd torch-adata;
pip install -e .
The main API
The primary class is the AnnDataset
. This is a subclass of the widely-used torch.utils.data.Dataset
. The PyTorch Dataset
module enables us to take advantage of built-in multiprocessing and other organizational tricks that ultimately standardize workflows and enable reproducibility.
import anndata as a
import torch_adata
adata = a.read_h5ad("/path/to/data.h5ad")
dataset = torch_adata.AnnDataset(adata, use_key="X_pca", groupby="time", obs_keys=["affinity"])
[ torch-adata ]: AnnDataset object with 7131 samples
----------------------------------------------------
Grouped by: 'time' with attributes:
- X (use_key = 'X_pca') torch.Size([3, 7131, 50])
- obs: affinity: torch.Size([3, 7131, 1])
There is an additional approach to this dubbed AnnLoader
, highlighted by Sergei Rybakov in Interfacing pytorch models with anndata
For more information, please visit the documentation!
Problem? Open an issue
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
torch-adata-0.0.24rc0.tar.gz
(32.1 kB
view hashes)
Built Distribution
Close
Hashes for torch_adata-0.0.24rc0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 71dda6ebd60ed11d2b85449a4562ea65827c4dc33d64452be3030c84b1d0b4b6 |
|
MD5 | 5f0f05d0bef99a69d034af3d87ba3a5d |
|
BLAKE2b-256 | e30c6a7ea07de191fdaf0dd4a1ec92fe8bb1ffc676311b9d4078135565f4fb96 |