LaminDB: Manage R&D data & analyses.

These details have been verified by PyPI

Maintainers

bpenteado falexwolf frederic.enard Koncopd lamindev sunnyosun

These details have not been verified by PyPI

Project links

Home

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

LaminDB: Data lakes for biology

LaminDB is an API layer for your existing infrastructure to manage your existing data & analyses.

Public beta: Currently only recommended for collaborators as we still make breaking changes.

Update 2023-06-05: We completed a major migration from SQLAlchemy/SQLModel to Django, available in pre-releases of v0.42.

Features

Free:

Track data lineage across notebooks, pipelines & apps.
Manage biological registries, ontologies & features.
Persist, load & stream data objects with a single line of code.
Query for anything & everything.
Define & manage your own schemas (assays, instruments, etc.).
Manage data on your laptop, on your server or in your cloud infra.
Use a mesh of distributed LaminDB instances for different teams and purposes.
Share instances through a Hub akin to GitHub.

Enterprise plan:

Explore & share data, submit samples & track lineage with LaminApp (deployable in your infra).
Receive support & services for a BioTech data & analytics platform.

How does it work?

LaminDB builds semantics of R&D and biology onto well-established tools:

SQLite & Postgres for SQL databases
S3, GCP & local storage for object storage
Django ORM (previously SQLAlchemy/SQLModel)
Configurable storage formats: pyarrow, anndata, zarr, etc.
Biological knowledge resources & ontologies: see Bionty

Most of LaminDB is open source.

Installation

pip install lamindb  # basic data lake
pip install 'lamindb[bionty]'  # biological entities
pip install 'lamindb[nbproject]'  # Jupyter notebook tracking
pip install 'lamindb[aws]'  # AWS dependencies (s3fs, etc.)
pip install 'lamindb[gcp]'  # GCP dependencies (gcfs, etc.)

Quick setup

Why do I have to sign up?

Data lineage requires a user identity (who modified which data when?).
Collaboration requires a user identity (who shares this with me?).

Signing up takes 1 min.

We do not store any of your data, but only basic metadata about you (email address, etc.) & your instances (S3 bucket names, etc.).

Sign up via lamin signup <email>.
Log in via lamin login <handle>.
Init an instance via lamin init --storage <storage>.

Usage overview

Track & query data lineage

ln.track()  # auto-detect a notebook & register as a Transform
ln.File("my_artifact.parquet").save()  # link Transform & Run objects to File object

Now, you can query, e.g., for

ln.File.select(created_by__handle="user1").df()   # a DataFrame of all files ingested by user1
ln.File.select().order_by("-updated_at").first()   # latest updated file

Or for

transforms = ln.Transform.select(  # all notebooks with 'T cell' in the title created in 2022
    name__contains="T cell", type="notebook", created_at__year=2022
).all()
ln.File.select(transform=transforms[1]).all()  # files ingested by the second notebook in transforms

Or, if you'd like to track a run of a registered pipeline (here, "Cell Ranger"):

transform = ln.Transform.select(name="Cell Ranger", version="0.7.1").one()  # select a pipeline from the registry
ln.track(transform)  # create a new global run context
ln.File("s3://my_samples01/my_artifact.fastq.gz").save()  # link file against run & transform

Now, you can query, e.g., for

run = ln.select(ln.Run, transform__name="Cell Ranger").order_by("-created_at").df()  # get the latest Cell Ranger pipeline runs
# query files by selected runs, etc.

Persist & load data objects

df = pd.DataFrame({"a": [1, 2], "b": [3, 4]})

ln.File(df, name="My dataframe").save()

Get it back:

file = ln.select(ln.File, name="My dataframe").one()  # query for it
df = file.load()  # load it into memory
    a   b
0   1   3
1   2   4

Manage biological registries

lamin init --storage ./myobjects --schema bionty

...

Track biological features

...

Track biological samples

...

Manage custom schemas

Create a GitHub repository with Django ORMs similar to github.com/laminlabs/lnschema-lamin1
Create & deploy migrations via lamin migrate create and lamin migrate deploy

It's fastest if we do this for you based on our templates within an enterprise plan, but you can fully manage the process yourself.

Notebooks

Find all guide notebooks here.
You can run these notebooks in hosted versions of JupyterLab, e.g., Saturn Cloud, Google Vertex AI, and others or on Google Colab.
Jupyter Lab & Notebook offer a fully interactive experience, VS Code & others require using the CLI (lamin track my-notebook.ipynb)

Architecture

LaminDB consists of the lamindb Python package, which builds on a number of open-source packages developed by Lamin:

bionty: Biological entities (usable standalone)
lamindb-setup: Setup & configure LaminDB, client for Lamin Hub.
lnschema-core: Core schema, containing the core ORMs.
lnschema-bionty: Bionty schema, containing ORMs that are coupled to Bionty's biological entities.
lnschema-lamin1: Exemplary configured schema to track samples, treatments, etc.
nbproject: Parse metadata from Jupyter notebooks.

LaminHub & LaminApp are not open sourced, neither are templates to model lab operations.

Documentation

Read the docs.

Project details

These details have been verified by PyPI

Maintainers

bpenteado falexwolf frederic.enard Koncopd lamindev sunnyosun

These details have not been verified by PyPI

Project links

Home

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.70.4

Apr 24, 2024

0.70.3

Apr 22, 2024

0.70.2

Apr 19, 2024

0.70.1

Apr 18, 2024

0.70.0

Apr 17, 2024

0.69.10

Apr 12, 2024

0.69.9

Apr 8, 2024

0.69.8

Apr 4, 2024

0.69.7

Apr 3, 2024

0.69.6

Apr 2, 2024

0.69.5

Mar 31, 2024

0.69.4

Mar 30, 2024

0.69.3

Mar 28, 2024

0.69.2

Mar 26, 2024

0.69.1

Mar 18, 2024

0.69.0

Mar 17, 2024

0.68.2

Mar 11, 2024

0.68.1

Mar 8, 2024

0.68.0

Mar 1, 2024

0.67.3

Feb 2, 2024

0.67.2

Jan 14, 2024

0.67.1

Jan 12, 2024

0.67.0

Jan 10, 2024

0.66.1

Jan 9, 2024

0.66.0

Jan 7, 2024

0.65.1

Jan 5, 2024

0.65.0

Jan 2, 2024

0.64.2

Dec 18, 2023

0.64.1

Dec 13, 2023

0.64.0

Dec 13, 2023

0.63.5

Dec 11, 2023

0.63.4

Dec 8, 2023

0.63.3

Dec 5, 2023

0.63.2

Dec 2, 2023

0.63.1

Nov 28, 2023

0.63.0

Nov 24, 2023

0.62.0

Nov 20, 2023

0.61.0

Nov 15, 2023

0.60.3

Nov 13, 2023

0.60.2

Nov 10, 2023

0.60.1

Nov 10, 2023

0.60.0

Nov 9, 2023

0.59.5

Nov 9, 2023

0.59.4

Nov 5, 2023

0.59.3

Nov 4, 2023

0.59.2

Nov 4, 2023

0.59.1

Nov 3, 2023

0.59.0

Oct 27, 2023

0.58.2

Oct 26, 2023

0.58.1

Oct 26, 2023

0.58.0

Oct 24, 2023

0.57.2

Oct 23, 2023

0.57.1

Oct 20, 2023

0.57.0

Oct 19, 2023

0.56.2

Oct 19, 2023

0.56.1

Oct 18, 2023

0.56.0

Oct 16, 2023

0.56a1 pre-release

Oct 13, 2023

0.55.2

Oct 10, 2023

0.55.1

Oct 4, 2023

0.55.0

Oct 3, 2023

0.54.4

Sep 29, 2023

0.54.3

Sep 27, 2023

0.54.2

Sep 26, 2023

0.54.1

Sep 21, 2023

0.54.0

Sep 17, 2023

0.53.0

Sep 9, 2023

0.52.2

Sep 4, 2023

0.52.1

Sep 2, 2023

0.52.0

Aug 31, 2023

0.51.3

Aug 30, 2023

0.51.2

Aug 29, 2023

0.51.1

Aug 28, 2023

0.51.0

Aug 24, 2023

0.51a1 pre-release

Aug 18, 2023

0.50.7

Aug 17, 2023

0.50.6

Aug 17, 2023

0.50.5

Aug 16, 2023

0.50.4

Aug 16, 2023

0.50.3

Aug 11, 2023

0.50.2

Aug 10, 2023

0.50.1

Aug 7, 2023

0.50.0

Aug 7, 2023

0.49.3

Aug 4, 2023

0.49.2

Aug 3, 2023

0.49.1

Aug 1, 2023

0.49.0

Jul 31, 2023

0.48.1

Jul 26, 2023

0.48.0

Jul 26, 2023

0.48a3 pre-release

Jul 24, 2023

0.48a2 pre-release

Jul 20, 2023

0.48a1 pre-release

Jul 20, 2023

0.47.0

Jul 10, 2023

0.46.3

Jul 9, 2023

0.46.2

Jul 9, 2023

0.46.1

Jul 6, 2023

0.46.0

Jul 6, 2023

0.46a3 pre-release

Jul 5, 2023

0.46a2 pre-release

Jul 3, 2023

0.46a1 pre-release

Jul 2, 2023

0.45.0

Jun 28, 2023

0.45a1 pre-release

Jun 27, 2023

0.44.2

Jun 23, 2023

0.44.1

Jun 22, 2023

0.44.0

Jun 20, 2023

0.43.0

Jun 16, 2023

0.42.0

Jun 14, 2023

0.42a9 pre-release

Jun 12, 2023

0.42a8 pre-release

Jun 9, 2023

This version

0.42a7 pre-release

Jun 8, 2023

0.42a6 pre-release

Jun 6, 2023

0.42a5 pre-release

Jun 5, 2023

0.42a4 pre-release

Jun 5, 2023

0.42a3 pre-release

Jun 5, 2023

0.42a2 pre-release

Jun 5, 2023

0.42a1 pre-release

Jun 2, 2023

0.41.2

May 31, 2023

0.41.1

May 30, 2023

0.41.0

May 30, 2023

0.41a6 pre-release

May 29, 2023

0.41a5 pre-release

May 29, 2023

0.41a4 pre-release

May 28, 2023

0.41a3 pre-release

May 27, 2023

0.41a2 pre-release

May 26, 2023

0.41a1 pre-release

May 26, 2023

0.40.7

May 25, 2023

0.40.6

May 25, 2023

0.40.5

May 25, 2023

0.40.4

May 25, 2023

0.40.3

May 15, 2023

0.40.2

May 15, 2023

0.40.1

May 12, 2023

0.40.0

May 9, 2023

0.39.8

May 9, 2023

0.39.7

Apr 28, 2023

0.39.6

Apr 28, 2023

0.39.5

Apr 27, 2023

0.39.4

Apr 27, 2023

0.39.3

Apr 26, 2023

0.39.2

Apr 24, 2023

0.39.1

Apr 24, 2023

0.39.0

Apr 24, 2023

0.39rc1 pre-release

Apr 22, 2023

0.38.3

Apr 21, 2023

0.38.2

Apr 21, 2023

0.38.1

Apr 19, 2023

0.38.0

Apr 18, 2023

0.37.2

Apr 18, 2023

0.37.1

Apr 18, 2023

0.37.0

Apr 16, 2023

0.36.4

Apr 14, 2023

0.36.3

Apr 13, 2023

0.36.2

Apr 12, 2023

0.36.1

Apr 10, 2023

0.36.0

Apr 9, 2023

0.35.6

Apr 3, 2023

0.35.5

Mar 28, 2023

0.35.4

Mar 27, 2023

0.35.3

Mar 27, 2023

0.35.2

Mar 27, 2023

0.35.1

Mar 27, 2023

0.35.0

Mar 26, 2023

0.35rc3 pre-release

Mar 26, 2023

0.35rc2 pre-release

Mar 25, 2023

0.35rc1 pre-release

Mar 25, 2023

0.34.2

Mar 22, 2023

0.34.1

Mar 21, 2023

0.34.0

Mar 21, 2023

0.33.4

Mar 15, 2023

0.33.3

Mar 15, 2023

0.33.2

Mar 15, 2023

0.33.1

Mar 15, 2023

0.33.1rc1 pre-release

Mar 15, 2023

0.33.0

Mar 14, 2023

0.32.0

Mar 9, 2023

0.32.0rc1 pre-release

Mar 9, 2023

0.31.1

Mar 8, 2023

0.31.0

Mar 7, 2023

0.31rc1 pre-release

Mar 7, 2023

0.30.3

Mar 3, 2023

0.30.2

Mar 2, 2023

0.30.1

Mar 1, 2023

0.30.0

Mar 1, 2023

0.29.1

Feb 25, 2023

0.29.0

Feb 23, 2023

0.28.5

Feb 22, 2023

0.28.4

Feb 22, 2023

0.28.3

Feb 22, 2023

0.28.2

Feb 22, 2023

0.28.1

Feb 22, 2023

0.28.0

Feb 21, 2023

0.28rc1 pre-release

Feb 21, 2023

0.27.2

Feb 17, 2023

0.27.1

Feb 17, 2023

0.27.0

Feb 14, 2023

0.26.1

Feb 6, 2023

0.26.0

Feb 1, 2023

0.25.8

Jan 31, 2023

0.25.7

Jan 30, 2023

0.25.6

Jan 30, 2023

0.25.5

Jan 26, 2023

0.25.4

Jan 26, 2023

0.25.3

Jan 26, 2023

0.25.2

Jan 24, 2023

0.25.1

Jan 23, 2023

0.25.0

Jan 20, 2023

0.24.6

Jan 18, 2023

0.24.5

Jan 17, 2023

0.24.4

Jan 17, 2023

0.24.3

Jan 16, 2023

0.24.2

Jan 16, 2023

0.24.1

Jan 16, 2023

0.24.0

Jan 16, 2023

0.23.0

Jan 12, 2023

0.22.5

Jan 12, 2023

0.22.4

Jan 9, 2023

0.22.3

Jan 5, 2023

0.22.2

Dec 22, 2022

0.22.1

Dec 16, 2022

0.22.0

Dec 15, 2022

0.21.5

Dec 15, 2022

0.21.4

Dec 13, 2022

0.21.3

Dec 13, 2022

0.21.2

Dec 13, 2022

0.21.1

Dec 12, 2022

0.21.0

Dec 9, 2022

0.20.0

Dec 9, 2022

0.19.4

Dec 6, 2022

0.19.3

Dec 6, 2022

0.19.2

Dec 6, 2022

0.19.1

Dec 5, 2022

0.19.0

Dec 5, 2022

0.18.9

Dec 4, 2022

0.18.8

Nov 30, 2022

0.18.7

Nov 30, 2022

0.18.6

Nov 29, 2022

0.18.5

Nov 28, 2022

0.18.4

Nov 28, 2022

0.18.3

Nov 28, 2022

0.18.2

Nov 28, 2022

0.18.1

Nov 24, 2022

0.18.0

Nov 23, 2022

0.17.0

Nov 22, 2022

0.16.0

Nov 16, 2022

0.15.0

Nov 14, 2022

0.14.0

Nov 13, 2022

0.13.0

Nov 11, 2022

0.12.1

Nov 9, 2022

0.12.0

Nov 9, 2022

0.11.0

Nov 4, 2022

0.9.6

Oct 20, 2022

0.9.4

Oct 18, 2022

0.9.3

Oct 17, 2022

0.9.2

Oct 13, 2022

0.9.1

Oct 13, 2022

0.9.0

Oct 13, 2022

0.8.3

Oct 12, 2022

0.8.2

Oct 10, 2022

0.8.1

Oct 10, 2022

0.8.0

Oct 10, 2022

0.7.2

Oct 10, 2022

0.7.1

Oct 8, 2022

0.7.0

Oct 7, 2022

0.6.0

Oct 5, 2022

0.5.0

Oct 3, 2022

0.4.1

Oct 3, 2022

0.4.0

Sep 23, 2022

0.3.11

Sep 21, 2022

0.3.10

Sep 19, 2022

0.3.9

Sep 14, 2022

0.3.8

Sep 14, 2022

0.3.7

Sep 13, 2022

0.3.6

Sep 12, 2022

0.3.5

Sep 8, 2022

0.3.4

Sep 1, 2022

0.3.3

Aug 29, 2022

0.3.2

Aug 26, 2022

0.3.1

Aug 25, 2022

0.3.0

Aug 25, 2022

0.0.1

Apr 15, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lamindb-0.42a7.tar.gz (180.8 kB view hashes)

Uploaded Jun 8, 2023 Source

Built Distribution

lamindb-0.42a7-py3-none-any.whl (48.1 kB view hashes)

Uploaded Jun 8, 2023 Python 3

Hashes for lamindb-0.42a7.tar.gz

Hashes for lamindb-0.42a7.tar.gz
Algorithm	Hash digest
SHA256	`c47b9058aaf98b27e376eb56f79e4f666b214e2d54553e0a9dc91c54d628419f`
MD5	`a5492f019d0732f5afa1a06f75ec93d2`
BLAKE2b-256	`fef4743e46472a31f337aa91ea2e34bf7d6bf30f3ec2fa7d87cc7a3992d8c9a5`

Hashes for lamindb-0.42a7-py3-none-any.whl

Hashes for lamindb-0.42a7-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7fa70850db33e8e8acc65b3656c46c90e74bae8bf3c052273f974cb1a7ddc9cd`
MD5	`a3a21a433fb82add7221edd95c1ec06f`
BLAKE2b-256	`45aaae85cc77aacf86f1052bd72f9c10e02bfe853c57ea74a8592cbccae6334f`