Parallel nested-pandas with Dask.
Project description
nested-dask
ARCHIVE NOTICE: This repository is no longer being maintained. The original purpose of this package was to enable dask with the extended nested-pandas API, with the specific aim of using this to support the LSDB project. In April 2025, the contents of this package were migrated directly into LSDB to allow for tailored behavior to LSDB's specific operational needs. As a result, any needed changes to dask-compatibility for nested-pandas are happening directly within LSDB and not in this repository. Further, maintaining a generalized dask layer for nested-pandas is not directly within the critical path for LINCC-Frameworks effort at this time. If you found your way here and wished there was a maintained package that provided a dask-layer for nested-pandas for something you are working on, please feel free to voice that as an issue filed to nested-pandas.
A dask extension of nested-pandas.
Nested-pandas is a pandas extension package that empowers efficient analysis of nested associated datasets. This package wraps the majority of the nested-pandas API with Dask, which enables easy parallelization and capacity for work at scale.
Dev Guide - Getting Started
Before installing any dependencies or writing code, it's a great idea to create a
virtual environment. LINCC-Frameworks engineers primarily use conda to manage virtual
environments. If you have conda installed locally, you can run the following to
create and activate a new environment.
>> conda create env -n <env_name> python=3.10
>> conda activate <env_name>
Once you have created a new environment, you can install this project for local development using the following commands:
>> pip install -e .'[dev]'
>> pre-commit install
>> conda install pandoc
Notes:
- The single quotes around
'[dev]'may not be required for your operating system. pre-commit installwill initialize pre-commit for this local repository, so that a set of tests will be run prior to completing a local commit. For more information, see the Python Project Template documentation on pre-commit- Install
pandocallows you to verify that automatic rendering of Jupyter notebooks into documentation for ReadTheDocs works as expected. For more information, see the Python Project Template documentation on Sphinx and Python Notebooks
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file nested_dask-0.3.5.tar.gz.
File metadata
- Download URL: nested_dask-0.3.5.tar.gz
- Upload date:
- Size: 51.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3b512b63d135b928b37863cfd0b71d53aa5c808cb65552b60ea0647aa2abcf98
|
|
| MD5 |
624004b7550454059e9fabd76802011f
|
|
| BLAKE2b-256 |
62fc74b66b63f4f727da3d5de8771ef3b48850ed4b05a0fb54e3fe0434a6a11a
|
Provenance
The following attestation bundles were made for nested_dask-0.3.5.tar.gz:
Publisher:
publish-to-pypi.yml on lincc-frameworks/nested-dask
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
nested_dask-0.3.5.tar.gz -
Subject digest:
3b512b63d135b928b37863cfd0b71d53aa5c808cb65552b60ea0647aa2abcf98 - Sigstore transparency entry: 734451932
- Sigstore integration time:
-
Permalink:
lincc-frameworks/nested-dask@b15e9f081b3def47ff02c45b04f277abd1223be1 -
Branch / Tag:
refs/tags/v0.3.5 - Owner: https://github.com/lincc-frameworks
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-to-pypi.yml@b15e9f081b3def47ff02c45b04f277abd1223be1 -
Trigger Event:
release
-
Statement type:
File details
Details for the file nested_dask-0.3.5-py3-none-any.whl.
File metadata
- Download URL: nested_dask-0.3.5-py3-none-any.whl
- Upload date:
- Size: 24.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0654cb2e3687bbf5e1a5a62a87a8b3700461a47163554bbd82519aea1ead5283
|
|
| MD5 |
71049bb9d8c63b6b051029a062227cf6
|
|
| BLAKE2b-256 |
5a82877e0fe71d3b2167b1100f993d4921ecff9d40cf7d795ae11a78e6a43aa0
|
Provenance
The following attestation bundles were made for nested_dask-0.3.5-py3-none-any.whl:
Publisher:
publish-to-pypi.yml on lincc-frameworks/nested-dask
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
nested_dask-0.3.5-py3-none-any.whl -
Subject digest:
0654cb2e3687bbf5e1a5a62a87a8b3700461a47163554bbd82519aea1ead5283 - Sigstore transparency entry: 734452005
- Sigstore integration time:
-
Permalink:
lincc-frameworks/nested-dask@b15e9f081b3def47ff02c45b04f277abd1223be1 -
Branch / Tag:
refs/tags/v0.3.5 - Owner: https://github.com/lincc-frameworks
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-to-pypi.yml@b15e9f081b3def47ff02c45b04f277abd1223be1 -
Trigger Event:
release
-
Statement type: