30 projects
dask
Parallel PyData with Task Scheduling
distributed
Distributed scheduler for Dask
fastparquet
Python support for Parquet file format
gcsfs
Convenient Filesystem interface over GCS
s3fs
Convenient Filesystem interface over S3
fsspec
File-system specification
kbatch-proxy
Proxy batch job requests to kubernetes
kbatch
Submit batch jobs to Kubernetes
pandas
Powerful data structures for data analysis, time series, and statistics
dask-kubernetes
Native Kubernetes integration for Dask
adlfs
Access Azure Datalake Gen1 with fsspec and dask
stac-geoparquet
None
partd
Appendable key-value storage
dask-ml
A library for distributed and parallel machine learning
dask-glm
Generalized Linear Models with Dask
xstac
xstac
rechunker
A library for rechunking arrays
stac-table
Generate STAC Collections for tabular datasets.
dask-xgboost
Interactions between Dask and XGBoost
stac_vrt
Quickly build a GDAL VRT from a STAC Item Collection.
jupyterhub_mlflow_auth
Tornado-based proxy server for MLFlow and JupyterHub.
papermill-mlflow-handler
MLFlow handler for papermill.
mlflow_nbconvert
mlflow-nbconvert
cachey
Caching mindful of computation/storage costs
cyberpandas
IP Address type for pandas
dask-tensorflow
Interactions between Dask and Tensorflow
engarde
A python package for defensive data analysis.
knotr
Reproducible report generation tool.
dsadd
A python package for defensive data analysis.
python-cps
A python package for working with the[Current Population Survey](http://www.census.gov/cps/).