Skip to main content

High Level Expressions for Dask

Project description

Dask Expressions

Dask DataFrames with query optimization.

This is a rewrite of Dask DataFrame that includes query optimization and generally improved organization.

More in our blog posts:

Example

import dask_expr as dx

df = dx.datasets.timeseries()
df.head()

df.groupby("name").x.mean().compute()

Query Representation

Dask-expr encodes user code in an expression tree:

>>> df.x.mean().pprint()

Mean:
  Projection: columns='x'
    Timeseries: seed=1896674884

This expression tree will be optimized and modified before execution:

>>> df.x.mean().optimize().pprint()

Div:
  Sum:
    Fused(375f9):
    | Projection: columns='x'
    |   Timeseries: dtypes={'x': <class 'float'>} seed=1896674884
  Count:
    Fused(375f9):
    | Projection: columns='x'
    |   Timeseries: dtypes={'x': <class 'float'>} seed=1896674884

Stability

This is the default backend for dask.DataFrame since version 2024.3.0.

API Coverage

Dask-Expr covers almost everything of the Dask DataFrame API. The only missing features are:

  • named GroupBy Aggregations

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dask_expr-2.0.0.tar.gz (3.8 kB view details)

Uploaded Source

Built Distribution

dask_expr-2.0.0-py3-none-any.whl (3.2 kB view details)

Uploaded Python 3

File details

Details for the file dask_expr-2.0.0.tar.gz.

File metadata

  • Download URL: dask_expr-2.0.0.tar.gz
  • Upload date:
  • Size: 3.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.4

File hashes

Hashes for dask_expr-2.0.0.tar.gz
Algorithm Hash digest
SHA256 e6d5a7bdff927aa4a1754a76cad79c5925254c525f568038df7832fcd558fc11
MD5 b08ad4079865598dc35c78ce40d0b49d
BLAKE2b-256 c2d51c3311f7e0fee2415fe61fdd0c30e88134767cb3b5da8ee653eaf4c052de

See more details on using hashes here.

File details

Details for the file dask_expr-2.0.0-py3-none-any.whl.

File metadata

  • Download URL: dask_expr-2.0.0-py3-none-any.whl
  • Upload date:
  • Size: 3.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.4

File hashes

Hashes for dask_expr-2.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 8840da136aadd2c4d672990744c4b82179b69ecaba991db767c62a855a31852f
MD5 aac90b2c0906d22987ce601f2d3e0862
BLAKE2b-256 d0f5f0ae5e6b38d3383508b7a6669c90def361a4caa7b8b3b7e673828e34ac2e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page