Skip to main content

BigQuery DataFrames -- scalable analytics and machine learning with BigQuery

Project description

GA pypi versions

BigQuery DataFrames (also known as BigFrames) provides a Pythonic DataFrame and machine learning (ML) API powered by the BigQuery engine. It provides modules for many use cases, including:

  • bigframes.pandas is a pandas API for analytics. Many workloads can be migrated from pandas to bigframes by just changing a few imports.

  • bigframes.ml is a scikit-learn-like API for ML.

  • bigframes.bigquery.ai are a collection of powerful AI methods, powered by Gemini.

BigQuery DataFrames is an open-source package.

Getting started with BigQuery DataFrames

The easiest way to get started is to try the BigFrames quickstart in a notebook in BigQuery Studio.

To use BigFrames in your local development environment,

  1. Run pip install --upgrade bigframes to install the latest version.

  2. Setup Application default credentials for your local development environment enviroment.

  3. Create a GCP project with the BigQuery API enabled.

  4. Use the bigframes package to query data.

import bigframes.pandas as bpd

bpd.options.bigquery.project = your_gcp_project_id  # Optional in BQ Studio.
bpd.options.bigquery.ordering_mode = "partial"  # Recommended for performance.
df = bpd.read_gbq("bigquery-public-data.usa_names.usa_1910_2013")
print(
    df.groupby("name")
    .agg({"number": "sum"})
    .sort_values("number", ascending=False)
    .head(10)
    .to_pandas()
)

Documentation

To learn more about BigQuery DataFrames, visit these pages

License

BigQuery DataFrames is distributed with the Apache-2.0 license.

It also contains code derived from the following third-party packages:

For details, see the third_party directory.

Contact Us

For further help and provide feedback, you can email us at bigframes-feedback@google.com.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bigframes-2.40.0.tar.gz (3.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bigframes-2.40.0-py2.py3-none-any.whl (1.9 MB view details)

Uploaded Python 2Python 3

File details

Details for the file bigframes-2.40.0.tar.gz.

File metadata

  • Download URL: bigframes-2.40.0.tar.gz
  • Upload date:
  • Size: 3.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.2.0 CPython/3.11.2

File hashes

Hashes for bigframes-2.40.0.tar.gz
Algorithm Hash digest
SHA256 5c43f8b1ffbec55857e32945745e64f0fd8f02ae9939f5a1154cc57443feaeb0
MD5 4e13f7f44619af6e2643ec60524aa645
BLAKE2b-256 444a5c6dc71cf9d320fc8d502b91f2a1e92ea70d3aea23971bbb9d05ddcad9bd

See more details on using hashes here.

Provenance

The following attestation bundles were made for bigframes-2.40.0.tar.gz:

Publisher: google-cloud-sdk-py@oss-exit-gate-prod.iam.gserviceaccount.com

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.
  • Statement: Publication detail:
    • Token Issuer: https://accounts.google.com
    • Service Account: google-cloud-sdk-py@oss-exit-gate-prod.iam.gserviceaccount.com

File details

Details for the file bigframes-2.40.0-py2.py3-none-any.whl.

File metadata

  • Download URL: bigframes-2.40.0-py2.py3-none-any.whl
  • Upload date:
  • Size: 1.9 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.2.0 CPython/3.11.2

File hashes

Hashes for bigframes-2.40.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 6585ff2eafb30143bd0e33235da29df6ed9e7bebb735bacfd7f841626f26c80a
MD5 54ecd9ca69795453730ba0d3058b0985
BLAKE2b-256 e6e3cf4be18eb373883d3117c5a78f1a0af9a09ebe60fc9cc199f61f8552f427

See more details on using hashes here.

Provenance

The following attestation bundles were made for bigframes-2.40.0-py2.py3-none-any.whl:

Publisher: google-cloud-sdk-py@oss-exit-gate-prod.iam.gserviceaccount.com

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.
  • Statement: Publication detail:
    • Token Issuer: https://accounts.google.com
    • Service Account: google-cloud-sdk-py@oss-exit-gate-prod.iam.gserviceaccount.com

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page