Skip to main content

BigQuery DataFrames -- scalable analytics and machine learning with BigQuery

Project description

GA pypi versions

BigQuery DataFrames (also known as BigFrames) provides a Pythonic DataFrame and machine learning (ML) API powered by the BigQuery engine. It provides modules for many use cases, including:

  • bigframes.pandas is a pandas API for analytics. Many workloads can be migrated from pandas to bigframes by just changing a few imports.

  • bigframes.ml is a scikit-learn-like API for ML.

  • bigframes.bigquery.ai are a collection of powerful AI methods, powered by Gemini.

BigQuery DataFrames is an open-source package.

Getting started with BigQuery DataFrames

The easiest way to get started is to try the BigFrames quickstart in a notebook in BigQuery Studio.

To use BigFrames in your local development environment,

  1. Run pip install --upgrade bigframes to install the latest version.

  2. Setup Application default credentials for your local development environment enviroment.

  3. Create a GCP project with the BigQuery API enabled.

  4. Use the bigframes package to query data.

import bigframes.pandas as bpd

bpd.options.bigquery.project = your_gcp_project_id  # Optional in BQ Studio.
bpd.options.bigquery.ordering_mode = "partial"  # Recommended for performance.
df = bpd.read_gbq("bigquery-public-data.usa_names.usa_1910_2013")
print(
    df.groupby("name")
    .agg({"number": "sum"})
    .sort_values("number", ascending=False)
    .head(10)
    .to_pandas()
)

Documentation

To learn more about BigQuery DataFrames, visit these pages

License

BigQuery DataFrames is distributed with the Apache-2.0 license.

It also contains code derived from the following third-party packages:

For details, see the third_party directory.

Contact Us

For further help and provide feedback, you can email us at bigframes-feedback@google.com.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bigframes-2.42.0.tar.gz (3.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bigframes-2.42.0-py2.py3-none-any.whl (2.0 MB view details)

Uploaded Python 2Python 3

File details

Details for the file bigframes-2.42.0.tar.gz.

File metadata

  • Download URL: bigframes-2.42.0.tar.gz
  • Upload date:
  • Size: 3.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.2.0 CPython/3.11.2

File hashes

Hashes for bigframes-2.42.0.tar.gz
Algorithm Hash digest
SHA256 b578a0771fc5f9c77005313dd454f72989ef8a40e70d11b6f77935f5a7fd0f07
MD5 f697e1762cc006b1828dbb31a7fbeb78
BLAKE2b-256 e8cfc66a3734d5eb088df214cafebd4fefd15a57764a222c12e3e142f4e944d0

See more details on using hashes here.

Provenance

The following attestation bundles were made for bigframes-2.42.0.tar.gz:

Publisher: google-cloud-sdk-py@oss-exit-gate-prod.iam.gserviceaccount.com

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.
  • Statement: Publication detail:
    • Token Issuer: https://accounts.google.com
    • Service Account: google-cloud-sdk-py@oss-exit-gate-prod.iam.gserviceaccount.com

File details

Details for the file bigframes-2.42.0-py2.py3-none-any.whl.

File metadata

  • Download URL: bigframes-2.42.0-py2.py3-none-any.whl
  • Upload date:
  • Size: 2.0 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.2.0 CPython/3.11.2

File hashes

Hashes for bigframes-2.42.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 13d44bc6cec541a160b172e71f0b77cb92d77afd3374e85cc519f01b08aab4d2
MD5 1481afe5e0303bade2c68dd75cfb1dc6
BLAKE2b-256 ae6ede6d5338a3179f39290734ca4fe70501bebbefa5c3f99e097c9592fa4ec7

See more details on using hashes here.

Provenance

The following attestation bundles were made for bigframes-2.42.0-py2.py3-none-any.whl:

Publisher: google-cloud-sdk-py@oss-exit-gate-prod.iam.gserviceaccount.com

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.
  • Statement: Publication detail:
    • Token Issuer: https://accounts.google.com
    • Service Account: google-cloud-sdk-py@oss-exit-gate-prod.iam.gserviceaccount.com

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page