Skip to main content

BigQuery DataFrames -- scalable analytics and machine learning with BigQuery

Project description

GA pypi versions

BigQuery DataFrames (also known as BigFrames) provides a Pythonic DataFrame and machine learning (ML) API powered by the BigQuery engine. It provides modules for many use cases, including:

  • bigframes.pandas is a pandas API for analytics. Many workloads can be migrated from pandas to bigframes by just changing a few imports.

  • bigframes.ml is a scikit-learn-like API for ML.

  • bigframes.bigquery.ai are a collection of powerful AI methods, powered by Gemini.

BigQuery DataFrames is an open-source package.

Getting started with BigQuery DataFrames

The easiest way to get started is to try the BigFrames quickstart in a notebook in BigQuery Studio.

To use BigFrames in your local development environment,

  1. Run pip install --upgrade bigframes to install the latest version.

  2. Setup Application default credentials for your local development environment enviroment.

  3. Create a GCP project with the BigQuery API enabled.

  4. Use the bigframes package to query data.

import bigframes.pandas as bpd

bpd.options.bigquery.project = your_gcp_project_id  # Optional in BQ Studio.
bpd.options.bigquery.ordering_mode = "partial"  # Recommended for performance.
df = bpd.read_gbq("bigquery-public-data.usa_names.usa_1910_2013")
print(
    df.groupby("name")
    .agg({"number": "sum"})
    .sort_values("number", ascending=False)
    .head(10)
    .to_pandas()
)

Documentation

To learn more about BigQuery DataFrames, visit these pages

License

BigQuery DataFrames is distributed with the Apache-2.0 license.

It also contains code derived from the following third-party packages:

For details, see the third_party directory.

Contact Us

For further help and provide feedback, you can email us at bigframes-feedback@google.com.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bigframes-2.38.0.tar.gz (3.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bigframes-2.38.0-py2.py3-none-any.whl (1.9 MB view details)

Uploaded Python 2Python 3

File details

Details for the file bigframes-2.38.0.tar.gz.

File metadata

  • Download URL: bigframes-2.38.0.tar.gz
  • Upload date:
  • Size: 3.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.2.0 CPython/3.11.2

File hashes

Hashes for bigframes-2.38.0.tar.gz
Algorithm Hash digest
SHA256 24d23676588bcd1082664376888afcd6b31d5a592db71b01e455e1e00f94c10f
MD5 1e079cf055ca66fa1d6068a2554e1854
BLAKE2b-256 eb361f3e18a3514df46e7523b6f8f40b55ec6cd949e2d2bcc1db3fe0ce9dbbbe

See more details on using hashes here.

Provenance

The following attestation bundles were made for bigframes-2.38.0.tar.gz:

Publisher: google-cloud-sdk-py@oss-exit-gate-prod.iam.gserviceaccount.com

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.
  • Statement: Publication detail:
    • Token Issuer: https://accounts.google.com
    • Service Account: google-cloud-sdk-py@oss-exit-gate-prod.iam.gserviceaccount.com

File details

Details for the file bigframes-2.38.0-py2.py3-none-any.whl.

File metadata

  • Download URL: bigframes-2.38.0-py2.py3-none-any.whl
  • Upload date:
  • Size: 1.9 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.2.0 CPython/3.11.2

File hashes

Hashes for bigframes-2.38.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 aa8121036659377bace1c6ff27afafe42a502f29aef37ee8bf5f113e469c6f31
MD5 55d4b53341c1232718a2e056e1980c16
BLAKE2b-256 788e46b3e1de75b2727a577aa5511eeeb7e866b37a0db18ee5639078e7bbf786

See more details on using hashes here.

Provenance

The following attestation bundles were made for bigframes-2.38.0-py2.py3-none-any.whl:

Publisher: google-cloud-sdk-py@oss-exit-gate-prod.iam.gserviceaccount.com

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.
  • Statement: Publication detail:
    • Token Issuer: https://accounts.google.com
    • Service Account: google-cloud-sdk-py@oss-exit-gate-prod.iam.gserviceaccount.com

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page