Skip to main content

The standard API for egocentric data. $1/hr.

Project description

BuildAI

The standard API for egocentric data. $1/hr.

Quickstart

pip install buildai-data
from buildai import BuildAI

client = BuildAI(api_key="build_data_xxxx")
client.download(hours=10, output_dir="./data")

That's it. Ten shards download to ./data/. Each shard is one hour of egocentric video with frame-synced IMU data.

What you get

Each shard is a tar file containing 20 three-minute clips:

shard-000001.tar
  ├── 000001.mp4       # H.265, 1080p, 30fps, no audio
  ├── 000001.imu.npy   # (5400, 6) float32, frame-synced
  ├── 000002.mp4
  ├── 000002.imu.npy
  └── ...              # 20 clips total

Video: H.265, 1080p, 30fps, no audio. Three minutes per clip.

IMU: NumPy array, shape (5400, 6), float32. Columns: [acc_x, acc_y, acc_z, gyro_x, gyro_y, gyro_z]. Row N = frame N (30 fps x 180 seconds = 5400 frames). Units: accelerometer in m/s2, gyroscope in rad/s.

Shard size: ~2 GB per shard, ~1 hour of footage, 20 clips.

Shards are standard WebDataset format. If you already have a WebDataset pipeline, point it at the downloaded tars directly.

Download

from buildai import BuildAI

client = BuildAI(api_key="build_data_xxxx")

# Download 100 hours of data
client.download(hours=100, output_dir="/data/buildai/")

# Download more - you'll get new shards you haven't seen before
client.download(hours=1000, output_dir="/data/buildai/")

Every call gets new shards. You never receive the same shard twice. Shards are downloaded sequentially. Every researcher gets the same sequence.

Downloads resume on failure. If a shard already exists locally and the checksum matches, it's skipped.

Parameter Default Description
hours required Number of hours to download (1 shard = 1 hour)
output_dir ./buildai-data Where to save shard tars
workers 8 Parallel download threads
verify_checksum True SHA-256 verification after download

Visualize

from buildai import BuildAI

BuildAI.visualize(source="/data/buildai/")

Opens a local web viewer in your browser. Shows a grid of all clips across your downloaded shards. Click any clip to watch the video with accelerometer and gyroscope data graphed below, time-synced.

No data is uploaded. Everything runs locally. Press Ctrl+C to stop.

No API key needed. visualize is a static method that works on any directory of shard tars.

Pricing

$1 per hour. Billed monthly. No minimum, no commitment.

API key

Get your API key at build.ai. You can also set it via environment variable:

export BUILDAI_API_KEY=build_data_xxxx
# Picks up BUILDAI_API_KEY automatically
client = BuildAI()

Account

info = client.account
print(info["total_hours"])
print(info["total_spent_usd"])
print(info["shards_remaining"])

Requirements

  • Python 3.10+
  • httpx (HTTP client)
  • numpy (IMU data)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

buildai_data-0.2.5.tar.gz (164.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

buildai_data-0.2.5-py3-none-any.whl (17.2 kB view details)

Uploaded Python 3

File details

Details for the file buildai_data-0.2.5.tar.gz.

File metadata

  • Download URL: buildai_data-0.2.5.tar.gz
  • Upload date:
  • Size: 164.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for buildai_data-0.2.5.tar.gz
Algorithm Hash digest
SHA256 a0be73b731c0be5e9ee697baf832a83872dff02d8ea4e1f4a8595df14a166f04
MD5 e0b2969ea55a2a5e346071d010c4cf90
BLAKE2b-256 2e9a14f4e448987c9276d7fe15a24b6ba3f5f5f42f4ff10b87ae2c2cbcfe7be4

See more details on using hashes here.

File details

Details for the file buildai_data-0.2.5-py3-none-any.whl.

File metadata

  • Download URL: buildai_data-0.2.5-py3-none-any.whl
  • Upload date:
  • Size: 17.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for buildai_data-0.2.5-py3-none-any.whl
Algorithm Hash digest
SHA256 973b4ca2b4aa1b8b5f9ae47f40fa694391bca0a211f492f3d68ed51e80d52e4a
MD5 40f6c5b746b354051c50a5dba9ae90f3
BLAKE2b-256 86556d485a177d0f34e73c4d9d8b058616166a0355540a5a81cae8e0e110d03a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page