The standard API for egocentric data.
Project description
BuildAI
The standard API for egocentric data. $1/hr.
Quickstart
pip install buildai-data
from buildai import BuildAI
client = BuildAI(api_key="build_data_xxxx")
client.download(hours=10, output_dir="./data")
That's it. Ten shards download to ./data/. Each shard is one hour of egocentric video with frame-synced IMU data.
What you get
Each shard is a tar file containing 20 three-minute clips:
shard-000001.tar
├── 000001.mp4 # H.265, 1080p, 30fps, no audio
├── 000001.imu.npy # (5400, 6) float32, frame-synced
├── 000002.mp4
├── 000002.imu.npy
└── ... # 20 clips total
Video: H.265, 1080p, 30fps, no audio. Three minutes per clip.
IMU: NumPy array, shape (5400, 6), float32. Columns: [acc_x, acc_y, acc_z, gyro_x, gyro_y, gyro_z]. Row N = frame N (30 fps x 180 seconds = 5400 frames). Units: accelerometer in m/s2, gyroscope in rad/s.
Shard size: ~2 GB per shard, ~1 hour of footage, 20 clips.
Shards are standard WebDataset format. If you already have a WebDataset pipeline, point it at the downloaded tars directly.
Download
from buildai import BuildAI
client = BuildAI(api_key="build_data_xxxx")
# Download 100 hours of data
client.download(hours=100, output_dir="/data/buildai/")
# Download more - you'll get new shards you haven't seen before
client.download(hours=1000, output_dir="/data/buildai/")
Every call gets new shards. You never receive the same shard twice. Shards are downloaded sequentially. Every researcher gets the same sequence.
Downloads resume on failure. If a shard already exists locally and the checksum matches, it's skipped.
| Parameter | Default | Description |
|---|---|---|
hours |
required | Number of hours to download (1 shard = 1 hour) |
output_dir |
./buildai-data |
Where to save shard tars |
workers |
8 | Parallel download threads |
verify_checksum |
True | SHA-256 verification after download |
Visualize
from buildai import BuildAI
BuildAI.visualize(source="/data/buildai/")
Opens a local web viewer in your browser. Shows a grid of all clips across your downloaded shards. Click any clip to watch the video with accelerometer and gyroscope data graphed below, time-synced.
No data is uploaded. Everything runs locally. Press Ctrl+C to stop.
No API key needed. visualize is a static method that works on any directory of shard tars.
Pricing
$1 per hour. Billed monthly. No minimum, no commitment.
API key
Get your API key at build.ai. You can also set it via environment variable:
export BUILDAI_API_KEY=build_data_xxxx
# Picks up BUILDAI_API_KEY automatically
client = BuildAI()
Account
info = client.account
print(info["total_hours"])
print(info["total_spent_usd"])
print(info["shards_remaining"])
Requirements
- Python 3.10+
- httpx (HTTP client)
- numpy (IMU data)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file buildai_data-0.3.1.tar.gz.
File metadata
- Download URL: buildai_data-0.3.1.tar.gz
- Upload date:
- Size: 165.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d3632b6392b2b006a431a874a942d68946baf5ccf1305a88e7e49e9ca94ddee6
|
|
| MD5 |
707d1d41c3d247bbb45a5db4936f839b
|
|
| BLAKE2b-256 |
53f14dc990da02eb92d470b854065d9f17452d9206cf11b9ad89b5b04c6c448a
|
File details
Details for the file buildai_data-0.3.1-py3-none-any.whl.
File metadata
- Download URL: buildai_data-0.3.1-py3-none-any.whl
- Upload date:
- Size: 18.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b15dd10dd9c0d0b7fed91c9865a140355cd00444ebd1c2a422f74dede583b03d
|
|
| MD5 |
1746e215d9981abc4ff8483502f5e8ab
|
|
| BLAKE2b-256 |
2c88bdd3e9c826469590637f7d816c6baa94205188c9d9939d80cd3432149641
|