Skip to main content

Geneva - Multimodal Data Lake for AI

Project description

Geneva - Multimodal Data Platform

Geneva is a petabyte-scale multimodal feature engineering and data management platform built on LanceDB.

What lives in this repo

  • src/ - Geneva client library and core runtime
  • src/tests/ - unit test suites
  • src/integ_tests/ - integration test suites
  • src/stress_tests/ - stress and load tests
  • docs/ - mkdocs configuration and API docs (source for autogenerated public facing api documentation page)
  • e2e/ - end-to-end test suites and UDF manifests
  • notebook/ - quickstart notebooks (TODO replace with link to colab demo notebook)
  • internal_docs/ - internal design and operational notes
  • tools/ - helper scripts for local clusters and cleanup

User facing documentation should be submitted here.

Quickstart (local development)

uv sync --all-groups --all-extras --locked
import geneva
import pyarrow as pa

@geneva.udf(data_type=pa.int32())
def double(x: int) -> int:
    return x * 2

conn = geneva.connect("./db")
table = conn.create_table("numbers", [{"x": i} for i in range(10)])
table.add_columns({"doubled": double})
with conn.local_ray_context():
    table.backfill("doubled")
result = table.search().select(["doubled"]).to_arrow()

Development

See Development for details.

Configuration

Geneva supports specifying configuration in a few different ways. Refer to CONFIGURATION.md for more details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

geneva-0.11.0.tar.gz (3.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

geneva-0.11.0-py3-none-any.whl (354.7 kB view details)

Uploaded Python 3

File details

Details for the file geneva-0.11.0.tar.gz.

File metadata

  • Download URL: geneva-0.11.0.tar.gz
  • Upload date:
  • Size: 3.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.10.11 {"installer":{"name":"uv","version":"0.10.11","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for geneva-0.11.0.tar.gz
Algorithm Hash digest
SHA256 d511c10eadee19ba0031ea9a6d2cfcb51f6e281874cb1dfabed063ab4819c823
MD5 da38bc4f527234c89b69b709a2ea3912
BLAKE2b-256 955db9117b7af8b52d3a96e88532032461f1b72f0597dd4affe3490c1839dc8f

See more details on using hashes here.

File details

Details for the file geneva-0.11.0-py3-none-any.whl.

File metadata

  • Download URL: geneva-0.11.0-py3-none-any.whl
  • Upload date:
  • Size: 354.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.10.11 {"installer":{"name":"uv","version":"0.10.11","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for geneva-0.11.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c599a7c4b9dc687380707380a3b74c515a9b613828ffa0e614914798de9bc06b
MD5 b944be7c41f2a30dc861cc0ab75406a4
BLAKE2b-256 4b1c06f2d0d6cd441cbc85ca80a4887ba4fa22a1340f123dd30c9096c236e49d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page