Skip to main content

Geneva - Multimodal Data Lake for AI

Project description

Geneva - Multimodal Data Platform

Geneva is a petabyte-scale multimodal feature engineering and data management platform built on LanceDB.

What lives in this repo

  • src/ - Geneva client library and core runtime
  • src/tests/ - unit test suites
  • src/integ_tests/ - integration test suites
  • src/stress_tests/ - stress and load tests
  • docs/ - mkdocs configuration and API docs (source for autogenerated public facing api documentation page)
  • e2e/ - end-to-end test suites and UDF manifests
  • notebook/ - quickstart notebooks (TODO replace with link to colab demo notebook)
  • internal_docs/ - internal design and operational notes
  • tools/ - helper scripts for local clusters and cleanup

User facing documentation should be submitted here.

Quickstart (local development)

uv sync --all-groups --all-extras --locked
import geneva
import pyarrow as pa

@geneva.udf(data_type=pa.int32())
def double(x: int) -> int:
    return x * 2

conn = geneva.connect("./db")
table = conn.create_table("numbers", [{"x": i} for i in range(10)])
table.add_columns({"doubled": double})
with conn.local_ray_context():
    table.backfill("doubled")
result = table.search().select(["doubled"]).to_arrow()

Development

See Development for details.

Configuration

Geneva supports specifying configuration in a few different ways. Refer to CONFIGURATION.md for more details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

geneva-0.12.0.tar.gz (3.3 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

geneva-0.12.0-py3-none-any.whl (371.9 kB view details)

Uploaded Python 3

File details

Details for the file geneva-0.12.0.tar.gz.

File metadata

  • Download URL: geneva-0.12.0.tar.gz
  • Upload date:
  • Size: 3.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.2 {"installer":{"name":"uv","version":"0.11.2","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for geneva-0.12.0.tar.gz
Algorithm Hash digest
SHA256 178f96828ed9ceeadd161e230b1bcdb55ed9ccb38e73a0d72a5024a12770c5cf
MD5 3457c2d6c55fa0035168c685771e7f62
BLAKE2b-256 eee797297410bf44686d680217b8238995c233d6222f705a03565c038cdee53a

See more details on using hashes here.

File details

Details for the file geneva-0.12.0-py3-none-any.whl.

File metadata

  • Download URL: geneva-0.12.0-py3-none-any.whl
  • Upload date:
  • Size: 371.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.2 {"installer":{"name":"uv","version":"0.11.2","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for geneva-0.12.0-py3-none-any.whl
Algorithm Hash digest
SHA256 86ce35551a363b1b66af8951ccb9560875b8517cd557987940ab55567401fc88
MD5 7fc5634e0461edd433c43ab1e3132df5
BLAKE2b-256 6256d0f40f5cb98892e1ad97eac382dc5109f7b7402b36fa00f390319e950d47

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page