Skip to main content

Geneva - Multimodal Data Lake for AI

Project description

Geneva - Multimodal Data Platform

Geneva is a petabyte-scale multimodal feature engineering and data management platform built on LanceDB.

What lives in this repo

  • src/ - Geneva client library and core runtime
  • src/tests/ - unit test suites
  • src/integ_tests/ - integration test suites
  • src/stress_tests/ - stress and load tests
  • docs/ - mkdocs configuration and API docs (source for autogenerated public facing api documentation page)
  • e2e/ - end-to-end test suites and UDF manifests
  • notebook/ - quickstart notebooks (TODO replace with link to colab demo notebook)
  • internal_docs/ - internal design and operational notes
  • tools/ - helper scripts for local clusters and cleanup

User facing documentation should be submitted here.

Quickstart (local development)

uv sync --all-groups --all-extras --locked
import geneva
import pyarrow as pa

@geneva.udf(data_type=pa.int32())
def double(x: int) -> int:
    return x * 2

conn = geneva.connect("./db")
table = conn.create_table("numbers", [{"x": i} for i in range(10)])
table.add_columns({"doubled": double})
with conn.local_ray_context():
    table.backfill("doubled")
result = table.search().select(["doubled"]).to_arrow()

Development

See Development for details.

Configuration

Geneva supports specifying configuration in a few different ways. Refer to CONFIGURATION.md for more details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

geneva-0.10.2.tar.gz (7.6 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

geneva-0.10.2-py3-none-any.whl (279.0 kB view details)

Uploaded Python 3

File details

Details for the file geneva-0.10.2.tar.gz.

File metadata

  • Download URL: geneva-0.10.2.tar.gz
  • Upload date:
  • Size: 7.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.10.4 {"installer":{"name":"uv","version":"0.10.4","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for geneva-0.10.2.tar.gz
Algorithm Hash digest
SHA256 4c76ced3fe77a32bc0022cfe79e557a3895bf1ee102adfac2b330caa4b426cf1
MD5 656d38a6bd2bc5f021700f56aaeea7dc
BLAKE2b-256 8eed52295b0ff3d42b57f7a48002fedb627423ca1cec03527f5322dcb9051de6

See more details on using hashes here.

File details

Details for the file geneva-0.10.2-py3-none-any.whl.

File metadata

  • Download URL: geneva-0.10.2-py3-none-any.whl
  • Upload date:
  • Size: 279.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.10.4 {"installer":{"name":"uv","version":"0.10.4","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for geneva-0.10.2-py3-none-any.whl
Algorithm Hash digest
SHA256 6bb38d401ed3cb89e64bee8e275ce39f91b57ba95fcb167a3df2ace2b8e9eb61
MD5 aa2e0179ca7ed89ab1a46a99790ab694
BLAKE2b-256 1862a09ddb8ebda425447ce14e42159e230aebf76bf66802376cd236b92d96bb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page