Skip to main content

Geneva - Multimodal Data Lake for AI

Project description

Geneva - Multimodal Data Platform

Geneva is a petabyte-scale multimodal feature engineering and data management platform built on LanceDB.

What lives in this repo

  • src/ - Geneva client library and core runtime
  • src/tests/ - unit test suites
  • src/integ_tests/ - integration test suites
  • src/stress_tests/ - stress and load tests
  • docs/ - mkdocs configuration and API docs (source for autogenerated public facing api documentation page)
  • e2e/ - end-to-end test suites and UDF manifests
  • notebook/ - quickstart notebooks (TODO replace with link to colab demo notebook)
  • internal_docs/ - internal design and operational notes
  • tools/ - helper scripts for local clusters and cleanup

User facing documentation should be submitted here.

Quickstart (local development)

uv sync --all-groups --all-extras --locked
import geneva
import pyarrow as pa

@geneva.udf(data_type=pa.int32())
def double(x: int) -> int:
    return x * 2

conn = geneva.connect("./db")
table = conn.create_table("numbers", [{"x": i} for i in range(10)])
table.add_columns({"doubled": double})
with conn.local_ray_context():
    table.backfill("doubled")
result = table.search().select(["doubled"]).to_arrow()

Development

See Development for details.

Configuration

Geneva supports specifying configuration in a few different ways. Refer to CONFIGURATION.md for more details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

geneva-0.10.0.tar.gz (7.6 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

geneva-0.10.0-py3-none-any.whl (266.6 kB view details)

Uploaded Python 3

File details

Details for the file geneva-0.10.0.tar.gz.

File metadata

  • Download URL: geneva-0.10.0.tar.gz
  • Upload date:
  • Size: 7.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.10.2 {"installer":{"name":"uv","version":"0.10.2","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for geneva-0.10.0.tar.gz
Algorithm Hash digest
SHA256 9df9717126aa7a77ee525c406dab34937ff9bc1a20335d8125d66163efa25d1f
MD5 2f5a0f3c7bfa9af81029b96a4d306b03
BLAKE2b-256 c7910962d0ff095000bb68f63c143611e7f4c20c7594376a25d69f333d007da1

See more details on using hashes here.

File details

Details for the file geneva-0.10.0-py3-none-any.whl.

File metadata

  • Download URL: geneva-0.10.0-py3-none-any.whl
  • Upload date:
  • Size: 266.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.10.2 {"installer":{"name":"uv","version":"0.10.2","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for geneva-0.10.0-py3-none-any.whl
Algorithm Hash digest
SHA256 e8330297623a67914676e70cb84c86ea1175db8325090eeeb9a2feb43f592549
MD5 5fbc8ac6bbc05d74ccabda025ef319da
BLAKE2b-256 c92b063965785134b61ac1c1a5954dffea1080bd561ad9489d148217897ccc74

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page