Skip to main content

Geneva - Multimodal Data Lake for AI

Project description

Geneva - Multimodal Data Platform

Geneva is a petabyte-scale multimodal feature engineering and data management platform built on LanceDB.

What lives in this repo

  • src/ - Geneva client library and core runtime
  • src/tests/ - unit test suites
  • src/integ_tests/ - integration test suites
  • src/stress_tests/ - stress and load tests
  • docs/ - mkdocs configuration and API docs (source for autogenerated public facing api documentation page)
  • e2e/ - end-to-end test suites and UDF manifests
  • notebook/ - quickstart notebooks (TODO replace with link to colab demo notebook)
  • internal_docs/ - internal design and operational notes
  • tools/ - helper scripts for local clusters and cleanup

User facing documentation should be submitted here.

Quickstart (local development)

uv sync --all-groups --all-extras --locked
import geneva
import pyarrow as pa

@geneva.udf(data_type=pa.int32())
def double(x: int) -> int:
    return x * 2

conn = geneva.connect("./db")
table = conn.create_table("numbers", [{"x": i} for i in range(10)])
table.add_columns({"doubled": double})
with conn.local_ray_context():
    table.backfill("doubled")
result = table.search().select(["doubled"]).to_arrow()

Development

See Development for details.

Configuration

Geneva supports specifying configuration in a few different ways. Refer to CONFIGURATION.md for more details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

geneva-0.10.1.tar.gz (7.6 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

geneva-0.10.1-py3-none-any.whl (276.2 kB view details)

Uploaded Python 3

File details

Details for the file geneva-0.10.1.tar.gz.

File metadata

  • Download URL: geneva-0.10.1.tar.gz
  • Upload date:
  • Size: 7.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.10.3 {"installer":{"name":"uv","version":"0.10.3","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for geneva-0.10.1.tar.gz
Algorithm Hash digest
SHA256 3d61f021da9c406b6427e7c8feb00897e294159305636340fa13857246324490
MD5 e539db1bb3f1196d2b9eb00f1ba3b4b6
BLAKE2b-256 583a3c310881f4c33366e31fa40e2d617c05f750f80187482634bac38458193a

See more details on using hashes here.

File details

Details for the file geneva-0.10.1-py3-none-any.whl.

File metadata

  • Download URL: geneva-0.10.1-py3-none-any.whl
  • Upload date:
  • Size: 276.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.10.3 {"installer":{"name":"uv","version":"0.10.3","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for geneva-0.10.1-py3-none-any.whl
Algorithm Hash digest
SHA256 6836b73e6c6dc0d217da2ebd0de176ca155e8485680e27c5898766ad0e23cf32
MD5 b327292da589139b023dbad14006e594
BLAKE2b-256 341191d015ed0545b350e87cfbb52c3a5e3a38d25c0fab1cf0225f48350433e2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page