Skip to main content

With CocoIndex, users declare the transformation, CocoIndex creates & maintains an index, and keeps the derived index up to date based on source update, with minimal computation and changes.

Project description

CocoIndex

Data transformation for AI

GitHub Documentation License PyPI version

PyPI Downloads CI release Link Check prek Discord

cocoindex-io%2Fcocoindex | Trendshift

Ultra performant data transformation framework for AI, with core engine written in Rust. Support incremental processing and data lineage out-of-box. Exceptional developer velocity. Production-ready at day 0.

⭐ Drop a star to help us grow!


CocoIndex Transformation


CocoIndex makes it effortless to transform data with AI, and keep source data and target in sync. Whether you’re building a vector index, creating knowledge graphs for context engineering or performing any custom data transformations — goes beyond SQL.


CocoIndex Features


Exceptional velocity

Just declare transformation in dataflow with ~100 lines of python

# import
data['content'] = flow_builder.add_source(...)

# transform
data['out'] = data['content']
    .transform(...)
    .transform(...)

# collect data
collector.collect(...)

# export to db, vector db, graph db ...
collector.export(...)

CocoIndex follows the idea of Dataflow programming model. Each transformation creates a new field solely based on input fields, without hidden states and value mutation. All data before/after each transformation is observable, with lineage out of the box.

Particularly, developers don't explicitly mutate data by creating, updating and deleting. They just need to define transformation/formula for a set of source data.

Plug-and-Play Building Blocks

Native builtins for different source, targets and transformations. Standardize interface, make it 1-line code switch between different components - as easy as assembling building blocks.

CocoIndex Features

Data Freshness

CocoIndex keep source data and target in sync effortlessly.

Incremental Processing

It has out-of-box support for incremental indexing:

  • minimal recomputation on source or logic change.
  • (re-)processing necessary portions; reuse cache when possible

Quick Start

If you're new to CocoIndex, we recommend checking out

Setup

  1. Install CocoIndex Python library

Note: CocoIndex v1 is currently in preview (pre-release). Use the --pre flag with pip, or configure your package manager to allow pre-releases.

pip install -U --pre cocoindex
  1. Install Postgres if you don't have one. CocoIndex uses it for incremental processing.

  2. (Optional) Install Claude Code skill for enhanced development experience. Run these commands in Claude Code:

/plugin marketplace add cocoindex-io/cocoindex-claude
/plugin install cocoindex-skills@cocoindex

📖 Documentation

For detailed documentation, visit CocoIndex Documentation, including a Quickstart guide.

🤝 Contributing

We love contributions from our community ❤️. For details on contributing or running the project for development, check out our contributing guide.

👥 Community

Welcome with a huge coconut hug 🥥⋆。˚🤗. We are super excited for community contributions of all kinds - whether it's code improvements, documentation updates, issue reports, feature requests, and discussions in our Discord.

Join our community here:

Support us

We are constantly improving, and more features and examples are coming soon. If you love this project, please drop us a star ⭐ at GitHub repo GitHub to stay tuned and help us grow.

License

CocoIndex is Apache 2.0 licensed.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cocoindex-1.0.0a23.tar.gz (267.4 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

cocoindex-1.0.0a23-cp314-cp314t-win_amd64.whl (6.4 MB view details)

Uploaded CPython 3.14tWindows x86-64

cocoindex-1.0.0a23-cp314-cp314t-macosx_11_0_arm64.whl (6.3 MB view details)

Uploaded CPython 3.14tmacOS 11.0+ ARM64

cocoindex-1.0.0a23-cp311-abi3-win_amd64.whl (6.4 MB view details)

Uploaded CPython 3.11+Windows x86-64

cocoindex-1.0.0a23-cp311-abi3-macosx_11_0_arm64.whl (6.3 MB view details)

Uploaded CPython 3.11+macOS 11.0+ ARM64

cocoindex-1.0.0a23-cp311-abi3-macosx_10_12_x86_64.whl (6.2 MB view details)

Uploaded CPython 3.11+macOS 10.12+ x86-64

cocoindex-1.0.0a23-cp310-cp310-manylinux_2_28_x86_64.whl (6.3 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

cocoindex-1.0.0a23-cp310-cp310-manylinux_2_28_aarch64.whl (6.1 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ ARM64

File details

Details for the file cocoindex-1.0.0a23.tar.gz.

File metadata

  • Download URL: cocoindex-1.0.0a23.tar.gz
  • Upload date:
  • Size: 267.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: maturin/1.12.6

File hashes

Hashes for cocoindex-1.0.0a23.tar.gz
Algorithm Hash digest
SHA256 d50ba0a5a5a215ef76bad8031e7294cd9b76d10b397e6e714da542a373fff1c4
MD5 0cd39744ef07d3ebe9ba8f9d5b8b6ee3
BLAKE2b-256 e950d77829abff291473a858e84ed49c80f24fbf9354193090d12a23bf4eb415

See more details on using hashes here.

File details

Details for the file cocoindex-1.0.0a23-cp314-cp314t-win_amd64.whl.

File metadata

File hashes

Hashes for cocoindex-1.0.0a23-cp314-cp314t-win_amd64.whl
Algorithm Hash digest
SHA256 cbb762d7d5ac15f358f2fa23fdc9485026f8f6f2542d20edcbf27680339137ba
MD5 7d7871e7c8a3563a8d5423a5cc83c7c7
BLAKE2b-256 1e2f1437c801b1a7c6c669f848cc097b4ef3819a6ab7d2347f50d1e88ef2f0af

See more details on using hashes here.

File details

Details for the file cocoindex-1.0.0a23-cp314-cp314t-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for cocoindex-1.0.0a23-cp314-cp314t-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 d28c6d5b7faf1226ba04bbd8b84832b9c2b0502ca1a1d52ec2e9e19fdf87ebf4
MD5 590627f88e4106693821b3f9204da287
BLAKE2b-256 7771166aee5d12a1c0f9afa84243021c9aaec40604170f9aa0d1b106e7934b29

See more details on using hashes here.

File details

Details for the file cocoindex-1.0.0a23-cp311-abi3-win_amd64.whl.

File metadata

File hashes

Hashes for cocoindex-1.0.0a23-cp311-abi3-win_amd64.whl
Algorithm Hash digest
SHA256 8532c4aa0d599e116ee49119539f978dc3f9521c3edb0666fd86a00abddd33fe
MD5 2bd1bd9d0e3cf0e26de03f111f001d94
BLAKE2b-256 cb4856c59d5199c81bf9a733f8b49159758f00c097010f0723036762a7ba5006

See more details on using hashes here.

File details

Details for the file cocoindex-1.0.0a23-cp311-abi3-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for cocoindex-1.0.0a23-cp311-abi3-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 ae27c255f4159935d6452befd52ee8e68c358dee9e18edd252094f51cd1c67f2
MD5 6ed26d7bb055baee17bddb8c1664144e
BLAKE2b-256 c13f7ddb705b665280455794011dc9200f357df767075396117662a40f3ce4fd

See more details on using hashes here.

File details

Details for the file cocoindex-1.0.0a23-cp311-abi3-macosx_10_12_x86_64.whl.

File metadata

File hashes

Hashes for cocoindex-1.0.0a23-cp311-abi3-macosx_10_12_x86_64.whl
Algorithm Hash digest
SHA256 2c2e31364e78c15c8ebc6a7234fe1d211d8c36ef1b7a5bc6281c1033eab36ee0
MD5 c9c275ff52cc0231d9780f5f7585cb03
BLAKE2b-256 417a5de94e37c2e32bcbb1a5022b9c5fbfc8957f601b9478109de5f016c0bc9d

See more details on using hashes here.

File details

Details for the file cocoindex-1.0.0a23-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for cocoindex-1.0.0a23-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 16cee49182fb7b58eab669ed9fc9069bafc4396e53150634ffc23fd55a5fcc1f
MD5 ec1548aecc1931115862dd8e0521ce74
BLAKE2b-256 3aa4e8dabf1e0d085fb9325b17423e38033e2c348f7dfa5bcb2f22ed7188de84

See more details on using hashes here.

File details

Details for the file cocoindex-1.0.0a23-cp310-cp310-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for cocoindex-1.0.0a23-cp310-cp310-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 c3c97b296ef3d1444248ce9584cbfdeab3afdf3d4ef7300cdce294e253c17423
MD5 fd0767f11ba4e9683833a3d17a9e615c
BLAKE2b-256 e37ebce8c659c4c78d574b0614452f8b44d53cd89afe0cfff34242796d2ab00f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page