Skip to main content

With CocoIndex, users declare the transformation, CocoIndex creates & maintains an index, and keeps the derived index up to date based on source update, with minimal computation and changes.

Project description

CocoIndex

Data transformation for AI

GitHub Documentation License PyPI version

PyPI Downloads CI release Link Check Discord

cocoindex-io%2Fcocoindex | Trendshift

Ultra performant data transformation framework for AI, with core engine written in Rust. Support incremental processing and data lineage out-of-box. Exceptional developer velocity. Production-ready at day 0.

⭐ Drop a star to help us grow!


CocoIndex Transformation


CocoIndex makes it effortless to transform data with AI, and keep source data and target in sync. Whether you’re building a vector index, creating knowledge graphs for context engineering or performing any custom data transformations — goes beyond SQL.


CocoIndex Features


Exceptional velocity

Just declare transformation in dataflow with ~100 lines of python

# import
data['content'] = flow_builder.add_source(...)

# transform
data['out'] = data['content']
    .transform(...)
    .transform(...)

# collect data
collector.collect(...)

# export to db, vector db, graph db ...
collector.export(...)

CocoIndex follows the idea of Dataflow programming model. Each transformation creates a new field solely based on input fields, without hidden states and value mutation. All data before/after each transformation is observable, with lineage out of the box.

Particularly, developers don't explicitly mutate data by creating, updating and deleting. They just need to define transformation/formula for a set of source data.

Plug-and-Play Building Blocks

Native builtins for different source, targets and transformations. Standardize interface, make it 1-line code switch between different components - as easy as assembling building blocks.

CocoIndex Features

Data Freshness

CocoIndex keep source data and target in sync effortlessly.

Incremental Processing

It has out-of-box support for incremental indexing:

  • minimal recomputation on source or logic change.
  • (re-)processing necessary portions; reuse cache when possible

Quick Start

If you're new to CocoIndex, we recommend checking out

Setup

  1. Install CocoIndex Python library

Note: CocoIndex v1 is currently in preview (pre-release). Use the --pre flag with pip, or configure your package manager to allow pre-releases.

pip install -U --pre cocoindex
  1. Install Postgres if you don't have one. CocoIndex uses it for incremental processing.

  2. (Optional) Install Claude Code skill for enhanced development experience. Run these commands in Claude Code:

/plugin marketplace add cocoindex-io/cocoindex-claude
/plugin install cocoindex-skills@cocoindex

📖 Documentation

For detailed documentation, visit CocoIndex Documentation, including a Quickstart guide.

🤝 Contributing

We love contributions from our community ❤️. For details on contributing or running the project for development, check out our contributing guide.

👥 Community

Welcome with a huge coconut hug 🥥⋆。˚🤗. We are super excited for community contributions of all kinds - whether it's code improvements, documentation updates, issue reports, feature requests, and discussions in our Discord.

Join our community here:

Support us

We are constantly improving, and more features and examples are coming soon. If you love this project, please drop us a star ⭐ at GitHub repo GitHub to stay tuned and help us grow.

License

CocoIndex is Apache 2.0 licensed.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cocoindex-1.0.0a5.tar.gz (191.5 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

cocoindex-1.0.0a5-cp311-abi3-win_amd64.whl (5.9 MB view details)

Uploaded CPython 3.11+Windows x86-64

cocoindex-1.0.0a5-cp311-abi3-manylinux_2_28_x86_64.whl (5.8 MB view details)

Uploaded CPython 3.11+manylinux: glibc 2.28+ x86-64

cocoindex-1.0.0a5-cp311-abi3-manylinux_2_28_aarch64.whl (5.6 MB view details)

Uploaded CPython 3.11+manylinux: glibc 2.28+ ARM64

cocoindex-1.0.0a5-cp311-abi3-macosx_11_0_arm64.whl (5.9 MB view details)

Uploaded CPython 3.11+macOS 11.0+ ARM64

cocoindex-1.0.0a5-cp311-abi3-macosx_10_12_x86_64.whl (5.7 MB view details)

Uploaded CPython 3.11+macOS 10.12+ x86-64

File details

Details for the file cocoindex-1.0.0a5.tar.gz.

File metadata

  • Download URL: cocoindex-1.0.0a5.tar.gz
  • Upload date:
  • Size: 191.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: maturin/1.11.5

File hashes

Hashes for cocoindex-1.0.0a5.tar.gz
Algorithm Hash digest
SHA256 3b4354a399ea45e7ea42332c7a4931c3dfa146664dbaabae2f87217a78301265
MD5 e7919a4a91a7dffee281d00904f1119a
BLAKE2b-256 b910f0e5fb4db390e41826c5fc2fc8775bf496e425f9f541076adcd689dd4d89

See more details on using hashes here.

File details

Details for the file cocoindex-1.0.0a5-cp311-abi3-win_amd64.whl.

File metadata

File hashes

Hashes for cocoindex-1.0.0a5-cp311-abi3-win_amd64.whl
Algorithm Hash digest
SHA256 067117572088533c5749abe5036851a03d188363ac3dfe6552084cc2a93684c4
MD5 32eb7d1955637984a8cc11cbabdd0271
BLAKE2b-256 d64c42ed06e1ff5779bf20232bd2927e379d5f52be8208c0e94229e203513eff

See more details on using hashes here.

File details

Details for the file cocoindex-1.0.0a5-cp311-abi3-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for cocoindex-1.0.0a5-cp311-abi3-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 ef4e6e2facad7f24933ded333acc0374d57899b01c2bd88a1a504313e92c6262
MD5 7c030c4de874af7a11a244735deb474d
BLAKE2b-256 4864a88d4d8652ac19120e50bae5ac66a9b29822853e254f1b2821c728243244

See more details on using hashes here.

File details

Details for the file cocoindex-1.0.0a5-cp311-abi3-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for cocoindex-1.0.0a5-cp311-abi3-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 e81aa90a3f2fcea1968985460521515cdbaf55b1a36511306a131ea704fd6d6f
MD5 2a5f7d5c405d5cbe971e6b3c75ab53c6
BLAKE2b-256 be6df2a3f7373e413387caf9167550f92f97b724806ba8146f80538ffff8e169

See more details on using hashes here.

File details

Details for the file cocoindex-1.0.0a5-cp311-abi3-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for cocoindex-1.0.0a5-cp311-abi3-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 0e4eda2dae59b890eced58c8252e28b285a741311f0b4cc574160e4908df277a
MD5 930c39ad978ba00a66eaa924c3ccdcdf
BLAKE2b-256 7f501ab88e4a1101657bc8fcb882577ebf92f3d23a9f8d301ee58508afd456fb

See more details on using hashes here.

File details

Details for the file cocoindex-1.0.0a5-cp311-abi3-macosx_10_12_x86_64.whl.

File metadata

File hashes

Hashes for cocoindex-1.0.0a5-cp311-abi3-macosx_10_12_x86_64.whl
Algorithm Hash digest
SHA256 a9036ef46b364d314475562b9e840656a3cf08e7971e371a307a8a4ae52b1c1d
MD5 a5a5c671ab52f833d80d64584f862839
BLAKE2b-256 5fdf9b15227823b62dd4ae19a3b4ac8b8f9dc78045ad735e2fcc6320ce10e3e4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page