Skip to main content

Monkey-patches Products.CMFCore to optimize catalog operations on object moves

Project description

experimental.catalogmoveopt

Plone add-on that optimizes catalog operations when content is moved or renamed, preserving Record IDs (RIDs) and reindexing only the indexes that actually change.

The problem

When an object is moved or renamed in stock Plone, Products.CMFCore fires a full catalog cycle: unindex at the old path, reindex everything at the new path. For large content trees this is expensive because every index is recomputed even though most of them (title, description, content body, …) have not changed at all. It also assigns a new RID to the object, which can invalidate in-flight catalog references.

How it works

The add-on monkey-patches Products.CMFCore at Zope startup (via an IProcessStarting subscriber) to replace the stock handleContentishEvent with an optimized version.

On a true object move (old parent ≠ new parent, i.e. cut-paste):

  1. IObjectWillBeMovedEvent — instead of calling unindexObject(), the object's current physical path is saved in the transaction-local registry keyed by its ZODB _p_oid. The catalog entry is left untouched.
  2. IObjectMovedEvent — the saved old path is retrieved, and CatalogTool.moveObject() (injected by this add-on) is called. It remaps old_path → same RID → new_path in the catalog's internal BTree structures, then calls reindexObject() with only the context-aware indexes.

The net result: the RID is preserved, only the path-dependent and security-dependent indexes are recomputed, and the full reindex of expensive text/metadata indexes is skipped entirely.

For renames (same parent, new id) the same path is followed — the object stays in the same container, only its path and id change.

For all other event types (add, copy, delete) the behaviour is identical to stock CMFCore.

Transaction-local path registry

The old path is stored via transaction.set_data() / transaction.data(), keyed by a stable module-level singleton object. This avoids the _v_ volatile attribute pattern, which is vulnerable to ZODB cache ghostification: for large subtrees, objects can be evicted from the ZODB cache between the WillBeMoved and Moved event phases, causing silent fallback to a full reindex. Transaction-attached data lives outside the ZODB object graph and is discarded automatically on commit or abort.

Context-aware indexes

Only indexes whose values change when an object moves need to be reindexed. This add-on ships with two built-in providers:

Provider name Indexes
cmf.location path, getId, id
cmf.security allowedRolesAndUsers

Third-party packages can contribute additional indexes by registering a named utility providing IContextAwareIndexProvider:

<!-- my.package/configure.zcml -->
<utility
    provides="experimental.catalogmoveopt.interfaces.IContextAwareIndexProvider"
    name="my.package.myindex"
    component=".providers.MyIndexProvider"
    />
# my.package/providers.py
from zope.interface import implementer
from experimental.catalogmoveopt.interfaces import IContextAwareIndexProvider

@implementer(IContextAwareIndexProvider)
class MyIndexProvider:
    def getIndexNames(self):
        return ("my_custom_index",)

If no providers are registered the optimization is disabled and the stock full-reindex path is used as a safe fallback.

Installation

Add experimental.catalogmoveopt to your Plone backend's dependencies:

# pyproject.toml
dependencies = [
    ...
    "experimental.catalogmoveopt",
]

No further configuration is required. The add-on uses z3c.autoinclude.plugin so its ZCML is loaded automatically when installed in a Plone site.

Compatibility

Plone Python
6.0 3.10, 3.11
6.1 3.10, 3.11, 3.12
6.2 3.10, 3.11, 3.12, 3.13

Development

git clone git@github.com:RedTurtle/experimental.catalogmoveopt.git
cd experimental.catalogmoveopt
make install
make test

Prior art and upstream discussion

This add-on exists as a monkey-patch package while the optimization makes its way into the Plone/CMFCore ecosystem proper. Key references:

  • 4teamwork/ftw.copymovepatches — the original proof-of-concept for Plone 4.3 that demonstrated the approach. A real-world benchmark reported an 80-second move of a folder with ~300 files dropping to ~8 seconds (~10× speedup).

  • plone/Products.CMFPlone#3834 — David Glick's draft experiment bringing the same optimization to Plone 6, using ftw.copymovepatches as the starting point. The linked comment explicitly requests that the fix land in CMFCore rather than as a monkey-patch in CMFPlone.

  • zopefoundation/Products.CMFCore#161 — the upstream CMFCore pull request (by the author of this package) that proposes adding CatalogTool.moveObject() and the IContextAwareIndexProvider interface directly to CMFCore. Once merged, this add-on will become unnecessary.

Contribute

License

The project is licensed under GPLv2.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

experimental_catalogmoveopt-1.0.0a0.tar.gz (23.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

experimental_catalogmoveopt-1.0.0a0-py3-none-any.whl (19.2 kB view details)

Uploaded Python 3

File details

Details for the file experimental_catalogmoveopt-1.0.0a0.tar.gz.

File metadata

File hashes

Hashes for experimental_catalogmoveopt-1.0.0a0.tar.gz
Algorithm Hash digest
SHA256 efe87d1571be62aee3acbbb7554fd7c99c21e204d7c473d58855c78fda5c19f1
MD5 a78070d64c347e0e776ae5fa40a723e9
BLAKE2b-256 4288c2ef87b00ece85f2ffbc3baea58714a3c355cd6158847dba7b6c6ae70ea6

See more details on using hashes here.

File details

Details for the file experimental_catalogmoveopt-1.0.0a0-py3-none-any.whl.

File metadata

File hashes

Hashes for experimental_catalogmoveopt-1.0.0a0-py3-none-any.whl
Algorithm Hash digest
SHA256 9e95d3b5d231015d6bc516cfd244c7570a90778794682832238928bfc39178b8
MD5 ab9cf84a9613cf8329865110c084a84e
BLAKE2b-256 f31e149a540b9931d3561b1234fcb9cec98f731a6c84d1cc336b7a1bcdcee24a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page