Skip to main content

Inject pandas DataFrames into Excel templates with table resizing and pivot refresh.

Project description

pivoteer

CI PyPI License: MIT

pivoteer injects pandas DataFrames into existing Excel templates by editing the underlying XML. It resizes Excel Tables (ListObjects) and forces PivotTables to refresh on open without corrupting pivot caches.

Why pivoteer

Most Python Excel libraries rewrite workbooks, which can break PivotTables, filters, and formatting in real-world templates. pivoteer is designed for enterprise reporting workflows where templates are authored in Excel and must remain intact. It surgically updates only the table data and table metadata so PivotTables remain connected and refresh correctly.

Installation

pip install pivoteer

Quick Start

from pathlib import Path
import pandas as pd

from pivoteer.core import Pivoteer

pivoteer = Pivoteer(Path("template.xlsx"))

df = pd.DataFrame(
    {
        "Category": ["Hardware", "Software"],
        "Region": ["North", "South"],
        "Amount": [120.0, 250.0],
        "Date": ["2024-01-01", "2024-01-02"],
    }
)

pivoteer.apply_dataframe("DataSource", df)
pivoteer.save("report_output.xlsx")

Architecture Overview

  • Input/output: .xlsx files are ZIP archives containing OpenXML parts.
  • Data injection: updates xl/worksheets/sheetN.xml row data using inline strings to avoid touching sharedStrings.xml.
  • Table resizing: updates xl/tables/tableN.xml by recalculating the ref range based on the DataFrame shape.
  • Pivot refresh: sets refreshOnLoad="1" in xl/pivotCache/pivotCacheDefinitionN.xml when present.
  • Pivot cache field sync (opt-in): appends missing cache field entries for table columns so new headers appear in existing PivotTables.

Features

  • Surgical Data Injection: updates worksheet XML without touching sharedStrings.
  • Table Resizing: recalculates ListObject ranges to match injected data.
  • Pivot Preservation: sets pivot caches to refresh on load when present.
  • Optional Pivot Cache Field Sync: appends missing cache field metadata for new table columns without touching PivotTable layouts.
  • Minimal IO: stream-based ZIP copy-and-replace for stability.

Pivot Cache Field Sync

When new columns are added to an Excel Table, existing PivotTables often fail to show the new fields until the PivotCache metadata is updated. pivoteer can synchronize PivotCache field definitions so new table columns appear in the PivotTable field list.

What pivoteer does:

  • Syncs PivotCache field metadata for the target table.
  • Appends missing cache fields so new columns are visible in the PivotTable UI.

What pivoteer does not do:

  • Does not create PivotTables.
  • Does not modify PivotTable layouts or filters.
  • Does not touch slicers or formatting.

Usage Patterns

Multiple table updates

from pivoteer.core import Pivoteer
import pandas as pd

p = Pivoteer("template.xlsx")
p.apply_dataframe("SalesData", pd.read_csv("sales.csv"))
p.apply_dataframe("CostData", pd.read_csv("costs.csv"))
p.save("report_output.xlsx")

Opt-in pivot cache field sync

from pivoteer.core import Pivoteer
import pandas as pd

p = Pivoteer("template.xlsx", enable_pivot_field_sync=True)
p.apply_dataframe("RawData", pd.read_csv("usage.csv"))
p.save("report_output.xlsx")

This flag is optional; when it is not set, pivoteer behaves exactly as before.

Advanced usage with TemplateEngine

from pathlib import Path
import pandas as pd

from pivoteer.template_engine import TemplateEngine

engine = TemplateEngine(Path("template.xlsx"))
engine.apply_dataframe("RawData", pd.read_csv("usage.csv"))
engine.sync_pivot_cache_fields()
engine.ensure_pivot_refresh_on_load()
parts = engine.get_modified_parts()

Low-level XML access

For custom XML inspection or modification, read_xml_part reads any XML part from an Excel archive:

import zipfile
from pivoteer.xml_engine import read_xml_part

with zipfile.ZipFile("template.xlsx", "r") as archive:
    tree = read_xml_part(archive, "xl/workbook.xml")
    print(tree.getroot().tag)

Supported data types

pivoteer handles the following DataFrame value types when injecting rows:

Type Excel representation
int, float Numeric cell (<v>)
str Inline string (<is><t>)
datetime.date, datetime.datetime Inline string (ISO 8601)
None, NaN, NaT Empty cell (no children)

Large datasets

pivoteer is optimized for replacing table data without rewriting the entire workbook. It is a good fit for large tables where preserving PivotTables and filters matters more than Excel formatting for each row.

Safety Guarantees

  • Opt-in only: the feature is disabled unless explicitly enabled.
  • Only missing cache fields are appended.
  • Existing cache field order is preserved.
  • PivotTable definitions are not modified.

Limitations

  • The PivotCache source must reference the named Excel Table.
  • The template must already contain PivotTables and pivot caches.
  • The structured table must exist and be the PivotTable cache source.
  • pivoteer does not auto-refresh the Excel UI; Excel recalculates pivots on open.

Compatibility

  • Python: 3.10+
  • Excel: Desktop Excel (Windows/macOS) supports refreshOnLoad for PivotTables.
  • Templates: Must include Excel Tables (ListObjects) with stable names.

Troubleshooting

  • "Table not found": Ensure the Excel Table name matches exactly.
  • "Pivot cache not found": The template may not include a PivotTable; this is expected for synthetic templates.
  • "DataFrame is empty": pivoteer refuses empty payloads to protect templates.

Support and Requests

  • Bugs: open a GitHub issue using the Bug Report template.
  • Feature requests: open a GitHub issue using the Feature Request template.
  • Security: follow the reporting process in SECURITY.md.

Security

If you discover a vulnerability, please read SECURITY.md for reporting instructions.

Development

python -m venv .venv
source .venv/bin/activate
pip install -e .[dev]
pytest

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pivoteer-0.2.1.tar.gz (22.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pivoteer-0.2.1-py3-none-any.whl (14.2 kB view details)

Uploaded Python 3

File details

Details for the file pivoteer-0.2.1.tar.gz.

File metadata

  • Download URL: pivoteer-0.2.1.tar.gz
  • Upload date:
  • Size: 22.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pivoteer-0.2.1.tar.gz
Algorithm Hash digest
SHA256 11d20e48e4eefcb127219d2032f9e0d54fc98a3b2e72225a2b26a96f8b61b289
MD5 7b12425c498a4c84e3ff478365561345
BLAKE2b-256 318ffab6202f58254221570e92da92ec08c839bab8f455c42062f3e14260757d

See more details on using hashes here.

Provenance

The following attestation bundles were made for pivoteer-0.2.1.tar.gz:

Publisher: release.yml on flitzrrr/pivoteer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pivoteer-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: pivoteer-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 14.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pivoteer-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 39af0cd80f135f4fa486cf75eb8fc4c092b9a96aa517869e3c07be9af82e1c98
MD5 25be8a35ce520ce5372e13bd4916fa21
BLAKE2b-256 6d6b64a334368f045f17b19dbd49bba177738ef207168790a23d7283e03c1f17

See more details on using hashes here.

Provenance

The following attestation bundles were made for pivoteer-0.2.1-py3-none-any.whl:

Publisher: release.yml on flitzrrr/pivoteer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page