Skip to main content

Lightweight Word report generation library based on python-docx

Project description

docxspec

English | 简体中文

PyPI Python License CI Publish

docxspec is a structured Word report generation library built on top of python-docx.

It provides a small, template-driven API for generating .docx reports from user-supplied templates and containerized content blocks. It is designed for automated test reports, simulation reports, and other engineering document workflows.

Features

  • Template-based Word report generation
  • Structured content container API
  • Text, image, and table insertion
  • Rich text style helpers
  • Automatic figure and table caption numbering
  • Page field helpers such as PAGE and NUMPAGES
  • Word field refresh helpers for TOC, figure lists, table lists, and captions
  • KL-standard document classification, validation, and standard document assembly
  • PyPI-ready packaging and CI/CD workflows

Installation

Install from PyPI:

pip install docxspec

Or install from source:

git clone https://github.com/Poggi-Tang/DocxSpec.git
cd DocxSpec
pip install -e .

Quick Start

from docxspec import BODY_STYLE, WordAPI, make_rich_text

api = WordAPI("your_template.docx")

text = make_rich_text(
    "This text is inserted into the template.",
    BODY_STYLE,
)

image_container = api.new_container()
image_container.add_image(
    "your_image.png",
    width_cm=8.0,
    align="center",
)

table_container = api.new_container()
table_container.add_table_by_config(
    {
        "data": [
            ["Name", "Value"],
            ["Example", "123"],
        ]
    }
)

api.render(
    {
        "text": text,
        "image": image_container.subdoc,
        "table": table_container.subdoc,
    },
    "report.docx",
)

How It Works

docxspec uses two main ideas:

  1. WordAPI loads a .docx template and renders the final output.
  2. Containers are used to build structured sub-document blocks such as images, tables, captions, and styled paragraphs.

A typical workflow is:

  1. Prepare a Word template
  2. Build text or container content
  3. Pass the rendered blocks into api.render(...)
  4. Export the final report

Template Requirements

docxspec is template-driven.

When preparing your own template, make sure that:

  • the template structure matches the placeholders or render targets used by your code
  • the required paragraph / table styles exist in the template
  • custom report styles are defined in advance if your project depends on them

The demo template in Demo/template.docx is provided as a runnable example, but demo assets are not packaged into the published wheel. The KL master template used by build_standard_docx() is bundled separately as package data.

KL Standard Documents

docxspec also provides KL-standard document utilities for workflows that start from an existing body-only .docx file and need a standardized output document.

from docxspec import build_standard_docx, check_word_standard, classify_docx_body

report = classify_docx_body("body.docx")
build_result = build_standard_docx(
    "body.docx",
    "standard.docx",
    "Demo/template.docx",
)
check_result = check_word_standard("standard.docx")

These APIs reuse the KL styles and field-code rules from the master template. They are intended for document standardization and validation, while WordAPI remains the primary API for template-driven report generation.

Refreshing Word Fields

docxspec can mark a generated .docx so Word/WPS refreshes fields when the document is opened. On Windows with Microsoft Word installed, it can also refresh fields immediately through Word COM automation.

from docxspec import refresh_docx_fields

result = refresh_docx_fields("report.docx", use_word=True)
print(result.word_refreshed, result.error)

Install the optional Word automation dependency when immediate COM refresh is needed:

pip install "docxspec[word]"

Without Microsoft Word or pywin32, refresh_docx_fields() still writes the updateFields flag so the fields can update when the file is opened.

Demo Directory

The repository includes a runnable demo set in Demo/.

Shared template:

  • Demo/template.docx

Example scripts:

  • Demo/demo1_paragraph.py
  • Demo/demo2_container_paragraph.py
  • Demo/demo3_container_image_caption.py
  • Demo/demo4_container_table_caption.py
  • Demo/demo5_container_table_image_caption.py
  • Demo/demo6_header_footer.py
  • Demo/demo7_styles_in_container.py
  • Demo/demo8_all_in_one.py
  • Demo/demo9_block_template.py
  • Demo/demo10_kl_standard.py

Run them from the repository root, for example:

python Demo/demo1_paragraph.py
python Demo/demo8_all_in_one.py
python Demo/demo9_block_template.py
python Demo/demo10_kl_standard.py

Generated files are written to Demo/output/.

Reusing Word Blocks

BlockTemplate can extract a repeated caption + table block from an existing Word template and insert cloned copies back into a document or a container. This is useful when the template already contains field captions, merged cells, images, or carefully prepared table styles.

Place a marker paragraph immediately before the source caption and table:

{{SOURCE_TABLE_BLOCK}}
Table <SEQ field> {{TABLE_TITLE}}
<table with placeholders>

Then extract and reuse the block:

from docxspec import WordAPI

api = WordAPI("template.docx")
block_template = api.extract_table_block("{{SOURCE_TABLE_BLOCK}}", remove_block=True)

container = api.new_container()
container.add_block(
    block_template.clone().replace_text(
        {
            "{{TABLE_TITLE}}": "Default icon display",
            "{{ITEM_ID}}": "FT_Demo_UI_001",
        }
    )
)

api.render({"result": container.subdoc}, "report.docx")

For documents that must preserve Word field captions as strictly as possible, prefer insert_block_at_marker(...) followed by saving the document directly.

Project Structure

DocxSpec
├── .github/
│   └── workflows/
├── Demo/
├── src/
│   └── docxspec/
│       ├── __init__.py
│       ├── word_api.py
│       └── word_styles.py
├── tests/
├── CHANGELOG.md
├── CONTRIBUTING.md
├── LICENSE
├── README.md
├── README.zh-CN.md
└── pyproject.toml

Release Automation

This repository is prepared for a professional Python package workflow:

  • CI runs lint and tests on push and pull request
  • Trusted Publishing publishes to PyPI from GitHub Actions
  • Build artifacts include both source distribution and wheel

Development

pip install -e .[dev]
pytest
ruff check .

When developing locally, prepare your own .docx template files in the repository or project workspace and pass their paths explicitly to WordAPI.

License

MIT License. See LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

docxspec-0.0.8.tar.gz (105.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

docxspec-0.0.8-py3-none-any.whl (89.7 kB view details)

Uploaded Python 3

File details

Details for the file docxspec-0.0.8.tar.gz.

File metadata

  • Download URL: docxspec-0.0.8.tar.gz
  • Upload date:
  • Size: 105.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for docxspec-0.0.8.tar.gz
Algorithm Hash digest
SHA256 8c6098428a74e19171ef2c5069d1c8d60905e15fa46db6a3b325aaeea7c3bf67
MD5 76718a13dd7a8a9f2debb15bf658b086
BLAKE2b-256 77767a845bc16c8df49b176e39c4579318d29085a013f244fa1bf22df8061db2

See more details on using hashes here.

Provenance

The following attestation bundles were made for docxspec-0.0.8.tar.gz:

Publisher: publish.yml on Poggi-Tang/DocxSpec

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file docxspec-0.0.8-py3-none-any.whl.

File metadata

  • Download URL: docxspec-0.0.8-py3-none-any.whl
  • Upload date:
  • Size: 89.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for docxspec-0.0.8-py3-none-any.whl
Algorithm Hash digest
SHA256 1dfaae1d6a40d62f5b248b321b83090380bbdbca44fd5823aa5d3648bcccfcc4
MD5 09f24d54f18c46b9832c006104b4f86d
BLAKE2b-256 755bd8be329a619567bf0eea0612159ec83ccbafe7f1fbff695dfaf603c09b82

See more details on using hashes here.

Provenance

The following attestation bundles were made for docxspec-0.0.8-py3-none-any.whl:

Publisher: publish.yml on Poggi-Tang/DocxSpec

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page