Object-oriented documentation toolkit for authoring DOCX, PDF, and HTML documents from Python object trees.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

oodocs

OODocs is an Object-Oriented Documentation Tool: a Python-first authoring toolkit for defining structured documents as ordinary objects and rendering the same source to DOCX, PDF, and HTML.

It is aimed at report, documentation, and manuscript workflows where content already lives near Python data, figures, and scripts. Instead of treating a document as a string template or a markup stream, OODocs keeps the source of record as a typed object tree.

Object-Oriented Documentation

In OODocs, Document owns the artifact, DocumentSettings owns document-wide metadata and rendering defaults, and blocks such as Chapter, Section, Paragraph, Table, Figure, Box, and CitationSource carry document intent directly in Python.

That object model is the main design constraint:

the visible outline should match the Python tree
data-backed tables, generated figures, citations, comments, and cross-references should stay editable objects until render time
renderer-specific behavior should live behind DOCX, PDF, and HTML renderers rather than leaking into authoring code
the same source should be useful for writing, validating, reviewing, and releasing a document

Install

For normal use, install OODocs from PyPI:

pip install oodocs

To upgrade later:

pip install --upgrade oodocs

If you need the optional dependencies used by the bundled examples:

pip install "oodocs[examples]"

If you need YAML-backed repository adapters such as GitHub Actions workflow summaries:

pip install "oodocs[adapters]"

If you want to work from a repository checkout, run the bundled example scripts, or contribute locally:

git clone https://github.com/Gonie-Gonie/oo-docs.git
cd oo-docs
pip install -e .

If you want the optional example dependencies from a local checkout:

pip install -e ".[examples]"

For local development and tests:

pip install -e ".[dev]"

On Windows, the repository also includes a helper that creates .venv and installs the development dependencies from the checkout:

.\scripts\setup-repo.cmd

Quick Start

The smallest useful document is just a Document, one visible heading, one paragraph, and one save call. Start here before splitting code into helper functions:

from oodocs import Chapter, Document, DocumentSettings, Paragraph, Section, bold

report = Document(
    "Hello oodocs",
    Chapter(
        "Getting Started",
        Section(
            "Overview",
            Paragraph(
                "This document was defined with ",
                bold("Python objects"),
                ".",
            ),
        ),
    ),
    settings=DocumentSettings(metadata_author="OODocs"),
)

report.save("artifacts/hello.docx")
report.save("artifacts/hello.pdf")
report.save("artifacts/hello.html")

Document.save(...) chooses the renderer from the file extension. The explicit save_docx(...), save_pdf(...), and save_html(...) methods are still available when you want the output format to be obvious in code. Document metadata and renderer defaults live under DocumentSettings(...), so title matter and theme changes stay in one place.

When you want the normal review bundle in one call:

paths = report.save_all("artifacts")
print(paths["docx"], paths["pdf"], paths["html"])

Command Line

The installed package exposes a oodocs command for common build and conversion workflows:

oodocs build report.py --out artifacts
oodocs convert README.md --to docx,pdf,html
oodocs convert notebook.ipynb --to pdf
oodocs validate report.py

build expects a Python file that exposes a Document as document, doc, or report, or a zero-argument factory such as build_document(). Use --factory NAME when the document object or builder has a different name. convert imports Markdown and Jupyter notebooks through the same parser APIs available in Python. Both commands validate before rendering by default and stop before writing outputs when validation errors are found.

For CI and release evidence, oodocs validate --format json emits a machine-readable validation summary. oodocs build ... --fail-on-warning and oodocs convert ... --fail-on-warning treat validation warnings as failures, while --show-warnings prints warning tables without changing the default success path. Add --traceback to any command when debugging needs the full Python stack trace.

For imported Markdown or notebooks, oodocs convert ... --show-import-warnings prints diagnostics for lossy source features, and --strict-import fails on those diagnostics. Release evidence bundles can be generated with python -m oodocs.evidence build --out artifacts/evidence.

Why Not Just LaTeX?

OODocs is not trying to replace every LaTeX workflow. It is meant for documents where Python already owns the data, plots, citations, or release process and where collaborators may still need DOCX.

Common translations:

LaTeX \part -> Part(...) separator pages above chapters
LaTeX \section / \subsection -> Chapter(...), Section(...), Subsection(...)
LaTeX \textbf{...} / \emph{...} / \texttt{...} -> bold(...), italic(...), code(...)
LaTeX tag chips / compact inline labels -> tag(...), badge(...), status(...), and keyboard(...)
Word highlight / strikethrough / manual line break -> highlight(...), strike(...), line_break()
LaTeX \vspace{...} / \hrule and Notion-style separators -> VerticalSpace(...) or Divider(...)
LaTeX \includegraphics -> Figure(path_or_matplotlib_figure, caption=...)
LaTeX subfigures -> SubFigure(...) children inside a captioned SubFigureGroup(...)
LaTeX tabular or copied tables -> Table(...) or Table.from_dataframe(...)
LaTeX \label / \ref -> use reference(obj) or obj.reference() inside Paragraph(...)
LaTeX tcolorbox-style report panels -> editable Box(..., background_color=..., padding=...)
BibTeX-style references -> CitationLibrary, CitationSource.cite(...), and ReferencesPage()

The main payoff is fewer manual handoffs: a benchmark CSV can become a table, a matplotlib object can become a figure, and the same authored structure can render to DOCX for review, PDF for release, and HTML for lightweight sharing.

Authoring Model

OODocs tries to keep the source readable:

create objects with classes such as Document, Part, Chapter, Section, Paragraph, Table, and Figure
apply inline actions with helpers such as bold(...), italic(...), code(...), tag(...), badge(...), status(...), keyboard(...), Text.from_markup(...), Comment.annotated(...), Footnote.annotated(...), and CitationSource.cite()
import existing Markdown with parse_markdown(...), from_markdown(...), or Document.from_markdown(...) when release notes, README fragments, or generated Markdown should become editable OODocs objects
import Jupyter notebooks with parse_ipynb(...), from_ipynb(...), or Document.from_ipynb(...) when notebook markdown, code cells, and textual outputs should become OODocs blocks
keep the document tree explicit so the Python structure matches the final output structure
move document-wide metadata and theme options into DocumentSettings(...) when you want a single place to adjust title matter, cover pages, and renderer defaults

The default behavior is intentionally conventional:

paragraphs are justified by default
tables, figures, boxes, and their captions are centered by default
parts render on their own separator pages and do not reset chapter numbering
headings are numbered as 1, 1.1, 1.1.1, and so on
ordered and bullet lists can be customized with direct kwargs such as indent=..., marker_format=..., and bullet=...
heading numbering can be customized with HeadingNumbering(...)
article-style front matter can be left unnumbered with Section(..., numbered=False)

What To Use When

Use Paragraph(...) for prose. Pass strings and inline helpers directly; you do not need to pre-build Text(...) objects for normal writing.
Use tag(...), badge(...), status(...), and keyboard(...) for compact inline labels. They share the InlineChip(...) model; DOCX emits small inline images, while PDF and HTML keep styled text.
Use highlight(...), strike(...), and line_break() for Word-style emphasis and manual line breaks inside one paragraph.
Use Theme(paragraph_alignment=...) for the document-wide paragraph default, and direct paragraph kwargs such as alignment=... when one paragraph should override it.
Use Paragraph(left_indent=..., right_indent=..., first_line_indent=..., unit=...) when you need Word-like first-line or hanging indents. If unit is omitted, indent values follow DocumentSettings(unit=...).
Use subscript(...), superscript(...), and prescript(...) for ordinary prose. Use Math(...) or Equation(...) for lightweight LaTeX-style math, including ordinary x^2 / x_0 scripts and front scripts such as \prescript{14}{6}{C}.
Use Part(...) for book-like divisions above chapters; each part gets a separator page, while chapter numbers continue across parts by default.
Use Chapter(...), Section(...), Subsection(...), and Subsubsection(...) for the visible outline. Their nesting in Python should match how you expect the final document to read.
Use Document.add(...), Section.add(...), Box.add(...), and MultiColumn.add(...) when a longer report is easier to assemble step by step. Matching extend(...) methods accept iterables and use the same coercion rules as constructors.
Use reference(obj) or obj.reference() for cross-references to captioned media, headings, equations, paragraphs, code blocks, and boxes. Passing the raw object inside Paragraph(...) is rejected so insertion and citation are not confused.
Use Definition(...), Lemma(...), Proposition(...), Theorem(...), Corollary(...), Example(...), Remark(...), Assumption(...), Axiom(...), Claim(...), and Conjecture(...) for theorem-like blocks that share a document-wide counter; Proof(...) is unnumbered by default. Use countable_kind("Exercise", counter="exercise") to create custom countable block classes without subclassing, or pass counter="theorem" when a custom block should join the same sequence.
Use Table(...) for small authored tables, Table.from_records(...) or Table.from_mapping(...) for ordinary Python data, Table.from_csv(...) or Table.from_tsv(...) for delimited files, and Table.from_dataframe(...) when the data already lives in pandas.
Use TableStyle.plain(), TableStyle.compact(), or TableStyle.evidence() when several tables should share a preset without repeating style kwargs.
Use TableCell(horizontal_alignment=..., vertical_alignment=...) or table-wide Table(..., cell_horizontal_alignment=..., cell_vertical_alignment=...) when a table needs Word-like cell alignment. Use dictionaries in row_styles, header_row_styles, or column_styles when rows or columns need background color, text color, bold, or italic formatting.
Use Table(split=True) when a table should render in source order and may break across pages. Leave split=False when the table should stay together when possible; very long tables are automatically rendered as split repeated-header tables.
Use Figure(...) for image files or savefig()-compatible Python figure objects. Use Figure.from_bytes(...) or Figure.from_buffer(...) when image bytes are already in memory.
Use SubFigureGroup(SubFigure(...), SubFigure(...), caption=...) when related images should share one figure number and expose (a), (b), and similar subfigure references.
Use Theme(table_caption_label=..., table_reference_label=..., figure_caption_label=..., figure_reference_label=...) when captions and in-text references should use different labels such as Figure, Fig., or localized terms.
Use Theme(citation_format="apa", reference_format="apa") when inline citations and the generated references page should follow an author-year style. Numeric citation output remains the default.
Use advanced placement=... hints on tables and figures only when needed. Supported values include here, tbp/float, top, bottom, and page.
Use Box(...) for callouts, evidence panels, and tcolorbox-like report sections that should stay editable in Word.
Use Shape(...), TextBox(...), and ImageBox(...) with DocumentSettings(page_items=[...]) for page-positioned overlays that do not move the body text. Use placement="inline" when the same objects should sit in the text flow like Word's inline drawing mode.
Use DocumentSettings(...) for document-wide choices: authors, subtitle, page size, margins, units, page overlays, and theme defaults.
Use document.validate() when you want a structured preflight check before rendering. The returned ValidationResult prints as a compact table, can be serialized with to_dict() or to_json(), and each issue records whether it affects Word, PDF, HTML, or all outputs. save(...), save_docx(...), save_pdf(...), save_html(...), and save_all(...) validate before rendering by default and stop before writing when errors are found.
Use Document.from_markdown(...) when a Markdown file should become a full document. Use parse_markdown(...) when you want a list of blocks that can be inserted, reordered, wrapped in sections, or combined with tables and figures.
Use parse_markdown(..., diagnostics=True) or parse_markdown_file(..., diagnostics=True) when you need an ImportResult with import issues. Use import_policy="strict" when lossy Markdown features such as remote images should fail instead of being converted conservatively.
Use Document.from_markdown(..., numbered=False, toc=True) when imported headings should keep their source titles without generated numbers but still appear in a generated table of contents.
Use heading_level_shift=1 or heading_level_shift=-1 with Document.from_markdown(...) or parse_markdown(...) when imported Markdown should be demoted or promoted before it is inserted into a larger outline. Use shift_heading_levels(...) when already-created Chapter(...)/Section(...) objects need the same adjustment; paragraphs and other non-heading blocks stay unchanged, and shifts beyond the supported heading range fail.
Use Document.from_ipynb(...) when a notebook should become a full document. Use parse_ipynb(...) when notebook cells should be merged into a larger report; markdown cells become normal document structure, code cells become CodeBlock(...), and textual outputs can be included or filtered. Use NotebookImportOptions(...) for tag filtering, output truncation, image captions, and error-output policy.
Use oodocs.adapters when repository files should become document objects: section_from_pyproject(...), section_from_github_workflow(...), section_from_manifest(...), and build_release_evidence_document(...) are designed for release and audit reports.
Use document.save_all("artifacts") when a workflow normally needs DOCX, PDF, and HTML together.

Features

DOCX, PDF, and HTML rendering from the same document tree
Markdown and GitHub Flavored Markdown import for headings, paragraphs, lists, task-list markers, block quotes, fenced code, thematic breaks, tables, local images, links, autolinks, emphasis, inline code, strikethrough, and optional import diagnostics
Jupyter notebook import for markdown cells, code cells, raw cells, textual stream/result/error outputs, tag filtering, output truncation, image captions, and optional import diagnostics
block objects for paragraphs, lists, code blocks, equations, boxes, tables, figures, and generated pages
Pygments-backed syntax highlighting for code blocks across Python, JavaScript, SQL, YAML, shell, and other supported languages
editable report panels with Box(...) kwargs for width, alignment, title color, and per-side padding
portable comments and footnotes that stay stable across DOCX, PDF, and HTML
footnotes target page-bottom placement by default when the renderer supports it; Theme(footnote_placement="document") keeps the collected-notes pattern
captioned tables and figures with automatic numbering and in-text references
independent document-level labels for table/figure captions and in-text references
table support for TableCell(...), rowspan, colspan, banded rows, ordinary Python records, CSV/TSV files, dataframe-like inputs, and cell/row/column styling
automatic split-table rendering for long tables, with repeated headers where renderers support them
advanced table and figure placement hints for here/float/top/bottom/page-style workflows
figure support for stored image files, in-memory bytes, buffers, and savefig()-compatible Python objects
subfigure groups with automatic child labels and references such as Figure 1(a)
page-positioned Shape.rect(...), Shape.ellipse(...), Shape.line(...), TextBox(...), and ImageBox(...) objects with anchors to the page, the margin box, or an earlier named shape
inline drawing placement for Shape(...), TextBox(...), and ImageBox(...), similar to using an image directly in the document flow
inline chips through InlineChip(...), tag(...), badge(...), status(...), and keyboard(...)
bibliography support through CitationSource, CitationLibrary, direct citation objects, BibTeX import, and configurable inline/reference formats such as APA
optional title matter such as subtitle, structured Author(...) metadata, AuthorLayout(...), affiliations, and a cover page
inline hyperlinks and heading/caption anchors for cross-references
release evidence adapters for pyproject metadata, GitHub Actions workflows, JSON manifests, CSV/TSV evidence tables, checksums, and generated evidence reports

Example Scripts

The repository includes four standalone example directories:

examples/usage_guide_example/
examples/journal_paper_example/
examples/native_benchmark_report/
examples/release_notes_digest/

Run them directly from the repository checkout:

.\.venv\Scripts\python.exe .\examples\usage_guide_example\main.py
.\.venv\Scripts\python.exe .\examples\journal_paper_example\main.py
.\.venv\Scripts\python.exe .\examples\native_benchmark_report\main.py
.\.venv\Scripts\python.exe .\examples\release_notes_digest\main.py

Direct example scripts print slow major render steps. Imported build functions stay quiet by default; pass verbose=True when you want the same progress messages.

What they show:

usage_guide_example is a detailed guide that keeps almost all assembly in one main.py so the source stays easy to read; it now covers the core authoring model, validation, CLI workflows, theorem-like countable blocks, layout controls, imports, presets, and renderer differences
the usage guide includes Markdown and notebook import patterns that bring existing authored files into normal OODocs document objects
journal_paper_example shows a longer manuscript-style workflow with article-style sections, unnumbered abstract/highlights/acknowledgements, CSV-backed tables, and matplotlib figures inserted directly from Python objects
native_benchmark_report shows a compact Python-native workflow where a script generates an in-memory workload, benchmarks several callables, turns structured result objects into tables and prose, and exports one report bundle
release_notes_digest collects release-notes/*.md, sorts semantic versions from filenames, imports the Markdown bodies, and builds a release-note document with a version-management table and runbook

By default they write outputs under:

artifacts/usage-guide/
artifacts/journal-paper/
artifacts/native-benchmark-report/
artifacts/release-notes/

The main exported filenames are:

artifacts/usage-guide/oodocs-user-guide.pdf
artifacts/journal-paper/oodocs-development-philosophy.pdf
artifacts/native-benchmark-report/native-python-benchmark.pdf
artifacts/release-notes/oodocs-release-notes.pdf

Project Layout

The package is organized by responsibility:

src/oodocs/document.py for the root Document
src/oodocs/settings.py for DocumentSettings plus grouped configuration exports
src/oodocs/components/ for the concrete authoring model (base.py, blocks.py, equations.py, generated.py, inline.py, markup.py, media.py, people.py, positioning.py, and references.py)
src/oodocs/importers/ for adapters that convert external formats such as Markdown into OODocs objects
src/oodocs/adapters/ and src/oodocs/evidence.py for repository artefact adapters and release evidence bundles
src/oodocs/layout/ for low-level theme and indexing support
src/oodocs/renderers/docx.py, src/oodocs/renderers/pdf.py, and src/oodocs/renderers/html.py for format-specific layout

Development

Assuming Python 3.11 or newer is installed:

.\scripts\setup-repo.cmd

Or:

powershell -ExecutionPolicy Bypass -File .\scripts\setup-repo.ps1

That setup script creates .venv and installs .[dev]. If dependency metadata changes later, rerun the setup script or refresh the environment with:

.\.venv\Scripts\python.exe -m pip install -e ".[dev]"

Run tests:

.\.venv\Scripts\python.exe -m pytest

Build distribution artifacts:

.\.venv\Scripts\python.exe -m build

Releases

OODocs versions are derived from git tags through setuptools-scm.

Create and push a release tag like this:

.\scripts\release.ps1 1.0.3

That pushes v1.0.3, and the GitHub release workflow runs the test suite, builds the wheel/sdist artifacts, renders the release documents, builds the release evidence bundle, attaches those assets to the matching GitHub Release, and uploads the Python distributions to PyPI.

If you want a curated release body instead of GitHub's generated notes, add a file such as release-notes/v1.0.3.md before pushing the tag.

The examples/release_notes_digest/ script demonstrates the same convention as a document workflow: it scans the semantic-versioned Markdown files under release-notes/, builds an index, includes the version-management rules, and imports each release body into one DOCX/PDF/HTML bundle.

PyPI publishing uses Trusted Publishing through the pypi GitHub environment. The PyPI project or pending publisher must trust repository Gonie-Gonie/oo-docs, workflow .github/workflows/release.yml, and environment pypi.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Hyeong-Gon.Jo

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.0.4

Jun 10, 2026

This version

1.0.3

Jun 10, 2026

1.0.2

Jun 10, 2026

1.0.1

Jun 2, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

oodocs-1.0.3.tar.gz (6.7 MB view details)

Uploaded Jun 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

oodocs-1.0.3-py3-none-any.whl (160.5 kB view details)

Uploaded Jun 10, 2026 Python 3

File details

Details for the file oodocs-1.0.3.tar.gz.

File metadata

Download URL: oodocs-1.0.3.tar.gz
Upload date: Jun 10, 2026
Size: 6.7 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for oodocs-1.0.3.tar.gz
Algorithm	Hash digest
SHA256	`b5a8ad525c4543f076532e8c11461ff760c67664f8328c720d96e3b0200162f7`
MD5	`5802e5c90f70e10190d2919e62f8e105`
BLAKE2b-256	`910d4758d09901053e5b6e83ecb8ea4eb21312f03356b246001705d85c217db1`

See more details on using hashes here.

Provenance

The following attestation bundles were made for oodocs-1.0.3.tar.gz:

Publisher: release.yml on Gonie-Gonie/oo-docs

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: oodocs-1.0.3.tar.gz
- Subject digest: b5a8ad525c4543f076532e8c11461ff760c67664f8328c720d96e3b0200162f7
- Sigstore transparency entry: 1772793209
- Sigstore integration time: Jun 10, 2026
Source repository:
- Permalink: Gonie-Gonie/oo-docs@2d8f0ae824359cc790be506463e13cbad2e8db07
- Branch / Tag: refs/tags/v1.0.3
- Owner: https://github.com/Gonie-Gonie
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@2d8f0ae824359cc790be506463e13cbad2e8db07
- Trigger Event: push

File details

Details for the file oodocs-1.0.3-py3-none-any.whl.

File metadata

Download URL: oodocs-1.0.3-py3-none-any.whl
Upload date: Jun 10, 2026
Size: 160.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for oodocs-1.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2959aa2511fd2ce8d7cd5dcd811f5c289ac71dc7aed836fd0b4f95b5fcc8f87b`
MD5	`17cc5c03da4aa9a6954a0a0c9043a7cd`
BLAKE2b-256	`8675fdd03ffeb23bf7e3226477677b244b7bf715531cce858c7a9d48776aca15`

See more details on using hashes here.

Provenance

The following attestation bundles were made for oodocs-1.0.3-py3-none-any.whl:

Publisher: release.yml on Gonie-Gonie/oo-docs

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: oodocs-1.0.3-py3-none-any.whl
- Subject digest: 2959aa2511fd2ce8d7cd5dcd811f5c289ac71dc7aed836fd0b4f95b5fcc8f87b
- Sigstore transparency entry: 1772793397
- Sigstore integration time: Jun 10, 2026
Source repository:
- Permalink: Gonie-Gonie/oo-docs@2d8f0ae824359cc790be506463e13cbad2e8db07
- Branch / Tag: refs/tags/v1.0.3
- Owner: https://github.com/Gonie-Gonie
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@2d8f0ae824359cc790be506463e13cbad2e8db07
- Trigger Event: push

oodocs 1.0.3

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

oodocs

Object-Oriented Documentation

Install

Quick Start

Command Line

Why Not Just LaTeX?

Authoring Model

What To Use When

Features

Example Scripts

Project Layout

Development

Releases

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance