Skip to main content

Browse and diff Microsoft Office .docx, .xlsx, and .pptx files.

Project description

opc-diag is a command-line application for exploring Microsoft Word, Excel, and PowerPoint files from Office 2007 and later. Also known as Office Open XML, the structure of these files adheres to the Open Packaging Convention (OPC), specified by ISO/IEC 29500.

opc-diag provides the opc command, which allows OPC files to be browsed, diff-ed, extracted, repackaged, and parts from one to be substituted into another.

Its primary use is by developers of software that generates and/or manipulates Microsoft Office documents.

A typical use would be diff-ing a Word file from before and after an operation, say inserting a paragraph, to identify the specific changes Word made to the XML. This is handy when one is developing software to do the same without Word’s help:

$ opc diff before.docx after.docx

Another main use is to diagnose an issue causing an Office document to not load cleanly, typically because the software that generated it has a bug. These problems can be tedious and often difficult to diagnose without tools like opc-diag, and were the primary motivation for developing it.

More information is available in the opc-diag documentation.

History

0.9.0 (2013-09-23)

  • Initial release – supporting browse, diff, diff-item, extract, repackage, and substitute subcommands.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

opc-diag-0.9.6.tar.gz (16.6 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page