Skip to main content

docx (OOXML) to html converter

Project description

https://travis-ci.org/CenterForOpenScience/pydocx.png?branch=master

PyDocX is a parser that breaks down the elements of an Office Open XML (.docx) and converts them into different markup languages. Currently, only HTML is supported. Markdown and LaTex are planned for the future. You can extend any of the available parsers to customize it to your needs. You can also create your own class that inherits DocxParser to create your own methods for a markup language not yet supported.

To get started using PyDocX, see the Usage guide and also Extending PyDocX.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

PyDocX-0.5.1.tar.gz (638.5 kB view details)

Uploaded Source

File details

Details for the file PyDocX-0.5.1.tar.gz.

File metadata

  • Download URL: PyDocX-0.5.1.tar.gz
  • Upload date:
  • Size: 638.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for PyDocX-0.5.1.tar.gz
Algorithm Hash digest
SHA256 bb15b7dc6fec1b8490b757811ff12f689e21cf70f0e0fbb971d4a4ee1c19e008
MD5 ad83b3cbdbcdd73f2dd105854441250c
BLAKE2b-256 92f713a9fa7b3528855f0f55f30f60dc99aebb2b2414a7df01ee9f5fb6f933e2

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page