Skip to main content

a python module to parse a Microsoft Word form in docx format, and extract all field values with their tags into a dictionary.

Project description

pywordform: a python module to parse a Microsoft Word form in docx format, and extract all field values with their tags into a dictionary.

Project website: http://www.decalage.info/python/pywordform

INSTALLATION:

  • on Windows, launch install.bat

  • on other systems, launch: setup.py install

HOW TO USE THIS MODULE:

Open sample_form.docx in MS Word, and edit field values.

From the shell, extract all fields with tags:

> python pywordform.py sample_form.docx field1 = “hello, world.” field2 = “hello,” field3 = “value B” field4 = “04-03-2012”

In a python script:

import pywordform fields = pywordform.parse_form(‘sample_form.docx’) print fields

=> this returns a dictionary of field values indexed by tags.

See http://www.decalage.info/python/pywordform See main program at the end of the module, and also docstrings.

LICENSE:

See LICENSE.txt.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pywordform-0.02.zip (23.3 kB view details)

Uploaded Source

File details

Details for the file pywordform-0.02.zip.

File metadata

  • Download URL: pywordform-0.02.zip
  • Upload date:
  • Size: 23.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for pywordform-0.02.zip
Algorithm Hash digest
SHA256 154a656a592ebb52f1b94dc93fee268aa64fffd8cf0ab38351c32598372f8346
MD5 79a596b550759d3f7f524e75878b64ea
BLAKE2b-256 accb1d64d4d5df254e2dc13bd695a329cc654e2203deeff66595d916fbb18aae

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page