Skip to main content

a python module to parse a Microsoft Word form in docx format, and extract all field values with their tags into a dictionary.

Project description

pywordform: a python module to parse a Microsoft Word form in docx format, and extract all field values with their tags into a dictionary.

Project website: http://www.decalage.info/python/pywordform

INSTALLATION:

  • on Windows, launch install.bat
  • on other systems, launch: setup.py install

HOW TO USE THIS MODULE:

Open sample_form.docx in MS Word, and edit field values.

From the shell, extract all fields with tags:

> python pywordform.py sample_form.docx field1 = “hello, world.” field2 = “hello,” field3 = “value B” field4 = “04-03-2012”

In a python script:

import pywordform fields = pywordform.parse_form(‘sample_form.docx’) print fields

=> this returns a dictionary of field values indexed by tags.

See http://www.decalage.info/python/pywordform See main program at the end of the module, and also docstrings.

LICENSE:

See LICENSE.txt.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for pywordform, version 0.02
Filename, size File type Python version Upload date Hashes
Filename, size pywordform-0.02.zip (23.3 kB) File type Source Python version None Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page