Skip to main content

Get the parsed microsoft word document in a hierarchical tree structure.

Project description

mswordtree

Parse your whole word document in a hierarchical tree structure. The document content will be listed down as Heading and its children as subheading/paragraph/table etc.

Install the library using following comand

pip install mswordtree

Use the following code to parse your word document in a tree structure

from mswordtree import GetWordDocTree
root = GetWordDocTree('test.docx')

Now you can iterate over all objects of the document by using the following code

for item in root.Items:
    print('Type: {} -> Content {}\n'.format(item.Type, item.Content))

To make the json use the following code

from mswordtree import ToString
ToString([root])

Common Methods

Find(guid)

Use the root element to find any element in its tree structure by mathing its GUID.

item = root.Find('3b34509b-533e-40cc-b0dc-c44df5bcba51')

ToString_AllHeadings(root)

Returns the string of all heading elements in a tree structure, which we can use as a json string.

from mswordtree import ToString_AllHeadings
import json

data = ToString_AllHeadings(root)
json.dumps(data)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mswordtree-0.1.1.7.tar.gz (46.0 kB view details)

Uploaded Source

Built Distribution

mswordtree-0.1.1.7-py3-none-any.whl (6.6 kB view details)

Uploaded Python 3

File details

Details for the file mswordtree-0.1.1.7.tar.gz.

File metadata

  • Download URL: mswordtree-0.1.1.7.tar.gz
  • Upload date:
  • Size: 46.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.3

File hashes

Hashes for mswordtree-0.1.1.7.tar.gz
Algorithm Hash digest
SHA256 c2b39beb3c7b082dbd8dc7541c730d0d7e001a19499628c1868fe5672968cfc5
MD5 048b60d47062808d205bed0681b5e65d
BLAKE2b-256 9b8e18aec56cd095537b265c69dd18e3f22dcf5e23a0e053ac2bcabd2d7590ce

See more details on using hashes here.

File details

Details for the file mswordtree-0.1.1.7-py3-none-any.whl.

File metadata

  • Download URL: mswordtree-0.1.1.7-py3-none-any.whl
  • Upload date:
  • Size: 6.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.3

File hashes

Hashes for mswordtree-0.1.1.7-py3-none-any.whl
Algorithm Hash digest
SHA256 8f3d13ea602f2ce19b0fa7fdee71ce5550d9d83209674edb234e9ed4d22487c3
MD5 c31f0156440e79f5b60c76bbafa65644
BLAKE2b-256 58455e897b9c5cb74598e6edc54e869b6410e57122c02230982a8638a00f1236

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page