Get the parsed microsoft word document in a hierarchical tree structure.
Project description
mswordtree
Parse your whole word document in a hierarchical tree structure. The document content will be listed down as Heading and its children as subheading/paragraph/table etc.
Install the library using following comand
pip install mswordtree
Use the following code to parse your word document in a tree structure
from mswordtree import GetWordDocTree
root = GetWordDocTree('test.docx')
Now you can iterate over all objects of the document by using the following code
for item in root.Items:
print('Type: {} -> Content {}\n'.format(item.Type, item.Content))
To make the json use the following code
from mswordtree import ToString
ToString([root])
Common Methods
Find(guid)
Use the root element to find any element in its tree structure by mathing its GUID.
item = root.Find('3b34509b-533e-40cc-b0dc-c44df5bcba51')
ToString_AllHeadings(root)
Returns the string of all heading elements in a tree structure, which we can use as a json string.
from mswordtree import ToString_AllHeadings
import json
data = ToString_AllHeadings(root)
json.dumps(data)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file mswordtree-0.1.1.7.tar.gz
.
File metadata
- Download URL: mswordtree-0.1.1.7.tar.gz
- Upload date:
- Size: 46.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
c2b39beb3c7b082dbd8dc7541c730d0d7e001a19499628c1868fe5672968cfc5
|
|
MD5 |
048b60d47062808d205bed0681b5e65d
|
|
BLAKE2b-256 |
9b8e18aec56cd095537b265c69dd18e3f22dcf5e23a0e053ac2bcabd2d7590ce
|
File details
Details for the file mswordtree-0.1.1.7-py3-none-any.whl
.
File metadata
- Download URL: mswordtree-0.1.1.7-py3-none-any.whl
- Upload date:
- Size: 6.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
8f3d13ea602f2ce19b0fa7fdee71ce5550d9d83209674edb234e9ed4d22487c3
|
|
MD5 |
c31f0156440e79f5b60c76bbafa65644
|
|
BLAKE2b-256 |
58455e897b9c5cb74598e6edc54e869b6410e57122c02230982a8638a00f1236
|