Skip to main content

Microsoft Compound File Binary File Format (aka OLE2) IO

Project description

CfbIO provides access to internal structure of Microsoft Compound File Binary File Format, also known as OLE2 or COM.

Module operates with input file like standard IO module in Python. You can seek, read and maybe one day write those files, like all other file-like objects. Also module grants access to internal directory structure containing Entries, which are also standard readable/seekable objects.

So, your work with this module is very simple:

from cfb import CfbIO
from cfb.directory.entry import SEEK_END

doc = CfbIO("something.doc")

root = doc.root
print(root.read())  # Read whole root entry buffer

some_entry = doc.directory[1].left
some_entry.seek(100, whence=SEEK_END)
print(some_entry.read(100))  # Read last 100 bytes from left sibling

All classes are lazy, so you can read really big files without memory leaks. All data will be read only, when you will want it.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cfb-0.8.2.tar.gz (8.8 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page