Skip to main content

A library to extract text and images from DOCX files, returning a flattened string or structured data.

Project description

A simple library to extract text and images from DOCX files, returning a flattened string (with image placeholders) or a structured list. Useful for MCQ and exam document parsing.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pydocxextractor-0.1.0.tar.gz (2.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pydocxextractor-0.1.0-py3-none-any.whl (3.2 kB view details)

Uploaded Python 3

File details

Details for the file pydocxextractor-0.1.0.tar.gz.

File metadata

  • Download URL: pydocxextractor-0.1.0.tar.gz
  • Upload date:
  • Size: 2.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.4

File hashes

Hashes for pydocxextractor-0.1.0.tar.gz
Algorithm Hash digest
SHA256 be6bbd9a493df905369ecc5fba0b6b73e9067dab8fa342228e1d58781007ba8e
MD5 0a1429edb476fa4730bba22f0f3551a7
BLAKE2b-256 644e9b7665409f2e2e3c37fb250aa2c864de622030901b928d1a45b20e60d90e

See more details on using hashes here.

File details

Details for the file pydocxextractor-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for pydocxextractor-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 3f8f8d04adac132aae4d7df89f9b8d1f43541881e3dc16c1d01d67232464b19e
MD5 44e0ca338c3dc04830d967d38394cfd5
BLAKE2b-256 80ec651c2f6a059e833aee2486068f09eebcfeb3cd9d02db60aacd30667ec76c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page