Skip to main content

Python package to read pdf / word file and extract text from it.

Project description

file2text

Python package to read WORD (docx) file and extract text from it.

Installation

$ pip install file2text

Usage

  • Input the path of WORD file (.docx), and the output will be the text in the file.
import file2text
text = file2text.convert_word("FILENAME.docx")

Contributing

Check out the contributing guidelines. Please note that this project is released with a Code of Conduct. By contributing to this project, you agree to abide by its terms.

License

file2text was created by Beth Ou Yang. It is licensed under the terms of the MIT license.

Credits

file2text was created with cookiecutter and the py-pkgs-cookiecutter template.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

file2text-0.1.0.tar.gz (2.2 kB view details)

Uploaded Source

Built Distribution

file2text-0.1.0-py3-none-any.whl (2.8 kB view details)

Uploaded Python 3

File details

Details for the file file2text-0.1.0.tar.gz.

File metadata

  • Download URL: file2text-0.1.0.tar.gz
  • Upload date:
  • Size: 2.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.11.7 Darwin/22.5.0

File hashes

Hashes for file2text-0.1.0.tar.gz
Algorithm Hash digest
SHA256 1b988778f9d107aa939f2153f998ef8f31092aaacb0dbf1c3f549fdcd4df46ca
MD5 366c4bb406c3edbc89b9423d0d426b62
BLAKE2b-256 e99a18318bd205ef3e60c76541d2672d1401948665248cb11880477e46d8bb8a

See more details on using hashes here.

File details

Details for the file file2text-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: file2text-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 2.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.11.7 Darwin/22.5.0

File hashes

Hashes for file2text-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 094f5b9bca2a5751b1bcddeb804ee49b80b5f5dca770ff9368ed73a5f36ebb7a
MD5 0c50f0c2786e6b39e4a7884984dcf22c
BLAKE2b-256 be13c93216ea43c973ceab1e882ee95891d0327b2679be74be0fa915335ac0d8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page