Skip to main content

Added Requirements

Project description

DocuScan Package

DocuScan is a lightweight document scanner.

DocuScan allows users to open up document types docx,doc,pdf and return the information inside as strings.

DocuScan also allows for manipulation of this information via regular expressions.

Check out my other projects!


  1. zipfile

  2. io

  3. re

  4. XML


  1. run pip install DocuScan

  2. import DocuScan


  1. class DocuScan('fileName') to a variable.

###It is worth noting that the fileName must be in the directory.

  1. use print(variable.returnFileText())

  2. use print(variable.executeRegex('regex here'))

  3. use print(executeHeaderRegex('regex here'))

  4. use print(executeFooterRegex('regex here'))


  1. returnFileText() - Returns the text of a file.

  2. executeRegex(regexExpression) - creates a list of all matching cases of regexExpression

  3. executeHeaderRegex(regularExpression) - creates a list of all matching cases of regexExpression in the header XML.

  4. executeFooterRegex(regularExpression) - creates a list of all matching cases of regexExpression in the Footer XML.

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for DocuScan, version
Filename, size File type Python version Upload date Hashes
Filename, size DocuScan- (2.9 kB) File type Source Python version None Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page