DocuScan is a lightweight document scanner.
DocuScan allows users to open up document types docx,doc,pdf and return the information inside as strings.
DocuScan also allows for manipulation of this information via regular expressions.
run pip install DocuScan
- class DocuScan('fileName') to a variable.
###It is worth noting that the fileName must be in the directory.
use print(variable.executeRegex('regex here'))
use print(executeHeaderRegex('regex here'))
use print(executeFooterRegex('regex here'))
returnFileText() - Returns the text of a file.
executeRegex(regexExpression) - creates a list of all matching cases of regexExpression
executeHeaderRegex(regularExpression) - creates a list of all matching cases of regexExpression in the header XML.
executeFooterRegex(regularExpression) - creates a list of all matching cases of regexExpression in the Footer XML.
Release history Release notifications | RSS feed
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.