Extract form data from a PDF form.
Project description
Copyright (c) 2016 Bart Massey
This Python 2 library and command-line app facilitate getting form data from PDF forms, using the PDFMiner (http://www.unixuser.org/~euske/python/pdfminer) library for extraction. The code is originally from http://stackoverflow.com/q/3984003/364875 but with minor changes for 2016 and for app.
There are a lot of things that need to be improved here:
- Unicode and other character encoding handling is kludgy at best.
- Only text and selection elements of forms are supported.
- Subforms aren’t supported.
This code is available under the “MIT License”. Please see the file COPYING in this distribution for license terms.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Filename, size | File type | Python version | Upload date | Hashes |
---|---|---|---|---|
Filename, size pdfformread-0.1.2-py2-none-any.whl (5.3 kB) | File type Wheel | Python version 2.7 | Upload date | Hashes View |
Filename, size pdfformread-0.1.2.tar.gz (3.1 kB) | File type Source | Python version None | Upload date | Hashes View |
Close
Hashes for pdfformread-0.1.2-py2-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0091f5a1800b23040a4e4b8ce2613a8988babb396e35c5af5180cb40c86fa70a |
|
MD5 | 30bfcde680bcf8f0381e85d70a4d0441 |
|
BLAKE2-256 | c0cd0fe8ed72b705bd632fdc9588d8f1b36810f96b484c978698993b0db9aca2 |