Upload documents to Overview web server
Project description
overview-upload
Upload files to an Overview document set.
Installation
Requires python3.
pip3 install overview_upload, maybe with sudo in front. That will install a overview-upload program in your path.
Command-Line Usage
overview-upload: upload files
overview-upload --server <SERVER_URL> --token <API_TOKEN> [options] DIRECTORY
Required arguments:
SERVER_URL is the base URL for the server, which defaults to http://localhost:9000 for use with overview-local.
API_TOKEN is required. It’s the access token for a particular document set, which you get like this
Browse to your document set. The URL will look like http://localhost:9000/documentsets/123456.
Note the document set number, in this case 123456
Browse to https://www.overviewdocs.com/documentsets/[your number here]/api-tokens and click “Generate token”.
DIRECTORY is a path to the file or directory you want to upload.
Optional arguments:
--skip-duplicate (the default), --noskip: skip files that are already part of the document set your API token refers to. Files are compared by their sha1 hashes, meaning even if you move or rename the file it will not be uploaded if the document set already includes it. This feature is helpful for synchronizing a local directory with an Overview document set; however, it will not delete Overview documents corresponding to files you deleted locally.
--split-by-page: tell Overview to turn a multi-page file (like a PDF or Word document) into several Overview documents.
--ocr (the default), --no-ocr: tell Overview what to do when a PDF page has only images but no text. Overview can either try to recognize text using Tesseract (which is slow and will prevent you from viewing the document set until it finishes), or it can assume the page contains no text.
If you upload a single file, its Overview document title will be its filename, without any directory information. If you upload a directory, filenames will include subdirectory informatin: for instance, if overview-upload /some/path uploads /some/path/to/file.pdf, the Overview document title will be to/file.pdf.
overview-upload-csv: upload from a CSV manifest
Given a CSV like this:
id |
title |
url |
field1 |
field2 |
---|---|---|---|---|
1 |
doc1.pdf |
http://www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/pdf_open_parameters.pdf |
some metadata |
some more metadata |
2 |
doc2.pdf |
http://www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/pdf_open_parameters.pdf |
more metadata |
and even more metadata |
You can upload the referenced files to Overview like this:
overview-upload-csv <API_TOKEN> <manifest.csv> --url-field url --title-field title --server https://www.overviewdocs.com
If you browse to the document set and add field1 and field2 fields, you will see the values from the CSV.
API usage
You can also import overview_upload from your own Python3 program and then use the overview_upload.Upload class. See the overview-server source code for more information.
Developing
Releasing a new version
Register on PyPI and create ~/.pypirc with [pypi]\nusername = ...\npassword = ...
Update version in setup.py
rm -r dist && ./setup.py sdist
gpg --detach-sign -a dist/*.tar.gz
twine upload dist/*.tar.gz dist/*.asc
License
This software is distributed under the terms of the GNU Affero General Public License. See the LICENSE file for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.