Pdfserver is a webservice that offers common PDF operations like joining documents, selecting pages or "n pages on one".
Pdfserver is a webservice that offers common PDF operations like joining documents, selecting pages or “n pages on one”. It is built on top of the Python based microframework Flask and depends on pyPdf to manipulate PDFs.
Powerful tools to manipulate PDF exist but they are not universally available on all systems or not simple to use. This server allows anyone to quickly solve most common PDF operations over the web.
If you don’t trust other servers with your data, deploy a copy yourself!
See http://pdfserverapp.appspot.com/ for an example installation.
Download and extract the soure code.
Create a virtualenv in the extracted folder and install requirements:
$ virtualenv env $ source env/bin/activate $ pip install -r requirements.txt
You can simply run the development server with:
$ python manage.py createdb $ mkdir uploads $ python manage.py runserver
Make sure the given upload directory and database can be written to and are not accessible from the outside (if on a public server).
When not in debug mode make sure to serve static files under static.
Give a SECRET_KEY and keep it secret so that sessions can be signed and users cannot see files uploaded by others.
Create the database by running:
$ python manage.py createdb
For optional, asynchronous generation of the resulting PDF install celery and kombu-sqlalchemy (you may also use default broker RabbitMQ, see http://celeryq.org/docs/getting-started/broker-installation.html):
$ pip install -r celery_requirements.txt
Run celeryd from the project’s directory to handle tasks asynchronously:
The Google App Engine has its own dereferred library which is automatically used.
See pdfserver.cgi for an example on how to run pdfserver through the traditional CGI interface.
For pdfserver to run on the App Engine you need to download and copy dependencies locally. Run the following in the extracted folder:
# Get dependencies $ mkdir tmp $ pip install -r requirements.txt distribute --build=tmp --src=tmp \ --no-install --ignore-installed $ mv tmp/Babel/babel/ tmp/Flask/flask/ tmp/Flask-Babel/flaskext/ \ tmp/Jinja2/jinja2/ tmp/pyPdf/pyPdf/ tmp/pytz/pytz \ tmp/speaklater/speaklater.py tmp/Werkzeug/werkzeug/ \ tmp/reportlab/src/reportlab/ tmp/distribute/pkg_resources.py . $ rm -rf tmp # Add a secret key $ $EDITOR appengine.py # Choose your application name $ $EDITOR app.yaml # Run the development server $ /usr/local/google_appengine/dev_appserver.py . # Finally upload $ /usr/local/google_appengine/appcfg.py update .
If tasks won’t get executed (you can check under http://localhost:8080/_ah/admin/tasks?queue=default), you might got hitten by bug http://code.google.com/p/appengine-mapreduce/issues/detail?id=9, see workaround there.
Please report bugs to http://github.com/cburgmer/pdfserver/issues.
Christoph Burgmer <cburgmer (at) ira uka de>