This is a pre-production deployment of Warehouse, however changes made here WILL affect the production instance of PyPI.
Latest Version Dependencies status unknown Test status unknown Test coverage unknown
Project Description
# HTTP Task Queue (htq)

The motivation for this project is to provide a standard interface for sending a request in the background to a service that performs some task with a standard mechanism for signaling it to be *cancelled*.

In this context, a *task* is classified as something that may take longer than what is appropriate for a typical HTTP response or something that is allowed to be eventually completed. Examples include executing a database query, running an analysis on some data, and injesting/scraping data from websites or other services. One side effect of the client-server model is that the server may not be aware if the client aborts the request. The server will continue to perform the task using up resources that other tasks or request handlers could be using. The mechanism for signaling a cancellation is for a subsequent DELETE request to be sent to the service which can be handled to interrupt the first request. This of course requires services to support the DELETE method and implement the logic for cancelling the task being performed. See below for a working example of a [service](#service-example) implementing this interface.

For a detailed introduction, read the [tutorial](#tutorial) below.

## Dependencies

- Python 3.3+
- Redis 2.4+

## Command-line Interface

$ htq -h
HTTP Task Queue (htq) command-line interface

htq server [--host <host>] [--port <port>] [--redis <redis>] [--debug]
htq worker [--threads <n>] [--redis <redis>] [--debug]

-h --help Show this screen.
-v --version Show version.
--debug Turns on debug logging.
--host <host> Host of the HTTP service [default: localhost].
--port <port> Port of the HTTP service [default: 5000].
--redis <redis> Host/port of the Redis server [default: localhost:6379].
--threads <n> Number of threads a worker should spawn [default: 10].

Run the server for the HTTP REST interface.

htq server

Run the worker to send requests and receive responses.

htq worker

## API

- `GET /` - Gets all queued requests.
- `POST /` - Sends (queues) a request
- `GET /<uuid>/` - Gets a request by UUID
- `DELETE /<uuid>/` - Cancels a request, deleting it's response if already received
- `GET /<uuid>/response/` - Gets a request's response if it has been received
- `DELETE /<uuid>/response/` - Delete a request's response to clear up space

## Tutorial

Start the HTQ server:

$ htq server
Starting htq REST server...
* Running on http://localhost:5000/

Send a POST to the service with a JSON encoded structure of the request to be sent. This immediately returns a `303 See Other` response with the `Location` header to the request

$ curl -i -X POST -H 'Content-Type: application/json' http://localhost:5000 -d '{"url": ""}'
Content-Type: text/html; charset=utf-8
Content-Length: 0
Location: http://localhost:5000/a936e1a1-68d8-4433-a0c0-4f4b2111670d/
Server: Werkzeug/0.9.6 Python/3.4.1
Date: Mon, 29 Sep 2014 17:18:49 GMT

See what the request resource looks like (use the URL from the `Location` header in your output):

$ curl -i http://localhost:5000/a936e1a1-68d8-4433-a0c0-4f4b2111670d/
HTTP/1.0 200 OK
Content-Type: application/json
Content-Length: 185
Link: <http: localhost:5000="" a936e1a1-68d8-4433-a0c0-4f4b2111670d="" response=""/>; rel="response",
<http: localhost:5000="" a936e1a1-68d8-4433-a0c0-4f4b2111670d=""/>; rel="self",
<http: localhost:5000="" a936e1a1-68d8-4433-a0c0-4f4b2111670d="" status=""/>; rel="status"
Server: Werkzeug/0.9.6 Python/3.4.1
Date: Mon, 29 Sep 2014 17:26:05 GMT

"timeout": 60,
"status": "queued",
"data": null,
"headers": {},
"url": "",
"method": "get",
"uuid": "a936e1a1-68d8-4433-a0c0-4f4b2111670d",
"time": 1412011537783

The above response shows the details of the request such as the URL, method, headers, and request data. Additionally metadata has been captured when the request was queued including the UUID, the time (in milliseconds) when the request was queued, the status of the request and a timeout. The `Link` header shows the related links from this resource including one to the response and the status. The status link is a lightweight:

$ curl -i http://localhost:5000/a936e1a1-68d8-4433-a0c0-4f4b2111670d/status/
HTTP/1.0 200 OK
Content-Type: application/json
Content-Length: 20
Server: Werkzeug/0.9.6 Python/3.4.1
Date: Mon, 29 Sep 2014 17:30:21 GMT

{"status": "queued"}

If we were to `curl` the response link, it would block until the response is ready. Since we have not started a worker yet, this would be forever. Let's start a worker to actual send the request. Execute this in a separate shell:

$ htq worker --debug
Started 10 workers...
[a936e1a1-68d8-4433-a0c0-4f4b2111670d] sending request...
[a936e1a1-68d8-4433-a0c0-4f4b2111670d] response received

Starting the worker daemon immediately starts consuming the queue and sending the requests. As you can see, the one sent above has been sent and the response received. Let's check the status of our request.

$ curl -i http://localhost:5000/a936e1a1-68d8-4433-a0c0-4f4b2111670d/status/
HTTP/1.0 200 OK
Content-Type: application/json
Content-Length: 21
Server: Werkzeug/0.9.6 Python/3.4.1
Date: Mon, 29 Sep 2014 17:36:56 GMT

{"status": "success"}

Success! Now let's use the link to the response itself.

$ curl -i http://localhost:5000/a936e1a1-68d8-4433-a0c0-4f4b2111670d/response/
HTTP/1.0 200 OK
Content-Type: application/json
Content-Length: 19291
Link: <http: localhost:5000="" a936e1a1-68d8-4433-a0c0-4f4b2111670d=""/>; rel="request",
<http: localhost:5000="" a936e1a1-68d8-4433-a0c0-4f4b2111670d="" response=""/>; rel="self"
Server: Werkzeug/0.9.6 Python/3.4.1
Date: Mon, 29 Sep 2014 17:38:22 GMT

"status": "success",
"time": 1412012019457,
"code": 200,
"reason": "OK",
"elapsed": 79.81,
"uuid": "62db48a4-e511-4c8e-9c11-32b39758d1ff",
"headers": {
"Age": "0",
"Content-Length": "32",
"Connection": "Keep-Alive",
"Access-Control-Allow-Origin": "*",
"Server": "gunicorn/18.0",
"Access-Control-Allow-Credentials": "true",
"Date": "Mon, 29 Sep 2014 17:50:29 GMT",
"Content-Type": "application/json"
"data": "{\n \"origin\": \"\"\n}"

The response contains all the elements of an HTTP response including code, reason, headers, and the data (which has been removed for brevity). In addition, the time (in milliseconds) the response was received and the elapsed time (in milliseconds) join the UUID and status metadata.

### Canceling a request

HTQ defines an interface for services to implement for allowing requests to be canceled. For example, if I send a request that is taking longer than I expect (delayed for 30 seconds):

$ curl -i -X POST -H Content-Type:application/json http://localhost:5000 -d '{"url": ""}'
Content-Type: text/html; charset=utf-8
Content-Length: 0
Location: http://localhost:5000/1686e1b7-3b05-4d45-95e8-caf934f540aa/
Server: Werkzeug/0.9.6 Python/3.4.1
Date: Mon, 29 Sep 2014 18:00:28 GMT

Then I can send a DELETE to the request URL:

$ curl -i -X DELETE http://localhost:5000/1686e1b7-3b05-4d45-95e8-caf934f540aa/
Content-Type: text/html; charset=utf-8
Content-Length: 0
Server: Werkzeug/0.9.6 Python/3.4.1
Date: Mon, 29 Sep 2014 18:00:41 GMT

Now if we check the status, we should see the status has changed to `canceled` (the response is also empty).

$ curl -i http://localhost:5000/9619b267-760d-4f0a-9f15-eb8ad99cd1c4/status/
HTTP/1.0 200 OK
Content-Type: application/json
Content-Length: 22
Server: Werkzeug/0.9.6 Python/3.4.1
Date: Mon, 29 Sep 2014 18:01:23 GMT

{"status": "canceled"}

Internally this interrupts the request, but also sends a DELETE request to the endpoint (in this ``). Implementors of services can support the DELETE request to cancel the underlying processing that is occurring. Of course this is specific to the underlying task being performed, but this simple service-level contract provides a consistent mechanism for signaling the the cancellation.

## Service Example

Below is a working example of a service that implements the interface HTQ requires for receiving the DELETE to signal the cancellation. This service takes a JSON-encoded statement and parameters and executes them in a local PostgreSQL instance (for simplicity, the database settings are hard-coded).

import json
import logging
import psycopg2
from flask import Flask, abort, request

logger = logging.getLogger(__name__)
handler = logging.StreamHandler()

app = Flask(__name__)

# Database connection settings
settings = {
'dbname': 'example',
'host': 'localhost',

# Currently running tasks by UUID mapped to the
# connection process ID
tasks = {}

@app.route('/<uuid>/', methods=['POST'])
def run(uuid):
data = request.json

conn = psycopg2.connect(**settings)
c = conn.cursor()

# Store the PID of the connection execute this task
pid = conn.get_backend_pid()
tasks[uuid] = pid

logger.debug('[{}] executing query for {}...'.format(pid, uuid))
c.execute(data['statement'], data.get('parameters'))
tasks.pop(uuid, None)

return json.dumps(c.fetchall()), 200

@app.route('/<uuid>/', methods=['DELETE'])
def cancel(uuid):
if uuid not in tasks:

pid = tasks.pop(uuid)

# Open new connection to cancel the query
conn = psycopg2.connect(**settings)
c = conn.cursor()

c.execute('select pg_terminate_backend(%s)', (pid,))
logger.debug('[{}] canceled query for {}'.format(pid, uuid))

return '', 204

if __name__ == '__main__':

The interaction looks as follows:

% python
* Running on
[57819] executing query for abc123...
[57819] canceled query for abc123 - - [30/Sep/2014 11:16:01] "DELETE /abc123/ HTTP/1.1" 204 - - - [30/Sep/2014 11:16:02] "POST /abc123/ HTTP/1.1" 500 -

A POST request was first sent to execute a query. A DELETE request was sent shortly after to cancel the query and returns. When the POST request does respond it is a 500 complaining the connection was closed (which is what we wanted).
Release History

Release History


This version

History Node

TODO: Figure out how to actually get changelog content.

Changelog content for this version goes here.

Donec et mollis dolor. Praesent et diam eget libero egestas mattis sit amet vitae augue. Nam tincidunt congue enim, ut porta lorem lacinia consectetur. Donec ut libero sed arcu vehicula ultricies a non tortor. Lorem ipsum dolor sit amet, consectetur adipiscing elit.

Show More

Download Files

Download Files

TODO: Brief introduction on what you do with files - including link to relevant help section.

File Name & Checksum SHA256 Checksum Help Version File Type Upload Date
htq-0.1.0.tar.gz (10.6 kB) Copy SHA256 Checksum SHA256 Source Sep 30, 2014

Supported By

WebFaction WebFaction Technical Writing Elastic Elastic Search Pingdom Pingdom Monitoring Dyn Dyn DNS HPE HPE Development Sentry Sentry Error Logging CloudAMQP CloudAMQP RabbitMQ Heroku Heroku PaaS Kabu Creative Kabu Creative UX & Design Fastly Fastly CDN DigiCert DigiCert EV Certificate Rackspace Rackspace Cloud Servers DreamHost DreamHost Log Hosting