python-epo-ops-client

Python Client for the European Patent Office's Open Patent Services API

These details have not been verified by PyPI

Project links

Project description

python-epo-ops-client

python-epo-ops-client is an Apache2 Licensed client library for accessing the European Patent Office’s (“EPO”) Open Patent Services (“OPS”) v.3.1 (based on v 1.2.10 of the reference guide).

import epo_ops

anonymous_client = epo_ops.Client()  # Instantiate a default client
response = anonymous_client.published_data(  # Retrieve bibliography data
  reference_type = 'publication',  # publication, application, priority
  input = epo_ops.models.Docdb('1000000', 'EP', 'A1'),  # original, docdb, epodoc
  endpoint = 'biblio',  # optional, defaults to biblio in case of published_data
  constituents = []  # optional, list of constituents
)

registered_client = epo_ops.RegisteredClient(key='abc', secret='xyz')
registered_client.access_token  # To see the current token
response = registered_client.published_data(…)

Features

python_epo_ops_client abstracts away the complexities of access EPO OPS:

Format the requests properly
Bubble up quota problems as proper HTTP errors
Handle token authentication and renewals automatically
Handle throttling properly
Add optional caching to minimize impact on the OPS servers

There are two main layers to python_epo_ops_client: Client and Middleware.

Client

The Client contains all the formatting and token handling logic and is what you’ll interact with mostly.

When you issue a request, the response is a requests.Response object. If response.status_code != 200 then an exception will be raised, it’s your responsibility to handle those exceptions if you want to. The one case that’s handled by the RegisteredClient is when its access token has expired: in this case, the client will automatically handle the HTTP 400 status and renew the token.

Note that the Client does not attempt to interpret the data supplied by OPS, so it’s your responsibility to parse the XML or JSON payload for your own purpose.

The following custom exceptions are raised for cases when OPS quotas are exceeded, they are all subclasses of requests.HTTPError and offer the same behavior:

AnonymousQuotaPerMinuteExceeded
AnonymousQuotaPerDayExceeded
IndividualQuotaPerHourExceeded
RegisteredQuotaPerWeekExceeded

Again, it’s up to you to parse the response and decide what to do.

Currently the Client only knows how to issue request for the following services:

/published-data/search (search)
/published-data (retrieval)
/family (inpadoc)

Please submit pull requests for other services by enhancing the epo_ops.api.Client class.

Middleware

All requests and responses are passed through each middleware object listed in client.middlewares. Requests are processed in the order listed, and responses are processed in the reverse order.

Each middleware should subclass middlewares.Middleware and implement the process_request and process_response methods.

There are two middleware classes out of the box: Throttler and Dogpile. Throttler is in charge of the OPS throttling rules and will delay requests accordingly. Dogpile is an optional cache which will cache all HTTP status 200, 404, 405, and 413 responses.

By default, only the Throttler middleware is enabled, if you want to enable caching:

import epo_ops

middlewares = [
    epo_ops.middlewares.Dogpile(),
    epo_ops.middlewares.Throttler(),
]
registered_client = epo_ops.RegisteredClient(
    key='key',
    secret='secret',
    middlewares=middlewares,
)

Note that caching middleware should be first in most cases.

Dogpile

Dogpile is based on (surprise) dogpile.cache. By default it is instantiated with a DBMBackend region with timeout of 2 weeks.

Dogpile takes three optional instantiation parameters:

region: You can pass whatever valid dogpile.cache Region you want to backend the cache
kwargs_handlers: A list of keyword argument handlers, which it will use to process the kwargs passed to the request object in order to extract elements for generating the cache key. Currently one handler is implemented (and instantiated by default) to make sure that the X-OPS-Range request header is part of the cache key.
http_status_codes: A list of HTTP status codes that you would like to have cached. By default 200, 404, 405, and 413 responses are cached.

Note: dogpile.cache is not installed by default, if you want to use it, pip install dogpile.cache in your project.

Throttler

Throttler contains all the logic for handling different throttling scenarios. Since OPS throttling is based on a one minute rolling window, we must persist historical (at least for the past minute) throtting data in order to know what the proper request frequency is. Each Throttler must be instantiated with a Storage object.

Storage

The Storage object is responsible for:

Knowing how to update the historical record with each request (Storage.update()), making sure to observe the one minute rolling window rule.
Calculating how long to wait before issuing the next request (Storage.delay_for()).

Currently the only Storage backend provided is SQLite, but you can easily write your own Storage backend (such as file, Redis, etc.). To use a custom Storage type, just pass the Storage object when you’re instantiating a Throttler object. See epo_ops.middlewares.throttle.storages.Storage for more implementation details.

Tests

Tests are written using pytest. To run the tests:

Register a OPS user login with EPO
Create an app
Set the OPS_KEY and OPS_SECRET environment variables accordingly
make test

The tests must be run with a working internet connection, since both OPS and the mock Apiary services are online.

Release History

0.1.3 (2014-05-21)

Python 3.4 compatibility
Updated requests dependency to 2.3.0

0.1.2 (2014-03-04)

Python 2.6 and 3.3 compatibility

0.1.1 (2014-03-01)

Allow configuration of which HTTP responses (based on status code) to cache

0.1.0 (2014-02-20)

Introduced dogpile.cache for caching http200 resopnses
Introduced the concept of middleware

0.0.1 (2014-01-21)

Initial release

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

4.1.0

Jan 25, 2024

4.0.0

Sep 19, 2021

3.1.3

Sep 23, 2020

3.1.2

Jul 5, 2020

3.1.1

Oct 28, 2019

3.1.0

Oct 28, 2019

3.0.0

Oct 27, 2019

2.3.2

Jan 15, 2018

2.3.1

Nov 11, 2017

2.3.0

Oct 23, 2017

2.2.0

Mar 30, 2017

2.1.0

Feb 21, 2016

2.0.0

Dec 11, 2015

1.0.0

Sep 21, 2015

0.1.9

Jul 22, 2015

0.1.8

Jan 24, 2015

0.1.7

Jan 24, 2015

0.1.6

Dec 12, 2014

0.1.5

Oct 18, 2014

0.1.4

Oct 11, 2014

This version

0.1.3

May 21, 2014

0.1.2

Mar 5, 2014

0.1.1

Mar 2, 2014

0.1.0

Feb 20, 2014

0.0.1

Feb 17, 2014

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

python-epo-ops-client-0.1.3.tar.gz (15.6 kB view hashes)

Uploaded May 21, 2014 Source

Built Distribution

python_epo_ops_client-0.1.3-py2.py3-none-any.whl (20.5 kB view hashes)

Uploaded May 21, 2014 Python 2 Python 3

Hashes for python-epo-ops-client-0.1.3.tar.gz

Hashes for python-epo-ops-client-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`b79f040382d5dfcae6e99fca789701d0dbfcefc98658e9f0035d1b6e5afc745b`
MD5	`13649caeea0c8c9c0292833621900fab`
BLAKE2b-256	`de9899a319213cf3b8415defe574135e8cc92f6f8a848e028785ec9583749454`

Hashes for python_epo_ops_client-0.1.3-py2.py3-none-any.whl

Hashes for python_epo_ops_client-0.1.3-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`9e5b87190ac1bb6f2a6414a34554c55ae0bee322c1ac49e4f001c75bc3274491`
MD5	`8f7107f3746e7ef7676e2bbbee638e0e`
BLAKE2b-256	`1b086d4361ae28c40d8d4e26a63a4ef64bdd6871536eb269c264a4e50bed66d4`