Skip to main content

Generic framework for building modern asynchronous API clients for bibliometric and scholarly data retrieval

Project description

bibliofabric: a foundation for modern scholarly API clients

bibliofabric is a foundational, asynchronous Python framework for building modern, resilient, and maintainable clients for scholarly APIs. It is the generic engine designed to create a future toolkit of clients for services like OpenAIRE, Crossref, OpenAlex, and more.


Key Features

  • Built on httpx and asyncio for high-performance, non-blocking I/O.
  • Automatic retries for transient network errors and 5xx/429 status codes with configurable exponential backoff.
  • Supports various authentication schemes.
  • The ResponseUnwrapper can be adapted to all response structures.
  • Optional in-memory TTL caching for GET requests to reduce latency and API load.
  • Handles 429 errors by respecting Retry-After headers.
  • Inject custom logic before a request is sent or after a response is received with a hook system.

Who is this for?

bibliofabric is a framework for developers building API clients. It is not an end-user library. You would use bibliofabric to create a library like AIREloom, which can then be used by researchers, librarians, data scientists, ....

How to Build a Client with bibliofabric (Conceptual Example)

Here’s a simplified example of how you would build a client for a hypothetical "SimpleAPI". The developer only needs to define the API-specific parts; the underlying boilerplay & logic is handled by bibliofabric.

# simpleapi_client/client.py
from bibliofabric.client import BaseApiClient
from bibliofabric.config import BaseApiSettings
from bibliofabric.models import ResponseUnwrapper
from bibliofabric.resources import GettableMixin, BaseResourceClient
from pydantic import BaseModel

# 1. Define the API's data model
class SimpleWork(BaseModel):
    id: str
    title: str

# 2. Implement the ResponseUnwrapper for this specific API
class SimpleApiUnwrapper(ResponseUnwrapper[dict]):
    def unwrap_results(self, response_json: dict) -> list[dict]:
        return response_json.get("data", [])

    def unwrap_single_item(self, response_json: dict) -> dict:
        return response_json.get("item", {})

    # ... implement other protocol methods ...

# 3. Define the Resource Client using BiblioFabric's mixins
class WorksClient(GettableMixin, BaseResourceClient):
    _entity_path: str = "works"
    _entity_model: type[SimpleWork] = SimpleWork

# 4. Create the main client class
class SimpleApiClient(BaseApiClient):
    def __init__(self, settings: BaseApiSettings | None = None, auth_strategy=None):
        # Instantiate the unwrapper and pass it to the base client
        unwrapper = SimpleApiUnwrapper()
        super().__init__(
            base_url="https://api.simple.org/v1",
            settings=settings,
            auth_strategy=auth_strategy,
            response_unwrapper=unwrapper
        )
        # Attach the specific resource client
        self.works = WorksClient(api_client=self)

# 5. The end-user can now use your new client
async def main():
    async with SimpleApiClient() as client:
        work = await client.works.get("123")
        print(work.title)

Core Architectural Concepts

bibliofabric is built on a few key, decoupled concepts:

  • The heart of the framework is formed by the BaseApiClient. A robust, httpx-based asynchronous client that manages the entire request/response lifecycle, including retries, caching, rate limiting, and a hook system.
  • A pluggable system for authentication using AuthStrategy. Implementations for No-Auth, Static Bearer Token, and OAuth2 Client Credentials are provided out of the box, and adding new strategies (e.g., for Crossref's "polite pool") is straightforward.
  • ResponseUnwrapper: The key to flexibility. This protocol allows specific clients to teach the BaseApiClient how to navigate different API response "envelopes" (e.g., finding results in response['results'] vs. response['message']['items']).
  • A set of pre-built mixins (GettableMixin, SearchableMixin, CursorIterableMixin) that provide the boilerplate logic for common REST operations (.get(), .search(), .iterate()), making specific client implementations incredibly lean and declarative.

Contributing

Contributions are welcome! This project adheres to standard open-source practices. Please see our development guidelines for information on setting up your environment, coding conventions, and testing.

License

This project is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bibliofabric-0.1.0.tar.gz (66.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bibliofabric-0.1.0-py3-none-any.whl (25.8 kB view details)

Uploaded Python 3

File details

Details for the file bibliofabric-0.1.0.tar.gz.

File metadata

  • Download URL: bibliofabric-0.1.0.tar.gz
  • Upload date:
  • Size: 66.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.7.15

File hashes

Hashes for bibliofabric-0.1.0.tar.gz
Algorithm Hash digest
SHA256 9fc142c1edcbaf927557cc3d1312e758469e104bc93d6ec8bb93593a31c7be3d
MD5 6349ba54deeeabea2345c643564447e8
BLAKE2b-256 0a77568541416bb635388cef8e8697309a60819f41c163fea2bd2555f9ceebda

See more details on using hashes here.

File details

Details for the file bibliofabric-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for bibliofabric-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 cbc127554a4f942db10d28a861d15a0bcd623a3bc179dd99c58fd42d5e975cbb
MD5 cb4d809779cb1d0c2e1c8c965b857801
BLAKE2b-256 bf3f474867908d7371cbfff9703a6339684f837040e8c4c2adf940f5ecc39e6e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page