A base class for building robust, asynchronous API clients.

These details have not been verified by PyPI

Project description

Asynchronous API Client Base Module

Overview

This Python module provides an abstract base class, ApiBase, designed to simplify the process of building robust, asynchronous API clients. It handles common challenges such as session management, concurrency limiting, rate-limiting, error handling, and periodic saving of results, allowing developers to focus on the specific logic of the API they are integrating with.

The framework is built on aiohttp for asynchronous HTTP requests and is structured as an abstract class, requiring the user to implement API-specific logic for fetching, transforming, and saving data.

✨ Key Features

Asynchronous by Design: Utilizes asyncio and aiohttp for high-performance, non-blocking API calls.
Concurrency Control: Employs an asyncio.Semaphore to limit the number of concurrent requests, preventing the client from overwhelming the API server.
Built-in Error Handling: Gracefully handles common HTTP errors (e.g., 429 Too Many Requests, 401 Unauthorized, 403 Forbidden) and raises custom, specific exceptions (RateLimitError, APIkeyError).
Abstract Base Class Structure: Enforces a clean and consistent implementation pattern by requiring the user to define three core methods: get_item, transform_output, and save_func.
Automatic Batch Processing: Manages the processing of large lists of inputs, yielding results as they are completed.
Periodic Saving: Includes functionality to automatically save results at specified intervals, preventing data loss during long-running jobs.
Proxy Support: Easily configurable to route requests through HTTP/HTTPS proxies.

⚙️ Installation & Prerequisites

This module relies on several external libraries. You can install them using pip:

pip install aiohttp pandas python-dotenv

It is recommended to list these in a requirements.txt file for your project.

How to use

To use this module, you must create a new class that inherits from ApiBase and implement its three abstract methods.

Step 1: Create a Subclass

First, define a class that inherits from ApiBase. In the __init__ method, you should call the parent constructor and set the base_url for the API you are targeting.

from sibr_api.base import ApiBase
import pandas as pd


class MyApiClient(ApiBase):
   def __init__(self):
      super().__init__(logger_name='MyApiClient')
      self.base_url = "[https://api.example.com/v1](https://api.example.com/v1)"

Step 2: Implement the Abstract Methods

You must provide concrete implementations for the following three methods in your subclass.

get_item(self,item) This async method defines how to fetch a single item from the API. It takes one argument, item, which contains the necessary information (e.g., an ID, a search query) to build the request URL. Inside this method, you should:
- Construct the full request URL
- Call await self.fetch_single(url) to perform the request

async def get_item(self, item_id: str):
    """
    Fetches data for a single item_id.
    """
    endpoint = f"/data/{item_id}"
    url = self.base_url + endpoint
    
    raw_response = await self.fetch_single(url)
    
    if raw_response:
        self.ok_responses += 1
        return raw_response
    else:
        self.fail_responses += 1
        return None

transform_output(self,output) This method is for processing the raw JSON response from the API into a more usable format. For example, you can extract relevant fields, flatten a nested structure, or convert it into a specific object.

def transform_output(self, output: dict) -> dict:
    """
    Transforms the raw API response into a clean dictionary.
    """
    # Example: Extracting specific fields from the JSON response
    transformed_data = {
        'id': output.get('id'),
        'name': output.get('name'),
        'value': output.get('details', {}).get('value')
    }
    return transformed_data

save_func(self,results) This method defines how to persist a list of processed results. It is called automatically by the framework when the save_interval is reached. A common implementation is to save the data to a CSV or JSON file.

def save_func(self, results: list):
    """
    Saves a list of results to a CSV file.
    """
    df = pd.DataFrame(results)
    # Use mode='a' to append to the file, and header=not os.path.exists(path)
    # to write the header only once.
    df.to_csv('results.csv', mode='a', header=False, index=False)

Step 3: Run the Client

Once your class is defined, you can instantiate it and use the get_items_with_ids or get_items methods to fetch data in bulk. These methods should be run within an `async function.

import asyncio
import pandas as pd
import os

# Assuming the MyApiClient class from above is defined here

async def main():
    # A list of item IDs to fetch
    item_ids_to_fetch = [f"id_{i}" for i in range(100)]

    # Instantiate the client
    client = MyApiClient()

    print("Starting API calls...")
    
    # Fetch all items, with a maximum of 10 concurrent requests
    # The results will be saved to 'results.csv' every 50 items.
    all_results = await client.get_items_with_ids(
        inputs=item_ids_to_fetch,
        save=True,
        save_interval=50,
        concurrent_requests=10
    )

    # Ensure the session is closed gracefully
    await client.close()

    print(f"Finished. Total results processed: {len(all_results)}")
    # The `all_results` variable contains a list of tuples: [(item_id, result), ...]

if __name__ == "__main__":
    # Initialize the results file with a header before starting
    if not os.path.exists('results.csv'):
        pd.DataFrame(columns=['id', 'name', 'value']).to_csv('results.csv', index=False)
    
    asyncio.run(main())

Class References

ApiBase The abstract base class for creating an API client.

Public Methods async def get_items(self, inputs: list, save: bool = False, save_interval: int = 50000, concurrent_requests: int = 5) -> list

Fetches multiple items concurrently from a list of inputs.

Returns: A list of processed results.

async def get_items_with_ids(self, inputs: list | dict, save: bool = False, save_interval: int = 50000, concurrent_requests: int = 5) -> list

Similar to get_items, but associates each result with its original input ID.

Returns: A list of tuples, where each tuple is (item_id, result).

async def close(self)

Closes the underlying aiohttp.ClientSession. It's important to call this when you are done to release resources.

Abstract Methods (to be implemented by subclass) get_item(self, item)

transform_output(self, output)

save_func(self, results: list)

⚠️ Error Handling The module will automatically handle network errors and common HTTP status codes.

If a 429 Too Many Requests status is received, a RateLimitError is raised, and processing will stop.

If a 401 Unauthorized status is received, an APIkeyError is raised.

Other client or server errors are logged, and the corresponding item will have a result of None.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.3.4

Sep 18, 2025

0.3.2

Sep 10, 2025

0.3.1

Sep 10, 2025

0.3.0

Sep 10, 2025

0.2.9

Sep 10, 2025

0.2.8

Sep 9, 2025

0.2.7

Sep 8, 2025

0.2.6

Sep 5, 2025

0.2.5

Sep 3, 2025

This version

0.2.4

Sep 3, 2025

0.2.3

Sep 2, 2025

0.2.2

Sep 2, 2025

0.2.1

Sep 2, 2025

0.2.0

Sep 2, 2025

0.1.9

Sep 2, 2025

0.1.8

Sep 1, 2025

0.1.7

Sep 1, 2025

0.1.6

Sep 1, 2025

0.1.5

Sep 1, 2025

0.1.4

Sep 1, 2025

0.1.3

Sep 1, 2025

0.1.2

Aug 29, 2025

0.1.1

Aug 29, 2025

0.1.0

Aug 29, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sibr_api-0.2.4.tar.gz (8.6 kB view details)

Uploaded Sep 3, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sibr_api-0.2.4-py3-none-any.whl (9.6 kB view details)

Uploaded Sep 3, 2025 Python 3

File details

Details for the file sibr_api-0.2.4.tar.gz.

File metadata

Download URL: sibr_api-0.2.4.tar.gz
Upload date: Sep 3, 2025
Size: 8.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.4 CPython/3.13.5 Darwin/24.6.0

File hashes

Hashes for sibr_api-0.2.4.tar.gz
Algorithm	Hash digest
SHA256	`b5cb00a52588e92b49ba25dd8b786bf7efc5449b5c0baacdd383d9a7c3d4eedd`
MD5	`99d7d5408246a9b7359fbfa04c0c6a0a`
BLAKE2b-256	`323f85ca238d096981571c1271861bc6cda5d36c790d206711a109b22248958d`

See more details on using hashes here.

File details

Details for the file sibr_api-0.2.4-py3-none-any.whl.

File metadata

Download URL: sibr_api-0.2.4-py3-none-any.whl
Upload date: Sep 3, 2025
Size: 9.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.4 CPython/3.13.5 Darwin/24.6.0

File hashes

Hashes for sibr_api-0.2.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`bf5eda3afb2fb5c307fc8ff34ed0e1780ea9322497849d3232e14b0474652fb6`
MD5	`61f00b444f933808b65cd78f8eb013e3`
BLAKE2b-256	`779a8dd338341b86dd95c0f455aeb3dcb42168281790175f5570e6ba5216dcc4`

See more details on using hashes here.

sibr-api 0.2.4

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Asynchronous API Client Base Module

Overview

✨ Key Features

⚙️ Installation & Prerequisites

How to use

Step 1: Create a Subclass

Step 2: Implement the Abstract Methods

Step 3: Run the Client

Class References

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes