Skip to main content

An asyncio PostgreSQL driver that returns numpy arrays

Project description

GitHub Actions status https://img.shields.io/pypi/v/asyncpg-rkt.svg

asyncpg-rkt is a fork of asyncpg, a database interface library designed specifically for PostgreSQL and Python/asyncio. asyncpg is an efficient, clean implementation of PostgreSQL server binary protocol for use with Python’s asyncio framework. You can read more about asyncpg in an introductory blog post.

asyncpg-rkt extends asyncpg as follows:

  • Backward compatible with the origin.

  • It is possible to set the numpy dtype for the fetched query.

  • Such “typed” queries return numpy arrays instead of lists of Record objects.

  • We construct numpy arrays directly from the low-level PostgreSQL protocol, without materializing any Python objects.

  • Although, we support object fields, too.

  • The time from receiving the response from PostgreSQL server until Connection.fetch() returns is ~20x less. This is because we avoid the overhead of dealing with Python objects in the result.

  • We return ravel()-ed indexes of nulls while writing NaN-s/NaT-s at the corresponding places in the array.

  • There is an option to return data by column vs. by row.

asyncpg-rkt provides the best performance when there are thousands of rows returned and the field types map to numpy.

Read the blog post with the introduction.

asyncpg-🚀 requires Python 3.8 or later and is supported for PostgreSQL versions 9.5 to 14. Older PostgreSQL versions or other databases implementing the PostgreSQL protocol may work, but are not being actively tested.

Documentation

The project documentation can be found here.

See below about how to use the fork’s special features.

Performance

In our testing asyncpg is, on average, 3x faster than psycopg2 (and its asyncio variant – aiopg).

https://raw.githubusercontent.com/athenianco/asyncpg-rkt/master/performance.png

The above results are a geometric mean of benchmarks obtained with PostgreSQL client driver benchmarking toolbench in November 2020 (click on the chart to see full details).

Further improvement from writing numpy arrays is ~20x:

https://raw.githubusercontent.com/athenianco/asyncpg-rkt/master/benchmark_20220522_142813.svg https://raw.githubusercontent.com/athenianco/asyncpg-rkt/master/benchmark_20220522_143838.svg

Features

asyncpg implements PostgreSQL server protocol natively and exposes its features directly, as opposed to hiding them behind a generic facade like DB-API.

This enables asyncpg to have easy-to-use support for:

  • prepared statements

  • scrollable cursors

  • partial iteration on query results

  • automatic encoding and decoding of composite types, arrays, and any combination of those

  • straightforward support for custom data types

Installation

asyncpg-🚀 is available on PyPI and requires numpy 1.21+. Use pip to install:

$ pip install asyncpg-rkt

Basic Usage

import asyncio
import asyncpg
from asyncpg.rkt import set_query_dtype
import numpy as np

async def run():
    conn = await asyncpg.connect(user='user', password='password',
                                 database='database', host='127.0.0.1')
    dtype = np.dtype([
        ("a", int),
        ("b", "datetime64[s]"),
    ])
    array, nulls = await conn.fetch(
        set_query_dtype('SELECT * FROM mytable WHERE id = $1', dtype),
        10,
    )
    await conn.close()

loop = asyncio.get_event_loop()
loop.run_until_complete(run())

License

asyncpg-🚀 is developed and distributed under the Apache 2.0 license, just like the original project.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

asyncpg-rkt-0.27.3.tar.gz (884.9 kB view hashes)

Uploaded Source

Built Distributions

asyncpg_rkt-0.27.3-cp311-cp311-win_amd64.whl (551.2 kB view hashes)

Uploaded CPython 3.11 Windows x86-64

asyncpg_rkt-0.27.3-cp311-cp311-musllinux_1_1_x86_64.whl (3.5 MB view hashes)

Uploaded CPython 3.11 musllinux: musl 1.1+ x86-64

asyncpg_rkt-0.27.3-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (3.0 MB view hashes)

Uploaded CPython 3.11 manylinux: glibc 2.17+ x86-64

asyncpg_rkt-0.27.3-cp311-cp311-macosx_10_9_x86_64.whl (670.2 kB view hashes)

Uploaded CPython 3.11 macOS 10.9+ x86-64

asyncpg_rkt-0.27.3-cp310-cp310-win_amd64.whl (559.6 kB view hashes)

Uploaded CPython 3.10 Windows x86-64

asyncpg_rkt-0.27.3-cp310-cp310-musllinux_1_1_x86_64.whl (3.4 MB view hashes)

Uploaded CPython 3.10 musllinux: musl 1.1+ x86-64

asyncpg_rkt-0.27.3-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (2.8 MB view hashes)

Uploaded CPython 3.10 manylinux: glibc 2.17+ x86-64

asyncpg_rkt-0.27.3-cp310-cp310-macosx_10_9_x86_64.whl (686.5 kB view hashes)

Uploaded CPython 3.10 macOS 10.9+ x86-64

asyncpg_rkt-0.27.3-cp39-cp39-win_amd64.whl (574.2 kB view hashes)

Uploaded CPython 3.9 Windows x86-64

asyncpg_rkt-0.27.3-cp39-cp39-musllinux_1_1_x86_64.whl (3.4 MB view hashes)

Uploaded CPython 3.9 musllinux: musl 1.1+ x86-64

asyncpg_rkt-0.27.3-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (2.9 MB view hashes)

Uploaded CPython 3.9 manylinux: glibc 2.17+ x86-64

asyncpg_rkt-0.27.3-cp39-cp39-macosx_10_9_x86_64.whl (704.2 kB view hashes)

Uploaded CPython 3.9 macOS 10.9+ x86-64

asyncpg_rkt-0.27.3-cp38-cp38-win_amd64.whl (574.7 kB view hashes)

Uploaded CPython 3.8 Windows x86-64

asyncpg_rkt-0.27.3-cp38-cp38-musllinux_1_1_x86_64.whl (3.8 MB view hashes)

Uploaded CPython 3.8 musllinux: musl 1.1+ x86-64

asyncpg_rkt-0.27.3-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (3.3 MB view hashes)

Uploaded CPython 3.8 manylinux: glibc 2.17+ x86-64

asyncpg_rkt-0.27.3-cp38-cp38-macosx_10_9_x86_64.whl (694.8 kB view hashes)

Uploaded CPython 3.8 macOS 10.9+ x86-64

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page