sqlakeyset

offset-free paging for sqlalchemy

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

This library implements keyset-based paging for SQLAlchemy (both ORM and core).

This library has been tested with PostgreSQL and MariaDB/MySQL. It should work with other SQLAlchemy-supported databases to provided they support row( syntax (see below).

Background

A lot of people use SQL’s OFFSET syntax to implement paging of query results. The trouble with that is, the more pages you get through, the slower your query gets. Also, if the results you’re paging through change frequently, it’s possible to skip over or repeat results between pages. Keyset paging avoids these problems: Selecting even the millionth page is as fast as selecting the first.

sqlakeyset seems to work well so far, but is in its early stages of development, and as such, is alpha software. Treat it accordingly.

Getting Started

Here’s how it works with a typical ORM query:

from sqlakeyset import get_page
from sqlbag import S

from models import Book

with S('postgresql:///books') as s:  # create a session
    q = s.query(Book).order_by(Book.author, Book.title, Book.id)  #

    # gets the first page
    page1 = get_page(q, per_page=20)

    # gets the key for the next page
    next_page = page1.paging.next

    # gets the second page
    page2 = get_page(q, per_page=20, page=next_page)

    # returning to the first page, getting the key
    previous_page = page2.paging.previous

    # the first page again, backwards from the previous page
    page1 = get_page(q, per_page=20, page=previous_page)

    # what if new items were added at the start?
    if page1.paging.has_previous:

        # go back even further
        previous_page = page1.paging.previous
        page1 = get_page(q, per_page=20, page=previous_page)

Under the Hood

sqlakeyset does the following to your query in order to get the paged contents:

adds a where clause, to get only rows after the specified row key.
if getting the previous page, reverses the order by direction in order the get the rows before the specified bookmark.
adds a limit clause, to fetch only enough items to fill the page, plus one additional (this additional row is used only to test for the existence of further pages after the current one, and is discarded from the results).
returns the page contents as an ordinary list that has an attached .paging attribute with the paging information for this and related pages.

Page objects

Paged items/rows are returned in a Page object, which is a vanilla python list, except with an attached Paging object with the paging information.

Properties such as next and previous return a 2-tuple containing the ordering key for the row, and a boolean to specify if the direction is forwards or backwards.

In our above example, the 2-tuple specifying the second page might look like:

('Catch 22', 123), False

The False means the query will fetch the page after the row containing Catch 22. This tuple contains two elements, title and id, to match the order by clause of the query.

The page before this row would be specified as:

('Catch 22', 123), True

The first and last pages are fetched with None instead of a tuple, so for the first page (this is also the default if the page parameter is not specified):

None, False

And the last page:

None, True

Keyset Serialization

You will probably want to turn these keysets/bookmarks for passing around. sqlakeyset includes code to do this. To get a serialized bookmark, just add bookmark_ to the name of the property that holds the keyset you want.

Most commonly you’ll want next and previous, so:

>>> page.paging.bookmark_previous
<i:1~i:2015~s:Bad Blood~i:34
>>> page.paging.bookmark_next
>i:1~i:2014~s:Shake It Off~i:31

sqlakeyset uses the python csv row serializer to serialize the bookmark values (using ~ instead of a , as the separator). Direction is indicated by > (forwards/next), or < (backwards/previous) at the start of the string.

Limitations

Golden Rule: Always ensure your keysets are unique per row. If you violate this condition you risk skipped rows and other nasty problems. The simplest way to do this is to always include your primary key column(s) at the end of your ordering columns.
If you’re using the in-built keyset serialization, this only handles basic data/column types so far (strings, ints, floats, datetimes, dates, booleans, and a few others). The serialization can be extended to serialize more advanced types as necessary (documentation on this is forthcoming).
Known MariaDB/MySQL issue: For performing comparisons, sqlakeyset generates row-value syntax similar to the following:

where row('a', 1) > row(name, id)

Indexing support for this syntax in MariaDB/MySQL is apparently faulty. So performance on paging large tables may be poor (Meanwhile, PostgreSQL correctly supports indexing for this syntax).

sqlakeyset is alpha software Please be aware that sqlakeyset is in its early stage of development. That said, please use it! Your feedback is most welcome (good or bad).

Documentation

sqlakeyset is in early alpha and documentation other than this README is scarce so far. We are working on remedying this. Watch this space.

Installation

Assuming you have pip installed, all you need to do is install as follows:

$ pip install sqlakeyset

This will install sqlakeyset and also sqlalchemy if not already installed. Obviously you’ll need the necessary database driver for your chosen database to be installed also.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

2.0.1708907391

Feb 26, 2024

2.0.1708730727

Feb 23, 2024

2.0.1706067133

Jan 24, 2024

2.0.1701229837

Nov 29, 2023

2.0.1695177552

Sep 20, 2023

2.0.1695093069

Sep 19, 2023

2.0.1691149549

Aug 4, 2023

2.0.1687912386

Jun 28, 2023

2.0.1684285512

May 17, 2023

2.0.1681021882

Apr 9, 2023

2.0.1680651691

Apr 4, 2023

2.0.1680321678

Apr 1, 2023

1.0.1679485999

Mar 22, 2023

1.0.1679375467

Mar 21, 2023

1.0.1679209451

Mar 19, 2023

1.0.1659142803

Jul 30, 2022

1.0.1655946639

Jun 23, 2022

1.0.1655855963

Jun 21, 2022

1.0.1650280980

Apr 18, 2022

1.0.1650254349

Apr 18, 2022

1.0.1629029818

Aug 15, 2021

1.0.1628549850

Aug 9, 2021

1.0.1626139718

Jul 13, 2021

1.0.1616300210

Mar 21, 2021

1.0.1616299134

Mar 21, 2021

1.0.1615859905

Mar 16, 2021

1.0.1611286921

Jan 22, 2021

1.0.1611275164

Jan 22, 2021

1.0.1602495765

Oct 12, 2020

1.0.1602223605

Oct 9, 2020

1.0.1602223202

Oct 9, 2020

1.0.1595476224

Jul 23, 2020

1.0.1594957855

Jul 17, 2020

1.0.1594881885

Jul 16, 2020

0.1.1579837191

Jan 24, 2020

0.1.1576554700

Dec 17, 2019

0.1.1576552828

Dec 17, 2019

0.1.1559103842

May 29, 2019

0.1.1558438465

May 21, 2019

0.1.1485813522

Jan 30, 2017

This version

0.1.1472125695

Aug 25, 2016

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sqlakeyset-0.1.1472125695.tar.gz (11.1 kB view hashes)

Uploaded Aug 25, 2016 Source

Built Distribution

sqlakeyset-0.1.1472125695-py2.py3-none-any.whl (13.5 kB view hashes)

Uploaded Aug 25, 2016 Python 2 Python 3

Hashes for sqlakeyset-0.1.1472125695.tar.gz

Hashes for sqlakeyset-0.1.1472125695.tar.gz
Algorithm	Hash digest
SHA256	`b2d8f67e069d717c5a4068c1580b7e49f04944d145af0e770bd0681b50dc9265`
MD5	`7191b5d91bdc1cdf287cfd29315ac71d`
BLAKE2b-256	`71becfd3cdf27c77e98ce2249e6ed980a93e5a533eb11dcdb78dbbe5552a25bd`

Hashes for sqlakeyset-0.1.1472125695-py2.py3-none-any.whl

Hashes for sqlakeyset-0.1.1472125695-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`02f44cdda986d200c9d1d079a5c106a6fbc2d1646b117c7e644e6516a22877da`
MD5	`01908472fd94573cba65d220a9ef2578`
BLAKE2b-256	`9cbb286e639f2040fe7973750a5c247fdd2571e1ba3c368f5f24bafb4f4c8184`