Skip to main content

Contains a variety of ordered structures, in particular a SkipList.

Project description

SkipList

This project contains a SkipList implementation in C++ with Python bindings.

A SkipList behaves as an always sorted list with, typically, O(log(n)) cost for insertion, look-up and removal. This makes it ideal for such operations as computing the rolling median of a large dataset.

See the full documentation on this project at ReadTheDocs.

A SkipList is implemented as a singly linked list of ordered nodes where each node participates in a subset of, sparser, linked lists. These additional 'sparse' linked lists provide rapid indexing and mutation of the underlying linked list. It is a probabilistic data structure using a random function to determine how many 'sparse' linked lists any particular node participates in. As such SkipList is an alternative to binary tree, Wikipedia has a introductory page on SkipLists.

An advantage claimed for SkipLists are that the insert and remove logic is simpler (however I do not subscribe to this). The drawbacks of a SkipList include its larger space requirements and its O(log(N)) lookup behaviour compared to other, more restricted and specialised, data structures that may have either faster runtime behaviour or lower space requirements or both.

This project contains a SkipList implementation in C++ with Python bindings with:

  • No capacity restrictions apart from available memory.
  • Works with any C++ type that has meaningful comparison operators.
  • The C++ SkipList can be compiled as thread safe.
  • The Python SkipList is thread safe.
  • The SkipList has exhaustive internal integrity checks.
  • Python SkipLists can be long/float/bytes/object types, the latter can have user defined comparison functions.
  • With Python 3.8+ SkipLists can be combined with the multiprocessing.shared_memory module for concurrent operation on large arrays. For example concurrent rolling medians which speed up near linearly with the number of cores.
  • The implementation is extensively performance tested in C++ and Python.

There are a some novel features to this implementation:

Credits

Originally written by Paul Ross with credits to: Wilfred Hughes (AHL), Luke Sewell (AHL) and Terry Tsantagoeds (AHL).

Installation

C++

This SkipList requires:

  • A C++20 compiler.
  • -I<skiplist>/src/cpp as an include path.
  • <skiplist>/src/cpp/SkipList.cpp to be compiled/linked.
  • The macro SKIPLIST_THREAD_SUPPORT set if you want a thread safe SkipList using C++ mutexes.

Python

This SkipList version supports Python 3.10, 3.11, 3.12, 3.13, 3.14 (and, probably, some earlier Python 3 versions).

From PyPi

$ pip install orderedstructs

From source

$ git clone https://github.com/paulross/skiplist.git
$ cd <skiplist>
$ python setup.py install

Testing

This SkipList has extensive tests for correctness and performance.

C++

To run all the C++ functional and performance tests:

$ cd <skiplist>/src/cpp
$ make release
$ ./SkipList_R.exe

To run the C++ functional tests with agressive internal integrity checks:

$ cd <skiplist>/src/cpp
$ make debug
$ ./SkipList_D.exe

To run all the C++ functional and performance tests for a thread safe SkipList:

$ cd <skiplist>/src/cpp
$ make release CXXFLAGS=-DSKIPLIST_THREAD_SUPPORT
$ ./SkipList_R.exe

Python

Testing requires pytest and hypothesis:

To run all the C++ functional and performance tests:

$ cd <skiplist>
$ py.test -vs tests/

Examples

Here are some examples of using a SkipList in your code:

C++

#include "SkipList.h"
    
// Declare with any type that has sane comparison.
OrderedStructs::SkipList::HeadNode<double> sl;

sl.insert(42.0);
sl.insert(21.0);
sl.insert(84.0);

sl.has(42.0) // true
sl.size()    // 3
sl.at(1)     // 42.0, throws OrderedStructs::SkipList::IndexError if index out of range

sl.remove(21.0); // throws OrderedStructs::SkipList::ValueError if value not present

sl.size()    // 2
sl.at(1)     // 84.0

The C++ SkipList is thread safe when compiled with the macro SKIPLIST_THREAD_SUPPORT, then a SkipList can then be shared across threads:

#include <thread>
#include <vector>

#include "SkipList.h"

void do_something(OrderedStructs::SkipList::HeadNode<double> *pSkipList) {
    // Insert/remove items into *pSkipList
    // Read items inserted by other threads.
}

OrderedStructs::SkipList::HeadNode<double> sl;
std::vector<std::thread> threads;

for (size_t i = 0; i < thread_count; ++i) {
    threads.push_back(std::thread(do_something, &sl));
}
for (auto &t: threads) {
    t.join();
}
// The SkipList now contains the totality of the thread actions.

Python

An example of using a SkipList of always ordered floats:

import orderedstructs

# Declare with a type. Supported types are long/float/bytes/object.
sl = orderedstructs.SkipList(float)

sl.insert(42.0)
sl.insert(21.0)
sl.insert(84.0)

sl.has(42.0) # True
sl.size()    # 3
sl.at(1)     # 42.0

sl.has(42.0) # True
sl.size()    # 3
sl.at(1)     # 42.0, raises IndexError if index out of range

sl.remove(21.0); # raises ValueError if value not present

sl.size()    # 2
sl.at(1)     # 84.0

The Python SkipList can be used with user defined objects with a user defined sort order. In this example the last name of the person takes precedence over the first name:

import functools

@functools.total_ordering
class Person:
    """Simple example of ordering based on last name/first name."""
    def __init__(self, first_name, last_name):
        self.first_name = first_name
        self.last_name = last_name

    def __eq__(self, other):
        try:
            return self.last_name == other.last_name and self.first_name == other.first_name
        except AttributeError:
            return NotImplemented

    def __lt__(self, other):
        try:
            return self.last_name < other.last_name or self.first_name < other.first_name
        except AttributeError:
            return NotImplemented

    def __str__(self):
        return '{}, {}'.format(self.last_name, self.first_name)

import orderedstructs

sl = orderedstructs.SkipList(object)

sl.insert(Person('Peter', 'Pan'))
sl.insert(Person('Alan', 'Pan'))
assert sl.size() == 2
assert str(sl.at(0)) == 'Pan, Alan' 
assert str(sl.at(1)) == 'Pan, Peter' 

The Python SkipList is thread safe when using any acceptable Python type even if that type has user defined comparison methods. This uses Pythons mutex machinery which is independent of C++ mutexes.

History

0.4.5 (2026-04-20)

  • Performance of low level shared memory operations.
  • Add support for Python 3.15
  • Support for Python 3.10, 3.11, 3.12, 3.13, 3.14, 3.15.
  • Development Status :: 5 - Production/Stable

0.4.4 (2026-02-15)

  • Demonstrate reverse ordering with Python objects.
  • Add further performance analysis of rolling median with multiprocessing shared memory.
  • Identify an issue with mixing pure Python code and C code in the orderedstructs package. Fixing this is known but would mean an API change.

0.4.3 (2026-02-09)

  • Add minor gridlines to log plots.
  • Fix git corruption.

0.4.0 (2026-01-31)

  • Support for Python 3.10, 3.11, 3.12, 3.13, 3.14.
  • End support for Python 3.8, 3.9, but they will work just fine.
  • Improve the performance tests and documentation.
  • Python performance benchmarks are now automatically converted into Gnuplot .dat files.
  • Improve C++ rolling median code.
  • Reorganise C++ header files to be more conventional which also gives Doxygen a better grasp of the code.
  • Development Status :: 5 - Production/Stable

0.3.17 (2026-01-21)

  • Support for Python 3.8, 3.9, 3.10, 3.11, 3.12, 3.13, 3.14.
  • Development Status :: 5 - Production/Stable

0.3.16 (2025-04-12)

  • Improvements to performance documentation.
  • Minor fix for a linker issue.
  • Support for Python 3.8, 3.9, 3.10, 3.11, 3.12, 3.13.
  • Development Status :: 5 - Production/Stable

0.3.15 (2025-02-12)

0.3.14 (2024-08-02)

  • Support for Python 3.8, 3.9, 3.10, 3.11, 3.12, 3.13.
  • Drop support for Python 3.7.

0.3.13 (2023-09-05)

  • Documentation improvements.

0.3.12 (2023-08-29)

  • Minor fixes to the documentation and examples.

0.3.11 (2023-08-28)

  • Minor fixes missed in 0.3.10 release.

0.3.10 (2023-08-28)

  • Minor fixes missed in 0.3.9 release.

0.3.9 (2023-08-28)

  • Minor fixes missed in 0.3.8 release.

0.3.8 (2023-08-28)

  • Add cmake build.
  • Support for Python 3.7, 3.8, 3.9, 3.10, 3.11.
  • Remove mention of Python 2.

0.3.7 (2021-12-18)

  • Fix build on GCC (Issue #8).

0.3.6 (2021-12-18)

  • Add documentation on NaN in rolling median.
  • Add plots using shared_memory.
  • Add Python 3.10 support.

0.3.5 (2021-05-02)

  • Fix uncaught exception when trying to remove a NaN.

0.3.4 (2021-04-28)

  • Improve documentation mainly around multiprocessing.shared_memory and tests.

0.3.3 (2021-03-25)

  • Add Python benchmarks, fix some build issues.

0.3.2 (2021-03-18)

  • Fix lambda issues with Python 3.8, 3.9.

0.3.1 (2021-03-17)

  • Support Python 3.7, 3.8, 3.9.

0.3.0 (2017-08-18)

  • Public release.
  • Allows storing of PyObject* and rich comparison.

0.2.0

Python module now named orderedstructs.

0.1.0

Initial release.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

orderedstructs-0.4.5.tar.gz (66.5 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

orderedstructs-0.4.5-cp315-cp315-macosx_10_15_universal2.whl (93.8 kB view details)

Uploaded CPython 3.15macOS 10.15+ universal2 (ARM64, x86-64)

orderedstructs-0.4.5-cp314-cp314-macosx_10_15_universal2.whl (93.8 kB view details)

Uploaded CPython 3.14macOS 10.15+ universal2 (ARM64, x86-64)

orderedstructs-0.4.5-cp313-cp313-macosx_10_13_universal2.whl (93.4 kB view details)

Uploaded CPython 3.13macOS 10.13+ universal2 (ARM64, x86-64)

orderedstructs-0.4.5-cp312-cp312-macosx_10_13_universal2.whl (93.4 kB view details)

Uploaded CPython 3.12macOS 10.13+ universal2 (ARM64, x86-64)

orderedstructs-0.4.5-cp311-cp311-macosx_10_9_universal2.whl (94.0 kB view details)

Uploaded CPython 3.11macOS 10.9+ universal2 (ARM64, x86-64)

orderedstructs-0.4.5-cp310-cp310-macosx_10_9_universal2.whl (94.0 kB view details)

Uploaded CPython 3.10macOS 10.9+ universal2 (ARM64, x86-64)

File details

Details for the file orderedstructs-0.4.5.tar.gz.

File metadata

  • Download URL: orderedstructs-0.4.5.tar.gz
  • Upload date:
  • Size: 66.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.11

File hashes

Hashes for orderedstructs-0.4.5.tar.gz
Algorithm Hash digest
SHA256 34a85215b59c2a29578600f19ccb4ed1f945f9a60ba20f965e4db7a5edb9a0c6
MD5 1420226c2cf4236e375b29fc5bfab233
BLAKE2b-256 b77d981335e6f091a93188c622382fe5f08189deb5b59388ed6809c06f494828

See more details on using hashes here.

File details

Details for the file orderedstructs-0.4.5-cp315-cp315-macosx_10_15_universal2.whl.

File metadata

File hashes

Hashes for orderedstructs-0.4.5-cp315-cp315-macosx_10_15_universal2.whl
Algorithm Hash digest
SHA256 18028d632dc6f83cf99d3376f5308fad912c0984ba221a57cc52a46c60c71ac2
MD5 d6295a6bf8ee9e1388501d1697b84d20
BLAKE2b-256 df754bee3e79d09fe0466f1df750d213973120edb999dc5af3d1edf78f72fc30

See more details on using hashes here.

File details

Details for the file orderedstructs-0.4.5-cp314-cp314-macosx_10_15_universal2.whl.

File metadata

File hashes

Hashes for orderedstructs-0.4.5-cp314-cp314-macosx_10_15_universal2.whl
Algorithm Hash digest
SHA256 3454dc5e33f217fb747510c21b4254a4e706e3b91209fa09aa52d030d080c4a1
MD5 d70192c66fabce8e1d605302ba97a51f
BLAKE2b-256 2d8c08aed94fae8a6ddc13a239cf2b83bbf2ce71c4e1a8696f28756a7fe7e6cc

See more details on using hashes here.

File details

Details for the file orderedstructs-0.4.5-cp313-cp313-macosx_10_13_universal2.whl.

File metadata

File hashes

Hashes for orderedstructs-0.4.5-cp313-cp313-macosx_10_13_universal2.whl
Algorithm Hash digest
SHA256 e4b1ca7c2ff609ca61125b130c68aca525d5774c52da2176d6ecb30e4d6e8834
MD5 731f2597beb2130a4cbc0ecc35aa7080
BLAKE2b-256 3606266cf7a18a080c8562d5b9b20566903294ec6a61639e003d0feb49be7716

See more details on using hashes here.

File details

Details for the file orderedstructs-0.4.5-cp312-cp312-macosx_10_13_universal2.whl.

File metadata

File hashes

Hashes for orderedstructs-0.4.5-cp312-cp312-macosx_10_13_universal2.whl
Algorithm Hash digest
SHA256 c1f83fd03e3587603e0e48fb64df8afe9da1ecf4076ac575d7830961456c4ab6
MD5 f6250b43f3a4e7824a0cec30f236192a
BLAKE2b-256 00d275e422c8cfc019ce146c5efc6b59d01188626a8315c6c961c9bfd8f3d95e

See more details on using hashes here.

File details

Details for the file orderedstructs-0.4.5-cp311-cp311-macosx_10_9_universal2.whl.

File metadata

File hashes

Hashes for orderedstructs-0.4.5-cp311-cp311-macosx_10_9_universal2.whl
Algorithm Hash digest
SHA256 e8537a51f14320bd761de8aa44f5e9144619abcfe03142ad107ba831e29834ad
MD5 2139b26f8e5c03cda66e00a0ffb52d09
BLAKE2b-256 e6841aa21538e4cd3c52107b580ac53596f04fae9195f5f0969417fcf0614841

See more details on using hashes here.

File details

Details for the file orderedstructs-0.4.5-cp310-cp310-macosx_10_9_universal2.whl.

File metadata

File hashes

Hashes for orderedstructs-0.4.5-cp310-cp310-macosx_10_9_universal2.whl
Algorithm Hash digest
SHA256 72c102a7c9da347e8d86dbbd3da5826b835d23e4f977fdad8013148ab79681c4
MD5 9fce7a1adcb6d38b1254f5eceb0878fc
BLAKE2b-256 cce2eb0c2bd38e3c6921a0f3f810ef15ba9b933e62736b63089c2732c7b50dd5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page