Skip to main content

Memory-efficient container for list-like objects

Project description

PyPI-Server Unit tests

CompressedList Implementation in Python

A Python implementation of the CompressedList class from R/Bioconductor for memory-efficient list-like objects.

CompressedList is a memory-efficient container for list-like objects. Instead of storing each list element separately, it concatenates all elements into a single vector-like object and maintains information about where each original element begins and ends. This approach is significantly more memory-efficient than standard lists, especially when dealing with many list elements.

Install

To get started, install the package from PyPI

pip install compressed-lists

Usage

from compressed_lists import CompressedIntegerList, CompressedStringList, Partitioning

# Create a CompressedIntegerList
int_data = [[1, 2, 3], [4, 5], [6, 7, 8, 9]]
names = ["A", "B", "C"]
int_list = CompressedIntegerList.from_list(int_data, names)

# Access elements
print(int_list[0])      # [1, 2, 3]
print(int_list["B"])    # [4, 5]
print(int_list[1:3])    # Slice of elements

# Apply a function to each element
squared = int_list.lapply(lambda x: [i**2 for i in x])
print(squared[0])       # [1, 4, 9]

# Convert to a regular Python list
regular_list = int_list.to_list()

# Create a CompressedStringList from lengths
import biocutils as ut
char_data = ut.StringList(["apple", "banana", "cherry", "date", "elderberry", "fig"])

char_list = CompressedStringList(char_data, partitioning=Partitioning.from_lengths([2,3,1]))
print(char_list)

Partitioning

The Partitioning class handles the information about where each element begins and ends in the concatenated data. It allows for efficient extraction of elements without storing each element separately.

from compressed_lists import Partitioning

# Create partitioning from end positions
ends = [3, 5, 10]
names = ["A", "B", "C"]
part = Partitioning(ends, names)

# Get partition range for an element
start, end = part[1]  # Returns (3, 5)

[!NOTE]

Check out the documentation for available compressed list implementations and extending CompressedLists to custom data types.

Note

This project has been set up using BiocSetup and PyScaffold.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

compressed_lists-0.4.1.tar.gz (36.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

compressed_lists-0.4.1-py3-none-any.whl (19.2 kB view details)

Uploaded Python 3

File details

Details for the file compressed_lists-0.4.1.tar.gz.

File metadata

  • Download URL: compressed_lists-0.4.1.tar.gz
  • Upload date:
  • Size: 36.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for compressed_lists-0.4.1.tar.gz
Algorithm Hash digest
SHA256 7aa53d23105ebd600424807a9019a081485312a4d470d3a4be9ca6d38fd3bec2
MD5 f1eff245345d5e8623e3c23a1babe30b
BLAKE2b-256 074a7a213ef3c714156a5ff880f336fce164a326fb41feb2fee05aec81ad1861

See more details on using hashes here.

Provenance

The following attestation bundles were made for compressed_lists-0.4.1.tar.gz:

Publisher: publish-pypi.yml on BiocPy/compressed-lists

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file compressed_lists-0.4.1-py3-none-any.whl.

File metadata

File hashes

Hashes for compressed_lists-0.4.1-py3-none-any.whl
Algorithm Hash digest
SHA256 cbce1a04bfe16dd024e9a4ef3b68afedef2181bc2a8fc96ad6fba0484af48b22
MD5 b8a705ab4540f4b47373ce149ccb17c5
BLAKE2b-256 2ab87a4ec7b80f999a230c0b3c682eb988fe1b0b3a24d9581168762c6a78fc91

See more details on using hashes here.

Provenance

The following attestation bundles were made for compressed_lists-0.4.1-py3-none-any.whl:

Publisher: publish-pypi.yml on BiocPy/compressed-lists

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page