Skip to main content

Python package for string case formatting; implemented in Rust.

Project description

https://img.shields.io/pypi/v/rscase.svg https://github.com/sondrelg/rs-case/workflows/tests/badge.svg https://img.shields.io/badge/Python-v3.8-blue.svg https://img.shields.io/badge/Rust-v1.43.0--nightly-red.svg https://codecov.io/gh/sondrelg/rscase/branch/master/graph/badge.svg

This module offers a handful of case-formatting utility functions. It is a very simple Python package, written in Rust and implemented using pyo3 which offers you easy Rust bindings for the Python interpreter.

Installation

Install with pip using:

pip install rscase

Note: This package requires Rust nightly 2020-02-06 or an equivalent future release.

Usage

The package provides utility functions for generating strings formatted in several different case standards.

The case-standards and their functions are listed below.

Supported cases

Function

Format example

camel case

camel_case

camelCasedValue

snake case

snake_case

snake_cased_value

pascal case

pascal_case

PascalCasedValue

kebab case

kebab_case

kebab-cased-value

train case

train_case

TRAIN-CASED-VALUE

All functions are imported and accessed the same way:

>> [in] from rscase import rscase
>> [in] rscase.camel_case('this_is-a_Test')
>> [out] thisIsATest

If you want to use this package, please note that the case functions are written to successfully convert camel case and snake case to the remaining formats. Formatting train case to itself doesn’t really make sense, and the way I would use this would be to, e.g., serialize out response data to a camelCased format.

Benchmarking Performance

This repo is a bit of an experiment, and because the functions contained in this package only do some very simple string manipulation, they seem like they might actually be good candidates for Python vs Rust performance benchmarking.

To try and make this a fair comparison - to make sure we’re comparing apples to apples - I decided to test the Rust function snake_case (see the Rust function here) to an identical Python function. The Python version is shown below:

from rscase import rscase

test_string = "thisIsALongCamelCasedAlphabeticKey"

# Test functions

def original_snake_case():
    string = test_string
    new_string = ""
    dash = "-"
    for index in range(len(string)):
        if index == 0:
            new_string += string[index].lower()
        elif string[index] == dash:
            new_string += '_'
        elif string[index].upper() == string[index]:
            new_string += f'_{string[index]}'
        else:
            new_string += string[index]
    return new_string

def rust_snake_case():
    string = test_string
    return rscase.snake_case(string)

The main difference between the two functions, flow-wise, is only that Rust won’t let you just iterate over a string, so you have to create a vector of char’s instead - or at least that’s how I did it.

Results

After running the tests, the results seems to be pretty promising - in favor of the Rust implementation.

Reps

Rust Execution Time

Python Execution Time

Difference

1

18.30 us

14.20 us

0.78x*

10

55.20 us

114.20 us

2.07x

100

.49 ms

1.11 ms

2.27x

1000

4.88 ms

11.18 ms

2.28x

10 000

47.20 ms

109.13 ms

2.31x

100 000

.47 s

1.08 s

2.31x

1000 000

4.83 s

11.12 s

2.30x

10 000 000

46.67 s

109.27 s

2.34x

100 000 000

484 s

1102 s

2.28x

The results are pretty clear: after only 100 reps, the results seem to stabilize, and flatten out at around a 2.3x longer execution time for the Python implementation.

* the 1-rep result seems to show that Python actually outperforms Rust in the scenario that would normally actually matter. Since it makes sense that variance would be high when trying to measure something at the microsecond level I decided to run this individual scenario again, another one million times, to increase the sample size. With a larger sample, the average difference for 1 rep averages to 1.85x slower in Python, and the median is 1.88x. In short, the Rust implementation seems to outperform the Python across the board.

Benchmarking Performance - Update

Thanks to Thomas Hartmann for suggesting a significant performance improvement in the packaged Rust code.

Using some experimental features, we’re able to improve the performance of the Rust code considerably. The snake_case test from above is replicated below, with the performance difference settling at 5x the Python performance.

Reps

Rust Execution Time

Python Execution Time

Difference

1

10.70 us

15.20 us

1.42x

10

28.70 us

113.30 us

3.95x

100

.24 ms

1.11 ms

4.56x

1000

2.24 ms

11.28 ms

5.03x

10 000

22.16 ms

107.79 ms

4.86x

100 000

.24 s

1.09 s

4.44x

1000 000

2.21 s

11.02 s

4.99x

10 000 000

22.09 s

110.47 s

5.00x

100 000 000

222 s

1086 s

4.88x

Running the 1 rep scenario one million times, gives an average Rust execution time of 3.84 us compared to an average Python execution time of 12.61 us (~3.3x slower for Python).

This time around, I also decided to test the camel case implementations, as the logic does behave a bit differently:

Reps

Rust Execution Time

Python Execution Time

Difference

1

10.99 us

14.40 us

1.31x

10

39.79 us

106.90 us

2.69x

100

.25 ms

1.02 ms

4.07x

1000

2.40 ms

10.24 ms

4.26x

10 000

23.55 ms

100.17 ms

4.25x

100 000

.23 s

0.98 s

4.26x

1000 000

2.34 s

9.92 s

4.23x

10 000 000

23.23 s

98.91 s

4.26x

100 000 000

232 s

990 s

4.26x

Running the 1 rep scenario one million times, gives an average Rust execution time of 3.90 us compared to an average Python execution time of 11.48 us (almost ~3x slower for Python).

In summary, the benchmarked performed similarly, with Rust pulling ahead even more, for these two implementations. At the same time, there’s probably room for improvement for both implementations still, and probably especially for the Python one.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rscase-1.1.0.tar.gz (8.6 kB view details)

Uploaded Source

Built Distribution

rscase-1.1.0-cp38-cp38-win_amd64.whl (115.7 kB view details)

Uploaded CPython 3.8 Windows x86-64

File details

Details for the file rscase-1.1.0.tar.gz.

File metadata

  • Download URL: rscase-1.1.0.tar.gz
  • Upload date:
  • Size: 8.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.8.1

File hashes

Hashes for rscase-1.1.0.tar.gz
Algorithm Hash digest
SHA256 7982e66dee9623e53805fc282f5d9d3b622b67837524d8e4e7d8ceb97a115950
MD5 d3030dbd46002df5ab645c19cff30b8b
BLAKE2b-256 d4cc256454f834a7601592b62f8ce731ebf8c621115209407ed0bfb8f318dd58

See more details on using hashes here.

File details

Details for the file rscase-1.1.0-cp38-cp38-win_amd64.whl.

File metadata

  • Download URL: rscase-1.1.0-cp38-cp38-win_amd64.whl
  • Upload date:
  • Size: 115.7 kB
  • Tags: CPython 3.8, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.8.1

File hashes

Hashes for rscase-1.1.0-cp38-cp38-win_amd64.whl
Algorithm Hash digest
SHA256 b81492e4b6e07067a590cfd3d7ec19ce9f268271149d1753d1c191d7649db78e
MD5 6f9bcb83e277f6b73f37dcab482cdf1c
BLAKE2b-256 d80d4d01d241beae32845f99c16e506a608704f09a0022a6ba88aba78e746945

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page