Quickly convert strings to number types.
Project description
Convert strings to numbers quickly.
Source Code: https://github.com/SethMMorton/fastnumbers
Downloads: https://pypi.python.org/pypi/fastnumbers
Documentation: http://pythonhosted.org//fastnumbers/
This module is a Python C extension that will convert strings to numbers much faster than can be done using pure Python; numeric types can also be converted to other numeric types.
Additionally, the user has control over what happens in the event that the input string cannot be converted to a number:
the input can be returned as-is (this is the default behavior)
a ValueError can be raised (like the built-in float or int)
a default value can be returned
Examples
fastnumbers contains functions that are fast C implementations similar to the following Pure Python function:
def fast_float(input, default=None, raise_on_invalid=False, inf=None, nan=None):
import math
try:
x = float(input)
except ValueError:
if raise_on_invalid:
raise
return default if default is not None else input
else:
if inf is not None and math.isinf(x):
return inf
elif nan is not None and math.isnan(x):
return nan
else:
return x
In addition to fast_float, there are also fast_real, fast_int, fast_forceint, isreal, isfloat, isint, and isintlike - please see the API Documentation for full details.
Some example usage:
>>> from fastnumbers import fast_float
>>> # Convert string to a float
>>> fast_float('56.07')
56.07
>>> # Unconvertable string returned as-is by default
>>> fast_float('bad input')
'bad input'
>>> # Unconvertable strings can trigger a default value
>>> fast_float('bad input', default=0)
0
>>> # 'default' is also the first optional positional arg
>>> fast_float('bad input', 0)
0
>>> # Integers are converted to floats
>>> fast_float(54)
54.0
>>> # One can ask inf or nan to be substituted with another value
>>> fast_float('nan')
nan
>>> fast_float('nan', nan=0.0)
0.0
>>> fast_float(float('nan'), nan=0.0)
0.0
>>> fast_float('56.07', nan=0.0)
56.07
>>> # The default built-in float behavior can be triggered with
>>> # "raise_on_invalid" set to True.
>>> fast_float('bad input', raise_on_invalid=True) #doctest: +IGNORE_EXCEPTION_DETAIL
Traceback (most recent call last):
...
ValueError: invalid literal for float(): bad input
>>> # Single unicode characters can be converted.
>>> fast_float(u'\u2164') # Roman numeral 5 (V)
5.0
>>> fast_float(u'\u2466') # 7 enclosed in a circle
7.0
NOTE: If you need locale-dependent conversions, supply the fastnumbers function of your choice to locale.atof.
import locale
locale.setlocale(locale.LC_ALL, 'de_DE.UTF-8')
print(atof('468,5', func=fast_float)) # Prints 468.5
Timing
Just how much faster is fastnumbers than a pure python implementation? Below are the timing results for the *_float functions; please see the Timing Documentation for details into all timing results.
from timeit import timeit
float_try = '''\
def float_try(input):
"""Typical approach to this problem."""
try:
return float(input)
except ValueError:
return input
'''
float_re = '''\
import re
float_match = re.compile(r'[-+]?\d*\.?\d+(?:[eE][-+]?\d+)?$').match
def float_re(input):
"""Alternate approach to this problem."""
try:
if float_match(input):
return float(input)
else:
return input
except TypeError:
return float(input)
'''
print('Invalid input:')
print("Try:", timeit('float_try("invalid")', float_try))
print("re:", timeit('float_re("invalid")', float_re))
print("fast", timeit('fast_float("invalid")', 'from fastnumbers import fast_float'))
print()
print('Valid input:')
print("try:", timeit('float_try("56.07")', float_try))
print("re:", timeit('float_re("56.07")', float_re))
print("fast", timeit('fast_float("56.07")', 'from fastnumbers import fast_float'))
The results will be similar to the below, by vary on the system you are on:
Invalid input: Try: 2.27156710625 re: 0.570491075516 fast 0.173984050751 Valid input: try: 0.378665924072 re: 1.08740401268 fast 0.204708099365
As you can see, in all cases fastnumbers beats the pure python implementations.
History
These are the last three entries of the changelog. See the package documentation for the complete changelog.
01-18-2016 v. 0.7.0
Broke all functions into smaller components, eliminating a lot of duplication.
Sped up functions by eliminating an unnecessary string copy.
Improved documentation.
11-01-2015 v. 0.6.2
Fixed bug that caused a SystemError exception to be raised on Python 3.5 if a very large int was passed to the “fast” functions.
10-29-2015 v. 0.6.1
Fixed segfault on Python 3.5 that seemed to be related to a change in the PyObject_CallMethod C function.
Sped up unit testing.
Added tox.ini.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
File details
Details for the file fastnumbers-0.7.0.zip
.
File metadata
- Download URL: fastnumbers-0.7.0.zip
- Upload date:
- Size: 62.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c430fd6057fefb68257fde85a4dc2dc58624173bd880f19745d63e90e7594d93 |
|
MD5 | 09e9c7fae0c538e8c2fafb68cd996ac9 |
|
BLAKE2b-256 | 703b67e748a520857c2e5cf55dfd7788e154f25d1252abda2d89cee0bb290bcc |
File details
Details for the file fastnumbers-0.7.0.tar.gz
.
File metadata
- Download URL: fastnumbers-0.7.0.tar.gz
- Upload date:
- Size: 45.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6dbda719d33f4dc14ae35410e1e73444e9f868761b58b34f9259b7b744fceacc |
|
MD5 | 758a796ca1e55a4af84e9f3e1b0e05fd |
|
BLAKE2b-256 | b8f230e56b5268e4cd9d21dbc2497bc15b3a0e2ec2d1ca11d317f8d2e96465b3 |