PRECIS-i18n: Internationalized Usernames and Passwords
Project description
If you want your application to accept unicode user names and passwords, you must be careful in how you validate and compare them. The PRECIS framework makes internationalized user names and passwords safer for use by applications. PRECIS profiles transform unicode strings into a canonical form, suitable for comparison.
This module implements the PRECIS Framework as described in:
PRECIS Framework: Preparation, Enforcement, and Comparison of Internationalized Strings in Application Protocols (RFC 7564)
Preparation, Enforcement, and Comparison of Internationalized Strings Representing Usernames and Passwords (RFC 7613)
Preparation, Enforcement, and Comparison of Internationalized Strings Representing Nicknames (RFC 7700)
Requires Python 3.3 or later.
Usage
Use the get_profile function to obtain a profile object, then use its enforce method. The enforce method returns a Unicode string.
>>> from precis_i18n import get_profile >>> username = get_profile('UsernameCaseMapped') >>> username.enforce('Kevin') 'kevin' >>> username.enforce('\u212Aevin') 'kevin' >>> username.enforce('\uFF2Bevin') 'kevin' >>> username.enforce('\U0001F17Aevin') Traceback (most recent call last): ... UnicodeEncodeError: 'UsernameCaseMapped' codec can't encode character '\U0001f17a' in position 0: DISALLOWED/symbols
Alternatively, you can use the Python str.encode API. Import the precis_i18n.codec module to register the PRECIS codec names. Now you can use the str.encode method with any unicode string. The result will be a UTF-8 encoded byte string or a UnicodeEncodeError if the string is disallowed.
>>> import precis_i18n.codec >>> 'Kevin'.encode('UsernameCasePreserved') b'Kevin' >>> '\u212Aevin'.encode('UsernameCasePreserved') b'Kevin' >>> '\uFF2Bevin'.encode('UsernameCasePreserved') b'Kevin' >>> '\u212Aevin'.encode('UsernameCaseMapped') b'kevin' >>> '\uFF2Bevin'.encode('OpaqueString') b'\xef\xbc\xabevin' >>> '\U0001F17Aevin'.encode('UsernameCasePreserved') Traceback (most recent call last): ... UnicodeEncodeError: 'UsernameCasePreserved' codec can't encode character '\U0001f17a' in position 0: DISALLOWED/symbols
Supported Profiles and Codecs
Each PRECIS profile has a corresponding codec name. The CaseMapped variant converts the string to lower case for implementing case-insensitive comparison.
UsernameCasePreserved
UsernameCaseMapped
OpaqueString
NicknameCasePreserved
NicknameCaseMapped
The CaseMapped profiles use Unicode Default Case Folding. There are additional codecs that use Unicode ToLower to support draft RFC changes.
UsernameCaseMapped:ToLower
NicknameCaseMapped:ToLower
The PRECIS base string classes are also available:
IdentifierClass
FreeFormClass
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for precis_i18n-0.5.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c2578fae3e0cefa5becf160800546cd26b02a5916acfe7cbedb9139e453fc805 |
|
MD5 | bf38db9cf65947af8ada72d1e4f1af28 |
|
BLAKE2b-256 | 9d79c7f0274767a47edbc03915f39c5e22e7e41bd7d33e1f3d64f0d250a59f45 |