Skip to main content

Internationalized Domain Names in Applications (IDNA)

Project description

A library to support the Internationalised Domain Names in Applications (IDNA) protocol as specified in RFC 5891. This version of the protocol is often referred to as “IDNA2008”.

This library also provides support for Unicode Technical Standard 46, Unicode IDNA Compatibility Processing.

The library is also intended to act as a suitable replacement for the “encodings.idna” module that comes with the Python standard library but currently only supports the older, deprecated IDNA specification (RFC 3490).

Its basic functions are simply executed:

>>> import idna
>>> idna.encode(u'ドメイン.テスト')
'xn--eckwd4c7c.xn--zckzah'
>>> print idna.decode('xn--eckwd4c7c.xn--zckzah')
ドメイン.テスト

Packages

The latest tagged release version is published in the PyPI repository:

https://badge.fury.io/py/idna.svg

Installation

To install this library, you can use PIP:

$ pip install idna

Alternatively, you can install the package using the bundled setup script:

$ python setup.py install

This library should work with Python 2.7, and Python 3.3 or later.

Usage

For typical usage, the encode and decode functions will take a domain name argument and perform a conversion to an A-label or U-label respectively.

>>> import idna
>>> idna.encode(u'ドメイン.テスト')
'xn--eckwd4c7c.xn--zckzah'
>>> print idna.decode('xn--eckwd4c7c.xn--zckzah')
ドメイン.テスト

You may use the codec encoding and decoding methods using the idna.codec module.

>>> import idna.codec
>>> print u'домена.испытание'.encode('idna')
xn--80ahd1agd.xn--80akhbyknj4f
>>> print 'xn--80ahd1agd.xn--80akhbyknj4f'.decode('idna')
домена.испытание

Conversions can be applied at a per-label basis using the ulabel or alabel functions if necessary:

>>> idna.alabel(u'测试')
'xn--0zwm56d'

Compatibility Mapping (UTS #46)

As described in RFC 5895, the IDNA specification no longer including mappings from different forms of input that a user may enter, to the form that is provided to the IDNA functions. This functionality is now considered by the specification to be a local user-interface issue distinct from IDNA conversion functionality.

This library support one user-level mapping, that developed by the Unicode Consortium, known as Unicode IDNA Compatibility Processing. It provides for both regular mapping and transitional mapping.

For example, “Königsgäßchen” is not a permissible label as LATIN CAPITAL LETTER K is not allowed (as are capital letters in general). UTS46 will convert this into lower case.

>>> import idna
>>> idna.encode(u'Königsgäßchen')
...
idna.core.InvalidCodepoint: Codepoint U+004B at position 1 of u'K\xf6nigsg\xe4\xdfchen' not allowed
>>> idna.encode(u'Königsgäßchen', uts46=True)
'xn--knigsgchen-b4a3dun'

Transitional processing provides conversions to help transition from the older 2003 standard to the current standard. For example, in the original IDNA specification, the LATIN SMALL LETTER SHARP S (ß) was converted into two LATIN SMALL LETTER S (ss), whereas in the current IDNA specification this conversion is not performed.

>>> idna.encode(u'Königsgäßchen', uts46=True, transitional=True)
'xn--knigsgsschen-lcb0w'

Implementors should use transitional processing with caution, only in rare cases where conversion from legacy labels to current labels must be performed (i.e. IDNA implementations that pre-date 2008). For typical applications that just need to convert labels, transitional processing is unlikely to be beneficial and could produce unexpected incompatible results.

encodings.idna Compatibility

Function calls from the Python built-in encodings.idna module are mapping to their IDNA 2008 equivalents using the idna.compat module. Simply substitute the import clause in your code to refer to the new module name.

Exceptions

All errors raised during the conversion following the specification should raise an exception derived from the idna.IDNAError base class.

More specific exceptions that may be generated as idna.IDNABidiError when the error reflects an illegal combination of left-to-right and right-to-left characters in a label; idna.InvalidCodepoint when a specific codepoint is an illegal character in an IDN label (i.e. INVALID); and idna.InvalidCodepointContext when the codepoint is illegal based on its positional context (i.e. it is CONTEXTO or CONTEXTJ but the contextual requirements are not satisfied.)

Testing

The library has a test suite based on each rule of the IDNA specification, as well as test that are provided as part of the Unicode Technical Standard 46, Unicode IDNA Compatibility Processing.

The tests are run automatically on each commit to the master branch of the idna git repository at Travis CI:

https://travis-ci.org/kjd/idna.svg?branch=master

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

idna-2.1.tar.gz (129.0 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

idna-2.1-py2.py3-none-any.whl (54.2 kB view details)

Uploaded Python 2Python 3

idna-2.1-py2-none-any.whl (54.2 kB view details)

Uploaded Python 2

File details

Details for the file idna-2.1.tar.gz.

File metadata

  • Download URL: idna-2.1.tar.gz
  • Upload date:
  • Size: 129.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for idna-2.1.tar.gz
Algorithm Hash digest
SHA256 ed36f281aebf3cd0797f163bb165d84c31507cedd15928b095b1675e2d04c676
MD5 f6473caa9c5e0cc1ad3fd5d04c3c114b
BLAKE2b-256 fb848c27516fbaa8147acd2e431086b473c453c428e24e8fb99a1d89ce381851

See more details on using hashes here.

File details

Details for the file idna-2.1-py2.py3-none-any.whl.

File metadata

  • Download URL: idna-2.1-py2.py3-none-any.whl
  • Upload date:
  • Size: 54.2 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for idna-2.1-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 f28df695e9bede8a19b18a8e4429b4bad4d664e8e98aff27bc39b630f1ae2b42
MD5 3504ea3c28f95c5ce00fa02c07e1dba6
BLAKE2b-256 7102dee75fc3e6f7455bf69221164f94586ee13552c5f33c8002335667a3d122

See more details on using hashes here.

File details

Details for the file idna-2.1-py2-none-any.whl.

File metadata

  • Download URL: idna-2.1-py2-none-any.whl
  • Upload date:
  • Size: 54.2 kB
  • Tags: Python 2
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for idna-2.1-py2-none-any.whl
Algorithm Hash digest
SHA256 4cfe64db2804351249d8d13ec1f3a2b0be9dc84b409b65c2a646c4d673fa55bc
MD5 d87a2689779f273f3a67c97cdda2352d
BLAKE2b-256 ffee57ca2b52dea2df65a15d4ab2669c89ff29fa227668917cebc43e689360be

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page