Skip to main content

Universal encoding detector for Python 2 and 3

Project description

Chardet: The Universal Character Encoding Detector

Build status https://img.shields.io/coveralls/chardet/chardet/stable.svg Latest version on PyPI License
Detects
  • ASCII, UTF-8, UTF-16 (2 variants), UTF-32 (4 variants)

  • Big5, GB2312, EUC-TW, HZ-GB-2312, ISO-2022-CN (Traditional and Simplified Chinese)

  • EUC-JP, SHIFT_JIS, CP932, ISO-2022-JP (Japanese)

  • EUC-KR, ISO-2022-KR (Korean)

  • KOI8-R, MacCyrillic, IBM855, IBM866, ISO-8859-5, windows-1251 (Cyrillic)

  • ISO-8859-5, windows-1251 (Bulgarian)

  • ISO-8859-1, windows-1252 (Western European languages)

  • ISO-8859-7, windows-1253 (Greek)

  • ISO-8859-8, windows-1255 (Visual and Logical Hebrew)

  • TIS-620 (Thai)

Requires Python 2.6, 2.7, or 3.3+.

Installation

Install from PyPI:

pip install chardet

Documentation

For users, docs are now available at https://chardet.readthedocs.io/.

Command-line Tool

chardet comes with a command-line script which reports on the encodings of one or more files:

% chardetect somefile someotherfile
somefile: windows-1252 with confidence 0.5
someotherfile: ascii with confidence 1.0

About

This is a continuation of Mark Pilgrim’s excellent chardet. Previously, two versions needed to be maintained: one that supported python 2.x and one that supported python 3.x. We’ve recently merged with Ian Cordasco’s charade fork, so now we have one coherent version that works for Python 2.6+.

maintainer:

Dan Blanchard

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chardet-3.0.4.tar.gz (1.9 MB view details)

Uploaded Source

Built Distribution

chardet-3.0.4-py2.py3-none-any.whl (133.4 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file chardet-3.0.4.tar.gz.

File metadata

  • Download URL: chardet-3.0.4.tar.gz
  • Upload date:
  • Size: 1.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for chardet-3.0.4.tar.gz
Algorithm Hash digest
SHA256 84ab92ed1c4d4f16916e05906b6b75a6c0fb5db821cc65e70cbd64a3e2a5eaae
MD5 7dd1ba7f9c77e32351b0a0cfacf4055c
BLAKE2b-256 fcbba5768c230f9ddb03acc9ef3f0d4a3cf93462473795d18e9535498c8f929d

See more details on using hashes here.

File details

Details for the file chardet-3.0.4-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for chardet-3.0.4-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 fc323ffcaeaed0e0a02bf4d117757b98aed530d9ed4531e3e15460124c106691
MD5 0004b00caff7bb543a1d0d0bd0185a03
BLAKE2b-256 bca901ffebfb562e4274b6487b4bb1ddec7ca55ec7510b22e4c51f14098443b8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page