Skip to main content

Universal encoding detector for Python 2 and 3

Project description

Chardet: The Universal Character Encoding Detector

Build status https://img.shields.io/coveralls/chardet/chardet/stable.svg PyPI downloads Latest version on PyPI License
Detects
  • ASCII, UTF-8, UTF-16 (2 variants), UTF-32 (4 variants)

  • Big5, GB2312, EUC-TW, HZ-GB-2312, ISO-2022-CN (Traditional and Simplified Chinese)

  • EUC-JP, SHIFT_JIS, CP932, ISO-2022-JP (Japanese)

  • EUC-KR, ISO-2022-KR (Korean)

  • KOI8-R, MacCyrillic, IBM855, IBM866, ISO-8859-5, windows-1251 (Cyrillic)

  • ISO-8859-5, windows-1251 (Bulgarian)

  • ISO-8859-1, windows-1252 (Western European languages)

  • ISO-8859-7, windows-1253 (Greek)

  • ISO-8859-8, windows-1255 (Visual and Logical Hebrew)

  • TIS-620 (Thai)

Requires Python 2.6 or later

Installation

Install from PyPI:

pip install chardet

Documentation

For users, docs are now available at http://chardet.readthedocs.org.

Command-line Tool

chardet comes with a command-line script which reports on the encodings of one or more files:

% chardetect somefile someotherfile
somefile: windows-1252 with confidence 0.5
someotherfile: ascii with confidence 1.0

About

This is a continuation of Mark Pilgrim’s excellent chardet. Previously, two versions needed to be maintained: one that supported python 2.x and one that supported python 3.x. We’ve recently merged with Ian Cordasco’s charade fork, so now we have one coherent version that works for Python 2.6+.

maintainer:

Dan Blanchard

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chardet-3.0.2.tar.gz (1.9 MB view details)

Uploaded Source

Built Distribution

chardet-3.0.2-py2.py3-none-any.whl (133.3 kB view details)

Uploaded Python 2Python 3

File details

Details for the file chardet-3.0.2.tar.gz.

File metadata

  • Download URL: chardet-3.0.2.tar.gz
  • Upload date:
  • Size: 1.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for chardet-3.0.2.tar.gz
Algorithm Hash digest
SHA256 4f7832e7c583348a9eddd927ee8514b3bf717c061f57b21dbe7697211454d9bb
MD5 96e364abdbde20b5f6dbbe2ad9d54d04
BLAKE2b-256 910528f23094cdf1410fb03533f0d71e6b4aad3c504100e83b8cea6fc899552c

See more details on using hashes here.

File details

Details for the file chardet-3.0.2-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for chardet-3.0.2-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 6ebf56457934fdce01fb5ada5582762a84eed94cad43ed877964aebbdd8174c0
MD5 f18688841820b46ecc430aabdc274cde
BLAKE2b-256 b87454fdc2fcfdd46b6c486964b64c5bb7db9a3664033ab25cf11aab06dd2a5d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page