Skip to main content

Universal encoding detector for Python 2 and 3

Project description

Chardet: The Universal Character Encoding Detector

Build status https://img.shields.io/coveralls/chardet/chardet/stable.svg PyPI downloads Latest version on PyPI License
Detects
  • ASCII, UTF-8, UTF-16 (2 variants), UTF-32 (4 variants)

  • Big5, GB2312, EUC-TW, HZ-GB-2312, ISO-2022-CN (Traditional and Simplified Chinese)

  • EUC-JP, SHIFT_JIS, CP932, ISO-2022-JP (Japanese)

  • EUC-KR, ISO-2022-KR (Korean)

  • KOI8-R, MacCyrillic, IBM855, IBM866, ISO-8859-5, windows-1251 (Cyrillic)

  • ISO-8859-5, windows-1251 (Bulgarian)

  • ISO-8859-1, windows-1252 (Western European languages)

  • ISO-8859-7, windows-1253 (Greek)

  • ISO-8859-8, windows-1255 (Visual and Logical Hebrew)

  • TIS-620 (Thai)

Requires Python 2.6 or later

Installation

Install from PyPI:

pip install chardet

Documentation

For users, docs are now available at http://chardet.readthedocs.org.

Command-line Tool

chardet comes with a command-line script which reports on the encodings of one or more files:

% chardetect somefile someotherfile
somefile: windows-1252 with confidence 0.5
someotherfile: ascii with confidence 1.0

About

This is a continuation of Mark Pilgrim’s excellent chardet. Previously, two versions needed to be maintained: one that supported python 2.x and one that supported python 3.x. We’ve recently merged with Ian Cordasco’s charade fork, so now we have one coherent version that works for Python 2.6+.

maintainer:

Dan Blanchard

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chardet-3.0.1.tar.gz (1.9 MB view details)

Uploaded Source

Built Distribution

chardet-3.0.1-py2.py3-none-any.whl (133.2 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file chardet-3.0.1.tar.gz.

File metadata

  • Download URL: chardet-3.0.1.tar.gz
  • Upload date:
  • Size: 1.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for chardet-3.0.1.tar.gz
Algorithm Hash digest
SHA256 0d9688b015b3493f2f486214a2181b8f29fbe21c4034711bd9140a1d3467808d
MD5 bf3ea9df23c79d134aea8ef65745da06
BLAKE2b-256 42d1c7e0023643df3c53ff72513f53f28bc1e948cb18a56f8a20aee289537ee9

See more details on using hashes here.

File details

Details for the file chardet-3.0.1-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for chardet-3.0.1-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 f986f11a01ab75cbf3b364deb41f2bb0943053adc1c5c257245e8f484d59cbba
MD5 2ecd2aac7eebe48484afb917d2bdcce3
BLAKE2b-256 dee53226e65c6f4291bc3f7ec0a9329ed9949bf074d8663a037c58c78d275745

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page