Skip to main content

Universal encoding detector for Python 2 and 3

Project description

Chardet: The Universal Character Encoding Detector

Build status https://img.shields.io/coveralls/chardet/chardet/stable.svg PyPI downloads Latest version on PyPI License
Detects
  • ASCII, UTF-8, UTF-16 (2 variants), UTF-32 (4 variants)

  • Big5, GB2312, EUC-TW, HZ-GB-2312, ISO-2022-CN (Traditional and Simplified Chinese)

  • EUC-JP, SHIFT_JIS, CP932, ISO-2022-JP (Japanese)

  • EUC-KR, ISO-2022-KR (Korean)

  • KOI8-R, MacCyrillic, IBM855, IBM866, ISO-8859-5, windows-1251 (Cyrillic)

  • ISO-8859-5, windows-1251 (Bulgarian)

  • ISO-8859-1, windows-1252 (Western European languages)

  • ISO-8859-7, windows-1253 (Greek)

  • ISO-8859-8, windows-1255 (Visual and Logical Hebrew)

  • TIS-620 (Thai)

Requires Python 2.6 or later

Installation

Install from PyPI:

pip install chardet

Documentation

For users, docs are now available at http://chardet.readthedocs.org.

Command-line Tool

chardet comes with a command-line script which reports on the encodings of one or more files:

% chardetect somefile someotherfile
somefile: windows-1252 with confidence 0.5
someotherfile: ascii with confidence 1.0

About

This is a continuation of Mark Pilgrim’s excellent chardet. Previously, two versions needed to be maintained: one that supported python 2.x and one that supported python 3.x. We’ve recently merged with Ian Cordasco’s charade fork, so now we have one coherent version that works for Python 2.6+.

maintainer:

Dan Blanchard

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chardet-3.0.0.tar.gz (1.9 MB view details)

Uploaded Source

Built Distribution

chardet-3.0.0-py2.py3-none-any.whl (133.2 kB view details)

Uploaded Python 2Python 3

File details

Details for the file chardet-3.0.0.tar.gz.

File metadata

  • Download URL: chardet-3.0.0.tar.gz
  • Upload date:
  • Size: 1.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for chardet-3.0.0.tar.gz
Algorithm Hash digest
SHA256 171dfc754d56c16b82cf77ac3eee1d42db9bc2f26c2c61c6573426d2a108d9e3
MD5 f99fb635c61fe37e14570becaa281c76
BLAKE2b-256 33d943cd46d6b8cf2e393aeb60f338cfe0154cee7dd3de13a1752d3e4814aee2

See more details on using hashes here.

File details

Details for the file chardet-3.0.0-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for chardet-3.0.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 bedd581d3daea4180b3cb555940dcbc89916e7922b070d2a9a37e660791e90a2
MD5 9dc57a0e5c79d0cc9f45d865c7d72f0d
BLAKE2b-256 638b44ea61e33bd7d38c1416b625c87a98aeb4ee9abdad3bff630c69b6b5ffc8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page