Skip to main content

Universal encoding detector. This library is faster than chardet.

Project description

cChardet

cChardet is high speed universal character encoding detector. - binding to charsetdetect.

PyPI version Travis Ci build status AppVeyor build status

Support codecs

  • Big5
  • EUC-JP
  • EUC-KR
  • GB18030
  • HZ-GB-2312
  • IBM855
  • IBM866
  • ISO-2022-CN
  • ISO-2022-JP
  • ISO-2022-KR
  • ISO-8859-2
  • ISO-8859-5
  • ISO-8859-7
  • ISO-8859-8
  • KOI8-R
  • Shift_JIS
  • TIS-620
  • UTF-8
  • UTF-16BE
  • UTF-16LE
  • UTF-32BE
  • UTF-32LE
  • WINDOWS-1250
  • WINDOWS-1251
  • WINDOWS-1252
  • WINDOWS-1253
  • WINDOWS-1255
  • EUC-TW
  • X-ISO-10646-UCS-4-2143
  • X-ISO-10646-UCS-4-3412
  • x-mac-cyrillic

Requirements

Example

# -*- coding: utf-8 -*-
import cchardet as chardet
with open(r"src/tests/testdata/wikipediaJa_One_Thousand_and_One_Nights_SJIS.txt", "rb") as f:
    msg = f.read()
    result = chardet.detect(msg)
    print(result)

Benchmark

$ cd src/
$ pip install chardet
$ python tests/bench.py

Results

CPU: Intel(R) Core(TM) i3-4170 CPU @ 3.70GHz

RAM: DDR3 1600Mhz 16GB

Platform: Ubuntu 16.04 amd64

Python 2.7.12
  Request (call/s)
chardet 0.26
cchardet 1408.73
Python 3.5.2
  Request (call/s)
chardet 0.28
cchardet 1380.40

License

  • The MIT License: src/cchardet
  • Other Libraries License: Please, look at the src/ext directory.

Contact

Issues

CHANGES

1.1.0 (2016-10-17)

  • Add Detector class
  • Improve unit tests

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
cchardet-1.1.0-cp27-cp27m-win32.whl (76.8 kB) Copy SHA256 hash SHA256 Wheel cp27
cchardet-1.1.0-cp27-cp27m-win_amd64.whl (79.3 kB) Copy SHA256 hash SHA256 Wheel cp27
cchardet-1.1.0-cp34-cp34m-win32.whl (77.5 kB) Copy SHA256 hash SHA256 Wheel cp34
cchardet-1.1.0-cp34-cp34m-win_amd64.whl (79.6 kB) Copy SHA256 hash SHA256 Wheel cp34
cchardet-1.1.0-cp35-cp35m-win32.whl (79.2 kB) Copy SHA256 hash SHA256 Wheel cp35
cchardet-1.1.0-cp35-cp35m-win_amd64.whl (82.9 kB) Copy SHA256 hash SHA256 Wheel cp35
cchardet-1.1.0.tar.gz (219.3 kB) Copy SHA256 hash SHA256 Source None

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page