cChardet is high speed universal character encoding detector.
Reason this release was yanked:
Does not import
Project description
cChardet
NOTICE: This is a fork of the original project at https://github.com/PyYoshi/cChardet since the original project is no longer maintained.
To install:
pip install faust-cchardet
cChardet is high speed universal character encoding detector. - binding to uchardet.
Supported Languages/Encodings
International (Unicode)
UTF-8
UTF-16BE / UTF-16LE
UTF-32BE / UTF-32LE / X-ISO-10646-UCS-4-34121 / X-ISO-10646-UCS-4-21431
Arabic
ISO-8859-6
WINDOWS-1256
Bulgarian
ISO-8859-5
WINDOWS-1251
Chinese
ISO-2022-CN
BIG5
EUC-TW
GB18030
HZ-GB-2312
Croatian:
ISO-8859-2
ISO-8859-13
ISO-8859-16
Windows-1250
IBM852
MAC-CENTRALEUROPE
Czech
Windows-1250
ISO-8859-2
IBM852
MAC-CENTRALEUROPE
Danish
ISO-8859-1
ISO-8859-15
WINDOWS-1252
English
ASCII
Esperanto
ISO-8859-3
Estonian
ISO-8859-4
ISO-8859-13
ISO-8859-13
Windows-1252
Windows-1257
Finnish
ISO-8859-1
ISO-8859-4
ISO-8859-9
ISO-8859-13
ISO-8859-15
WINDOWS-1252
French
ISO-8859-1
ISO-8859-15
WINDOWS-1252
German
ISO-8859-1
WINDOWS-1252
Greek
ISO-8859-7
WINDOWS-1253
Hebrew
ISO-8859-8
WINDOWS-1255
Hungarian:
ISO-8859-2
WINDOWS-1250
Irish Gaelic
ISO-8859-1
ISO-8859-9
ISO-8859-15
WINDOWS-1252
Italian
ISO-8859-1
ISO-8859-3
ISO-8859-9
ISO-8859-15
WINDOWS-1252
Japanese
ISO-2022-JP
SHIFT_JIS
EUC-JP
Korean
ISO-2022-KR
EUC-KR / UHC
Lithuanian
ISO-8859-4
ISO-8859-10
ISO-8859-13
Latvian
ISO-8859-4
ISO-8859-10
ISO-8859-13
Maltese
ISO-8859-3
Polish:
ISO-8859-2
ISO-8859-13
ISO-8859-16
Windows-1250
IBM852
MAC-CENTRALEUROPE
Portuguese
ISO-8859-1
ISO-8859-9
ISO-8859-15
WINDOWS-1252
Romanian:
ISO-8859-2
ISO-8859-16
Windows-1250
IBM852
Russian
ISO-8859-5
KOI8-R
WINDOWS-1251
MAC-CYRILLIC
IBM866
IBM855
Slovak
Windows-1250
ISO-8859-2
IBM852
MAC-CENTRALEUROPE
Slovene
ISO-8859-2
ISO-8859-16
Windows-1250
IBM852
M
Example
# -*- coding: utf-8 -*-
import cchardet as chardet
with open(r"src/tests/samples/wikipediaJa_One_Thousand_and_One_Nights_SJIS.txt", "rb") as f:
msg = f.read()
result = chardet.detect(msg)
print(result)
Benchmark
$ cd src/
$ pip install chardet
$ python tests/bench.py
Results
CPU: Intel(R) Core(TM) i7-9700K CPU @ 3.60GHz
RAM: DDR4-3200 64GB
Platform: Ubuntu 20.04 amd64
Python 3.9.0
Request (call/s) |
|
---|---|
chardet v3.0.4 |
0.46 |
cchardet v2.1.7 |
1404.05 |
LICENSE
See COPYING file.
Contact
Platform
Support
Windows i686, x86_64
Linux i686, x86_64
macOS x86_64
Do not Support
CHANGES
2.x.x
2.1.7 (2020-10-27)
support Python 3.9
drop support for Python 3.5
2.1.6 (2020-03-17)
drop support for Python 2.7
support Github Actions
update dev-dependencies
2.1.5 (2019-09-27)
update language models (uchardet)
add iso8859-2 test but disabled it
support Python 3.8
drop support for Python 3.4
2.1.4 (2018-09-27)
disable LTO because become poor performance
2.1.3 (2018-09-26)
support Python 3.7
2.1.2 (2018-09-26)
enable LTO for wheel builds
update Cython
2.1.1 (2017-07-01)
fix that different results with different chuck sizes
fix that assignments to nsSMState in nsCodingStateMachine result in unspecified behavior
include COPYING in package
2.1.0 (2017-05-15)
2.0.1 (2017-04-25)
2.0.0 (2017-04-06)
Improve tests
2.0a4 (2017-04-05)
Update uchardet repo (Fix buffer overflow)
2.0a3 (2017-03-29)
Implement UniversalDetector (like chardet)
2.0a2 (2017-03-28)
Update uchardet repo (Fix memory leak)
2.0a1 (2017-03-28)
Replace uchardet-enhanced to uchardet
Remove Detector class
1.1.3 (2017-02-26)
Support AArch64
1.1.2 (2017-01-08)
Support Python 3.6
1.1.1 (2016-11-05)
Use len() function (9e61cb9e96b138b0d18e5f9e013e144202ae4067)
Remove detect function in _cchardet.pyx (25b581294fc0ae8f686ac9972c8549666766f695)
Support manylinux1 wheel
1.1.0 (2016-10-17)
Add Detector class
Improve unit tests
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
Hashes for faust_cchardet-2.1.12rc0-cp311-cp311-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7f07d800a85f3c5b7b2488ded072e1749e804d56f4325042148b66d41045fc48 |
|
MD5 | addec528a425f4f170de4a80a6f09fce |
|
BLAKE2b-256 | d4e71e2770e1e0254f35b97aa9a385199d5af843619366d1504f51f39a337633 |
Hashes for faust_cchardet-2.1.12rc0-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6800a9e70cac5783e5cb89c5098ba56463e48fcb6c9402a6bbedc02f7df7039b |
|
MD5 | d787a7e4d7365536fcaa3c45928f6f99 |
|
BLAKE2b-256 | cb4cf3b108ba71fc60d0720d1826e197e868c8755fb3d89275c3a1faddd54042 |
Hashes for faust_cchardet-2.1.12rc0-cp311-cp311-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 94afbd61f8f1e4eecdeb114edf05e813774355cd50bdaa77f4a43f32b06f6269 |
|
MD5 | 6f182f0375261ebc6d8194e7c62eb09f |
|
BLAKE2b-256 | bc5a417be7d8f2ea2277f37066eae53f002404f0d0cc06fda639204f36f668e4 |
Hashes for faust_cchardet-2.1.12rc0-cp311-cp311-macosx_10_9_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3b474d88708d779d1ca287a31e3bd32f1ec8ec2e2eaa686d2dacc3e9693ff9e0 |
|
MD5 | f038ab77328d1c21705fb200d4a2e071 |
|
BLAKE2b-256 | 946d48aa869a8d021699d923b9c6112e3fb727cd2b75fdebc52456793fd8c120 |
Hashes for faust_cchardet-2.1.12rc0-cp310-cp310-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ab90b6b2444982d66581e560f96884f0b3dbd22c2b6a86c772e70d451eee3b11 |
|
MD5 | 24863659b5b9283a33a980e09074b3c2 |
|
BLAKE2b-256 | 61f4fbf5020bf26b40f9b117133caa02633fcf6b0cc1734e6ba410121eb79425 |
Hashes for faust_cchardet-2.1.12rc0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | fd50dd5d12b5386d761dac05637b9408c98c265c2c7ddcb2bbbb4c187d7ab3c1 |
|
MD5 | 937033167d5ed0e7cd8c79e04f812fb3 |
|
BLAKE2b-256 | aa934ca699e402a72181469e0b9eed3e439056d6da89071fd57bace53f765908 |
Hashes for faust_cchardet-2.1.12rc0-cp310-cp310-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9cdd34f7dd1a3e8e2a886ff1916c9c7ee976bee42f866429063dfc3fd0da975d |
|
MD5 | ac12bc89be832b8d0f24ed9155607daf |
|
BLAKE2b-256 | b71869768ce9c0f7a0a2ab4754a2e81aae6c2b421da1b4c5c8833c7653004bf7 |
Hashes for faust_cchardet-2.1.12rc0-cp310-cp310-macosx_10_9_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a811ce40e8ec6a871df1d748c3b2a004fbc0f593ddc65f1f5b2520139121385c |
|
MD5 | 711955454c51ea96e01f79bcb11e9cf3 |
|
BLAKE2b-256 | d19c62a1f9cb6ec1373506e8cf764a13bd0b27be1d701ed0de9e068c0b867486 |
Hashes for faust_cchardet-2.1.12rc0-cp39-cp39-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 68c2d652bd47ea9949287dd354822ce66a1d1fea4c8b7d4297ad78ef7bbeedd9 |
|
MD5 | f0cd4b1786259b4a2d6c0b0a8cd6d6de |
|
BLAKE2b-256 | 5709bcb129dcf5ed33209d9705cb0e9e8f8c370ff48df5591a1b8bf01b5ad4df |
Hashes for faust_cchardet-2.1.12rc0-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 07d76ee856ba43e10b9e1b3c6bccd47be899f832a8a38041c2a00072bbcf77eb |
|
MD5 | 6f2458441b4d591d6ebe7fae4abb3772 |
|
BLAKE2b-256 | 5b4397f24d41a23a448fb6cf7a1d550d6bcd590459be2a6d988a0fc34fe73e9e |
Hashes for faust_cchardet-2.1.12rc0-cp39-cp39-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2739f36c1b704df2feb846ed0fbc2d9cc2a207fac856fc36090fa570a30ba99b |
|
MD5 | d1521ec2b3f65598126e980e5dcd4943 |
|
BLAKE2b-256 | 5807f7640ee593f3272cb45a12dcba2abe51953cc69eb6246b450ee32f48ebb1 |
Hashes for faust_cchardet-2.1.12rc0-cp39-cp39-macosx_10_9_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ee2442e880e9fcd666557fe18c4b49188467616e5e4f084cdb98a10ddbfd4cde |
|
MD5 | 80848c13ca2ff2a3cd0b118287594b06 |
|
BLAKE2b-256 | 3a140af9a191441255481df664dc62648d949b831e92f6c50e9c0efbf37e748b |
Hashes for faust_cchardet-2.1.12rc0-cp38-cp38-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5f04b49b7b90d43a93b5a3b75ad8d5a16476ec5ed815ad94c4de60aab15706a7 |
|
MD5 | 95277d4ee6c305667efb452152a9d19a |
|
BLAKE2b-256 | c11eb8ee52a65e2e0403e09839fc03ac371432eab2bb13c562f877a9d0eee4e6 |
Hashes for faust_cchardet-2.1.12rc0-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f1e19bcff9c690f179bad79f24b7cf2f0ae23301a94c24eb7ff8a3e1f6223ffe |
|
MD5 | 9e56c4713289f2207ee162fcb8ee5591 |
|
BLAKE2b-256 | 4650c82ef72cfe2500f54e348304a9f302ad00bbf2203276a2da7efd5c4496a6 |
Hashes for faust_cchardet-2.1.12rc0-cp38-cp38-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c566f93c75d980700af2535df7df50a50a1909bc7407e0623b63a791686ff294 |
|
MD5 | 606035c00772f848d01e2a37519fee0b |
|
BLAKE2b-256 | 530300ab1d04de96e4f3565c57678697351d5a4a909a2815e611a3a88982684a |
Hashes for faust_cchardet-2.1.12rc0-cp38-cp38-macosx_10_9_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8fd89a6c6e9af802458396253060af7610c7e862021574997182e0cc661e0e81 |
|
MD5 | f1edd5b13dc9691e8a0d19a3befc94be |
|
BLAKE2b-256 | 607d5d5d29201a2d857fac3de742f2585276a936528fb99a842447fbf5c0eb1c |
Hashes for faust_cchardet-2.1.12rc0-cp37-cp37m-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b421ab9645012802c2c8a3ef25b2631825b0658a98431cae191e8bf7d8321682 |
|
MD5 | d80224fef4ea5076bc2f9a4ee0d438d6 |
|
BLAKE2b-256 | 4d2e9c2c3f7b2e35c48baf711efa760a1943a5f39e1531f6766515d9cc3e2031 |
Hashes for faust_cchardet-2.1.12rc0-cp37-cp37m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 077920bbe10e2a4bd688401500af272daa02a1f6d6a835c4b94bc6a82e803c2c |
|
MD5 | adeba9fb81d222b4dfecae7a7910b14f |
|
BLAKE2b-256 | 7f18ba43922b22e92207ea4b25c75b46e6959b67f0d0f84e6c29e32021d974ac |
Hashes for faust_cchardet-2.1.12rc0-cp37-cp37m-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 72117dc8cd841dbe25d00afa530ca8bd3df3e66f81cbb2f7525617a5a6bd365e |
|
MD5 | 61edf888cda504b00da6a16028adfbb5 |
|
BLAKE2b-256 | f9a58a23ac5ed82c674ddd03d7ae2536a50e0f2de9ca9ebea12d3a2d77193c11 |
Hashes for faust_cchardet-2.1.12rc0-cp37-cp37m-macosx_10_9_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f6323df4cab16143d59600d2671c9a2c8a498fbddd6f6f6f41135b3e40044f1c |
|
MD5 | cfb15a77f567b2169add56f905ed08c0 |
|
BLAKE2b-256 | 66d2fdba8613b26848da801345e5cb60bc667ad0108ca1d6472064a39ab6981f |
Hashes for faust_cchardet-2.1.12rc0-cp36-cp36m-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b82d960061df033649884155d5b20c2c2f405a08f00cffac9e55e4c7a8c03f7f |
|
MD5 | 090a45024861c0f79deed4f412c8f6ae |
|
BLAKE2b-256 | 35a7a0bdc56174fb88bb8408977d0fec86162ba6b4c92cd37b51a8fe167d975a |
Hashes for faust_cchardet-2.1.12rc0-cp36-cp36m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6e8636fd854148e1462e34d9514a6c1893627d9523c0ab3879cff0e7512a174c |
|
MD5 | 0b6ae041a21b6fdc3d577deb40972e06 |
|
BLAKE2b-256 | b2fbc4115dbdee96180f77bdbd54c3ca817c1f90b652ff6a288d9b95633b628d |
Hashes for faust_cchardet-2.1.12rc0-cp36-cp36m-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b1d8939477637c620439c0ae9cd75f512fd607e5545e9854ddf10ac3d8181017 |
|
MD5 | f5c3afccc532dd7d389287ab3f31fff9 |
|
BLAKE2b-256 | 04e519093bdd0d8e2912a0119dd4465fe7d9659ebde6701a686eb6d91c5aa36e |
Hashes for faust_cchardet-2.1.12rc0-cp36-cp36m-macosx_10_9_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a2cfd72311a3f6fe4088c1899517af1836784471d10f3c570bc8bea13574ad53 |
|
MD5 | 1fedaaa48f0d169f083f5b5893c0ff3d |
|
BLAKE2b-256 | b51c72443fa27061a2c71542b3a4fe347c740218a6647cd4a95d39cae6c2b777 |