Skip to main content

Character encoding detecting library for Python using ICU and libmagic. Based on Ruby implementation https://github.com/brianmario/charlock_holmes and work of https://github.com/xtao/PyCharlockHolmes

Project description

Charlock Holmes

Character encoding detecting library for Python using [ICU](http://site.icu-project.org/) and libmagic. Inspired by [Charlock Holmes](https://raw.github.com/brianmario/charlock_holmes)

Dependency 1. icu 2. file(libmagic)

Gentoo emerge -av dev-libs/icu emerge -av sys-apps/file

Ubuntu apt-get install libmagic-dev apt-get install libicu-dev

Install

python setup build python setup install

Usage

from charlockholmes import detect file = open(‘test.txt’) content = file.read() print detect(content)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pycharlockholmes-0.0.4.tar.gz (11.1 kB view details)

Uploaded Source

File details

Details for the file pycharlockholmes-0.0.4.tar.gz.

File metadata

File hashes

Hashes for pycharlockholmes-0.0.4.tar.gz
Algorithm Hash digest
SHA256 5556552339586a06e5251f568693b8494274f585bc6cb49ecc901bf04e686884
MD5 518f7c8a21af805e1eb41a57c28d37ca
BLAKE2b-256 42f1abde3d7882f33b62ca1508121a1bd35d8978b8c5aa31dff7b39eec9b4291

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page