Character encoding detecting library for Python using ICU and libmagic. Based on Ruby implementation https://github.com/brianmario/charlock_holmes and work of https://github.com/xtao/PyCharlockHolmes
Project description
Charlock Holmes
Character encoding detecting library for Python using [ICU](http://site.icu-project.org/) and libmagic. Inspired by [Charlock Holmes](https://raw.github.com/brianmario/charlock_holmes)
Dependency 1. icu 2. file(libmagic)
Gentoo emerge -av dev-libs/icu emerge -av sys-apps/file
Ubuntu apt-get install libmagic-dev apt-get install libicu-dev
Install
python setup build python setup install
Usage
from charlockholmes import detect file = open(‘test.txt’) content = file.read() print detect(content)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.