No project description provided
Reason this release was yanked:
ineffective
Project description
PyAutoCorpus
A python interface to the excellent AutoCorpus library.
Right now, it only supports the wiki markup textify
function, which strips out
markup. From my benchmarks, this ends up being ~40x faster than methods to strip
markup using other libraries:
mwparserfromhell 0.208 sec/doc
wikitextparser 0.215 sec/doc
pyautocorpus 0.005 sec/doc
where:
mwparserfromhell
ismwparserfromhell.parse(x).strip_code()
wikitextparser
iswikitextparser.parse(x).plain_text()
pyautocorpus
ispyautocorpus.Textifier().textify(x)
Installing
From pypi:
pip install pyautocorpus
From source:
Be sure to clone recursively:
git clone --recursive https://github.com/seanmacavaney/pyautocorpus.git
You will first need the pcre
library installed.
python setup.py install
Usage
Example:
import pyautocorpus
textifier = pyautocorpus.Textifier()
textifier.textify("==Wiki Marked up text==\n [[Some Page|link text]] example.")
'Wiki Marked up text\n\n\n link text example.'
Known issues
- Windows is not yet supported
Credits
Contributors to this repository:
- Sean MacAvaney (University of Glasgow)
- Thomas Jänich (University of Glasgow)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pyautocorpus-0.1.3.tar.gz
(11.3 kB
view hashes)
Built Distributions
Close
Hashes for pyautocorpus-0.1.3-pp37-pypy37_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7c8c44a804e43c55e7e84307456881a35c766561e572974b90b48b68454d37e7 |
|
MD5 | b971aaeda48a67bb01feee408f7c4c4f |
|
BLAKE2b-256 | 2d19033a75c5aea4ba3fd3cc4fb3bb34c8050b4c6c0592c0faaed9d2d7b0a927 |
Close
Hashes for pyautocorpus-0.1.3-pp37-pypy37_pp73-manylinux_2_17_i686.manylinux2014_i686.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 752ce634423ee3233c60b2b75c1c29acae4886557b52194b94eed87a8f925b60 |
|
MD5 | 0e0f2c32113e080c05439d0b381be8ce |
|
BLAKE2b-256 | 5c2cd7c17c57111e99e8198a664312b6bc030ab787611e7efb9abd13cd00f0fb |
Close
Hashes for pyautocorpus-0.1.3-pp37-pypy37_pp73-manylinux_2_12_x86_64.manylinux2010_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b3ccfc2eeeb6172ffb18cb7c2b47e95b8c4cb4023c96cb6e9249f3284a979963 |
|
MD5 | db06831267c0b7ad18d9ce162d914bc5 |
|
BLAKE2b-256 | 5433dff408a713d37869221dba9544338f73b79a464fa521234274bfa66aef8b |
Close
Hashes for pyautocorpus-0.1.3-pp37-pypy37_pp73-manylinux_2_12_i686.manylinux2010_i686.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 27b987ffef7f9737afea95fba828446176d8067fd7745c6cc6ad8b5c27e72a62 |
|
MD5 | 581d7182ecefc54306941ae984914ad7 |
|
BLAKE2b-256 | 6ca8b41bd9619d8e578554eb8b53b0287052aef6f12e0060bbc4656934d3c873 |
Close
Hashes for pyautocorpus-0.1.3-cp39-cp39-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5d2cca388aa5124142c62d1fe1f101885d97e585c07f745f25872efa8e4758a2 |
|
MD5 | 4462be1ae0341623e859a2ca67988278 |
|
BLAKE2b-256 | 3857958c8953042cc399abc27bd15d97dd044b3c2921894602395f041bd973e1 |
Close
Hashes for pyautocorpus-0.1.3-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ac70ec63796495f1bc72a86f9e01ed0366adee14732229ce52d474d47d50a015 |
|
MD5 | 70bf67d29ec50035a9b0858c16d3da8a |
|
BLAKE2b-256 | bff68407838612c01aa330070a7e3035bc232138f8a3e062eb39383ffa7c5eb4 |
Close
Hashes for pyautocorpus-0.1.3-cp39-cp39-manylinux_2_17_i686.manylinux2014_i686.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b89b589ad3c591da7e6003d038ee44bc603d63610801478a79084def13004e31 |
|
MD5 | 8e772190eef38836976ea23d2cf3cdbd |
|
BLAKE2b-256 | 455b33b2842ddecc82ee60d94c38f3011c79603b95312b5a85baf271c99e9750 |
Close
Hashes for pyautocorpus-0.1.3-cp39-cp39-manylinux_2_12_x86_64.manylinux2010_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c5e859ad23d7cd0ba9122423c9aeca52014e05a0cb5f02e19bf7193e91fe2733 |
|
MD5 | d83dc465979cd899f5c0dce5b46360fc |
|
BLAKE2b-256 | 92fb622500747fa349912f115e21c3b25af1d7ba936d715f1c20154f3d847ea4 |
Close
Hashes for pyautocorpus-0.1.3-cp39-cp39-manylinux_2_12_i686.manylinux2010_i686.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7d312d26373cb1a36a4c0cf1812061c26b0df9c8bdd59f274e5cfdea81c27b39 |
|
MD5 | 2cff81de5dbea1b12d24309c2312b833 |
|
BLAKE2b-256 | b2998477b0555fe0f4f5434e8842db26d2e5d4c9a121ffc32f78460ebe4873cb |
Close
Hashes for pyautocorpus-0.1.3-cp39-cp39-macosx_10_14_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 55b12068a93b91c943312ea24ca246d76f4de89cf28b49ff1e3ec07f7ad45220 |
|
MD5 | 04e36b7c8101c3cea1e641cea9f4b620 |
|
BLAKE2b-256 | 9cf69eb36f9768ee8754e2b1a521ab7d01a9707cfc21b2af43e5893166312b00 |
Close
Hashes for pyautocorpus-0.1.3-cp38-cp38-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2345b1c7cc8556639c60a46fce1fb4359d1073b017aa04a31955f8e8b4bcf663 |
|
MD5 | 5f68624048626b96a0ac7b502ddbb545 |
|
BLAKE2b-256 | f1ea166706b4c4b7077c4a2fad9c4728a74a7d5ab6b3017e8741449b33216403 |
Close
Hashes for pyautocorpus-0.1.3-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 527eff83967fb18660ae3c40ddfc1219043d4f03598d22f3aaeea84b646bba67 |
|
MD5 | 7fc3287dc8a3f66489168b9466e79bc8 |
|
BLAKE2b-256 | b8354928cde6017b0b9ec405b2cebd1773e0ad65b6fb769c2abf0c31b065775b |
Close
Hashes for pyautocorpus-0.1.3-cp38-cp38-manylinux_2_17_i686.manylinux2014_i686.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 86efb753d618818253b80485785a0b1ff408bdce0e1a6af45042cc920d16d69e |
|
MD5 | 9467a3e62a48e7353e4d17d1fb651238 |
|
BLAKE2b-256 | 11b940128dab47da68be606053919cf06ab9fe0ffe6b95649dea84e114561276 |
Close
Hashes for pyautocorpus-0.1.3-cp38-cp38-manylinux_2_12_x86_64.manylinux2010_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 547496c862c19ecaae056adb2ccdd68f231d98f2c1e10dd0cd1a74c1e74d5e01 |
|
MD5 | 8116cdad6ced54ebfe575fd1faa156a5 |
|
BLAKE2b-256 | 8d637672907364e0054a33730bf7a4bd168b1d145d4cf8213d729c3d2e98e2ef |
Close
Hashes for pyautocorpus-0.1.3-cp38-cp38-manylinux_2_12_i686.manylinux2010_i686.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a6a14d183da81870e9bb8929d65db9cc053e46a7c2d3e01a053376b355b446a3 |
|
MD5 | 54763d3a5fe42e5ed04c40724b75f391 |
|
BLAKE2b-256 | 916b52c92640024648788c2405b9dd24f7e229e239776096aefde6f9b15cbf31 |
Close
Hashes for pyautocorpus-0.1.3-cp38-cp38-macosx_10_14_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 499fb36676b5e34794069e2665574a272a2561a57b0437a9202e9aacba133b02 |
|
MD5 | f72ec4a09ffbd7cf7340e327c3006620 |
|
BLAKE2b-256 | c6bd6a98c612a4543ee0033b0ce7fb8f7f788147d836dc7a1798bceeb94bc683 |
Close
Hashes for pyautocorpus-0.1.3-cp37-cp37m-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8dba2911e579f3ff1f08f6b60855b08d0b0055c93c4aec1e0056124b1c00c82b |
|
MD5 | b967a108716c5ea1d04c2500eedcd4b2 |
|
BLAKE2b-256 | f8b2f086a78e597869d90adf7afba48a7fc1259033096cd8f9118c4d090668b7 |
Close
Hashes for pyautocorpus-0.1.3-cp37-cp37m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 985df6355657663fb945897e8cb67011e983bae5c66d0ce96baab1ce1a85ec71 |
|
MD5 | 9ef7a72a219c2b580b5fada0016a7d9a |
|
BLAKE2b-256 | aa3f7b20b030e21b643c77206709c8948929465bbde6bdabcf33c8511eb55221 |
Close
Hashes for pyautocorpus-0.1.3-cp37-cp37m-manylinux_2_17_i686.manylinux2014_i686.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6bf7490c534b14e393c780eba85a8bf04beebf8d4f5ae7a804481ea51e3cf4a6 |
|
MD5 | c4a83efe4048c2523cf23309c0dbe121 |
|
BLAKE2b-256 | 7d96dd7f4bb56ada2524faf19761213c7eab20aca407e29ba94bd57584fc1a41 |
Close
Hashes for pyautocorpus-0.1.3-cp37-cp37m-manylinux_2_12_x86_64.manylinux2010_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | aa977aa29eaaf19f291442b2182e6eec8e484e39673d520b256ddc556ef4618b |
|
MD5 | 751419e2a2a3a8ee0802f327281ac9bc |
|
BLAKE2b-256 | 33e98f5861104bf724636bbb0cf9787e1611f217252d5339a523a1f81b9f6ce6 |
Close
Hashes for pyautocorpus-0.1.3-cp37-cp37m-manylinux_2_12_i686.manylinux2010_i686.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | e00bef15c40b3909cc028a0d88fb57543c29c046b4d49c28e737c6d15bafe92f |
|
MD5 | f72be1fe34149d387dcfb046f1e15a22 |
|
BLAKE2b-256 | 32327c0f01b19b003a2c9710703582b63984d0774052f164e703ede6fa0b70a9 |
Close
Hashes for pyautocorpus-0.1.3-cp37-cp37m-macosx_10_14_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0f890f4da29889a7f56abf48d15ffb146c13fb36acba1006391eaf75266594ff |
|
MD5 | e04e3719cfbbf6a43a3566dd49734ab6 |
|
BLAKE2b-256 | c24cf8c12a9da65698fa2e6b6d969d68b46eb51c3c27f3f280d256a3f4f26f19 |
Close
Hashes for pyautocorpus-0.1.3-cp36-cp36m-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f1d85b73e552f7b6bfbb4072f1fc808904e7c31ed50494d0ddd9563caaacf649 |
|
MD5 | 024ad12eb490f1e77de0d4fff0e9f73f |
|
BLAKE2b-256 | 95b56c78e00d73bc0ddb2e103780b3156e284973f38928d2c069bd78871c6d38 |
Close
Hashes for pyautocorpus-0.1.3-cp36-cp36m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c25f67ebc942d33aa06ab86198c6a0556b99da2bced8b3af97aca7b4a3185c97 |
|
MD5 | 30bbe96024712a64ea4255cf60b4fd4c |
|
BLAKE2b-256 | 87d0c1dc947963f0113dfd9454951cbbaf13b6741b6e62cde03ef5e9a7520332 |
Close
Hashes for pyautocorpus-0.1.3-cp36-cp36m-manylinux_2_17_i686.manylinux2014_i686.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | fcb4b9e5e46038f1470c1f263d7aee2b69e3a4382e93c17f2359980e2f631b70 |
|
MD5 | 91acd03c73bd02fedcf62e56daa0b75a |
|
BLAKE2b-256 | b69b11be8203dc7e251c2f739ba1533f7de8c15cf11f15039064a8af16406b54 |
Close
Hashes for pyautocorpus-0.1.3-cp36-cp36m-manylinux_2_12_x86_64.manylinux2010_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 24f3eb05a3c03f22acec5d0098720a3946f624c7daac19aa99cc0165b780da5d |
|
MD5 | c1c5d3b39792ca9605eec88bcc810a05 |
|
BLAKE2b-256 | 548ebcdd2a2d14fdb24959a7a52b450f3fa14e9c820fe03902db5ad43c494ada |
Close
Hashes for pyautocorpus-0.1.3-cp36-cp36m-manylinux_2_12_i686.manylinux2010_i686.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9d5cb30f9fc4a5d2aa4edb655665ca022e9415acbbc1dd46ee2b4a456cfabd70 |
|
MD5 | ec19ed7380b4d03ef5ce27347fae8330 |
|
BLAKE2b-256 | 65d55a7878c2eb258ce294da7a011d0da494592d0ef2a2c8b51938962a67ecdf |
Close
Hashes for pyautocorpus-0.1.3-cp36-cp36m-macosx_10_14_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ad3f47e4fcd6667089cf8cb5a573979efb8f59826bb90ec7e6ca927d6fa7cfeb |
|
MD5 | 8059581f23e14a1b0794024aac83d9aa |
|
BLAKE2b-256 | 506b8aa34d54ff910d6f2540f65ec2282c8285b9aeaaf4337e4f106125bbb8cf |