No project description provided
Reason this release was yanked:
ineffective
Project description
PyAutoCorpus
A python interface to the excellent AutoCorpus library.
Right now, it only supports the wiki markup textify
function, which strips out
markup. From my benchmarks, this ends up being ~40x faster than methods to strip
markup using other libraries:
mwparserfromhell 0.208 sec/doc
wikitextparser 0.215 sec/doc
pyautocorpus 0.005 sec/doc
where:
mwparserfromhell
ismwparserfromhell.parse(x).strip_code()
wikitextparser
iswikitextparser.parse(x).plain_text()
pyautocorpus
ispyautocorpus.Textifier().textify(x)
Installing
From pypi:
pip install pyautocorpus
From source:
Be sure to clone recursively:
git clone --recursive https://github.com/seanmacavaney/pyautocorpus.git
You will first need the pcre
library installed.
python setup.py install
Usage
Example:
import pyautocorpus
textifier = pyautocorpus.Textifier()
textifier.textify("==Wiki Marked up text==\n [[Some Page|link text]] example.")
'Wiki Marked up text\n\n\n link text example.'
Known issues
- Windows is not yet supported
Credits
Contributors to this repository:
- Sean MacAvaney (University of Glasgow)
- Thomas Jänich (University of Glasgow)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pyautocorpus-0.1.5.tar.gz
(11.4 kB
view hashes)
Built Distributions
Close
Hashes for pyautocorpus-0.1.5-pp37-pypy37_pp73-manylinux_2_18_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | dad198ba2170604d2051a63ef5139d9c42f80e77752cca9d3bc611cde8c78b65 |
|
MD5 | 92fa44d5bfc6ad0adb1438e516c2248d |
|
BLAKE2b-256 | 8139faae879142d659914751674e7ba699e9661ba789f016a53b70ade9d447a3 |
Close
Hashes for pyautocorpus-0.1.5-pp37-pypy37_pp73-manylinux_2_18_i686.manylinux2014_i686.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b38cabd6645fb258148732ad4a4d0fe18d18d10daa24e7abda2030417b585381 |
|
MD5 | 7c67ad27ae4877ef2307f4aebf1b4714 |
|
BLAKE2b-256 | be251b634fd1303d5ef07f0accf8b2d815357e46c0a144206830c6fd25409d28 |
Close
Hashes for pyautocorpus-0.1.5-pp37-pypy37_pp73-manylinux_2_12_x86_64.manylinux2010_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 14f8a2f53c92bb34f2bd55e956277c0ab0a38417ae5f18da32075e5b87df16c0 |
|
MD5 | 6951c49a37be5514864a573d76a2960e |
|
BLAKE2b-256 | c6412b6fc84529df7959d00be2f71bb150303cd74b0fe2590b9fcd987b8cb474 |
Close
Hashes for pyautocorpus-0.1.5-pp37-pypy37_pp73-manylinux_2_12_i686.manylinux2010_i686.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | be54c809fe447b6925f2ccb84f8d8b837bfc2ec512fe9e0325f74f98e49137ac |
|
MD5 | 5357413c0839f6a05078b5f4bf409564 |
|
BLAKE2b-256 | 27b00470e36661b80df524cfaa0a9adb1808a80fa200de9b96336b772bc0445a |
Close
Hashes for pyautocorpus-0.1.5-cp39-cp39-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3cf6e81e55f0ccda20f538463cf239c32ccb23a4fbf6f77a1e71f017d81e613d |
|
MD5 | 358ef89acba12d50a41467f58ff4fbd2 |
|
BLAKE2b-256 | cd79a17201e5d01e74ab3f42eae7e656ab497ca634ba368b04847a07ffb63870 |
Close
Hashes for pyautocorpus-0.1.5-cp39-cp39-manylinux_2_18_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 28aec6d186ad50542edf54043047b329b8b5e6c4f63fcb839360439ea6d3b54a |
|
MD5 | 6396fd4f28766ada3bde2a8c89778b7a |
|
BLAKE2b-256 | 712c90ebb802c893c03d3685909217d1eea168732038bbc3a3025ac8e0b957d7 |
Close
Hashes for pyautocorpus-0.1.5-cp39-cp39-manylinux_2_18_i686.manylinux2014_i686.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0aeec06be94f24e7b2ee829d729fbdbaaaaaed4f3160ea0649b9d5132299bb1a |
|
MD5 | e735bce44d8e17a498a18bfec87233bb |
|
BLAKE2b-256 | 0ea6d458017b8332a85c83be2e019528fe99ce6beca30bb770df2de3b7a237e9 |
Close
Hashes for pyautocorpus-0.1.5-cp39-cp39-manylinux_2_12_x86_64.manylinux2010_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 54388ec8a58c78f408bc905caca26ad2c0998696b42b1b9b8347e5051ace1b4e |
|
MD5 | 80edfab60110a0d8c2ae1bd09ed22b2b |
|
BLAKE2b-256 | 88d53b622db580886de2414e0afcb19774196396a101367d902443f9cc62e203 |
Close
Hashes for pyautocorpus-0.1.5-cp39-cp39-manylinux_2_12_i686.manylinux2010_i686.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | fcb249be53501ec88fd3c026ce79349613a4f27088a6d5617ff0998052e8a3b7 |
|
MD5 | b5394b3f171e9e87927299390ee4b303 |
|
BLAKE2b-256 | 9142d12ca08f280b28f2903963f78a5abcd9d5561b5744a209960702d68e6691 |
Close
Hashes for pyautocorpus-0.1.5-cp39-cp39-macosx_10_14_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2dbb79ab8b8dc238c37159da51bde04541d31bf36b56cf74a64d5218558dcb75 |
|
MD5 | fb7e3bec6bb971f1980912544b85d5a2 |
|
BLAKE2b-256 | 617d93da79c09f42a6a2be79d3390d18191c619629c687efd0c3682284a23852 |
Close
Hashes for pyautocorpus-0.1.5-cp38-cp38-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | afa873c97bd431dcb9164dce89d90802dabb2859a6434d48eb87057e23dc51ab |
|
MD5 | 6d701ad42b398aa49f884dd97cb786d2 |
|
BLAKE2b-256 | 44530b4b71bcd9381b79923c1d9892443241d9e127974484bc8394bbe5fefec5 |
Close
Hashes for pyautocorpus-0.1.5-cp38-cp38-manylinux_2_18_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | acaeb6dc3e2fa7bae323a491a11b1bf57ed4bf6cbd430c27d6b41e52f1c389af |
|
MD5 | 9d923bc17f61a2024da7a6901373fade |
|
BLAKE2b-256 | 06cfc35a7fc597d51c830e9ee44c301866369c782c77dfb07aca97f7d3ad2f20 |
Close
Hashes for pyautocorpus-0.1.5-cp38-cp38-manylinux_2_18_i686.manylinux2014_i686.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b668126147c1a0657dae902476fdcc46babc5971b922d00cbbdcb899ef321ab1 |
|
MD5 | feef45a6dca00080c5132406b08d0f0b |
|
BLAKE2b-256 | bd6f8e187aff04ec1a015b9cb1b73386c04d616495a3d5e000e66affdf339758 |
Close
Hashes for pyautocorpus-0.1.5-cp38-cp38-manylinux_2_12_x86_64.manylinux2010_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3ca9828ca3ec2babd540b5c9edb2da490929076c24d4c22b18a0689576e8c3ac |
|
MD5 | 420b176f2a64717857668e488afe268c |
|
BLAKE2b-256 | 3854c7eeed8bf1da4df4513a935c5503ada066b58cd50e72bbd858e31365b8af |
Close
Hashes for pyautocorpus-0.1.5-cp38-cp38-manylinux_2_12_i686.manylinux2010_i686.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0be33a994313a88308c44432ec1f86fa36bb672b81b2ee9ee9566d33aa9d4dba |
|
MD5 | 2b98a43f64a387268d14f4d76cbf8893 |
|
BLAKE2b-256 | ebd426c5ba8f3c18fdb3ae9397f6b9139a36efb689fcd1166badb076ab2f5218 |
Close
Hashes for pyautocorpus-0.1.5-cp38-cp38-macosx_10_14_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4c080ca1edc9e5f0e31324bbe7b89cafa25f6c7236adfbccd35eefc0a9eab8c1 |
|
MD5 | 57d186b86221f6939a2581388c8907d2 |
|
BLAKE2b-256 | 1f764bead37ec0ca64e8dd543c68d5afaa8053fae13e5927b2e81f77e6bc348d |
Close
Hashes for pyautocorpus-0.1.5-cp37-cp37m-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6b29105d3d4496798a7371a2c284f6da65a332e20d8abc6611a5547fe143b5a0 |
|
MD5 | 05c4218c9f314b3cc321827cc05a17db |
|
BLAKE2b-256 | a201cee634cf1931071c2f6b3bd6060ae4673c7ab6b682ba10829c71e55a3c81 |
Close
Hashes for pyautocorpus-0.1.5-cp37-cp37m-manylinux_2_18_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ef6b7e2f38eecbd6ce614baa21f88a55f99b3d4ceb5b36cf4c0b88fa309dda35 |
|
MD5 | 364d7ec1d1cd35da0aba7c716b2cb767 |
|
BLAKE2b-256 | 9e74f6b2054927d04c9c6b78b5a215a9e62da0245102eb6d944a98327fc57886 |
Close
Hashes for pyautocorpus-0.1.5-cp37-cp37m-manylinux_2_18_i686.manylinux2014_i686.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 31789f2820737ad916320ba48eaeac1c383c65eb5ec75b9ff87a0b49306df4ef |
|
MD5 | efa6247d6d4758a9da56d33159ef5b12 |
|
BLAKE2b-256 | 9da3fa7391255b06cb052cb2253ad56395eaac14832759dfbf8626370f5bd2b1 |
Close
Hashes for pyautocorpus-0.1.5-cp37-cp37m-manylinux_2_12_x86_64.manylinux2010_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9c0ce67cba78027002689ea482013825d4d12f2bd02dcce2c3990e39da4628c6 |
|
MD5 | 5c69b1fb11d901bb79114bd62f661dec |
|
BLAKE2b-256 | 78dbddbdf3b1e5785a314e2e8f826b6d8449bfa581092b6cccfdbeb474dacd9b |
Close
Hashes for pyautocorpus-0.1.5-cp37-cp37m-manylinux_2_12_i686.manylinux2010_i686.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 774305c52ed39112fa6820779d97987de732aafd8e16f6e75c739782f8ae2969 |
|
MD5 | 4660c766203ecb60b25fd5d97972cb49 |
|
BLAKE2b-256 | f783794f2be963596421177c8bc77d2b9297e4c26a7820b9fd2965d865775542 |
Close
Hashes for pyautocorpus-0.1.5-cp37-cp37m-macosx_10_14_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a291aa4b82fb84c32f0cb3fbd4644bc34d4f052f5d8c0dd903f296a05e478133 |
|
MD5 | b4bc5c9ea973e137d28b7569cb7660af |
|
BLAKE2b-256 | c37c8c76f97ae0f681a77015f06ba18829a791ad85150ef2abe250d61eb0a427 |
Close
Hashes for pyautocorpus-0.1.5-cp36-cp36m-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8b98b08ccb5751429bb5a1372aa7858622a93061e9b5689fdd499bbc4de7dcfe |
|
MD5 | a9057cd756231f1f0045f5793a9f06bd |
|
BLAKE2b-256 | eeaae5391e2cf78c01c7299694629a0723dd3c99b87f882906cbb7f47274537e |
Close
Hashes for pyautocorpus-0.1.5-cp36-cp36m-manylinux_2_18_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a78b5c546b9525bef1fadf936ba10abbf1aba624911437bc55080e30c302dc3e |
|
MD5 | e0ee24e1283de97923dc95231989760a |
|
BLAKE2b-256 | e990b4e9fb6191abd4ac1bf74931213fe890f79c3597707a9826072bf2c3baf3 |
Close
Hashes for pyautocorpus-0.1.5-cp36-cp36m-manylinux_2_18_i686.manylinux2014_i686.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 682a41555539e907d6546d1121b5cb5b5d4ff195ba759d1da60cf2c3878d8e77 |
|
MD5 | c0f36bb67dc73e266d780e9db1415508 |
|
BLAKE2b-256 | c58ce137290d25308e9fcc7edaf2e67b57af621bf0c4cbc6e8d55b54c8aacb81 |
Close
Hashes for pyautocorpus-0.1.5-cp36-cp36m-manylinux_2_12_x86_64.manylinux2010_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f7b97e4245b1e70d6a3f032b3bb6a6268103492256a180dca59684f78dfea730 |
|
MD5 | 07bdb16ef7f56e5f51f5a57ed9ae042f |
|
BLAKE2b-256 | 1e4a47d336654a099f428186ff0ba94de8ce7f2c962cd137f3a3443ae060880a |
Close
Hashes for pyautocorpus-0.1.5-cp36-cp36m-manylinux_2_12_i686.manylinux2010_i686.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d0ae2696854788881bc0628f90cb7e4613db09a22229e1b73aefb35e0a397e92 |
|
MD5 | 262c2f916391d34325bb83303af59383 |
|
BLAKE2b-256 | c7aabae1c732857e3bb3e150b45f6599616413a25b10a21329d551392df00ecd |
Close
Hashes for pyautocorpus-0.1.5-cp36-cp36m-macosx_10_14_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 578eca18680c8bc0fd0bb32df76d9aad529805bc6365c37cf8ca8ce5cc1816bc |
|
MD5 | dbf2fdab60c4eca08db995b95262312f |
|
BLAKE2b-256 | 87319a26cc27bb6d4e11ef57c857d6db180a6c3527e941e480e5ad5a7efb1040 |