Skip to main content

MediaWiki-compatible parser for Python.

Project description

smc.mw

A mediawiki-compatible parser for Python.

Current status:

https://travis-ci.org/lambdafu/smc.mw.png https://coveralls.io/repos/lambdafu/smc.mw/badge.png?branch=master https://pypip.in/v/smc.mw/badge.png https://pypip.in/d/smc.mw/badge.png

Using

To run the tests:

$ make -C tests

The test result can be found in tests/out/report.html.

A command line tool is available, too (installed as “mw”):

$ echo "''Hello World''" | python smc/mw/tool.py
<html><body><p><i>Hello World</i>
</p></body></html>

Differences

For specific differences, see the test results.

  • __TOC__ and other magic words must appear on a line on their own, while MediaWiki allows them everyhwere with some strange consequences.

  • Not supported right now: __NOEDITSECTION__, __NEWSECTIONLINK, __NONEWSECTIONLINK__.

ToDo

  • Test 843: Section extraction, <pre> around bogus header (bug 10309) This should be fixed, as it affects parser extensions (math element, etc), too. For this, the preprocessor grammar needs to include specific (but not all) XML elements parsed (counterexample: div elements are transparent for headings, pre elements are not).

Thanks

The parser uses the grako parser generator for PEG grammars by ResQSoft Inc. and Juancarlo Añez.

Authors

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

smc.mw-0.3.zip (777.4 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page