Skip to main content

MediaWiki-compatible parser for Python.

Project description

smc.mw

A mediawiki-compatible parser for Python.

Current status:

https://travis-ci.org/lambdafu/smc.mw.png https://coveralls.io/repos/lambdafu/smc.mw/badge.png?branch=master https://pypip.in/v/smc.mw/badge.png https://pypip.in/d/smc.mw/badge.png

Using

To run the tests:

$ make -C tests

The test result can be found in tests/out/report.html.

A command line tool is available, too (installed as “mw”):

$ echo "''Hello World''" | python smc/mw/tool.py
<html><body><p><i>Hello World</i>
</p></body></html>

Differences

For specific differences, see the test results.

  • __TOC__ and other magic words must appear on a line on their own, while MediaWiki allows them everyhwere with some strange consequences.
  • Not supported right now: __NOEDITSECTION__, __NEWSECTIONLINK, __NONEWSECTIONLINK__.

ToDo

  • Test 843: Section extraction, <pre> around bogus header (bug 10309) This should be fixed, as it affects parser extensions (math element, etc), too. For this, the preprocessor grammar needs to include specific (but not all) XML elements parsed (counterexample: div elements are transparent for headings, pre elements are not).

Thanks

The parser uses the grako parser generator for PEG grammars by ResQSoft Inc. and Juancarlo Añez.

Authors

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for smc.mw, version 0.3
Filename, size File type Python version Upload date Hashes
Filename, size smc.mw-0.3.zip (777.4 kB) File type Source Python version None Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page