Skip to main content

Python wrapper for libparsing, a PEG-based parsing library written in C

Project description

libparsing is a parsing element grammar (PEG) library written in C with Python bindings. It offers good performance while allowing for a lot of flexibility. It is mainly intended to be used to create programming languages and software engineering tools.

As opposed to more traditional parsing techniques, the grammar is not compiled but constructed using an API that allows for the dynamic update of the grammar.

The parser does not do any tokeninzation, instead, an input stream is consumed and parsing elements are dynamically asked to match the next element of it. Once parsing elements match, the resulting matched input is processed and an action is triggered.

libparsing supports the following features:

  • backtracking, ie. going back in the input stream if a match is not found

  • cherry-picking, ie. skipping unrecognized input

  • contextual rules, ie. a rule that will match or not depending on external variables

Parsing elements are usually slower than compiled or FSM-based parsers as they trade performance for flexibility. It’s probably not a great idea to use libparsing if the parsing has to happen as fast as possible (ie. a protocol implementation), but it is a great use for programming languages, as it opens up the door to dynamic syntax plug-ins and multiple language embedding.

If you’re interested in PEG, you can start reading Brian Ford’s original article. Projects such as PEG/LEG by Ian Piumarta http://piumarta.com/software/peg/ ,OMeta by Alessandro Warth http://www.tinlizzie.org/ometa/ or Haskell’s Parsec library https://www.haskell.org/haskellwiki/Parsec are of particular interest in the field.

Here is a short example of what creating a simple grammar looks like in Python:

g = Grammar()
s = g.symbols
g.token("WS",       "\s+")
g.token("NUMBER",   "\d+(\.\d+)?")
g.token("VARIABLE", "\w+")
g.token("OPERATOR", "[\/\+\-\*]")
g.group("Value",     s.NUMBER, s.VARIABLE)
g.rule("Suffix",     s.OPERATOR._as("operator"), s.Value._as("value"))
g.rule("Expression", s.Value, s.Suffix.zeroOrMore())
g.axiom(s.Expression)
g.skip(s.WS)
match = g.parseString("10 + 20 / 5")

and the equivalent code in C

Grammar* g = Grammar_new()
SYMBOL(WS,         TOKEN("\\s+"))
SYMBOL(NUMBER,     TOKEN("\\d+(\\.\\d+)?"))
SYMBOL(VARIABLE,   TOKEN("\\w+"))
SYMBOL(OPERATOR,   GROUP("[\\/\\+\\-\\*]"))
SYMBOL(Value,      GOUP(_S(NUMBER), _S(VARIABLE)))
SYMBOL(Suffix,     RULE(_AS(_S(OPERATOR), "operator"), _AS(_S(Value), "value")))
SYMBOL(Expression, RULE(_S(Value), _MO(Suffix))
g->axiom = s_Expression;
g->skip(s_WS);
Grammar_prepare(g);
Match* match = Grammar_parseString(g, "10 + 20 / 5")

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

libparsing-0.9.2.tar.gz (33.3 kB view details)

Uploaded Source

Built Distribution

libparsing-0.9.2.linux-x86_64.tar.gz (280.0 kB view details)

Uploaded Source

File details

Details for the file libparsing-0.9.2.tar.gz.

File metadata

  • Download URL: libparsing-0.9.2.tar.gz
  • Upload date:
  • Size: 33.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for libparsing-0.9.2.tar.gz
Algorithm Hash digest
SHA256 b841ac1aced0b395bdab1f193792b156a538073093cd84f96fa0f7c95fa02c89
MD5 48ced31585033955085a21224a5b6806
BLAKE2b-256 06be1459ca3bfb8b12b8eaf35924744fe5d7ed6deaec2cd315e1078975331bee

See more details on using hashes here.

File details

Details for the file libparsing-0.9.2.linux-x86_64.tar.gz.

File metadata

File hashes

Hashes for libparsing-0.9.2.linux-x86_64.tar.gz
Algorithm Hash digest
SHA256 02ba3aa42da78677962e51e7c1dae04f0e842ee04c88b78ac5948fcaca1bbb3e
MD5 04a588e8808fe304b2e1317a242ca176
BLAKE2b-256 bf4872e4ffadd58801ebd617656a63755c6bf31c6cde1b3dc831cbee85580628

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page