RPly 木兰编程语言定制版
Project description
在 rply 0.7.8 的基础上,作了如下修改:
为减小工作量,暂时放弃对 pypy、Python 2 等版本的支持,仅支持 Python 3.7。
各方面中文化:异常信息与标识符命名等;添加中文 API,同时仍然支持原英文 API。
- 改进:
添加 shift/reduce 歧义的调试信息,详见此文。
【试验】通过根据语法规则分词,支持中文无空格语法,演示在此。
### 测试
pytest 运行所有测试用例,或运行单个用例,如 pytest tests/test_both.py
以下为 rply 原始文档。
Welcome to RPLY! A pure Python parser generator, that also works with RPython. It is a more-or-less direct port of David Beazley’s awesome PLY, with a new public API, and RPython support.
You can find the documentation online.
Basic API:
from rply import ParserGenerator, LexerGenerator
from rply.token import BaseBox
lg = LexerGenerator()
# Add takes a rule name, and a regular expression that defines the rule.
lg.add("PLUS", r"\+")
lg.add("MINUS", r"-")
lg.add("NUMBER", r"\d+")
lg.ignore(r"\s+")
# This is a list of the token names. precedence is an optional list of
# tuples which specifies order of operation for avoiding ambiguity.
# precedence must be one of "left", "right", "nonassoc".
# cache_id is an optional string which specifies an ID to use for
# caching. It should *always* be safe to use caching,
# RPly will automatically detect when your grammar is
# changed and refresh the cache for you.
pg = ParserGenerator(["NUMBER", "PLUS", "MINUS"],
precedence=[("left", ['PLUS', 'MINUS'])], cache_id="myparser")
@pg.production("main : expr")
def main(p):
# p is a list, of each of the pieces on the right hand side of the
# grammar rule
return p[0]
@pg.production("expr : expr PLUS expr")
@pg.production("expr : expr MINUS expr")
def expr_op(p):
lhs = p[0].getint()
rhs = p[2].getint()
if p[1].gettokentype() == "PLUS":
return BoxInt(lhs + rhs)
elif p[1].gettokentype() == "MINUS":
return BoxInt(lhs - rhs)
else:
raise AssertionError("This is impossible, abort the time machine!")
@pg.production("expr : NUMBER")
def expr_num(p):
return BoxInt(int(p[0].getstr()))
lexer = lg.build()
parser = pg.build()
class BoxInt(BaseBox):
def __init__(self, value):
self.value = value
def getint(self):
return self.value
Then you can do:
parser.parse(lexer.lex("1 + 3 - 2+12-32"))
You can also substitute your own lexer. A lexer is an object with a next() method that returns either the next token in sequence, or None if the token stream has been exhausted.
Why do we have the boxes?
In RPython, like other statically typed languages, a variable must have a specific type, we take advantage of polymorphism to keep values in a box so that everything is statically typed. You can write whatever boxes you need for your project.
If you don’t intend to use your parser from RPython, and just want a cool pure Python parser you can ignore all the box stuff and just return whatever you like from each production method.
Error handling
By default, when a parsing error is encountered, an rply.ParsingError is raised, it has a method getsourcepos(), which returns an rply.token.SourcePosition object.
You may also provide an error handler, which, at the moment, must raise an exception. It receives the Token object that the parser errored on.
pg = ParserGenerator(...)
@pg.error
def error_handler(token):
raise ValueError("Ran into a %s where it wasn't expected" % token.gettokentype())
Python compatibility
RPly is tested and known to work under Python 2.7, 3.4+, and PyPy. It is also valid RPython for PyPy checkouts from 6c642ae7a0ea onwards.
Links
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file rply-ulang-0.8.3.tar.gz
.
File metadata
- Download URL: rply-ulang-0.8.3.tar.gz
- Upload date:
- Size: 21.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 30bdd9b5e505d54c8af6e484898d6699bc282b0e3ea2c168e4ad298881207b37 |
|
MD5 | 49a115fdce6b4d76c291f634a8e7618c |
|
BLAKE2b-256 | 056b69e7fc0a1c957a1ad06b46934da11fcf806492592bd170dbe6fcac71d2fd |
File details
Details for the file rply_ulang-0.8.3-py2.py3-none-any.whl
.
File metadata
- Download URL: rply_ulang-0.8.3-py2.py3-none-any.whl
- Upload date:
- Size: 22.0 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e05985bc1ea90e0b290324fe2a9b8412b1ae6d2fb049ace2936d431727b27ace |
|
MD5 | 348e5c72b2dd642f534b4244cbb94876 |
|
BLAKE2b-256 | 7049eeb520a6d81456ab93cb5a128bde1c6bc3a90cfd6a18dc6dfd154fe01f58 |