Skip to main content

A French-English text realizer

Project description

pyRealB - A Python Bilingual Text Realizer

Version 2.3 - February 2023

pyRealB is a Python adaptation of the JavaScript jsRealB text realizer with the same constituent and dependency syntax notation. It facilitates its integration within Python applications by simply adding

from pyrealb import *

Online documentation

Installing the distribution package from PyPI

pip install pyrealb

Upgrading the version

pip install pyrealb --upgrade

Building and installing the package from the sources

  1. cd into this directory (with pyproject.toml file)
  2. Build the distribution package python3 -m build
  3. Install with python3 -m pip install .

First realization tests at the Python 3 prompt

  1. from pyrealb import *
  2. loadEn()
  3. print(S(Pro("I").g("f"),VP(V("say"),"hello",PP(P("to"),NP(D("the"),N("world"))))))
  4. this should print She says hello to the world.
  5. print(root(V("say").t("ps"),subj(Pro("him").c("nom")),comp(N("goodbye"))).typ({"neg":True}))
  6. this should print He did not say goodbye.

Use pyrealb in a Jupyter notebook thru Binder

  1. in a browser, load one of these links:

Directories

  • src
    • __init__.py : empty program that imports subpackages and exports relevant symbols.
    • Constituent.py: Constituent is the top class for methods shared between Phrases and Terminals
    • Lexicon.py: class to access lexicon entries and syntactic rules
    • Number.py : utility functions for dealing with number formatting
    • Phrase.py : subclass of Constituent for creating complex phrases
    • Dependent.py : subclass of Constituent for creating complex phrases using dependencies
    • Terminal.py : subclass of Constituent for creating a single unit (most often a single word)
    • util.py : some useful functions
    • Warning.py : function to generate warnings in case of erroneous specifications using pyRealB itself
    • data:
      • lexicon-en.json : English lexicon (33,932 entries) in json format
      • rule-en.js : English conjugation and declension tables
      • lexicon-fr.json : French lexicon (52,547 entries) in json format
      • rule-fr.js : French conjugation and declension tables

Nota bene:

  1. In the following directories, the __init__.py file is used to set the appropriate search path for pyRealB functions; this ensures that the current Python source files are used for execution.
  2. Some directories include markup.py which should be loaded using pip. Unfortunately I never managed to make this "piped" version work, it does not import the name onelineralthough it should. It works once the file is in the local directory.
  • docs: in both English and French.

    • documentation.html : generated documentation (used for consultation) DO NOT EDIT directly Online version
    • documentation.py: Python program for generating documentation.html using markup.py
      once this is run documentation.html should be copied at http://www.iro.umontreal.ca/~lapalme/pyrealb/documentation.html which is used for consultation
    • style.css: style sheet for the documentation
    • userinfos.py: definitions of variables containing the examples
    • user.js : Python helper script.
  • IDE : Integrated Development Environment

    • ide.py: built on the Python read-eval-print loop, it imports pyRealB to get the realization of an expression, to consult the lexicon, the conjugation and declension tables. It is also possible to get a lemmatization: i.e. the pyRealB expression corresponding to a form.
    • README.html: documentation and examples
  • tests : unit tests of special features of pyRealB in both French and English. Files have the pattern *_{en|fr}.py

    • test.py: simplistic function to check if a function returns the expected answer and display appropriate message
    • testAll.html : run this file to run all tests

Demos

  • 99bottlesofbeer/99bottlesofbeer.py : simple generation of a classic repetitive text in English.
  • evenementsDemo/evenements.py : Description (in French) of a list of events, it creates HTML.
  • flight_infos/README.md : development of a RASA NLG server giving information about flights, aircrafts, etc...
  • gophypi/amr2text.py : generate a literal reading of an AMR (Abstract Meaning Representation); paper describing the approach
  • inflectionDemo/inflection.py : French or English conjugation and declension of a form.
  • kilometresapied/kilometresapied.py : simple generation of a classic repetitive text in French.
  • methodius/methodius.py : generation of English sentences from a logical form expressed in XML.
  • randomgen/randomgen.py: Generation of random English sentences
  • RDFpyrealb/WebGenerate.py : Generation from RDF triples
  • report/report.py : Single sentence parameterized by language, tense and subject
  • simple_example/simple_example.py: a few examples of English and French expressions to be realized, useful for debugging
  • variantes/variantes.py: French or English sentences realized with all possible sentence modifiers; some challenging examples are in examples.py.
  • weather/Bulletin.py: French and English weather bulletins generated from information in a json-line file. (weather-data.jsonl). It uses the packages in the Realization directory.

Contact

Guy Lapalme

Acknowledgement

Thanks to Fabrizio Gotti for helping to organize the Python package.

For the maintainer mainly

Updating package version on PyPI

see this tutorial

These steps take for granted that the password for PyPI has already been given...

  1. update version number in setup.cfg (it should be the same as python_version in src/pyrealb/utils.py)
  2. cd into the directory with the pyproject.toml file (the same as this README.md)
  3. Build the distribution package
    python3 -m build
  4. Upload to PyPi
    twine upload dist/*
  5. Install new version from PyPI
    python3 -m pip install pyrealb --upgrade

Useful trick for debugging with breaking point and tracing

  1. add pyrealb expression to debug at the end of demo/simple_example.py
  2. comment the line calling testPrevious()
  3. debug demo/simple_example.py

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyrealb-2.3.2.tar.gz (525.4 kB view hashes)

Uploaded Source

Built Distribution

pyrealb-2.3.2-py3-none-any.whl (541.1 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page