ReaderBench library written in python
Project description
ReaderBench Python
Install
pip3 install --user rbpy-rb
./install.sh
You may also need some spacy models which are downloaded through spacy.
You have to download these spacy models by yourself, using the command:
python3 -m spacy download name_of_the_model
The logger will also write instructions on which models you need, and how to download them.
Usage
For tokenization, lemmatiozation, pos tagging, use:
from rb.parser.spacy_parser import SpacyParser
from rb.core.lang import Lang
from rb.core.document import Document
nlp_ro = SpacyParser.get_instance().get_model(Lang.RO)
test_text_ro = "Am mers repede la magazinul frumos."
# tokenize
docs_ro = nlp_ro(test_text_ro)
# print all attributes of token objects
print(dir(docs_ro[0]))
for token in docs_ro:
print(token.lemma_, token.is_stop, token.tag_, token.pos_)
For indices use:
from rb.core.lang import Lang
from rb.core.document import Document
doc = Document(Lang.EN, 'This is a sample document. It can contain multiple sentences and paragraphs')
See examples.py
for usage examples.
Dev instructions
How to use the logger
In each file you have to initialize the logger:
from rb.utils.rblogger import Logger
logger = Logger.get_logger()
logger.info("info msg")
logger.warning("warning msg")
logger.error()
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
rbpy-rb-0.1.9.tar.gz
(30.2 kB
view hashes)
Built Distribution
rbpy_rb-0.1.9-py3-none-any.whl
(59.0 kB
view hashes)