doctalk: Dialogue agent handling queries about a given text document
Project description
doctalk
Python-based summary and keyword extractor and question answering system with optional BERT-based post-processing filter and spoken output
##INSTALL
See the requirements.txt
file if installing directly. To make an editable package locally, use
pip3 install -e .
To embed in a system as is, from pypi.org
use
pip3 install -U doctalk
##USAGE:
python3 -i
>>> from doctalk.talk import run_with
>>> run_with(fname)
>>> from doctalk.think import reason_with
>>> reason_with(fname)
This activates dialog about document in <fname>.txt
with questions in <fname>_quests.txt
See some examples at :
https://github.com/ptarau/pytalk , where, after installing the system, you can run
python3 -i tests.py
>>> go()
>>> tgo()
To run the system one will need to start the Stanford Corenlp Server
, listening on port 9000
with all annotators in params.py started, i.e., with something like:
java -mx16g -cp "stanford-corenlp-full-2018-10-05/*" edu.stanford.nlp.pipeline.StanfordCoreNLPServer
-preload tokenize,ssplit,pos,lemma,depparse,natlog,openie
To play with various parameter settings, edit the doctalk/params.py
file.
Please see the install hints for stanfordnlp
, that might involve torch binaries, and require anaconda on some systems.
Web App
To use the included ``docbootWeb app, after installing
Flask``` and ```waitress```, run the ```app.py``` file in directory ```docbot``` with ```python3 app.py```.
The docbot uses a JSON-based API, documented in doctalk/api.py
. In fact, this is the simplest way to integrate the summarizer and the dialog agent into a production system.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file doctalk-0.4.0.tar.gz
.
File metadata
- Download URL: doctalk-0.4.0.tar.gz
- Upload date:
- Size: 146.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.10.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0597c5f6500962c343bc78393ca216476f5335cd5a3e1a49a03af4007d4299d2 |
|
MD5 | 75ef03ce02498d850f414b133ee0da38 |
|
BLAKE2b-256 | 12714851c1ade7073765facb35ebfee58f3263cd17b925810d725af3b9c748ee |
File details
Details for the file doctalk-0.4.0-py3-none-any.whl
.
File metadata
- Download URL: doctalk-0.4.0-py3-none-any.whl
- Upload date:
- Size: 146.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.10.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 939e58ab5378705bafa27de4c7e8ba1d41ea051cf0095c5ddc28fc68b542ca66 |
|
MD5 | dd03b61e1d00242213da5a09af8f2339 |
|
BLAKE2b-256 | 699af0d7aa897e41153484a2eec9444fc57c6effb92f234694493c5c92214794 |