Skip to main content

doctalk: Dialogue agent handling queries about a given text document

Project description

doctalk

Python-based summary and keyword extractor and question answering system with optional BERT-based post-processing filter and spoken output

##INSTALL See the requirements.txt file if installing directly. To make an editable package locally, use

pip3 install -e .

To embed in a system as is, from pypi.org use

pip3 install -U doctalk

##USAGE:

python3 -i

>>> from doctalk.talk import run_with
>>> run_with(fname)
>>> from doctalk.think import reason_with
>>> reason_with(fname)

This activates dialog about document in <fname>.txt with questions in <fname>_quests.txt

See some examples at :

https://github.com/ptarau/pytalk , where, after installing the system, you can run

python3 -i tests.py
>>> go()
>>> tgo()

To run the system one will need to start the Stanford Corenlp Server, listening on port 9000 with all annotators in params.py started, i.e., with something like:

java -mx16g -cp "stanford-corenlp-full-2018-10-05/*" edu.stanford.nlp.pipeline.StanfordCoreNLPServer 
-preload tokenize,ssplit,pos,lemma,depparse,natlog,openie

To play with various parameter settings, edit the doctalk/params.py file.

Please see the install hints for stanfordnlp, that might involve torch binaries, and require anaconda on some systems.

Web App

To use the included ``docbootWeb app, after installingFlask``` and ```waitress```, run the ```app.py``` file in directory ```docbot``` with ```python3 app.py```.

The docbot uses a JSON-based API, documented in doctalk/api.py. In fact, this is the simplest way to integrate the summarizer and the dialog agent into a production system.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

doctalk-0.4.0.tar.gz (146.7 kB view details)

Uploaded Source

Built Distribution

doctalk-0.4.0-py3-none-any.whl (146.6 kB view details)

Uploaded Python 3

File details

Details for the file doctalk-0.4.0.tar.gz.

File metadata

  • Download URL: doctalk-0.4.0.tar.gz
  • Upload date:
  • Size: 146.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.6

File hashes

Hashes for doctalk-0.4.0.tar.gz
Algorithm Hash digest
SHA256 0597c5f6500962c343bc78393ca216476f5335cd5a3e1a49a03af4007d4299d2
MD5 75ef03ce02498d850f414b133ee0da38
BLAKE2b-256 12714851c1ade7073765facb35ebfee58f3263cd17b925810d725af3b9c748ee

See more details on using hashes here.

File details

Details for the file doctalk-0.4.0-py3-none-any.whl.

File metadata

  • Download URL: doctalk-0.4.0-py3-none-any.whl
  • Upload date:
  • Size: 146.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.6

File hashes

Hashes for doctalk-0.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 939e58ab5378705bafa27de4c7e8ba1d41ea051cf0095c5ddc28fc68b542ca66
MD5 dd03b61e1d00242213da5a09af8f2339
BLAKE2b-256 699af0d7aa897e41153484a2eec9444fc57c6effb92f234694493c5c92214794

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page