Skip to main content

A machine readable multilingual dialog corpus

Project description

# ChatterBot Language Training Corpus

[![Package Version](https://img.shields.io/pypi/v/chatterbot-corpus.svg)](https://pypi.python.org/pypi/chatterbot-corpus/)
[![Build Status](https://travis-ci.org/gunthercox/chatterbot-corpus.svg?branch=master)](https://travis-ci.org/gunthercox/chatterbot-corpus)

These modules are used to quickly train ChatterBot to respond to various inputs in different languages.
Although much of ChatterBot is designed to be language independent, it is still useful to have these
training sets available to prime a fresh database and make the variety of responses that a bot can yield
much more diverse.

For instructions on how to use these data sets, please refer to the [project documentation](http://chatterbot.readthedocs.io/en/latest/training.html#training-with-corpus-data).

All training data contained within this corpus is user contributed.

If you are interested in contributing support for a new language please create a pull request. Additions are welcomed!

# Create your own Corpus Training data

Chatterbot is a very flexible and dynamic chatbot that you easily can create your own training data and structure.

Create or copy an existing .yml file and put that file in a existing or a new directory you created under `chatterbot_corpus\data\<NEW DIRECTORY>`
Edit that file with any text editor that you like to work with.

In the beginning of the file you set one or two categories.
```
categories:
- myown
- my own categories
````

Then can you start your actual training conversation data.

```
conversations:
- - Hello
- Hello
- - Hi
- Hello
```

# Install your training corpus data to Django
You need to install chatterbot as the [Quick Start Guide](http://chatterbot.readthedocs.io/en/stable/quickstart.html).
When the installation are done, please go to `(Virtual Env)/lib/pythonX.X/site-packages/chatterbot_corpus/data/` directory.
Here is the same structure as you can find in this GitHub repo, here is the area where you can create your own directories and conversation files.

When you are done with your files, then can you edit the Django `setting.py` file and locate the chatterbot training section.
Here do you need to add `chatterbot.corpus.<DIRECTORY>.<FILENAME>`

```
'training_data': [
'chatterbot.corpus.english.greeting',
'chatterbot.corpus.custom.myown',
'chatterbot.corpus.swedish.food'
]
```

When you are done, please proceed with the Django Chatterbot Training session.

# Unit Testing

“A true professional does not waste the time and money of other people by handing over software that is not reasonably free of obvious bugs; that has not undergone minimal unit testing; that does not meet the specifications and requirements; that is gold-plated with unnecessary features; or that looks like junk.” – Daniel Read

```
pip install -r requirements.txt
nosetests
```

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

landa-chatterbot-corpus-1.2.0.tar.gz (124.3 kB view details)

Uploaded Source

File details

Details for the file landa-chatterbot-corpus-1.2.0.tar.gz.

File metadata

File hashes

Hashes for landa-chatterbot-corpus-1.2.0.tar.gz
Algorithm Hash digest
SHA256 dfb15184ab4dd4b52f35a3d42e9314e50f7fdf6a46e9208d1515d515a6b14b68
MD5 57983003969fd2f8b6b6b549e72deaf9
BLAKE2b-256 a816fcd0c0cad84404b4984a29bc44bc002e8e388cc9baba634c1eb3a8b359fb

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page