Skip to main content

A collection of basic python modules for spoken natural language processing

Project description

py-nltools
==========

+-----------------------------------------------------------------------------------------------+
| nltools |
| +-----------+ +-----------+ +------------+ |
| | tokenizer | | phonetics | | threadpool | |
| +-----------+ +-----------+ +------------+ |
| |
| +-----------+ +-----------+ +-----------+ +-----------+ +-----------+ |
| | tts | | asr | | vad | | g2p | | audio | |
| +-----------+ +-----------+ +-----------+ +-----------+ +-----------+ |
| | | | | | |
+-----------------------------------------------------------------------------------------------+
| | | | |
+--------+---------+ +------+----+ | | |
| | | | | | | |
v v v v v v v v
+------+ +--------+ +------+ +-------+ +-----------+ +--------+ +----------+ +------------+
| mary | | eSpeak | | pico | | kaldi | | cmusphinx | | webrtc | | sequitur | | pulseaudio |
+------+ +--------+ +------+ +-------+ +-----------+ +--------+ +----------+ +------------+

A collection of abstraction layers and support functions that form the
natural language processing foundation of the Zamia AI project:

- `phonetics`: translation functions between various phonetic
alphabets (IPA, X-SAMPA, X-ARPABET, …)

- `tts`: abstraction layer towards using eSpeak NG, MaryTTS, SVOX Pico
TTS or a remote TTS server and sequitur g2p

- `asr`: abstraction layer towards using kaldi-asr and pocketsphinx,
models can be found here: <http://goofy.zamia.org/voxforge/>

- `sequiturclient`: g2p using sequitur

- `pulseplayer`: audio playback through pulseaudio

- `pulserecorder`: audio recording through pulseaudio

- `tokenizer`: english and german word tokenizer aimed at spoken
language applications

- `threadpool`: simple thread pool implementation

- `vad`: Voice Activity Detection finite state machine based on webrtc
VAD

I plan to add modules as I need them in the Zamia AI projects. Some
modules like `phonetics` and `tokenizer` have some overlap with larger
projects like NLTK or spaCy - my modules tend to be more hands-on and
simple minded than these and therefore are in no way meant to replace
them.

Requirements
------------

**Note**: probably incomplete.

- Python 2.7

- for TTS one or more of:

- MaryTTS, py-marytts

- espeak-ng, py-espeak-ng

- SVOX Pico TTS, py-picotts

- for ASR one or more of:

- kaldi-asr 5.1, py-kaldi-asr

- pocketsphinx

- sequitur

- pulseaudio

- webrtc

License
-------

My own code is LGPLv3 licensed unless otherwise noted in the script’s
copyright headers.

Some scripts and files are based on works of others, in those cases it
is my intention to keep the original license intact. Please make sure to
check the copyright headers inside for more information.

Author
------

Guenter Bartsch \<<guenter@zamia.org>\>


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

py-nltools-0.1.4.tar.gz (23.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

py_nltools-0.1.4-py2.py3-none-any.whl (31.1 kB view details)

Uploaded Python 2Python 3

File details

Details for the file py-nltools-0.1.4.tar.gz.

File metadata

  • Download URL: py-nltools-0.1.4.tar.gz
  • Upload date:
  • Size: 23.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for py-nltools-0.1.4.tar.gz
Algorithm Hash digest
SHA256 98d833d3d19fc98208f88065b4bad2daa55ed9fcd741cf65d73c0d58f77c3bde
MD5 08726625313084d22e13aa0f7f5dbff9
BLAKE2b-256 edcd8c6934234fade64dd5cb9c7326b0079ec33f583f3d50874e400541752171

See more details on using hashes here.

File details

Details for the file py_nltools-0.1.4-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for py_nltools-0.1.4-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 9ebe713f5db3b59b76a966c7580b52b1c0071679c7ba736c2d851578a63cd931
MD5 d48f6163eaef61b7b1b1df32c576a6b2
BLAKE2b-256 558e760c16bba637c7bcfaed2e1c97bc504f50a4b1ff0e808f917121ffd2b47a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page