A package that can convert a paragraph of spoken english to written english
Project description
Spoken English To Written English Conversion system (s2wconversion)
- This repository contain library which convert spoken english like "Triple A" to written English like AAA or "Double A" to AA.
- To use this library you have to simply install it using pip and write from conversion import conversion (See demo.py)
Installation
Dependencies
s2wconversion requires:
- Python (>= 3.5)
- NumPy (>= 1.11.0)
- speech_recognition
User installation
If you already have a working installation of numpy and speech_recognition, the easiest way to install s2wconversion is using pip
pip install s2w
Conversion
- conversion directory contain converion.py file which have spoken to written funtion as per above rules
- run demo.py file to see how to convert trascript paragraph such as "I worked at Triple a" to "I worked at AAA". After run demo.py it says "Say Something" then spoke anything you want then by using google speech recognition services it trascript paragraph spoken with abbrevations conversion rules.
conversion rules
- conversion rules directory contain rules.md file and rules.py file.
- In rules.md file i explained about adding some grammer rules such as Part of speech in a paragraph or decontracted phrases
- rules.py file contain some of these rules implemented
- We convert transcript speech into tokens then after apply nltk pos_tagg to detect pos for words and implement some rules (See rules.md)
How to use
- Clone or download this repository.
- run demo.py file to see how it works.
- after run demo.py it ask some input "Say Something".
- then it converts speech into written text usin Google Speech Recognition Services, having rules like "Triple A" is "AAA" etc
- See conversion rule\rules.py file to checkout more rules like punctuations or different pos tags.
Future implementations and contributions
- This repo always open for open source contributions.
- We can add some more pos tagging rules and implement it to our library.
- Also we can use deep learning pos tagging for get better accuracy and add more grammer rules.
- Can add some more abbrevations such as we can add rules for "AAAA", "AAAAA" etc.I think of quad a, pent a etc.
LICENSE
GNU General Public License v3.0 (gplv3)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
s2w-1.0.0.tar.gz
(3.0 kB
view details)
Built Distribution
s2w-1.0.0-py3-none-any.whl
(15.7 kB
view details)
File details
Details for the file s2w-1.0.0.tar.gz
.
File metadata
- Download URL: s2w-1.0.0.tar.gz
- Upload date:
- Size: 3.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.6.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e1065400c01831c01e6036f9d6517c66b8dd75e45afbdcc40e9e9e4c345a05b1 |
|
MD5 | 63021f4fc6a1f9a88bb448c8fa529206 |
|
BLAKE2b-256 | fee579912488be6a6e8ece0c08088bc648042489f9285fe20686e1ce02dbe79b |
File details
Details for the file s2w-1.0.0-py3-none-any.whl
.
File metadata
- Download URL: s2w-1.0.0-py3-none-any.whl
- Upload date:
- Size: 15.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.6.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2980ef88707dfeb3505319bb1c5153de6a2e6ce545271478d3c68b9571c92d0c |
|
MD5 | 7b9359dbf549eca73168a32f9b2e0229 |
|
BLAKE2b-256 | f68240e0cd9c5970fa6a25602cb39f353f897dfffeafba13cb0ef619db489f9b |