Skip to main content

A package that can convert a paragraph of spoken english to written english

Project description

Spoken English To Written English Conversion system (s2wconversion)

  • This repository contain library which convert spoken english like "Triple A" to written English like AAA or "Double A" to AA.
  • To use this library you have to simply install it using pip and write from conversion import conversion (See demo.py)

Installation

Dependencies

s2wconversion requires:

  • Python (>= 3.5)
  • NumPy (>= 1.11.0)
  • speech_recognition

User installation

If you already have a working installation of numpy and speech_recognition, the easiest way to install s2wconversion is using pip

pip install s2w

Conversion

  • conversion directory contain converion.py file which have spoken to written funtion as per above rules
  • run demo.py file to see how to convert trascript paragraph such as "I worked at Triple a" to "I worked at AAA". After run demo.py it says "Say Something" then spoke anything you want then by using google speech recognition services it trascript paragraph spoken with abbrevations conversion rules.

conversion rules

  • conversion rules directory contain rules.md file and rules.py file.
  • In rules.md file i explained about adding some grammer rules such as Part of speech in a paragraph or decontracted phrases
  • rules.py file contain some of these rules implemented
  • We convert transcript speech into tokens then after apply nltk pos_tagg to detect pos for words and implement some rules (See rules.md)

How to use

  • Clone or download this repository.
  • run demo.py file to see how it works.
  • after run demo.py it ask some input "Say Something".
  • then it converts speech into written text usin Google Speech Recognition Services, having rules like "Triple A" is "AAA" etc
  • See conversion rule\rules.py file to checkout more rules like punctuations or different pos tags.

Future implementations and contributions

  • This repo always open for open source contributions.
  • We can add some more pos tagging rules and implement it to our library.
  • Also we can use deep learning pos tagging for get better accuracy and add more grammer rules.
  • Can add some more abbrevations such as we can add rules for "AAAA", "AAAAA" etc.I think of quad a, pent a etc.

LICENSE

GNU General Public License v3.0 (gplv3)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

s2w-1.0.0.tar.gz (3.0 kB view details)

Uploaded Source

Built Distribution

s2w-1.0.0-py3-none-any.whl (15.7 kB view details)

Uploaded Python 3

File details

Details for the file s2w-1.0.0.tar.gz.

File metadata

  • Download URL: s2w-1.0.0.tar.gz
  • Upload date:
  • Size: 3.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.6.8

File hashes

Hashes for s2w-1.0.0.tar.gz
Algorithm Hash digest
SHA256 e1065400c01831c01e6036f9d6517c66b8dd75e45afbdcc40e9e9e4c345a05b1
MD5 63021f4fc6a1f9a88bb448c8fa529206
BLAKE2b-256 fee579912488be6a6e8ece0c08088bc648042489f9285fe20686e1ce02dbe79b

See more details on using hashes here.

File details

Details for the file s2w-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: s2w-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 15.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.6.8

File hashes

Hashes for s2w-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 2980ef88707dfeb3505319bb1c5153de6a2e6ce545271478d3c68b9571c92d0c
MD5 7b9359dbf549eca73168a32f9b2e0229
BLAKE2b-256 f68240e0cd9c5970fa6a25602cb39f353f897dfffeafba13cb0ef619db489f9b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page