spoken2written

This package is to translate spoken languag to its written form

These details have not been verified by PyPI

Project links

Homepage

Project description

# Spoken English to Written English translator

There exits a difference between how we write and how we speak. e.g While speaking we say “I paid twenty thousand dollars to xyz organization”. But, we don’t write above example as it is, instead we write it as “I paid $20000 to xyz organization.” This is a python module is to translates such spoken english language to its written form.

e.g. It will translate: “I watched movie named triple H .” to “I watched movie named HHH”: “My weight is fifty five kilograms .” to “My weight is 55 kg” “I paid twenty thousand dollars to xyz organization .” to “I paid $20000 to xyz organization .”

<h1>Installation guide</h1>

Run this command in terminal: ` pip install spoken2written ` The dependencies spaCy,word2number will also be installed after installing the package. It is better to have english language dependency requirement of spacy which is en_core_web_sm

To install this en_core_web_sm, run following command in terminal ` python -m spacy download en_core_web_sm ` <h1>Usage</h1>

First you have to import the module using the below code. ` import spoken2written ` If it shows error during importing then spacy english dependency package is not installed in your device. In this case, install en_core_web_sm library using the command mentioned above.

After importing the package use TextTraslator method to translate spoken English to written form.

Example script: ` >>>from spoken2written import TextTranslator ...test= "My life is triple B . European authorities fined Google a record sixty five thousand dollars on Wednesday for abusing its power in the mobile phone market and ordered the company to alter its practices . Furthermore , My T - Shirt size is double X in 2019 and it costs six dollars . My weight is fifty kilograms ." ...result=TextTranslator(test) ...print(result) ` Output: ` My life is BBB . European authorities fined Google a record $65000 on Wednesday for abusing its power in the mobile phone market and ordered the company to alter its practices . Furthermore , My T - Shirt size is XX in 2019 and it costs $6 . My weight is 50 kg . `

<h1>Features Used to Develop this package</h1>

Name Entity Recognition technique is used to detect entities from given input. Name Entity Recognition is done using the library named ‘spaCy’. Entities such as QUANTITY (E.g weight: fifty kilograms), MONEY(e.g. amount: thousand dollars), PROPER NOUNS are detected using this technique.
The package word2number is used to convert numbers written as ‘two thousand’ to ‘2000’. Furthermore, few lines of logical code adds suffix/prefix as $/kg,etc. depending upon type of entity.
In some texts entity such as”double X” may occur. In this case, the word double acts as adjective followed by X as noun. To detect such texts along with their corresponding parts of speech spacy Token Matcher is used. Again, after detection of entity few lines of logical code will translate “double X” to “XX”.

<b>The logical code for all functions in this package could be found in file spoken2written/spoken2written/spoken2written.py of this repository</b>

<h1>Bugs/ Errors</h1> Please ensure that you have installed dependency en_core_web_sm of spacy before importing package written2spoken. If you find any bugs/errors in the usage of above code, please raise an issue through <a href=”https://github.com/HerambVD/spoken2written”>GitHub</a>. Else, send an email to <a href=”mailto:heramb1711@gmail.com”>heramb1711@gmail.com</a>.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.1.4

Oct 26, 2019

0.1.2

Oct 24, 2019

0.1.1

Oct 24, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

spoken2written-0.1.4.tar.gz (4.4 kB view details)

Uploaded Oct 26, 2019 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

spoken2written-0.1.4-py3-none-any.whl (4.7 kB view details)

Uploaded Oct 26, 2019 Python 3

File details

Details for the file spoken2written-0.1.4.tar.gz.

File metadata

Download URL: spoken2written-0.1.4.tar.gz
Upload date: Oct 26, 2019
Size: 4.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.3

File hashes

Hashes for spoken2written-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`cf2018c519e103222811e204a41872255068ce0af7aab534c1ce584134b3eb0e`
MD5	`50cacee7d25dcba9726a9968f3da08e4`
BLAKE2b-256	`28d0e044f9531b970b4208303cd22823257982e16e7b88169a2a73c148f0df11`

See more details on using hashes here.

File details

Details for the file spoken2written-0.1.4-py3-none-any.whl.

File metadata

Download URL: spoken2written-0.1.4-py3-none-any.whl
Upload date: Oct 26, 2019
Size: 4.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.3

File hashes

Hashes for spoken2written-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b5e93903c0f79825203ad624ca793f286917a8284bec3a632dae53695ab733e4`
MD5	`0b6e83121b4459705b7a8b701301eeec`
BLAKE2b-256	`e894603cbff1564a07c55d2553778b9b366f583ef0c557b81eba3bb3a0a38abe`

See more details on using hashes here.

spoken2written 0.1.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes