A list of similar sounding words to help disambiguate voice coding
Project description
Similar sounding words
This is a list of similar sounding words that I have collected from various sources on the web and added to as I find new pairs.
Unlike most homophone, homograph, and homonym resources this list is not targeting ESL or educational use. Instead it is designed for finding common errors in speech recognition texts. Specifically I use it with Caster for voice programming.
In addition to my custom file I currently have five different sources:
- https://7esl.com/homophones/
- https://web.ku.edu/~edit/wordsall.html
- http://www.singularis.ltd.uk/bifroest/misc/homophones-list.html
- https://www.teachingtreasures.com.au/teaching-tools/Basic-worksheets/worksheets-english/upper/homophones-list.htm
- https://www.thoughtco.com/homonyms-homophones-and-homographs-a-b-1692660
I preprocessed some of these files in a text editor and random three Jupiter notebook available in the github repository to generate the index.
All source files are copyright their respective authors.
TODO
- add mapping of words to numbers
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for similar-sounding-words-0.1.0.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | e8187c091a4e00c655413f43eaeb7a783fd3219a58dac0d299bf5c9d03aaf3d3 |
|
MD5 | 5bfff5c3765bb8e88cd847b052fbb656 |
|
BLAKE2b-256 | bc7c49deef2136bbf0183c11f8dc0e411d8fad0749c6c6c28a1bac5cae905c5a |
Hashes for similar_sounding_words-0.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a76087c8ec732e7a6a4079e647b05bdc213d20249a6ee32224a4522e3b0bb0d9 |
|
MD5 | 8c943e94f8a88c04d935aa7c8e5c1cfd |
|
BLAKE2b-256 | 38dc967605ad4d904c11f7baa4daaf27254f868e9f75461d20aea7090000a54a |