TMH Speech package

These details have not been verified by PyPI

Project links

Development Status
- 1 - Planning
Intended Audience
- Developers
Operating System
Programming Language
- Python :: 3

Project description

TMH Speech

TMH Speech is a library that gives access to open source models for transcription.

Read the docs

https://tmh-docs.readthedocs.io/en/latest/docs.html#getting-started

Getting started

To start the project you first need to install tmh and pyannote, since we are using newer packages.

pip install tmh
pip install https://github.com/pyannote/pyannote-audio/archive/develop.zip

Example usage

Transcription

from tmh.transcribe import transcribe_from_audio_path
file_path = "./sv.wav"
transcription = "Nu prövar vi att spela in ljud på svenska sex laxar i en laxask de finns en stor banan"
print("creating transcription")
asr_transcription = transcribe_from_audio_path(file_path)
print("output")
print(asr_transcription)
print("the transcription is", transcription)

Transcribe with VAD

from tmh.transcribe_with_vad import transcribe_from_audio_path_split_on_speech
file_path = "./sv.wav"
print("creating transcription")
asr_transcription_with_vad = transcribe_from_audio_path_split_on_speech(file_path)
print("transcription")
print(asr_transcription_with_vad)

Overlap detection

from tmh.overalp import overlap_detection

file_path = "./sv.wav"
overlap = overlap_detection(audio_path)
print(overlap)

Language classification

from tmh.transcribe import classify_language
file_path = "./sv.wav"
transcription = "Nu prövar vi att spela in ljud på svenska sex laxar i en laxask de finns en stor banan"
print("classifying language")
language = classify_language(file_path)
print("the language is", language)

Classify emotion

from tmh.transcribe import classify_emotion
file_path = "./sv.wav"
print("classifying emotion")
language = classify_emotion(file_path)
print("the emotion is", language)

Speaker embeddings

The speaker embeddings are made using the following library https://huggingface.co/speechbrain/spkrec-xvect-voxceleb

Extract speaker embedding

from tmh.transcribe import extract_speaker_embedding
file_path = "./sv.wav"
print("extracting speaker embedding")
embeddings = extract_speaker_embedding(file_path)
print("the speaker embedding is", embeddings)

Voice activity detection

from tmh.vad import extract_silences
file_path = "./sv.wav"
print("extracting silences")
embeddings = extract_silences(file_path)
print("the silences are", embeddings)

Speech Generation

Tacotron 2

Make sure you install these packages before running tacotron 2

pip install numpy scipy librosa unidecode inflect librosa
apt-get update
apt-get install -y libsndfile1

Text generation

You can use the text generation api to generate text based on any pretrained model from huggingface.

Example Swedish

from tmh.text import generate_text

output = generate_text(model='birgermoell/swedish-gpt', prompt="AI har möjligheten att", min_length=150)
print(output)

Example GPT-j

from tmh.text import generate_text

output = generate_text(model='EleutherAI/gpt-neo-2.7B', prompt="EleutherAI has", min_length=150)
print(output)

Codex

Generate code and save to file. To use

from tmh.code import generate_from_prompt, write_to_file
response = generate_from_prompt('''
A pytorch neural network model for MNIST
'''
)
write_to_file(response, "generated.py")

Build instructions

Change the version number

python3 -m build 
twine upload --skip-existing dist/*

Read the docs

https://tmh-docs.readthedocs.io/en/latest/docs.html#getting-started

Github

https://github.com/BirgerMoell/tmh

Project details

These details have not been verified by PyPI

Project links

Development Status
- 1 - Planning
Intended Audience
- Developers
Operating System
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

0.0.109

Oct 7, 2022

0.0.108

Oct 7, 2022

0.0.107

Oct 7, 2022

0.0.106

Oct 7, 2022

0.0.105

Oct 7, 2022

0.0.104

Oct 7, 2022

0.0.102

Oct 7, 2022

0.0.101

Oct 7, 2022

0.0.100

Oct 7, 2022

0.0.99

Oct 7, 2022

0.0.98

Oct 7, 2022

0.0.97

Jun 16, 2022

0.0.96

Jun 3, 2022

0.0.95

Jun 3, 2022

0.0.94

Jun 3, 2022

0.0.93

Jun 3, 2022

0.0.92

Jun 2, 2022

0.0.91

Jun 1, 2022

0.0.90

Mar 8, 2022

0.0.89

Mar 8, 2022

0.0.88

Mar 8, 2022

0.0.87

Mar 1, 2022

0.0.86

Mar 1, 2022

0.0.82

Mar 1, 2022

0.0.81

Mar 1, 2022

0.0.80

Mar 1, 2022

0.0.79

Mar 1, 2022

0.0.78

Mar 1, 2022

0.0.77

Feb 9, 2022

0.0.76

Nov 9, 2021

0.0.75

Sep 23, 2021

0.0.74

Sep 23, 2021

0.0.73

Sep 17, 2021

0.0.72

Sep 17, 2021

0.0.71

Sep 16, 2021

0.0.70

Sep 16, 2021

0.0.69

Sep 16, 2021

0.0.68

Sep 16, 2021

0.0.67

Sep 16, 2021

0.0.66

Sep 15, 2021

0.0.65

Sep 15, 2021

0.0.63

Sep 15, 2021

0.0.62

Sep 15, 2021

0.0.61

Sep 15, 2021

0.0.60

Sep 15, 2021

0.0.59

Sep 15, 2021

0.0.58

Sep 14, 2021

0.0.57

Sep 14, 2021

0.0.56

Sep 14, 2021

0.0.55

Sep 14, 2021

0.0.54

Sep 14, 2021

0.0.53

Sep 13, 2021

0.0.52

Sep 13, 2021

0.0.51

Sep 13, 2021

0.0.50

Sep 13, 2021

0.0.49

Sep 13, 2021

0.0.48

Sep 13, 2021

0.0.47

Sep 13, 2021

This version

0.0.46

Sep 13, 2021

0.0.45

Sep 13, 2021

0.0.44

Sep 13, 2021

0.0.43

Sep 10, 2021

0.0.42

Sep 10, 2021

0.0.41

Sep 10, 2021

0.0.39

Sep 10, 2021

0.0.38

Sep 10, 2021

0.0.36

Sep 9, 2021

0.0.35

Sep 9, 2021

0.0.33

Sep 9, 2021

0.0.32

Sep 9, 2021

0.0.31

Sep 9, 2021

0.0.30

Sep 9, 2021

0.0.29

Sep 9, 2021

0.0.28

Sep 9, 2021

0.0.27

Sep 9, 2021

0.0.26

Sep 9, 2021

0.0.25

Sep 9, 2021

0.0.24

Sep 9, 2021

0.0.23

Sep 9, 2021

0.0.22

Sep 9, 2021

0.0.21

Sep 9, 2021

0.0.20

Sep 9, 2021

0.0.19

Sep 9, 2021

0.0.18

Sep 9, 2021

0.0.17

Sep 8, 2021

0.0.16

Sep 8, 2021

0.0.15

Sep 8, 2021

0.0.14

Sep 8, 2021

0.0.12

Sep 8, 2021

0.0.11

Sep 8, 2021

0.0.10

Sep 8, 2021

0.0.9

Sep 8, 2021

0.0.8

Sep 8, 2021

0.0.7

Sep 8, 2021

0.0.6

Sep 7, 2021

0.0.5

Sep 7, 2021

0.0.4

Sep 7, 2021

0.0.3

Sep 7, 2021

0.0.2

Sep 7, 2021

0.0.1

Sep 7, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tmh-0.0.46.tar.gz (9.2 kB view hashes)

Uploaded Sep 13, 2021 Source

Built Distribution

tmh-0.0.46-py3-none-any.whl (11.6 kB view hashes)

Uploaded Sep 13, 2021 Python 3

Hashes for tmh-0.0.46.tar.gz

Hashes for tmh-0.0.46.tar.gz
Algorithm	Hash digest
SHA256	`d3bd5fd78f84f92082602a1b60b3a9d3ce13896a3e9ad8e0657bc824acf926d5`
MD5	`dcf2883cfdca4eb4d0b3ec7e119b90e8`
BLAKE2b-256	`20b6dd3eaa5fe27f3aa5695eca091524fc727a6ef9fd8926e19f01e29488dd26`

Hashes for tmh-0.0.46-py3-none-any.whl

Hashes for tmh-0.0.46-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4a3b2ca8d7849064075c097c2df72cda3841d1e82914d4b6a20ac448f641366f`
MD5	`31b247356bb85386651426186c814ef4`
BLAKE2b-256	`524f52a48e77928cbe4d1ca3e994589bc632f7cf6ebe12c77fd21c1203b1294f`