Skip to main content

Python package for AssemblyAI

Project description

assemblyai-python-sdk

Transcribe audio into text. Create custom language models for higher accuracy.

Quickstart

import assemblyai

aai = assemblyai.Client(token='your-secret-token')

transcript = aai.transcribe('https://example.com/sample.wav')

while transcript.status is not 'completed':
    transcript = transcript.get()

text = transcript.text

Transcripts take about half the duration of the audio to complete.

Custom language models

The quickstart example transcribes audio using a generic English language model.

In order to retain accuracy with unique word sets, create a custom language model.

For this example we'll train a new model using Pokemon characters from Wikipedia.

import assemblyai
import wikipedia

aai = assemblyai.Client(token='your-secret-token')

phrases = wikipedia.page("Pokemon characters").content.split('. ')

model = aai.train(phrases)

transcript = aai.transcribe('https://example.com/pokemon.wav', model=model)

while transcript.status is not 'completed':
    transcript = transcript.get()

text = transcript.text

Initially, models take six minutes to train, but afterwords they can be invoked by ID.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
assemblyai-0.1.5-py2.py3-none-any.whl (4.9 kB) Copy SHA256 hash SHA256 Wheel py2.py3
assemblyai-0.1.5.tar.gz (11.4 kB) Copy SHA256 hash SHA256 Source None

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page