Python client for the Gaston API (transcription, translation and sentence search).

These details have not been verified by PyPI

Project links

Project description

Gaston API Client

A small, typed Python client for the Gaston API: transcription, translation and full-text search of sentences within transcribed recordings.

Requires a Gaston account and an API token (see Configuration).

Installation

pip install gaston

Requires Python 3.10+.

For local development from a checkout instead:

pip install -e .

Quick start

from gaston import GastonClient

client = GastonClient(token="gapi-...")

# Who am I + remaining quota
me = client.me()
print(me.email, "files left:", me.usage.files_left)

# Transcribe a local file
result = client.transcribe("interview.mp4", lang="en", title="My interview")
print(result.id, result.state)

# Transcribe from a URL (YouTube or web)
client.transcribe_url("https://youtu.be/dQw4w9WgXcQ", lang="en")

# Translate an existing transcription
client.translate(result.id, target_lang="de")

# Speaker diarization (requires a completed translation in that language)
client.diarize(result.id, lang="de", speakers=2)

# Fetch a media item with its sentences
media = client.get_media(result.id, lang="en")
for sentence in media.sentences:
    print(sentence.id, sentence.text, sentence.speaker)

# Full text search across the whole library
results = client.search("climate change", max_=20)
print("total matches:", results.total)
for hit in results:
    print(hit["_sentence"]["body"], "->", hit["_highlight"]["body"])

See Search for query syntax and filtering options.

Configuration

Generate an API token in the Gaston app under Settings -> API. Full endpoint documentation is available at https://www.gaston.live/en/api.

The token can be supplied directly or via an environment variable:

Argument	Environment variable	Default
`token`	`GASTON_API_TOKEN`	(required)

# Uses GASTON_API_TOKEN from the environment
with GastonClient() as client:
    ...

Timeouts

Ordinary requests use a 30s timeout. The file upload in transcribe can take minutes for large files, so it uses a separate, more generous upload_timeout (default (10s connect, 600s read)).

A timeout may be a single float, a (connect, read) tuple, or None to wait indefinitely.

# Customise the defaults for all calls
client = GastonClient(
    token="gapi-...",
    timeout=30,
    upload_timeout=(10, 1800),   # allow up to 30 min to upload large files
)

# Or override per call (e.g. no read timeout for a very large file)
client.transcribe("huge-recording.mp4", timeout=(10, None))

Directories

folder = client.create_directory("Podcasts")
client.update_directory(folder.id, title="Podcast archive")
client.move_media(media_id="me...", dir_id=folder.id)
tree = client.directory_tree()
client.delete_directory(folder.id)

Search

client.search(query, from_=0, max_=50, dir_ids=None, lang=None) runs a full-text search over every sentence in your transcribed media.

Query syntax

The query supports a subset of the Lucene query_string syntax:

Feature	Example	Notes
Boolean `AND`	`cats AND dogs`	both terms must appear
Boolean `OR`	`cats OR dogs`	either term
Boolean `NOT`	`cats NOT dogs`	exclude a term
Grouping	`(cats OR dogs) AND vet`	combine operators with parentheses
Exact phrase	`"climate change"`	quoted terms match as a phrase
Trailing wildcard	`transcri*`	matches `transcribe`, `transcription`...

Leading wildcards (*tion), field selectors, fuzzy (~), boosts (^) and ranges are not supported and are stripped server-side. Queries must be at least 3 characters.

results = client.search('(invoice OR receipt) AND "due date" NOT draft')

Filtering and pagination

# Search within a single directory
client.search("budget", dir_ids=[42])

# Search across several directories
client.search("budget", dir_ids=[42, 43, 7])

# Restrict to one language, and page through results
page2 = client.search("budget", from_=50, max_=50, lang="en")

Reading results

search() returns a SearchResults object. Iterate it for hits, or read .total for the overall match count. Each hit is a dict with:

_sentence - the matched sentence plus its media metadata (id, title, duration, directory, thumbnail, file, originUrl).
_highlight - matched fragments with the hit terms wrapped in <hlt>...</hlt> tags.

results = client.search("climate change", max_=20)
print("total matches:", results.total)
for hit in results:
    sentence = hit["_sentence"]
    print(sentence["media"]["title"], "|", hit["_highlight"]["body"])

Error handling

All failures raise a subclass of GastonError:

from gaston import GastonClient, AuthenticationError, RateLimitError, NotFoundError

try:
    client.transcribe("clip.mp4")
except RateLimitError:
    print("File limit reached")
except AuthenticationError:
    print("Bad token / disabled account")
except NotFoundError as e:
    print("Not found:", e.message)

Exception	Trigger
`AuthenticationError`	HTTP 403, invalid token / disabled user
`BadRequestError`	HTTP 400, invalid parameters
`NotFoundError`	HTTP 404, resource not found
`RateLimitError`	HTTP 429, usage limit exceeded
`GastonAPIError`	any other API error

Every exception carries .status_code, .message, .details and the raw .payload.

Supported languages

from gaston import SUPPORTED_LANGUAGES, TRANSLATION_LANGUAGES

SUPPORTED_LANGUAGES lists transcription source languages; TRANSLATION_LANGUAGES lists the available translation targets.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.3.0

Jun 27, 2026

0.2.0

Jun 27, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gaston-0.3.0.tar.gz (16.0 kB view details)

Uploaded Jun 27, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

gaston-0.3.0-py3-none-any.whl (13.4 kB view details)

Uploaded Jun 27, 2026 Python 3

File details

Details for the file gaston-0.3.0.tar.gz.

File metadata

Download URL: gaston-0.3.0.tar.gz
Upload date: Jun 27, 2026
Size: 16.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.14

File hashes

Hashes for gaston-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`d4bdc5e033176913332702785ef7f101c263d378d7013ef1cfa20cc7a8e7b24c`
MD5	`112459f0ce6c5a7ba45a241df7f39873`
BLAKE2b-256	`68edc874b89f6f608e84c7d264234d78b21bbbb98514abb5496b0de8ee4bb965`

See more details on using hashes here.

File details

Details for the file gaston-0.3.0-py3-none-any.whl.

File metadata

Download URL: gaston-0.3.0-py3-none-any.whl
Upload date: Jun 27, 2026
Size: 13.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.14

File hashes

Hashes for gaston-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3af1fb77e3e5eb357fbcc775a79eff2c38307e564d6e1511e6b68e84ec0f407f`
MD5	`dfbd54ae96761838a0174efedeaa39a9`
BLAKE2b-256	`0be91f87bacbf5f3d8ca054b943c6e2b822789c7a5f12037f5204b840aa58408`

See more details on using hashes here.

gaston 0.3.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Gaston API Client

Installation

Quick start

Configuration

Timeouts

Directories

Search

Query syntax

Filtering and pagination

Reading results

Error handling

Supported languages

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes