ivrit.ai helper package

These details have not been verified by PyPI

Project description

ivrit

Python package providing wrappers around ivrit.ai's capabilities.

Installation

pip install ivrit

Usage

Audio Transcription

The ivrit package provides audio transcription functionality using multiple engines.

Basic Usage

import ivrit

# Transcribe a local audio file
model = ivrit.load_model(engine="faster-whisper", model="ivrit-ai/whisper-large-v3-turbo-ct2")
result = model.transcribe(path="audio.mp3")

# With custom device
model = ivrit.load_model(engine="faster-whisper", model="ivrit-ai/whisper-large-v3-turbo-ct2", device="cpu")
result = model.transcribe(path="audio.mp3")

print(result["text"])

Transcribe from URL

# Transcribe audio from a URL
model = ivrit.load_model(engine="faster-whisper", model="ivrit-ai/whisper-large-v3-turbo-ct2")
result = model.transcribe(url="https://example.com/audio.mp3")

print(result["text"])

Streaming Results

# Get results as a stream (generator)
model = ivrit.load_model(engine="faster-whisper", model="base")
for segment in model.transcribe(path="audio.mp3", stream=True, verbose=True):
    print(f"{segment.start:.2f}s - {segment.end:.2f}s: {segment.text}")

# Or use the model directly
model = ivrit.FasterWhisperModel(model="base")
for segment in model.transcribe(path="audio.mp3", stream=True):
    print(f"{segment.start:.2f}s - {segment.end:.2f}s: {segment.text}")

# Access word-level timing
for segment in model.transcribe(path="audio.mp3", stream=True):
    print(f"Segment: {segment.text}")
    for word in segment.extra_data.get('words', []):
        print(f"  {word['start']:.2f}s - {word['end']:.2f}s: '{word['word']}'")

Async Transcription (RunPod Only)

For RunPod models, you can use async transcription for better performance:

import asyncio
from ivrit.audio import load_model

async def transcribe_async():
    # Load RunPod model
    model = load_model(
        engine="runpod",
        model="large-v3-turbo",
        api_key="your-api-key",
        endpoint_id="your-endpoint-id"
    )
    
    # Stream results asynchronously
    async for segment in model.transcribe_async(path="audio.mp3", language="he"):
        print(f"{segment.start:.2f}s - {segment.end:.2f}s: {segment.text}")

# Run the async function
asyncio.run(transcribe_async())

Note: Async transcription is only available for RunPod models. The sync transcribe() method uses the original sync implementation.

API Reference

`load_model()`

Load a transcription model for the specified engine and model.

Parameters

engine (str): Transcription engine to use. Options: "faster-whisper", "stable-ts"
model (str): Model name for the selected engine
device (str, optional): Device to use for inference. Default: "auto". Options: "auto", "cpu", "cuda", "cuda:0", etc.
model_path (str, optional): Custom path to the model (for faster-whisper)

Returns

TranscriptionModel object that can be used for transcription

Raises

ValueError: If the engine is not supported
ImportError: If required dependencies are not installed

`transcribe()` and `transcribe_async()`

Transcribe audio using the loaded model.

Parameters

path (str, optional): Path to the audio file to transcribe
url (str, optional): URL to download and transcribe
blob (str, optional): Base64 encoded blob data to transcribe
language (str, optional): Language code for transcription (e.g., 'he' for Hebrew, 'en' for English)
stream (bool, optional): Whether to return results as a generator (True) or full result (False) - only for transcribe()
diarize (bool, optional): Whether to enable speaker diarization
verbose (bool, optional): Whether to enable verbose output
**kwargs: Additional keyword arguments for the transcription model

Returns

transcribe(): If stream=True: Generator yielding transcription segments, If stream=False: Complete transcription result as dictionary
transcribe_async(): AsyncGenerator yielding transcription segments

Raises

ValueError: If multiple input sources are provided, or none is provided
FileNotFoundError: If the specified path doesn't exist
Exception: For other transcription errors

Note: transcribe_async() is only available for RunPod models and always returns an AsyncGenerator.

Architecture

The ivrit package uses an object-oriented design with a base TranscriptionModel class and specific implementations for each transcription engine.

Model Classes

TranscriptionModel: Abstract base class for all transcription models
FasterWhisperModel: Implementation for the Faster Whisper engine

Usage Patterns

Pattern 1: Using `load_model()` (Recommended)

# Step 1: Load the model
model = ivrit.load_model(engine="faster-whisper", model="base")

# Step 2: Transcribe audio
result = model.transcribe(path="audio.mp3")

Pattern 2: Direct Model Creation

# Create model directly
model = ivrit.FasterWhisperModel(model="base")

# Use the model
result = model.transcribe(path="audio.mp3")

Multiple Transcriptions

For multiple transcriptions, load the model once and reuse it:

# Load model once
model = ivrit.load_model(engine="faster-whisper", model="base")

# Use for multiple transcriptions
result1 = model.transcribe(path="audio1.mp3")
result2 = model.transcribe(path="audio2.mp3")
result3 = model.transcribe(path="audio3.mp3")

Installation

Basic Installation

pip install ivrit

With Faster Whisper Support

pip install ivrit[faster-whisper]

Supported Engines

faster-whisper

Fast and accurate speech recognition using the Faster Whisper model.

Model Class: FasterWhisperModel

Available Models: base, large, small, medium, large-v2, large-v3

Features:

Word-level timing information
Language detection with confidence scores
Support for custom devices (CPU, CUDA, etc.)
Support for custom model paths
Streaming transcription

Dependencies: faster-whisper>=1.1.1

stable-ts

Stable and reliable transcription using Stable-TS models.

Status: Not yet implemented

Development

Installation for Development

git clone <repository-url>
cd ivrit
pip install -e ".[dev]"

Running Tests

pytest

Code Formatting

black .
isort .

Bounty rules

Like our bounties, and want to help? Here's how this works:

You pick a bounty you're interested in, and let us know. We discuss it together to make sure you understand the issue.
You let us know you're on it; we lock it for you for 2 weeks so you can develop, review and merge your code.
The ONLY metric for whether you met the bounty goal is whether we decide to merge your PR. Our key focus with reviews is to ensure high code and product quality.
Once your PR is merged, you receive the bounty award.

You can use any tool you'd like to write your code, including AI. Note that during review you will be asked questions about the code; if you are unable to explain what it does, or how (sometimes the case when doing Vibe coding), your PR will be discarded and you will not be able to reapply for this issue.

Reviews may be done live.

License

MIT License - see LICENSE file for details.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.2.6

May 30, 2026

0.2.5

Apr 15, 2026

0.2.4

Apr 14, 2026

0.2.3

Apr 13, 2026

0.2.2

Apr 11, 2026

0.2.1

Feb 26, 2026

0.2.0

Jan 11, 2026

0.1.9

Nov 25, 2025

0.1.8

Oct 28, 2025

0.1.7

Oct 27, 2025

0.1.6

Sep 21, 2025

0.1.5

Sep 19, 2025

0.1.4

Sep 18, 2025

0.1.3

Sep 16, 2025

0.1.2

Sep 15, 2025

0.1.1

Sep 15, 2025

0.1.0

Sep 14, 2025

0.0.12

Aug 26, 2025

0.0.11

Aug 21, 2025

0.0.10

Aug 16, 2025

0.0.9

Aug 15, 2025

0.0.8

Aug 15, 2025

0.0.7

Aug 15, 2025

0.0.6

Aug 4, 2025

0.0.5

Aug 3, 2025

0.0.4

Jul 20, 2025

0.0.3

Jul 12, 2025

0.0.2

Jul 12, 2025

0.0.1

Jul 11, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ivrit-0.2.6.tar.gz (1.2 MB view details)

Uploaded May 30, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ivrit-0.2.6-py3-none-any.whl (29.8 kB view details)

Uploaded May 30, 2026 Python 3

File details

Details for the file ivrit-0.2.6.tar.gz.

File metadata

Download URL: ivrit-0.2.6.tar.gz
Upload date: May 30, 2026
Size: 1.2 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.2

File hashes

Hashes for ivrit-0.2.6.tar.gz
Algorithm	Hash digest
SHA256	`372b2923a2b6b4b8930c45565a97d90b482d9c54dfec59a544995fb3f415f7ec`
MD5	`6e002a7f895ac4d3079fa170306c2e69`
BLAKE2b-256	`e29bbe868127f441926a8eaf128074643b2e16f6d45f0087408ea6772c97d1f7`

See more details on using hashes here.

File details

Details for the file ivrit-0.2.6-py3-none-any.whl.

File metadata

Download URL: ivrit-0.2.6-py3-none-any.whl
Upload date: May 30, 2026
Size: 29.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.2

File hashes

Hashes for ivrit-0.2.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c74af06d185666d7307e5491fef03487f0dcef203b32b46792921e8d00674dc3`
MD5	`009227135d8b7347210f09e18d073321`
BLAKE2b-256	`4b1fecbc0954fd614a847363945ea701d2b1be13652db182a14f7f3b26b448d9`

See more details on using hashes here.

ivrit 0.2.6

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

ivrit

Installation

Usage

Audio Transcription

Basic Usage

Transcribe from URL

Streaming Results

Async Transcription (RunPod Only)

API Reference

load_model()

Parameters

Returns

Raises

transcribe() and transcribe_async()

Parameters

Returns

Raises

Architecture

Model Classes

Usage Patterns

Pattern 1: Using load_model() (Recommended)

Pattern 2: Direct Model Creation

Multiple Transcriptions

Installation

Basic Installation

With Faster Whisper Support

Supported Engines

faster-whisper

stable-ts

Development

Installation for Development

Running Tests

Code Formatting

Bounty rules

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`load_model()`

`transcribe()` and `transcribe_async()`

Pattern 1: Using `load_model()` (Recommended)