Typed forced-alignment task-adapter interface — ForcedAlignmentAdapter ABC, ForcedAlignResult/ForcedAlignItem wire DTOs, and forced-alignment persistence helpers.
Project description
cjm-forced-alignment-adapter-interface
Install latest from the GitHub [repository][repo]:
pip install cjm_forced_alignment_adapter_interface
Project Structure
nbs/
├── adapter.ipynb # The typed forced-alignment task contract — `ForcedAlignmentAdapter` ABC +
├── core.ipynb # Standardized word-level forced-alignment DTOs — wire-registered so
└── storage.ipynb # Standardized SQLite storage for forced alignment results with content hashing
Total: 3 notebooks
Module Dependencies
graph LR
adapter["adapter<br/>Forced Alignment Adapter"]
core["core<br/>Core Data Structures"]
storage["storage<br/>Forced Alignment Storage"]
adapter --> core
1 cross-module dependencies detected
CLI Reference
No CLI commands found in this project.
Module Overview
Detailed documentation for each module in the project:
Forced Alignment Adapter (adapter.ipynb)
The typed forced-alignment task contract —
ForcedAlignmentAdapterABC +
Import
from cjm_forced_alignment_adapter_interface.adapter import (
ForcedAlignmentToolProtocol,
ForcedAlignmentAdapter
)
Classes
@runtime_checkable
class ForcedAlignmentToolProtocol(Protocol):
"""
PROVISIONAL structural contract for forced-alignment-capable tools.
Mirrors the fused-era surface (task-shaped `execute`); re-derived from
native tool surfaces when the Option C cascade splits tools (stage 8).
"""
def execute(self, audio: Union[str, Path], text: str, **kwargs) -> Any: ...
class ForcedAlignmentAdapter(TaskAdapter):
"""
Typed forced-alignment task adapter: model-ready audio + transcript
text in, `ForcedAlignResult` (word-level timings) out.
Input contract (carried over from the fused-era ForcedAlignmentPlugin):
the caller guarantees MODEL-READY audio; `text` is the transcript to
align against the audio.
Persistence sits BESIDE the task method: the storage module's
`ForcedAlignmentStorage` is the cache / persist seam — note the
FOUR-key cache identity (`audio_path`, `audio_hash`, `text_hash`,
`config_hash`): the task input is the (audio, transcript) PAIR, so the
text hash is part of the cache key (unlike transcription's three-key).
The result DTO is wire-registered ("forced_alignment.result"): items
arrive at the host as typed `ForcedAlignItem`s, not dicts.
"""
def align(
self,
audio: Union[str, Path], # Path to MODEL-READY audio (converted upstream)
text: str, # Transcript text to align against the audio
**kwargs, # Adapter-specific options (language, ...)
) -> ForcedAlignResult: # Word-level alignment output
"Align transcript text to model-ready audio at word level."
Core Data Structures (core.ipynb)
Standardized word-level forced-alignment DTOs — wire-registered so
Import
from cjm_forced_alignment_adapter_interface.core import (
ForcedAlignItem,
ForcedAlignResult
)
Classes
@dataclass
class ForcedAlignItem:
"A single word-level alignment result."
text: str # The aligned word (punctuation typically stripped by model)
start_time: float # Start time in seconds
end_time: float # End time in seconds
@dataclass
class ForcedAlignResult:
"Standardized output for all forced alignment plugins."
items: List[ForcedAlignItem] # Word-level alignments
metadata: Dict[str, Any] = field(...) # Plugin-specific metadata
def from_dict(
"Reconstruct from a wire payload, re-typing nested items.
Nested-DTO reconstruction is why this class defines its own
`from_dict` instead of relying on @wire_type's flat default
(which would leave `items` as plain dicts)."
Forced Alignment Storage (storage.ipynb)
Standardized SQLite storage for forced alignment results with content hashing
Import
from cjm_forced_alignment_adapter_interface.storage import (
ForcedAlignmentRow,
ForcedAlignmentStorage
)
Classes
@dataclass
class ForcedAlignmentRow:
"A single row from the forced_alignments table."
job_id: str # Unique job identifier
audio_path: str # Path to the source audio file
audio_hash: str # Hash of source audio in "algo:hexdigest" format
text: str # Input transcript text that was aligned
text_hash: str # Hash of input text in "algo:hexdigest" format
config_hash: str # Hash of the effective alignment config used
items: Optional[List[Dict[str, Any]]] # Serialized ForcedAlignItems
metadata: Optional[Dict[str, Any]] # Plugin metadata
created_at: Optional[float] # Unix timestamp
class ForcedAlignmentStorage:
def __init__(
self,
db_path: str # Absolute path to the SQLite database file
)
"Standardized SQLite storage for forced alignment results."
def __init__(
self,
db_path: str # Absolute path to the SQLite database file
)
"Initialize storage, create table, run migrations, and build indexes."
def save(
self,
job_id: str, # Unique job identifier
audio_path: str, # Path to the source audio file
audio_hash: str, # Hash of source audio in "algo:hexdigest" format
text: str, # Input transcript text
text_hash: str, # Hash of input text in "algo:hexdigest" format
config_hash: str, # Hash of the effective alignment config
items: Optional[List[Dict[str, Any]]] = None, # Serialized ForcedAlignItems
metadata: Optional[Dict[str, Any]] = None # Plugin metadata
) -> None
"Save or replace a forced alignment result (upsert by audio_path + text_hash + config_hash)."
def save_with_logging(
self,
*,
job_id: str, # Unique job identifier
audio_path: str, # Path to the source audio file
audio_hash: str, # Hash of source audio in "algo:hexdigest" format
text: str, # Input transcript text
text_hash: str, # Hash of input text in "algo:hexdigest" format
config_hash: str, # Hash of the effective alignment config
items: Optional[List[Dict[str, Any]]] = None, # Serialized ForcedAlignItems
metadata: Optional[Dict[str, Any]] = None, # Plugin metadata
logger: Optional[logging.Logger] = None # Optional logger for success/failure messages
) -> bool: # True if saved; False if the save failed (error logged, not raised)
"Save a result, logging success/failure. Failures are logged and swallowed (returns False).
Centralizes the try/save/log/except block every forced-alignment plugin reimplements.
Returns True on success so callers can gate post-save side effects on the result."
def get_cached(
self,
audio_path: str, # Path to the source audio file
audio_hash: str, # Content hash of the audio (cache miss if the file changed)
text_hash: str, # Hash of the input transcript text (part of the cache key)
config_hash: str # Hash of the effective alignment config
) -> Optional[ForcedAlignmentRow]: # Cached row or None
"Retrieve a content-correct cached alignment for an (audio, transcript) pair.
Matches on audio_path + audio_hash + text_hash + config_hash. A changed audio
file (new audio_hash) misses even if a stale row exists at the same
(audio_path, text_hash, config_hash) — the next save() replaces it."
def get_by_job_id(
self,
job_id: str # Job identifier to look up
) -> Optional[ForcedAlignmentRow]: # Row or None if not found
"Retrieve a forced alignment result by job ID."
def list_jobs(
self,
limit: int = 100 # Maximum number of rows to return
) -> List[ForcedAlignmentRow]: # List of forced alignment rows
"List forced alignment jobs ordered by creation time (newest first)."
def verify_audio(
self,
job_id: str # Job identifier to verify
) -> Optional[bool]: # True if audio matches, False if tampered, None if job not found
"Verify the source audio file still matches its stored hash."
def verify_text(
self,
job_id: str # Job identifier to verify
) -> Optional[bool]: # True if text matches, False if tampered, None if job not found
"Verify the input text still matches its stored hash."
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file cjm_forced_alignment_adapter_interface-0.0.2.tar.gz.
File metadata
- Download URL: cjm_forced_alignment_adapter_interface-0.0.2.tar.gz
- Upload date:
- Size: 13.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
cec69170ba6c12adacd2a73983e3a7825890e0346b408ca8b4bd91d1ff719d79
|
|
| MD5 |
e245ce60776de68788a945221b420bec
|
|
| BLAKE2b-256 |
0d4b387e13dcc971f5c8a81a72618387bc4ddaa9a55106139e23e4aef09811c9
|
File details
Details for the file cjm_forced_alignment_adapter_interface-0.0.2-py3-none-any.whl.
File metadata
- Download URL: cjm_forced_alignment_adapter_interface-0.0.2-py3-none-any.whl
- Upload date:
- Size: 15.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1c4506805f953067528b0e2dd10be51d7e07ebc7ad86d530c5880e14543c03b7
|
|
| MD5 |
662aadc909f89ed58aeb5b1d4f8ae27c
|
|
| BLAKE2b-256 |
cb85dbbf356020fee8bbabda647f930ce306e996c1162395a16453b4984a3655
|