Python client library for IDTAP - Interactive Digital Transcription and Analysis Platform for Hindustani music
Project description
IDTAP Python API
Python client library for IDTAP (Interactive Digital Transcription and Analysis Platform) - a web-based research platform developed at UC Santa Cruz for transcribing, analyzing, and archiving Hindustani (North Indian classical) music recordings using trajectory-based notation designed specifically for oral melodic traditions.
About IDTAP
IDTAP represents a paradigm shift in musical transcription and analysis. Rather than forcing oral traditions into Western notational frameworks, it uses trajectories as the fundamental musical unit—archetypal paths between pitches that capture the continuous melodic movement central to Hindustani music.
Key Innovation: Instead of discrete notes, IDTAP models music through:
- Trajectory-based notation - Continuous pitch contours rather than fixed notes
- Microtonal precision - Cent-based tuning with flexible raga systems
- Idiomatic articulations - Performance techniques specific to each instrument
- Hierarchical segmentation - Phrases, sections, and formal structures
Features
- Trajectory-Based Data Access - Load and analyze transcriptions using the trajectory notation system
- Hindustani Music Analysis - Work with raga-aware transcriptions and microtonal pitch data
- Audio Download - Retrieve associated audio recordings in multiple formats
- Secure Authentication - OAuth integration with encrypted token storage
Installation
pip install idtap
Optional Dependencies
For enhanced Linux keyring support:
pip install idtap[linux]
For development:
pip install idtap[dev]
Quick Start
Authentication & Basic Usage
from idtap import SwaraClient, Piece, Instrument
# Initialize client - connects to swara.studio platform
client = SwaraClient() # Automatic OAuth via Google
# Browse available transcriptions
transcriptions = client.get_viewable_transcriptions()
print(f"Found {len(transcriptions)} transcriptions")
# Load a Hindustani music transcription
piece_data = client.get_piece("transcription-id")
piece = Piece.from_json(piece_data)
print(f"Transcription: {piece.title}")
print(f"Raga: {piece.raga.name if piece.raga else 'Unknown'}")
print(f"Instrument: {piece.instrumentation}")
print(f"Trajectories: {sum(len(p.trajectories) for p in piece.phrases)}")
Working with Trajectory-Based Transcriptions
# Analyze trajectory-based musical structure
for phrase in piece.phrases:
print(f"Phrase {phrase.phrase_number}: {len(phrase.trajectories)} trajectories")
# Examine individual trajectories (fundamental units of IDTAP)
for traj in phrase.trajectories:
if traj.pitch_array:
# Each trajectory contains continuous pitch movement
start_pitch = traj.pitch_array[0].pitch_number
end_pitch = traj.pitch_array[-1].pitch_number
print(f" Trajectory {traj.traj_number}: {start_pitch:.2f} → {end_pitch:.2f}")
# Check for articulations (performance techniques)
if traj.articulation:
techniques = [art.stroke for art in traj.articulation if art.stroke]
print(f" Articulations: {', '.join(techniques)}")
# Raga analysis (theoretical framework)
if piece.raga:
print(f"Raga: {piece.raga.name}")
if hasattr(piece.raga, 'aroha') and piece.raga.aroha:
print(f"Aroha (ascending): {piece.raga.aroha}")
if hasattr(piece.raga, 'avaroha') and piece.raga.avaroha:
print(f"Avaroha (descending): {piece.raga.avaroha}")
Audio Handling
# Download audio in different formats
audio_bytes = client.download_audio("audio-id", format="wav")
with open("recording.wav", "wb") as f:
f.write(audio_bytes)
# Download all audio associated with a transcription
client.download_and_save_transcription_audio(piece, directory="./audio/")
Data Export
# Export transcription data
excel_data = client.excel_data(piece_id)
with open("analysis.xlsx", "wb") as f:
f.write(excel_data)
json_data = client.json_data(piece_id)
with open("transcription.json", "wb") as f:
f.write(json_data)
Working with Hindustani Music Data
from idtap import Piece, Phrase, Trajectory, Pitch, Raga, Instrument
# Example: Analyze a sitar transcription
sitar_pieces = [t for t in transcriptions if t.get('instrumentation') == 'Sitar']
for trans_meta in sitar_pieces[:3]: # First 3 sitar pieces
piece = Piece.from_json(client.get_piece(trans_meta['_id']))
# Count different types of trajectories (IDTAP's innovation)
trajectory_types = {}
for phrase in piece.phrases:
for traj in phrase.trajectories:
traj_type = getattr(traj, 'curve_type', 'straight')
trajectory_types[traj_type] = trajectory_types.get(traj_type, 0) + 1
print(f"{piece.title}:")
print(f" Raga: {piece.raga.name if piece.raga else 'Unknown'}")
print(f" Trajectory types: {trajectory_types}")
# Analyze articulation patterns (performance techniques)
articulations = []
for phrase in piece.phrases:
for traj in phrase.trajectories:
if traj.articulation:
articulations.extend([art.stroke for art in traj.articulation])
unique_arts = list(set(articulations))
print(f" Articulations used: {', '.join(unique_arts[:5])}") # First 5
Key Classes
SwaraClient
The main HTTP client for interacting with the IDTAP server.
Key Methods:
get_viewable_transcriptions()- List accessible transcriptionsget_piece(id)- Load transcription datasave_piece(data)- Save transcriptionexcel_data(id)/json_data(id)- Export datadownload_audio(id, format)- Download audio filesget_waiver_text()- Display the research waiver text that must be readagree_to_waiver(i_agree=True)- Accept research waiver (required for first-time users)has_agreed_to_waiver()- Check if waiver has been accepted
Musical Data Models
Piece- Central transcription container with metadata, audio association, and musical contentPhrase- Musical phrase containing trajectory data and categorizationsTrajectory- Detailed pitch movement data with timing and articulationsPitch- Individual pitch points with frequency and timing informationRaga- Indian musical scale/mode definitions with theoretical rulesSection- Large structural divisions (alap, composition, etc.)Meter- Rhythmic cycle and tempo informationArticulation- Performance technique annotations (meend, andolan, etc.)
Specialized Features
- Microtonal Pitch System - Precise cent-based pitch representation
- Hindustani Music Theory - Raga rules, sargam notation, gharana traditions
- Performance Analysis - Ornament detection, phrase categorization
- Multi-Track Support - Simultaneous transcription of melody and drone
Authentication
The client uses OAuth 2.0 flow with Google authentication. On first use, it will:
- Open a browser for Google OAuth login
- Securely store the authentication token using:
- OS keyring (preferred)
- Encrypted local file (fallback)
- Plain text (legacy, discouraged)
Research Waiver Requirement
First-time users must agree to a research waiver before accessing transcription data. If you haven't agreed yet, you'll see an error when trying to access transcriptions:
client = SwaraClient()
transcriptions = client.get_viewable_transcriptions() # Will raise RuntimeError
# First, read the waiver text
waiver_text = client.get_waiver_text()
print("Research Waiver:")
print(waiver_text)
# After reading, agree to the waiver
client.agree_to_waiver(i_agree=True)
transcriptions = client.get_viewable_transcriptions() # Now works
# Check waiver status
if client.has_agreed_to_waiver():
print("Waiver agreed - full access available")
Manual Token Management
# Initialize without auto-login
client = SwaraClient(auto_login=False)
# Login manually when needed
from idtap import login_google
login_google()
Advanced Usage
Batch Processing
# Process multiple transcriptions
transcriptions = client.get_viewable_transcriptions()
for trans in transcriptions:
if trans.get('instrumentation') == 'Sitar':
piece = Piece.from_json(client.get_piece(trans['_id']))
# Analyze sitar-specific features
total_meends = sum(
len([art for art in traj.articulation if art.stroke == 'meend'])
for phrase in piece.phrases
for traj in phrase.trajectories
)
print(f"{piece.title}: {total_meends} meends")
Research Applications
# Raga analysis across corpus
raga_stats = {}
for trans in transcriptions:
piece = Piece.from_json(client.get_piece(trans['_id']))
if piece.raga:
raga_name = piece.raga.name
raga_stats[raga_name] = raga_stats.get(raga_name, 0) + 1
print("Raga distribution:", raga_stats)
Development
Running Tests
# Unit tests
pytest idtap/tests/
# Integration tests (requires authentication)
python api_testing/api_test.py
Contributing
- Fork the repository
- Create a feature branch
- Add tests for new functionality
- Ensure all tests pass
- Submit a pull request
Documentation
- API Reference: Full documentation of all classes and methods
- Musical Concepts: Guide to Hindustani music terminology and theory
- Research Examples: Academic use cases and analysis workflows
Platform Access
- IDTAP Web Platform: swara.studio
- Source Code: github.com/jon-myers/idtap
- Research Paper: "Beyond Notation: A Digital Platform for Transcribing and Analyzing Oral Melodic Traditions" (ISMIR 2025)
Documentation
📖 Complete documentation is available at idtap-python-api.readthedocs.io
- Installation Guide - Detailed setup instructions
- Authentication - OAuth setup and token management
- Quickstart Tutorial - Get started in minutes
- API Reference - Complete class and method documentation
- Examples - Real-world usage examples
Support
- GitHub Issues: Report bugs and request features
- Research Contact: Jonathan Myers & Dard Neuman, UC Santa Cruz
- Platform: swara.studio
License
MIT License - see LICENSE file for details.
Citation
If you use IDTAP in academic research, please cite the ISMIR 2025 paper:
@inproceedings{myers2025beyond,
title={Beyond Notation: A Digital Platform for Transcribing and Analyzing Oral Melodic Traditions},
author={Myers, Jonathan and Neuman, Dard},
booktitle={Proceedings of the 26th International Society for Music Information Retrieval Conference},
pages={},
year={2025},
address={Daejeon, South Korea},
url={https://swara.studio}
}
IDTAP was developed at UC Santa Cruz with support from the National Endowment for the Humanities. The platform challenges Western-centric approaches to music representation by creating tools designed specifically for oral melodic traditions, enabling scholars to study Hindustani music on its own terms while applying cutting-edge computational methodologies.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file idtap-0.1.11.tar.gz.
File metadata
- Download URL: idtap-0.1.11.tar.gz
- Upload date:
- Size: 124.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
252991e459361dd5bbf50d6e3b3f5136c718b36b9d6eab68225527939b5fa2c8
|
|
| MD5 |
84f8bb0f79c4d3b2ed10ffa2177d9f7f
|
|
| BLAKE2b-256 |
4c7d7c0a859139cc60553791b57b0467fef70cbd07d3143d20f297f7d90e43ee
|
File details
Details for the file idtap-0.1.11-py3-none-any.whl.
File metadata
- Download URL: idtap-0.1.11-py3-none-any.whl
- Upload date:
- Size: 130.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3f2f02cc16d257f6f5755f309f55009ef26850f78a3df7be8791811e6d9d7cd3
|
|
| MD5 |
8c753ad99d5a476957d558e325a14a0e
|
|
| BLAKE2b-256 |
23d337e44382f2f16043033e1402fea1b59135b4d28eef4996d1a08ee8e5ff15
|