Skip to main content

Interactive bioacoustic annotation tool for measuring vocalizations

Project description

YAAAT: Yet Another Audio Annotation Tool

Interactive bioacoustic annotation tool for measuring vocalizations.

Features:

  1. Changepoint Annotator, for marking temporal onset, offset, and changepoints in vocalizations.
  2. Peak Annotator, for marking dominant frequency peaks on the power spectrum.
Changepoint Annotator Peak Annotator
Changepoint Annotator Peak Annotator

Installation

Via PyPI (Recommended)

pip install yaaat

From Source

git clone https://github.com/laelume/yaaat.git
cd yaaat
pip install -e .

Usage

Launch the Application

yaaat

Opens a tabbed interface with both annotators. Includes test audio files to get started immediately.

Use in Python Scripts

from yaaat import ChangepointAnnotator, PeakAnnotator
import tkinter as tk

# Launch changepoint annotator
root = tk.Tk()
app = ChangepointAnnotator(root)
root.mainloop()

# Or launch peak annotator
root = tk.Tk()
app = PeakAnnotator(root)
root.mainloop()

Getting Started

  1. Click Load Audio Directory to select files or Load Test Audio to explore test audio
  2. Choose where to save annotations (existing, new, or default directory)
  3. Click on the spectrogram to add annotation points
  4. Click Finish Syllable when done with annotation
  5. Move between files using Next/Previous buttons
  6. Annotations auto-save on file navigation or Finish syllable

Navigation & Features

  • Intuitive real-time interactive visualization with zoom, pan, and keycommand + mousewheel navigation

  • Visualize harmonics with adjustable multipliers and draggable bounding boxes

  • JSON annotations saved per-file to minimize corruption

  • Mark and track unusable files

  • Adjust spectrogram resolution for accuracy comparison

  • TODO: implement ranking system for annotation quality; inject as learning feedback mechanism

Requirements

  • Python ≥3.8 (built using 3.11)
  • numpy
  • matplotlib
  • librosa
  • scipy
  • natsort
  • sounddevice
  • soundfile

License

MIT License - Copyright (c) 2025 laelume

Contributing

Contributions welcome! Please open an issue or submit a pull request. I'm especially interested in talking to people about using this in their existing AI workflows, so please feel free to reach out !!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

yaaat-0.1.4.tar.gz (1.5 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

yaaat-0.1.4-py3-none-any.whl (1.5 MB view details)

Uploaded Python 3

File details

Details for the file yaaat-0.1.4.tar.gz.

File metadata

  • Download URL: yaaat-0.1.4.tar.gz
  • Upload date:
  • Size: 1.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for yaaat-0.1.4.tar.gz
Algorithm Hash digest
SHA256 0fccbe080942e220d0cfdcd3146829042775b1fc5a399aa1d4c9b9308fe22824
MD5 678258849a99f49a404ab9252b6f81b8
BLAKE2b-256 66e73fefb117ae22059c81c2e4305378cb81dd3fd49994ef89e3a332a658297a

See more details on using hashes here.

File details

Details for the file yaaat-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: yaaat-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 1.5 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for yaaat-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 bb0ea882fb81b4085a17ad72317c911fca31efbe27a2f10cb812778d1fe4efe3
MD5 5948c331c0bc1e6a23b22318f088e834
BLAKE2b-256 b8225f59171af374872518bb191f84c1379d6d514ad82dcaf99df2f79d4c9b5a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page