Last released Jan 29, 2024
ACES metric for evaluating automated audio captioning models based on the semantics of sounds
Supported by