Advertisement ยท 728 ร— 90
#
Hashtag
#SonyCSLMusic
Advertisement ยท 728 ร— 90
Preview
Assessing the Alignment of Audio Representations with Timbre Similarity Ratings Psychoacoustical so-called "timbre spaces" map perceptual similarity ratings of instrument sounds onto low-dimensional embeddings via multidimensional scaling, but suffer from scalability issues and a...

๐ŸŽถ New paper alert!
Do AI audio embeddings *hear* timbre like we do?
โžก๏ธ Benchmarked 18 reps vs 2.6 K human ratings (21 datasets)
๐Ÿ… Style embeddings from CLAP & our sound-matching model are best aligned!
Paper: arxiv.org/abs/2507.07764
#ISMIR2025 #MIR #AudioAI #SonyCSLMusic

3 0 1 1

We also show that our IC estimates can help predict EEG measurements. ๐Ÿ’†โ€โ™€๏ธ

Surprisal can be used for segment boundary detection and to simulate the information processing of a listener. ๐ŸŽถ ๐Ÿง 

๐Ÿ“œ Link to the paper: arxiv.org/pdf/2501.07474

Model weights are soon to come! ๐Ÿ‹๏ธ

๐Ÿ’ซโœจ #SonyCSLMusic ๐Ÿ’ซโœจ

0 0 0 0