Advertisement · 728 × 90

Posts by Tanel Alumäe

📢 This week at our Conversational AI Reading Group, We’re happy to welcome :
📖 Improving spoken language identification for non-native speech.
📅 Thursday, April 23 | 11:00 AM – 12:00 PM EST
🎙 Speaker: Tanel Alumäe ( @tanelalumae.bsky.social ) - Tallinn University of Technology

21 hours ago 0 1 0 0
Preview
TalTech Systems for the PROCESS Signal Processing Grand Challenge The PROCESS Challenge aims to detect cognitive decline, including early stages like mild cognitive impairment, through spontaneous speech. This paper describes TalTech’s systems prepared for the chall...

Last year, our lab ventured into a new domain: detecting cognitive decline from speech recordings. We competed in the
ICASSP PROCESS Challenge and secured 2nd place in the MMSE prediction task (out of ~30 teams)! 🏆 Our paper is now published: ieeexplore.ieee.org/abstract/doc...

1 year ago 0 0 0 0
Coat of arms of Ukraine.

Coat of arms of Ukraine.

I just donated to u24.gov.ua

1 year ago 25 8 0 2
Preview
University lecturer in Finno-Ugric linguistics University lecturer in Finno-Ugric linguistics

The University of Helsinki is looking for a lecturer in Finno-Ugric linguistics: jobs.helsinki.fi/job/Helsinki...

1 year ago 11 7 0 0

"And I'll see the day that anyone gives us #1 without being forced to do so ..."

There are many LLM projects that are open about training and evaluation data, such as AllenAI OLMo, several EU projects (EuroGPT, HPLT), and several Huggingface projects. I don't think anybody forced them to do so.

1 year ago 2 0 0 0
Preview
Joint speech and text machine translation for up to 100 languages - Nature SEAMLESSM4T is a single machine translation tool that supports speech-to-speech translation, speech-to-text translation, text-to-speech translation, text-to-text translation and automatic speech recog...

@nature.com asked me to write a short comment piece about the SeamlessM4T paper from @metaai.bsky.social
(nature.com/articles/s41...), here it is: nature.com/articles/d41.... I think SeamlessM4T is still the best publicly available multilingual ASR/speech-translation model.

1 year ago 1 0 0 0

Great challenge but very little time...

What is the maximum length of a test utterance (important considering limited GPU RAM on the test server)?

Is ASR CER case sensitive? Are spaces taken into account when computing CER?

1 year ago 0 0 0 0

What is the maximum length of a test utterance (important considering limited GPU RAM on the test server)?

Is ASR CER case sensitive? Are spaces taken into account when computing CER?

1 year ago 0 0 0 0

Very interesting challenge! Unfortunately there is very little time, considering that participants would have to prepare some kind of container that decodes the test data on the Dynabench server.
Some questions follow...

1 year ago 1 0 1 0