#MOSEL: 950K hours of #speech data for #opensource #AI model training in 24 #EU languages 🇪🇺🗣️
• 441K hours from #VoxPopuli & #LibriLight
• Transcribed by #Whisper
• Covers labeled & unlabeled corpora
• #CCBY40 license
#NLP #ML
huggingface.co/datasets/FBK...
1
0
0
0