Prompt-Aware Mixture Improves Speech LLMs for Transcription and Captioning
The Prompt‑aware Mixture (PaM) lets a speech LLM pick audio encoders, achieving better accuracy than any single‑encoder model on ASR and audio captioning. Read more: getnews.me/prompt-aware-mixture-imp... #promptawaremixture #speechllm
0
0
0
0