#Mamba2 hashtag - Bluesky

1 week ago

New Mamba‑3 slashes state size in half while keeping Mamba‑2 perplexity, adds ~4% LM gain and cuts latency. Curious how the architecture pulls this off? Dive into the details! #Mamba3 #Mamba2 #InferenceLatency

🔗 aidailypost.com/news/mamba3-...

2 0 0 0

GetNews.me

@getnews-me.bsky.social

6 months ago

Mamba-2 Advances Audio Captioning via Design Space Exploration

Mamba-2, a state‑space language model, achieves audio captioning with fewer parameters than larger transformer LLMs, using LoRA fine‑tuning and connectors. Read more: getnews.me/mamba-2-advances-audio-c... #mamba2 #audiocaptioning #statespacemodel

0 0 0 0

Winbuzzer

@winbuzzer.com

11 months ago

Novel IBM Bamba Hybrid AI Model Targets Speed Limits of Transformer Architecture - WinBuzzer IBM Research and partners have released Bamba-9B-v2, an open-source hybrid Transformer-SSM model trained on 3T tokens, claiming faster inference than comparable LLMs.

Novel IBM Bamba Hybrid AI Model Targets Speed Limits of Transformer Architecture

#AI #GenAI #Transformers #IBM #BambaAI #LLMs #AI #MachineLearning #DeepLearning #SSM #StateSpaceModel #Mamba2 #AIResearch #CMU #Princeton #UIUC #GraniteAI #AIEfficiency

winbuzzer.com/2025/04/29/i...

0 0 0 0

Yapay Zeka News

@yapayzekanews.bsky.social

1 year ago

Fransız #yapayzeka girişimi #MistralAI, matematiksel akıl yürütme için 7 milyar parametreye sahip #Mathstral, yeni #Mamba2 mimarisine sahip #CodestralMamba ve 12 milyar parametreye sahip #MistralNeMo olmak üzere iki özel dil modeli ve bir genel dil modeli yayınladı.

yapayzeka.news/mistral-mate...

0 0 0 0