#sparseattention hashtag - Bluesky

3 days ago

Speed up your LLMs! IndexCache’s sparse attention drops long‑context inference time by 1.82×, blending dense‑sparse tricks inside transformer blocks. Curious how it works? Dive in for the details. #IndexCache #SparseAttention #LongContextAI

🔗 aidailypost.com/news/indexca...

3 1 1 0

Pixels and Pulse - Blog

@pixelsandpulse.bsky.social

6 days ago

MSA brings 100M token context to LLMs, but there's a catch. We break down how sparse attention trades deep reasoning for massive scale, and what it means for AI's future.

thepixelspulse.com/posts/msa-memory-sparse-...

#msa #sparseattention #llm

1 0 0 0

@techlife-blog.bsky.social

3 months ago

DeepSeek V3.2 AI Model Matches OpenAI's GPT-5 with Lower Training Costs DeepSeek's V3.2 AI model achieves comparable results to OpenAI's GPT-5 with fewer training FLOPs, revolutionizing the AI industry.

DeepSeek V3.2 AI Model Matches OpenAI's GPT-5 with Lower Training Costs

techlife.blog/posts/deepse...

#DeepSeek #AImodel #GPT5 #SparseAttention

0 0 0 0

AI Daily Post

@aidailypost.com

3 months ago

DeepSeek V3.2 just dropped a major upgrade—sparse attention for long‑context, solid tool‑use reasoning, and built‑in formatting cues. Open‑source power is finally ready for production. Dive into the details! #DeepSeekV32 #OpenSourceLLM #SparseAttention

🔗 aidailypost.com/news/deepsee...

0 0 0 0

GetNews.me

@getnews-me.bsky.social

5 months ago

Input-Aware Sparse Attention Enables Real-Time Co-Speech Video Generation

A new input‑aware sparse attention method cuts compute by up to 40% and enables real‑time co‑speech video generation with better lip‑sync. Submitted 2 Oct 2025. getnews.me/input-aware-sparse-atten... #sparseattention #realtime

0 0 0 0

GetNews.me

@getnews-me.bsky.social

5 months ago

DeepSeek unveils sparse attention model to halve API costs

DeepSeek released the V3.2‑exp model on Monday, using sparse attention to cut API costs for long‑context tasks by half. The model and weights are open‑source on Hugging Face. Read more: getnews.me/deepseek-unveils-sparse-... #deepseek #sparseattention

0 0 0 0

GetNews.me

@getnews-me.bsky.social

6 months ago

ProxyAttn Introduces Guided Sparse Attention Using Representative Heads

ProxyAttn, a training‑free method without additional training using representative heads, claims up to 10.3× faster raw attention and 2.4× speed‑up in LLM pre‑fill. Read more: getnews.me/proxyattn-introduces-gui... #proxyattn #sparseattention

0 0 0 0

Ars Technica News

@arstechni.ca

6 months ago

DeepSeek tests “sparse attention” to slash AI processing costs https://arstechni.ca #computationalefficiency #transformerarchitecture #long-contextprocessing #AIdevelopmenttools #AIinfrastructure #machinelearning #sparseattention #AIefficiency #AIresearch #opensource #ChineseAI #deepseek…

2 0 0 0

The Daily Tech Feed

@thedailytechfeed.com

6 months ago

DeepSeek unveils V3.2-exp model with sparse attention, slashing AI inference costs by up to 50%. A game-changer for long-context operations! #AI #DeepSeek #SparseAttention #TechInnovation Link: thedailytechfeed.com/deepseeks-v3...

1 0 0 0

Winbuzzer

@winbuzzer.com

6 months ago

DeepSeek Releases Experimental V3.2 AI Model with ‘Sparse Attention’ to Boost Efficiency - WinBuzzer Chinese AI lab DeepSeek has released DeepSeek-V3.2-Exp, an experimental open-source model testing a new, more efficient sparse attention mechanism.

DeepSeek Releases Experimental V3.2 AI Model with ‘Sparse Attention’ to Boost Efficiency

#AI #DeepSeek #ChinaAI #OpenSource #LLM #TechWar #SparseAttention #China

winbuzzer.com/2025/09/29/d...

2 0 0 0

GetNews.me

@getnews-me.bsky.social

6 months ago

FG‑Attn brings fine‑grained sparse attention to speed video diffusion

FG‑Attn applies fine‑grained sparse attention, yielding an average 1.55× speedup on a single NVIDIA H100 GPU for five‑second 480p clips (up to 1.65×). Read more: getnews.me/fg-attn-brings-fine-grai... #fgattn #sparseattention

0 0 0 0