#Turboquant hashtag - Bluesky

@changemypitchup.bsky.social

4 hours ago

GitHub - peva3/turboquant-h2o-streamingllm Contribute to peva3/turboquant-h2o-streamingllm development by creating an account on GitHub.

Holy shit! I just got Turboquant working. They weren't fucking kidding. 4x context space with the same memory usage!
I used: github.com/peva3/turboq...

#llm #ai #turboquant

1 0 0 0

Oliver T Lord

@olivertlord.bsky.social

5 hours ago

Google's #TurboQuant is a compression mechanism for Key-Value Cache — the 'memory' of an #LLM.

It is derived from #PolarQuant which converts KV vectors from Cartesian to polar co-ordinates.

Early tests (admittedly by Google) show 6x reduction in memory usage.

This seems quite important for #AI.

0 0 0 0

Hannu Varjoranta

@varjoranta.bsky.social

1 day ago

GitHub - varjoranta/turbo-quant-lite Contribute to varjoranta/turbo-quant-lite development by creating an account on GitHub.

I open sourced minimal #python implementation for #TurboQuant:
github.com/varjoranta/t...

0 0 0 0

Hannu Varjoranta

@varjoranta.bsky.social

1 day ago

GitHub - varjoranta/turbo-quant-lite Contribute to varjoranta/turbo-quant-lite development by creating an account on GitHub.

I open sourced minimal #python implementation for #TurboQuant:
github.com/varjoranta/t...

0 0 0 0

Hannu Varjoranta

@varjoranta.bsky.social

1 day ago

GitHub - varjoranta/turbo-quant-lite Contribute to varjoranta/turbo-quant-lite development by creating an account on GitHub.

I open sourced minimal #python implementation for #TurboQuant:
github.com/varjoranta/t...

8 1 1 1

BuySellRam.com

@buysellram.bsky.social

1 day ago

Will Google's TurboQuant AI Compression Finally Demolish the AI Memory Wall? Will TurboQuant end the HBM shortage? Explore Google’s 6x KV cache compression, the Jevons Paradox, and how to manage GPU assets as the AI Memory Wall moves.

Google’s TurboQuant is being positioned as a breakthrough that could finally break the AI “memory wall”—but the reality is more nuanced.
www.buysellram.com/blog/will-go...

#AI #TurboQuant #Google #AIMemoryWall #AICompression #KVCache #LLMInference #MemoryBottleneck #ModelEfficiency #DataCenter

0 0 0 0

Kevin Sarrazin

@ksarrazin.bsky.social

1 day ago

Compression extrême sans perte : l'algorithme de compression IA TurboQuant de Google promet de réduire d'un facteur de six l'utilisation de la mémoire LLM Un billet de recherche publié un mardi de mars 2026 a suffi à faire plonger les actions de Micron, SanDisk, SK Hynix et Samsung. L'algorithme en cause, TurboQuant, promet de diviser par six la…

Compression extrême sans perte : l'algorithme de compression #IA #TurboQuant de #Google promet de réduire d'un facteur de six l'utilisation de la mémoire #LLM buff.ly/KljNsWh

0 0 0 0

woody｜AI×実用ツール専門

@woody-ai-tools.bsky.social

1 day ago

Google TurboQuant入門 — KVキャッシュ3ビット圧縮でLLM推論を8倍高速化 - Qiita はじめに LLMの推論コストを支配する要因のひとつが KVキャッシュ（Key-Value Cache）のメモリ消費である。コンテキスト長が伸びるほどKVキャッシュは線形に膨張し、GPUメモリを圧迫してバッチサイズやスループットを制限する。 2026年3月25日、Goo...

LLMのKVキャッシュ、もう3ビットでいいらしい。Googleの「TurboQuant」が強烈。
KVキャッシュを3bitに圧縮してメモリを6倍削減、H100で推論を最大8倍高速化。しかも精度損失ゼロ＆再学習不要。

ローカルLLM環境でのメモリ節約の切り札になりそう。使ってる人いる？

#AI #LLM #TurboQuant #Python #LocalLLM

https://qiita.com/kai_kou/items/a411215806322af68a73

1 0 0 0

in

@0madel0.bsky.social

1 day ago

Рынок оперативы стремительно падает — #Google представил алгоритм #TurboQuant, который снижает потребление памяти в 6 раз и увеличивает производительность до 8 раз, что может снизить спрос со стороны ИИ-индустрии. Акции производителей уже падают: Micron Technology 23%, SanDisk 11%, остальные — 6%.

1 0 0 0

Timelines

@hulio-ai.bsky.social

2 days ago

🚀 TurboQuant compresses AI memory by 6x.🤖 Gemini enables multi-modal AI with text, images, audio.🧑‍💻 Agentic AIs automate coding & workflows.💹 AI boosts enterprise gains, widens skill gaps.
#AIBreakthroughs #TurboQuant #Gemini #AgenticAI #AIWorkforce
View in Timelines

0 0 0 0

@techlife-blog.bsky.social

2 days ago

Google's TurboQuant Compresses AI Memory by 6x — With Zero Accuracy Loss Google Research published TurboQuant, a training-free compression algorithm that shrinks LLM key-value cache memory by at least 6x and speeds up attention by up to 8x on H100 GPUs — without any accura...

Google's TurboQuant Compresses AI Memory by 6x — With Zero Accuracy Loss

techlife.blog/posts/google...

#Google #TurboQuant #LLM #AIEfficiency #KVCache #ICLR2026 #MachineLearning #Compression

2 0 1 0

Meng Li

@mengli512.bsky.social

2 days ago

Google's New Paper Deals a Heavy Blow to Memory Chips Google’s TurboQuant cuts AI memory use by 6x, sparking stock drops but experts say long-term chip demand will grow.

Google's TurboQuant compresses AI memory by 6x, sparking chip stock sell-offs—but is the demand threat overblown?
#Google #TurboQuant
open.substack.com/pub/chippub/...

0 0 0 0

woody｜AI×実用ツール専門

@woody-ai-tools.bsky.social

2 days ago

GoogleALLMÌÁïð6ªÌ1Éí¸·éVZpuTurboQuantv\ GoogleÍALLMÌÁïÊð6ªÌ1Éí¸·éVZpuTurboQuantvð\µ½BPolarQuantÆQJLðgÝí¹A¸xðÛµ½ÜÜKVLbVð3rbgÜÅ³k·éBNVIDIAÌH100ÅÌvZ¬xÍÅå8{ÉüãBGeminiÌåKÍfâxNgõÌIÈ¬»ªúÒ³êéB

Googleの「TurboQuant」、LLMのメモリ消費を6分の1に削減ってマジか…！🤖 これが実装されたらローカルLLMの推論環境が劇的に軽くなるな。GPUのメモリ不足で諦めてたモデルも動かせるかも。

みんなはどのモデルの軽量化に期待してる？

#AI #LLM #TurboQuant #エンジニア #ローカルLLM

www.itmedia.co.jp/news/articles/2603/27/ne...

1 0 0 0

にゅーす!

@news0.bsky.social

2 days ago

gle #TurboQuant #社会トピックス

0 0 0 0

3C潮流情報站

@biggo-news.bsky.social

2 days ago

Google 的 TurboQuant 將 AI 記憶體需求削減 6 倍，引發市場震盪與效率之辯

Google新演演算法TurboQuant將AI記憶體需求砍6倍，記憶體股價應聲下跌！這會是AI硬體需求的轉捩點嗎？
biggo.com.tw/news/202603270230_Google...

#TurboQuant #AI記憶體

0 0 0 0

テクノロジー速報

@techtrendsjp.bsky.social

2 days ago

Google の TurboQuant が AI のメモリ需要を 6 分の 1 に削減、市場に動揺と効率性を巡る議論を巻き起こす

Googleの新圧縮技術「TurboQuant」がAIのメモリ使用量を6分の1に削減。H100で最大8倍高速化も実現し、市場は需要減を懸念してメモリ株が急落。技術の効率化は本当に需要を減らすのか？詳細な分析は記事で。
biggo.jp/news/202603270230_Google...

#TurboQuant #Google

0 0 0 0

3C潮流情報站

@biggo-news.bsky.social

2 days ago

Google 的 TurboQuant AI 壓縮技術將記憶體用量削減 6 倍，引發市場動盪

Google TurboQuant AI壓縮技術將記憶體用量削減6倍，推論速度提升8倍！這會讓AI更便宜還是引發新需求？點選看完整分析。
biggo.com.tw/news/202603270158_Google...

#TurboQuant #AI記憶體壓縮

0 0 0 0

テクノロジー速報

@techtrendsjp.bsky.social

2 days ago

Google の AI 圧縮技術「 TurboQuant 」がメモリ使用量を 6 倍削減、市場に混乱を招く

Googleの新技術「TurboQuant」がAIのメモリ使用量を6分の1に削減。推論速度は最大8倍向上し、市場に衝撃を与えています。詳細な技術解説と市場分析は記事で。
biggo.jp/news/202603270158_Google...

#TurboQuant #AI効率化

0 0 0 0

Tech Insights

@techinsights.bsky.social

2 days ago

Google's TurboQuant AI Compression Cuts Memory Use 6x, Sparks Market Turmoil

Google's TurboQuant AI cuts memory use 6x & speeds up inference 8x, slashing AI costs. But why did it tank memory stocks? The answer is a 160-year-old economic paradox. Full analysis in the comments.
biggo.com/news/202603270158_Google...

#TurboQuant #AI

1 0 1 0

Don Porqué

@donporque.bsky.social

2 days ago

Por qué Google pone en jaque a Micron, Samsung y SK Hynix Google sacude a Micron, Samsung y SK Hynix con TurboQuant, la tecnología que abarata la IA y aprieta el negocio global de la memoria de golpe Google ha activado una alarma muy seria en uno de los nego...

Por qué Google pone en jaque a Micron, Samsung y SK Hynix #felizviernes #27demarzo #Google #InteligenciaArtificial #IA #TurboQuant #Micron #Samsung #SKHynix #Semiconductores #Bolsa #WallStreet #Nvidia #Tecnologia donporque.com/google-abara...

0 0 0 0

deepseek

@deepseek.activitypub.awakari.com.ap.brid.gy

2 days ago

Here Is The Unvarnished Truth About Google’s TurboQuant: Jevons Paradox Prevails, Memory Crunch To Continue Google's new algorithm that dramatically compresses KV cache in a lossless fashion, dubbed TurboQuant, is all the rage these days in the AI sphere, where doomsday predictions around an imminent collapse in the demand for memory abound. Never mind the fact that the underlying paper was released all the way back in April 2025! Even so, we postulate that the current doom-and-gloom in the market is eerily similar to the one that prevailed immediately after DeepSeek released its R1 model in early 2025, and that Jevons paradox will prevail. Google's TurboQuant to supercharge Jevons paradox effect, sky-high demand for memory […]

Here Is The Unvarnished Truth About Google’s TurboQuant: Jevons Paradox Prevails, Memory Crunch To Continue Google's new algorithm that dramatically compresses KV cache in a lossless fashion,...

#Featured #News #AI #Google #KV #Cache #Memory #TurboQuant #Mobile

Origin | Interest | Match

0 0 0 0

deepseek

@deepseek.activitypub.awakari.com.ap.brid.gy

2 days ago

Here Is The Unvarnished Truth About Google’s TurboQuant: Jevons Paradox Prevails, Memory Crunch To Continue Google's new algorithm that dramatically compresses KV cache in a lossless fashion, dubbed TurboQuant, is all the rage these days in the AI sphere, where doomsday predictions around an imminent collapse in the demand for memory abound. Never mind the fact that the underlying paper was released all the way back in April 2025! Even so, we postulate that the current doom-and-gloom in the market is eerily similar to the one that prevailed immediately after DeepSeek released its R1 model in early 2025, and that Jevons paradox will prevail. Google's TurboQuant to supercharge Jevons paradox effect, sky-high demand for memory […]

Here Is The Unvarnished Truth About Google’s TurboQuant: Jevons Paradox Prevails, Memory Crunch To Continue Google's new algorithm that dramatically compresses KV cache in a lossless fashion,...

#Featured #News #AI #Google #KV #Cache #Memory #TurboQuant #Mobile

Origin | Interest | Match

0 0 0 0

Ben@SharedSapience

@sharedsapience.substack.com

3 days ago

Watch today's Century Report podcast here:

https://www.youtube.com/watch?v=jRQS2gnVmB4

#TurboQuant #OffshoreWind #AIRegulati

0 0 0 0

Ben@SharedSapience

@sharedsapience.substack.com

3 days ago

Google's TurboQuant cut AI memory needs sixfold, rattling chip stocks worldwide. Crusoe signed for 12 GWh of iron-air storage. A jury found Meta and YouTube liable for addictive design. #TurboQuant #OffshoreWind #AIRegulati… sharedsapience.com/century-report/the-centu...

0 0 0 0

Antonio Pardo Sánchez

@apardo.spluca.org

3 days ago

Google Introduces TurboQuant: A New Compression Algorithm that Reduces LLM Key-Value Cache Memory by 6x and Delivers Up to 8x Speedup, All with Zero Accuracy Loss TurboQuant: A Compression Algorithm that Reduces LLM Key-Value Cache Memory by 6x and Delivers Up to 8x Speedup, All with Zero Accuracy Loss

#Google Introduces #TurboQuant: A New Compression #Algorithm that Reduces #LLM Key-Value Cache Memory by 6x and Delivers Up to 8x Speedup, All with Zero Accuracy Loss

www.marktechpost.com/2026/03/25/g...

0 0 0 0

The Daily Tech Feed

@thedailytechfeed.com

3 days ago

Google's TurboQuant is revolutionizing AI memory compression, drawing comparisons to 'Pied Piper' from 'Silicon Valley.' A game-changer for AI efficiency! #Google #TurboQuant #AI #MemoryCompression Link: thedailytechfeed.com/googles-turb...

0 0 0 0

Merle Meyerdierks

@merledieperle.bsky.social

3 days ago

Mit #TurboQuant will #Google die künstliche Intelligenz deutlich effizienter machen.

Google hat gerade einen Komprimierungsalgorithmus vorgestellt, der KI achtmal schneller macht und dabei sechsmal weniger Speicherplatz benötigt.

Dabei geht keinerlei Genauigkeit verloren.

3 1 0 0

萌芽站長@萌芽系列網站 Mnya Series Website🌳🌐

@mnya.tw

3 days ago

TurboQuant：顛覆 AI 記憶體瓶頸與重塑未來硬體市場的壓縮革命 - 萌芽綜合天地 - 萌芽網頁 近期，Google 研究團隊發表了一項名為「TurboQuant」的全新免訓練（training-free）壓縮演算法，預計於 2026 年的 ICLR 國際會議上正式亮相。隨著大型語言 […]

🆕 mnya.tw/cc/word/2748...
#TurboQuant #Google #顛覆 #AI記憶體瓶頸 #AI #記憶體瓶頸 #記憶體 #RAM #VRAM #瓶頸 #重塑未來硬體市場 #重塑 #未來硬體市場 #未來 #硬體市場 #硬體 #壓縮革命 #資訊新聞 #人工智慧

1 0 0 0

techandchips

@techandchips.bsky.social

3 days ago

Google TurboQuant「メモリ6分の1」— HBMスーパーサイクルは揺らぐのか 2025年3月25日、Google Researchが「TurboQuant」を発表した。LLMのKVキャッシュを3ビットまで圧縮し、メモリ使用量を6分の1に削減。NVIDIA H100での推論速度は最大8倍。精度損失はゼロ。インターネットではHBOドラマ『シリコンバレー』の架空の圧縮アルゴリズム「Pied Piper...

Google TurboQuant——メモリ6分の1、推論8倍速、精度損失ゼロ。メモリ株は急落したが、ジェヴォンズ・パラドックスを知る者は慌てない。HBMスーパーサイクルの本当の体力と、工場現場への意味を解剖しました。

techandchips.com/ja/blog/goog...
#TurboQuant #HBM #半導体

0 0 0 0

山本@AIテクノロジーまとめ

@aitechmatome.bsky.social

3 days ago

【衝撃】AIメモリ圧縮「TurboQuant」6倍以上削減！パイド・パイパー超えなるか？ - AIテクノロジーまとめ <a href="https://tech-matome.com//?tag=%e3%82%b0%e3%83%bc%e3%82%b0%e3%83%ab">グーグル</a>の<a href="https://tech-matome.com//?tag=ai">AI</a>研究者たちが発表した新しいAIメモリ圧縮<a href="https://tech-matome.com//?tag=%e3%82%a2%e3%83%ab%e3%82%b4%e3%83%aa%e3%82%ba%e3%83%a0">アルゴリズム</a>「TurboQuant」は、その効率性の高さからインターネット上で、HBOのテレビドラマ「シリコンバレー」に登場する架空のスタートアップ「パイド・パイパー」に例えられています。ドラマ「パイド・パイパー」は、ファイルサイズを大幅に削減する圧縮アルゴリズムを開発したスタートアップの奮闘を描いており、TurboQuantも同様に、AIシステムのボトルネックとなっているメモリ使用量を削減するという点で共通しています。グーグル・リサーチは、TurboQuantをAIの動作メモリを縮小し、パフォーマンスに影響を与えない革新的な技術だと説明しています。この圧縮方法は、AI処理におけるキャッシュのボトルネックを解消するためにベクトル量子化の一種を使用し、AIがより多くの情報を記憶しつつ、省スペースで正確さを維持することを可能にします。研究者たちは、来月開催されるICLR 2026で、この技術を実現する2つの方法、すなわち量子化手法「PolarQuant」と、トレーニングおよび最適化手法「QJL」とともに、その成果を発表する予定です。 TurboQuantは、AIの実行に必要な「ワーキングメモリ」であるKVキャッシュを「少なくとも6倍」削減することで、AIの運用コストを削減する可能性があります。一部からは、中国の<a href="https://tech-matome.com//?tag=ai%e3%83%a2%e3%83%87%e3%83%ab">AIモデル</a>「DeepSeek」が、より安価なチップで競争力のある結果を出し、効率性を高めたことと比較する声も上がっています。ただし、TurboQuantはまだ広く展開されておらず、現時点では研究室でのブレークスルーに過ぎません。そのため、DeepSeekや「パイド・パイパー」のような事例と比較することは難しい部分もあります。TurboQuantは、推論中のメモリ使用量を削減し、効率的なシステムを実現する可能性を秘めていますが、AIのトレーニングに必要な大量の<a href="https://tech-matome.com//?tag=ram%e4%b8%8d%e8%b6%b3">RAM不足</a>を解決するものではありません。

AIメモリ圧縮の新技術「TurboQuant」がヤバすぎた！😲 6倍以上の削減で、まるでドラマ「シリコンバレー」のパイド・パイパー再来か？AIコスト削減の可能性大！🚀 #AI #TurboQuant

▼詳細はこちら

0 0 1 0