George Z Lin (@gzlin) Bsky

World's worst comms department

16 hours ago 7 0 0 0

Meanwhile the internet is still discussing whether "encrypted at rest" means "not stolen."
Classic.

16 hours ago 0 0 0 0

Context – AI agents that get smarter every week Context deploys agents inside your enterprise systems. They execute real workflows, learn from your team's corrections, and measurably improve.

A Roblox cheat, an AI tool, and a Vercel employee walk into a bar. It's not a joke—it's a supply chain attack. 🍺

Vercel's platform got owned by:
1. Someone installing a Roblox cheat on their work laptop
2. It contained an infostealer
3. They'd also granted "Allow All" permissions to context.ai

16 hours ago 2 0 1 0

Google's AI coding problem: 6+ tools scattered across DeepMind, Cloud, Android, Labs & more.

Gemini Code Assist, Gemini CLI, Antigravity, Firebase Studio, Jules, AI Studio — all overlapping, none dominating.

Fragmentation > focus. Meanwhile rivals Claude & Codex win.

22 hours ago 1 0 1 0

Kimi team creates PrfaaS, fixes LLM deployment memory bottlenecks by offloading prefill tasks, improving KV-cache scalability and performance cross-datacenter.
arxiv.org/abs/2604.150...

23 hours ago 0 0 0 0

China's humanoid "Lightning" crushed the human half-marathon world record at a Beijing race, finishing in 50:26 and beating all 12,000 human runners. The event showcased 40% autonomous robots, all achieving sub-one-hour times. A stunning reversal from last year's failures.

2 days ago 0 0 0 0

Anthropic launches Claude Design. Cool demo, AI produces competent templates but is terrible at the one thing that matters. Making Choices. AI design equals same rounded-corner-card aesthetic given training data. So we'll get an army of mediocre design, just at AI speed.

4 days ago 0 0 0 0

Anthropic team study on Claude Sonnet 4.5 reveals LLMs simulate emotions, influencing behavior and alignment, crucial for ethical AI development.
arxiv.org/abs/2604.07729

5 days ago 1 0 0 0

OpenAI just dropped Codex update: agents that can control your Mac. See, click, type across apps, browse the web, generate images. 90+ plugins, memory for preferences, auto-resume tasks. RIP to corporate token budgets.

5 days ago 0 0 0 0

Claude Opus 4.7 uses an updated tokenizer that improves how the model processes text. The tradeoff is that the same input can map to more tokens—roughly 1.0–1.35× depending on the content type..... Technical Details matter. Guess will drive Anthropic revenue up another 20% or so.

6 days ago 0 0 0 0

Claude is down again, instead of cybersecurity, they should be pointing their frontier mythos models at SRE

1 week ago 0 0 0 0

Allbirds is becoming an AI cloud, going from merino wool sneakers to MoE Neural Networks. Consider me a skeptic. April 1st was 2 weeks ago.

1 week ago 0 0 0 0

Bytedance's In-Place Test-Time Training enhances LLMs with dynamic adaptation during inference, improving performance on long-context tasks without extensive retraining.
arxiv.org/abs/2604.06169

1 week ago 0 0 0 0

Can't have good open source AI without actual hardware support. The reality is that AMD's open ROCm only works on recent cards, making the open source argument meaningless for 90% of users. CUDA dominates because it just works.

1 week ago 0 0 0 0

Apple's Simple Self-Distillation enhances LLM code generation, improving accuracy and simplifying training without external verification or teacher models through embarrassingly simple supervised learning
arxiv.org/abs/2604.01193

2 weeks ago 0 0 0 0

Tsinghua proposes Natural Language Agentic Harnesses with Intelligent Harness Runtimes to fix agent performance by externalizing harness logic, improving clarity, portability, and tool usage across multi-turn agentic systems.
arxiv.org/abs/2603.25723

2 weeks ago 1 0 0 0

UW / CMU team creates efficient coding agents with SERA framework, reducing costs and complexity while enhancing performance through innovative training techniques and privacy protection.
arxiv.org/abs/2601.20789

3 weeks ago 0 0 0 0

History rhymes. AC/DC debate is back (at least for AI Data Centers). Direct DC power to accelerators skips conversion loss, saves on heat. Expect infrastructure conversions to start later this year.

3 weeks ago 0 0 0 0

Attention Residuals Residual connections with PreNorm are standard in modern LLMs, yet they accumulate all layer outputs with fixed unit weights. This uniform aggregation causes uncontrolled hidden-state growth with dept...

arxiv.org/abs/2603.15031

1 month ago 0 0 0 0

AttnRes from Moonshot team enhances LLMs with learned attention, further enhancement with Block AttnRes improves memory utilization as an alternative residual architecture

1 month ago 0 0 1 0

SII team shows Geometric Autoencoder, enhances high-resolution visual generation, improving semantic discriminability and reconstruction fidelity
arxiv.org/abs/2603.10365

1 month ago 0 1 0 0

Berkeley team creates V1, enhances LLMs' parallel reasoning through pairwise self-verification, improving accuracy and robustness in solution selection and reasoning tasks.
arxiv.org/abs/2603.04304

1 month ago 0 0 0 0

CMDM enhances human motion synthesis with real-time, realistic generation using a novel framework combining MAC-VAE and Causal-DiT.
arxiv.org/abs/2602.22594

1 month ago 0 0 0 0

UniFreiburg and MSR work uses linear RNNs to excel in state-tracking tasks, outperforming dense supervision Transformers, showing a path forward in code execution and paving the way for hybrid nonlinear RNNs.
arxiv.org/abs/2602.14814

1 month ago 0 0 0 0

Rutgers team shows A-MEM enhances LLM agents' memory management with dynamic organization, intelligent synaptic linking, and improving long-term interaction capabilities.
arxiv.org/abs/2502.12110

2 months ago 0 0 0 0

Meta's TinyLoRA achieves effective reasoning in language models with just 13 parameters, enabling compute and energy efficient model customization
arxiv.org/abs/2602.04118

2 months ago 0 0 0 0

LoongFlow is an advanced AI framework by Baidu, enhancing evolutionary processes through structured reasoning and outperforming traditional methods in efficiency.
arxiv.org/abs/2512.24077

2 months ago 0 0 0 0

Mechanistic Analysis of Hierarchical Reasoning Model performance in Sudoku shows strengths in complex tasks but highlights guessing tendencies and fixed point violations. arxiv.org/abs/2601.10679

2 months ago 0 0 0 0

CALM by Tencent Team uses continuous vector prediction, improving efficiency and performance while reducing computational costs significantly in LLM design. arxiv.org/abs/2510.27688

2 months ago 1 0 0 0

RelayLLM enhances reasoning in language models by enabling smaller models to efficiently collaborate with larger ones, reducing costs significantly. arxiv.org/abs/2601.05167

3 months ago 0 0 0 0

Posts by George Z Lin