#llmsteering hashtag - Bluesky

@getnews-me.bsky.social

5 months ago

Interpretability vs Utility in Sparse Autoencoders for LLM Steering

90 SAEs on three LLMs gave a modest rank‑correlation (tau‑b ≈ 0.298) between interpretability and steering, and Delta Token Confidence boosted performance by ~52.5%. Read more: getnews.me/interpretability-vs-util... #sparseautoencoders #llmsteering

0 0 0 0

GetNews.me

@getnews-me.bsky.social

6 months ago

REAL Framework Boosts Inference-Time Steering of LLMs

REAL, an inference‑time steering framework, was tested on eight Llama and Qwen LLMs, achieving up to 81.5% gains and an average 20% improvement. Published October 2025. Read more: getnews.me/real-framework-boosts-in... #real #llmsteering #ai

0 0 0 0

GetNews.me

@getnews-me.bsky.social

6 months ago

Refining LLM Steering Vectors with Sparse Autoencoders

Researchers use sparse autoencoders to refine LLM steering vectors via denoising and augmentation, improving performance on limited data. Submitted 28 Sep 2025 (arXiv:2509.23799). Read more: getnews.me/refining-llm-steering-ve... #llmsteering #sparseautoencoder

0 0 0 0