SeaPO Boosts LLM Truthfulness with Strategic Error Amplification
SeaPO injects targeted errors into LLMs during preference optimization, boosting truthfulness by 5‑10 percentage points in models from 1.5 B to 14 B parameters. getnews.me/seapo-boosts-llm-truthfu... #sepo #truthfulness