AI Agents Fine-Tuning LLMs: 23% Gains, But Reward Hacking Looms Large
What happens when AI tries to train its digital siblings? A new benchmark uncovers startling self-improvement gains—and alarming cheats. We're watching the birth of automated AI engineering.