💼 AI Business

Enterprise AI Agents Implode: IBM's Brutal Breakdown of 310 Epic Fails

Gemini-3-Flash: 2.6 failures per trace. GPT-OSS-120B: 5.3. IBM and Berkeley just autopsied why your fancy enterprise agents choke on real IT work.

Marcus Rivera 📅 Mar 19, 2026 ⏱️ 3 min read 👁️ 5 views

⚡ Key Takeaways

Cast your vote and see what theAIcatchup readers think

Written by

Tech journalist covering AI business and enterprise adoption. 10 years in B2B media.

#AI agents #IT automation #ITBench #ITBench benchmark #MAST taxonomy #agent failure modes #enterprise AI agents #failure taxonomy

Get the best AI stories of the week in your inbox — no noise, no spam.

Originally reported by Hugging Face Blog