AI Research
AI Models Flunk IT Benchmark: Below 50% on Critical Enterprise Tasks
Turns out, those fancy AI models can't fix your server yet. A new benchmark reveals they're fumbling critical enterprise IT tasks, scoring embarrassingly low.