theAIcatchup
Large Language Models AI Tools AI Research Robotics Computer Vision
AI Hardware AI Business AI Ethics
AI Tools

#MoE models

Bar chart of Step-3.5-Flash crushing Kimi K2.5 and others on decoding cost and benchmarks
AI Hardware

StepFun's 196B Beast Runs Top AI Scores for Pennies—And It's Open Source

Imagine running AI that smokes the leaders without bankrupting your GPU budget. StepFun's Step-3.5-Flash just made elite performance dirt cheap for devs everywhere.

3 min read 1 week, 3 days ago
Diagram showing LoRA adapters injected into GPT-OSS 20B Mixture-of-Experts architecture during fine-tuning
AI Hardware

Taming GPT-OSS 20B: LoRA's Wild Ride on OpenAI's MoE Beast

OpenAI drops a 20B MoE monster into open source, and suddenly fine-tuning isn't just for billion-dollar labs. One practitioner's gritty guide reveals LoRA hacks that make it feasible on everyday rigs.

3 min read 1 week, 4 days ago
NVIDIA Nemotron-Cascade 2 model performance charts on math and coding benchmarks
AI Hardware

NVIDIA's Nemotron-Cascade 2: Proof That Small Can Think Big in AI Reasoning

Ever wonder why your fancy 100B+ AI still fumbles basic math? NVIDIA's new 30B Nemotron-Cascade 2 might have the answer — and it's open-weight.

4 min read 1 week, 6 days ago
NVIDIA Nemotron 3 Super model selection screen in Amazon Bedrock console
AI Hardware

NVIDIA's Nemotron 3 Super Lands on Bedrock: 5x Speed Boost, Same Old Hype?

Picture this: a 120 billion parameter model that only wakes up 12 billion at a time, now chilling serverless on AWS. NVIDIA's latest Nemotron drop promises agentic wizardry—but who's cashing the real checks?

4 min read 2 weeks ago
Diagram of Mistral Small 4's 128-expert MoE architecture with active experts highlighted
AI Hardware

Mistral Small 4: The Jack-of-All-Trades AI That Might Master None

Everyone figured AI needed specialist models for chat, math, code, and pics. Mistral Small 4 says hold my beer: one fat MoE does it all. Deployment just got simpler. Or did it?

3 min read 2 weeks ago
theAIcatchup

AI news that actually matters.

Categories

  • Large Language Models
  • AI Tools
  • AI Research
  • Robotics
  • Computer Vision
  • AI Hardware
  • AI Business
  • AI Ethics

More

  • RSS Feed
  • Sitemap
  • About
  • AI Tools
  • Advertise

Legal

  • Privacy
  • Terms
  • Work With Us

© 2026 theAIcatchup. All rights reserved.

📬

Stay in the loop

The week's most important stories from theAIcatchup, delivered once a week.

No spam. Unsubscribe any time.