theAIcatchup

Bar chart of Step-3.5-Flash crushing Kimi K2.5 and others on decoding cost and benchmarks

StepFun's 196B Beast Runs Top AI Scores for Pennies—And It's Open Source

Imagine running AI that smokes the leaders without bankrupting your GPU budget. StepFun's Step-3.5-Flash just made elite performance dirt cheap for devs everywhere.

3 min read 1 week, 3 days ago

Diagram showing LoRA adapters injected into GPT-OSS 20B Mixture-of-Experts architecture during fine-tuning

AI Hardware

Taming GPT-OSS 20B: LoRA's Wild Ride on OpenAI's MoE Beast

OpenAI drops a 20B MoE monster into open source, and suddenly fine-tuning isn't just for billion-dollar labs. One practitioner's gritty guide reveals LoRA hacks that make it feasible on everyday rigs.

3 min read 1 week, 4 days ago

NVIDIA Nemotron-Cascade 2 model performance charts on math and coding benchmarks

AI Hardware

NVIDIA's Nemotron-Cascade 2: Proof That Small Can Think Big in AI Reasoning

Ever wonder why your fancy 100B+ AI still fumbles basic math? NVIDIA's new 30B Nemotron-Cascade 2 might have the answer — and it's open-weight.

4 min read 1 week, 6 days ago

NVIDIA Nemotron 3 Super model selection screen in Amazon Bedrock console

AI Hardware

NVIDIA's Nemotron 3 Super Lands on Bedrock: 5x Speed Boost, Same Old Hype?

Picture this: a 120 billion parameter model that only wakes up 12 billion at a time, now chilling serverless on AWS. NVIDIA's latest Nemotron drop promises agentic wizardry—but who's cashing the real checks?

4 min read 2 weeks ago

Diagram of Mistral Small 4's 128-expert MoE architecture with active experts highlighted

AI Hardware

Mistral Small 4: The Jack-of-All-Trades AI That Might Master None

Everyone figured AI needed specialist models for chat, math, code, and pics. Mistral Small 4 says hold my beer: one fat MoE does it all. Deployment just got simpler. Or did it?

3 min read 2 weeks ago

#MoE models

StepFun's 196B Beast Runs Top AI Scores for Pennies—And It's Open Source

Taming GPT-OSS 20B: LoRA's Wild Ride on OpenAI's MoE Beast

NVIDIA's Nemotron-Cascade 2: Proof That Small Can Think Big in AI Reasoning

NVIDIA's Nemotron 3 Super Lands on Bedrock: 5x Speed Boost, Same Old Hype?

Mistral Small 4: The Jack-of-All-Trades AI That Might Master None

Stay in the loop