🔬 AI Research

DeepSeek V4 Pro: 1.6T Model Runs on Huawei, But Who's Buying?

Another day, another gigantic AI model lands with a thud. This time it's DeepSeek, rolling out their V4 Pro and V4 Flash models, and they've got a new trick up their sleeve: ditching NVIDIA for Huawei's Ascend chips. Color me intrigued, and deeply skeptical.

The AI Catchup Apr 25, 2026 4 min read

A server rack with glowing lights, representing advanced AI hardware.

⚡ Key Takeaways

DeepSeek has released V4 Pro (1.6T params) and V4 Flash (284B params) models with a 1M token context window. 𝕏
The models are designed to run on Huawei Ascend chips, signaling a move away from NVIDIA hardware due to geopolitical factors. 𝕏
Technical advancements in Compressed Sparse Attention (CSA) and Heavily Compressed Attention (HCA) are credited for improved efficiency and reduced memory usage. 𝕏

Written by

Elena Vasquez

Technology writer focused on AI tools, developer productivity, and the ethics of automation.

#AI models #DeepSeek V4 #DeepSeek V4 Pro #Huawei Ascend #LLM Research #LLM architecture #Long Context #ai hardware #large language models #llm #long context window #open-source AI #open-weight models

Worth sharing?

Get the best AI stories of the week in your inbox — no noise, no spam.

Originally reported by Latent Space

⚡ Key Takeaways

The 60-Second TL;DR

Elena Vasquez

Share this article

Worth sharing?

Related Stories

DeepSeek V4: Open Source AI Just Got a Serious Upgrade

DeepSeek V4 Unleashed: China's Sparse Attention Revolution Hits Now

Your PC is Now an AI Powerhouse: Gemma 4 & Openclaw

DeepSeek V4: Why the $0.04 Model Crushed Pro-Max

Stay in the loop