Newsgather
BackDeepSeek Releases V4 Open-Source AI Model with 1.6 Trillion Parameters
DeepSeek Releases V4 Open-Source AI Model with 1.6 Trillion Parameters
Urgent
SCMP Tech4/24/2026Tech1 min readChina

DeepSeek Releases V4 Open-Source AI Model with 1.6 Trillion Parameters

Chinese AI startup's flagship model boasts 1M token context window, aims to compete with OpenAI and Google DeepMind

Quick Look

  • DeepSeek released its V4 foundational AI model in two versions - V4-pro with 1.6 trillion parameters and V4-flash with 284 billion parameters.
  • Both feature a 1 million token context window, up from 128,000 in the previous model.
  • The open-source models aim to compete with US leaders OpenAI and Google DeepMind, with Huawei and Cambricon quickly announcing chip compatibility support.

AI-generated summary

Why It Matters

DeepSeek is a Hangzhou-based AI startup that has been gaining recognition for its open-source models. The V4 release marks a significant upgrade from its previous flagship which had a 128,000 token context window. The model architecture and training techniques are outlined in an extended technical report.

Font size

DeepSeek has finally released its much-anticipated next-generation foundational artificial intelligence model, the open-source V4, which it said was competitive with leading US closed-source models from the likes of OpenAI and Google DeepMind. The Hangzhou-based AI start-up released two versions of the model on Friday, with the V4-pro model boasting 1.6 trillion parameters, making the company's biggest-ever model by that metric, while the smaller V4-flash model has 284 billion parameters. A higher parameter count generally correlates with greater capabilities for a model, while also increasing the computational demands of training and serving it.

Both models have a context window of 1 million tokens, a critical feature that determines the amount of information an AI system is able to process, which DeepSeek said was achieved with "world-leading" cost efficiency. DeepSeek's previous flagship model had a context window of 128,000 tokens. Soon after DeepSeek's release, Huawei announced "full support" of its range of Ascend chips, along with its supernode systems, to serve V4 models for model inference. The Shenzhen-based tech giant is set to reveal more details about the collaboration in a livestream on Friday afternoon. AI chipmaker Cambricon Technologies also moved quickly to announce compatibility with DeepSeek's new models.

"The release of V4 explicitly mentions compatibility with domestic chips," said analysts from Huatai Securities in a note to clients. "We can look forward to a significant improvement in the capabilities of domestic graphics cards and their widespread adoption this year."

While the parameter size of V4-pro makes it prohibitively large to be run locally on consumer-grade hardware, the extended technical report outlining V4's model architecture and training techniques is likely to be beneficial for global AI developers. The V4-flash model is also one of the cheapest cutting-edge models available on the market, with token pricing identical to DeepSeek's V2 model released in June 2024.

What to Watch

AI outlook — possibilities, not facts

  • Huawei will reveal Ascend chip collaboration details in Friday afternoon livestream

    Very likely · Within days

  • More Chinese semiconductor companies will announce DeepSeek V4 compatibility

    Likely · Within weeks

Open Questions

  • What are the specific performance benchmarks for V4 compared to GPT-4o and Gemini?
  • What is the exact pricing for V4-flash tokens?
  • What are the computational requirements for running V4-flash?

Related Topics

This article was originally published by SCMP Tech.

Related Stories

科技创新成为天津东丽区高质量发展的核心引擎
Developing·3h ago

科技创新成为天津东丽区高质量发展的核心引擎

天津市东丽区通过科技创新,特别是航天科技和新能源汽车领域,正成为区域高质量发展的核心引擎。爱思达航天科技在商业航天领域取得显著成就,中汽中心新能源汽车科技创新基地则为新能源汽车提供严苛的测试验证。东丽区正积极融入京津冀科技创新中心建设,构建先进制造业与现代服务业协同发展的产业体系。

中国新闻网
More on this topicdeepseek v4