SCMP Tech1d agoTech1 min readChina

DSpark Module Enhances AI Response Generation Efficiency

Quick Look

DeepSeek's DSpark module accelerates AI inference by using a lightweight draft model for candidate responses, verified in batches by a larger model, and employs semi-autoregressive generation and confidence-based scheduling for balanced speed and quality.

AI-generated summary

Why It Matters

DeepSeek aims to improve AI service efficiency.

Font size

AI models’ conventional token-by-token output often slowed when responses were lengthy, leading to low utilisation of graphics processing units (GPU) and high user-perceived waiting time, which was a “primary bottleneck in serving AI”, the company said in research published on Saturday. DeepSeek said the DSpark module accelerated AI response generation – also known as AI inference, which refers to serving a trained model to respond to user queries – by using a lightweight draft model to propose candidate responses and then verifying them in batches with a larger model, speeding up output. DSpark further refined the approach with a semi-autoregressive generation method, allowing the model to produce small chunks of tokens rather than strictly one at a time. It also introduced a confidence-based scheduling system that dynamically adjusted how much verification was applied based on computing demand, helping balance speed and output quality.

What to Watch

AI outlook — possibilities, not facts

Increased adoption of DSpark in AI services
Likely · Within months

Open Questions

Impact on user experience
Broader industry adoption plans

DSpark Module Enhances AI Response Generation Efficiency

Quick Look

Why It Matters

What to Watch

Open Questions

Related Topics

Related Stories

China Closing Innovation Gap with US in Space Sector: Report

Chinese SiC Manufacturer Basic Prepares for Public Debut Amid National Push for Tech Dominance

Australia Toughens Social Media Ban for Minors, Increases Penalties to AUD 99 Million

vivo X Fold6 正式发布：聚焦“大屏+AI生产力”，开启折叠新篇章

三星Galaxy S8、S8+與Note 8獲意外軟體更新

AI 產業蓬勃發展科技巨頭積極招募哲學家

DSpark Module Enhances AI Response Generation Efficiency

Quick Look

Why It Matters

What to Watch

Open Questions

Related Topics

Related Stories

China Closing Innovation Gap with US in Space Sector: Report

Chinese SiC Manufacturer Basic Prepares for Public Debut Amid National Push for Tech Dominance

Australia Toughens Social Media Ban for Minors, Increases Penalties to AUD 99 Million

vivo X Fold6 正式发布：聚焦“大屏+AI生产力”，开启折叠新篇章

三星Galaxy S8、S8+與Note 8獲意外軟體更新

AI 產業蓬勃發展 科技巨頭積極招募哲學家

AI 產業蓬勃發展科技巨頭積極招募哲學家