Última hora
GLOBALSupreme Court Rules Trump Can Fire FTC Commissioners, Overturning Key PrecedentDEBerenberg-Bank: BaFin entmachtet Geschäftsleitung - Hintergründe unklarDEWM als Menschenbringer? Rassismus-Skandal um SchweinsteigerFRCouple placé en garde à vue après le décès de leurs jumelles de 15 mois à BeuvragesKR모나코 주거용 건물 폭발, 3명 부상…용의자 추적 중ESReal Madrid y Valencia Basket se disputan a Pedro MartínezTRFransa'da Sıcak Hava Dalgası Sonrası 1.000 Ek Ölüm GözlendiITProcesso Crollo Ponte Morandi: Udienza del 16 Luglio, Probabile Sentenza la Stessa SerataUKEU and China Launch 3-Month Talks to Avert Trade War Over €360bn ImbalanceUSVirginia Approves Legal Recreational Marijuana Sales, Set to Begin July 2027GLOBALSupreme Court Rules Trump Can Fire FTC Commissioners, Overturning Key PrecedentDEBerenberg-Bank: BaFin entmachtet Geschäftsleitung - Hintergründe unklarDEWM als Menschenbringer? Rassismus-Skandal um SchweinsteigerFRCouple placé en garde à vue après le décès de leurs jumelles de 15 mois à BeuvragesKR모나코 주거용 건물 폭발, 3명 부상…용의자 추적 중ESReal Madrid y Valencia Basket se disputan a Pedro MartínezTRFransa'da Sıcak Hava Dalgası Sonrası 1.000 Ek Ölüm GözlendiITProcesso Crollo Ponte Morandi: Udienza del 16 Luglio, Probabile Sentenza la Stessa SerataUKEU and China Launch 3-Month Talks to Avert Trade War Over €360bn ImbalanceUSVirginia Approves Legal Recreational Marijuana Sales, Set to Begin July 2027
Newsgather
BackDSpark Module Enhances AI Response Generation Efficiency
DSpark Module Enhances AI Response Generation Efficiency
Tecnología
SCMP Tech1 g önceTecnología1 dk okumaChina

DSpark Module Enhances AI Response Generation Efficiency

En resumen

DeepSeek's DSpark module accelerates AI inference by using a lightweight draft model for candidate responses, verified in batches by a larger model, and employs semi-autoregressive generation and confidence-based scheduling for balanced speed and quality.

Resumen generado por IA

Por qué importa

DeepSeek aims to improve AI service efficiency.

Tamaño de fuente

AI models’ conventional token-by-token output often slowed when responses were lengthy, leading to low utilisation of graphics processing units (GPU) and high user-perceived waiting time, which was a “primary bottleneck in serving AI”, the company said in research published on Saturday. DeepSeek said the DSpark module accelerated AI response generation – also known as AI inference, which refers to serving a trained model to respond to user queries – by using a lightweight draft model to propose candidate responses and then verifying them in batches with a larger model, speeding up output. DSpark further refined the approach with a semi-autoregressive generation method, allowing the model to produce small chunks of tokens rather than strictly one at a time. It also introduced a confidence-based scheduling system that dynamically adjusted how much verification was applied based on computing demand, helping balance speed and output quality.

Qué observar

Perspectiva de IA — posibilidades, no hechos

  • Increased adoption of DSpark in AI services

    Probable · En meses

Preguntas abiertas

  • Impact on user experience
  • Broader industry adoption plans

Temas relacionados

This article was originally published by SCMP Tech.

Noticias relacionadas

Más sobre este temaAI Efficiency