Eilmeldung

ESCanadá se clasifica para octavos en la Copa del Mundo tras un partido 'horrible' contra Sudáfrica ESComunidades de Guerrero y Michoacán bajo control de estructuras criminales ESJuanma Moreno y Vox aceleran las negociaciones para la investidura en Andalucía ESReal Madrid y Valencia Basket se disputan a Pedro Martínez ESFrancia busca aprender de España tras la ola de calor que ha causado más de 1.000 muertes ESExplosión en Mónaco: Tres heridos graves, incluido un menor, por un paquete bomba ESTerremotos en Venezuela: Consecuencias y Daños ESTerremoto en Venezuela: Fractura entre solidaridad ciudadana y asistencia estatal ESMoreno busca convencer al Parlamento andaluz sin lograr el apoyo de Vox ESNueva réplica sísmica en Venezuela: 4,6 de magnitud en La Guaira sin daños adicionales ESCanadá se clasifica para octavos en la Copa del Mundo tras un partido 'horrible' contra Sudáfrica ESComunidades de Guerrero y Michoacán bajo control de estructuras criminales ESJuanma Moreno y Vox aceleran las negociaciones para la investidura en Andalucía ESReal Madrid y Valencia Basket se disputan a Pedro Martínez ESFrancia busca aprender de España tras la ola de calor que ha causado más de 1.000 muertes ESExplosión en Mónaco: Tres heridos graves, incluido un menor, por un paquete bomba ESTerremotos en Venezuela: Consecuencias y Daños ESTerremoto en Venezuela: Fractura entre solidaridad ciudadana y asistencia estatal ESMoreno busca convencer al Parlamento andaluz sin lograr el apoyo de Vox ESNueva réplica sísmica en Venezuela: 4,6 de magnitud en La Guaira sin daños adicionales

BackDSpark Module Enhances AI Response Generation Efficiency

DSpark Module Enhances AI Response Generation Efficiency

Technik

SCMP Tech1 g önceTechnik1 dk okumaChina

DSpark Module Enhances AI Response Generation Efficiency

AI Efficiency DSpark Module DeepSeek

Auf einen Blick

DeepSeek's DSpark module accelerates AI inference by using a lightweight draft model for candidate responses, verified in batches by a larger model, and employs semi-autoregressive generation and confidence-based scheduling for balanced speed and quality.

KI-generierte Zusammenfassung

Warum es wichtig ist

DeepSeek aims to improve AI service efficiency.

Schriftgröße

AI models’ conventional token-by-token output often slowed when responses were lengthy, leading to low utilisation of graphics processing units (GPU) and high user-perceived waiting time, which was a “primary bottleneck in serving AI”, the company said in research published on Saturday. DeepSeek said the DSpark module accelerated AI response generation – also known as AI inference, which refers to serving a trained model to respond to user queries – by using a lightweight draft model to propose candidate responses and then verifying them in batches with a larger model, speeding up output. DSpark further refined the approach with a semi-autoregressive generation method, allowing the model to produce small chunks of tokens rather than strictly one at a time. It also introduced a confidence-based scheduling system that dynamically adjusted how much verification was applied based on computing demand, helping balance speed and output quality.

Worauf zu achten ist

KI-Ausblick — Möglichkeiten, keine Fakten

Increased adoption of DSpark in AI services
Wahrscheinlich · Innerhalb von Monaten

Offene Fragen

Impact on user experience
Broader industry adoption plans

Verwandte Themen

OrganisationenDeepSeek

ThemenDSpark Module DeepSeek AI Inference

This article was originally published by SCMP Tech.

Ähnliche Meldungen

Mehr zu diesem ThemaAI Efficiency

Breakthrough in Engine Design: Graphite Solution for Combustion Chamber

Technik·6 sa önce

Breakthrough in Engine Design: Graphite Solution for Combustion Chamber

Researchers from Northwestern Polytechnical University and the Beijing Power Machinery Institute have developed a solution for a longstanding engine design problem using graphite, enabling a combustion chamber component to adjust in 1/3 of a second at high temperatures.

US to Extend Import Ban to Older Chinese Tech Equipment Amid Security Concerns

Technik·7 sa önce

US to Extend Import Ban to Older Chinese Tech Equipment Amid Security Concerns

The FCC is expanding its import ban to include older Chinese tech equipment from firms like Huawei and Hikvision, citing national security risks, effective next month.

China Closing Innovation Gap with US in Space Sector: Report

Technik·1 g önce

China Closing Innovation Gap with US in Space Sector: Report

A Washington-based report warns China is rapidly closing the innovation gap with the US in the commercial space sector, backed by strong state support, with projections of the global space economy exceeding $1 trillion in the next decade.

Chinese SiC Manufacturer Basic Prepares for Public Debut Amid National Push for Tech Dominance

Technik·1 g önce

Chinese SiC Manufacturer Basic Prepares for Public Debut Amid National Push for Tech Dominance

Basic, a Chinese SiC device manufacturer founded in 2016, prepares for its public debut, aligning with China's efforts to lead in SiC technology, despite current market oversupply, with anticipated demand growth driven by 800V architectures and data center adoption by 2027-2028.

Australia Toughens Social Media Ban for Minors, Increases Penalties to AUD 99 Million

Technik·2 g önce

Australia Toughens Social Media Ban for Minors, Increases Penalties to AUD 99 Million

Australia increases maximum penalty to AUD 99 million for tech firms failing to enforce social media ban for under-16s, strengthens regulator's powers amid little observed impact on teen usage.

vivo X Fold6 正式发布：聚焦“大屏+AI生产力”，开启折叠新篇章

In Entwicklung·2 g önce

vivo X Fold6 正式发布：聚焦“大屏+AI生产力”，开启折叠新篇章

vivo发布新一代折叠旗舰X Fold6，主打“大屏+AI生产力”。搭载OriginOS 6 Fold操作系统与蓝晶x天玑9500超能版芯片，实现AI任务流和多窗口交互。售价7999元起，7月1日正式开售。

中国新闻网

Mehr zu diesem ThemaAI Efficiency