TECH

DeepSeek Unveils New Model With Sparse Attention, Slashes API Costs

Published: Sep. 30, 2025 4:56 a.m. GMT+8

00:00

00:00/00:00

Listen to this article 1x

A user interface message on the DeepSeek artificial intelligence app on a mobile phone on Feb. 28, 2025.

Chinese AI startup DeepSeek has unveiled a new experimental large language model and slashed prices for its services, intensifying the price war in China’s crowded AI market.

The company on Monday released DeepSeek-V3.2-Exp, featuring a proprietary “DeepSeek Sparse Attention” mechanism that it says boosts training and inference efficiency for long texts. The model is now available on DeepSeek’s app, website, and mini program.

You've accessed an article available only to subscribers

Subscribe today for just $.99.

VIEW OPTIONS

Unlock exclusive discounts with a Caixin group subscription — ideal for teams and organizations.

Save an extra $50. Introductory offer for new readers. Subscribe now.

Disclaimer

This is an AI-generated English rendering of original reporting or commentary published by Caixin Media. In the event of any discrepancies, the Chinese version shall prevail.

Share this article

Open WeChat and scan the QR code

DIGEST HUB

Digest Hub

Back

Explore the story in 30 seconds

DeepSeek launched DeepSeek-V3.2-Exp with “Sparse Attention” for efficient long-text processing and cut API prices by up to 75%.
The model’s performance is similar to V3.1-Terminus; earlier upgrades included faster inference and reduced hallucinations by 45–50%.
DeepSeek’s R1 model research, featured in Nature, showed LLMs can learn reasoning autonomously via reinforcement learning.

AI generated, for reference only

Who’s Who

DeepSeek: DeepSeek is a Chinese AI startup that has launched an experimental large language model, DeepSeek-V3.2-Exp. This new model incorporates a "DeepSeek Sparse Attention" mechanism to enhance efficiency with long texts. The company has also significantly cut its API prices. DeepSeek's research has been recognized internationally, with its DeepSeek-R1 model's scientific principles featured in the journal Nature.

AI generated, for reference only

What Happened When

May 2025：: DeepSeek upgraded its R1 model, significantly improving performance in complex reasoning and reducing the hallucination rate by 45%–50%.

August 2025：: DeepSeek launched its V3.1 model, featuring a mixed inference architecture for faster responses.

September 18, 2025：: The scientific principles behind DeepSeek-R1 were featured as a cover story in Nature journal.

September 22, 2025：: DeepSeek released the V3.1-Terminus update, optimizing model performance in code and search agent tasks.

September 29, 2025：: DeepSeek released DeepSeek-V3.2-Exp with a new sparse attention mechanism and announced significant API price cuts.

AI generated, for reference only

GALLERY

: Gallery: Huaiyang Cuisine Takes Center Stage at Trump Banquet

NEWSLETTERS

Get our CX Daily, weekly Must-Read and China Green Bulletin newsletters delivered free to your inbox, bringing you China's top headlines.

We ‘ve added you to our subscriber list.

Manage subscription

PODCAST

China Business Uncovered Podcast: How a Lost Masterpiece Sparked a Museum Scandal

00:00

00:00/00:00

MOST POPULAR

1: Chinese Politburo Member’s Downfall Exposes Shadowy Network Tied to Evergrande, Vanke

2: Hong Kong Grapples With Near-One-Year High in Covid-19 Infections

3: Chinese Consumers Tighten Purse Strings Despite Modest Income Gains

4: Analysis: China’s AI Boom Masks a Deepening Economic Divide

5: Alibaba Leads $439 Million Funding Round for AI Video Startup AIsphere