Categories: विज्ञान

China's DeepSeek releases 'intermediate' AI model on route to next generation

BEIJING (Reuters) -Chinese AI developer DeepSeek has released its "experimental" latest model, which it said was more efficient to train and better at processing long sequences of text than previous iterations of its large language models. The Hangzhou-based company called DeepSeek-V3.2-Exp an "intermediate step toward our next-generation architecture" in a post on developer forum Hugging Face. That architecture will likely be DeepSeek's most important product release since V3 and R1 shocked Silicon Valley and tech investors outside China. The V3.2-Exp model includes a mechanism called DeepSeek Sparse Attention, which the Chinese firm says can cut computing costs and boost some types of model performance. DeepSeek said in a post on X on Monday that it is cutting API prices by "50%+". While DeepSeek's next-generation architecture is unlikely to roil markets as previous versions did in January, it could still put significant pressure on domestic rivals like Alibaba's Qwen and U.S. counterparts like OpenAI if it can repeat the success of DeepSeek R1 and V3. That would require it to demonstrate high capability for a fraction of what competitors charge and spend in model training. (Reporting by Eduardo Baptista and Beijing Newsroom; Editing by Toby Chopra and Jan Harvey)

(The article has been published through a syndicated feed. Except for the headline, the content has been published verbatim. Liability lies with original publisher.)

Inkhabar webdesk

Share
Published by Inkhabar webdesk

Recent Posts

CBOT Trends-Wheat up 1-3 cents, corn steady-down 1, soybeans steady-down 1

CHICAGO, Oct 3 (Reuters) - The following are U.S. expectations for the resumption of grain…

29 seconds ago

Foreign investors can exploit cheaper dollar hedges as Fed easing resumes

(Corrects firm name to MillTech from MillTechFX in paragraph 22) By Laura Matthews and Saqib…

4 minutes ago

Indian states to raise 2.82 trillion rupees through debt in current quarter, RBI says

Oct 3 (Reuters) - Indian states will likely borrow 2.82 trillion rupees ($31.76 billion) via…

5 minutes ago

RPT-UPDATE 2-Death toll rises to 13 as rescuers search for trapped Indonesian students

(Repeats with wider coding) SIDOARJO, Indonesia, Oct 3 (Reuters) - The number of students confirmed…

10 minutes ago

Real Madrid's Alonso plays diplomatic defence amid Valverde controversy

VIDEO SHOWS: REAL MADRID SQUAD TRAINING, REMARKS BY REAL MADRID COACH XABI ALONSO SHOWS: COMPLETE SCRIPT…

11 minutes ago

GIP in talks to buy Aligned Data Centers, sources say

By Akash Sriram (Reuters) -BlackRock-owned Global Infrastructure Partners (GIP) is in talks to acquire Macquarie-backed…

15 minutes ago