Please ensure Javascript is enabled for purposes of website accessibility
What to Know About DeepSeek and How It Is Upending Artificial Intelligence
d8a347b41db1ddee634e2d67d08798c102ef09ac
By The New York Times
Published 1 day ago on
January 29, 2025

Nvidia’s chief executive, Jensen Huang, speaks at CES 2025 in Las Vegas, Jan. 7, 2025. DeepSeek’s engineers said they needed only about 2,000 Nvidia chips to train the startup’s AI system. (Stella Kalinina/The New York Times)

Share

Getting your Trinity Audio player ready...

SAN FRANCISCO — Tech stocks tumbled. Giant companies such as Meta and Nvidia faced a barrage of questions about their future. And tech executives took to social media to proclaim their fears.

And it was all because of a little-known Chinese artificial intelligence startup called DeepSeek.

DeepSeek caused waves all over the world Monday as one of its accomplishments — having created a very powerful AI model with far less money than many AI experts thought possible — raised a host of questions, including whether U.S. companies were even competitive in AI anymore.

DeepSeek is “AI’s Sputnik moment,” Marc Andreessen, a tech venture capitalist, posted on social media Sunday.

How could a company that few people had heard of have such an effect?

What Is DeepSeek?

DeepSeek is a startup founded and owned by the Chinese stock trading firm High-Flyer. Its goal is to build AI technologies along the lines of OpenAI’s ChatGPT chatbot or Google’s Gemini. By 2021, DeepSeek had acquired thousands of computer chips from the U.S. chipmaker Nvidia, which are a fundamental part of any effort to create powerful AI systems.

In China, the startup is known for grabbing young and talented AI researchers from top universities, promising high salaries and an opportunity to work on cutting-edge research projects. Both High-Flyer and DeepSeek are run by Liang Wenfeng, a Chinese entrepreneur.

Over the past few years, DeepSeek has released several large language models, which is the kind of technology that underpins chatbots such as ChatGPT and Gemini. On Jan. 10, it released its first free chatbot app, which was based on a new model called DeepSeek-V3.

Why Did the Stock Market React to It Now?

When DeepSeek introduced its DeepSeek-V3 model the day after Christmas, it matched the abilities of the best chatbots from U.S. companies such as OpenAI and Google. That alone would have been impressive.

But the team behind the new system also revealed a bigger step forward. In a research paper explaining how it built the technology, DeepSeek said it used only a fraction of the computer chips that leading AI companies relied on to train their systems.

The world’s top companies typically train their chatbots with supercomputers that use as many as 16,000 chips or more. DeepSeek’s engineers said they needed only about 2,000 Nvidia chips.

Why Is That Important?

Since late 2022, when OpenAI set off the AI boom, the prevailing notion had been that the most powerful AI systems could not be built without investing billions of dollars in specialized AI chips. That would mean that only the biggest tech companies — such as Microsoft, Google and Meta, all of which are based in the United States — could afford to build the leading technologies.

But DeepSeek’s engineers said they needed only about $6 million in raw computing power to train their new system. That was roughly 10 times less than what Meta spent building its latest AI technology.

How Did DeepSeek Make Its Tech With Fewer AI Chips?

Top AI engineers in the United States say that DeepSeek’s research paper laid out clever and impressive ways of building AI technology with fewer chips.

In short, the startup’s engineers demonstrated a more efficient way of analyzing data using the chips. Leading AI systems learn their skills by pinpointing patterns in huge amounts of data, including text, images and sounds. DeepSeek described a way of spreading this data analysis across several specialized AI models — what researchers call a “mixture of experts” method — while minimizing the time lost by moving data from place to place.

Others have used similar methods before, but moving information between the models tended to reduce efficiency. DeepSeek did this in a way that allowed it to use less computing power.

Is Deepseek’s Tech as Good as Systems From OpenAI and Google?

DeepSeek-V3 can answer questions, solve logic problems and write its own computer programs as effectively as anything already on the market, according to standard benchmark tests.

Just before DeepSeek released its technology, OpenAI had unveiled a new system, called OpenAI o3, which seemed more powerful than DeepSeek-V3. But OpenAI has not released this system to the wider public.

OpenAI o3 was designed to “reason” through problems involving math, science and computer programming. Many experts pointed out that DeepSeek had not built a reasoning model along these lines, which is seen as the future of AI.

Then, on Jan. 20, DeepSeek released its own reasoning model called DeepSeek R1, and it, too, impressed the experts. That eventually sent U.S. investors and others into a panic late last week and over the weekend as they realized the importance of DeepSeek’s new technology.

US Tech Giants Are Building Data Centers With Specialized AI Chips. Does This Still Matter, Given What DeepSeek Has Done?

Yes, it still matters.

Large numbers of AI chips can still help companies in many ways. With more chips, they can run more experiments as they explore new ways of building AI. In other words, more chips can still give companies a technical and competitive advantage.

Hasn’t the United States Limited the Number of Nvidia Chips Sold to China?

Yes. To maintain the U.S. lead in the global AI race, the Biden administration had put in place rules limiting the number of powerful chips that could be sold to China and other rivals.

Does Deepseek’s Tech Mean That China Is Now Ahead of the United States in AI?

No. The world has not yet seen OpenAI’s o3 model, and its performance on standard benchmark tests was more impressive than anything else on the market. But experts are concerned that China is jumping ahead on open-source AI systems.

What Exactly Is Open-Source AI?

Like many other companies, DeepSeek has “open sourced” its latest AI system, which means that it has shared the underlying computer code with other businesses and researchers. This allows others to build and distribute their own products using the same technologies.

This is part of the reason DeepSeek and others in China have been able to build competitive AI systems so quickly and inexpensively.

In the AI world, open source first gathered steam in 2023 when Meta freely shared an AI system called Llama. At the time, many assumed that the open-source ecosystem would flourish only if companies such as Meta — giant firms with huge data centers filled with specialized chips — continued to open source their technologies.

But DeepSeek and others have shown that this ecosystem can thrive in ways that extend beyond the American tech giants.

Why Is That Important?

Many experts have argued that the big American companies should not open source their technologies because they could be used to spread disinformation or cause other serious harm. Some U.S. lawmakers have explored the possibility of preventing or throttling the practice.

But other experts have argued that if regulators stifle the progress of open-source technology in the United States, China will gain a significant edge. If the best open-source technologies come from China, these experts argue, U.S. researchers and companies will build their systems atop those technologies.

In the long run, that could put China at the heart of AI research and development, which could further accelerate its effort to build a wide range of AI technologies, including autonomous weapons and other military systems.

This article originally appeared in The New York Times.

By Cade Metz/Stella Kalinina
c. 2025 The New York Times Company

RELATED TOPICS:

DON'T MISS

World Champion Russian Skaters on American Airlines Jet Built a New Life as Coaches in the US

DON'T MISS

Fresno County Confirms Two Flu Deaths While Nationwide Stats Rise

DON'T MISS

Kings County Children Found After Amber Alert Issued, Suspect in Custody

DON'T MISS

Amazon Sues to Block Release of Trade Secrets to Washington Post

DON'T MISS

President Trump Blames DEI and Biden for Crash Under Trump’s Watch

DON'T MISS

Reds Acquire Left-Handed Reliever Taylor Rogers From Giants

DON'T MISS

Driver Vanishes After Crash on Highway 41. CHP Says Hit-and-Runs on the Rise.

DON'T MISS

Gilgeous-Alexander Scores 52 but Curry and Wiggins Lead Balanced Warriors Past Thunder 116-109

DON'T MISS

Tulsi Gabbard, Trump’s Pick to Oversee US Spy Agencies, Grilled on Snowden, Syria and Russia

DON'T MISS

Rihanna Appears at Trial of A$AP Rocky and Outshines Key Testimony on Alleged Shooting

UP NEXT

Fresno County Confirms Two Flu Deaths While Nationwide Stats Rise

UP NEXT

Kings County Children Found After Amber Alert Issued, Suspect in Custody

UP NEXT

Amazon Sues to Block Release of Trade Secrets to Washington Post

UP NEXT

President Trump Blames DEI and Biden for Crash Under Trump’s Watch

UP NEXT

Reds Acquire Left-Handed Reliever Taylor Rogers From Giants

UP NEXT

Driver Vanishes After Crash on Highway 41. CHP Says Hit-and-Runs on the Rise.

UP NEXT

Gilgeous-Alexander Scores 52 but Curry and Wiggins Lead Balanced Warriors Past Thunder 116-109

UP NEXT

Tulsi Gabbard, Trump’s Pick to Oversee US Spy Agencies, Grilled on Snowden, Syria and Russia

UP NEXT

Rihanna Appears at Trial of A$AP Rocky and Outshines Key Testimony on Alleged Shooting

UP NEXT

FireAid, a Benefit for LA Wildfire Relief, Is Almost Here. Here’s How to Watch and Donate

Amazon Sues to Block Release of Trade Secrets to Washington Post

2 hours ago

President Trump Blames DEI and Biden for Crash Under Trump’s Watch

3 hours ago

Reds Acquire Left-Handed Reliever Taylor Rogers From Giants

3 hours ago

Driver Vanishes After Crash on Highway 41. CHP Says Hit-and-Runs on the Rise.

3 hours ago

Gilgeous-Alexander Scores 52 but Curry and Wiggins Lead Balanced Warriors Past Thunder 116-109

3 hours ago

Tulsi Gabbard, Trump’s Pick to Oversee US Spy Agencies, Grilled on Snowden, Syria and Russia

3 hours ago

Rihanna Appears at Trial of A$AP Rocky and Outshines Key Testimony on Alleged Shooting

4 hours ago

FireAid, a Benefit for LA Wildfire Relief, Is Almost Here. Here’s How to Watch and Donate

4 hours ago

Here Are Some of the Deadliest Plane Crashes in US History

4 hours ago

With Sweeping Executive Orders, Trump Tests Local Control of Schools

4 hours ago

World Champion Russian Skaters on American Airlines Jet Built a New Life as Coaches in the US

TALLINN, Estonia — The two Russian figure skating coaches killed in the American Airlines crash were two-time Olympians and former world cha...

5 minutes ago

5 minutes ago

World Champion Russian Skaters on American Airlines Jet Built a New Life as Coaches in the US

18 minutes ago

Fresno County Confirms Two Flu Deaths While Nationwide Stats Rise

The Kern County Sheriff's Office has located missing children Alana and Arya Maldonado safe and arrested Jonathan Maldonado-Cruz. (Kern County SO)
2 hours ago

Kings County Children Found After Amber Alert Issued, Suspect in Custody

2 hours ago

Amazon Sues to Block Release of Trade Secrets to Washington Post

President Donald Trump speaks to reporters aboard Air Force One en route from Miami to Joint Base Andrews, Md., Monday, Jan. 27, 2025. (AP/Mark Schiefelbein)
3 hours ago

President Trump Blames DEI and Biden for Crash Under Trump’s Watch

3 hours ago

Reds Acquire Left-Handed Reliever Taylor Rogers From Giants

A driver vanished after rolling a car on Freeway 41, highlighting growing concerns over hit-and-run incidents and their consequences. (CHP)
3 hours ago

Driver Vanishes After Crash on Highway 41. CHP Says Hit-and-Runs on the Rise.

3 hours ago

Gilgeous-Alexander Scores 52 but Curry and Wiggins Lead Balanced Warriors Past Thunder 116-109

Help continue the work that gets you the news that matters most.

Search

Send this to a friend