Please ensure Javascript is enabled for purposes of website accessibility
What to Know About DeepSeek and How It Is Upending Artificial Intelligence
d8a347b41db1ddee634e2d67d08798c102ef09ac
By The New York Times
Published 18 hours ago on
January 29, 2025

Nvidia’s chief executive, Jensen Huang, speaks at CES 2025 in Las Vegas, Jan. 7, 2025. DeepSeek’s engineers said they needed only about 2,000 Nvidia chips to train the startup’s AI system. (Stella Kalinina/The New York Times)

Share

Getting your Trinity Audio player ready...

SAN FRANCISCO — Tech stocks tumbled. Giant companies such as Meta and Nvidia faced a barrage of questions about their future. And tech executives took to social media to proclaim their fears.

And it was all because of a little-known Chinese artificial intelligence startup called DeepSeek.

DeepSeek caused waves all over the world Monday as one of its accomplishments — having created a very powerful AI model with far less money than many AI experts thought possible — raised a host of questions, including whether U.S. companies were even competitive in AI anymore.

DeepSeek is “AI’s Sputnik moment,” Marc Andreessen, a tech venture capitalist, posted on social media Sunday.

How could a company that few people had heard of have such an effect?

What Is DeepSeek?

DeepSeek is a startup founded and owned by the Chinese stock trading firm High-Flyer. Its goal is to build AI technologies along the lines of OpenAI’s ChatGPT chatbot or Google’s Gemini. By 2021, DeepSeek had acquired thousands of computer chips from the U.S. chipmaker Nvidia, which are a fundamental part of any effort to create powerful AI systems.

In China, the startup is known for grabbing young and talented AI researchers from top universities, promising high salaries and an opportunity to work on cutting-edge research projects. Both High-Flyer and DeepSeek are run by Liang Wenfeng, a Chinese entrepreneur.

Over the past few years, DeepSeek has released several large language models, which is the kind of technology that underpins chatbots such as ChatGPT and Gemini. On Jan. 10, it released its first free chatbot app, which was based on a new model called DeepSeek-V3.

Why Did the Stock Market React to It Now?

When DeepSeek introduced its DeepSeek-V3 model the day after Christmas, it matched the abilities of the best chatbots from U.S. companies such as OpenAI and Google. That alone would have been impressive.

But the team behind the new system also revealed a bigger step forward. In a research paper explaining how it built the technology, DeepSeek said it used only a fraction of the computer chips that leading AI companies relied on to train their systems.

The world’s top companies typically train their chatbots with supercomputers that use as many as 16,000 chips or more. DeepSeek’s engineers said they needed only about 2,000 Nvidia chips.

Why Is That Important?

Since late 2022, when OpenAI set off the AI boom, the prevailing notion had been that the most powerful AI systems could not be built without investing billions of dollars in specialized AI chips. That would mean that only the biggest tech companies — such as Microsoft, Google and Meta, all of which are based in the United States — could afford to build the leading technologies.

But DeepSeek’s engineers said they needed only about $6 million in raw computing power to train their new system. That was roughly 10 times less than what Meta spent building its latest AI technology.

How Did DeepSeek Make Its Tech With Fewer AI Chips?

Top AI engineers in the United States say that DeepSeek’s research paper laid out clever and impressive ways of building AI technology with fewer chips.

In short, the startup’s engineers demonstrated a more efficient way of analyzing data using the chips. Leading AI systems learn their skills by pinpointing patterns in huge amounts of data, including text, images and sounds. DeepSeek described a way of spreading this data analysis across several specialized AI models — what researchers call a “mixture of experts” method — while minimizing the time lost by moving data from place to place.

Others have used similar methods before, but moving information between the models tended to reduce efficiency. DeepSeek did this in a way that allowed it to use less computing power.

Is Deepseek’s Tech as Good as Systems From OpenAI and Google?

DeepSeek-V3 can answer questions, solve logic problems and write its own computer programs as effectively as anything already on the market, according to standard benchmark tests.

Just before DeepSeek released its technology, OpenAI had unveiled a new system, called OpenAI o3, which seemed more powerful than DeepSeek-V3. But OpenAI has not released this system to the wider public.

OpenAI o3 was designed to “reason” through problems involving math, science and computer programming. Many experts pointed out that DeepSeek had not built a reasoning model along these lines, which is seen as the future of AI.

Then, on Jan. 20, DeepSeek released its own reasoning model called DeepSeek R1, and it, too, impressed the experts. That eventually sent U.S. investors and others into a panic late last week and over the weekend as they realized the importance of DeepSeek’s new technology.

US Tech Giants Are Building Data Centers With Specialized AI Chips. Does This Still Matter, Given What DeepSeek Has Done?

Yes, it still matters.

Large numbers of AI chips can still help companies in many ways. With more chips, they can run more experiments as they explore new ways of building AI. In other words, more chips can still give companies a technical and competitive advantage.

Hasn’t the United States Limited the Number of Nvidia Chips Sold to China?

Yes. To maintain the U.S. lead in the global AI race, the Biden administration had put in place rules limiting the number of powerful chips that could be sold to China and other rivals.

Does Deepseek’s Tech Mean That China Is Now Ahead of the United States in AI?

No. The world has not yet seen OpenAI’s o3 model, and its performance on standard benchmark tests was more impressive than anything else on the market. But experts are concerned that China is jumping ahead on open-source AI systems.

What Exactly Is Open-Source AI?

Like many other companies, DeepSeek has “open sourced” its latest AI system, which means that it has shared the underlying computer code with other businesses and researchers. This allows others to build and distribute their own products using the same technologies.

This is part of the reason DeepSeek and others in China have been able to build competitive AI systems so quickly and inexpensively.

In the AI world, open source first gathered steam in 2023 when Meta freely shared an AI system called Llama. At the time, many assumed that the open-source ecosystem would flourish only if companies such as Meta — giant firms with huge data centers filled with specialized chips — continued to open source their technologies.

But DeepSeek and others have shown that this ecosystem can thrive in ways that extend beyond the American tech giants.

Why Is That Important?

Many experts have argued that the big American companies should not open source their technologies because they could be used to spread disinformation or cause other serious harm. Some U.S. lawmakers have explored the possibility of preventing or throttling the practice.

But other experts have argued that if regulators stifle the progress of open-source technology in the United States, China will gain a significant edge. If the best open-source technologies come from China, these experts argue, U.S. researchers and companies will build their systems atop those technologies.

In the long run, that could put China at the heart of AI research and development, which could further accelerate its effort to build a wide range of AI technologies, including autonomous weapons and other military systems.

This article originally appeared in The New York Times.

By Cade Metz/Stella Kalinina
c. 2025 The New York Times Company

RELATED TOPICS:

DON'T MISS

DEI Will Not Be Missed

DON'T MISS

FACT FOCUS: No Evidence That $50 Million Was Designated by the US to Buy Condoms for Hamas

DON'T MISS

Community Health System Announces $30M Milestone for Neuroscience Institute

DON'T MISS

Visalia Man Arrested on Child Pornography Charge

DON'T MISS

Eagles’ Victory Celebration Turns Tragic for Temple Student

DON'T MISS

Mayor Dyer Addresses Police Chief Search, Immigration Raids, High-Speed Rail

DON'T MISS

Fed Holds Rates Steady, Hitting Pause After a Series of Cuts

DON'T MISS

Senate Confirms Zeldin to Lead EPA as Trump Vows to Cut Climate Rules

DON'T MISS

Clovis Is Rewarding Diners for Eating and Drinking Local

DON'T MISS

How Much Rain Will Fresno Get From Storms Slamming NorCal?

UP NEXT

FACT FOCUS: No Evidence That $50 Million Was Designated by the US to Buy Condoms for Hamas

UP NEXT

Community Health System Announces $30M Milestone for Neuroscience Institute

UP NEXT

Visalia Man Arrested on Child Pornography Charge

UP NEXT

Eagles’ Victory Celebration Turns Tragic for Temple Student

UP NEXT

Mayor Dyer Addresses Police Chief Search, Immigration Raids, High-Speed Rail

UP NEXT

Fed Holds Rates Steady, Hitting Pause After a Series of Cuts

UP NEXT

Senate Confirms Zeldin to Lead EPA as Trump Vows to Cut Climate Rules

UP NEXT

Clovis Is Rewarding Diners for Eating and Drinking Local

UP NEXT

How Much Rain Will Fresno Get From Storms Slamming NorCal?

UP NEXT

Trump’s Orders Aim at Critical Race Theory and Antisemitism on Campuses

Visalia Man Arrested on Child Pornography Charge

10 hours ago

Eagles’ Victory Celebration Turns Tragic for Temple Student

10 hours ago

Mayor Dyer Addresses Police Chief Search, Immigration Raids, High-Speed Rail

11 hours ago

Fed Holds Rates Steady, Hitting Pause After a Series of Cuts

11 hours ago

Senate Confirms Zeldin to Lead EPA as Trump Vows to Cut Climate Rules

11 hours ago

Clovis Is Rewarding Diners for Eating and Drinking Local

12 hours ago

How Much Rain Will Fresno Get From Storms Slamming NorCal?

12 hours ago

Trump’s Orders Aim at Critical Race Theory and Antisemitism on Campuses

13 hours ago

At Signing of Laken Riley Act, Trump Says He Plans to Send Migrants in US Illegally to Guantanamo

13 hours ago

Authorities Seize $160K, 100 Pounds of Marijuana in Merced County Traffic Stop

13 hours ago

DEI Will Not Be Missed

Bret Stephens Opinion Jan. 28, 2025 In December 2015, the Obama administration decided to allow women to serve in all combat roles. “There w...

9 hours ago

Soldiers at the Army’s jungle training school on Oahu, in Hawaii, practice tactical movements in the pouring rain, Nov. 28, 2023. (Mark Abramson/The New York Times)
9 hours ago

DEI Will Not Be Missed

10 hours ago

FACT FOCUS: No Evidence That $50 Million Was Designated by the US to Buy Condoms for Hamas

10 hours ago

Community Health System Announces $30M Milestone for Neuroscience Institute

10 hours ago

Visalia Man Arrested on Child Pornography Charge

10 hours ago

Eagles’ Victory Celebration Turns Tragic for Temple Student

11 hours ago

Mayor Dyer Addresses Police Chief Search, Immigration Raids, High-Speed Rail

The Federal Reserve building in Washington, Nov 3, 2024. The Federal Reserve is set to stand pat at its first gathering of 2025, pressing pause on interest rate cuts as policymakers take stock of how the world’s largest economy is faring. (Anna Rose Layden/The New York Times)
11 hours ago

Fed Holds Rates Steady, Hitting Pause After a Series of Cuts

11 hours ago

Senate Confirms Zeldin to Lead EPA as Trump Vows to Cut Climate Rules

Help continue the work that gets you the news that matters most.

Search

Send this to a friend