What Is China’s DeepSeek and Why Is It Freaking Out the AI World?

What Is China’s DeepSeek and Why Is It Freaking Out the AI World?

(Bloomberg) — DeepSeek, an AI startup just over a year old, stirred awe and consternation in Silicon Valley with its breakthrough artificial intelligence model that offered comparable performance to the world’s best chatbots at seemingly a fraction of the cost. Created in China’s Hangzhou, DeepSeek carries far-reaching implications for the global tech industry and supply chain, offering a counterpoint to the widespread belief that the future of AI will require ever-increasing amounts of power and energy to develop.

Most Read from Bloomberg

What exactly is DeepSeek?

DeepSeek is a Chinese startup founded in 2023 by Liang Wenfeng, the chief of AI-driven quant hedge fund High-Flyer. The company develops open-source AI models, and its eponymous mobile app surged to the top of the iPhone’s download charts in the US after its release in early January.

The DeepSeek app distinguishes itself from other chatbots like OpenAI’s ChatGPT by articulating its reasoning before delivering a response to a prompt. The company claims its R1 release offers performance on par with OpenAI’s latest and has granted license for individuals interested in developing chatbots on the technology to build on it.

How does DeepSeek R1 compare to OpenAI or Meta AI?

Though not fully detailed by the company, the cost of training and developing DeepSeek’s models appears to be only a fraction of what’s required for OpenAI or Meta Platforms Inc.’s best products. The much better efficiency of the model puts into question the need for vast expenditures of capital to acquire the latest and most powerful AI accelerators from the likes of Nvidia Corp. That also amplifies attention on US export curbs of such advanced semiconductors to China — which were intended to prevent a breakthrough of the sort that DeepSeek seems to represent.

DeepSeek R1 is near or better than rival models in several leading benchmarks such as AIME 2024 for mathematical tasks, MMLU for general knowledge and AlpacaEval 2.0 for question-and-answer performance. It also ranks among the top performers on a UC Berkeley-affiliated leaderboard called Chatbot Arena.

What’s raising alarm in the US?

Washington has banned the export of high-end technologies like GPU semiconductors to China, in a bid to stall the country’s advances in AI, the pivotal frontier in the US-China contest for tech supremacy. But DeepSeek’s progress suggests Chinese AI engineers have worked their way around the restrictions, focusing on greater efficiency with limited resources. While it remains unclear how much advanced AI-training hardware DeepSeek has had access to, the company’s demonstrated enough to suggest the trade restrictions have not been entirely effective in stymying China’s progress.

Source link

Visited 1 times, 1 visit(s) today

Leave a Reply

Your email address will not be published. Required fields are marked *