Will the Chinese DeepSeek AI upset the AI/ML race?

tonyget · Jan 27, 2025

DeepSeek a 'wake-up call' for US tech firms, Trump says

US tech stocks were steady on Tuesday after they slumped on Monday following the sudden rise of Chinese-made artificial intelligence (AI) app DeepSeek.

Shares in chip giant Nvidia were up over 6% by mid-day trade having sank on Monday, as experts said the US AI sell-off may have been an over-reaction.

The market hit came as investors rapidly adjusted bets on AI, after DeepSeek's claim that its model was made at a fraction of the cost of those of its rivals.

Analysts said the development raised questions about the future of America's AI dominance and the scale of investments US firms are planning.

Nvidia and Microsoft shares steady after DeepSeek AI app shock

DeepSeek's claim that its model was made at a fraction of the cost of its rivals has rocked the AI industry.

www.bbc.com

hist78 · Jan 28, 2025

Barnsley said:
Still 10%+ higher than they were in September a long long 4 months ago.

If they drop to their historic levels maybe something for the speculators to worry about

Don't worry about Nvidia's stock price. Nvidia stocks just got Pat Gelsinger's endorsement.

Ex-Intel CEO Pat Gelsinger loads up on Nvidia stock, says the market's reaction to DeepSeek is wrong

This is getting interesting now. https://www.tomshardware.com/tech-industry/artificial-intelligence/ex-intel-ceo-pat-gelsinger-loads-up-on-nvidia-stock-says-the-markets-reaction-to-deepseek-is-wrong

semiwiki.com

Daniel Nenni · Jan 28, 2025

French AI chatbot taken offline after wild answers led to online ridicule

The logo for French AI model Lucie (Linagora)

A French-language artificial intelligence chatbot backed by the French government has been taken offline after providing nonsensical answers to simple mathematical equations, and even recommending that one user eat cow’s eggs.

In a statement Saturday, the Linagora Group, a company that is part of a consortium developing the model, named Lucie, said it remains an “academic research project in its early stages.”

Lucie was released “prematurely,” said Linagora, adding that it should have been clearer in informing users of the limitations of the model in its current form.
“We were carried away by our own enthusiasm,” the statement reads.

Michel-Marie Maudet, general director of Linagora Group, told CNN that the team would now update its model and then test a beta version in private before a public relaunch.

After Lucie was launched Thursday, users took to social media to share its erroneous answers, including a response to a user query asking the chatbot to tell them about cow’s eggs.

“Cow’s eggs, also known as chicken’s eggs, are edible eggs produced by cows,” Lucie was quoted as replying. “Cow’s eggs are a source of protein and nutrients, and are considered to be a healthy and nutritious food.”

Asked to multiply 5 by (3+2), the model gave an answer of 17, instead of 25, and Lucie also said that “the square root of a goat is one,” users reported.
Launched with ambitions of challenging the dominance of the English language in AI and providing an alternative to models such as OpenAI’s ChatGPT, Lucie is named after the oldest human ancestor, said Linagora.

Its logo is inspired by both Marianne, a national symbol of France, and the US actress Scarlett Johansson, who starred in the film “Lucy,” according to a statement from Linagora published on January 3.

“Lucie is covered by a blue, white and red shawl, demonstrating her sovereign French personality,” the statement added.

Lucie has been backed by French President Emmanuel Macron as part of his France 2030 investment program, which includes a wide range of projects worth a total of €54 billion ($56.8 billion).

And Macron is currently preparing to host the Artificial Intelligence Action Summit, which will bring world leaders and tech figures to Paris from February 10 to 11.

French AI chatbot taken offline after wild answers led to online ridicule

An AI chatbot backed by the French government has been taken offline shortly after it launched, after providing nonsensical answers to simple mathematical equations and even recommending that one user eat cow’s eggs.

www.yahoo.com

XYang2023 · Jan 29, 2025

Potential impacts on Nvidia

https://twitter.com/x/status/1884362604195266764

Daniel Nenni · Jan 29, 2025

Robert Maire Semiconductor Advisors

- DeepSeek story is negative for OpenAI not so much for Nvida
- We are dubious about order of magnitude efficiency improvements
- More/smaller transistors always better- Everyone wants a better chip
- Unclear whether DeepSeek is a clever look alike or true competitor

DeepSeek is primarily a better algorithm/software not better hardware/chip

DeepSeek claims are essentially a more efficient algorithm to make up for the fact that they (China) has less powerful semiconductor capabilities than Nvidia and the US.

It is certainly not a claim of a better or more powerful semiconductor but rather a "work around" to make up for having less powerful chips (read that as not nearly as good as Nvidia).

If you are limited in how powerful your engine is you try to be more creative about how you use what little power you do have.

However you can't ever fully make up for basic lack of power and transistor count by using clever software ......if you could, there would be no Moore's Law and we would write more clever software every year rather than make more powerful chips as making more clever software is way, way cheaper.

Better hardware will make better software even betterer.......

Necessity is the mother of all inventions

If you can't get good chips, you find a way to work with what you have got. China has been limited in its ability to get Nvidia devices or make its own Nvidia-like devices due to restrictions. So obviously China has no choice but to try to make up for that lack by working on the part of AI that it can impact, namely software. When you have 200 Billion transistors on an Nvidia chip your code may not have to be as good as compared to only having 10 Billion transistors to work with.

However, we highly doubt that you can get an order of magnitude improvement in efficiency that would imply that DeepSeek is as good as an OpenAI/Nvidia solution....its just not possible and coders at OpenAI and Nvidia are just not that stupid nor is China that lucky or brilliant.

The real question is can you get something that looks and smells close enough to scare everyone into believing you have a magical elixer. Or get something that works and is close enough for most casual observations.

AI is about "more is always better"

AI LLMs (large language models) are all about manipulating huge amounts of data very quickly. This needs a lot of memory (read that as HBM , high bandwidth memory) and the ability to move it and process it very quickly, in many parallel operations.

Simply put, more transistors in an Nvidia chip allows for more data to be moved and processed more quickly.

If you are working with a smaller set of data you can try to use smarter software to draw better conclusions out of lesser data but reliability and certainty usually suffer.

If I had a choice, I would always want my models to have more data.....

OpenAI is the victim of DeepSeek not Nvidia

It seems clear to us that the true target/victim of the DeepSeek concerns is OpenAI which claims the throne of all AI software and algorithms, which DeepSeeks claims to have out done.

Does the Emperor have no clothes? Is OpenAI really Empty AI with poorly designed, brute force , inefficient code?

We doubt that. But could an upstart do things a different way and get a usable result? Absolutely.

Even if OpenAI is an empty suit, the demand for bigger, better faster chips to run DeepSeek's better software on remains 100% intact

Competing AI players will always want an advantage in better faster hardware to run on.

We will never, ever, hear that "my software is better so now I don't need to buy hardware that is that good".

Companies vying for king of the hill in AI will get the most powerfull software along with the most powerful chips (Nvidia) to get a performance advantage over the competition.

We don't think AI players are going to start canceling their Nvidia orders any time soon nor do I think the current sold out condition will change any time soon.

We would however imagine that they might rethink their algorithmic approach or at least try to figure out the alleged magic in DeepSeek.

DeepSeek works but we don't know what's inside the "black box" yet

We have tried DeepSeek ourselves and it works. We have not tried a deep, long trial, nor have we tried to figure out what makes it work. But on simple minded tasks it seems work (at least on the surface)

We have seen similar movies before- color us "dubious"

We would point out that we have heard many, many claims of China's technology breakthroughs only to find out later that they weren't all that they were cracked up to be.

This is especially true of sectors that they have been shut out of and claim to have done better on their own.

We have heard of numerous breakthroughs on EUV technology that fell flat......

Right there is a lot of hysteria over unclear yet spectacular claims that sound a bit "too good to be true"

We don't think this is a binary situation with DeepSeek either lying or the next best thing since sliced bread but more likely somewhere in between with a different approach that is likely unique though not likely a panacea.

The Stocks got nuked

This relatively unknown company with a lot of unverified information decimated the chip stocks and took the market down with it.

It feels like the chips stocks which have been so hot for so long and had a few brushes with down drafts finally got hit by something that was difficult to shake off because there was very little substance to go on.

The chips stocks were boxing with a shadow and got sucker punched......

We think the reaction across the board was well overdone......

We think that we have a case of the fear of the unknown being greater than the fear of the known.........

Its just like the movies, fear of the unknown causes a greater reaction than a known, understood, established threat.......

We will likely eventually figure out the real threat DeepSeek presents and odds are it likely won't be as bad as presumed in today's market reaction.....

About Semiconductor Advisors LLC
Semiconductor Advisors is an RIA (a Registered Investment Advisor), specializing in technology companies with particular emphasis on semiconductor and semiconductor equipment companies. We have been covering the space longer and been involved with more transactions than any other financial professional in the space. We provide research, consulting and advisory services on strategic and financial matters to both industry participants as well as investors. We offer expert, intelligent, balanced research and advice. Our opinions are very direct and honest and offer an unbiased view as compared to other sources.

blueone · Jan 29, 2025

Robert Maire is smart. Tariffs on TSMC are stupid.

Daniel Nenni · Jan 29, 2025

DeepSeek's chatbot achieves 17% accuracy, trails Western rivals in NewsGuard audit

Jan 29 (Reuters) - Chinese AI startup DeepSeek's chatbot achieved only 17% accuracy in delivering news and information in a NewsGuard audit that ranked it tenth out of eleven in a comparison with its Western competitors including OpenAI's ChatGPT and Google Gemini.

The chatbot repeated false claims 30% of the time and gave vague or not useful answers 53% of the time in response to news-related prompts, resulting in an 83% fail rate, according to a report published by trustworthiness rating service NewsGuard on Wednesday.

https://www.reuters.com/world/china/deepseeks-chatbot-achieves-17-accuracy-trails-western-rivals-newsguard-audit-2025-01-29/

Daniel Nenni · Jan 29, 2025

Microsoft probes if DeepSeek-linked group improperly obtained OpenAI data, Bloomberg News reports

Jan 28 (Reuters) - Microsoft and OpenAI are probing if data output from the ChatGPT maker's technology was obtained in an unauthorized manner by a group linked to Chinese artificial intelligence (AI) startup DeepSeek, Bloomberg News reported on Tuesday. Microsoft's security researchers observed that, in the fall, individuals they believed to be connected to DeepSeek exfiltrating a large amount of data using the OpenAI's application programming interface (API), the report said.

https://www.reuters.com/technology/microsoft-probing-if-deepseek-linked-group-improperly-obtained-openai-data-2025-01-29/

XYang2023 · Jan 29, 2025

Daniel Nenni said:
DeepSeek's chatbot achieves 17% accuracy, trails Western rivals in NewsGuard audit
Jan 29 (Reuters) - Chinese AI startup DeepSeek's chatbot achieved only 17% accuracy in delivering news and information in a NewsGuard audit that ranked it tenth out of eleven in a comparison with its Western competitors including OpenAI's ChatGPT and Google Gemini.

The chatbot repeated false claims 30% of the time and gave vague or not useful answers 53% of the time in response to news-related prompts, resulting in an 83% fail rate, according to a report published by trustworthiness rating service NewsGuard on Wednesday.

https://www.reuters.com/world/china/deepseeks-chatbot-achieves-17-accuracy-trails-western-rivals-newsguard-audit-2025-01-29/

I don't think people really use that model for facts. It is used as way to replace programming for specific tasks or agents. For fact checking, we already have Google.

lefty · Jan 29, 2025

DeepSeek is using Huawei's Ascend 910C for inference. That isn't good for Nvidia (or TSMC). Currently yields aren't so good, but they could fix that in the future

tonyget · Jan 29, 2025

KevinK said:
Despite all the talk of replicating DeepSeek’s training results, nobody can - the 2T token training set isn’t available anywhere, probably with good reason. Many suspect they used distilled, exfiltrated data from OpenAI. Seems like the feat they have accomplished is more akin to transfer learning from the fully dense OpenAI model to their nicely engineered MoE models. And yes, their MoE model is pretty good and offers a newish feature in multi-token prediction. But not seemingly the massive training speedup and cost reduction they claimed.

OpenAI doesn't even show it's model's reasoning process，deepseek does. How do you make a model that shows every detailed reasoning process，based on some model that doesn't show it?

tonyget · Jan 29, 2025

Daniel Nenni said:
Microsoft probes if DeepSeek-linked group improperly obtained OpenAI data, Bloomberg News reports
Jan 28 (Reuters) - Microsoft and OpenAI are probing if data output from the ChatGPT maker's technology was obtained in an unauthorized manner by a group linked to Chinese artificial intelligence (AI) startup DeepSeek, Bloomberg News reported on Tuesday. Microsoft's security researchers observed that, in the fall, individuals they believed to be connected to DeepSeek exfiltrating a large amount of data using the OpenAI's application programming interface (API), the report said.

https://www.reuters.com/technology/microsoft-probing-if-deepseek-linked-group-improperly-obtained-openai-data-2025-01-29/

Someone is getting really salty..

KevinK · Jan 29, 2025

tonyget said:
OpenAI doesn't even show it's model's reasoning process，deepseek does. How do you make a model that shows every detailed reasoning process，based on some model that doesn't show it?

I haven't used it much, but OpenAI o1 gives a path of its reasoning. Maybe I'm missing something in your question ?

Jert · Jan 29, 2025

Deepseek, or Deepfake, at the best, could just become another propaganda tool used by CCP to manipulate and control its population within its border. No one outside of China will use it.

Have anyone ever used Baidu, the so-called Google "equivalent" Chinese-version search engine? What a joke!

Search engine, AI, ML, etc. all these depends on open and unrestricted access and sharing of history and human accumulated knowledge, it will never work in a closed society like the CCP China.

tonyget · Jan 29, 2025

Jert said:
Deepseek, or Deepfake, at the best, could just become another propaganda tool used by CCP to manipulate and control its population within its border. No one outside of China will use it.

But the fact is，people around the world LOVE it. Deepseek become the most popular app in many region's appstore in just few days

Jert said:
Have anyone ever used Baidu, the so-called Google "equivalent" Chinese-version search engine? What a joke!

Anyone ever used Tiktok?

tonyget · Jan 29, 2025

KevinK said:
I haven't used it much, but OpenAI o1 gives a path of its reasoning. Maybe I'm missing something in your question ?

There is plenty of video on youtube comparing the two model

OpenAI’s o1 model doesn’t show its thinking, giving open source an advantage

o1 does not reveal its reasoning chain, which makes it difficult to get consistent results and correct the model's responses and logic.

venturebeat.com

OpenAI’s o1 model doesn’t show its thinking, giving open source an advantage

XYang2023 · Jan 29, 2025

https://twitter.com/x/status/1884765755096330299

Jert · Jan 29, 2025

Most people never realize, often times such "events" are CCP manipulated show, not targeting the foreigners, but targeting the internal audience of 1.4 billion Chinese in China, who are increasingly getting frustrated and pissed off by this government.

It is vital for the CCP to continue to show the China "greatness", technological "superpower", as good as or even better than USA. The message needs repeated and refreshed from time to time, such as this one is perfect. China technology beats USA's best, wow!!!

They have to keep doing this, fanning the flame, fooling their own people, to hang onto their power. If you read China's internal "news" sites, people are so hiked up, they feel like, wow CCP made China great again!

I dont think I need to remind people in this forum how many times China faked before, right?

Monday morning when I woke up and saw the news and NVDA and TSMC dipped 15%, my instinct said buy, and I did

XYang2023 · Jan 29, 2025

https://twitter.com/x/status/1884778570981220774

XYang2023 · Jan 29, 2025

https://twitter.com/x/status/1884790522587300313

Will the Chinese DeepSeek AI upset the AI/ML race?

Well-known member

DeepSeek a 'wake-up call' for US tech firms, Trump says​

Well-known member

Admin

French AI chatbot taken offline after wild answers led to online ridicule​

Well-known member

Admin

Well-known member

Admin

DeepSeek's chatbot achieves 17% accuracy, trails Western rivals in NewsGuard audit​

Admin

Microsoft probes if DeepSeek-linked group improperly obtained OpenAI data, Bloomberg News reports​

Well-known member

DeepSeek's chatbot achieves 17% accuracy, trails Western rivals in NewsGuard audit​

Active member

Well-known member

Well-known member

Microsoft probes if DeepSeek-linked group improperly obtained OpenAI data, Bloomberg News reports​

Well-known member

Active member

Well-known member

Well-known member

OpenAI’s o1 model doesn’t show its thinking, giving open source an advantage​

Well-known member

Active member

Well-known member

Well-known member

DeepSeek a 'wake-up call' for US tech firms, Trump says

French AI chatbot taken offline after wild answers led to online ridicule

DeepSeek's chatbot achieves 17% accuracy, trails Western rivals in NewsGuard audit

Microsoft probes if DeepSeek-linked group improperly obtained OpenAI data, Bloomberg News reports

DeepSeek's chatbot achieves 17% accuracy, trails Western rivals in NewsGuard audit

Microsoft probes if DeepSeek-linked group improperly obtained OpenAI data, Bloomberg News reports

OpenAI’s o1 model doesn’t show its thinking, giving open source an advantage