How China’s New AI Model DeepSeek Is Threatening U.S. Dominance

XYang2023 · Jan 25, 2025

It contains a 30-minute interview with Perplexity CEO Aravind Srinivas, which I found to be very informative.

Jozo035 · Jan 25, 2025

You can test model on Your computer. Download and install Ollama , then enter ollama run deepseek-r1 into command line. It should download and run 7B model.

You can also test different sizes or different model from library. https://ollama.com/library/deepseek-r1

I think this model is pretty good but size is still limiting factor at least for personal use (locally). I spend 30 minutes forcing it to fix one function (unsuccessfully). Copilot (o1) fixed same issue instantly (literally just "fixt it" prompt). But again, it is more issue with size and 600B+ is probably better...

XYang2023 · Jan 25, 2025

Ollama distributes quantized versions. With quantisation, perplexity increases. But as you said, it could be due to model sizes:

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

14B @ 4bit -> 12GB graphic memory requirement

https://www.reddit.com/r/LocalLLaMA/comments/1gosrzh/qwen_14b_on_18gb_of_vram

.

Intel B580 should work for the 14B model size.

32B @ 4bit -> 24GB graphic memory requirement

Qwen/QwQ-32B-preview · Hardware Requirements

Very interesting model. Does anyone have info on what hardware is required to run it?

huggingface.co

XYang2023 · Jan 27, 2025

Made a video to share my experience:

blueone · Jan 27, 2025

Excellent video!

XYang2023 · Jan 27, 2025

blueone said:
Excellent video!

I really hope Intel improves its efforts to promote their GPUs. I have many ideas they could consider, but I don't work for Intel, and they should strive to do better in this area.

blueone · Jan 27, 2025

XYang2023 said:
I really hope Intel improves its efforts to promote their GPUs. I have many ideas they could consider, but I don't work for Intel, and they should strive to do better in this area.

You might consider trying posting your ideas on the intel.com community forum for graphics products, which includes the B580. Perhaps someone from Intel will see the value in them, and contact you.

Graphics

Intel® graphics drivers and software, compatibility, troubleshooting, performance, and optimization

community.intel.com

XYang2023 · Jan 27, 2025

blueone said:
You might consider trying posting your ideas on the intel.com community forum for graphics products, which includes the B580. Perhaps someone from Intel will see the value in them, and contact you.

Graphics

Intel® graphics drivers and software, compatibility, troubleshooting, performance, and optimization

community.intel.com

I’m not sure. I’ve made several videos about the B580 and tagged @Intel and @MJHolthaus, but I haven’t received any direct feedback from Intel.

One of my videos has thousands of views, clearly showing that many people are interested in using the B580 for machine learning and AI. I genuinely feel that, instead of sampling the cards to some YouTubers, Intel could consider sending samples to me or my school. We could test them and provide valuable feedback.

I really hope Intel's marketing team becomes more proactive in addressing the market and takes steps to actively prepare for Falcon Shores.

I work in a university robotic lab and we have and use quite a lot of GPUs.

Daniel Nenni · Jan 27, 2025

XYang2023 said:
It contains a 30-minute interview with Perplexity CEO Aravind Srinivas, which I found to be very informative.

I don't understand why this is a threat? And who said the US is dominant? Because you read it on the internet?

XYang2023 said:
I’m not sure. I’ve made several videos about the B580 and tagged @Intel and @MJHolthaus, but I haven’t received any direct feedback from Intel.

One of my videos has thousands of views, clearly showing that many people are interested in using the B580 for machine learning and AI. I genuinely feel that, instead of sampling the cards to some YouTubers, Intel could consider sending samples to me or my school. We could test them and provide valuable feedback.

I really hope Intel's marketing team becomes more proactive in addressing the market and takes steps to actively prepare for Falcon Shores.

I work in a university robotic lab and we have and use quite a lot of GPUs.

Who at Intel did you contact? I may be able to help. Send me private email through SemiWiki.

XYang2023 · Jan 27, 2025

Daniel Nenni said:
I don't understand why this is a threat? And who said the US is dominant? Because you read it on the internet?

Who at Intel did you contact? I may be able to help. Send me private email through SemiWiki.

Thank you. I’ll think about that. I believe the lab I work in is definitely open to collaborations.

Recently, I tagged Intel and MJ on Twitter in a post suggesting they should do more to promote the B580 to the ML/AI audience. I also shared a video I created analyzing the B580 for machine learning purposes. I felt this should be Intel’s responsibility, not mine. Today, I tagged Intel and MJ again with a new video.

Additionally, I tagged Robert Hallock on Bluesky regarding AMD's misleading tweet about the Linus/Jimmy Fallon show. I suggested he consider being more active on X (formerly Twitter) since misinformation can spread quickly there. I also mentioned that Intel should work to address and correct such issues.

However, it feels like this communication is entirely one-sided...

XYang2023 · Jan 28, 2025

Twitter thread with MJ

https://twitter.com/x/status/1884105282252587497

Yunus · Jan 28, 2025

Jozo035 said:
You can test model on Your computer. Download and install Ollama , then enter ollama run deepseek-r1 into command line. It should download and run 7B model.

You can also test different sizes or different model from library. https://ollama.com/library/deepseek-r1

I think this model is pretty good but size is still limiting factor at least for personal use (locally). I spend 30 minutes forcing it to fix one function (unsuccessfully). Copilot (o1) fixed same issue instantly (literally just "fixt it" prompt). But again, it is more issue with size and 600B+ is probably better...

The Ollama deepseek-r1 model is a distilled version, it's not the deepseek V3 R1. The name chosen by Ollama is very misleading.

XYang2023 · Jan 28, 2025

Yunus said:
The Ollama deepseek-r1 model is a distilled version, it's not the deepseek V3 R1. The name chosen by Ollama is very misleading.

I think it’s fine. My understanding is that R1 stands for Reasoning model 1. Depending on the parameter size, the base models vary. I used the benchmark table to select the model, which I discussed in my video.

Fred Chen · Jan 28, 2025

Jozo035 said:
You can test model on Your computer. Download and install Ollama , then enter ollama run deepseek-r1 into command line. It should download and run 7B model.

You can also test different sizes or different model from library. https://ollama.com/library/deepseek-r1

I think this model is pretty good but size is still limiting factor at least for personal use (locally). I spend 30 minutes forcing it to fix one function (unsuccessfully). Copilot (o1) fixed same issue instantly (literally just "fixt it" prompt). But again, it is more issue with size and 600B+ is probably better...

The repeated monologue responses wear on me quickly.

Search

How China’s New AI Model DeepSeek Is Threatening U.S. Dominance

XYang2023

Well-known member

Jozo035

Active member

XYang2023

Well-known member

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B · Hugging Face

Qwen/QwQ-32B-preview · Hardware Requirements

XYang2023

Well-known member

blueone

Well-known member

XYang2023

Well-known member

blueone

Well-known member

Graphics

XYang2023

Well-known member

Graphics

Daniel Nenni

Admin

XYang2023

Well-known member

XYang2023

Well-known member

Yunus

Member

XYang2023

Well-known member

Fred Chen

Moderator