Zhang Pingan: 5nm and 7nm are not the core. Huawei's computing power is already three times that of Nvidia chips.

Fred Chen · Oct 11, 2025

According to Fast Technology on October 4, in the view of Zhang Pingan, Huawei's executive director and CEO of Huawei Cloud, chip process is not the core, what customers really need is high-quality computing results.

Recently, Zhang Pingan publicly stated that Huawei Cloud Service has achieved a breakthrough in computing power and efficiency, and its production efficiency has reached three times that of Nvidia's H20 chip.

"The chip process (such as 5nm and 7nm) is not the core. What customers really need is high-quality computing results." Zhang Pingan introduced that through technological innovation, Huawei Cloud Service has achieved the ability to generate 2,400 tokens per second on a single card with a latency of 50 milliseconds.

Currently, Huawei's Ascend Cloud service not only supports our own large-scale Pangu models, but also fully supports third-party models such as DeepSeek and Kimi. "We hope that all large-scale models will run faster and better on Ascend Cloud," said Zhang Pingan.

Zhang Pingan pointed out that China's "computing power hub" is gradually becoming an AI computing power center for global customers. During Huawei's overseas expansion, it discovered that while intelligent computing centers in China all utilize liquid cooling technology, liquid cooling data centers overseas are still relatively rare. Renovating overseas data centers is not only time-consuming, but also lacks sufficient fiber optic network bandwidth.

Zhang Pingan finally emphasized that according to regulations, every cloud vendor must publicly disclose any major incidents online, and Huawei Cloud has maintained a "zero major incident record" for 756 consecutive days. "We are confident that we can maintain this achievement," he said.

张平安：5、7nm并非核心华为算力能力已超英伟达芯片3倍

快科技10月4日消息，在华为常务董事、华为云CEO张平安看来，芯片制程并非核心，客户真正需要的是优质的计算结果。近日，张平安公开表示，华为云服务在算力效能方面实现突破，其生产效率已达到

news.mydrivers.com

LLL0955 · Oct 12, 2025

If the chip technology is not the key, why do not just use SMIC 14nm technology which must have higher yield and low cost compared to SMIC 7nm lol ?

Fred Chen · Oct 12, 2025

LLL0955 said:
If the chip technology is not the key, why do not just use SMIC 14nm technology which must have higher yield and low cost compared to SMIC 7nm lol ?

Yes, I was actually thinking the same

DanX · Oct 12, 2025

Didn't the Pangu involve in a scandal alleging it copied Alibaba's (technologies)?
Huawei is really the expert of managing up.

DanX · Oct 12, 2025

Fred Chen said:
Zhang Pingan finally emphasized that according to regulations, every cloud vendor must publicly disclose any major incidents online, and Huawei Cloud has maintained a "zero major incident record" for 756 consecutive days. "We are confident that we can maintain this achievement," he said.

If you do nothing, you can have "zero major incident record" forever.

ai268 · Oct 12, 2025

What China hope is that "Apple II is faster than Cray YMP". Anyone read that paper, published almost 40 years ago? So, why Apple II is faster, which is physically impossible?

soAsian · Oct 12, 2025

LLL0955 said:
If the chip technology is not the key, why do not just use SMIC 14nm technology which must have higher yield and low cost compared to SMIC 7nm lol ?

It's how China responds to anything. I remember when China didn't have an aircraft carrier. The Chinese said their rocket force could shut down US carriers thus carriers are useless. They don't say that now since China has its own carriers. When China gets its own EUV or whatever can make better chips, they'll sing a different tune

KevinK · Oct 13, 2025

Fred Chen said:
"We hope that all large-scale models will run faster and better on Ascend Cloud," said Zhang Pingan.

Lot's of indicators of how empty a claim he is making:
* Comparison against H20 not a modern full-featured chip from NVIDIA, AMD or Cerebras.
* No mention of which model they are using to benchmark their per-card performance.
* The "We hope" disclaimer in the statement above.

The only real neutral and open chip-focused AI data-center level inference benchmarking is being done here. Maybe some day we'll see how they really perform, or maybe not.

InferenceMAX™: Open Source Inference Benchmarking

NVIDIA GB200 NVL72, AMD MI355X, Throughput Token per GPU, Latency Tok/s/user, Perf per Dollar, Tokens per Provisioned Megawatt, DeepSeek R1 670B, GPTOSS 120B, Llama3 70B

newsletter.semianalysis.com

Real benchmarking has to look at performance per CPU, cost per token, tokens / sec per mW, and other key performance and user parameters on near-leading edge frontier models.

Fred Chen · Oct 13, 2025

KevinK said:
Lot's of indicators of how empty a claim he is making:
* Comparison against H20 not a modern full-featured chip from NVIDIA, AMD or Cerebras.
* No mention of which model they are using to benchmark their per-card performance.
* The "We hope" disclaimer in the statement above.

The only real neutral and open chip-focused AI data-center level inference benchmarking is being done here:

InferenceMAX™: Open Source Inference Benchmarking

NVIDIA GB200 NVL72, AMD MI355X, Throughput Token per GPU, Latency Tok/s/user, Perf per Dollar, Tokens per Provisioned Megawatt, DeepSeek R1 670B, GPTOSS 120B, Llama3 70B

newsletter.semianalysis.com

Real benchmarking has to look at performance per CPU, cost per token, tokens / sec per mW, and other key performance and user parameters on near-leading edge frontier models.

His chief concern seems to be to play down the importance of chip process.

Xebec · Oct 13, 2025

ai268 said:
What China hope is that "Apple II is faster than Cray YMP". Anyone read that paper, published almost 40 years ago? So, why Apple II is faster, which is physically impossible?

Plot twist: they were running JAVA+Python code on the Cray

bilau · Oct 13, 2025

Fred Chen said:
Huawei Cloud Service has achieved the ability to generate 2,400 tokens per second on a single card with a latency of 50 milliseconds.

Is that good? I know there are other considerations like joules, and $, but simplistically, is the above something to write home about?

I mean all the other stuff he said is to be expected, where he sits dictates what where he stands.

DanX · Oct 13, 2025

bilau said:
Is that good? I know there are other considerations like joules, and $, but simplistically, is the above something to write home about?

I mean all the other stuff he said is to be expected, where he sits dictates what where he stands.

The biggest issue is still the software which is the weakest link in Chinese company.
7nm process chip is better than H20 (which downgrade to 28% of H100), but the software is so bad.
There are a lot of discussions in Zhihu(Chinese Reddit) on this.

bilau · Oct 13, 2025

DanX said:
The biggest issue is still the software which is the weakest link in Chinese company.
7nm process chip is better than H20 (which downgrade to 28% of H100), but the software is so bad.
There are a lot of discussions in Zhihu(Chinese Reddit) on this.

Wouldn't it be true that they can overcome a software deficit easier than a hardware one? There isn't much obstacle (like EUV) that can be deployed to cripple software development.

Search

Zhang Pingan: 5nm and 7nm are not the core. Huawei's computing power is already three times that of Nvidia chips.

Fred Chen

Moderator

张平安：5、7nm并非核心华为算力能力已超英伟达芯片3倍

LLL0955

Member

Fred Chen

Moderator

DanX

Active member

DanX

Active member

ai268

Member

soAsian

Well-known member

KevinK

Well-known member

InferenceMAX™: Open Source Inference Benchmarking

Fred Chen

Moderator

InferenceMAX™: Open Source Inference Benchmarking

Xebec

Well-known member

bilau

Active member

DanX

Active member

bilau

Active member