Array
(
    [content] => 
    [params] => Array
        (
            [0] => /forum/threads/zhang-pingan-5nm-and-7nm-are-not-the-core-huaweis-computing-power-is-already-three-times-that-of-nvidia-chips.23779/
        )

    [addOns] => Array
        (
            [DL6/MLTP] => 13
            [Hampel/TimeZoneDebug] => 1000070
            [SV/ChangePostDate] => 2010200
            [SemiWiki/Newsletter] => 1000010
            [SemiWiki/WPMenu] => 1000010
            [SemiWiki/XPressExtend] => 1000010
            [ThemeHouse/XLink] => 1000970
            [ThemeHouse/XPress] => 1010570
            [XF] => 2030770
            [XFI] => 1060170
        )

    [wordpress] => /var/www/html
)

Zhang Pingan: 5nm and 7nm are not the core. Huawei's computing power is already three times that of Nvidia chips.

Fred Chen

Moderator
According to Fast Technology on October 4, in the view of Zhang Pingan, Huawei's executive director and CEO of Huawei Cloud, chip process is not the core, what customers really need is high-quality computing results.

Recently, Zhang Pingan publicly stated that Huawei Cloud Service has achieved a breakthrough in computing power and efficiency, and its production efficiency has reached three times that of Nvidia's H20 chip.

"The chip process (such as 5nm and 7nm) is not the core. What customers really need is high-quality computing results." Zhang Pingan introduced that through technological innovation, Huawei Cloud Service has achieved the ability to generate 2,400 tokens per second on a single card with a latency of 50 milliseconds.

Currently, Huawei's Ascend Cloud service not only supports our own large-scale Pangu models, but also fully supports third-party models such as DeepSeek and Kimi. "We hope that all large-scale models will run faster and better on Ascend Cloud," said Zhang Pingan.

Zhang Pingan pointed out that China's "computing power hub" is gradually becoming an AI computing power center for global customers. During Huawei's overseas expansion, it discovered that while intelligent computing centers in China all utilize liquid cooling technology, liquid cooling data centers overseas are still relatively rare. Renovating overseas data centers is not only time-consuming, but also lacks sufficient fiber optic network bandwidth.

Zhang Pingan finally emphasized that according to regulations, every cloud vendor must publicly disclose any major incidents online, and Huawei Cloud has maintained a "zero major incident record" for 756 consecutive days. "We are confident that we can maintain this achievement," he said.

 
Zhang Pingan finally emphasized that according to regulations, every cloud vendor must publicly disclose any major incidents online, and Huawei Cloud has maintained a "zero major incident record" for 756 consecutive days. "We are confident that we can maintain this achievement," he said.
If you do nothing, you can have "zero major incident record" forever.
 
What China hope is that "Apple II is faster than Cray YMP". Anyone read that paper, published almost 40 years ago? So, why Apple II is faster, which is physically impossible?
 
If the chip technology is not the key, why do not just use SMIC 14nm technology which must have higher yield and low cost compared to SMIC 7nm lol ?
It's how China responds to anything. I remember when China didn't have an aircraft carrier. The Chinese said their rocket force could shut down US carriers thus carriers are useless. They don't say that now since China has its own carriers. When China gets its own EUV or whatever can make better chips, they'll sing a different tune
 
"We hope that all large-scale models will run faster and better on Ascend Cloud," said Zhang Pingan.
Lot's of indicators of how empty a claim he is making:
* Comparison against H20 not a modern full-featured chip from NVIDIA, AMD or Cerebras.
* No mention of which model they are using to benchmark their per-card performance.
* The "We hope" disclaimer in the statement above.

The only real neutral and open chip-focused AI data-center level inference benchmarking is being done here. Maybe some day we'll see how they really perform, or maybe not.


Real benchmarking has to look at performance per CPU, cost per token, tokens / sec per mW, and other key performance and user parameters on near-leading edge frontier models.
 
Last edited:
Lot's of indicators of how empty a claim he is making:
* Comparison against H20 not a modern full-featured chip from NVIDIA, AMD or Cerebras.
* No mention of which model they are using to benchmark their per-card performance.
* The "We hope" disclaimer in the statement above.

The only real neutral and open chip-focused AI data-center level inference benchmarking is being done here:


Real benchmarking has to look at performance per CPU, cost per token, tokens / sec per mW, and other key performance and user parameters on near-leading edge frontier models.
His chief concern seems to be to play down the importance of chip process.
 
What China hope is that "Apple II is faster than Cray YMP". Anyone read that paper, published almost 40 years ago? So, why Apple II is faster, which is physically impossible?
Plot twist: they were running JAVA+Python code on the Cray
 
Huawei Cloud Service has achieved the ability to generate 2,400 tokens per second on a single card with a latency of 50 milliseconds.
Is that good? I know there are other considerations like joules, and $, but simplistically, is the above something to write home about?

I mean all the other stuff he said is to be expected, where he sits dictates what where he stands.
 
Is that good? I know there are other considerations like joules, and $, but simplistically, is the above something to write home about?

I mean all the other stuff he said is to be expected, where he sits dictates what where he stands.
The biggest issue is still the software which is the weakest link in Chinese company.
7nm process chip is better than H20 (which downgrade to 28% of H100), but the software is so bad.
There are a lot of discussions in Zhihu(Chinese Reddit) on this.
 
The biggest issue is still the software which is the weakest link in Chinese company.
7nm process chip is better than H20 (which downgrade to 28% of H100), but the software is so bad.
There are a lot of discussions in Zhihu(Chinese Reddit) on this.
Wouldn't it be true that they can overcome a software deficit easier than a hardware one? There isn't much obstacle (like EUV) that can be deployed to cripple software development.
 
Back
Top