Array
(
    [content] => 
    [params] => Array
        (
            [0] => /forum/index.php?threads/deepseek-reportedly-prepares-to-use-chinese-domestic-gpus-bypassing-nvidias-restrictions.22025/
        )

    [addOns] => Array
        (
            [DL6/MLTP] => 13
            [Hampel/TimeZoneDebug] => 1000070
            [SV/ChangePostDate] => 2010200
            [SemiWiki/Newsletter] => 1000010
            [SemiWiki/WPMenu] => 1000010
            [SemiWiki/XPressExtend] => 1000010
            [ThemeHouse/XLink] => 1000970
            [ThemeHouse/XPress] => 1010570
            [XF] => 2021770
            [XFI] => 1050270
        )

    [wordpress] => /var/www/html
)

DeepSeek Reportedly Prepares to Use Chinese Domestic GPUs, Bypassing NVIDIA's Restrictions

XYang2023

Well-known member
Machine translation:

(Central News Agency, Taipei, 3rd) Chinese AI startup DeepSeek has reportedly developed a large language model that circumvents NVIDIA’s CUDA framework, as it prepares for future adaptation to domestically-produced Chinese GPU chips.

According to a report from Hong Kong's Sing Tao Daily, NVIDIA’s Compute Unified Device Architecture (CUDA) significantly reduces the difficulty of developing large AI models, making it widely adopted by developers worldwide and securing NVIDIA’s dominant position in artificial intelligence (AI) development.

The report, citing a U.S. tech website, states that while DeepSeek currently trains its models using NVIDIA’s H800 chips, it utilizes NVIDIA’s low-level hardware instruction language PTX (Parallel Thread Execution) instead of the higher-level programming language CUDA.

Huang Lei, an associate professor at Beihang University (Beijing University of Aeronautics and Astronautics), explained that bypassing CUDA means DeepSeek can develop directly based on GPU driver functions, enabling more fine-tuned operations.

The report further notes that DeepSeek has internal developers skilled in PTX language, which could facilitate its transition to Chinese domestic GPUs in the future. By understanding the basic function interfaces provided by GPU drivers, DeepSeek can mimic NVIDIA’s GPU hardware programming interfaces to develop relevant code. This capability would enhance its large language model’s adaptability to Chinese-made hardware.

(Edited by: Zhou Huiying / Zhang Shuling) 1140203

 
The founder of DeepSeek already hinted at having an end-to-end ecosystem in an interview that took place in July 2024:

 
Cars, batteries, chips and AI is going completely seperate.

Would it be better if it was Nvidia, TSMC, ASML, AMD and others getting business and scale ?
 
Cars, batteries, chips and AI is going completely seperate.

Would it be better if it was Nvidia, TSMC, ASML, AMD and others getting business and scale ?
I think related reports mentioned Huawei specifically. It also implies the CUDA moat can be surpassed.
 
Back
Top