This is from Mark Hibben who I have read for years and have grown to trust and respect. The other so called analsystes who dumped on TSMC and Nvidia for the delay are click baiters and not to be trusted, my opinion.
My take on the Blackwell delay
On August 2, The Information released a report that Nvidia’s Blackwell would be delayed by at least a quarter due to unspecified “design flaws.” Normally, when one hears about the design flaws of a chip, logic design problems or bugs come to mind. But Nvidia management made clear that this was not the case during their fiscal Q2 results conference call.
Instead, there was an issue with a mask that impacted chip yield, said CFO Colette Kress:
Presumably, the chip yield affects the number of good chips resulting from each silicon wafer fabricated by foundry partner TSMC (TSM). CEO Jensen Huang reiterated that there was nothing wrong with the functional design of the chip. The mask change did not change the functional logic of the chip.
Developing the mask to implement a given layer of circuitry on an advanced chip has become an enormously complex process. This is because at the extreme ultraviolet wavelength of 13.5 nm, diffraction effects and optical distortion mean that the light pattern produced on the chip doesn’t look like the mask.
Optical physicists try to work backwards from the desired pattern to predict what the mask should look like, depending on the EUV machine and other factors. The process is computationally intensive, and TSMC has invested in an Nvidia supercomputer to perform what is called “computational lithography.”
The process isn’t perfect, and the actual pattern produced by a mask may not match the computational lithography prediction. Since the issue is one of yield, this is probably what happened in the case of Blackwell, and why it’s going to take several months to find a more optimal mask solution and begin mass production.
Asa Fitch’s article in the WSJ reported on the Blackwell delay:
I disagree with this interpretation. Nvidia has been making its flagship GPU accelerators (such as Hopper), at TSMC’s reticle limit, for years. This sets the maximum physical size of a chip that TSMC can produce using EUV lithography machines produced by ASML Holding (ASML).
Blackwell does consist of two such chips, but the process to make each is essentially unchanged from Hopper. Blackwell is fabbed, using basically the same TSMC N4 process as Hopper. Fitch continues:
Once again, I have a different interpretation. The fact that the package consists of two chips doesn’t make the chips harder to make. Each chip already needs to be “close to perfect.” The added complexity is in packaging, not in making the silicon. And it certainly doesn’t have anything to do with the mask issue.
As I pointed out in my investing group article on the Blackwell debut back in March, the approach had already been pioneered by Apple (AAPL) and TSMC in their M-series Ultra processors.
My take on the Blackwell delay
On August 2, The Information released a report that Nvidia’s Blackwell would be delayed by at least a quarter due to unspecified “design flaws.” Normally, when one hears about the design flaws of a chip, logic design problems or bugs come to mind. But Nvidia management made clear that this was not the case during their fiscal Q2 results conference call.
Instead, there was an issue with a mask that impacted chip yield, said CFO Colette Kress:
Hopper demand is strong and Blackwell is widely sampling. We executed a change to the Blackwell GPU mask to improve production yields. Blackwell production ramp is scheduled to begin in the fourth quarter and continue into fiscal year '26.
Presumably, the chip yield affects the number of good chips resulting from each silicon wafer fabricated by foundry partner TSMC (TSM). CEO Jensen Huang reiterated that there was nothing wrong with the functional design of the chip. The mask change did not change the functional logic of the chip.
Developing the mask to implement a given layer of circuitry on an advanced chip has become an enormously complex process. This is because at the extreme ultraviolet wavelength of 13.5 nm, diffraction effects and optical distortion mean that the light pattern produced on the chip doesn’t look like the mask.
Optical physicists try to work backwards from the desired pattern to predict what the mask should look like, depending on the EUV machine and other factors. The process is computationally intensive, and TSMC has invested in an Nvidia supercomputer to perform what is called “computational lithography.”
The process isn’t perfect, and the actual pattern produced by a mask may not match the computational lithography prediction. Since the issue is one of yield, this is probably what happened in the case of Blackwell, and why it’s going to take several months to find a more optimal mask solution and begin mass production.
Asa Fitch’s article in the WSJ reported on the Blackwell delay:
Nvidia hasn’t detailed the nature of the issue. But analysts and industry executives say its engineering challenges stem mostly from the size of the Blackwell chips, which require a significant departure in design.
I disagree with this interpretation. Nvidia has been making its flagship GPU accelerators (such as Hopper), at TSMC’s reticle limit, for years. This sets the maximum physical size of a chip that TSMC can produce using EUV lithography machines produced by ASML Holding (ASML).
Blackwell does consist of two such chips, but the process to make each is essentially unchanged from Hopper. Blackwell is fabbed, using basically the same TSMC N4 process as Hopper. Fitch continues:
Instead of one big piece of silicon, Blackwell consists of two advanced new Nvidia processors and numerous memory components joined in a single, delicate mesh of silicon, metal and plastic.
The manufacturing of each chip has to be close to perfect: serious defects in any one part can spell disaster, and with more components involved, there is a greater chance of that happening.
Once again, I have a different interpretation. The fact that the package consists of two chips doesn’t make the chips harder to make. Each chip already needs to be “close to perfect.” The added complexity is in packaging, not in making the silicon. And it certainly doesn’t have anything to do with the mask issue.
As I pointed out in my investing group article on the Blackwell debut back in March, the approach had already been pioneered by Apple (AAPL) and TSMC in their M-series Ultra processors.
Last edited: