SemiWiki – Page 34 – The Open Forum for Semiconductor Professionals

Video EP1: A Discussion of Meeting the Challenges to Implement Gen AI in Semiconductor Design with Vishal Moondhra

Video EP1: A Discussion of Meeting the Challenges to Implement Gen AI in Semiconductor Design with Vishal Moondhra
by Daniel Nenni on 02-05-2025 at 10:00 am

In this inaugural episode of the new Semiconductor Insiders video series, Dan is joined by Vishal Moondhra, VP of Solutions Engineering at Perforce Helix IPLM. Dan explores the risks and challenges of using Gen AI in the semiconductor industry with Vishal. Liability, traceability, cost, and quality are discussed. The challenges associated with design flows and provenance are also explored in this wide ranging and informative video.

Vishal describes how the unique capabilities of Helix IP Lifecycle Management can significantly improve the deployment of Gen AI for semiconductors. The views, thoughts, and opinions expressed in these videos belong solely to the speaker, and not to the speaker’s employer, organization, committee or any other group or individual.

February 5, 2025February 5, 2025

KLAC Good QTR with AI and HBM drive leading edge and China is Okay

KLAC Good QTR with AI and HBM drive leading edge and China is Okay
by Robert Maire on 02-05-2025 at 6:00 am
Categories: Semiconductor Advisors, Semiconductor Services

– KLA put up a good qtr & year with consistent growth
– AI & HBM are the main drivers of leading edge which helps KLA
– China slowing but not too fast, Outlook OK but not super
– Wafer inspection is huge but reticle inspection continues to slip

KLA reports good quarter and OK outlook

KLA reported revenues of $3.1B and Non GAAP EPS of $8.20, coming in above the mid point as KLA usually does.

Guidance is for $3B+-$150M and Non GAAP EPS of $8.05 +-$0.60.

AI & HBM are key drivers and push leading edge

As we continue to emphasize, it is primarily the leading edge applications of both HBM and big AI processors (read that as Nvidia) that are driving the market.

KLA tends to do better when more of the business is at the leading edge as they tend to help get new processes up to speed.

The shift away from run of the mill DRAM and weak NAND continues to push more capacity into HBM which likely has the same unlimited demand driven by AI applications.

Everybody still wants better AI chips & more memory – Elastic demand

KLA clearly supports the view that whether you run Open AI or Deep Seek, you always want better processors and more memory and that semiconductor demand remains as elastic as it ever was.

This clearly implies that KLA is not expecting any drop off in demand for AI applications and neither are we.

We also believe that KLA customers, TSMC and the memory makers , are not taking their foot off the gas of technology improvement.

TSMC is so far ahead of both Intel and Samsung that it doesn’t have to be crazy aggressive but will keep spend for capacity.

The memory market is seeing more intensive competition for HBM dominance.

China is slowing gradually and not falling off a cliff as feared

China came in at 36% of business which seems to indicate that their China business is not falling as fast as others in the chip equipment business as China still needs yield management tools as compared to standard process tools such as dep and etch which saw a huge jump in revenue for domestic Chinese suppliers that compete against AMAT, LRCX, and TEL etc, not so much KLA as yield management is harder to copy.

While China will continue to slow as China digests their binge buying of the last few years that that has warehouses bursting at the seams KLA will slow more slowly than process competitors.

KLA financials are still best in the industry

KLA’s focus on financial metrics continues to be great and they do a great job of managing backlog, costs, cash and most especially margins that have historically been the highest of major players.

Wafer inspection is huge and offsets losses & weakness in reticle inspection

The huge dichotomy between wafer inspection and reticle inspection continues to worsen as wafer inspection gained 14% Q/Q and reached 51% of overall revenue at $1,563B while patterning (mostly reticle inspection) fell 8% Q/Q to $531M.

These two businesses used to be similar in size and now wafer is three times the size.

Its clearly a combination of factors but obviously KLA has lost both leadership and share to competitors both at the high end and low end of the market.

2025 looking at a middling 5% growth Y/Y

While KLA will likely do better than its peers as the leading edge remains strong, overall WFE growth of roughly 5% expected for 2025 does not set the world on fire.

But this expectation is in line with what others are saying

The stocks

Given that chips stocks sold off with the DeepSeek DeepScare, any decent results will see the stocks bump up as investors realize that the AI sky is not falling.

Demand remains strong, the leading edge is still rolling along with strong demand.

China and trailing edge are moderating but not falling off a cliff.

Intel reporting a decent quarter as well should help the whole group.

When Nvidia reports we will likely get a reminder of the strong, sold out growth business expected in AI in 2025.

In short, we expect semis and semi equipment stocks to continue to claw back their valuations after being blown up by the great Deep Seek overreaction ….

About Semiconductor Advisors LLC

Semiconductor Advisors is an RIA (a Registered Investment Advisor), specializing in technology companies with particular emphasis on semiconductor and semiconductor equipment companies. We have been covering the space longer and been involved with more transactions than any other financial professional in the space. We provide research, consulting and advisory services on strategic and financial matters to both industry participants as well as investors. We offer expert, intelligent, balanced research and advice. Our opinions are very direct and honest and offer an unbiased view as compared to other sources.

Also Read:

Consumer memory slowing more than AI gaining

If you believe in Hobbits you can believe in Rapidus

AMAT has OK Qtr but Mixed Outlook Means Weaker 2025 – China & Delays & CHIPS Act?

February 4, 2025January 29, 2025

Getting Faster DRC Results with a New Approach

Run time improvements using Calibre nmDRC Recon

As IC designs become increasingly complex, traditional Design Rule Checking (DRC) methods are struggling to keep up. The old “construct by correction” approach, initially developed for simpler, custom layouts, is creating substantial runtime and resource bottlenecks. Traditional DRC relies on an iterative, sequential approach that is not well-suited for today’s automated and multi-layered design hierarchies. New methodologies, such as the “shift-left” approach, are helping to address these challenges. This blog post will explore how Siemens’ Calibre nmDRC Recon, has enabled a shift-left strategy, that allows for faster DRC, maximum check coverage, and minimal hardware usage.

Traditional DRC

Traditional DRC methods involve a manual process where layout designers create shapes, identify violations, and make corrections. While this worked well for simpler designs at larger process nodes, it’s not efficient for today’s automated design environments and smaller process nodes. In modern designs, components are routed individually, and then again at the top-level, plus the final design is assembled at different stages of readiness. Design teams typically run all design rules on all available layouts and then review individual results one-by-one, adjusting as needed. This iterative process can be extremely time-consuming, especially with the complexity of today’s advanced process design rules. Designs are often made up of multiple IP blocks designed by different teams on different timelines, which makes it difficult to have a fully assembled design for full verification. This leads to longer runtimes and higher compute requirements.

Shift-Left Approach

A better approach is to shift verification steps to earlier stages of the design process, significantly reducing debug time and expediting time to tape-out. Calibre nmDRC Recon is designed for early layout, whether at the IP, block, or chip level. It uses AI techniques to identify and run only those rules that are local in scope, meaning those that do not require checking across long distances or hierarchy. By focusing on local checks, Calibre nmDRC Recon significantly reduces runtime and hardware requirements.

Run time improvements using Calibre nmDRC Recon

Memory Improvements using Calibre nmDRC Recon

The checks are local, so the root-cause of errors are usually close to where the errors are reported. This allows designers to quickly identify and fix system layout issues like floorplan, cell placement, or chip finishing errors. Calibre nmDRC Recon also allows designers to enable or disable certain rules or specify layers that have been changed to automatically enable all dependent rules. Identifying and fixing root-cause problems early on eliminates many individual DRC errors, further reducing the debug time.

Running a subset of checks might seem counterintuitive, but it can reduce overall iteration time. The reason is that gross root-cause problems can be fixed quickly based on local checks. Rules requiring a more global scope, such as those that depend on connectivity information, can be addressed later. For these more global rules, Calibre nmLVS can help by identifying shorts early, ensuring that subsequent DRC iterations are more effective.

Using Calibre nmLVS Recon early in the flow

Full DRC checking is still necessary for final tape-out, however using techniques like split-deck runs, where checks that require significant runtime or hardware are run in parallel, overall performance can be improved. Using Calibre interactive, the individual results from each split can be combined into a single DRC results database.

Incomplete Designs

Another challenge is dealing with incomplete designs, as IP blocks of a design are not always ready at the same time. Using a combination of Calibre nmDRC Recon and Calibre nmDRC auto-waivers feature can be used. This allows designers to identify incomplete regions and exclude them from checking using markers. This “gray-boxing” technique allows designers to focus on their specific components of interest while still considering only the context of interest from clean layouts. By eliminating geometries, there is the risk of creating false errors, but the auto-waiver feature allows those errors to be removed so designers can focus on their specific component without impacting other areas of the design.

Gray-box Regions

Specified regions

Combining Recon and auto-waivers has proven to be significantly faster than traditional methods for design layout generation and final tape-out.

Microsoft

Microsoft used Calibre nmDRC Recon to accelerate their DRC process, achieve maximum check coverage, and minimize hardware use. They started using Calibre nmDRC Recon iterations at the floorplan stage and then at the physical implementation stage. By that point, most of the designs were clean of PG shorts.

Shorter run times

Microsoft found that DRC Recon improved their early design stages, as it provided a solid foundation for pinpointing violations efficiently and significantly reduced runtimes. By adopting the shift-left approach with Calibre nmDRC Recon and auto-waivers, Microsoft was able to significantly reduce runtime, hardware requirements, and debugging time.

Conclusion

The shift-left verification strategy, using tools like Calibre nmDRC Recon is critical for addressing the challenges of complex IC designs. By focusing on local checks early, using techniques like auto-waivers, and leveraging split-deck runs, design teams can achieve faster and more efficient IC design. Microsoft showed the benefits of this approach, emphasizing the importance of adopting shift-left verification for achieving faster time-to-market.

Read the 10 page paper online.

Related Blogs

February 4, 2025June 18, 2025

2025 Outlook with Uzi Baruch of proteanTecs

2025 Outlook with Uzi Baruch of proteanTecs
by Daniel Nenni on 02-04-2025 at 6:00 am
Categories: AI, Analytics, Automotive, proteanTecs

Uzi Baruch of proteanTecs

Tell us a little bit about yourself and your company.

I am the Chief Strategy Officer at proteanTecs where I oversee our organic and inorganic growth strategies, as well as our go-to-market. This includes collaboration with ecosystem partners, defining our business model, and creating value for our customers through a targeted product portfolio. I joined proteanTecs in 2021 and have over 20 years of experience in the tech industry, big data and software domains.

proteanTecs gives advanced electronics the ability to monitor and report on their own health and performance throughout the full lifecycle. Out technology combines on-chip monitoring with a software application stack, delivering solutions for power reduction, performance optimization, and failure prevention. We offer a multi-layered, deep data and comprehensive product suite that creates a common language, breaking down data silos along the value chain and between teams.

By integrating our novel Agents (monitoring IP) into advanced semiconductor chips, our machine learning analytics and real-time applications offer unparalleled visibility – from production to the field. These deep data insights are empowering our customers in industries such as in AI, Cloud, Automotive, Telecom and Mobile to optimize their products, increasing their reliability, reducing their power consumption, and enhancing their chip and system quality. We like to say we are enabling the digital future at scale.

What was the most exciting high point of 2024 for your company?

It has been inspiring to see the results our customers have achieved. Earlier this year, we introduced proteanTecs AVS Pro™ as part of our power reduction solution. This application has enabled our customers to achieve significant power savings, leading to potential cost savings of up to $25M. One of our customers, a data center chipmaker achieved 12.5% power reduction, enabling them to increase server throughput. A mobile company saw 11.5% power savings, extending their product’s battery life. A cloud service provider was able to safely reduce power consumption by 14%. AVS Pro is a closed-loop application that performs functional-workload aware adaptive voltage scaling (AVS) in mission-mode, with a reliability protection layer. We like to call it a “safety net”. The incredible results we are seeing in customer chips really excite us, because power reduction is one of the most pressing issues and biggest limiters of scale. We have many more case studies like these across AI, networking, and other industries.

What are the biggest challenges you are seeing in the industry?

It is clear we are entering a new era of AI, which presents both new challenges and opportunities. AI is reshaping the world as we know it, AI models are getting smarter and taking on tasks we used to think only humans could perform. Because AI is spreading to a wider range of fields and applications, it is also driving the need for more powerful and efficient SoCs to handle intensive computational workloads. This trend is particularly evident not only in the surge of training processors, but also in the rise of edge AI and the increasing demand for specialized chips that can efficiently handle inference tasks. This meant our company had to adapt as well.

How is your company’s work addressing this biggest challenge?

As AI continues to evolve, the demand for powerful and efficient electronics grows exponentially. At proteanTecs, we are at the forefront, working across the ecosystem to enable companies and systems to scale reliably while meeting demanding power/performance envelopes. Consider hyperscale data centers, which house thousands of servers, all working in clusters across varying workloads. Throughput and power efficiency are vital, and TOPS per Watt becomes one of the most important metrics when evaluating new technology. That’s where our power reduction solution, AVS Pro that I mentioned earlier, is crucial. AVS Pro’s closed-loop hardware-firmware application monitors actual margin-to-timing-failure at high coverage for real-time voltage scaling. It allows precise guard-band reclamation based on actual workloads, aging, temperature, noise, and IR drops to reduce power while ensuring failure prevention.

Chip failures and performance degradation can cause significant problems, especially when dealing with the real-time processing demands of AI and mission-critical applications. In such environments, the reliability, availability, and serviceability (RAS) of systems become paramount. Introduced this year, is a cutting-edge application designed to predict and prevent failures before they happen, redefining the future of reliability.

RTHM also offers mitigation of silent data corruption (SDC), a growing challenge in high-compute environments. SDC has become a critical concern in which undetected faults propagate and lead to significant system-wide failures or computational errors, compromising data integrity. SDC is occurring at a much higher rate than software engineers expected, undermining the hardware reliability they used to take for granted. Hyperscalers, such as Google and Meta, report that approximately one in a thousand machines in their fleets is affected by SDC. RTHM enables early detection of potential failures that can lead to SDC, ensuring that systems remain resilient and reliable, even in the most demanding workloads.

What do you think the biggest growth area for 2025 will be, and why?

We are seeing more companies designing their own chips. This trend is driven by the increasing requirements of software and the desire for greater control over performance, power efficiency, supply chains, and cost. By designing their own chips, companies can optimize for specific applications, differentiate their products, and reduce reliance on external suppliers. In 2025, we foresee this trend continuing to grow rapidly. Today, we are working with the leading hyperscalers, telco players, mobile companies – all designing and developing their own chips, in addition to sourcing from the traditional big semiconductor players.

How is your company’s work addressing this growth?

Besides our offering for in-field monitoring, we also have a suite of solutions for production testing. We provide our customers with deep-data visibility to accelerate their time-to-market, enhance quality and yield, and optimize operational efficiencies. Our solutions—designed for advanced technology nodes down to 2nm—streamline the NPI process, detect latent defects, optimize performance, and enable the creation of highly customized, cutting-edge chips. With proteanTecs, customers can optimize their designs for reliability, yield, performance, and power efficiency. Our comprehensive solutions provide the monitoring IP, the EDA tools to facilitate the IP integration in the chip and ensure the implementation will provide the expected value. Once integrated we provide the ML algorithms and analytics software stack to analyze the measured data at all phases of the product cycle—from characterization, qualification, wafer testing, packaged device testing, system ramp and system test. Our solutions include edge software for inline decisions on the tester, as well as a cloud platform for advanced analytics, pinpoint debug, cross-test correlation, population analysis, and RMA investigations. Using our production analytics solutions, our customers get their products to market faster, with reduced cost, and with added confidence.

What conferences did you attend in 2024 and how was the traffic?

In 2024, proteanTecs participated in many events and conferences in the automotive, data center and semiconductor industries. These events spanned the globe—with our team participating in shows in the United States, Spain, Germany, Belgium, Portugal, the United Kingdom, the Netherlands, Israel, Japan, China, Taiwan, Korea, and India.

We participated in events similar to prior years, but also added some new ones to our calendar, especially in Japan as we expanded our strategic focus there. Of note, we kicked off 2024 with a strong presence at Automotive World in Tokyo, where we also co-hosted an Exclusive Tech Summit with Advantest with hundreds of guests, and guest speakers from Renesas and NTT.

Events with our ecosystem partners continue to be a big focus. We participated in partner events with TSMC, Intel Foundry, Samsung Foundry, Teradyne, PDF Solutions and Cadence. We also ramped up our presence at the annual Design Automation Conference (DAC). Our proteanTecs booth featured in-booth demos, along with presentations from several ecosystem partners, including TSMC, GUC, Siemens, Intel Foundry, UCIe, Teradyne and Andes Technology.

Events have clearly rebounded since COVID-19. We consistently saw strong booth traffic across our different global events. Attendees are also eager to experience, firsthand, our interactive live product demos, based on customer systems.

Will you attend conferences in 2025? Same or more?

Absolutely. In fact, our 2025 event calendar is filling up quickly. We are committed to participating in industry events and strengthening relationships through in-person connection. A big part of our strategy is centered around on-stage talks, and we plan to be many relevant venues again this year.

Throughout 2025, we look forward to collaborating across the ecosystem, presenting new solutions to technical challenges, exploring new business opportunities, and contributing to more data-driven decisions across the industry.

How do customers engage with your company?

Customers can engage with proteanTecs in a variety of ways. With a global team working across seven locations worldwide, we provide our customers with exceptional support on-site with our dedicated application engineering team, and provide them with everything they need to maximize their value from the insights they generate using our solutions.

We encourage interested parties to contact our team to book a customized product demo or ask any technical questions. They can also visit our website to explore a wide range of materials in our knowledge center, including white papers, case studies, on-demand webinars, and other informative content. Following us on social media, particularly LinkedIn, is an excellent way to stay updated on our latest news and industry insights. Last, but certainly not least, you can connect with our team at upcoming industry events and conferences.

Additional questions or final comments?

proteanTecs celebrated its 7^th anniversary this year. It’s been an incredible journey marked by significant milestones. Throughout 2024, we introduced three new solutions—Power Reduction, RTHM™ (Real-Time Health Monitoring) and RTSM™ (Real-Time Safety Monitoring). We signed new customers across multiple industries, we also announced strategic partnerships, including a collaboration with Alphawave Semi and our participation in Arm Total Design. These achievements are a testament to the hard work and dedication of our talented team, along with the trust of our customers. Stay tuned for what we have in store during 2025, we are going to share some exciting updates soon!

Also Read:

Datacenter Chipmaker Achieves Power Reduction With proteanTecs AVS Pro

proteanTecs Introduces a Safety Monitoring Solution #61DAC

proteanTecs at the 2024 Design Automation Conference

February 3, 2025July 18, 2025

What is Different About Synopsys’ Comprehensive, Scalable Solution for Fast Heterogeneous Integration

What is Different About Synopsys’ Comprehensive, Scalable Solution for Fast Heterogeneous Integration
by Mike Gianfagna on 02-03-2025 at 10:00 am
Categories: 3D IC, EDA, Synopsys

Multi-die design has become the center of a lot of conversation lately. The ability to integrate multiple heterogeneous devices into a single package has changed the semiconductor landscape, permanently. This technology has opened a path for continued Moore’s Law scaling at the system level. What comes next will truly be exciting. Before getting too excited it’s important to realize there are still substantial challenges presented by this new method of system design.

These challenges are interrelated and span from architectural all the way to manufacturing and deployment in the field. Solving these problems is a multi-dimensional balancing act. A holistic approach is the only effective strategy. The problem is actually broader than multi-die design. It also includes new communication strategies, new materials, new thermal and mechanical problems among others. The term heterogeneous integration is more accurate. There are few companies with the breadth and depth of capability to tackle this class of problem. Synopsys is one of those companies and they have taken a unique approach. I examined some aspects of the Synopsys solution in a recent post. Let’s go deeper and examine what is different about Synopsys’ comprehensive, scalable solution for fast heterogeneous integration.

Early Architecture

The graphic at the top of this post presents a good overview of the multi-dimensional nature of heterogeneous integration. It turns out Synopsys has published a series of white papers that cover the full spectrum of the problem. Taken together, this material presents all the elements of a master class on the topic. I highly recommend taking the time to read them all. Links are coming. Let’s first take a quick look at what each white paper offers. I’ll start with the early architecture topics.

This first white paper begins with an overview of the various tasks that must be considered to achieve a successful multi-die design project. Those items are summarized in the figure below.

Multi die system design challenges

This white paper focuses on system pathfinding, memory utilization & coherency, and power/thermal management. A key to successful multi-die design is a virtual prototyping environment for early architecture exploration. This environment allows architects to capture the hardware resources of their multi-die design. The architecture of such a system is described.

Key items that are analyzed and balanced here include workload and architecture definition, partitioning and technology selection, and early performance and power analysis. The Synopsys Platform Architect for Multi-Die is described, which addresses all these requirements and more.

System Verification and Validation

The next white paper discusses the challenges of verification and validation for multi-die designs. Areas of focus here include addressing capacity and performance for system verification, validating assumptions made during architecture design, and knowing when verification is complete. Simulation and emulation models must be able to scale with the design size and make best use of the available resources. It is also important to consider analog components, which must either be modeled digitally or co-simulated in a mixed-signal environment.

The white paper goes on to point out that the key to addressing the challenges outlined above is recognizing that a multi-die design is not a single design, but rather a combination of independently manufactured designs (dies) interconnected through communication fabrics. The figure below provides a disaggregation example, showing how a monolithic design becomes a combination of dies, creating numerous verification challenges.

Example of disaggregation that affects verification

It is explained that, to perform system-level tests, the RTL designs for all the dies must be assembled and simulated in a single executable. This presents many challenges, including:

How can “independent” designs and testbenches be assembled into one simulation environment?
Can the die-level testbenches be reused or synchronized?
Does the compute server have enough memory to build and execute the simulation?
Can the simulation be distributed over multiple servers?
How can name clashes be avoided?

The paper goes on to describe the Synopsys VCS® functional verification solution, that provides a powerful and flexible approach to multi-die design simulation. Details of how NVIDIA used this capability on a real design are also provided.

Design Implementation and Signoff

Next, we examine the challenges of implementation and signoff. This white paper discusses the challenges faced here, which include signoff for multi-die extraction and timing, multi-die power, and multi-die physical design.

The white paper explains that multi-die signoff is impossible with traditional 2D timing, checking, and power analysis tools. For example, signals that cross between dies pass through multiple stacked layers, including interposers and substrates, and the delays through these layers must be considered for static timing analysis. This places new requirements on both physical verification and parasitic extraction.

In addition, power calculations are more complex since they must combine the results for all dies. Multi-die design also requires innovation for design rule checking (DRC), layout versus schematic (LVS) verification, and other physical checks. The reality is that accurate multi-die signoff requires the entire stack to be considered in a holistic way.

The paper describes the suite of tools Synopsys provides to address these challenges in an integrated and unified way. The figure below summarizes the technologies discussed.

Synopsys multi die signoff solution

Silicon IP Integration

This white paper examines the challenges of efficient multi-die design implementation and IP integration. UCIe, as a specification for die-to-die interconnect is discussed. The complexities of a multi-die package with UCIe expressing die-to-die connectivity are reviewed.

An example shows one UCIe link on the I/O chiplet (Die 1) being connected through the package with the other UCIe link on the CPU/compute die (Die 2). The UCIe link consists of a physical layer (PHY) and a controller. The UCIe PHY includes the transmit/receive (TX/ RX) pins, which must be routed through the package to the UCIe PHY on the other die. The UCIe PHY IP is composed of 8 DWORDs, is placed in a row next to each other. Each DWORD consists of a pair of clocks, 64 single-ended data lanes, a data valid lane in each direction (transmit and receive), and a track lane. Additionally, there is a low-speed sideband bus for initialization, link training, and configuration read writes.

The diagram below illustrates this configuration.

Die 1 to Die 2 connectivity for UCIe

This white paper goes into the details of how the combination of Synopsys UCIe IP and Synopsys 3DIC Compiler enable higher productivity with lower IP integration risk by automating routing, interposer studies, and signal integrity analysis. There’s a lot to consider here, both in terms of IP choices for communication and implementation of complex interconnect schemes. You will get a good appreciation of the completeness of the Synopsys solution.

Manufacturing and Device Health

The final white paper discusses effective monitoring, test, and repair of multi-die designs. The piece points out that multi-die designs are more costly to build and test than traditional single-die packages. Only one failed die in a multi-die configuration can cause the entire system to fail. Thus, the quality of each die and the integrity of the interconnect is critical.

This white paper goes into significant detail regarding the array of Synopsys solutions to cover test, repair and device health through its lifetime. It also explains how Synopsys IP is integrated into the chip design to implement these capabilities.

HBM is a popular standard for integrated memory in multi-die designs. The standard defines an interface for 3D-stacked synchronous dynamic random-access memory (DRAM) dies. It specifies the PHY-level logic-to-memory interconnection. The white paper describes how Synopsys SLM SMS ext-RAM IP supports at-speed interconnect test and diagnosis of memory dies as well as post package repair (PPR).

This IP provides:

Comprehensive at-speed interface and memory array testing and diagnosis
Programmable test algorithms, address types and ranges, test operation time, and DRAM access timing
Diagnostics data reporting
Memory fault type and failing address/data lanes
Post-packaging repair via HBM stack repair signature

The figure below shows how the pieces fit together.

Synopsys SLM SMS ext RAM IP for memory test and repair

To Learn More

This is just a high-level summary of the broad coverage Synopsys offers for multi-die design and heterogeneous integration. The white papers mentioned provide substantially more detail. If a multi-die design is in your future, you will find this material to be quite valuable. Here is where you can get your own copies:

Early Architecture Performance and Power Analysis of Multi-Die Systems

Overcoming the Challenges of Verifying Multi-Die Systems

Achieving Successful Timing, Power, and Physical Signoff for Multi-Die Designs

Enabling Efficient Multi-Die Design Implementation and IP Integration

Effective Monitoring, Test, and Repair of Multi-Die Designs

This information will help you better understand what is different about Synopsys’ comprehensive, scalable solution for fast heterogeneous integration.

Also Read:

Will 50% of New High Performance Computing (HPC) Chip Designs be Multi-Die in 2025?

A Deep Dive into SoC Performance Analysis: Optimizing SoC Design Performance Via Hardware-Assisted Verification Platforms

A Deep Dive into SoC Performance Analysis: What, Why, and How

February 3, 2025June 30, 2025

2025 Outlook with Cristian Amitroaie, Founder and CEO of AMIQ EDA

2025 Outlook with Cristian Amitroaie, Founder and CEO of AMIQ EDA
by Daniel Nenni on 02-03-2025 at 6:00 am
Categories: AMIQ EDA, EDA

Tell us a little bit about yourself and your company, AMIQ EDA.

We are an EDA company providing software tools targeting both chip design and chip verification. Our tools enable engineers to increase the speed and quality of new code development, simplify debugging and legacy code maintenance, accelerate language and methodology learning, improve testbench reliability, extract automatically accurate documentation, and implement best coding practices.

What was the most exciting high point of 2024 for your company?

Our most exciting innovation for the year was our first incorporation of artificial intelligence (AI) Into our products. As Serban Ionica discussed in October, AI Assistant is included for no extra cost in the latest releases of DVT IDE for Visual Studio (VS) Code and DVT Eclipse IDE. It works with any large language model (LLM) to generate new design or verification code and to explain and improve existing code. The results are much better than with a general-purpose AI tool because we leverage our project database and its deep knowledge about your design and testbench. Users are telling us that we have really improved their coding and debug efficiency.

What was the biggest challenge your company faced in 2024?

Two of the challenges I mentioned last year continue as our company and user base continue to grow. Of course, growth is good, but it does make it harder to provide rapid and accurate customer support and to hire enough of the best people to evolve our products and develop new ones.

How is your company’s work addressing this biggest challenge?

I believe that AI Assistant helps with customer support, since it enables users to ask questions about their code in natural language. The more intuitive the user interface, the fewer questions users have to ask. We continue to create very popular “how to” videos and we have started a new series of posts on common pitfalls in SystemVerilog. As for hiring, the internship program I mentioned last year has been an incredible success. It takes a lot of time and effort to make it work so well, but many of our interns join us full time when they graduate and are productive from the first day. We also recruit through job fairs and social media, but the internship program is our largest and best source for new employees.

What do you think the biggest growth area for 2025 will be, and why?

We are seeing a big upturn in VS Code interest among users. We are also seeing increased usage for our Verissimo SystemVerilog Linter and Specador documentation generator. I certainly expect AI to be a growth area as well. AI Assistant is getting more capable all the time, and we’ll be applying AI wherever we can to improve our entire product line.

How is your company’s work addressing this growth?

Of course, we’re always adding new features to all our products. For example, our DVT IDE family now smoothly handles preprocessor code, even when it’s in a proprietary language. Also, we’ve added more than 70 new rules to Verissimo over the past year. On the AI side, we work with domain experts to educate our team and ensure that we’re using all the latest relevant tools and technologies.

What conferences did you attend in 2024 and how was the traffic?

We attended and exhibited at our usual in-person events: the Design Automation Conference (DAC) in the U.S. and the Design and Verification Conference (DVCon) in the U.S. and Europe. Traffic was better than last year, showing that engineers are comfortable traveling to conferences again and that their companies have the budget to send them. We also attended Open Community Experience (OCX), the Eclipse Foundation’s flagship developer conference. We heard about a lot of interesting topics in the open source community, from IDEs of the future to cyber resilience.

Will you attend conferences in 2025? Same or more?

We expect even better traffic this year, so we plan to attend the same three events. We always enjoy catching up with users, friends, and colleagues while seeing some new faces as well.

Additional questions or final comments?

I said a year ago that 2024 should be another outstanding year for us, and indeed it was. I fully expect the same for 2025. We’ll continue to provide regular updates to SemiWiki on what’s happening at AMIQ EDA. Thank you for your time.

Also Read:

Adding an AI Assistant to a Hardware Language IDE

Writing Better Code More Quickly with an IDE and Linting

AMIQ EDA Integrated Development Environment #61DAC

Podcast EP272: An Overview How AI is Changing Semiconductor and System Design with Dr. Sailesh Kumar

Podcast EP272: An Overview How AI is Changing Semiconductor and System Design with Dr. Sailesh Kumar
by Daniel Nenni on 01-31-2025 at 10:00 am

Daniel Nenni is joined by Dr. Sailesh Kumar, CEO of Baya Systems. With over two decades of experience, Sailesh is a seasoned expert in SoC, fabric, I/O, memory architecture, and algorithms. Previously, Sailesh founded NetSpeed Systems and served as its Chief Technology Officer until its successful acquisition by Intel. Sailesh is also a prolific author, with more than two dozen highly cited papers and over 150 patents.

Dan covers a lot of ground in this far-reaching discussion with Sailesh. The ways AI is disrupting computing architectures are explored, with a focus on the new requirements for high-performance data movement. The various standards under development to improve external interfaces and implement chiplet-based design are also explored. Dan also discusses the recent announcements from DeepSeek with Sailesh, who comments on the significance of the work and the likely impact on the industry going forward.

Sailesh also provides an overview of the unique design platform offered by Baya Systems that helps to tame the complexity of next generation design. Sailesh also comments on the importance of relationships when building a new technology company and the “network effect” that is present in Silicon Valley.

The views, thoughts, and opinions expressed in these podcasts belong solely to the speaker, and not to the speaker’s employer, organization, committee or any other group or individual.

January 31, 2025May 9, 2025

CEO Interview: With Fabrizio Del Maffeo of Axelera AI

CEO Interview: With Fabrizio Del Maffeo of Axelera AI
by Daniel Nenni on 01-31-2025 at 6:00 am
Categories: AI, CEO Interviews, RISC-V

Fabrizio Del Maffeo is the CEO and co-founder of Axelera AI, the Netherlands-based startup building game-changing, scalable hardware for AI at the edge. Axelera AI was incubated by the Bitfury Group, a globally recognised emerging technologies company, where Fabrizio previously served as Head of AI. In his role at Axelera AI, Fabriozo leads a world-class executive team, board of directors and advisors from top AI Fortune 500 companies.

Tell us about your company?

Axelera AI was founded in July 2021 with Evangelos Eleftheriou, emeritus IBM Fellow, myself and a core team from Bitfury AI, IMEC, researchers from IBM Zurich Lab, ETH Zurich, Google and Qualcomm.

Our mission is to rapidly provide access to advanced Edge AI-native hardware and software solutions for companies of all sizes across a range of market verticals and place AI in the hands of those who could not otherwise afford it. We do this by delivering faster, more efficient and easy-to-use inference acceleration while minimizing power and cost. To do this, our platform is purpose-built to support AI strategies across a wide-range of industries while seamlessly integrating with existing technologies.

With our team of brilliant engineers, developers and business experts, we are focused on building our solutions and ecosystem that together will drive the democratization of AI, enabling a green, fair and safe world.

In three years, Axelera AI has raised USD 120 million, built a world-class team of 190+ employees (including 60+ PhD’s with 40,000+ citations), launched its Metis™ AI Platform and is the largest AI semiconductor company in Europe.

The company is backed by major institutional investors, including Samsung Catalyst Fund, the European Innovation Council Fund, Innovation Industries Strategic Partnership Fund (backed by MN/Pension Fund for Metal and Technique), Invest-NL Deep-Tech Fund , along long-standing investors Verve Ventures, Innovation Industries, Fractionelera, the Italian sovereign fund CDP Venture Capital SGR, the Dutch Enterprise Agency (RVO), Bitfury, Federal Holding and Investment Company of Belgium (SFPIM), imec andimec.xpand.

What problems are you solving?

Current iterations of AI technology have leveraged more general purpose acceleration and have delivered expensive, power-hungry solutions that prove to be inefficient for many use cases. In the cloud, with access to water-cooling and large power supplies, this architecture suffices, but it is poorly suited for the edge.

At Axelera AI, we are revolutionizing the field of artificial intelligence by developing an industry-defining hardware and software platform for accelerating computer vision and generative AI on edge devices. Our platform, built using proprietary in-memory computing technology and RISC-V dataflow architecture, delivers industry-leading performance and usability at a fraction of the cost and energy consumption of current solutions.

Power consumption is a critical factor both on devices and in data centers. Axelera AI offers leading compute density with exceptional core efficiency which means systems can easily crunch data without draining power or running hot, with a typical use case requiring just a few watts.

One of the biggest challenges for Edge AI is optimizing neural networks to run efficiently when ported onto a mixed-precision accelerator solution. Our platform includes advanced quantization techniques and mapping tools that significantly reduce the inference computational load and increase energy efficiency. By creating integrated solutions that are powerful, cost effective and efficient, Axelera is bringing inference to the edge with accuracy.

What application areas are your strongest?

Axelera AI is primarily focused on providing powerful AI inference solutions for edge computing and high-performance applications. The first generation of AI processing unit, Metis, focuses on primarly on computer vision and some of the strongest application areas include:

Security and Surveillance Axelera AI excels in real-time image and video processing for applications like campus management, safety, surveillance, access control
Automotive: autonomous vehicles and infotainment
Industrial automation: real-time high speed quality control, pick-and-place and general purpose robotics
IoT Devices: Their technology is well-suited for IoT applications, enabling smart devices to process data locally with minimal latency.
Smart Cities: AI-powered analytics for traffic management, public safety, and resource optimization can benefit from Axelera’s capabilities.
Retail: Providing a frictionless experience for customers with personalized recommendations, queue management, fast and efficient checkouts, and smart mirrors.
Healthcare: With nearly 10 million fewer health workers[1] than the world needs, bringing AI inference to the healthcare system will allow doctors to more quickly understand and diagnose patients.

We have been working on broadening our future product offerings from the edge to the enterprise servers to address the growing computing needs for generative AI, large language models and large multi-modal models.

What keeps your customers up at night?

Neural networks are getting bigger and they require more computations. Scaling performance using CPUs and GPUs are inefficient and extremely expensive. We are fully focused on tailoring our technology around the new emerging needs, efficiently offloading completely the AI acceleration from the CPU inside our AI processing unit.

We must also contend with the realities of the current chip market. These realities include the high cost of hardware, as well as ongoing shortages in the industry. A discrepancy in the demand and supply of chips, fueled by supply chain delays, the pandemic, natural disasters and labor market changes, is heavily impacting the global semiconductor space.

There is also the question of Moore’s Law and energy usage. Moore’s Law suggests that the number of transistors in an integrated circuit would double every two years, which played a driving role in modern tech development like computers. However, modern semiconductors are far more technologically complex and require significant energy to produce and progress. With our Metis AI Platform, we aim to overcome these challenges by delivering a product that packs the power of an entire AI server – all at a fraction of power consumption and cost of other solutions.

What does the competitive landscape look like and how do you differentiate?

Until now, AI systems and applications have relied on the computational performance of large, power hungry and expensive hardware. However, fully unlocking the potential of AI, especially at the Edge, requires a dramatic increase in FPS/$ which will enable complex AI applications to run on-device at the Edge. Running the industry-standard benchmark of ResNet, YoloSSD-Mobilenet families, the Metis PCI delivers high performance at a fraction of power consumption and price of today solutions. Furthermore, Metis excel in real application pipelines thanks to the possibility to run in parallel on different core multiple networks, delivering 5-10x higher throughput than existing solutions.

A major advantage of our SRAM-based D-IMC technology is that it has been implemented in standard CMOS technology. Our design uses proven, cost-effective and standard manufacturing processes, readily available in any silicon foundry which brings supply chain resiliency to system builders. Memory technologies are also a key driver for lower lithography nodes. So, Axelera AI will be able to easily scale performance as the semiconductor industry brings advanced lithography nodes into volume production.

We are also going beyond just the accelerator technology and chip development, building a fully interconnected ecosystem of support powered by a versatile and easy-to-use software stack: our Voyager^TM SDK.

What new features/technology are you working on?

The full production-ready Metis AI platform is now in production delivering high performance and preserving 99% of the original model’s precision, indistinguishable from GPU-based inference models, while offering 4-5 times throughput, energy efficiency and cost savings.

We have a complete product portfolio including standard form factors like an M.2 card to PCI-E cards capable of handling the most demanding vision applications. We have a complete roadmap that scales from single digit watt usage to enterprise grade server usage.

Our platform includes advanced quantization techniques that enable customers to run out of the box state of arts neural networks and mapping tools that significantly reduce AI computational load and increase energy efficiency,

Finally, our software tool chain allows customers to build up a complete application pipeline in a matter of minutes, simplifying the deployment of artificial intelligence in any device and opening up unprecedented opportunities for mass deployment of AI solutions.

Nowadays we are working on expanding the neural networks zoo to support Large Language Models on Metis. We are also in advanced design with the new AI processing unit, complementary to Metis and more focused on generative AI workload. The product line will be announced later next year.

How do customers normally engage with your company?

Whether you are a computer vision system developer or integrator, software vendor or OEM, our AI acceleration hardware has been built to meet your needs. Delivering leading AI acceleration hardware in a range of industry accepted form factors supported by our easy-to-use software stack, Metis simplifies development, integration and deployment of AI inference acceleration.

Valuation kits are available in six variations, each designed for industry-defining AI vision inference. The kits are equipped with the Metis AIPU integrated in an AI Acceleration card, and the Voyager Software Development Kit, allowing users to evaluation performance and vision systems using popular AI inference networks such as YOLOv7.

Customers use the SDK to bring their applications into the Metis AI platform and run it on Axelera’s powerful Metis AI Processing Unit (AIPU), whether the application is developed using proprietary or standard industry models. The Voyager^TM SDK offers end-to-end integration and is API-compatible with de-facto industry standards, unleashing the potential of the Metis AIPU, delivering high-performance AI that can be deployed quickly and easily.

The Voyager^TM SDK comes with a Model Zoo, a catalog of state-of-the-art AI models and turnkey pipelines for real-world use cases including image classification, object detection, segmentation, keypoint detection, face recognition and other Computer Vision tasks. Importantly, developers can easily modify any of the offered models to work with their own datasets or make them fit better to their application requirements.

We are working on creating a frictionless experience for our customers who soon will be able to buy online our products and get supported by an online developer community.

[1] World Health organization: Health workforce

Also Read:

CEO Interview: GP Singh from Ambient Scientific

CEO Interview: Ollie Jones of Sondrel

CEO Interview: Dr. Yunji Corcoran of SMC Diode Solutions

January 30, 2025August 7, 2025

Automating Formal Verification

Automating Formal Verification
by Daniel Payne on 01-30-2025 at 10:00 am
Categories: EDA, LUBIS EDA

Formal verification methods are being adopted at a fast pace as a complement to traditional verification methods like functional simulation for IP blocks in SoC designs. I had a video meeting with Max Birtel, co-founder of LUBIS EDA and learned more about their history, products and vision. This company started recently in 2020 to bring formal products to market, based on their experience providing verification services and then productizing their formal apps.

Their vision is to automate formal verification, which reveals hard to find bugs and makes high risk silicon designs more reliable. With a team of 35 people based in Germany, they provide expert training and support through consulting and a formal bootcamp, plus their technology provides automated setup and SVA assertion generation, finally with automated verification it simplifies complex bug detection with AI-driven tools.

VIP has been customized for common designs like RISC-V and AMBA protocols. They even have a cloud-based product, dubbed LUBIS-on-cloud, which means no software to install. In 2023 the company completed over 50 projects, with more than 250 bugs reported, taking typically under 30 days per block. Common blocks like limiters, memories and arbiters required 10 to 25 days. Control logic like AXI to X, routers and CHI took 25 to 50 days. Caches that were pipelined or RMW were validated in 25 to 40 days. Compute cores – Crypto, RISC-V, Image and Tensors took 10 to 50 days.

The LUBIS-on-cloud product takes your RTL code, then runs an App that controls a formal engine (Cadence, Siemens, Synopsys), which produces a bug report that a verification engineer reviews to finally make bug fixes. Here’s the flow for verifying a RISC-V processor, which takes very little training or experience with formal tools. You would see your first bug report within an hour.

An example open-source RISC-V processor was uploaded, setup was automatic, then the App was run to start formal verification. Bugs were reported and details of the bug displayed, so that an RTL engineer could find and fix the bug. There were even waveforms shown to help in the debug process. AI is used to explain the bug in English, speeding the debug.

AI Debug

Using these apps to control the formal engines makes it easier for an un-trained engineer to use a formal approach, shortening the verification effort.

Cloud Demo

It was refreshing to see a live EDA demo on a RISC-V processor, and login was with Microsoft. The first run of the formal app revealed that 5 bugs were found.

Status of formal runs

Max clicked on a bug to see why the run had failed, and it reported which signal had a mismatch between expected and actual values, along with waveforms.

Bug report details

For this block the test failed within 2 minutes, and the tool provides the counter example. You can either debug with the provided waveforms or choose to download the SV testbench to replicate the bug in your own UVM environment. An engineer still must manually fix the bug, then re-run the formal app until it passes to complete verification.

Summary

The focused engineering group at LUBIS EDA has made using formal verification tools quite automated and easy to use with the app approach, and the AI-based debug report really makes understanding what caused the bug explicit. Evaluating the cloud-based tool looks straight forward and intuitive

I also discovered where the company name LUBIS came from, it’s a mixture of the founder’s names: Ludwig, Bittel, Shwartz.

Related Blogs

CEO Interview: Tobias Ludwig of LUBIS EDA

January 30, 2025February 17, 2025

2024 Retrospective. Innovation in Verification

2024 Retrospective. Innovation in Verification
by Bernard Murphy on 01-30-2025 at 6:00 am
Categories: AI, Cadence, EDA

As usual in January we start with a look back at the papers we reviewed last year. Paul Cunningham (GM, Verification at Cadence), Raúl Camposano (Silicon Catalyst, entrepreneur, former Synopsys CTO and lecturer at Stanford, EE292A) and I continue our series on research ideas. As always, feedback welcome.

The 2024 Picks

These are the blogs we posted through the year, sorted by popularity. We averaged 12.6k engagements per blog, a little up from last year; thank you for your continued interest! The leader at over 17k views was a surprise, using automated theorem proving to validate multipliers. While still an exotic technology more commonly at home in proving math theorems (the 4-color problem) and the security of specialized OSes, it seems our readership is more than intrigued, perhaps looking for future methods to extend formal verification?

Theorem Proving for Multipliers. Innovation in Verification

The Next LLM Architecture? Innovation in Verification

Accelerating Simulation. Innovation in Verification

2023 Retrospective. Innovation in Verification

Fault Sim on Multi-Core Arm Platform in China. Innovation in Verification

Bug Hunting in NoCs. Innovation in Verification

Fault Simulation for AI Safety. Innovation in Verification

Compiler Tuning for Simulator Speedup. Innovation in Verification

Using LLMs for Fault Localization. Innovation in Verification

BDD-Based Formal for Floating Point. Innovation in Verification

Novelty-Based Methods for Random Test Selection. Innovation in Verification

Safety Grading in DNNs. Innovation in Verification

Paul’s view

Wow! 5 years of blogging our appreciation for innovation in verification. And our readership continues to increase every year. I never expected this. Thank you, readers!

Our most recent paper, picking up on research by MIT on accelerating logic simulation (see here), had the most hits out of the gate in its first month. This was a great paper, and I hope MIT continue active research in this area. Our blog on State Space Models (see here) was second out of the gate. It didn’t expect this but am delighted to see it as I also found the work very intriguing, especially the structural parallels between an SSM and control vs. datapath in a digital circuit.

The paper on using theorem proving techniques rather than SAT/BDD for multiplier correctness proofs (see here) was also somewhat of a surprise hit for me, but another happy surprise. While the content was heavy, the paper was very well written, and I suspect popular because it is relevant to the verification of AI accelerator chips.

After these top 3 winners we have a mix of hits in the 9-14k range, which still means a lot of interest in every paper we blogged on. I was glad to have read the compiler tuning paper (see here) as it confirmed the same sorts of finding we have been seeing internally at Cadence.

Looking forwards to this year, we would welcome your feedback and suggestions on what to cover. AI, accelerated simulation, and formal will all continue to be important topics – I see the same high interest in these topics from our Cadence customers. Another area which is getting increasing attention in the commercial world is “synthetic content”. This is verification done by generating interesting bare metal software programs that run on the chip being verified and attempt to stress test correctness and performance in clever ways. Synthetic content sits somewhere between signal level testbenches in, say SystemVerilog UVM, and true system-level testing running an actual operating system and application software. We’ll try to cover this topic some more in 2025. Looking back on 2024, we didn’t look at mixed-signal verification. We aim to pay more attention to this area this year.

Raúl’s view

Surprisingly, Wine Spectator named a Chilean wine as their #1 pick for their 2024 list. Similarly, our readers surprisingly selected a paper on theorem proving for verifying multipliers as the most-viewed blog post of 2024. However, the broader trend tells a different story. While the wine list remains dominated by familiar regions, our reviewed papers, much like last year, were largely focused on AI in verification. Over half of the papers we covered dealt with AI-related topics.

The second most-viewed paper, Mamba, diverged from verification entirely. It explores a promising architecture based on SSM, which shows promise as a potential alternative to Transformers, particularly for applications requiring efficient processing of long sequences. Meanwhile, AI applications in verification occupied spots 6, 7, 8, 10, and 11. This might suggest a degree of reader fatigue with these themes or perhaps a sense of saturation in the field. To list them:

Detecting transient faults in DNN accelerators using AI and RTL simulation.
Using Bayesian optimization to optimally configure compiler flags.
Leveraging large language models (LLMs) for fault localization in Java programs.
Applying neural networks to identify coverage “holes.”
Employing deep neural networks (DNNs) to enhance functional safety in image classification.

The remaining papers were Verification- or EDA-specific and captured considerable interest. Ranking third was a paper on accelerating RTL simulation using a dedicated HW/SW architecture (SASH). Fourth place went to a practical approach for speeding up fault simulation, while fifth focused on leveraging fuzzing for NoC verification. Ninth place featured a study on managing BDD size for floating-point adder verification—a second formal verification paper, though surprisingly, it was among the least viewed.

As I look ahead to a fifth year of blogging about verification papers, I’m curious to see which topics we’ll explore based on your feedback. The continued growth in readership is especially encouraging—our 2023 retrospective saw a 40% increase in readers compared to the previous year. Thank you for your interest!

Also Read:

Accelerating Simulation. Innovation in Verification

Compiler Tuning for Simulator Speedup. Innovation in Verification

Cadence Paints a Broad Canvas in Automotive