SemiWiki – Page 26 – The Open Forum for Semiconductor Professionals

August 14, 2025September 24, 2025

Semiconductors Still Strong in 2025

Semiconductors Still Strong in 2025
by Bill Jewell on 08-14-2025 at 2:00 pm
Categories: Semiconductor Intelligence, Semiconductor Services

The global semiconductor market in 2Q 2025 was $180 billion, up 7.8% from 1Q 2025 and up 19.6% from 2Q 2024, according to WSTS. 2Q 2025 marked the sixth consecutive quarter with year-to-year growth of over 18%.

The table below shows the top twenty semiconductor companies by revenue. The list includes companies which sell devices on the open market. This excludes foundry companies such as TSMC and companies which only produce semiconductors for their internal use such as Apple. The revenue in most cases is for the total company, which may include some non-semiconductor revenue. In cases where revenue is broken out separately, semiconductor revenue is used.

Nvidia remains the largest semiconductor company based on its forecast of $45 billion in 2Q 2025 revenue. Memory companies Samsung and SK Hynix are second and third. Broadcom is fourth and long-time number one Intel has dropped to fifth.

Most companies reported solid growth in 2Q 2025 revenues versus 1Q 2025, with a weighted average increase of 7%. Memory companies showed the largest increases, with SK Hynix up 26%, Micron Technology up 16%, and Samsung up 11%. The healthiest revenue gains among the non-memory companies were Microchip Technologies at 11%, STMicroelectronics at 10%, and Texas Instruments at 9.3%. Five companies saw revenue decline from 1Q 2025.

Almost all the companies providing guidance expect healthy growth in 3Q 2025 revenues versus 2Q 2025. Again, the biggest gains are from memory companies, with Micron projecting 20% and Kioxia projecting 30%. Both companies cited demand form AI applications as the key driver.

STMicroelectronics guided 15% revenue growth with all its end markets up except auto. AMD projects a 13% increase driven by AI. The other six companies providing revenue growth guidance range from 1.7% to 7.7%. The only company expecting a revenue decline is MediaTek, with a drop of 10% in 3Q 2025 due to a weak mobile market.

AI remains the highest grow driver. Many companies are seeing upticks in their traditional markets. Some companies are experiencing growth in automotive revenues while other companies see automotive continuing to be weak. In their conference calls with financial analysts, most companies cited the uncertainties around tariffs and global trade as areas of concern.

The strong semiconductor market growth in the first half of 2025 practically guarantees double-digit full year growth. Recent forecasts are generally in a narrow range of 14% to 16%. WSTS revised its June forecast from 11.2% to 15.4% based on the 2Q 2025 data. We at Semiconductor Intelligence (SC IQ) remain cautious due to the uncertainty about global trade. But based on the strong first half of 2025, we are raising our 2025 forecast to 13% from the May forecast of 7%.

Projecting the impact of U.S. tariffs on global trade is difficult due to the frequent changes in threatened tariffs and implemented tariffs. In the case of China, the Trump administration in April threatened tariffs as high a 145%. In May, the administration put a 90-day pause on the higher tariffs and set tariffs on China at 30%. This week, the pause was extended until November.

Direct tariffs on semiconductors are very uncertain. Earlier this month, President Trump announced the U.S. will impose a 100% tariff on imports of semiconductors. He said companies that commit to building semiconductors in the U.S. will not face tariffs. Details of the plan have yet to be announced.

This month the Trump administration reached an agreement to provide export licenses for Nvidia and AMD to ship certain AI chips to China. The companies will be required to pay 15% of the revenue from these sales to the U.S. government. The legality of this agreement is questionable. The U.S. Constitution prohibits Congress from putting taxes or duties on exports. EE Times describes the deal as “unique”.

One area which has already seen an impact from tariffs is smartphones. As we have noted in previous newsletters, U.S. imports of smartphones have been dropping dramatically in recent months. 2Q 2025 U.S. smartphone imports dropped 58% in dollars and 47% in units from 1Q 2025. Smartphone unit imports from China declined 85%. Although there are currently no tariffs on smartphone imports, the threat of tariffs has had a significant impact. Canalys estimated 2Q 2025 U.S. smartphone sales were down about 20% from 1Q 2025. Many of the 2Q 2025 sales came from existing inventory. However, U.S. smartphone sales should drop significantly in the second half of 2025. Despite the drop in exports to the U.S., China smartphone manufacturing has remained strong, with unit production in 2Q 2025 up 5% from 1Q 2025.

The current semiconductor market is strong. Ongoing global trade disputes are a significant concern, but so far have not had a meaningful impact. The Trump administration tariff threats may become, to quote Shakespeare’s MacBeth, “sound and fury, signifying nothing.”

Bill Jewell
Semiconductor Intelligence, LLC
billjewell@sc-iq.com

Also Read:

U.S. Imports Shifting

Electronics Up, Smartphones down

Semiconductor Market Uncertainty

August 14, 2025August 13, 2025

Moving Beyond RTL at #62DAC

Moving Beyond RTL at #62DAC
by Daniel Payne on 08-14-2025 at 10:00 am
Categories: EDA, Rise Design Automation
2 Comments

Hardware designers have been using RTL and hardware description languages since the 1980s, yet many attempts at moving beyond RTL have tried to gain a foothold. At the #62DAC event I spent some time with Mike Fingeroff, the Chief High-Level Synthesis Technologist to understand what his company Rise Design Automation is up to. Mike has two decades of experience in High Level Synthesis (HLS) and even authored a book in 2010 on HLS.

One major theme at DAC this year was using GenAI to create RTL faster. At RISE they support a methodology using several higher-level languages like SystemVerilog, C++ or SystemC. Verilog designers gravitate towards using SystemVerilog with loose timing for control flow designs, while C++ is an appropriate language for dataflow designs. Mike thinks that you should use the best language for each block, then mix abstractions as needed.

All of the popular EDA simulators support multiple languages for design descriptions spanning from RTL to transaction level. Many tier-one companies have proven that HLS flows are more productive than RTL: Google, NVIDIA, Qualcomm. The new challenges are providing a complete tool chain for HLS that use AI agents, instead of requiring experts to run the tools.

With RISE there are AI advisors and agents to help you generate high-level code using LLMs that already understand high-level coding from Python and C repositories. Their AI works with engineers to create the code easily by using chat prompts. Traditional LLMs are being used either on-premise or in the cloud, your choice, and they are pre-trained for you.

An LLM doesn’t really know HW design, so they had to show them how to make HW from C++ code. They have an Agent Orchestrator that calls the RISE tools, views the results, and continues to iterate to explore the design space. This iteration loop can also contain logic synthesis and P&R tools as well.

Rise.ai Adviser is a generative AI advisor aimed at high-level design with natural language input, creating designs in SystemVerilog, SystemC and C++. Test benches are created in both C++ and UVM. You can analyze your design then optimize for area, power or speed. This all runs on a local processor or something larger if you really want to. During design exploration you can call your own tools, like VCS for power numbers, or Open ROAD tools for synthesis and P&R.

Verification speed ups with higher abstraction levels range from 100X to 1,000X faster. RISE verification has automatic channel capture for waveforms, automatic high-level to RTL comparisons, and utilities for sub-system assembly and verification testbenches.

Summary

RISE Design Automation did create a buzz at DAC this year, because their message was something that RTL designers want – becoming more productive by raising the design and verification abstractions, using faster toolchains and benefitting from generative AI integration. You can learn more about RISE by visiting their website and then think about starting an evaluation to produce better design and verification results from your team.

Related Blogs

August 14, 2025August 25, 2025

Streamlining Functional Verification for Multi-Die and Chiplet Designs

Streamlining Functional Verification for Multi-Die and Chiplet Designs
by Daniel Nenni on 08-14-2025 at 6:00 am
Categories: 3D IC, Cadence, Chiplet, EDA

As multi-die and chiplet-based system designs become more prevalent in advanced electronics, much of the focus has been on physical design challenges. However, verification—particularly functional correctness and interoperability of inter-die connections—is just as critical. Interfaces such as UCIe or custom interconnects must be rigorously tested to ensure the entire system performs as intended.

Traditional verification methods face serious challenges when applied to multi-die systems. Creating a unified top-level simulation that includes all dies is computationally demanding. Memory utilization often exceeds the capabilities of typical compute servers, which are geared more toward verifying individual IP blocks or subsystems. Although premium emulation and prototyping platforms like Palladium and Protium can manage such large-scale simulations, they are generally reserved for later stages of validation and software bring-up, not early-stage design.

Most early and mid-cycle verification relies on simulators like Xcelium^™ Simulator, which perform power regressions across thousands of runs. These use existing compute farms, but the capacity limitations of typical servers prevent full-system simulations from being practical. Another bottleneck is the time and effort needed to build and debug a new top-level testbench for the integrated system, which can take weeks even when each die has already been verified independently.

A serial approach to interoperability testing is risky. In modern development flows, the goal is always to “shift left”to detect and fix issues as early as possible. Waiting until interposer designs are finalized and all die models are complete delays verification unnecessarily. There’s a better path forward: begin interoperability testing as soon as two or more die models are available, even if other parts of the system are still in development.

This is where the Xcelium Distributed Simulation Verification App offers a game-changing solution. Rather than simulating the entire system as one monolithic design, the Xcelium App enables each die to be simulated in its own process, running independently but connected through Xcelium Virtual Channels that abstract away RTL-level bus interfaces. These distributed simulations use the existing testbenches created for individual dies, significantly reducing the time and effort needed to verify multi-die systems.

Customer experience with the App shows that adapting to this distributed approach typically takes just a few days. Once connected, these simulations enable a wide range of interoperability testing scenarios, including register access, concurrency, die-to-die CRC and retry mechanisms, protocol interactions, and physical-layer behaviors like scrambling and lane repair. These tests are essential for signoff quality assurance in multi-die environments.

Importantly, distributed simulation allows verification activities to begin up to three months earlier than traditional methods well before the interposer layout is finalized. The simulation model is constructed with only minor changes: conditional compile switches to handle traffic generation and memory maps, along with API calls to configure Xcelium Virtual Channels. From there, the Xcelium App handles the distributed communication and synchronization.

Performance is a key concern, but real-world testing has shown distributed simulations to be up to 3X faster than integrated top-level simulations, even with inter-process communication overhead. This is because Xcelium Virtual Channels minimize synchronization needs, allowing each simulation to run at optimal speed except during necessary transaction updates.

The potential of distributed simulation isn’t limited to multi-die systems. As individual dies grow in complexity, the same methodology could be applied to partition large single-die designs into independently simulated blocks, each with its own testbench. With the right communication strategy—favoring asynchronous transaction-based links over tightly coupled cycle-by-cycle synchronization—distributed simulation can scale to manage increasing design sizes efficiently.

Bottom line: Multi-die systems are becoming a foundational part of modern electronics, yet functional verification has struggled to keep pace with physical integration. The Xcelium Distributed Simulation Verification App provides a robust, scalable, and early-deployable solution. It enables full-system functional verification using existing testbenches and compute infrastructure, advancing shift-left strategies and accelerating development cycles without sacrificing quality or confidence in design correctness.

You can view the whitepaper here.

Also Read:

Chiplets and Cadence at #62DAC

Prompt Engineering for Security: Innovation in Verification

New Cooling Strategies for Future Computing

August 13, 2025August 13, 2025

S2C Advances RISC-V Ecosystem, Accelerating Innovation at 2025 Summit China

S2C Advances RISC-V Ecosystem, Accelerating Innovation at 2025 Summit China
by Daniel Nenni on 08-13-2025 at 10:00 am
Categories: EDA, Prototyping, RISC-V, S2C EDA

Shanghai, July 19, 2025 — S2C, a leader in functional verification, showcased its latest digital EDA solutions and key partnerships with BOSC, Xuantie, and Andes Technology at RISC-V Summit China 2025, highlighting its contributions to the ecosystem. The company also played a leading role in the EDA sub-forum, with VP Ying J Chen co-chairing and Senior Engineer Dehao Yang delivering insights on accelerating RISC-V adoption through practical strategies.

Showcasing Diverse RISC-V Applications with Ecosystem Partners

Leveraging its comprehensive digital EDA portfolio, S2C delivers matching verification solutions across the RISC-V ecosystem—addressing verification needs ranging from IP validation to system-level verification. Through close partnerships with leading RISC-V vendors, S2C provides high-performance, scalable prototyping solutions that accelerate time-to-market from early design bring-up to full-system deployment.

At the summit, S2C showcased its FPGA prototyping solutions with live demos across multiple RISC-V applications – including the Xiangshan processor running a graphical Linux interface. S2C has collaborated with Beijing Open Source Chip Research Institute (BOSC) since the first-generation Xiangshan CPU. In the recent validation of its third-generation Kunminghu processor – a 16-core RISC-V design with NoC interconnect running on two S8-100Q Logic Systems (each with 4 VP1902 FPGAs) were deployed and achieved a static timing closure at 12MHz. BOSC recognized S2C as a “Strategic Contributor” for its critical role in accelerating Xiangshan’s development cycle.

Additionally, Xuantie R908—a high-efficiency processor designed for real-time performance—was demonstrated live running on S2C S7-19P Logic System. The demo effectively demonstrated its low-latency operation and field-ready reliability.

Equally notable was Andes Technology’s 64-bit RISC-V vector processor IP core, the AX45MPV – running Linux and large language models easily efficiently on S2C’s S8-100 Logic System through the Andes Custom Extension (ACE) framework.

Overcoming Simulation Bottlenecks with Transaction-Based Acceleration

The RISC-V Verification Interface (RVVI) provides a standardized framework to ensure ISA compliance and functional correctness. Yet, as RISC-V designs grow in complexity—especially with custom extensions—traditional simulation methods encounter challenges like slow execution speeds, limited debug visibility, and difficulties scaling to full system-level verification.

To address these challenges, the keynote by Yang Dehao focused on Transaction-Based Acceleration (TBA), a verification methodology that enhances RVVI by decomposing test scenarios into reusable transaction flows. TBA leverages co-simulation between virtual prototyping platforms and hardware emulators—using tools such as S2C’s Genesis Architect and OmniArk/OmniDrive—to significantly improve verification speed and observability at scale, while maintaining RVVI compliance.

This approach exemplifies how advanced verification methodologies, combined with powerful prototyping tools, can accelerate the path from RTL validation to full-chip system verification.

Building on this, VP of marketing Ying J Chen highlighted S2C’s continued commitment to ecosystem collaboration and innovation:

“It is exciting to see thousands of engineers at the summit, and the manifestation of our partners’ RISC-V cores drawing a large crowd to our booth,” stated Ying J Chen, VP of Marketing at S2C. “We don’t just see ourselves as tool providers—we’re also an advocate for innovation and customers’ success. We’re committed to deepening our effort in the RISC-V community and broaden the ecosystem.”

S2C Inc. is a global provider of FPGA prototyping solutions for SoC (System on Chip) and ASIC (Application-Specific Integrated Circuit) designs. They offer hardware, software, and system-level design verification tools to accelerate the development process. S2C’s solutions are used for design exploration, IP development, hardware verification, system validation, software development, and compatibility testing.

Also Read:

Double SoC prototyping performance with S2C’s VP1902-based S8-100

Enabling RISC-V & AI Innovations with Andes AX45MPV Running Live on S2C Prodigy S8-100 Prototyping System

Cost-Effective and Scalable: A Smarter Choice for RISC-V Development

August 13, 2025August 28, 2025

Breaking the Sorting Barrier for Directed Single-Source Shortest Paths

Breaking the Sorting Barrier for Directed Single-Source Shortest Paths
by Admin on 08-13-2025 at 8:00 am
Categories: EDA

Problem & significance.
Single-source shortest paths (SSSP) on directed graphs with non-negative real weights is a pillar of graph algorithms. For decades, the textbook gold standard has been Dijkstra’s algorithm with good heaps, running in the comparison-addition model (only comparisons and additions on weights). Because Dijkstra effectively maintains a total order of tentative distances, many believed its “sorting cost” was an inherent barrier on sparse graphs. The new paper “Breaking the Sorting Barrier for Directed Single-Source Shortest Paths” overturns that belief with a deterministic algorithm for directed graphs—strictly on sparse inputs and the first to beat Dijkstra’s bound in this model.

Model & setup.
The result holds in the comparison-addition model with real, non-negative edge weights. The authors adopt standard simplifications that preserve shortest paths: (i) make the graph constant-degree via a classical vertex-expansion gadget ensure uniqueness of path lengths by a lexicographic tie-breaking scheme that can be implemented with some overhead. These choices streamline the analysis without changing correctness.

Core idea in one line.
Blend the ordering discipline of Dijkstra with the bulk-relaxation power of Bellman–Ford, but never fully sort the frontier. Instead, shrink the frontier so only a fraction of “pivots” need attention at each scale, thereby skirting the sorting bottleneck.

From barrier to breakthrough: two levers.

Frontier reduction via “pivots.”
At any moment, Dijkstra’s priority queue reflects a frontier $S$ of vertices whose outgoing edges can unlock progress. If one naively kept selecting the minimum, sorting resurfaces. The paper instead introduces a FindPivots subroutine that runs $k$ rounds of bounded relaxations (Bellman–Ford style) and classifies vertices according to whether their shortest path crosses at most $k$ frontier-interval vertices. Those that finish become complete. The others are “covered” by root vertices whose shortest-path subtrees are large. Only these roots—the pivots—must persist.
Recursive bounded multi-source SSSP (BMSSP).
Rather than run one monolithic Dijkstra, the algorithm performs a divide-and-conquer over distance scales.

A partial-sorting data structure.
To orchestrate sources across recursive calls without full sorting, the authors design a block-based structure supporting: Insert, BatchPrepend (efficiently add a batch of strictly smaller keys), and Pull (extract up to $M$ smallest keys plus a separating threshold). This lets the algorithm “sip” from the smallest distance ranges as needed, while updates from relaxations are appended efficiently, avoiding global reordering.

Context & novelty.
Earlier, it was shown that Dijkstra is optimal if one insists on outputting the full ordering of vertices by distance. This paper sidesteps that constraint: it outputs distances without maintaining a total order, breaking the “sorting barrier.” It also delivers the first deterministic improvement even relative to prior randomized gains on undirected graphs, strengthening the case that the barrier is not fundamental.

Takeaways:

Theory: A clean, deterministic path past $nlog⁡nn\log n$ for directed SSSP in a realistic algebraic model.

Technique: A reusable template—bounded multi-source recursion, pivot selection, and partial sorting—that may inform faster routines for other path problems under comparison constraints.
Outlook: Extending these ideas to richer weight domains or dynamic settings could unlock further speedups where sorting once seemed inevitable.

The full paper: Breaking the Sorting Barrier for Directed Single-Source Shortest Paths

August 13, 2025August 23, 2025

Samtec Practical Cable Management for High-Data-Rate Systems

Samtec Practical Cable Management for High-Data-Rate Systems
by Admin on 08-13-2025 at 6:00 am
Categories: Samtec, Semiconductor Services

According to a recent Samtec whitepaper, in high-data-rate (HDR) architectures, where signals traverse tens to hundreds of gigabits per second, “cable management” isn’t a housekeeping chore, it’s a first-order design variable. The mechanical path a cable takes directly influences channel loss, crosstalk, reliability, rework costs, and even thermal performance. The most successful programs treat cable routing, bend strategy, strain relief, and labeling as part of early architecture, co-optimizing them alongside signal integrity (SI), thermals, and assembly. That mindset unlocks cleaner channels, faster bring-up, and fewer surprises in environmental or HALT testing.

Figure 1. Samtec High-Data-Rate Cable Assemblies can be used in mid-board, front panel, and backplane flyover applications.

Bend control is foundational. Every cable has a static minimum bend radius below which impedance discontinuities, jacket damage, or conductor fatigue become likely. Minimums vary with construction and gauge; designers should consult product-specific data rather than assume generic rules. As one illustrative datum, 34-AWG twinax/coax commonly specifies a 3.1 mm (0.125″) minimum bend radius—tight by eye, but still large enough to demand discipline in dense layouts. Two practical rules follow: avoid bunching (it effectively increases the required minimum), and allow cables to splay as they leave the connector so the first bend is gentle and not levering the termination. When routing along the connector’s length, the best practice is “bend, then twist”: first introduce the desired bend, then apply a controlled 90° twist over ~1.5″ to re-orient the exit without over-stressing the bundle.

Sleeving is valuable when used judiciously. Its role is protective—guarding jackets from nicks, abrasion, and errant edges—while offering light organizational benefits in cable-rich builds. Oversized, stretchable sleeves are preferred because they let strands splay naturally at bends and turns; tight sleeves (or large labels placed on them) add stiffness right where compliance is needed. Keep sleeves away from tight-radius regions, and use edge tape only where necessary. Strain is the other half of the story: always design for slack. Select lengths to accommodate both the final mated condition and the act of insertion/removal, short cables (<10 in / 25 cm) are especially sensitive. Where slack cannot be guaranteed, consult qualification data on allowable normal and side loads and, if needed, transfer load paths from the connector into the chassis or PCB via brackets or tie-downs. If compressive forces are unavoidable, preserve degrees of freedom so cables can splay and dissipate stress rather than prying at the interface.

Labeling and identification deserve more attention than they usually get. In production, field service, and RMA workflows, smart labels pay for themselves, yet they can become unintended stiffeners if applied indiscriminately. Keep labels minimal, place them on individual strands rather than wrapped around a bundle, and keep them clear of bend zones. Color coding can accelerate assembly while reducing handling time (and the handling damage that comes with it). Surround these mechanical practices with a robust set of enablement tools: full-channel SI models and evaluation kits for pre-layout what-ifs; thermal analysis to compute pressure drops and airflow interaction in cabled systems; and physical mock-ups to validate touch-labor ergonomics before freezing the design. Mature vendors also offer application-engineering support, custom sleeves and labels, and solution finders that map needs to qualified assemblies, particularly relevant for mid-board, front-panel, backplane, and Flyover® use cases.

In sum, HDR cable management is about treating the cable path as part of the channel, not an afterthought to be “neatened up” at the end. Respect bend radius by design; route to avoid bunching and leverage bend-then-twist to re-orient without stress; use sleeves for protection, not constriction; preserve slack and manage loads into structures that can bear them; label intelligently without adding stiffness; and anchor the whole effort with SI, thermal, and assembly analyses up front. Do these things, and you’ll ship systems that are faster to validate, more robust in the field, and easier to service—outcomes that matter just as much as the headline data rate.

Read the full Samtec white paper here

Also Read:

How Channel Operating Margin (COM) Came to be and Why It Endures

Visualizing System Design with Samtec’s Picture Search

Webinar – Achieving Seamless 1.6 Tbps Interoperability with Samtec and Synopsys

August 13, 2025September 8, 2025

A Quick Tour Through Prompt Engineering as it Might Apply to Debug

A Quick Tour Through Prompt Engineering as it Might Apply to Debug
by Bernard Murphy on 08-13-2025 at 6:00 am
Categories: AI
3 Comments

The immediate appeal of large language models (LLMs) is that you can ask any question using natural language in the same way you would ask an expert, and it will provide an answer. Unfortunately, that answer may be useful only in simple cases. When posing a question we often implicitly assume significant context and skate over ambiguities. Then we are surprised when the LLM completely misses our expectation in the answer it provides.

The reason for the miss is that initial guidance was insufficient. Rather than trying to stuff all the necessary context into a new prompt, standard practice is to refine initial guidance through added prompts, as in the following (fake) example. No longer a simple prompt, this looks more like an algorithm, though still expressed in natural language. Welcome to prompt engineering, a new discipline requiring user training and familiarity with a range of prompt engineering techniques in order to craft effective queries for LLM applications.

Techniques in prompt engineering

This domain is still quite new, as seen in the great majority of papers reported in Google Scholar which appear from 2023 onwards. Outside of Google Scholar I have found multiple papers on use of LLMs in support of software debug and to a lesser extent hardware debug, but I have found very little on prompt engineering in these domains. There are also Freemium apps to help optimize prompts (I’ll touch on these later) though I’m not sure how much these could help in engineering debug given their more likely business-centric client base.

Lacking direct sources, I will synthesize my view from a variety of sources (list at the end), extrapolating to how I think these methods might apply in hardware debug. It would love to see responses to disprove or confirm these assumptions.

In debug, context is important even though in conventional algorithmic debug it is unclear how this might play a role. LLM-based debug could in principle help bridge between high-level context and low-level detail, for example, requiring that the LLM answer with an expert engineering viewpoint. Yes, that should be a default but isn’t when you are starting with a general-purpose model, trained on a wide spectrum of expertise in many domains. Less obvious is value in including information about the design function. This might narrow context somewhat within general training, maybe more so through in-house fine-tuning/legacy in-context training. Either way providing this information might help more than you expect, even though your prompt suggestion may appear very high level.

Chain of Thought (CoT) prompting, telling the LLM to reason through a question in steps, has proved to be one of more popular prompting techniques. We ourselves don’t reason through a complex problem in one shot and LLMs also struggle when faced with a simple question/prompt addressed to a complex problem. We humans break such a problem down into simpler steps and attack those steps in sequence. The same approach can work with LLMs. For example, in trying to trace to a failure root-cause we might ask the LLM to apply conventional (non-AI) methods to grade possible fault locales, then rank that list based on factors like suspiciousness, and then provide a reasoning for each of the top 5 candidates. One caution is that this method apparently doesn’t work so well on the more advanced LLMs like GPT4-o which apparently prefer to figure out their own CoT reasoning.

Another technique is in-context learning/few-shot learning, learning from a few samples provided in a prompt. I see this method mentioned often in code creation applications, but I’m yet to find a published example for verification except for code repair in debugging. However, there is recent work on code summarization driven by few-shot learning which I think could be a starting point for augmenting prompts with semantic hints in support of fault localization.

There are other techniques such as chain-of-tables for reasoning on table-structured data and tree-of-thoughts to explore multiple paths, but in this brief article it is best next to touch on methods to automate this newfound complexity.

Automated prompt engineering

A general characteristic of ad hoc prompt engineering seems to be high sensitivity to how prompts are worded/constructed. I can relate. I use a DALL-E 3 tool to generate images for some of my blogs and find that beyond my initial attempt and a few simple changes it is very difficult to tune prompts predictably towards something that better matches my goal.

Leading AI providers now offer prompt generators such as this one for ChatGPT which will tell you what essentials you should add to your request, then will generate a new and very detailed prompt which has the added benefit of optimizing to the host model’s preferences (e.g. whether to spell out step-based reasoning or not). I tried this for image generation. It built an impressively complex prompt which unfortunately was too long to be accepted by my free subscriptions to either of 2 image generators. Google/Deepmind OPRO has a somewhat similar objective though as far as I can tell it directly front-ends the LLM, optimizing your input “meta-prompt”, then feeding that into the LLM.

There is also an emerging class of prompt engineering tools, though I wonder how effective these can be given rapid evolution in LLM models and the generally opaque/emerging characteristics of those systems. In prompt engineering perhaps the best options may still be those offered by the big model builders, augmented by your own promptware.

Happy prompting!

References

CACM: Tools for Prompt Engineering

Promptware Engineering: Software Engineering for LLM Prompt Development

Prompt Engineering: How to Get Better Results from Large Language Models

Empirical Evaluation of Large Language Models for Novice Program Fault Localization

Automatic Semantic Augmentation of Language Model Prompts (for Code Summarization)

Also Read:

What is Vibe Coding and Should You Care?

DAC TechTalk – A Siemens and NVIDIA Perspective on Unlocking the Power of AI in EDA

Architecting Your Next SoC: Join the Live Discussion on Tradeoffs, IP, and Ecosystem Realities

August 12, 2025August 25, 2025

Chiplets and Cadence at #62DAC

Chiplets and Cadence at #62DAC
by Daniel Payne on 08-12-2025 at 10:00 am
Categories: Cadence, EDA

Using chiplets is an emerging trend well-covered at #62DAC and they even had a dedicated Chiplet Pavilion, so I checked out the presentation from Dan Slocombe, Design Engineering Architect in the Compute Solutions Group at Cadence. In a short 20 minutes Dan managed to cover a lot of ground, so this blog will summarize the key points.

The need for IC design automation and chiplet automation is driven by the steady growth in the number of 5nm, 3nm and smaller nodes along with the goal of mixing and matching them as part of heterogeneous package-level, multi-die systems. Overcoming manual design challenges requires automation for: preventing stale documentation, reducing the number of human errors, abstracting low-level implementation details, preventing duplication of information sources, handling the proliferation of IPblocks used, having an automated verification strategy to reduce the number of time-consuming verification cycles, and minimizing development times.

At Cadence they are using an internal automation flow “SoC Cockpit” to meet these SoC challenges through:

Capturing system-level specifications
Using Cadence IP and partner IP libraries
Configuration of IP library components
Taking feedback from downstream flows
Automating verification with simulation, emulation and virtual platforms
Providing customer chiplet platforms and reference software

This approach aims to improve design efficiency from an executable spec through GDS II production. Here’s the basic correct-by-construction automation flow, where the specification is an executable spec; construction is the RTL with floor planning and database; software framework is the reference framework and drivers; design collateral is the testbenches, emulation and models; physical design encompasses RTL to GDS.

Design intent captured includes many details:

Functional specification
Top-level pinout
I/O cell selection
Pin multiplexing
Clock tree definition
Reset tree definition
Voltage domain definitions
Power domain definitions
System maps
Infrastructure and IP definitions

The SoC Cockpit flowchart encompasses multiple file formats and transformations.

Zooming into the SoC Builder there is SoCGen which starts with an executable spec, creates intermediate files, then finally passes the data to IPGen to create instances of IP. AI agents can be used to select which IP blocks meet the specifications. There are both built-in generators for Verilog RTL and plugin generators to work with a variety of formats: IP-XACT, SDC/UPS/USF, virtual platform, testbench, C/C++ header files.

Users work with a front-end GUI to guide through the specification capture process, enforcing the correct-by-construction approach. Cadence has been able to create this automated flow by harnessing their own tools and IP, like:

Front End
Conformal Technologies
Jasper Apps
Joules RTL Solutions
Cadence Modus DFT Software Solution

Back End
Genus Synthesis Solution
Innovus Implementation System
Cadence Cerebrus Intelligent Chip Explorer

Verification
Palladium Emulation
Protium FPGA-Based Prototyping Platform
Helium Virtual and Hybrid Studio
Xcelium Logic Simulator
Verisium AI-Driven Verification Platform
Perspec System Verifier

IP
Tensilica Processors
System IP, Partner IP
Interface IP – UCIe, PCIe
Memory – DDR, LPDDR, Flash

Summary

Cadence has adopted industry standards, including UCIe , Arm’s Chiplet System Architecture (CSA) and AMBA C2C protocols to ensure systems are built based on known standards. With the SoC Cockpit, the new automation features bring together architecture, design and implementation tasks resulting in faster availability of correct by construction designs. This automation reduces time to market and engineering efforts.

Partner with Cadence to help you realize your chiplet and SoC ambitions.

Related Blogs

August 12, 2025August 16, 2025

What XiangShan Got Right—And What It Didn’t Dare Try

What XiangShan Got Right—And What It Didn’t Dare Try
by Jonah McLeod on 08-12-2025 at 6:00 am
Categories: IP, RISC-V

An Open ISA, a Closed Mindset — Predictive Execution Charts a New Path

The RISC-V revolution was never just about open instruction sets. It was a rare opportunity to break free from the legacy assumptions embedded in every generation of CPU design. For decades, architectural decisions have been constrained by proprietary patents, locked toolchains, and a culture of cautious iteration. RISC-V, born at UC Berkeley, promised a clean-slate foundation: modular, extensible, and unencumbered. A fertile ground where bold new paradigms could thrive.

XiangShan, perhaps the most ambitious open-source RISC-V project to date, delivers impressively on that vision—at least at first glance. Developed by the Institute of Computing Technology (ICT) under the Chinese Academy of Sciences, XiangShan aggressively targets high performance. Its dual-core roadmap (Nanhu and Kunminghu) spans mobile and server-class performance brackets. By integrating AI-focused vector enhancements (e.g., dot-product accelerators), high clock speeds, and deep pipelines, XiangShan has established itself as the most competitive open-source RISC-V core in both versatility and throughput.

But XiangShan achieves this by doubling down on conventional wisdom. It fully embraces speculative, out-of-order microarchitecture—fetching, predicting, and reordering dynamically to maintain high instruction throughput. Rather than forging a new execution model, it meticulously refines well-known techniques familiar from x86 and ARM. Its design decisions reflect performance pragmatism: deliver ARM-class speed using proven playbooks, made interoperable with an open RISC-V framework.

What truly sets XiangShan apart is not its microarchitecture but its tooling. Built in Chisel, a hardware construction language embedded in Scala, XiangShan prioritizes modularity and rapid iteration. Its open-source development model includes integrated simulators, verification flows, testbenches, and performance monitoring. This makes XiangShan not just a core design, but a scalable research platform. The community can reproduce, modify, and build upon each generation—from Nanhu (targeting Cortex-A76 class) to Kunminghu (approaching Neoverse-class capability).

In this sense, XiangShan is a triumph of open hardware collaboration. But it also highlights a deeper inertia in architecture itself.

Speculative execution has dominated CPU design for decades. From Intel and AMD to ARM, Apple, IBM, and NVIDIA, the industry has invested heavily in branch prediction, out-of-order execution, rollback mechanisms, and speculative loads. Speculation once served as the fuel for ever-increasing IPC (Instructions Per Cycle). But it now carries mounting costs: energy waste, security vulnerabilities (Spectre, Meltdown, PACMAN), and ballooning verification complexity.

Since 2018, when Spectre and Meltdown exposed the architectural liabilities of speculative logic, vendors have shifted focus. Patents today emphasize speculative containment rather than acceleration. Techniques like ghost loads, delay-on-miss, and secure predictors aim to obscure speculative side effects rather than boost performance. What was once a tool of speed has become a liability to mitigate. This shift marks a broader digression in CPU innovation—from maximizing performance to patching vulnerabilities.

Most recent patents and innovations now prioritize security mitigation over performance enhancement. While some performance-oriented developments still surface, particularly in cloud and distributed systems, the dominant trend has become defensive. Designs increasingly rely on rollback and verification mechanisms as safeguards. The speculative execution model, once synonymous with speed and efficiency, has been recalibrated into a mechanism of trust and containment.

This is why XiangShan’s adherence to speculation represents a fork in the road. RISC-V’s openness gave the team a chance to rethink not just the ISA, but the core execution model. What if they had walked away from speculation entirely?

Unlike dataflow machines (Groq, Tenstorrent) or the failed promise of VLIW (e.g., Itanium and its successors in niche DSP or embedded markets), Simplex Micro’s predictive execution model breaks from speculative architecture—but with a crucial difference: it aims to preserve general-purpose programmability. Dataflow and VLIW each delivered valuable lessons in deterministic scheduling but struggled to generalize beyond narrow use cases. Each became a developmental cul-de-sac—offering point solutions rather than a unifying compute model.

Simplex’s family of foundational patents eliminates speculative execution entirely. Dr. Thang Tran—whose earlier vector processor was designed into Meta’s original MTIA chip—has patented a suite of techniques centered on time-based dispatch, latency prediction, and deterministic replay. These innovations coordinate instruction execution with precision by forecasting readiness using cycle counters and hardware scoreboards. Rather than relying on a program counter and branch prediction, this architecture replaces both with deterministic, cycle-accurate scheduling—eliminating speculative hazards at the root.

Developers can still write in C or Rust, compiling code through standard RISC-V toolchains with a modified backend scheduler. The complexity shifts to compilation, not programming. This preserves software portability while achieving hardware-level predictability.

XiangShan has proven what open-source hardware can achieve within the boundaries of established paradigms. Simplex Micro challenges us to redraw those boundaries. If the RISC-V movement is to fulfill its original promise—not just to open the ISA, but to reimagine what a CPU can be—then we must explore roads not taken.

And Predictive Execution may be the most compelling of them all: the fast lane no one has yet dared to take.

Also Read:

Podcast EP294: An Overview of the Momentum and Breadth of the RISC-V Movement with Andrea Gallo

Andes Technology: Powering the Full Spectrum – from Embedded Control to AI and Beyond

From All-in-One IP to Cervell™: How Semidynamics Reimagined AI Compute with RISC-V

August 11, 2025August 13, 2025

The Critical Role of Pre-Silicon Security Verification with Secure-IC’s Laboryzr™ Platform

The Critical Role of Pre-Silicon Security Verification with Secure-IC’s Laboryzr™ Platform
by Kalar Rajendiran on 08-11-2025 at 10:00 am
Categories: EDA, Secure-IC, Security

Pre Silicon Security Verification (Hardware SCA)

As embedded systems and System-on-Chip (SoC) designs grow in complexity and integration, the risk of physical attacks has dramatically increased. Modern day adversaries no longer rely solely on software vulnerabilities; instead, they exploit the physical properties of silicon to gain access to sensitive data. Side-channel attacks (SCA) and fault injection attacks (FIA) have emerged as some of the most potent threats, targeting the physical behavior of chips through power analysis, timing discrepancies, or induced faults. While cryptographic algorithms remain mathematically sound, their hardware implementations often betray subtle leakages that attackers can exploit.

To confront these risks proactively, Secure-IC has developed Laboryzr™, a pre-silicon security verification platform that enables hardware and software teams to simulate real-world threats and validate countermeasures during design—long before tape-out.

Why Pre-Silicon Security Matters

The financial and operational impact of discovering a security flaw post-silicon is enormous. Fixes at this stage involve redesign, re-fabrication, and potentially even product recalls. In contrast, pre-silicon verification allows vulnerabilities to be detected and resolved when the cost of change is still low. For industries such as automotive, defense, medical devices, and critical infrastructure, early detection is not only practical—it’s imperative.

Through pre-silicon security verification, organizations can align more easily with demanding security certifications like FIPS 140-3, ISO/IEC 19790, and Common Criteria. Just as importantly, they can ensure that devices are robust against real-world threats like differential power analysis or electromagnetic glitching.

Introducing Laboryzr™: A Platform for Security Sign-Off

Laboryzr™ is Secure-IC’s comprehensive platform for pre-silicon security verification. With Laboryzr, teams can measure and validate the effectiveness of security countermeasures before tape-out, transforming security sign-off from a concept into a measurable reality.

One of Laboryzr’s most powerful attributes is its ability to provide traceability from specification to silicon. By linking threat models directly to RTL and attack simulations, it ensures that security coverage is both complete and verifiable. Laboryzr™ integrates with industry EDA tools used across the SoC design flow, enabling it to catch vulnerabilities early and help reduce the need for costly post-silicon fixes.

Laboryzr’s Pre-Silicon Verification Components

Virtualyzr™ focuses on the hardware layer. It simulates and emulates side-channel and fault injection attacks at various abstraction levels—from RTL to post-synthesis—leveraging existing EDA workflows. Through the use of Value Change Dump (VCD) files, it reconstructs signal activities that mimic power or electromagnetic emissions, enabling leakage detection and exploitation analysis. It also supports fault injection modeling, including clock glitches, electromagnetic interference, and laser attacks. Originally limited to analyzing small IP blocks like AES cores, Virtualyzr™ has evolved to support full-chip and chiplet-scale analysis through advanced parallelization and optimization.

Catalyzr™ addresses the software layer, where it analyzes source code and binaries to detect vulnerabilities such as timing side channels, cache-based leakages, and improper cryptographic API usage. It performs both static and dynamic analysis to evaluate masking countermeasures, cryptographic integration, and execution behavior. With over seven years of field use, Catalyzr™ has matured into a key component of pre-silicon software security assessments.

Designed for the Modern SoC Design Flow

Laboryzr™ has been under development for more than a decade, evolving through constant customer feedback. One of the earliest challenges faced by Secure-IC was how to create a user interface that seamlessly fit into the existing SoC design flow. Originally offering only a graphical interface, Laboryzr later added a command line interface (CLI) to support CI/CD workflows and accommodate power users seeking integration into automated verification environments.

As customer demands shifted toward larger and more complex designs—including SoCs and chiplets—Laboryzr™ underwent fundamental architecture changes. Secure-IC optimized the platform for speed and scalability, enabling high-throughput simulations that could handle full-chip assessments. These improvements, along with robust support for Place and Route (PR) phases, positioned Laboryzr™ as a go-to solution for teams that require both depth and breadth in their security analysis.

Built for What’s Next: PQC, Chiplets, and Beyond

Secure-IC continues to future-proof Laboryzr™ by expanding support for post-quantum cryptography (PQC) and emerging chiplet-based architectures. The platform is being extended to validate PQC algorithm implementations and to analyze interactions between chiplets, especially as heterogeneous integration becomes more common in next-generation SoC design.

Secure-IC’s upcoming acquisition by Cadence also positions Laboryzr™ for even deeper integration into mainstream EDA workflows. With Cadence as an internal customer, Laboryzr™ will gain access to more complete design environments, allowing further validation of its capabilities on complex, multi-chip systems.

Market Context and Differentiation

Unlike solutions focused on software security or information flow analysis or security verification post-silicon, Secure-IC has long focused exclusively on physical attack emulation at the pre-silicon stage. Laboryzr’s tight integration with EDA flows, real-time emulation capability, and multi-layered approach make it uniquely positioned to address the needs of design teams working from RTL to place and route.

Summary

As hardware security threats continue to evolve, the need for comprehensive, early-stage verification is greater than ever. Security must be engineered with the same rigor and traceability as functional requirements. Secure-IC’s Laboryzr™ platform represents a significant advancement in how security is implemented, validated, and signed off in the silicon lifecycle. It empowers chip developers to simulate threats, validate defenses, and certify hardware security—before silicon is produced.

By enabling early detection of physical vulnerabilities, linking threat models to design data, and providing automation-ready interfaces for hardware and software teams, Laboryzr™ delivers a true shift-left security solution. Its continued development in areas like PQC and chiplet support ensures that it remains at the cutting edge of security verification.

To learn more, you can visit the following pages:

Laboryzr brochure page

Laboryzr product page

Catalyzr product page

Virtualyzr product page

Also Read:

Secure-IC at the 2025 Design Automation Conference #62DAC

Anirudh Keynote at CadenceLIVE 2025 Reveals Millennium M2000

A Timely Update on Secure-IC