SemiWiki – Page 266 – The Open Forum for Semiconductor Professionals

Podcast EP85: How Expedera is Revolutionizing AI Deployment

Podcast EP85: How Expedera is Revolutionizing AI Deployment
by Daniel Nenni on 06-10-2022 at 10:00 am

Dan is joined by Sharad Chole, chief scientist & co-founder at Expedera. Sharad is an expert in AI frameworks, power-aware neural network optimizations, and programmable dataflow architectures. Previously, he was an architect at Cisco, Memoir Systems, and Microsoft.

Dan and Shared explore Expedera’s unique AI accelerator architecture. Sharad provides a broad overview of the various challenges of AI deployment and how Expedera is changing the landscape.

The views, thoughts, and opinions expressed in these podcasts belong solely to the speaker, and not to the speaker’s employer, organization, committee or any other group or individual.

June 10, 2022January 3, 2023

WEBINAR: 5G is moving to a new and Open Platform O-RAN or Open Radio Access Network

WEBINAR: 5G is moving to a new and Open Platform O-RAN or Open Radio Access Network
by Daniel Nenni on 06-10-2022 at 6:00 am
Categories: eFPGA, Events, Flex Logix

The demands of 5G requires new designs to not only save power but also increase performance and by moving to advance power-saving nodes and by using eFPGAs will help to achieve these goals. This paper will introduce 5G and O-RAN, the complexity of these systems, and how flexibility could be beneficial. Then we will dive into how eFPGA can save power, cost and increase flexibility. By providing some examples of how eFPGA can be used for reconfigurability, it can also deliver to customers a flexible platform for carrier personalization with less power.

Watch the replay here

5G is known as a faster mobile phone experience but it is so much more. The changes include a 90% reduction in network energy, 1-millisecond latency, 10-year battery life for IoT devices, 100x more connected devices, 1000x more bandwidth and many others. These changes not only impact mobile devices but there are many other devices envisioned to connect to a 5G network across a large span of frequencies. These 5G New Radios (NR) will operate from below 1G to 100GHz supplying data to many different services.

The understood use case is Enhanced Mobile Broadband (eMBB) which includes enhance data rates, reduced latency, higher user density, more capacity and coverage of mobile devices. A better mobile phone experience (Fig. 1)

Fig 1- 5G use cases based on channel frequency used

Other applications will leverage the lower frequency channels and are referred to as Ultra-Reliable Low-Latency Communications (URLLC.) These devices require ultra-reliability, very low latency and high availability for vehicular communication, industrial control, factory automation, remote surgery, smart grids and public safety.

On the other end of the frequency spectrum, we have Massive Machine-Type Communications (mMTC.) The devices taking advantage of this very high frequency will be communication of low-cost, massive in number, battery-powered devices such as smart metering, logistics, field and body sensors. These devices will be on for a very short time, burst data and then shut down using very little power.

All these new devices and applications will need many 5G New Radios to serve them and a lot of equipment needs to be installed and tested. One proposal, to help speed this is to make the interfaces between the New Radio and the Distributed Unit (DU) open which is called Open Radio Access Network or O-RAN for short (fig 2.) where the DU is virtualized in the cloud on standard servers that can be bought off the shelf.

This allows the possibility of having more than one provider for the RAN and mixing with different backends. There will also be many different networks with different Radio Units for Macro sites, Micro sites and Pico sites. The combinations could be endless.

This transition is paved with many good intentions and uncertainty. Although based on “enhanced CPRI” or “eCPRI” there are unknown sideband signals or custom commands. Learn more about how eFPGA can help this transition and other 5G applications for eFPGA to save cost, power, and reduce latency by joining this webinar.

Watch the replay here

Also Read:

Why Software Rules AI Success at the Edge

High Efficiency Edge Vision Processing Based on Dynamically Reconfigurable TPU Technology

A Flexible and Efficient Edge-AI Solution Using InferX X1 and InferX SDK

June 9, 2022September 11, 2022

Standardization of Chiplet Models for Heterogeneous Integration

Standardization of Chiplet Models for Heterogeneous Integration
by Tom Dillinger on 06-09-2022 at 10:00 am
Categories: Chiplet, EDA, Siemens EDA

The emergence of 2.5D packaging technology for heterogeneous die integration offers significant benefits to system architects. Functional units may be implemented using discrete die – aka “chiplets” – which may be fabricated in different process nodes. The power, performance, and cost for each unit may be optimized separately.

In particular, the potential to use chiplets fabricated in an older process node may save considerable cost over an equivalent system fabricated in a large SoC die using an advanced node, if only a subset of the overall functionality requires leading-edge performance, power dissipation, and/or circuit density. The fabrication yield of the monolithic SoC will be adversely impacted by the larger die area due to full integration of the chiplet functionality.

Additionally, cost savings will accrue if chiplets are re-used in multiple products, amortizing the development expense across a larger shipped volume. And, product time-to-market may be accelerated if existing specialty chiplets are used, rather than incur the time (and NRE) to design, fabricate, and qualify a new circuit block in a test shuttle for an advanced process node.

The disadvantages to a 2.5D heterogeneous package design relative to a monolithic implementation are:

- larger overall package area
- higher power dissipation from chiplet-to-chiplet data interface switching, due to the larger inter-die signal loading
- design and NRE cost to develop the interposer: the internal die-to-die signals, the through-interposer vias from the die to package pins, and the power delivery to the die
- (potential) difficulty in partitioning the system design to manage the number of interconnects to be routed on the interposer, with a coarser routing pitch

Parenthetically, a number of interposer technologies for 2.5D package design are available, with different relative tradeoffs in the list above: (full area) silicon interposer; (full area) organic-based interposer; and, (reduced area) embedded bridges spanning die-to-die edges.

Also, note that chiplets are not intended to be packaged individually. The I/O circuitry incorporated on the chiplet for intra-package connections is intended for the low loading of very short reach signals. And, the chiplet I/Os for internal signals may have unique specifications for exposure to ESD and overshoot/undershoot events.

Chiplets incorporated into a 2.5D package design share many characteristics with “hard IP” offerings for direct SoC integration. Perhaps the best example of the similarities is the availability of hard IP for industry-standard interfaces, both parallel and high-speed SerDes lanes. The opportunities for chiplets to provide these package-level interfaces are great, as opposed to hard IP integration in a large SoC.

The SoC IP market relies on the delivery of models to compile into EDA flows. Industry (and de facto) standards have emerged to enable hard IP integration from external vendors. The nascent chiplet market leaders have recognized the importance of having a clear definition of the design enablement model set required for a 2.5D package integration.

The Chiplet Design Exchange (CDX) is a group representing chiplet providers, EDA vendors, and end customers developing system-in-package (SiP) designs. They are working to establish standards and guidelines for the release of chiplet models. A recent whitepaper titles “Proposed standardization of chiplet models for heterogeneous integration”, authored by Anthony Mastroianni and Joseph Reynick from Siemens EDA with other CDX members, provides a blueprint for chiplet model development.

(The CDX working group is part of the larger “Open Domain-Specific Architecture” (ODSA) initiative. Other working groups in the ODSA are focused on standards for the physical design and electrical protocols for die-to-die interfaces on an SiP – e.g., the large number of parallel interface signals between a high-bandwidth memory (HBM) stack chiplet and the rest of the SiP die.)

CDX Model Standards

The figures below capture the chiplet data model format for each design methodology area. In many cases, there will be similarities to the models developed for hard IP, as alluded to above. Note that there are additional categories unique to chiplet IP. Specifically, mechanical models for chiplets (with materials properties) are needed for assembly evaluation and structural reliability analysis.

- Behavioral models

The end-customer will need to collaborate with the chiplet provider to decide whether a full behavioral model or an abstracted bus functional model (BFM) will be part of system simulation. The chiplet provider may include a testbench to assist with verification. If the chiplet has mixed-signal circuits, an AMS model may be provided.

- Power models for dissipation analysis and functional verification

Similarly to hard IP, functional power states and power domain information about the chiplet would be provided with a separate UPF file. The SiP physical power distribution network would be verified against the chiplet UPF description.

- Signal integrity and power integrity analysis

IBIS models for chiplet I/Os would be used for signal integrity analysis. IBIS-AMI models and/or S-parameter channel models would be provided for chiplets incorporating off-package SerDes lanes.

- Physical, mechanical, and electrical properties

Of particular note is the CDX recommendation to adopt the JEDEC JEP30-P101 model format (link). This schema is a “container” for property and value information for the chiplet and its pins. Electrical properties would include chiplet operating range limits and individual pin characteristics (e.g., receiver voltage levels, driver I-V values, loading, ESD rating). Mechanical properties would be needed for both assembly (e.g., chiplet x, y, and z data, pin info, microbump composition/profile) and reliability analysis (e.g., materials data, such as coefficient of thermal expansion, fracture strength).

- Thermal

Package-level thermal analysis is critical in 2.5D SiP implementations. The SiP end customer and chiplet provider will need to review the model granularity needed for thermal analysis – i.e., a uniform dissipation map across the chiplet or a more detailed block/grid-level thermal model.

- Test

As is evident from the list of model deliverables in the figure above, SiP test with chiplets requires some challenging methodology decisions.

Chiplets would be delivered to package assembly as “known good die” (KGD). Typically, it would suffice post-assembly to test the I/O connectivity on the interposer between die, using a (reduced) boundary scan-based pattern set. As many of the die-to-die connections will be internal to the package, the SiP test architecture needs to provide an external access method to the individual chiplet boundary scan I/Os.

However, if there is a risk that a chiplet itself may be damaged during the assembly process, a more extensive test of the internal functionality of each chiplet would be required, necessitating delivery of more extensive chiplet test models and/or pattern data (adding considerably to the post-assembly tester time). This could become quite an involved procedure, if the chiplet contains unique analog circuitry that needs to be re-tested at the package level.

Test models for chiplets become even more intricate if there is the SiP developer needs to pursue post-assembly failure analysis, on defects of a class beyond interposer interconnect validation.

The whitepaper goes into further detail about the chiplet model requirements if an interface design includes redundant lanes to replace/repair interposer interconnect defects found during post-assembly test.

- Documentation

And, last but certainly not least, the whitepaper stresses the importance for the chiplet provider to release extensive “datasheet” information, ranging from recommendations for design and analysis methodology flows to detailed functional and physical information. Again, the JEDEC JEP30 Part Model file format is recommended.

And, to be sure, any chiplet firmware code to be integrated by the end-customer needs to be thoroughly documented.

Futures

The whitepaper briefly discusses some of the future areas of modeling focus to be pursued by the CDX working group:

- a definition for hardware and software security features, providing cryptographic-based validation of chiplet hardware to the system and chiplet-level verification of firmware releases
- chiplet SerDes receiver eye diagram opening definition
- chiplet modeling standards for vertically-stacked die in 3D package technologies

If you are involved in a 2.5D SiP project incorporating chiplets, this whitepaper from the CDX working group is a must read (link).

Also Read:

Using EM/IR Analysis for Efinix FPGAs

Methods for Current Density and Point-to-point Resistance Calculations

3D IC Update from User2User

June 9, 2022December 27, 2023

LIDAR-based SLAM, What’s New in Autonomous Navigation

LIDAR-based SLAM, What’s New in Autonomous Navigation
by Bernard Murphy on 06-09-2022 at 6:00 am
Categories: Automotive, Ceva, IP

SLAM – simultaneous localization and mapping – is already a well-established technology in robotics. This generally starts with visual SLAM, using object recognition to detect landmarks and obstacles. VSLAM alone uses a 2D view of a 3D environment, challenging accuracy; improvements depend on complementary sensing inputs such as inertial measurement. VISLAM, as this approach is known, works well in good lighting conditions and does not necessarily depend on fast frame rates for visual recognition. Now automotive applications are adopting SLAM but cannot guarantee good seeing and demand fast response times. LIDAR-based SLAM, aka LOAM – LIDAR Odometry and Mapping – is a key driver in this area.

SLAM in automotive

Before we think about broader autonomy, consider self-parking. Parallel parking is one obvious example, already available in some models. More elaborate is the ability for a car to valet park itself in a parking lot (and return to you when needed). Parking assist functions may not require SLAM, but true autonomous parking absolutely requires that capability and is generating a lot of research and industry attention.

2D-imaging alone is not sufficient to support this level of autonomy, where awareness of distances to obstacles around the car is critical. Inertial measurement and other types of sensing can plug this hole, but there is a more basic problem in these self-parking applications. Poor or confusing lighting conditions in parking structures or streets at nighttime can make visual SLAM a low-quality option. Without that, the whole localization and mapping objective is compromised.

LIDAR is the obvious solution at first glance. Works well in poor lighting, at night, in fog, etc. But there is another problem. The nature of LIDAR requires a somewhat different approach to SLAM.

The SLAM challenge

SLAM implementations, for example OrbSLAM, perform three major functions. Tracking does (visual) frame-to-frame registration and localizes a new frame on the current map. Mapping adds points to the map and optimizes locally by creating and solving a complex set of linear equations. These estimates are subject to drift due to accumulating errors. The third function, loop closure, corrects for that drift by adjusting the map when points already visited are visited again. SLAM accomplishes this by solving a large set of linear equations.

Some of these functions can run very effectively on a host CPU. Others, such as the linear algebra, run best on a heavily parallel system, for which the obvious platform will be DSP-based. CEVA already offers a platform to support VSLAM development through their SensPro solution. Providing effective real-time SLAM support in daytime lighting, at up to 30 frames per second.

LIDAR SLAM as an alternative to VSLAM for poor light conditions presents a different problem. LIDAR works by mechanically or electronically spinning a laser beam. From this it builds up a point cloud of reflections from surrounding objects, together with ranging information for those points. This point cloud starts out distorted due to the motion of the LIDAR platform. One piece of research suggests a solution to mitigate this distortion through two algorithms running at different frequencies: one to estimate the velocity of the LIDAR and one to perform mapping. Through this analysis alone, without inertial corrections or loop closure, they assert they can get to accuracy comparable to conventional batch SLAM calculations. That paper does suggest that adding IM and loop closure will be obvious next steps 

Looking forward

Autonomous navigation still has much to offer, even before we get to fully driverless cars. Any such solution operating without detailed maps – for parking applications for example – must depend on SLAM. VISLAM for daytime outdoors navigation and LOAM for bad seeing and indoor navigation in constrained spaces. As an absolutely hopeless parallel parker, I can’t wait!

Podcast EP84: MegaChips and Their Launch in the US with Doug Fairbairn

Podcast EP84: MegaChips and Their Launch in the US with Doug Fairbairn
by Daniel Nenni on 06-08-2022 at 10:00 am

Dan is joined by semiconductor and EDA industry veteran Douglas Fairbairn. Doug provides details about MegaChips, where he currently heads business development. MegaChips is a large, successful 30-year old semiconductor company based in Japan.

Doug is helping MegaChips launch in the US with a focus on ASIC design and delivery. While the initial focus is on AI at the edge. MegaChips has substantial design, IP and manufacturing depth in many areas, making them an excellent partner for many custom chip projects.

The views, thoughts, and opinions expressed in these podcasts belong solely to the speaker, and not to the speaker’s employer, organization, committee or any other group or individual.

June 8, 2022December 16, 2022

Closing the Communication Chasms in the SoC Design and Manufacturing Supply Chain

Closing the Communication Chasms in the SoC Design and Manufacturing Supply Chain
by Kalar Rajendiran on 06-08-2022 at 6:00 am
Categories: Aion Silicon, Semiconductor Services

In sports, we’re all familiar with how even a team with the best individual players for every role needs to be coordinated as a team to win a championship. In healthcare, a patient is better served with a well-trained primary physician to coordinate with the various medical specialists. The field of semiconductors involves a series of complex functional steps from architecting a chip to designing it all the way through to manufacturing, quality control and logistics. Driven by complexity increases, the semiconductor supply chain has gotten disaggregated over the last few decades.

The above transformation has led to specialization by individual players in the ecosystem and enabled rapid advances within each respective functional areas. Whether it is chip design, package design, package and test or even logistics for that matter, there are subcontractors that are involved. While the functional specializations have opened up tremendous opportunities for System-on-Chips (SoCs) to push the performance, power and area (PPA) benefits, they have also introduced some vulnerabilities. The modern SoC design and manufacturing supply chain can lead to communication chasms between different players and missteps in various phases of the process. In a recent press announcement, Sondrel describes how these communication chasms can ripple through the entire supply chain, creating delays and huge cost overruns.

With chip development costs running in the millions of dollars and time to market schedules ever so tight, even a single misstep can be disastrous for a chip company. Extending the sports and healthcare analogy, chip development needs an entity to oversee all phases of the entire process. This entity should possess not only deep consulting capabilities but also have in-house expertise to deliver complete turnkey services to transform designs into tested, volume-packaged semiconductor chips. That’s why Sondrel offers a complete turnkey service from concept to shipping silicon so there is no possibility of any communication chasms. Sondrel takes total responsibility for the smooth running of every stage and every subcontractor in the supply chain. In addition, Sondrel offers many market-specific, reusable and customizable, reference platforms, which it calls Architecting the Future. These enable it to rapidly develop differentiated chip products for its customers as designing does not start from scratch every time. It also offers chip development consulting services including chip architectural study reports.

Sondrel has been serving fabless and systems companies in this capacity for a long time. It serves the Automotive, AI at the Edge, 8K Video, Smart Homes/Smart Cities, Consumer Devices, and Wearables markets and more. Sondrel’s designs have been incorporated into mobile phones, game consoles, security systems, AR/VR systems, network switches and routers, cameras, computer systems and many more. With a long successful track record of customer products in various end markets, Sondrel has mastered a holistic approach to developing chips for its customers. It deploys its deep understanding of all aspects of chip development to produce designs optimized for PPA and time to market requirements.

Sondrel’s Offerings

Full Turnkey Service

Sondrel’s full turnkey service manages every stage of the process from chip concept to final silicon. Its Operations Team manages all downstream stages after the design is done. This includes liaising with the fabs through to selecting the most appropriate packaging OSAT, Test Development and logistics partner. Sondrel’s software team develops the required software to get to Board Support Package level. The software team is also experienced in producing software for drivers and validation tests for the whole SoC. For more details about Sondrel’s Full Turnkey Service, please see here.

Architecting the Future® Reusable IP Platforms

Sondrel has developed a family of reference designs for major applications to help reduce risk and time to market for its customers. Customers can easily add their own IP as well as third party IP to rapidly create a differentiated solution. Then, Sondrel’s manufacturing service team provides a total unit cost estimate based on foundry, test, qualification and packaging choices. This is a very important part of viability analysis. For more details about the various Reusable IP Platforms offered, please visit here.

Architecture Study Service

The world of electronics is full of creative ideas and the biggest challenge for a customer is to understand which ones can be turned into realities. Of course, one would want to find out the commercial viability before starting the whole process of building a chip. Sondrel’s architects explore different options using abstracted what-if modelling to test alternatives and arrive at a candidate architecture. The modelling and analysis of key system behaviors validate that the architecture will behave as intended. With its experience designing hundreds of chips, Sondrel provides a detailed architectural study report that includes a high accuracy cost analysis for building a chip. For more details about its Architecture Study Service, visit here.

Also Read:

SoC Application Usecase Capture For System Architecture Exploration

Sondrel explains the 10 steps to model and design a complex SoC

Build a Sophisticated Edge Processing ASIC FAST and EASY with Sondrel

June 7, 2022September 11, 2022

RISC-V embedded software gets teams coding faster

RISC-V embedded software gets teams coding faster
by Don Dingee on 06-07-2022 at 10:00 am
Categories: EDA, RISC-V, Siemens EDA

RISC-V processor IP is abundant. Open-source code for RISC-V is also widely available, but typically project-based code solves one specific problem. Using only pieces of code, it’s often up to a development team integrate a complete application-ready stack for creating an embedded device. A commercial embedded software development stack puts proven tools together and gets teams to application coding faster. For RISC-V embedded software development, a new port of Siemens’ Nucleus ReadyStart provides that complete solution.

Five areas RISC-V embedded software stacks should cover

Embedded software development is a bit different than enterprise software or EDA software development. There’s a host-target model, where the target machine is very different from the host where coding is done. Target devices are often resource-constrained in size, memory and storage, connectivity, or power consumption. Performance is usually real-time aware in some way, with deadlines that must be met. Sometimes, visibility to what’s going on inside a device is more challenging because a user interface isn’t part of its deployed configuration.

Those differences call for an RISC-V embedded software development stack optimized for the job. State-of-the-art tools like Nucleus ReadyStart set the bar for such an environment in at least these five areas:

Toolchain and debugging tools – A good stack would tie tools into an integrated design environment (IDE), with easy editing and project management features. A performance-optimized C/C++ compiler is a must. JTAG or BDM debugging provide local or remote (with a debug agent on the target) capability, with awareness of threads.
System-level trace and analysis tools – Being able to see performance bottlenecks in processes and threading is one capability. Another is the ability to see how power consumption relates to code activity, helping developers adjust efficiency.
RTOS kernel – Nucleus RTOS is a proven hard real-time operating system (RTOS) kernel for embedded devices. The RISC-V port brings the same small footprint and security features developers have come to expect. It is multicore enabled, has a power management API, and offers a path to safety certification for mission critical devices.
Connectivity protocols – For deeply embedded devices, support for USB and wireless protocols like Bluetooth and Zigbee enables many applications. Support for IPv4 and IPv6 helps add heavier protocols like Ethernet and Wi-Fi.
User interface framework – Not every embedded device has a user interface, but when one is called for a robust framework like Qt significantly reduces development and debug time. One innovation is footprint management capability, helping to reduce target code size by configuring which library modules are included for run-time.

For embedded devices at the edge, integration with the cloud is becoming more common. An optional add-on to Nucleus ReadyStart for RISC-V is the Nucleus IoT Framework. It adds support for connecting devices to Amazon Web Services (AWS), Microsoft Azure, and MindSphere from Siemens.

The entire idea is providing foundational value

Many users are turning to RISC-V because they see open-source hardware as the future. For them, it may seem strange to turn to commercial software running on their solution. But the entire idea behind open sourcing is providing foundational value development teams can leverage to create project-specific value at higher levels of an application.

Yes, it may be possible for an embedded development team to piece together an open-source software development stack for RISC-V. It would take precious time and resources to create and maintain that effort. Teams without hard real-time requirements have more choices, for example, a RISC-V Linux distribution. Layering on requirements for multicore and threading, low power consumption, user interface, and connectivity might complicate that effort.

Meanwhile, competitors adopting a commercial solution like Nucleus ReadyStart would be up and running, with support from the teams at Siemens and the confidence gained from a solution deployed on millions of other devices. The Nucleus RTOS kernel is royalty-free, so there is no financial trade-off compared with deploying open-source RTOS or Linux offerings.

Most RISC-V hardware developers are using commercial EDA tools to create their designs, right? Why? Because there’s a huge amount of foundational value a team doesn’t have to recreate to design a chip successfully. The same thinking applies to tools for RISC-V embedded software development. The foundational value in Nucleus ReadyStart for RISC-V brings teams value they can build on right away in creating an embedded device application successfully.

For RISC-V developers evaluating embedded software options, a good place to start is with this short video demonstrating Nucleus RTOS for RISC-V.

June 7, 2022June 8, 2022

Advanced Packaging Analysis at DesignCon

Advanced Packaging Analysis at DesignCon
by Tom Dillinger on 06-07-2022 at 10:00 am
Categories: EDA, Xpeedic

The slogan for the DesignCon conference has been “where the chip meets the board”. Traditionally, the conference has provided a breadth of technical presentations covering the design and analysis of high-speed communication interfaces and power integrity evaluations between chip, board, and system.

The recent DesignCon event at the Santa Clara Convention Center conveyed a noticeably different theme. The emergence of 2.5D and 3D advanced packaging has necessitated the development of new tools and techniques for the extraction and channel simulation of the disparate interface topologies provided with these packages.

New classes of design requirements have emerged. Whereas interfaces were typically denoted as long reach (LR), medium reach (MR), or short reach (SR), designers are now addressing the unique electrical requirements associated with very short reach (VSR), extra short reach (XSR), and ultra short reach (USR) connections. Each of these interface types have stringent allowable signal loss and crosstalk constraints, at ever higher transmission frequencies.

In addition to the growing diversity of interface types, there is a growing demand for quick turnaround time for design and analysis iterations, while concurrently managing the increasing physical data volume. The interconnect density available on these advanced packages combined with the options for (clock-forwarded) parallel and (NRZ, PAM) serial data transmission requires extensive focus on initial package route planning. The need for a “shift left” approach to advanced packaging design closure needs fast analysis evaluation throughput.

At DesignCon, Feng Ling, CEO of Xpeedic Technology Inc., gave an insightful presentation entitled “High-Performance EM Simulation Solution for Advanced Packaging”. He highlighted how their advanced packaging analysis platform development is addressing these capacity, accuracy, and throughput challenges.

To frame the problem, Feng used the figure below to illustrate the range of physical dimensions which the electromagnetic solver must encompass.

Examples of the types of elements in a 2.5D model to be extracted are shown below, from extremely dense parallel wires associated with HBM die stack signals to embedded routes in a 2.5D interposer to through silicon vias (TSVs) and package substrate traces.

Feng focused on three key features of the Xpeedic Metis EM extraction approach:

- support for data input formats that encompass the disparity, in die, interposer, and package substrate design representations
- an optimal coupling of boundary element method (BEM, aka “method of moments”, for linear piecewise-isotropic materials) and finite element method (FEM) solution algorithms for different domains
- an optimized meshing strategy of the advanced packaging material properties and geometries – e.g., a combined rectangular and triangular surface decomposition

The figures below highlight these features:

To illustrate the efficiency and accuracy of the Metis EM solver, Feng presented comparisons of the computational resources and the insertion loss plus return loss versus frequency results for several advanced package elements (against a reference tool).

An example of these comparisons is illustrated below, for the case of an HBM channel on a 2.5D package:

- CoWoS-R package from TSMC, with an organic interposer
- co-planar signals in the channel in an interspersed supply-signal (GSGSG) configuration
- 2um signal width with 3um spacing

Representative return loss (S-parameter S11) and crosstalk (S13) curves for Metis versus a reference tool are shown, along with the computational resources required.

Specifically, the Metis solver evaluation time efficiencies support the need for fast “pathfinding” design and analysis iterations, to achieve an optimal physical implementation that confirms signal loss budgets are met.

Advanced package design technology has enabled a diverse set of electrical interfaces to be integrated, with high interconnect density provided over short distances. The target frequency of the data rates across these interfaces and the tight signal losses allowed necessitate accurate EM analysis. The design complexity of these packages means that tools must support large dataset size and simultaneously provide fast analysis throughput for signal and power implementation planning. The Metis extraction solution from Xpeedic addresses these requirements.

For more information on Xpeedic Metis, please follow this link.

PS. Perhaps DesignCon could update their slogan, “where heterogeneous chips integrated on an advanced package meets the board”.

The Electron Spread Function in EUV Lithography

The Electron Spread Function in EUV Lithography
by Fred Chen on 06-07-2022 at 6:00 am
Categories: Lithography

To the general public, EUV lithography’s resolution can be traced back to its short wavelengths (13.2-13.8 nm), but the true printed resolution has always been affected by the stochastic behavior of the electrons released by EUV absorption [1-5].

A 0.33 NA EUV system is expected to have a diffraction-limited point spread function (minimum spot size) represented by an Airy disk [6] with a full width at half-maximum level of over 20 nm. On the other hand, the electron spread function, which represents how far the EUV-released electrons migrate before driving chemical reactions in the resist, is typically fit as an exponential function [3,7]. The convolution of the electron spread function with the optical point spread function, shown in Figure 1, mathematically adds up the effects of electron spread from each point of the optical point spread function.

Figure 1. The optical point spread function of a 0.33 NA 13.5 nm wavelength system (blue) is fit well with a Gaussian with sigma=8.5 nm (orange). The electron spread function is typically fit with an exponential function (gray) with lambda as a fitting parameter. Here lambda corresponds to a decay length of 3 nm. The resulting final spread function (yellow) has a peak that is not at zero radius but at some few nm radial distance.

The electron spread inevitably worsens the resolution of EUV compared to the optical-only expectation. The overlap of optical+electron spread functions degrades the ability to resolve the gap between two features, as the deposited energy in between is at least doubled compared to an isolated feature (Figure 2).

Figure 2. Overlap of optical+electron spread functions degrades the ability to resolve the gap between two features. Here the feature separation is 40 nm. (Note: there is a local minimum in the peaks at 0 and 40 nm, so the feature pitch does not change in the image.)

The degradation is aggravated further by the inevitable stochastic variation, which causes the value of lambda to be random. This causes significant gap CD variation (Figure 3).

Figure 3. Stochastically varying electron spread (random values of lambda) causes CD variation in closely paired features. The blue curve indicates the expected image from two points separated by 40 nm in an 0.33 NA 13.5 nm wavelength system. The orange curves are the expected images from three random values of lambda, which reflect random degrees of electron spread. The black dashed line indicates the threshold level for printing in the resist. The grey curves indicate the extent of electron spread, with r=0 being the location of photon absorption.

For a 40 nm separation, the gap CD in Figure 3 spans from 14 nm to 18.5 nm as the decay length (1/lambda) decreases from 3 nm to 1.25 nm. Since the CD impact is quite significant (>10%), it is definitely necessary for the application of EUV lithography to include stochastic electron spread functions into the algorithms for optimization and resolution enhancements such as OPC and SMO.

References

[1] https://www.linkedin.com/pulse/blur-wavelength-determines-resolution-advanced-nodes-frederick-chen/; https://semiwiki.com/lithography/303429-blur-not-wavelength-determines-resolution-at-advanced-nodes/

[2] https://www.linkedin.com/pulse/adding-random-secondary-electron-generation-photon-shot-chen/; https://semiwiki.com/lithography/311874-adding-random-secondary-electron-generation-to-photon-shot-noise-compounding-euv-stochastic-edge-roughness/

[3] https://www.linkedin.com/pulse/demonstration-dose-driven-photoelectron-spread-euv-resists-chen/; https://semiwiki.com/lithography/312476-demonstration-of-dose-driven-photoelectron-spread-in-euv-resists/

[4] https://www.spiedigitallibrary.org/journals/Journal-of-MicroNanolithography-MEMS-and-MOEMS/volume-19/issue-2/024601/Cascade-and-cluster-of-correlated-reactions-as-causes-of-stochastic/10.1117/1.JMM.19.2.024601.pdf

[5] https://www.spiedigitallibrary.org/journals/journal-of-micro-nanolithography-mems-and-moems/volume-18/issue-1/013503/Localized-and-cascading-secondary-electron-generation-as-causes-of-stochastic/10.1117/1.JMM.18.1.013503.pdf

[6] https://en.wikipedia.org/wiki/Airy_disk

[7] M. Kotera et al., Jpn. J. Appl. Phys. 47, 4944 (2008).

This article first appeared in LinkedIn Pulse: The Electron Spread Function in EUV Lithography

Also read:

Double Diffraction in EUV Masks: Seeing Through The Illusion of Symmetry

Demonstration of Dose-Driven Photoelectron Spread in EUV Resists

Adding Random Secondary Electron Generation to Photon Shot Noise: Compounding EUV Stochastic Edge Roughness

June 6, 2022July 18, 2025

DesignDash: ML-Driven Big Data Analytics Technology for Smarter SoC Design

DesignDash: ML-Driven Big Data Analytics Technology for Smarter SoC Design
by Kalar Rajendiran on 06-06-2022 at 10:00 am
Categories: EDA, Synopsys

With time-to-market pressures ever increasing, companies are continually seeking enhanced designer productivity, faster design closure and improved project management efficiency. To accomplish these, organizations invest a lot in implementing both standardized approaches and proprietary techniques. With ever increasing product complexities, more and more engineers are needed to implement the designs. Consequently, regular onboarding of a mix of fresh engineers and experienced ones is an ongoing process.

A typical chip project includes thousands of tool-flow runs with different setup configurations. A problem that gets introduced in one flow could have ramifications later on within the same flow or in a different flow. Understanding the relationship and dependencies is critical for rapidly debugging an issue, better still for avoiding such issues in the first place. An SoC design produces a massive amount of data that gets archived away and doesn’t see the light of day after a project completes. The learnings generally don’t get documented as the team members are off to work on the next project. So, critical knowledge essentially resides among the various team members.

Being able to leverage the learnings from one project is not only helpful at the start of future projects but is even more useful when facing problems that need to be debugged. This is where and why the institutional knowledge developed over time becomes very valuable. Everything is fine until one or more team members leave and/or when many fresh engineers start working on a project. One of the biggest gripes at many companies is the loss of talent along with the institutional learnings. What if there is a way to leverage the massive amount of data to enhance productivity and efficiency of an SoC design? A way for engineering teams to benefit from the valuable insights that are hidden within the archived big data. Until recent years, compute power and machine learning technology were not available in a commercially viable scale to mine this big data.

Earlier this week, Synopsys launched an ML-Driven Big Data Analytics technology which helps solve this long standing problem of lost knowledge. You can access the entire press release here. I had a chat with Mark Richards, Sr. Staff Product Marketing at Synopsys to gain more insights about the recently announced technology. This article is a summary of the salient points I garnered from our discussion.

Synopsys DesignDash Solution

The Synopsys DesignDash solution is an EDA-tuned big data analytics tool that leverages machine-learning techniques to yield enhanced designer productivity and faster design closures. It brings immense value, particularly in the context of high system complexities, shrinking time to market windows and challenging talent resource landscape. The tool delivers a real-time, unified, 360-degree view of all design activities for faster decision making, a deeper understanding of run-to-run, design-to-design and project-to-project trends, and enhanced collaboration within an SoC development environment.

Some salient features and benefits offered by the tool include:

Extensive real-time design status through powerful visualizations and interactive dashboards
Actionable insights from structured and unstructured EDA metrics and tool-flow data
Classification of trends and identification of design limitations
Guided root-cause analysis and delivery of flow consumable, prescriptive resolutions
Team-wide dashboard for consistent and comprehensive data views to make status comparisons easy, informative, and actionable
Simple metric tracking (e.g., machine, license, and other KPIs) to optimize project management

The cloud-optimized DesignDash solution is natively integrated with Synopsys Digital Design Family of tools and offers easy 3^rd party tools support too. The solution complements the Synopsys SiliconDash product and thus enables valuable data analysis across the complete design-to-silicon lifecycle.

You can visit the DesignDash product page for more details.

Customer Experience

Customer demand for the DesignDash solution is very strong and a number of customers have already benefitted by using it for their projects. IDMs, established Fabless and startup companies alike have leveraged the automated insights and prescriptive guidance provided by the tool.

For more details, visit Synopsys.com or call your local Synopsys representative.

DesignDash and DSO.ai: A Powerful Interplay

Synopsys is already known for DSO.ai solution, an AI-based design space optimization tool. Together with DesignDash, customers enjoy the benefit of finding the best solution. While DesignDash recommends a range of suitable system architectures to consider given the product requirements and constraints, DSO.ai will help deliver the best implementation for each of these architectures.

Also Read:

Coding Guidelines for Datapath Verification

Very Short Reach (VSR) Connectivity for Optical Modules

Bigger, Faster and Better AI: Synopsys NPUs