Instance

Array
(
    [title] => Recent Forum Threads
    [title_url] => 
    [ignore_sticky] => 0
    [exclude_current] => 0
    [limit] => 10
    [sluglist] => ["jobs-dashboard"]
    [rw_opt] => Array
        (
            [widget_select] => 1
            [pageid_281769] => 1
            [pageid_281772] => 1
        )

    [display_widget_mobile] => 
    [rw_opt_exclude] => Array
        (
            [pageid_274493] => 1
            [cpt_podcast] => 1
            [cpta_podcast] => 1
            [category_16613] => 1
            [category_16631] => 1
            [taxonomy_series] => 1
            [pageid_354254] => 1
        )

    [node_id] => Array
        (
            [0] => 2
        )

)

Threads

Recent Article Comments

Silicon Insurance: Why eFPGA is Cheaper Than a Respin — and Why It Matters in the Intel 18A Era
@kingmouf - Functionally, the fabrics are very similar (6-input LUTS, DSPs, BRAM, interconnect, etc.). DSPs are slightly different and both…

— ajaros925 on March 31, 2026
Silicon Insurance: Why eFPGA is Cheaper Than a Respin — and Why It Matters in the Intel 18A Era
How does the eFPGA fabric mentioned here compares to AMD(Xilinx)/Altera fabrics? How do you address potential security issues?

— kingmouf on March 31, 2026
Silicon Insurance: Why eFPGA is Cheaper Than a Respin — and Why It Matters in the Intel 18A Era
Interesting article. eFPGA is clearly valuable as silicon insurance, but it still buys that flexibility with meaningful area, power, and…

— TomJackson on March 30, 2026
Musk’s Orbital Compute Vision: TERAFAB and the End of the Terrestrial Data Center
Your point that radiation accelerates device aging is a real constraint. But it’s also a predictable one. Space hardware is…

— Jonah McLeod on March 29, 2026
Musk’s Orbital Compute Vision: TERAFAB and the End of the Terrestrial Data Center
He's fixated on the heating thing because it's the only theoretically viable aspect of his new scam. After considering what…

— coldsolder215 on March 29, 2026
Chemical Origins of Environmental Modifications to MOR Lithographic Chemistry
This is an important finding for understanding how MORs work, but it clearly puts oxygen in the role that acids…

— Fred Chen on March 26, 2026
Captain America: Can Elon Musk Save America’s Chip Manufacturing Industry?
No, Elon won’t turn into LBT but he doesn’t need to. All he needs is to create an culture where…

— Jonah McLeod on March 25, 2026
Captain America: Can Elon Musk Save America’s Chip Manufacturing Industry?
That is the first time I hear "egos in check" and "Elon" in the same sentence. Not going to happen,…

— jmlobert on March 25, 2026
Captain America: Can Elon Musk Save America’s Chip Manufacturing Industry?
The “trust” people are talking about isn’t about tax credits — it’s about whether a fab partner tells the truth…

— Jonah McLeod on March 24, 2026
Captain America: Can Elon Musk Save America’s Chip Manufacturing Industry?
I see several comments on trust and Elon in the same sentence but I don't see how the two go…

— katgod on March 24, 2026

RVN! 26 Banner revised (800 x 100 px) (600 x 100 px)

WP_Term Object
(
    [term_id] => 95
    [name] => Automotive
    [slug] => automotive
    [term_group] => 0
    [term_taxonomy_id] => 95
    [taxonomy] => category
    [description] => 
    [parent] => 0
    [count] => 780
    [filter] => raw
    [cat_ID] => 95
    [category_count] => 780
    [category_description] => 
    [cat_name] => Automotive
    [category_nicename] => automotive
    [category_parent] => 0
)

October 11, 2023November 30, 2023 by Lauro Rizzatti

Long-standing Roadblock to Viable L4/L5 Autonomous Driving and Generative AI Inference at the Edge

Long-standing Roadblock to Viable L4/L5 Autonomous Driving and Generative AI Inference at the Edge
by Lauro Rizzatti on 10-11-2023 at 6:00 am
Categories: AI, Automotive

Two recent software-based algorithmic technologies –– autonomous driving (ADAS/AD) and generative AI (GenAI) –– are keeping the semiconductor engineering community up at night.

While ADAS at Level 2 and Level 3 are on track, AD at Levels 4 and 5 are far from reality, causing a drop in venture capital enthusiasm and money. Today, GenAI gets the attention, and VCs eagerly invest billions of dollars.

Both technologies are based on modern, complex algorithms. The processing of their training and inference shares a few attributes, some critical, others important but not essential: See table I.

Generative AI Inference at the Edge — Table I caption: Algorithm training and inference share some but not all critical attributes. Source: VSORA

The remarkable software progress in these technologies has until now not been replicated by advancements in algorithmic hardware to accelerate their execution. For example, state-of-the-art algorithmic processors do not have the performance to answer ChatGPT-4 queries in one or two seconds at a cost of ¢2 per query, the benchmark established by Google search, or to process the massive data collected by the AD sensors in less than 20 milliseconds.

That is until French startup VSORA invested brainpower to address the memory bottleneck known as the memory wall.

The Memory Wall

The memory wall of the CPU was first described by Wulf and McKee in 1994. Ever since, memory accesses have become the bottleneck of computing performance. Advancements in processor performance have not been mirrored in memory access progress, driving processors to wait ever longer for data delivered by memories. At the end, processor efficiency drops way below 100% utilization.

To solve the problem, the semiconductor industry created a multi-level hierarchical memory structure with multiple levels of cache nearer the processor that reduces the amount of traffic with the slower main and external memories.

Performance of AD and GenAI processors depends more than other types of computing devices on wide memory bandwidth.

VSORA, founded in 2015 to target 5G applications, invented a patented architecture that collapses the hierarchical memory structure into a large high bandwidth, tightly coupled memory (TCM) accessed in one clock cycle.

From the perspective of the processor cores, the TCM looks and acts like a sea of registers in the amount of MBytes versus kBytes of actual physical registers. The ability to access any memory cell in the TMC in one cycle yields high execution speed, low latency, and low-power consumption. It also requires less silicon area. Loading new data from external memory into the TCM while the current data is processed does not affect system throughput. Basically, the architecture allows for 80+% utilization of the processing units through its design. Still, there is a possibility to add cache and scratchpad memory if a system designer so wishes. See figure 1.

Autonomous Driving and Generative AI Inference at the Edge — Figure 1 caption: The traditional hierarchical memory structure is dense and complicated. VSORA’s approach is streamlined and hierarchical.

Through a register-like memory structure implemented in virtually all memories across all applications, the advantage of the VSORA memory approach cannot be overstated. Typically, cutting-edge GenAI processors deliver single digits percentage efficiency. For instance, a GenAI processor with nominal throughput of one Petaflops of nominal performance but less than 5% efficiency delivers usable performance of less than 50 Teraflops. Instead, the VSORA architecture achieves more than 10 times greater efficiency.

VSORA’s Algorithmic Accelerators

VSORA introduced two classes of algorithmic accelerators –– the Tyr family for AD applications and the Jotunn family for GenAI acceleration. Both deliver stellar throughput, minimal latency, low-power consumption in a small silicon footprint.

With nominal performance of up to three Petaflops, they boast a typical implementation efficiency of 50-80% regardless of algorithm type, and a peak power consumption of 30 Watts/Petaflops. These are stellar attributes, not reported by any competitive AI accelerator yet.

Tyr and Jotunn are fully programmable and integrate AI and DSP capabilities, albeit in different amounts, and support on-the-fly selection of arithmetic from 8-bit to 64-bit either integer or floating-point based. Their programmability accommodates a universe of algorithms, making them algorithm agnostic. Several different types of sparsity are also supported.

VSORA processors’ attributes propel them to forefront of the competitive algorithmic processing landscape.

VSORA Supporting Software

VSORA designed a unique compilation/validation platform tailored to its hardware architecture to ensure its complex, high-performance SoC devices have plenty of software support.

Meant to put the algorithmic designer in the cockpit, a range of hierarchical verification/validation levels –– ESL, hybrid, RTL and gate –– deliver push-button feedback to the algorithmic engineer in response to design space explorations. This helps him or her select the best compromise between performance, latency, power and area. Programming code written at a high level of abstraction can be mapped targeting different processing cores transparently to the user.

Interfacing between cores can be implemented within the same silicon, between chips on the same PCB or through an IP connection. Synchronization between cores is managed automatically at compilation time and does not require real-time software operations.

Roadblock to L4/L5 Autonomous Driving and Generative AI Inference at the Edge

A successful solution should also include in-field programmability. Algorithms evolve rapidly, driven by new ideas that obsolete overnight yesterday’s state of the art. The ability to upgrade an algorithm in the field is a noteworthy advantage.

While hyperscale companies have been assembling huge compute farms with multitudes of their highest performance processors to handle advanced software algorithms, the approach is only practical for training, not for inference at the edge.

Training is typically based on 32-bit or 64-bit floating-point arithmetic that generates large data volumes. It does not impose stringent latency and tolerates high-power consumption as well as substantial cost.

Inference at the edge is typically performed on 8-bit floating-point arithmetic that generates somewhat less amounts of data, but mandates uncompromising latency, low energy consumption, and low cost.

Impact of Energy Consumption on Latency and Efficiency

Power consumption in CMOS ICs is dominated by data movement not data processing.

A Stanford University study led by Professor Mark Horowitz showed that the power consumption of memory access consumes orders of magnitude more energy than basic digital logic computations. See table II.

AD and GenAI accelerators are prime examples of devices dominated by data movement posing a challenge to contain power consumption.

Conclusion

AD and GenAI inference pose non-trivial challenges to achieve successful implementations. VSORA can deliver a comprehensive hardware solution and supporting software to meet all critical requirements to handle AD L4/L5 and GenAI like GPT-4 acceleration at commercially viable costs.

More details about VSORA and its Tyr and Jotunn can be found at www.vsora.com.

About Lauro Rizzatti

Lauro Rizzatti is a business advisor to VSORA, an innovative startup offering silicon IP solutions and silicon chips, and a noted verification consultant and industry expert on hardware emulation. Previously, he held positions in management, product marketing, technical marketing and engineering.

Also Read:

Soitec is Engineering the Future of the Semiconductor Industry

ISO 21434 for Cybersecurity-Aware SoC Development

Predictive Maintenance in the Context of Automotive Functional Safety

Share this post via:

Comments

There are no comments yet.

You must register or log in to view/post comments.

Silicon Insurance: Why eFPGA is Cheaper Than a Respin — and Why It Matters in the Intel 18A Era
@kingmouf - Functionally, the fabrics are very similar (6-input LUTS, DSPs, BRAM, interconnect, etc.). DSPs are slightly different and both…

— ajaros925 on March 31, 2026
Silicon Insurance: Why eFPGA is Cheaper Than a Respin — and Why It Matters in the Intel 18A Era
How does the eFPGA fabric mentioned here compares to AMD(Xilinx)/Altera fabrics? How do you address potential security issues?

— kingmouf on March 31, 2026
Silicon Insurance: Why eFPGA is Cheaper Than a Respin — and Why It Matters in the Intel 18A Era
Interesting article. eFPGA is clearly valuable as silicon insurance, but it still buys that flexibility with meaningful area, power, and…

— TomJackson on March 30, 2026
Musk’s Orbital Compute Vision: TERAFAB and the End of the Terrestrial Data Center
Your point that radiation accelerates device aging is a real constraint. But it’s also a predictable one. Space hardware is…

— Jonah McLeod on March 29, 2026
Musk’s Orbital Compute Vision: TERAFAB and the End of the Terrestrial Data Center
He's fixated on the heating thing because it's the only theoretically viable aspect of his new scam. After considering what…

— coldsolder215 on March 29, 2026
Chemical Origins of Environmental Modifications to MOR Lithographic Chemistry
This is an important finding for understanding how MORs work, but it clearly puts oxygen in the role that acids…

— Fred Chen on March 26, 2026
Captain America: Can Elon Musk Save America’s Chip Manufacturing Industry?
No, Elon won’t turn into LBT but he doesn’t need to. All he needs is to create an culture where…

— Jonah McLeod on March 25, 2026
Captain America: Can Elon Musk Save America’s Chip Manufacturing Industry?
That is the first time I hear "egos in check" and "Elon" in the same sentence. Not going to happen,…

— jmlobert on March 25, 2026

Search Semiwiki

Recent Forum Threads

Recent Article Comments

Recent Podcast Episodes

The Memory Wall

VSORA’s Algorithmic Accelerators

VSORA Supporting Software

Roadblock to L4/L5 Autonomous Driving and Generative AI Inference at the Edge

Impact of Energy Consumption on Latency and Efficiency

Conclusion

About Lauro Rizzatti

Also Read:

Comments

Recent Forum Threads

Recent Article Comments