Instance

Array
(
    [title] => Recent Forum Threads
    [title_url] => 
    [ignore_sticky] => 0
    [exclude_current] => 0
    [limit] => 10
    [sluglist] => ["jobs-dashboard"]
    [rw_opt] => Array
        (
            [widget_select] => 1
            [pageid_281769] => 1
            [pageid_281772] => 1
        )

    [display_widget_mobile] => 
    [rw_opt_exclude] => Array
        (
            [pageid_274493] => 1
            [cpt_podcast] => 1
            [cpta_podcast] => 1
            [category_16613] => 1
            [category_16631] => 1
            [taxonomy_series] => 1
            [pageid_354254] => 1
        )

    [node_id] => Array
        (
            [0] => 2
        )

)

Threads

Recent Forum Threads

OpenAI taps Broadcom to build its first AI processor in latest chip deal

latest reply by siliconbruh999 on October 14, 2025

started by Daniel Nenni on October 13, 2025
Dutch restrict Nexperia to keep its chip secrets outside of China

latest reply by cheesehead on October 14, 2025

started by Barnsley on October 13, 2025
Intel Unveils Panther Lake Architecture: First AI PC Platform Built on 18A

latest reply by Artificer60 on October 14, 2025

started by Daniel Nenni on October 9, 2025
Texas Instruments to lay off 400 employees as it shutters 150mm fabs

latest reply by pepgpu on October 14, 2025

started by Barnsley on October 9, 2025
Former Intel CEO Pat Gelsinger says AI is a bubble that won't pop for 'several years'

latest reply by samwilde on October 14, 2025

started by Daniel Nenni on October 13, 2025
Chinese media given tour of Intel Fab 52: here are their notes

latest reply by tomatoma on October 14, 2025

started by Fred Chen on October 12, 2025
Anyone?

latest reply by Barnsley on October 13, 2025

started by jordanM22 on October 13, 2025
N3B Lion Cove in LNL vs 18A Cougar Cove(LNC+)

latest reply by MKWVentures on October 13, 2025

started by siliconbruh999 on October 9, 2025
US Chip Workers Likely to Quit Jobs - McKinsey

latest reply by bilau on October 13, 2025

started by Barnsley on May 12, 2024
Zhang Pingan: 5nm and 7nm are not the core. Huawei's computing power is already three times that of Nvidia chips.

latest reply by bilau on October 13, 2025

started by Fred Chen on October 12, 2025

The Protocol Processing Dataplane

The Protocol Processing Dataplane
by Paul McLellan on 10-11-2012 at 8:48 pm
Categories: Uncategorized

At the Linley processor conference this week, Chris Rowen, the CTO of Tensilica presented on the protocol processing dataplane. That sounds superficially like he is talking about networking but in fact true protocol processing is just part of adding powerful compute features to the dataplane. Other applications are video, audio, security, voice-recognition and so on. All of these applications are inherently parallel and data-rich and either are impossible to process on a general purpose control processor such as an ARM (not enough performance) or are extremely power-hungry to use a general purpose processor.

Depending on the application, different kinds of parallelism are required, from single-instruction multiple-data (SIMD) vector processing to homogenous threads (all doing the same thing) or heterogenous threads.

The Tensilica Xtensa dataplane processor units (DPUs) are highly customizable and thus suitable for all these applications. The processors generated range from 11.5K gates up to huge beasts with large numbers of execution units. In addition, they can have a huge range of I/O architectures with FIFOs, lookup tables, or very wide direct connections. After all, a high-performance DPU isn’t much use if you can’t get the data in and out to the rest of the design with high enough bandwidth.

Probably the most demanding application, requiring very high I/O performance and high performance in the compute fabric, is network data forwarding (such as in a high-performance router). The most generic way to do this would be to use a cache-coherent memory system and just put the packets in off-chip DRAM. But Chris has a rule-of-thumb that, since energy is proportional to distance, if a direct wire connect is 1 unit of energy, local memory is 4, on-chip NoC is 16 and going off-chip is 256.

There is thus an enormous difference in energy efficiency to build the best possible fabric on-chip to keep everything fed, rather than building something completely general purpose, as can be seen from the above diagram showing the difference between using a cache-coherent cluster, one where DMA is used to offload the processors and one with direct connect.

The savings are huge using a DPU versus a standard microrprocessor. The pink bars show the efficiency of the Tensilica Xtensa DPU, the blue are ARM and the green is Intel Atom. Higher numbers are good (this is efficiency, Xtensa has been scaled to 1).

To take another demanding example, LTE-Advanced level 7. The block diagram is complex and requires a huge amount, 6.5Gb/s, to be moved around between the blocks. Again, comparing the general purpose solution to building direct connections on-chip shows the enormous difference in efficiency.

Share this post via:

Comments

There are no comments yet.

You must register or log in to view/post comments.

Selling the Forges of the Future: U.S. Report Exposes China’s Reliance on Western Chip Tools
This report has a "where's the beef" quality--China loves US and Japan SME, improving the trade imbalance, and without any…

— benb on October 12, 2025
Facing the Quantum Nature of EUV Lithography
In fact, electron noise dominates over EUV photon shot noise at higher doses: https://frederickchen.substack.com/p/how-secondary-electrons-worsen-euv

— Fred Chen on October 11, 2025
EUV Resist Degradation with Outgassing at Higher Doses
Since dose actually varies across a pattern, the degradation varies accordingly: https://www.youtube.com/watch?v=nF505AWAI7I.

— Fred Chen on October 10, 2025
From Prompts to Prompt Engineering to Knowing Ourselves
Thanks! I'm always interested in hearing about user experiences with AI. Helps drive my research and follow-on articles. Good luck!

— Bernard Murphy on October 10, 2025
From Prompts to Prompt Engineering to Knowing Ourselves
This is a really good article - I'm taking away 'use examples' to try to get better results with AI,…

— Xebec on October 10, 2025
Revolutionizing Processor Design: Intel’s Software Defined Super Cores
Saw this patent mentioned again the other day in a YouTube video. The idea of melding cores and chips together…

— KevinK on October 5, 2025
Semiconductor Equipment Spending Healthy
I can send you the graph if you send me your email. billjewell@sc-iq.com

— Bill Jewell on October 2, 2025
Semiconductor Equipment Spending Healthy
Yes, the Samsung fab in Taylor, TX, Intel fab in Ohio and Micron fab in New York have all been…

— Bill Jewell on October 2, 2025

Search Semiwiki

Recent Forum Threads

Recent Article Comments

Recent Podcast Episodes

Comments

Recent Forum Threads

Recent Article Comments