Instance

Array
(
    [title] => Recent Forum Threads
    [title_url] => 
    [ignore_sticky] => 0
    [exclude_current] => 0
    [limit] => 10
    [sluglist] => ["jobs-dashboard"]
    [rw_opt] => Array
        (
            [widget_select] => 1
            [pageid_281769] => 1
            [pageid_281772] => 1
        )

    [display_widget_mobile] => 
    [rw_opt_exclude] => Array
        (
            [pageid_274493] => 1
            [cpt_podcast] => 1
            [cpta_podcast] => 1
            [category_16613] => 1
            [category_16631] => 1
            [taxonomy_series] => 1
            [pageid_354254] => 1
        )

    [node_id] => Array
        (
            [0] => 2
        )

)

Threads

Recent Forum Threads

Intel Ohio One Construction Timeline Update

latest reply by hist78 on March 3, 2026

started by Daniel Nenni on March 3, 2026
OpenAI amends Pentagon deal as Sam Altman admits it looks ‘sloppy’

latest reply by Xebec on March 3, 2026

started by Daniel Nenni on March 3, 2026
Press Kit: Intel at MWC Barcelona 2026

latest reply by MKWVentures on March 3, 2026

started by Daniel Nenni on March 3, 2026
The Looming Taiwan Chip Disaster That Silicon Valley Has Long Ignored

latest reply by DanX on March 3, 2026

started by Daniel Nenni on February 24, 2026
Apple introduces M5 Pro and M5 Max, powering the new MacBook Pro

started by Daniel Nenni on March 3, 2026
Chinese DDR5 RAM: Is This the Solution to Crazy Memory Prices?

latest reply by count on March 3, 2026

started by Fred Chen on February 28, 2026
The U.S. Supreme Court struck down Trump’s global tariffs

latest reply by freshshine1 on March 3, 2026

started by hist78 on February 20, 2026
Intel Foundry, losing key people in 2025-2026

latest reply by KevinK on March 3, 2026

started by NY_Sam2 on February 26, 2026
Panther Lake design rules revealed, no HD cells

latest reply by Brady on March 3, 2026

started by Fred Chen on February 21, 2026
Nvidia to invest $4 billion into photonics companies Coherent and Lumentum

started by swka on March 2, 2026

Recent Article Comments

Advancing Automotive Memory: Development of an 8nm 128Mb Embedded STT-MRAM with Sub-ppm Reliability
Gotta note the fine print: read failure is ppm-level, not sub-ppm. This is also pointed out in the paper and…

— Fred Chen on March 2, 2026
Advancing Automotive Memory: Development of an 8nm 128Mb Embedded STT-MRAM with Sub-ppm Reliability
The key question is whether changing your SoC or MCU design to 8nm to include 128 Mb (or larger) eMRAM…

— Fred Chen on March 1, 2026
The Name Changes but the Vision Remains the Same – ESD Alliance Through the Years
Thanks for the additional history Dave, much appreciated. You are correct about the broad charter of EDAC. I remember participating…

— Mike Gianfagna on February 28, 2026
The Name Changes but the Vision Remains the Same – ESD Alliance Through the Years
Nice article. A few updates for accuracy sake: The organization was founded by Rick Carlson and Dave Millman as IDAC,…

— davemillman on February 27, 2026
CEO Interview with Aftkhar Aslam of yieldWerx
Good Luck. You guys are making good impact

— IC_observer on February 26, 2026
TSMC Process Simplification for Advanced Nodes
For the specification, on one hand, you should say as little as possible, but on the other hand, you need…

— Fred Chen on February 26, 2026
TSMC Process Simplification for Advanced Nodes
The spec of a patent is required, but not required to be anything other than some writing. Claims are supported…

— benb on February 26, 2026
TSMC Process Simplification for Advanced Nodes
Apparently, 3nm metal patterning is EUV LELE lines + DUV LELE blocks. But DUV LELELELE is already cheaper than EUV…

— Fred Chen on February 25, 2026
A Century of Miracles: From the FET’s Inception to the Horizons Ahead
Yes, he retired to St Thomas USVI. Hopefully they did meet, legends........

— Daniel Nenni on February 21, 2026
A Century of Miracles: From the FET’s Inception to the Horizons Ahead
When you say Lilienfeld retired to St Thomas, do you mean St Thomas USVI?? Curiously, J Robert Oppenheimer "retired" to…

— PBealo on February 21, 2026

WP_Term Object
(
    [term_id] => 6435
    [name] => AI
    [slug] => artificial-intelligence
    [term_group] => 0
    [term_taxonomy_id] => 6435
    [taxonomy] => category
    [description] => Artificial Intelligence
    [parent] => 0
    [count] => 776
    [filter] => raw
    [cat_ID] => 6435
    [category_count] => 776
    [category_description] => Artificial Intelligence
    [cat_name] => AI
    [category_nicename] => artificial-intelligence
    [category_parent] => 0
)

February 11, 2026February 14, 2026 by Bernard Murphy

Watch Live Agentic Software Debug

Watch Live Agentic Software Debug
by Bernard Murphy on 02-11-2026 at 6:00 am
Categories: AI

Key takeaways ▼

Many moons ago in the Innovation series we explored techniques like spectrum analysis to root-cause bugs. While these methods provide some value they don’t get as close as we would like to isolating a root-cause. In hindsight given what we know about the complexity of conventional debug it is unsurprising that we can’t root-cause in one shot. Hence the rise of agentic debug solutions from companies like ChipAgents and ChipStack. Agentic systems can reason through a root cause analysis in multiple steps just as we do in human-based analysis. Following is a very intriguing parallel from our sister field (software debug) posted as a YouTube session from the C++ conference.

(Image courtesy of Cppcon)

Background and bugs

This event was a joint presentation between UnDo.io (who provide time-travel debugging for C++ and Java, think something like gdb with full context replay) and Anthropic. Their goal was to explore live (not a canned demo) what agentic debugging would look like. Gutsy move because the reality was messy though still very informative. They test on a couple of cases in parallel: A segfault in the Python interpreter and unexpected behaviors (which prove not to be bugs) in Doom.

The Python bug should attract the interest of hardware designers: effectively a cache coherency issue in software. The code caches pointers to objects allocated in memory and entries in the cache can be tested without incrementing reference counts for those objects. The coherency risk is that a referenced object may be freed without clearing the cache reference, a worthy test for the value of agentic debugging. The Doom exploration is primarily interesting for how it influences the debugging process in localizing a behavior within a playback to get close to whatever triggered that behavior. This case may be even more interesting for hardware debug, where unexpected behavior is much more likely than anything comparable to a crash.

My takeaways from the demo

The Python debug demo is, as far as I can tell, hands-free apart from the initial setup. Analysis starts with the crash and iterates backwards and between multiple types of agents, trying different hypotheses, testing with different techniques to eliminate possibilities. UnDo added an adversarial “bug diagnosis validator” agent (Claude Code) provides support for this). As agentic analysis progresses, discoveries start to converge towards the right area ultimately getting get pretty darn close to the root cause.

As expected, Claude builds a ToDo list of tasks it believes it needs to perform to work towards a goal (e.g. find when the second zombie was killed in the Doom debug, see below), and checks these off as it progresses. An interesting revelation is that it apparently can lose the plot periodically, at which point it needs to be reminded to revisit the list. This didn’t seem to happen in this Python case.

The Doom analysis is more collaborative, I imagine because they don’t have a bug to target. Instead, they are trying to understand unexpected”behaviors. For example, why did the player get stuck in the map room after killing the second Zombie? Here the demo guy asked, “when was the second zombie killed during this playthrough (recorded playback)?” Claude got him to this point, from which he could ask it to drill down further. Note the value of being able to use a high level reference (second zombie) in prompting next steps.

The demo often ran into system problems (“repeated server overload with “Opus model” – Opus is Claude’s model optimized for coding) which seem to reflect server busy problems on the Claude side. These issues are now apparently fixed (or at least improved) – this demo was running with a pre-release of the Claude Code API.

There was question from the audience about token costs. The UnDo speaker suggested single-digit dollars for the Doom example (56k LOC), much higher costs ($$ numbers not cited) to track down the Python interpreter bug mentioned earlier (350k C LOC, 800k Python LOC).

Long video (about an hour) but well worth watching all the way through for the insights it provides. You can find the video HERE.

Also Read:

Why PDF Solutions Is Positioning Itself at the Center of the Semiconductor Ecosystem

Gate-All-Around (GAA) Technology for Sustainable AI

Beyond Transformers. Physics-Centric Machine Learning for Analog

Share this post via:

Comments

There are no comments yet.

You must register or log in to view/post comments.

Advancing Automotive Memory: Development of an 8nm 128Mb Embedded STT-MRAM with Sub-ppm Reliability
Gotta note the fine print: read failure is ppm-level, not sub-ppm. This is also pointed out in the paper and…

— Fred Chen on March 2, 2026
Advancing Automotive Memory: Development of an 8nm 128Mb Embedded STT-MRAM with Sub-ppm Reliability
The key question is whether changing your SoC or MCU design to 8nm to include 128 Mb (or larger) eMRAM…

— Fred Chen on March 1, 2026
The Name Changes but the Vision Remains the Same – ESD Alliance Through the Years
Thanks for the additional history Dave, much appreciated. You are correct about the broad charter of EDAC. I remember participating…

— Mike Gianfagna on February 28, 2026
The Name Changes but the Vision Remains the Same – ESD Alliance Through the Years
Nice article. A few updates for accuracy sake: The organization was founded by Rick Carlson and Dave Millman as IDAC,…

— davemillman on February 27, 2026
CEO Interview with Aftkhar Aslam of yieldWerx
Good Luck. You guys are making good impact

— IC_observer on February 26, 2026
TSMC Process Simplification for Advanced Nodes
For the specification, on one hand, you should say as little as possible, but on the other hand, you need…

— Fred Chen on February 26, 2026
TSMC Process Simplification for Advanced Nodes
The spec of a patent is required, but not required to be anything other than some writing. Claims are supported…

— benb on February 26, 2026
TSMC Process Simplification for Advanced Nodes
Apparently, 3nm metal patterning is EUV LELE lines + DUV LELE blocks. But DUV LELELELE is already cheaper than EUV…

— Fred Chen on February 25, 2026

Search Semiwiki

Recent Forum Threads

Recent Article Comments

Recent Podcast Episodes

Background and bugs

My takeaways from the demo

Also Read:

Comments

Recent Forum Threads

Recent Article Comments