Instance

Array
(
    [title] => Recent Forum Threads
    [title_url] => 
    [ignore_sticky] => 0
    [exclude_current] => 0
    [limit] => 10
    [sluglist] => ["jobs-dashboard"]
    [rw_opt] => Array
        (
            [widget_select] => 1
            [pageid_281769] => 1
            [pageid_281772] => 1
        )

    [display_widget_mobile] => 
    [rw_opt_exclude] => Array
        (
            [pageid_274493] => 1
            [cpt_podcast] => 1
            [cpta_podcast] => 1
            [category_16613] => 1
            [category_16631] => 1
            [taxonomy_series] => 1
            [pageid_354254] => 1
        )

    [node_id] => Array
        (
            [0] => 2
        )

)

Threads

Recent Forum Threads

Trump demand's Intel CEO's resignation

started by siliconbruh999 on August 7, 2025
Intel’s Pivot: Why It’s Betting on UMC—Not TSMC—in the Legacy Node Wars

latest reply by cliff on August 7, 2025

started by XYang2023 on July 14, 2025
TSM rated the best stock

latest reply by Arthur Hanson on August 7, 2025

started by Arthur Hanson on August 6, 2025
TSMC exempt from tariffs?

started by Fred Chen on August 7, 2025
UMC Reports Sales for July 2025

latest reply by eding42 on August 7, 2025

started by Daniel Nenni on August 6, 2025
Report: Tesla Taps Samsung, Intel for Dojo Supercomputer Supply Chain

started by eding42 on August 7, 2025
Top 40 Cybersecurity Companies You Need to Know in 2025 Europe

started by karishmaqualysec on August 7, 2025
Only USA-Investing Semiconductor Chip Companies to Avoid 100% Tariffs

latest reply by Barnsley on August 7, 2025

started by benb on August 6, 2025
America needs chips champion

latest reply by Y.H on August 7, 2025

started by Y.H on August 6, 2025
Tokyo Electron confirms Taiwan unit employee's involvement in intellectual property case

started by hist78 on August 7, 2025

Recent Article Comments

Musk’s new job as Samsung Fab Manager – Can he disrupt chip making? Intel outside
It would have indeed been good for Intel, but perhaps this is a blessing, after all. I would not want…

— jmlobert on August 5, 2025
Musk’s new job as Samsung Fab Manager – Can he disrupt chip making? Intel outside
Tesla has two sets of silicon that are well below 28nm. The first is the Ryzen infotainment system -- that's…

— Xebec on August 5, 2025
Musk’s new job as Samsung Fab Manager – Can he disrupt chip making? Intel outside
The Samsung deal is for TSLA's AI6 chip, which is a couple of years away (AI5 isn't out yet). In…

— rgrindley on August 4, 2025
Why I Think Intel 3.0 Will Succeed
that was totally China's fault

— siliconbruh999 on August 4, 2025
Musk’s new job as Samsung Fab Manager – Can he disrupt chip making? Intel outside
14nm shipping since 2019? https://www.autopilotreview.com/tesla-custom-ai-chips-hardware-3/ . Current shipping FSD 4 silicon is 7nm or 4nm .

— pvaris on August 4, 2025
Musk’s new job as Samsung Fab Manager – Can he disrupt chip making? Intel outside
INTC would of been the fit, but assuming they talked, Lip probably didn't agree to any of the nonsense.

— Rob McCance on August 3, 2025
Musk’s new job as Samsung Fab Manager – Can he disrupt chip making? Intel outside
Oh goody, Musk in the fab. Wait until the fab manager and engineers get a taste of Musk’s demands for…

— icartist on August 2, 2025
Why I Think Intel 3.0 Will Succeed
Don’t forget about the failed Tower Semi purchase. That would have helped a lot, imo.

— NEO on August 2, 2025
cHBM for AI: Capabilities, Challenges, and Opportunities
Computational HBM sounds a bit like Computing-in-Memory?

— Fred Chen on July 31, 2025
Intel has a new Billionaire CEO!
The only question is why he remains with Walden even when he has taken probably the most challenging and reputable…

— Hart XU on July 30, 2025

WP_Term Object
(
    [term_id] => 6435
    [name] => AI
    [slug] => artificial-intelligence
    [term_group] => 0
    [term_taxonomy_id] => 6435
    [taxonomy] => category
    [description] => Artificial Intelligence
    [parent] => 0
    [count] => 657
    [filter] => raw
    [cat_ID] => 6435
    [category_count] => 657
    [category_description] => Artificial Intelligence
    [cat_name] => AI
    [category_nicename] => artificial-intelligence
    [category_parent] => 0
)

July 16, 2025July 15, 2025 by Bernard Murphy

A Quick Look at Agentic/Generative AI in Software Engineering

A Quick Look at Agentic/Generative AI in Software Engineering
by Bernard Murphy on 07-16-2025 at 6:00 am
Categories: AI

Agentic methods are hot right now since single LLM models seem limited to point tool applications. Each such application is impressive but still a single step in the more complex chain of reasoning tasks we want to automate, where agentic methods should shine. I have been hearing that software engineering (SWE) teams are advancing faster in AI adoption than hardware teams so thought it would be useful to run a quick reality check on status. Getting into the spirit of this idea I used Gemini Deep Research to find sources for this article, selectively sampling a few surveys it offered while adding a couple of my own finds. My quick summary is first that what counts as progress depends on the application: convenience-based use-models are more within reach today, precision use-models are also possible but more bounded. And second, advances are more evident in automating subtasks subject to a natural framework of crosschecks and human monitoring, rather than a hands-free total SWE objective.

Automation for convenience

One intriguing paper suggests that we should move away from apps for convenience needs towards prompt-based queries to serve the same objectives. This approach can in principle do better than apps because prompt-based systems eliminate need for app development, can be controlled through the language we all speak without need for cryptic human-machine interfaces, and can more easily adapt to variations in needs.

Effective prompt engineering may still be more of an art than we would prefer, but the author suggests we can learn how to become more effective and (my interpretation) perhaps we only need to learn this skill once rather than for every unique app.

Even technology engineers need this kind of support, not in deep development or analysis but in routine yet important questions: “who else is using this feature, when was it most recently used, what problems have others seen?” Traditionally these might be answered by a help library or an in-house data management app, but what if you want to cross your question with other sources or constraints outside the scope of that app? In hardware development imagine the discovery power available if you could do prompt-based searches across all design data – spec, use cases, source code, logs, waveforms, revisions, etc, etc.

Automating precision development

This paper describes an agentic system to develop quite complex functions including a face recognition system, a chat-bot system, a face mask detection tool, a snake game, a calculator, and a Tic-Tac-Toe game, using an LLM-based agentic system with agents for management, code generation, optimization, QA, iterative refinement and final verification. It claims 85% or better code accuracy against a standard benchmark, building and testing these systems in minutes. At 85% accuracy, we must still follow that initial code with developer effort to verify and correct to production quality. But assuming this level of accuracy is repeatable, it is not hard to believe that even given a few weeks or months of developer testing and refinement, the net gain in productivity without loss of quality can be considerable.

Another paper points out that in SWE there is still a trust issue with automatically developed code. However they add that most large-scale software development is more about assembling code from multiple sources than developing code from scratch. Which changes the trust question to how much you can trust components and assembly. I’m guessing that they consider assembly in DevOps to be relatively trivial, but in hardware design SoC-level assembly (or even multi-die system assembly) is more complex though still primarily mechanical rather than creative. The scope for mistakes is certainly more limited than it would be in creating a complete new function from scratch. I know of an AI-based system from over a decade ago which could create most of the integration infrastructure for an SoC – clocking, reset, interrupt, bus fabric, etc. This was long before we’d heard of LLMs and agents.

Meanwhile, Agentic/Generative AI isn’t only useful for code development. Tools are appearing to automate test design, generation and execution, for debug, and more generally for DevOps. Many of these systems in effect crosscheck each other and are also complemented by human oversight. Mistakes might happen but perhaps no more so than in an AI-free system.

Convenience, precision or a bit of both?

Engineers obsess about precision, especially around AI. But much of what we do during our day doesn’t require precision. “Good enough” answers are OK if we can get them quickly. Search, summarizing key points from an email or paper, generating a first draft document, these are all areas where we depend on (or would like) the convenience of a quick and “good enough” first pass. On the other hand, precision is vital in some contexts. For financial transactions, jet engine modeling, logic simulation we want the most accurate answers possible, where “good enough” isn’t good enough.

Even so, there can still be an advantage for precision applications. If AI can provide a good enough starting point very quickly (minutes) and if we can manage our expectations by accepting need to refine and verify beyond that starting point, then the net benefit in shortened schedule and reduced effort may be worth the investment. As long as you can build trust in the quality the AI system can provide.

Incidentally, my own experience (I tried Deep Research (DR) options in Gemini, Perplexity and Chat GPT) backs up my conclusions. Each DR analysis appeared in ~10 minutes, mostly useful to me for the references they provided rather than the DR summaries. Some of these references were new to me, some I already knew. That might have been enough if my research was purely for my own interest. But I wanted to be more accurate since I’m aiming to provide reliable insight, so I also looked for other references through more conventional on-line libraries. Combining both methods proved to be productive!

Share this post via:

Comments

There are no comments yet.

You must register or log in to view/post comments.

Musk’s new job as Samsung Fab Manager – Can he disrupt chip making? Intel outside
It would have indeed been good for Intel, but perhaps this is a blessing, after all. I would not want…

— jmlobert on August 5, 2025
Musk’s new job as Samsung Fab Manager – Can he disrupt chip making? Intel outside
Tesla has two sets of silicon that are well below 28nm. The first is the Ryzen infotainment system -- that's…

— Xebec on August 5, 2025
Musk’s new job as Samsung Fab Manager – Can he disrupt chip making? Intel outside
The Samsung deal is for TSLA's AI6 chip, which is a couple of years away (AI5 isn't out yet). In…

— rgrindley on August 4, 2025
Why I Think Intel 3.0 Will Succeed
that was totally China's fault

— siliconbruh999 on August 4, 2025
Musk’s new job as Samsung Fab Manager – Can he disrupt chip making? Intel outside
14nm shipping since 2019? https://www.autopilotreview.com/tesla-custom-ai-chips-hardware-3/ . Current shipping FSD 4 silicon is 7nm or 4nm .

— pvaris on August 4, 2025
Musk’s new job as Samsung Fab Manager – Can he disrupt chip making? Intel outside
INTC would of been the fit, but assuming they talked, Lip probably didn't agree to any of the nonsense.

— Rob McCance on August 3, 2025
Musk’s new job as Samsung Fab Manager – Can he disrupt chip making? Intel outside
Oh goody, Musk in the fab. Wait until the fab manager and engineers get a taste of Musk’s demands for…

— icartist on August 2, 2025
Why I Think Intel 3.0 Will Succeed
Don’t forget about the failed Tower Semi purchase. That would have helped a lot, imo.

— NEO on August 2, 2025

Search Semiwiki

Recent Forum Threads

Recent Article Comments

Recent Podcast Episodes

Automation for convenience

Automating precision development

Convenience, precision or a bit of both?

Comments

Recent Forum Threads

Recent Article Comments