Instance

Array
(
    [title] => Recent Forum Threads
    [title_url] => 
    [ignore_sticky] => 0
    [exclude_current] => 0
    [limit] => 10
    [sluglist] => ["jobs-dashboard"]
    [rw_opt] => Array
        (
            [widget_select] => 1
            [pageid_281769] => 1
            [pageid_281772] => 1
        )

    [display_widget_mobile] => 
    [rw_opt_exclude] => Array
        (
            [pageid_274493] => 1
            [cpt_podcast] => 1
            [cpta_podcast] => 1
            [category_16613] => 1
            [category_16631] => 1
            [taxonomy_series] => 1
            [pageid_354254] => 1
        )

    [node_id] => Array
        (
            [0] => 2
        )

)

Threads

Recent Article Comments

TSMC Process Simplification for Advanced Nodes
For the specification, on one hand, you should say as little as possible, but on the other hand, you need…

— Fred Chen on February 26, 2026
TSMC Process Simplification for Advanced Nodes
The spec of a patent is required, but not required to be anything other than some writing. Claims are supported…

— benb on February 26, 2026
TSMC Process Simplification for Advanced Nodes
Apparently, 3nm metal patterning is EUV LELE lines + DUV LELE blocks. But DUV LELELELE is already cheaper than EUV…

— Fred Chen on February 25, 2026
A Century of Miracles: From the FET’s Inception to the Horizons Ahead
Yes, he retired to St Thomas USVI. Hopefully they did meet, legends........

— Daniel Nenni on February 21, 2026
A Century of Miracles: From the FET’s Inception to the Horizons Ahead
When you say Lilienfeld retired to St Thomas, do you mean St Thomas USVI?? Curiously, J Robert Oppenheimer "retired" to…

— PBealo on February 21, 2026
Podcast EP332: How AI Really Works – the Perspectives of Linley Gwennap
Good interview; I've been looking for a book like this, too. Also cool to hear from an old Microprocessor Report…

— Xebec on February 21, 2026
TSMC vs Intel Foundry vs Samsung Foundry 2026
How much revenue do you think Samsung foundry has from external customers? After 5 years of work and sales and…

— Mark Webb on February 20, 2026
Semidynamics Unveils 3nm AI Inference Silicon and Full-Stack Systems
Oops - yet one more data center inference AI chip/rack company that claims a complete solution targeted for 2027. They…

— KevinK on February 18, 2026
TSMC vs Intel Foundry vs Samsung Foundry 2026
Very nice overview ....

— Rahul Razdan on February 14, 2026
Semidynamics Unveils 3nm AI Inference Silicon and Full-Stack Systems
Not sure what to make of these "late-in-the game" chip startups that seemingly claim to have rack level AI inference…

— KevinK on February 11, 2026

WP_Term Object
(
    [term_id] => 151
    [name] => General
    [slug] => general
    [term_group] => 0
    [term_taxonomy_id] => 151
    [taxonomy] => category
    [description] => 
    [parent] => 0
    [count] => 447
    [filter] => raw
    [cat_ID] => 151
    [category_count] => 447
    [category_description] => 
    [cat_name] => General
    [category_nicename] => general
    [category_parent] => 0
)

August 1, 2016 by Bernard Murphy

Limits to Deep Reasoning in Vision

Limits to Deep Reasoning in Vision
by Bernard Murphy on 08-01-2016 at 7:00 am
Categories: General

If you are a regular reader, you’ll know I like to explore the boundaries of technology. Readers I respect sometimes interpret this as a laughable attempt to oppose the inevitable march of progress, but that is not my purpose. In understanding the limits of a particular technology, it is possible to envision what properties a successor technology should have. And that to me seems more interesting than assuming all further progress in that direction will be no more than fine-tuning.

Take deep learning and vision. Recent progress in this direction has been quite astounding; in one example, systems have bested humans in identifying dog breeds. These systems are now used in cars for driver assistance and safety applications – detecting lane markings, collision hazards, even traffic signs. Increasingly Google and Facebook use image recognition to search and tag people, animals and objects in images. It seems we’ve almost conquered automated image recognition at a level better than humans. But have we really, and if so, is that good enough?

While progress in deep reasoning has been impressive, there have also been some fairly spectacular fails. Microsoft was forced to retire a chatbot after it developed racist and other unpleasant tendencies. Google had to remove the “gorilla” tag from its Photos app after complaints that it was identifying dark-skinned people as gorillas. And Google released open-source software which identifies surrealist collages of faces in what we would consider perfectly ordinary images (in fairness, Google was pushing the software to see what happened).

You could argue that this is just normal progression for technology. Perhaps once the bugs are worked out, these problems will be rare. But I am skeptical that solutions as they stand just need better training. Our own fallibility in image recognition should be a hint. It’s common to see faces and other images in complex irregular patterns if we stare at them for a while. This phenomenon is called pareidolia, a bias of the brain to see patterns, particularly faces in random images. I can’t imagine why deep reasoning should be immune from this problem; after all we modeled the method on human reasoning, so it would be surprising if it did not also inherit weaknesses in that approach. In fact the Google software that produced surrealist images is known to have this bias.

How good the recognition has to be may depend on the application, but clearly there is room for improvement and for some applications, the bar is going to be very high. More training might help, up to a point. So might more hidden layers, though apparently the value of adding layers drops off sharply after a relatively small number. Ultimately we have to acknowledge that the only straightforward way to fix deep reasoning problems is to try harder, which is not an encouraging place to start when you want to find breakthrough solutions.

Or perhaps we could go back to how we think. Most of us don’t instantly convert what we think we see into action. We consider multiple factors and we pass our conclusions through multiple filters. This is so apparent that we all know people who seem to lack these safeguards; we consider them socially-challenged (or worse). Now think of a cascade of neural nets where each net is trained in different ways. Deep learning methods for particle detection at the Large Hadron Collider (LHC) use similar methods, also combining different approaches – neural nets and binary decision trees – to weed out false positives. This alone might be a good start, with a first order goal to default to “I don’t know” when there is ambiguity in recognition.

Training more nets and other methods would be more expensive and the outcome may initially be more ambiguous than we might like. But maybe that’s an inescapable reality of improved recognition. Perhaps we should think of what we have today as hind-brain recognition – good for quick reaction (fight-or-flight) response but, like the hind-brain, not good at ultra-high-fidelity recognition where we might need improved tools.

I’m sure however this evolves the field will continue to be called deep learning, but that’s just a label. For one insight into limitations in existing architectures and newer methods, see HERE. You can see the Google surrealist art HERE.

Comments

0 Replies to “Limits to Deep Reasoning in Vision”

You must register or log in to view/post comments.

TSMC Process Simplification for Advanced Nodes
For the specification, on one hand, you should say as little as possible, but on the other hand, you need…

— Fred Chen on February 26, 2026
TSMC Process Simplification for Advanced Nodes
The spec of a patent is required, but not required to be anything other than some writing. Claims are supported…

— benb on February 26, 2026
TSMC Process Simplification for Advanced Nodes
Apparently, 3nm metal patterning is EUV LELE lines + DUV LELE blocks. But DUV LELELELE is already cheaper than EUV…

— Fred Chen on February 25, 2026
A Century of Miracles: From the FET’s Inception to the Horizons Ahead
Yes, he retired to St Thomas USVI. Hopefully they did meet, legends........

— Daniel Nenni on February 21, 2026
A Century of Miracles: From the FET’s Inception to the Horizons Ahead
When you say Lilienfeld retired to St Thomas, do you mean St Thomas USVI?? Curiously, J Robert Oppenheimer "retired" to…

— PBealo on February 21, 2026
Podcast EP332: How AI Really Works – the Perspectives of Linley Gwennap
Good interview; I've been looking for a book like this, too. Also cool to hear from an old Microprocessor Report…

— Xebec on February 21, 2026
TSMC vs Intel Foundry vs Samsung Foundry 2026
How much revenue do you think Samsung foundry has from external customers? After 5 years of work and sales and…

— Mark Webb on February 20, 2026
Semidynamics Unveils 3nm AI Inference Silicon and Full-Stack Systems
Oops - yet one more data center inference AI chip/rack company that claims a complete solution targeted for 2027. They…

— KevinK on February 18, 2026

Search Semiwiki

Recent Forum Threads

Recent Article Comments

Recent Podcast Episodes

Comments

0 Replies to “Limits to Deep Reasoning in Vision”

Recent Forum Threads

Recent Article Comments