Instance

Array
(
    [title] => Recent Forum Threads
    [title_url] => 
    [ignore_sticky] => 0
    [exclude_current] => 0
    [limit] => 10
    [sluglist] => ["jobs-dashboard"]
    [rw_opt] => Array
        (
            [widget_select] => 1
            [pageid_281769] => 1
            [pageid_281772] => 1
        )

    [display_widget_mobile] => 
    [rw_opt_exclude] => Array
        (
            [pageid_274493] => 1
            [cpt_podcast] => 1
            [cpta_podcast] => 1
            [category_16613] => 1
            [category_16631] => 1
            [taxonomy_series] => 1
            [pageid_354254] => 1
        )

    [node_id] => Array
        (
            [0] => 2
        )

)

Threads

Recent Article Comments

TSMC CoPoS Versus Intel EMIB Semiconductor Packaging
Am I the only one who sees the Intel EMIB powerpoints differently than what I've seen elsewhere ? Aren't the…

— ChrisGar on July 31, 2026
DAC 2026: The Trouble with John Cooley’s Troublemaker Panel
Yes, it was recorded. It should be available sometime in August so stay tuned to SemiWiki. We will definitely write…

— Daniel Nenni on July 31, 2026
Formal Acceleration on FPGA. Innovation in Verification
Pretty interesting idea ....

— Rahul Razdan on July 31, 2026
DAC 2026: The Trouble with John Cooley’s Troublemaker Panel
Is a transcript or recording of the panel available?

— skmurphy on July 31, 2026
The Silicon Shield Has Never Been Stronger!
Mutually Assured Destruction. Several good movies about this concept - Dr Strangelove and War Games are two of my favorites.

— EganVector on July 30, 2026
Previewing FMS 2026: The Next Frontier of Enterprise Memory, CXL, and AI-Era Storage
I agree completely! I hope to see you all there!

— Daniel Nenni on July 30, 2026
Previewing FMS 2026: The Next Frontier of Enterprise Memory, CXL, and AI-Era Storage
This is a great conference on Storage and memory. great chance to have in person discussions on roadmaps and the…

— Mark Webb on July 30, 2026
Enhancing Multi-Domain System Simulation with FMI Co-Simulation
I find all the applications quite interesting, specially the second where a Twin Activate is co-simulated with a power converter…

— Naveen Yadav on July 28, 2026
The Silicon Shield Has Never Been Stronger!
If a blockade would deliver a severe blow to all nations, yet China already has countermeasures in place, then such…

— DanX on July 27, 2026
The Silicon Shield Has Never Been Stronger!
Let me say something politically incorrect. Victory in any conflict never comes from whose argument holds more merit, but from…

— DanX on July 27, 2026

WP_Term Object
(
    [term_id] => 151
    [name] => General
    [slug] => general
    [term_group] => 0
    [term_taxonomy_id] => 151
    [taxonomy] => category
    [description] => 
    [parent] => 0
    [count] => 449
    [filter] => raw
    [cat_ID] => 151
    [category_count] => 449
    [category_description] => 
    [cat_name] => General
    [category_nicename] => general
    [category_parent] => 0
)

May 27, 2016 by Bernard Murphy

Google, Deep Reasoning and Natural Language Understanding

Google, Deep Reasoning and Natural Language Understanding
by Bernard Murphy on 05-27-2016 at 7:00 am
Categories: General

Understanding natural language is considered a hard problem in artificial intelligence. You could be forgiven for thinking this can’t be right – surely language recognition systems already have this problem mostly solved? If so, you might be confusing recognition with understanding – loosely, recognition is the phonology (for voice) and syntax part of the problem and understanding is the semantic part.

A lot of progress has been made in recognition and this is largely thanks to deep reasoning. Voice recognition is a natural for these methods – systems can be trained to recognize a voice or a range of voices then can, thanks to probabilistic weighting, recognize a pre-determined vocabulary with high accuracy. The same applies to text recognition trained for reading selected content (stories, web-content, etc).

The quality of recognition depends on a few things – a relevant vocabulary, a sufficient grammar and a method to resolve the ambiguities which are typical in natural language. A typical English speaker has a vocabulary of ~20k words – very manageable with a large-enough neural net, though most applications today work with a much smaller task-specific vocabulary (for example in voice commands for your car). Grammars on the other hand tend to be quite simple in most applications. They throw away most of what they see and look for a likely verb and object (assuming you are the subject) to decide what you want. There are much more capable systems like IBM’s Watson, but these have required massive investment to get to better recognition.

But now there’s a big assist to building equally capable systems, and that helps with the ambiguity problem. Google recently released Syntax Net (which runs on top of Tensor Flow) as an open-source syntax engine to recognize syntax structures in a text sentence. The release also includes an English language parser called Parsey McParseface identifying the syntax tree for a sentence, including relative clauses, and tagging parts of speech like nouns, verbs (including tense and mode), pronouns and more.

While the system works with text, it is also built on deep reasoning to handle ambiguity in sentence structure. An example given in the link below considers “Alice drove down the street in her car”. Sounds pretty simple to us, but a possible machine interpretation is that she drove down a street which is inside her car. Trained neural net processing helps resolve these ambiguities.

Based on training with carefully-labelled Washington Post newswire texts, the parser is able to come very close to human accuracy in structuring sentences. It doesn’t do quite as well with unlabeled text, especially web examples, showing there is still more research required in self-guided training.

Google’s goal in this release is to encourage wider research on the deeper problems in natural language understanding, for example completing parts of speech identification (identifying that this is the subject, not just a noun or pronoun) and the semantics. Syntax Net helps other researchers and commercial developers avoid needing to reinvent a solution to a solved problem (and presumably they can now be confident that Google will be sympathetic to fair-use claims for products based on this software :cool:).

A lot of the interesting semantic challenges revolve around ambiguity and context-awareness: “Everyone loves someone” (one fortunate person is loved by everyone or possibly many people are loved?) and “John kissed his wife and so did Tom” (Tom kissed John’s wife or his own wife?). These problems might also be amenable to deep reasoning (what is the most probable interpretation) but it’s not yet as clear how you would constrain training examples for specific applications.

Natural language processing is becoming a competitive frontier as personal assistant software and translation tools become more popular and as our expectation for accuracy in dictation continue to rise (who wouldn’t love to get rid of keyboards?). This is a domain worth watching. You can read more about the Google release HERE. And HERE is a Berkeley paper on training neural nets to recognize continuous speech with a 65k word lexicon.

Comments

0 Replies to “Google, Deep Reasoning and Natural Language Understanding”

You must register or log in to view/post comments.

TSMC CoPoS Versus Intel EMIB Semiconductor Packaging
Am I the only one who sees the Intel EMIB powerpoints differently than what I've seen elsewhere ? Aren't the…

— ChrisGar on July 31, 2026
DAC 2026: The Trouble with John Cooley’s Troublemaker Panel
Yes, it was recorded. It should be available sometime in August so stay tuned to SemiWiki. We will definitely write…

— Daniel Nenni on July 31, 2026
Formal Acceleration on FPGA. Innovation in Verification
Pretty interesting idea ....

— Rahul Razdan on July 31, 2026
DAC 2026: The Trouble with John Cooley’s Troublemaker Panel
Is a transcript or recording of the panel available?

— skmurphy on July 31, 2026
The Silicon Shield Has Never Been Stronger!
Mutually Assured Destruction. Several good movies about this concept - Dr Strangelove and War Games are two of my favorites.

— EganVector on July 30, 2026
Previewing FMS 2026: The Next Frontier of Enterprise Memory, CXL, and AI-Era Storage
I agree completely! I hope to see you all there!

— Daniel Nenni on July 30, 2026
Previewing FMS 2026: The Next Frontier of Enterprise Memory, CXL, and AI-Era Storage
This is a great conference on Storage and memory. great chance to have in person discussions on roadmaps and the…

— Mark Webb on July 30, 2026
Enhancing Multi-Domain System Simulation with FMI Co-Simulation
I find all the applications quite interesting, specially the second where a Twin Activate is co-simulated with a power converter…

— Naveen Yadav on July 28, 2026

Search Semiwiki

Recent Forum Threads

Recent Article Comments

Recent Podcast Episodes

Comments

0 Replies to “Google, Deep Reasoning and Natural Language Understanding”

Recent Forum Threads

Recent Article Comments