Instance

Array
(
    [title] => Recent Forum Threads
    [title_url] => 
    [ignore_sticky] => 0
    [exclude_current] => 0
    [limit] => 10
    [sluglist] => ["jobs-dashboard"]
    [rw_opt] => Array
        (
            [widget_select] => 1
            [pageid_281769] => 1
            [pageid_281772] => 1
        )

    [display_widget_mobile] => 
    [rw_opt_exclude] => Array
        (
            [pageid_274493] => 1
            [cpt_podcast] => 1
            [cpta_podcast] => 1
            [category_16613] => 1
            [category_16631] => 1
            [taxonomy_series] => 1
            [pageid_354254] => 1
        )

    [node_id] => Array
        (
            [0] => 2
        )

)

Threads

AI Inference Software Architect

AI Inference Software Architect
by Daniel Nenni on 09-09-2020 at 7:55 pm

Full Time
Mountain View, CA
Posted 5 years ago
Applications have closed

Flex Logix has finished the hardware design and is fabricating it’s first Inference Accelerator CoProcessor, InferX X1, which is based on our nnMAX Inference IP. We will have chips and PCIe boards this autumn. Our software team is preparing our Inference Model Compiler to be ready to run deep neural network models on X1.

We have begun architecting the follow-on chip. InferX has industry-best inference efficiency: more inference throughput per $ and per watt. We excel on larger models and megapixel images, but can run any neural network.

RESPONSIBILITIES
Part of the small but excellent team responsible for our nnMAX Model Compiler: a DNN (Deep Neural Network) Model-to-binary flow – we are looking for a software architect-level position to expand architecture of our Model Compiler, written in modern C++, for addition of functionality for support of additional capabilities, in particular:
– Parsing of TensorflowLite/ONNX/other DNN model description languages to our internal model
format, support of custom-defined operators
– Consider numerous parameters (memory bandwidth, memory access pattern, memory & compute resource allocation, etc.) to arrive at an optimal computation strategy for each new operator
– Mapping of each operator and its computational strategy to Verilog RTL code, running on EFLX eFPGA inside nnMAX chip and controlling Flex computation & memory engines.

This is a software architect/developer role but your activities will include integration with Flex-Logix hardware-based computation architecture for DNN as well as representation of computational architecture in software abstractions.

EXPERIENCE AND SKILL REQUIRED
Expertise in developing software compilers for one or more hardware-accelerated computational
engines, preferably for DNN training or inference
Abilities to take complex problems and come up with efficient, innovative solutions
Experience with modern AI frameworks and inference engines – TensorFlow, PyTorch, TfLite, ONNX

Experience with Verilog and hardware computation architectures.

Experience with FPGA synthesis tools such as Synopsys Synplify is a plus

Very good grasp of proper software/hardware development engineering practices, modeling, design, representation in documentation, excellent communication skills.

Must be very smart and very motivated, must be a quick learner, proactive and curious.
Must be passionate about being part of an aggressive, venture-backed startup team that is changing the way chips and supporting software are architected, designed, and programmed.
Must be entrepreneurial, innovative problem solver and willing to work hard.

Must live in Silicon Valley. Strong preference for US citizenship or permanent residency (“green card”);
will consider candidates with current H1-B visas who are willing to transfer promptly.

Share this post via:

Facing the Quantum Nature of EUV Lithography
This presentation considers 5 nm Gaussian acid blur: https://www.youtube.com/watch?v=MYLdE69RDBg

— Fred Chen on July 7, 2025
Flynn Was Right: How a 2003 Warning Foretold Today’s Architectural Pivot
Appreciate your take, Rahul. You’re absolutely right that market scale drives architectural investment—scalar dominated when desktop and enterprise ruled, and…

— Jonah McLeod on June 29, 2025
Flynn Was Right: How a 2003 Warning Foretold Today’s Architectural Pivot
Well.. I found this to be a funny article. Flynn's critique is fine and good...but not really the driving factor…

— Rahul Razdan on June 29, 2025
Reachability in Analog and AMS. Innovation in Verification
Apologies for that slip-up on our part. Failing memories!

— Bernard Murphy on June 27, 2025
Reachability in Analog and AMS. Innovation in Verification
swka: This is true, I worked with MunEDA up until the Cadence acquisition. Before that I worked with Solido up…

— Daniel Nenni on June 26, 2025
Reachability in Analog and AMS. Innovation in Verification
One quick correction. WiCkeD was MunEDA tool, which was acquired by Cadence. So it is never part of Synopsys. Synopsy…

— swka on June 26, 2025
Flynn Was Right: How a 2003 Warning Foretold Today’s Architectural Pivot
At Simplex Micro, the name says it all. Founder Dr. Thang Tran chose it to reflect his belief that in…

— Jonah McLeod on June 25, 2025
Flynn Was Right: How a 2003 Warning Foretold Today’s Architectural Pivot
Thanks for the thoughtful read—and you're right, we’re in a fascinating inflection point. On your first point: Lunar Lake doesn’t…

— Jonah McLeod on June 24, 2025

Search Semiwiki

Recent Forum Threads

Recent Article Comments

Recent Podcast Episodes

Recent Forum Threads

Recent Article Comments