Instance

Array
(
    [title] => Recent Forum Threads
    [title_url] => 
    [ignore_sticky] => 0
    [exclude_current] => 0
    [limit] => 10
    [sluglist] => ["jobs-dashboard"]
    [rw_opt] => Array
        (
            [widget_select] => 1
            [pageid_281769] => 1
            [pageid_281772] => 1
        )

    [display_widget_mobile] => 
    [rw_opt_exclude] => Array
        (
            [pageid_274493] => 1
            [cpt_podcast] => 1
            [cpta_podcast] => 1
            [category_16613] => 1
            [category_16631] => 1
            [taxonomy_series] => 1
            [pageid_354254] => 1
        )

    [node_id] => Array
        (
            [0] => 2
        )

)

Threads

AI Inference Software Architect

AI Inference Software Architect
by Admin on 06-22-2020 at 9:23 am

Full Time
Silicon Valley
Posted 6 years ago
Applications have closed

Flex Logix has finished the hardware design and is fabricating it’s first Inference Accelerator CoProcessor, InferX X1, which is based on our nnMAX Inference IP. We will have chips and PCIe boards this autumn. Our software team is preparing our Inference Model Compiler to be ready to run deep neural
network models on X1.

We have begun architecting the follow-on chip. InferX has industry-best inference efficiency: more inference throughput per $ and per watt. We excel on larger models and megapixel images, but can run any neural network.

RESPONSIBILITIES

Part of the small but excellent team responsible for our nnMAX Model Compiler: a DNN (Deep Neural Network) Model-to-binary flow – we are looking for a software architect-level position to expand architecture of our Model Compiler, written in modern C++, for addition of functionality for support of
additional capabilities, in particular:

Parsing of TensorflowLite/ONNX/other DNN model description languages to our internal model format, support of custom-defined operators
Consider numerous parameters (memory bandwidth, memory access pattern, memory & compute resource allocation, etc.) to arrive at an optimal computation strategy for each new operator
Mapping of each operator and its computational strategy to Verilog RTL code, running on EFLX eFPGA inside nnMAX chip and controlling Flex computation & memory engines.

This is a software architect/developer role but your activities will include integration with Flex-Logix hardware-based computation architecture for DNN as well as representation of computational architecture in software abstractions.

EXPERIENCE AND SKILL REQUIRED

Expertise in developing software compilers for one or more hardware-accelerated computational engines, preferably for DNN training or inference
Abilities to take complex problems and come up with efficient, innovative solutions
Experience with modern AI frameworks and inference engines – TensorFlow, PyTorch, TfLite, ONNX
Experience with Verilog and hardware computation architectures
Experience with FPGA synthesis tools such as Synopsys Synplify is a plus
Very good grasp of proper software/hardware development engineering practices, modeling, design, representation in documentation, excellent communication skills.
Must be very smart and very motivated, must be a quick learner, proactive and curious.
Must be passionate about being part of an aggressive, venture-backed startup team that is changing the way chips and supporting software are architected, designed, and programmed.
Must be entrepreneurial, innovative problem solver and willing to work hard.
Must live in Silicon Valley. Strong preference for US citizenship or permanent residency (“green card”); will consider candidates with current H1-B visas who are willing to transfer promptly.

Share this post via:

Silicon Insurance: Why eFPGA is Cheaper Than a Respin — and Why It Matters in the Intel 18A Era
@kingmouf - Functionally, the fabrics are very similar (6-input LUTS, DSPs, BRAM, interconnect, etc.). DSPs are slightly different and both…

— ajaros925 on March 31, 2026
Silicon Insurance: Why eFPGA is Cheaper Than a Respin — and Why It Matters in the Intel 18A Era
How does the eFPGA fabric mentioned here compares to AMD(Xilinx)/Altera fabrics? How do you address potential security issues?

— kingmouf on March 31, 2026
Silicon Insurance: Why eFPGA is Cheaper Than a Respin — and Why It Matters in the Intel 18A Era
Interesting article. eFPGA is clearly valuable as silicon insurance, but it still buys that flexibility with meaningful area, power, and…

— TomJackson on March 30, 2026
Musk’s Orbital Compute Vision: TERAFAB and the End of the Terrestrial Data Center
Your point that radiation accelerates device aging is a real constraint. But it’s also a predictable one. Space hardware is…

— Jonah McLeod on March 29, 2026
Musk’s Orbital Compute Vision: TERAFAB and the End of the Terrestrial Data Center
He's fixated on the heating thing because it's the only theoretically viable aspect of his new scam. After considering what…

— coldsolder215 on March 29, 2026
Chemical Origins of Environmental Modifications to MOR Lithographic Chemistry
This is an important finding for understanding how MORs work, but it clearly puts oxygen in the role that acids…

— Fred Chen on March 26, 2026
Captain America: Can Elon Musk Save America’s Chip Manufacturing Industry?
No, Elon won’t turn into LBT but he doesn’t need to. All he needs is to create an culture where…

— Jonah McLeod on March 25, 2026
Captain America: Can Elon Musk Save America’s Chip Manufacturing Industry?
That is the first time I hear "egos in check" and "Elon" in the same sentence. Not going to happen,…

— jmlobert on March 25, 2026

Search Semiwiki

Recent Forum Threads

Recent Article Comments

Recent Podcast Episodes

Recent Forum Threads

Recent Article Comments