Instance

Array
(
    [title] => Recent Forum Threads
    [title_url] => 
    [ignore_sticky] => 0
    [exclude_current] => 0
    [limit] => 10
    [sluglist] => ["jobs-dashboard"]
    [rw_opt] => Array
        (
            [widget_select] => 1
            [pageid_281769] => 1
            [pageid_281772] => 1
        )

    [display_widget_mobile] => 
    [rw_opt_exclude] => Array
        (
            [pageid_274493] => 1
            [cpt_podcast] => 1
            [cpta_podcast] => 1
            [category_16613] => 1
            [category_16631] => 1
            [taxonomy_series] => 1
            [pageid_354254] => 1
        )

    [node_id] => Array
        (
            [0] => 2
        )

)

Threads

Recent Forum Threads

Reality Gap Admissible Evidence

started by moh.kolb on April 13, 2026
Current Events - Impact on Semicon Industry.

latest reply by Xebec on April 13, 2026

started by Barnsley on April 13, 2026
Japan approves additional $4Bfor chipmaker Rapidus

latest reply by Xebec on April 13, 2026

started by Daniel Nenni on April 13, 2026
"Intel is proud to join the Terafab project with @SpaceX, @xAI, and @Tesla to help refactor silicon fab technology"

latest reply by Daniel Nenni on April 13, 2026

started by NY_Sam2 on April 7, 2026
Bernie Sanders has a plan to stop the AI industry

latest reply by Barnsley on April 13, 2026

started by swka on April 6, 2026
MediaTek’s AI ASIC Breakout: Award-Winning Research Helps Secure New Google TPU Orders

latest reply by fatfat on April 11, 2026

started by karin623 on April 11, 2026
The semiconductor market could double by 2030

latest reply by Daniel Nenni on April 11, 2026

started by soAsian on April 11, 2026
SemiAnalysis: Nvidia – The Inference Kingdom Expands

latest reply by KevinK on April 11, 2026

started by user nl on March 24, 2026
Inside Nanya Technology’s Turnaround: Why Global Memory Giants Are Buying In

latest reply by Paul2 on April 11, 2026

started by karin623 on April 2, 2026
Samsung GAA SF2, Exynos 2600, cross-section images

latest reply by Fred Chen on April 11, 2026

started by NY_Sam2 on April 5, 2026

Recent Article Comments

Intel, Musk, and the Tweet That Launched a 1000 Ships on a Becalmed Sea
Wow — do I feel unprofessional for not doing thorough fact‑checking. Turns out I grabbed the wrong Jerry Sanders. After…

— Jonah McLeod on April 12, 2026
Intel, Musk, and the Tweet That Launched a 1000 Ships on a Becalmed Sea
"Jerry Sanders, who passed away just last December, ..." I couldn't find any news item or obit on his passing.…

— msoqatx on April 12, 2026
Agentic AI Demands More Than GPUs
Really interesting perspective — the shift toward CPU-bound orchestration in agentic workflows is definitely showing up in real deployments. One…

— TomJackson on April 12, 2026
Podcast EP339: Unique Scalable, Power-Efficient AI Technology from EdgeCortix with Dr. Sakya Dasgupta
Really interesting discussion — especially the emphasis on co-designing AI silicon and software, and the focus on energy-efficient inference. One…

— TomJackson on April 12, 2026
Musk’s Orbital Compute Vision: TERAFAB and the End of the Terrestrial Data Center
You’re welcome — and thanks for the kind words. Glad you enjoyed the article.

— Jonah McLeod on April 4, 2026
Musk’s Orbital Compute Vision: TERAFAB and the End of the Terrestrial Data Center
Thanks for writing.. .enjoyed your article

— Rahul Razdan on April 4, 2026
Silicon Insurance: Why eFPGA is Cheaper Than a Respin — and Why It Matters in the Intel 18A Era
@kingmouf - Functionally, the fabrics are very similar (6-input LUTS, DSPs, BRAM, interconnect, etc.). DSPs are slightly different and both…

— ajaros925 on March 31, 2026
Silicon Insurance: Why eFPGA is Cheaper Than a Respin — and Why It Matters in the Intel 18A Era
How does the eFPGA fabric mentioned here compares to AMD(Xilinx)/Altera fabrics? How do you address potential security issues?

— kingmouf on March 31, 2026
Silicon Insurance: Why eFPGA is Cheaper Than a Respin — and Why It Matters in the Intel 18A Era
Interesting article. eFPGA is clearly valuable as silicon insurance, but it still buys that flexibility with meaningful area, power, and…

— TomJackson on March 30, 2026
Musk’s Orbital Compute Vision: TERAFAB and the End of the Terrestrial Data Center
Your point that radiation accelerates device aging is a real constraint. But it’s also a predictable one. Space hardware is…

— Jonah McLeod on March 29, 2026

RVN! 26 Banner revised (800 x 100 px) (600 x 100 px)

WP_Term Object
(
    [term_id] => 157
    [name] => EDA
    [slug] => eda
    [term_group] => 0
    [term_taxonomy_id] => 157
    [taxonomy] => category
    [description] => Electronic Design Automation
    [parent] => 0
    [count] => 4445
    [filter] => raw
    [cat_ID] => 157
    [category_count] => 4445
    [category_description] => Electronic Design Automation
    [cat_name] => EDA
    [category_nicename] => eda
    [category_parent] => 0
)

June 7, 2021June 7, 2021 by Lauro Rizzatti

Software Developers Turn to CacheQ for Multi-Threading CPU Acceleration

Software Developers Turn to CacheQ for Multi-Threading CPU Acceleration
by Lauro Rizzatti on 06-07-2021 at 10:00 am
Categories: EDA, RISC-V

Three-year old CacheQ, founded by two former Xilinx executives and a clever group of engineers, produces a distributed heterogenous compute development environment targeting software developers with limited knowledge of hardware architecture.

The promise of compiler tools for heterogeneous compute systems intrigued me when I first read about CacheQ back at the end of 2019. The Xilinx reference was even more intriguing because I worked for a hardware emulation scale-up company that powered its platform with high-performance Xilinx FPGAs. My relationship with Xilinx was a positive one, so I’m rooting for CacheQ.

All through last year, it was quietly selling its FPGA-based computing platforms for life sciences, financial trading, government, oil and gas exploration and industrial IoT platforms. It also began expanding its reach outside of the FPGA space and recently announced a new feature of the CacheQ Compiler Collection for software developers to develop and deploy custom hardware accelerators for heterogeneous compute systems including FPGAS, CPUs and GPUs. The advantage is no manual code rewriting and there’s no need for threading libraries or complex parallel execution APIs. This gives software developers the ability to work on their algorithms and let the compiler extract the parallelism needed to run on those cores.

According to the news release, the compiler generates executables using single-threaded C code that can run on CPUs, taking advantage of multiple physical x86 cores with or without hyperthreading and Arm and RISC-V cores. Code is produced for multicore processors on the same or different architectures and benchmark usage with runtime variables. A flexible environment means that hardware for performance and power usage can be added or the number of cores reduced and other processes allocated for better performance per watt of power consumed.

The compiler’s results are impressive, showing a speedup of more than 486% over single-thread execution on X86 processors with 12 logical cores. An Apple M1 processor with eight Arm cores is 400% faster than the single-threaded gcc. The benchmarks come from the Black Scholes financial algorithm that simulates human behavior in stock trading.

Caption: The graph highlights the execution time showing the Black Scholes algorithm running a simulation of 20,000 stock option trades on single thread compiled with gcc. A comparison of the same code compiled without modification for one to eight threads on an Intel i7 x86 CPU with 12 logical cores and Apple M1 silicon with eight cores.

Source: CacheQ

The idea for the CacheQ Compiler Collection, says CacheQ’s CEO Clay Johnson, was something he and co-founder and CTO Dave Bennett talked about for more than 10 years. While distributed processing offers numerous performance advantages, programming continued to be a daunting challenge. They agreed the market was ready for a fast, intuitive and easy-to-use compiler targeting embedded software developers who were not hardware designers.

And that’s what CacheQ delivered. The CacheQ Compiler Collection is modelled after the gcc tool suite with a user interface like common open-source compilers. It requires limited code modification, shortening development time and improving system quality.

In addition to the compiler, analysis tools help software developers understand bottlenecks for performance that report which loops might not be threadable due to things such as loop carry dependencies, for example. The collection of tools contains a compiler, partitioner for assigning code to heterogenous compute elements, linting, profiling and performance prediction, everything that does not exist with OpenMP, the primary competitive technology.

CacheQ’s website includes a video that explains how the compiler works.

Hardware emulation continues to be my expertise and chip designs for those platforms aren’t a good fit for now. Nonetheless, CacheQ’s new compiler looks like a winner for embedded software developers to who need help mastering parallel processing.

Share this post via:

Comments

There are no comments yet.

You must register or log in to view/post comments.

Intel, Musk, and the Tweet That Launched a 1000 Ships on a Becalmed Sea
Wow — do I feel unprofessional for not doing thorough fact‑checking. Turns out I grabbed the wrong Jerry Sanders. After…

— Jonah McLeod on April 12, 2026
Intel, Musk, and the Tweet That Launched a 1000 Ships on a Becalmed Sea
"Jerry Sanders, who passed away just last December, ..." I couldn't find any news item or obit on his passing.…

— msoqatx on April 12, 2026
Agentic AI Demands More Than GPUs
Really interesting perspective — the shift toward CPU-bound orchestration in agentic workflows is definitely showing up in real deployments. One…

— TomJackson on April 12, 2026
Podcast EP339: Unique Scalable, Power-Efficient AI Technology from EdgeCortix with Dr. Sakya Dasgupta
Really interesting discussion — especially the emphasis on co-designing AI silicon and software, and the focus on energy-efficient inference. One…

— TomJackson on April 12, 2026
Musk’s Orbital Compute Vision: TERAFAB and the End of the Terrestrial Data Center
You’re welcome — and thanks for the kind words. Glad you enjoyed the article.

— Jonah McLeod on April 4, 2026
Musk’s Orbital Compute Vision: TERAFAB and the End of the Terrestrial Data Center
Thanks for writing.. .enjoyed your article

— Rahul Razdan on April 4, 2026
Silicon Insurance: Why eFPGA is Cheaper Than a Respin — and Why It Matters in the Intel 18A Era
@kingmouf - Functionally, the fabrics are very similar (6-input LUTS, DSPs, BRAM, interconnect, etc.). DSPs are slightly different and both…

— ajaros925 on March 31, 2026
Silicon Insurance: Why eFPGA is Cheaper Than a Respin — and Why It Matters in the Intel 18A Era
How does the eFPGA fabric mentioned here compares to AMD(Xilinx)/Altera fabrics? How do you address potential security issues?

— kingmouf on March 31, 2026

Search Semiwiki

Recent Forum Threads

Recent Article Comments

Recent Podcast Episodes

Comments

Recent Forum Threads

Recent Article Comments