Instance

Array
(
    [title] => Recent Forum Threads
    [title_url] => 
    [ignore_sticky] => 0
    [exclude_current] => 0
    [limit] => 10
    [sluglist] => ["jobs-dashboard"]
    [rw_opt] => Array
        (
            [widget_select] => 1
            [pageid_281769] => 1
            [pageid_281772] => 1
        )

    [display_widget_mobile] => 
    [rw_opt_exclude] => Array
        (
            [pageid_274493] => 1
            [cpt_podcast] => 1
            [cpta_podcast] => 1
            [category_16613] => 1
            [category_16631] => 1
            [taxonomy_series] => 1
            [pageid_354254] => 1
        )

    [node_id] => Array
        (
            [0] => 2
        )

)

Threads

Recent Article Comments

Semidynamics Unveils 3nm AI Inference Silicon and Full-Stack Systems
Oops - yet one more data center inference AI chip/rack company that claims a complete solution targeted for 2027. They…

— KevinK on February 18, 2026
TSMC vs Intel Foundry vs Samsung Foundry 2026
Very nice overview ....

— Rahul Razdan on February 14, 2026
Semidynamics Unveils 3nm AI Inference Silicon and Full-Stack Systems
Not sure what to make of these "late-in-the game" chip startups that seemingly claim to have rack level AI inference…

— KevinK on February 11, 2026
The Risk of Not Optimizing Clock Power
yup... good topic

— Rahul Razdan on February 7, 2026
The Foundry Model Is Morphing — Again
For those interested in the early formation of the foundry–fabless model and Morris Chang’s role, here is a 4‑minute clip…

— Jonah McLeod on February 3, 2026
Website Developers May Have Most to Fear From AI
Happy to see my opinion on SEO supported in a recent a recent study: https://www.authoritas.com/blog/can-you-fake-it-til-you-make-it-in-the-age-of-ai-search Incidentally, OpenAI is proposing (maybe…

— Bernard Murphy on January 30, 2026
Weebit Nano Reports on 2025 Targets
Samsung presented a competitive 8nm 128Mb automotive-grade MRAM at IEDM 2025 with 0.017 um2 cell size, already giving NOR Flash…

— Fred Chen on January 27, 2026
AI Bubble?
That was an error. Should be blank.

— Bill Jewell on January 21, 2026
AI Bubble?
doesnt the chart about say Nvidia revenue drops 7% in 2026? or am I misreading it

— Mark Webb on January 20, 2026
AI Bubble?
I have not seen that. A Motley Fool article today says TSMC believes AI chip demand will remain strong in…

— Bill Jewell on January 16, 2026

WP_Term Object
(
    [term_id] => 106
    [name] => FPGA
    [slug] => fpga
    [term_group] => 0
    [term_taxonomy_id] => 106
    [taxonomy] => category
    [description] => 
    [parent] => 0
    [count] => 342
    [filter] => raw
    [cat_ID] => 106
    [category_count] => 342
    [category_description] => 
    [cat_name] => FPGA
    [category_nicename] => fpga
    [category_parent] => 0
)

October 3, 2016 by Bernard Murphy

Microsoft, FPGAs and the Evolution of the Datacenter

Microsoft, FPGAs and the Evolution of the Datacenter
by Bernard Murphy on 10-03-2016 at 12:00 pm
Categories: FPGA

When we think of datacenters, we think of serried ranks of high-performance servers. Recent announcements from Google (on the Tensor Processing Unit), Facebook and others have opened our eyes to the role that specialized hardware and/or GPUs can play in support of deep/machine learning and big data analytics. But most of us would probably still consider those applications, while important, somewhat niche in their role in the datacenter.

Several years ago, motivated by what they knew was already happening at Google and Amazon, Microsoft started to build their own machine learning system to enhance the capabilities of Bing. But rather than develop a custom device, or build on a GPU platform, they decided to build on FPGAs. As we know, FPGA-based solutions can be significantly cheaper to build and deploy when you know you are going to be the sole customer. And of course FPGAs have the advantage of re-programmability. The Microsoft team built an FPGA-based platform they called Catapult and demonstrated this would significantly accelerate machine-learning algorithms in Bing (over previous software-only approaches, I assume).

Fast forward to 2015. Even the most starry-eyed Microsoft supporter would admit that Bing has a long way to go to catch up with the leader in search and is unlikely to drive significant revenue for Microsoft in the near future. What the company really wants are more ways to propel their major online services – Azure (the MS Cloud) and Office 365. Catapult was appealing to both of these applications, but not necessarily for machine-learning.

A major problem for Azure’s has been managing the high volume of PCIe network traffic to and from virtual machines through virtual network (VN) adapters. When this gets up to GB/sec for a VM, the the VN management load on the CPU becomes substantial. Obviously off-loading this to a system to support physical traffic and handle network virtualization can significantly improve throughput. Network cards would be one solution but the Azure team didn’t find this approach adaptable enough in supporting what they needed in a flexible VN fabric on the server side. After all, if you want maximum flexibility in VM management in the cloud, you need corresponding flexibility in VN management. The Azure team felt this could best be handled through FPGAs, particularly in support for programmability for load balancing and other rules.

All of this required a major rework of Catapult, but now the hardware is done and is being rolled out. And this is no longer a few specialized boxes to serve specialized needs. Azure needs a Catapult system per server (exact details are difficult to find – looks like one per server). And you can add to that the deep/machine learning requirements to support Bing and later encryption/compression and machine learning requirements to support Office 365.

This is a whole new ball-game for FPGA deployment. Since a large datacenter contains many hundreds of thousands of servers, Microsoft’s demand alone has apparently shifted FPGA worldwide volumes significantly. You should know by the way that Catapult is based on Altera FPGAs. Intel EVP Diane Bryant is on record as saying this is why Intel bought Altera last year. She also anticipates that for similar reasons, one third of all servers in datacenters will contain FPGAs (presumably optical connectivity sets the limit on volume, where FPGAs maybe can’t help – for now, but stay tuned since Intel was talking about both FPGAs and photonics at the OCP summit this year).

Of course you could argue that Microsoft and Intel have misread the market and the virtual networking functionality will be replaced by ASIC hardware solutions (especially optical). I’m not so sure, at least for the next few years. This is an area of critical differentiation for cloud services providers, so they’ll each want their own solutions. Of course the economics of ASIC may not be a big factor in those budgets, but adaptability could be a very big factor, especially as capabilities in cloud services are evolving quickly. Eventually differentiation always moves on to other factors, but it’s not clear that is going to happen here anytime soon.

You can read the Wired article on Catapult HERE and a slightly more detailed article on the Azure need for networking flexibility HERE.

Comments

0 Replies to “Microsoft, FPGAs and the Evolution of the Datacenter”

You must register or log in to view/post comments.

Semidynamics Unveils 3nm AI Inference Silicon and Full-Stack Systems
Oops - yet one more data center inference AI chip/rack company that claims a complete solution targeted for 2027. They…

— KevinK on February 18, 2026
TSMC vs Intel Foundry vs Samsung Foundry 2026
Very nice overview ....

— Rahul Razdan on February 14, 2026
Semidynamics Unveils 3nm AI Inference Silicon and Full-Stack Systems
Not sure what to make of these "late-in-the game" chip startups that seemingly claim to have rack level AI inference…

— KevinK on February 11, 2026
The Risk of Not Optimizing Clock Power
yup... good topic

— Rahul Razdan on February 7, 2026
The Foundry Model Is Morphing — Again
For those interested in the early formation of the foundry–fabless model and Morris Chang’s role, here is a 4‑minute clip…

— Jonah McLeod on February 3, 2026
Website Developers May Have Most to Fear From AI
Happy to see my opinion on SEO supported in a recent a recent study: https://www.authoritas.com/blog/can-you-fake-it-til-you-make-it-in-the-age-of-ai-search Incidentally, OpenAI is proposing (maybe…

— Bernard Murphy on January 30, 2026
Weebit Nano Reports on 2025 Targets
Samsung presented a competitive 8nm 128Mb automotive-grade MRAM at IEDM 2025 with 0.017 um2 cell size, already giving NOR Flash…

— Fred Chen on January 27, 2026
AI Bubble?
That was an error. Should be blank.

— Bill Jewell on January 21, 2026

Search Semiwiki

Recent Forum Threads

Recent Article Comments

Recent Podcast Episodes

Comments

0 Replies to “Microsoft, FPGAs and the Evolution of the Datacenter”

Recent Forum Threads

Recent Article Comments