Meta's first bespoke AI chips

blueone · May 18, 2023

Meta unveils its first custom AI chip

The circuits come with software optimized to run PyTorch, and emphasize the task of making recommendations.

www.zdnet.com

Definitely going for efficiency over highest processing power, which IMO is smarter for initial custom hardware designs.

7nm TSMC.

64 custom PEs for acceleration, lots of memory channels for bandwidth, 128MB of on-die SRAM.

A couple of RISC-V cores, not Arm.

hist78 · May 18, 2023

More technical details can be found here:

MTIA v1: Meta’s first-generation AI inference accelerator

In 2020, we initiated the Meta Training and Inference Accelerator (MTIA) family of chips to support our evolving AI workloads, starting with an inference accelerator ASIC for deep learning recommendation models (DLRMs).

ai.facebook.com

"Lessons for the future

Building custom silicon solutions, especially for the first time, is a significant undertaking. From this initial program, we have learned invaluable lessons that we are incorporating into our roadmap, including architectural insights and software stack enhancements that will lead to improved performance and scale of future systems.

The challenges we need to address are becoming increasingly complicated. Looking at historical trends in the industry for scaling compute, as well as memory and interconnect bandwidth, we can see that memory and interconnect bandwidth are scaling at a much lower pace compared with compute over the last several generations of hardware platforms.

The lagging performance of memory and interconnect bandwidth has also manifested itself in the final performance of our workloads as well. For example, we see a significant portion of a workload’s execution time spent on networking and communication.

Moving forward, as part of building a better and more efficient solution, we are focused on striking a balance between these three axes (compute power, memory bandwidth, and interconnect bandwidth) to achieve the best performance for Meta’s workloads. This is an exciting journey, and we’re just getting started."

hist78 · May 18, 2023

I'm wondering if anything came out from the old Intel-Facebook chip design project.

Intel working with Facebook on chips for AI

Intel chief Brian Krzanich said Tuesday his company is working on a super-fast chip designed specifically for artificial intelligence.

phys.org

blueone · May 18, 2023

hist78 said:
More technical details can be found here:

MTIA v1: Meta’s first-generation AI inference accelerator

In 2020, we initiated the Meta Training and Inference Accelerator (MTIA) family of chips to support our evolving AI workloads, starting with an inference accelerator ASIC for deep learning recommendation models (DLRMs).

ai.facebook.com

"Lessons for the future

Building custom silicon solutions, especially for the first time, is a significant undertaking. From this initial program, we have learned invaluable lessons that we are incorporating into our roadmap, including architectural insights and software stack enhancements that will lead to improved performance and scale of future systems.

The challenges we need to address are becoming increasingly complicated. Looking at historical trends in the industry for scaling compute, as well as memory and interconnect bandwidth, we can see that memory and interconnect bandwidth are scaling at a much lower pace compared with compute over the last several generations of hardware platforms.

The lagging performance of memory and interconnect bandwidth has also manifested itself in the final performance of our workloads as well. For example, we see a significant portion of a workload’s execution time spent on networking and communication.

Moving forward, as part of building a better and more efficient solution, we are focused on striking a balance between these three axes (compute power, memory bandwidth, and interconnect bandwidth) to achieve the best performance for Meta’s workloads. This is an exciting journey, and we’re just getting started."

A great find. Thanks.

Daniel Nenni · May 18, 2023

Meta, Google, Microsoft and Amazon are all at TSMC. N5 in progress and N3 coming up. Fast times at TSMC.

MTIA v1: Meta’s first-generation AI inference accelerator

In 2020, we initiated the Meta Training and Inference Accelerator (MTIA) family of chips to support our evolving AI workloads, starting with an inference accelerator ASIC for deep learning recommendation models (DLRMs).

ai.facebook.com

hist78 · May 18, 2023

Daniel Nenni said:
Meta, Google, Microsoft and Amazon are all at TSMC. N5 in progress and N3 coming up. Fast times at TSMC.

MTIA v1: Meta’s first-generation AI inference accelerator

In 2020, we initiated the Meta Training and Inference Accelerator (MTIA) family of chips to support our evolving AI workloads, starting with an inference accelerator ASIC for deep learning recommendation models (DLRMs).

ai.facebook.com

You can add Ampere Computing/Oracle at TSMC to the list.

This growing list of big companies with in-house chip products is a serious problem for Intel now and in the coming years.

Barnsley · May 18, 2023

What does the chip do?

Get folk ads faster on their feed?

blueone · May 18, 2023

hist78 said:
You can add Ampere Computing/Oracle at TSMC to the list.

This growing list of big companies with in-house chip products is a serious problem for Intel now and in the coming years.

A while ago I predicted Ampere would get acquired by a cloud company, but I was figuring it would be Microsoft for Azure. I know Oracle is an Ampere investor, but Oracle's cloud business probably isn't big enough to support Ampere. And a lot of Oracle's cloud is still running on SPARC.

AMD is at risk too, but probably not as much as Intel.

blueone · May 18, 2023

Barnsley said:
What does the chip do?

Get folk ads faster on their feed?

Only Meta knows that for now.

Daniel Nenni · May 18, 2023

Meta Discloses Its Second Custom Processor, And This Should Interest Investors

#1-Ranked Industry Analyst Patrick Moorhead dives into Meta's a "full-stack" infrastructure approach to silicon.

www.forbes.com

Daniel Nenni · May 18, 2023

Ampere Computing Unveils New AmpereOne Processor Family with 192 Custom Cores

/PRNewswire/ -- Ampere® Computing today announced a new AmpereOne™ Family of processors with up to 192 single threaded Ampere cores – the highest core count in...

www.prnewswire.com

Maxim · May 18, 2023

Barnsley said:
What does the chip do?

Get folk ads faster on their feed?

You can read about that in the cited article:

"Meta describes the chip as being tuned for one particular type of AI program: deep learning recommendation models. These are programs that can look at a pattern of activity, such as clicking on posts on a social network, and predict related, possibly relevant material to recommend to the user. "

Barnsley · May 19, 2023

Maxim said:
You can read about that in the cited article:

"Meta describes the chip as being tuned for one particular type of AI program: deep learning recommendation models. These are programs that can look at a pattern of activity, such as clicking on posts on a social network, and predict related, possibly relevant material to recommend to the user. "

This functionality got anything useful to do?

blueone · May 19, 2023

Barnsley said:
This functionality got anything useful to do?

Yes, for PyTorch applications. Your question seems to what the specific Meta applications are, and Meta hasn’t said from what I’ve seen.

blueone · May 19, 2023

Another article with some additional information. Be wary of TP Morgan's architectural analyses, he likes to think he is much more technical than he really is.

Meta Platforms Crafts Homegrown AI Inference Chip, AI Training Next

As we pointed out a year ago when some key silicon experts were hired from Intel and Broadcom to come work for Meta Platforms, the company formerly known

www.nextplatform.com

hist78 · May 19, 2023

This a arms race.

Microsoft seeks electrical engineers for custom DC chips

Redmond see, Redmond do... what AWS and Google are also doing

www.theregister.com

blueone · May 19, 2023

hist78 said:
This is an arms race.

Microsoft seeks electrical engineers for custom DC chips

Redmond see, Redmond do... what AWS and Google are also doing

www.theregister.com

Agreed. And Microsoft is behind Amazon, Google, and Meta in chip development. I think they know it and are desperate, otherwise they would not have acquired Fungible, which had the least successful of the startup DPU projects in the industry. Several people I know think they acquired Fungible just for the chip design team. But as one would expect, that wasn't enough. I still can't believe AMD paid $1.9B for Pensando, which was better than Fungible, but the value is difficult to reconcile with the price. Nvidia still has far and away the best DPU with Bluefield 3. Although the proprietary nature of all of these DPUs makes me wonder if there is a sustainable market for them.

Maxim · May 19, 2023

Barnsley said:
This functionality got anything useful to do?

I do not think so.

But the whole world thinks differently.

Daniel Nenni · May 19, 2023

blueone said:
Agreed. And Microsoft is behind Amazon, Google, and Meta in chip development. I think they know it and are desperate, otherwise they would not have acquired Fungible, which had the least successful of the startup DPU projects in the industry. Several people I know think they acquired Fungible just for the chip design team. But as one would expect, that wasn't enough. I still can't believe AMD paid $1.9B for Pensando, which was better than Fungible, but the value is difficult to reconcile with the price. Nvidia still has far and away the best DPU with Bluefield 3. Although the proprietary nature of all of these DPUs makes me wonder if there is a sustainable market for them.

Microsoft has a very large chip team so I would not count them out. The difference I see is that companies like Amazon and Google are spending large sums for chip design while others are following the fabless model of cutting budgets/corners to protect chip margins.

Meta's first bespoke AI chips

Well-known member

Well-known member

Well-known member

Well-known member

Admin

Well-known member

Well-known member

Well-known member

Well-known member

Admin

Admin

Member

Well-known member

Well-known member

Well-known member

Well-known member

Well-known member

Member

Admin