CEO Andy Jassy’s 2025 Letter to Shareholders

soAsian · Apr 9, 2026

https://twitter.com/x/status/2042204845986242790

"1. It's clear $AMZN has now entered the chip business as a standalone business, taking on $NVDA:"There’s so much demand for our chips that it’s quite possible we’ll sell racks of them to third parties in the future.""

Richard Jarc's linkedin profile: https://www.linkedin.com/in/rihard-jarc-93502472/

https://www.aboutamazon.com/news/company-news/amazon-ceo-andy-jassy-2025-letter-to-shareholders

Golden age for semis industry.

Xebec · Apr 9, 2026

An interesting post -- thank you.

Do Amazon's chips have any fundamental improvements that AMD, Nvidia, Intel, or Qualcomm don't already offer, or is this more of "we have chips available to buy"?

(I don't see how this isn't going to end up in a glut of products at some point in the future, similar to the over-ordering that occured after the COVID chip-related "shortage". That said, I do think this is a good opportunity right now for Amazon).

blueone · Apr 9, 2026

Xebec said:
An interesting post -- thank you.

Do Amazon's chips have any fundamental improvements that AMD, Nvidia, Intel, or Qualcomm don't already offer, or is this more of "we have chips available to buy"?

I read the shareholder letter, and Amazon is not going to be selling chips as a merchant vendor. Like Google, they are going to be renting chips on AWS. The letter has some convoluted wording speaking of their chip development and deployment as if it were a merchant chip business, but there's nothing in the letter that implies they have a plan like that:

"Our annual revenue run rate for our chips business (inclusive of Graviton, Trainium, and Nitro—our EC2 NIC) is now over $20 billion, and growing triple digit percentages YoY. To dimensionalize this versus other chips companies, that run rate is somewhat understated by our currently only monetizing our chips through EC2. If our chips business was a stand-alone business, and sold chips produced this year to AWS and other third parties (as other leading chips companies do), our annual run rate would be ~$50 billion. There’s so much demand for our chips that it’s quite possible we’ll sell racks of them to third parties in the future."

The word "sell" in the last paragraph clearly refers to specially reserved and rented racks in AWS, and I'm surprised Jassy let the word "sell" get past him. Obviously, Amazon is very proud of their internal chip development and the Annapurna Labs acquisition that made it possible, but it's a problem of another magnitude altogether to sell server chips to others.

https://www.aboutamazon.com/news/company-news/amazon-ceo-andy-jassy-2025-letter-to-shareholders

(I don't see how this isn't going to end up in a glut of products at some point in the future, similar to the over-ordering that occurred after the COVID chip-related "shortage". That said, I do think this is a good opportunity right now for Amazon).

I highly doubt, IMO, Amazon becomes a merchant chip vendor. After all that Broadcom hype about selling Google TPUs to Anthropic, it turned out it was just Google Cloud deployment. Being a merchant server chip vendor is hugely expensive and you need very high margins to support it, and I think Arm is about to find that out the hard way.

KevinK · Apr 9, 2026

Xebec said:
Do Amazon's chips have any fundamental improvements that AMD, Nvidia, Intel, or Qualcomm don't already offer, or is this more of "we have chips available to buy"?

OK - claimed benefits via Perplexity, with associated caveats (below):

Amazon’s in-house chips are mainly claimed to excel on price-performance, throughput, latency, memory bandwidth, and energy efficiency versus general-purpose GPUs. The strongest claims are for AWS Inferentia and Trainium: lower inference cost, higher throughput, and better performance per watt for AI workloads.[247wallst +1]

Claimed strengths (mostly compared to earlier Amazon chips / no direct references to merchant market competitors)
• Lower cost for AI inference and training. AWS says Inferentia2 delivers up to 4x higher throughput and up to 10x lower latency than Inferentia, while Inf1 instances can provide up to 70% lower cost per inference than comparable EC2 instances.[aws.amazon]
• Better price-performance than GPUs. Amazon has said its Trainium-based systems can cut AI training and inference costs by up to half versus comparable GPU setups, framing custom silicon as a cost-leadership play.[247wallst]
• Higher throughput and memory capacity. AWS says Inferentia2 has 32 GB of HBM per chip, 4x the memory of Inferentia, and 10x the memory bandwidth, which helps with larger and more complex models.[aws.amazon]
• Scale-out inference support. Inf2 instances are described as the first inference-optimized EC2 instances with ultra-high-speed chip-to-chip connectivity for distributed inference.[aws.amazon]
• Framework compatibility. AWS says Neuron integrates natively with PyTorch and TensorFlow, so customers can use existing workflows with fewer code changes.[aws.amazon]
• Energy efficiency. AWS claims Inf2 instances offer up to 50% better performance per watt than comparable EC2 instances.[aws.amazon]

Important caveat
These are Amazon’s own claims, and some outside reporting says the chips still trail Nvidia in certain areas, especially latency and software maturity. So the core technical story is not “best overall chip,” but rather “optimized for AWS workloads at lower cost, with improving performance generation by generation”.

Real Benchmark Results
I’m personally waiting to see how Amazon Trainium does on InferenceX data center scale inference benchmarks to see if any of their claims are anywhere near true in a third-party assessment. Per a couple sources, Amazon is signed up to do the InferenceX work with SemiAnalysis, but haven’t produced any real results yet vis a vis existing NVIDIA and AMD comparisons. The longer it takes, the less likely it is that they have a competitive product at the rack level.

InferenceX v2: NVIDIA Blackwell Vs AMD vs Hopper - Formerly InferenceMAX

The Artist Known as InferenceMAX. GB300 NVL72, MI355X, B200, H100, Disaggregated Serving, Wide Expert Parallelism, Large Mixture of Experts, SGLang, vLLM, TRTLLM

newsletter.semianalysis.com

That Amazon has paired with Cerebras on disaggregated inference also tells me that their chips aren't particularly good on decode when compared to other new solutions.

blueone · Apr 9, 2026

KevinK said:
OK - claimed benefits via Perplexity, with associated caveats (below):

Amazon’s in-house chips are mainly claimed to excel on price-performance, throughput, latency, memory bandwidth, and energy efficiency versus general-purpose GPUs. The strongest claims are for AWS Inferentia and Trainium: lower inference cost, higher throughput, and better performance per watt for AI workloads.[247wallst +1]

Claimed strengths (mostly compared to earlier Amazon chips / no direct references to merchant market competitors)
• Lower cost for AI inference and training. AWS says Inferentia2 delivers up to 4x higher throughput and up to 10x lower latency than Inferentia, while Inf1 instances can provide up to 70% lower cost per inference than comparable EC2 instances.[aws.amazon]
• Better price-performance than GPUs. Amazon has said its Trainium-based systems can cut AI training and inference costs by up to half versus comparable GPU setups, framing custom silicon as a cost-leadership play.[247wallst]
• Higher throughput and memory capacity. AWS says Inferentia2 has 32 GB of HBM per chip, 4x the memory of Inferentia, and 10x the memory bandwidth, which helps with larger and more complex models.[aws.amazon]
• Scale-out inference support. Inf2 instances are described as the first inference-optimized EC2 instances with ultra-high-speed chip-to-chip connectivity for distributed inference.[aws.amazon]
• Framework compatibility. AWS says Neuron integrates natively with PyTorch and TensorFlow, so customers can use existing workflows with fewer code changes.[aws.amazon]
• Energy efficiency. AWS claims Inf2 instances offer up to 50% better performance per watt than comparable EC2 instances.[aws.amazon]

Important caveat
These are Amazon’s own claims, and some outside reporting says the chips still trail Nvidia in certain areas, especially latency and software maturity. So the core technical story is not “best overall chip,” but rather “optimized for AWS workloads at lower cost, with improving performance generation by generation”.

The Perplexity analysis is terrible.

KevinK said:
Real Benchmark Results
I’m personally waiting to see how Amazon Trainium does on InferenceX data center scale inference benchmarks to see if any of their claims are anywhere near true in a third-party assessment. Per a couple sources, Amazon is signed up to do the InferenceX work with SemiAnalysis, but haven’t produced any real results yet vis a vis existing NVIDIA and AMD comparisons. The longer it takes, the less likely it is that they have a competitive product at the rack level.

InferenceX v2: NVIDIA Blackwell Vs AMD vs Hopper - Formerly InferenceMAX

The Artist Known as InferenceMAX. GB300 NVL72, MI355X, B200, H100, Disaggregated Serving, Wide Expert Parallelism, Large Mixture of Experts, SGLang, vLLM, TRTLLM

newsletter.semianalysis.com

This sounds like a real issue, but not a surprise since the Tranium and Inferentia software stack was probably only developed and tested with the internal-only AWS software.

KevinK said:
That Amazon has paired with Cerebras on disaggregated inference also tells me that their chips aren't particularly good on decode when compared to other new solutions.

I'm not sure, since the Cerebras solution is purported to be the fastest inference available, so it gets a lot of attention. And even though Cerebras has a cloud service of their own, AWS is probably a much lower impedance path to people trying out Cerebras software and systems. I also guess Cerebras gave AWS a "special price" and probably priority support.

KevinK · Apr 9, 2026

blueone said:
The Perplexity analysis is terrible.

Yeah - but it's only based on claims, not real technical analysis outside of what Amazon has published.

blueone said:
AWS is probably a much lower impedance path to people trying out Cerebras software and systems. I also guess Cerebras gave AWS a "special price" and probably priority support.

And I think the benefit to Amazon is that they can offer a lower latency inference product, that still includes some of their own infrastructure.

soAsian · Apr 9, 2026

blueone said:
I read the shareholder letter, and Amazon is not going to be selling chips as a merchant vendor. Like Google, they are going to be renting chips on AWS. The letter has some convoluted wording speaking of their chip development and deployment as if it were a merchant chip business, but there's nothing in the letter that implies they have a plan like that:

"Our annual revenue run rate for our chips business (inclusive of Graviton, Trainium, and Nitro—our EC2 NIC) is now over $20 billion, and growing triple digit percentages YoY. To dimensionalize this versus other chips companies, that run rate is somewhat understated by our currently only monetizing our chips through EC2. If our chips business was a stand-alone business, and sold chips produced this year to AWS and other third parties (as other leading chips companies do), our annual run rate would be ~$50 billion. There’s so much demand for our chips that it’s quite possible we’ll sell racks of them to third parties in the future."

The word "sell" in the last paragraph clearly refers to specially reserved and rented racks in AWS, and I'm surprised Jassy let the word "sell" get past him. Obviously, Amazon is very proud of their internal chip development and the Annapurna Labs acquisition that made it possible, but it's a problem of another magnitude altogether to sell server chips to others.

https://www.aboutamazon.com/news/company-news/amazon-ceo-andy-jassy-2025-letter-to-shareholders

I highly doubt, IMO, Amazon becomes a merchant chip vendor. After all that Broadcom hype about selling Google TPUs to Anthropic, it turned out it was just Google Cloud deployment. Being a merchant server chip vendor is hugely expensive and you need very high margins to support it, and I think Arm is about to find that out the hard way.

you are correct. Amazon will sell their own chip via AWS.

https://twitter.com/x/status/2042273512048496963

Golden age for semis industry. I love to see options out there beside Intel, AMD and Nvidia. Even tho, you can't buy Amazon's chip. You have the option to "lease". beside, everything is an subscription in today's world.

blueone · Apr 9, 2026

KevinK said:
And I think the benefit to Amazon is that they can offer a lower latency inference product, that still includes some of their own infrastructure.

I agree. AWS is so profitable, anything that gets people to use AWS more often is a win for Amazon.

blueone · Apr 9, 2026

soAsian said:
Golden age for semis industry. I love to see options out there beside Intel, AMD and Nvidia. Even tho, you can't buy Amazon's chip. You have the option to "lease". beside, everything is an subscription in today's world.

I agree. The AWS chips aren't all that different than Google TPUs; you need the rack systems and the networks and the proprietary software that makes Tranium and Inferentia into a usable system. Microsoft's Maia is almost certainly the same way. Cerebras is even worse, but at least they're already in business of selling systems and supporting them.

blueone · Apr 9, 2026

soAsian said:
you are correct. Amazon will sell their own chip via AWS.

https://twitter.com/x/status/2042273512048496963

Golden age for semis industry. I love to see options out there beside Intel, AMD and Nvidia. Even tho, you can't buy Amazon's chip. You have the option to "lease". beside, everything is an subscription in today's world.

The question I wonder about is, how long will enterprise datacenters continue to exist? What fraction is forever, and everything else is deployed in cloud datacenters? If I'm running a business that has nothing to do with computer technology, I think the last thing I would want to deal with is my own datacenters and computers. Everything to do with datacenters, from the people to run them, to the specialized facilities, to the specialized IT staff, to complex vendor relationships with numerous suppliers with products and technologies I really don't understand. I'd want to put everything in the cloud and use off-the-shelf applications. Even if the cost was higher, it would be worth it just so my business could focus on my business, and not worry about choosing Cisco or HPE networking, for example.

swka · Apr 9, 2026

blueone said:
The question I wonder about is, how long will enterprise datacenters continue to exist? What fraction is forever, and everything else is deployed in cloud datacenters? If I'm running a business that has nothing to do with computer technology, I think the last thing I would want to deal with is my own datacenters and computers. Everything to do with datacenters, from the people to run them, to the specialized facilities, to the specialized IT staff, to complex vendor relationships with numerous suppliers with products and technologies I really don't understand. I'd want to put everything in the cloud and use off-the-shelf applications. Even if the cost was higher, it would be worth it just so my business could focus on my business, and not worry about choosing Cisco or HPE networking, for example.

We have been through SaaS and cloud model in the last decade. Wonder in the same lens, how many enterprise are maintaining their own on-premise setup

blueone · Apr 9, 2026

swka said:
We have been through SaaS and cloud model in the last decade. Wonder in the same lens, how many enterprise are maintaining their own on-premise setup

Unfortunately, IT people don't seem to hang around on this forum.

Barnsley · Apr 9, 2026

blueone said:
The question I wonder about is, how long will enterprise datacenters continue to exist? What fraction is forever, and everything else is deployed in cloud datacenters? If I'm running a business that has nothing to do with computer technology, I think the last thing I would want to deal with is my own datacenters and computers. Everything to do with datacenters, from the people to run them, to the specialized facilities, to the specialized IT staff, to complex vendor relationships with numerous suppliers with products and technologies I really don't understand. I'd want to put everything in the cloud and use off-the-shelf applications. Even if the cost was higher, it would be worth it just so my business could focus on my business, and not worry about choosing Cisco or HPE networking, for example.

Its all fun and games until the Vendor starts squeezing your balls ....

They know you have nowhere to go.

The sad thing in the semicon industry there seems to be little co-operation , just who can squeeze who to get the most.

Barnsley · Apr 9, 2026

blueone said:
Unfortunately, IT people don't seem to hang around on this forum.

They likely outsourced their replies elsewhere.

blueone · Apr 9, 2026

Barnsley said:
Its all fun and games until the Vendor starts squeezing your balls ....

They know you have nowhere to go.

The sad thing in the semicon industry there seems to be little co-operation , just who can squeeze who to get the most.

I agree, though a lot of the reason for the price inflation is that corporate IT shops like all the optional reliability and performance features of cloud computing that cost extra. Like multiple region support and redundant data copies. And their costs would be rising too even with their own datacenters.

The software companies also squeeze IT customers. Broadcom is famous for it, especially with VMWare. So is Oracle, and software packages are necessary no matter where you run them.

Barnsley · Apr 9, 2026

blueone said:
I agree, though a lot of the reason for the price inflation is that corporate IT shops like all the optional reliability and performance features of cloud computing that cost extra. Like multiple region support and redundant data copies. And their costs would be rising too even with their own datacenters.

The software companies also squeeze IT customers. Broadcom is famous for it, especially with VMWare. So is Oracle, and software packages are necessary no matter where you run them.

Synosys also good at squeezing.

Dunno why , they surely making piles of cash from the big hitters

Search

CEO Andy Jassy’s 2025 Letter to Shareholders

soAsian

Well-known member

Xebec

Well-known member

blueone

Well-known member

KevinK

Well-known member

InferenceX v2: NVIDIA Blackwell Vs AMD vs Hopper - Formerly InferenceMAX

blueone

Well-known member

InferenceX v2: NVIDIA Blackwell Vs AMD vs Hopper - Formerly InferenceMAX

KevinK

Well-known member

soAsian

Well-known member

blueone

Well-known member

blueone

Well-known member

blueone

Well-known member

swka

Active member

blueone

Well-known member

Barnsley

Well-known member

Barnsley

Well-known member

blueone

Well-known member

Barnsley

Well-known member