AWS may be able to reduce reliance on Nvidia by developing its own AI infrastructure.
Matt Garman, the CEO of Amazon Web Services, made several significant new AWS announcements at the re:Invent conference on Tuesday.
His two-and-a-half hour keynote delved into AWS's current software and hardware offerings and updates, with words from clients including Apple and JPMorgan. Graphics processing units (GPUs), supercomputers, and a surprise Apple cameo stuck out among the slew of information.
AWS, the cloud computing arm of Amazon, has been developing its own semiconductors to train AI. On Tuesday, Garman said it's creating UltraServers -- containing 64 of its Trainium 2 chips -- so companies can scale up their GenAI workloads.
Moreover, it's also building an AI supercomputer, an UltraCluster made up of UltraServers, in partnership with AI startup Anthropic. Named Project Rainier, it will be "the world's largest AI compute cluster reported to date available for Anthropic to build and deploy its future models on" when completed, according to an Amazon blog post. Amazon has invested $8 billion in Anthropic.
Such strides could push AWS further into competition with other tech firms in the ongoing AI arms race, including AI chip giant Nvidia.
Here are four takeaways from Garman's full keynote on Tuesday.
Nvidia currently dominates the AI chip market with its sought-after and pricey GPUs, but Garman backed AWS's homegrown silicon during his keynote on Tuesday. His company's goal is to reduce the cost of AI, he said.
"Today, there's really only one choice on the GPU side, and it's just Nvidia. We think that customers would appreciate having multiple choices," Garman told the Wall Street Journal.
AI is growing rapidly, and the demand for chips that make the technology possible is poised to grow alongside it. Major tech companies, like Google and Microsoft, are venturing into chip creation as well to find an alternative to Nvidia.
However, Garman told The Journal the doesn't expect Trainium to dethrone Nvidia "for a long time."
"But, hopefully, Trainium can carve out a good niche where I actually think it's going to be a great option for many workloads -- not all workloads," he said.
AWS also introduced Trainium3, its next-gen chip.
According to The Journal, the chip cluster known as Project Rainier is expected to be available in 2025. Once it is ready, Anthropic plans to use it to train AI models.
With "hundreds of thousands" of Trainium chips, it would challenge Elon Musk's xAI's Colossus -- a supercomputer with 100,000 of Nvidia's Hopper chips.
Garman said that Apple is one of its customers using AWS chips, like Amazon Graviton and Inferentia, for services including Siri.
Benoit Dupin, senior director of AI and machine learning at Apple, then took to the stage at the Las Vegas conference. He said the company worked with AWS for "virtually all phases" of its AI and machine learning life cycle.
"One of the unique elements of Apple business is the scale at which we operate and the speed with which we innovate," Dupin said.
He added, "AWS has been able to keep the pace, and we've been customers for more than a decade."
Now, Dupin said Apple is in the early stages of testing Trainium 2 chips to potentially help train Apple Intelligence.
Amazon announced some new kids on the GenAI block.
AWS customers will be able to use Amazon Nova-powered GenAI applications "to understand videos, charts, and documents, or generate videos and other multimedia content," Amazon said. There are a range of models available at different costs, it said.