Gorodenkoff - stock.adobe.com

News

AMD Instinct MI300 AI accelerator takes aim at Nvidia GPUs

Data center-grade GPUs and accelerators for enterprise customers and cloud vendors are the new battleground for AI hardware. AMD and Google advance the race with new chips.

Don Fluckinger

By

Don Fluckinger, Senior News Writer

Published: 06 Dec 2023

Both AMD and Google released AI accelerators today: AMD Instinct MI300 and Google TPU v5e. Both are data center-grade processors that speed AI tasks, such as training large language models.

AMD is playing catch-up to Nvidia, which has parleyed its gaming tech expertise into an AI processing superpower. AI typically runs on chips adjacent to CPUs; AMD's accelerator is a GPU, while Google's is a proprietary tensor processing unit (TPU) that powers AI in the Google Cloud.

What do the 153 billion transistors in AMD's MI300 accelerator -- and its claimed 17TB/second bandwidth -- get enterprise IT buyers? The Instinct MI300 chips run AI operations much faster, AMD CEO Lisa Su said at a launch event.

AMD customers and partners there, including Dell, HPE, Microsoft, Meta, Oracle, Databricks and others, said they had the chips either running in their products and services, are testing them, or plan to use them soon. Not only are the chips faster than their predecessors, but they can be combined to further improve performance.

"Generative AI is the most demanding data center workload ever," Su said. "It requires tens of thousands of accelerators to train and refine models with billions of parameters. And that same infrastructure is also needed to answer the millions of queries from everyone around the world.

A graphic representation of the AMD Instinct MI300 GPU Accelerator. — AMD Instinct MI300 GPU Accelerator.

"It's very simple: The more compute you have, the more capable the model, the faster the answers are generated. And the GPU is at the center of this generative AI world," she said.

The hardware upon which AI accelerators run has become a key feature of AI accelerators, said Daniel Newman, Futurum Research founder. It's not just speeds and feeds anymore but open source platforms that let developers build software and connect their large language models to the hardware.

"Today is all about AMD entering with valid, competitive capabilities and products using open source in the era of an incredibly strong or even dominant Nvidia in the AI training [chip] and overall AI chip," said Daniel Newman, Futurum Research founder. "It isn't just about performance. It is also about availability, viability, capability, and the world understanding that open-source collaborative ecosystems for AI are important."

Enterprise AI buyers, take note

Many companies still field their own GPUs in their data centers or colocations -- even in the cloud-first era -- Gartner analyst Chirag Dekate said. Data privacy regulations or the need for intellectual property protection force companies to take a hybrid approach that mixes their own data centers and public clouds such as Google, AWS and Microsoft.

In some cases, an enterprise might run its proprietary LLM in its own data center to keep it off a public cloud.

The AMD GPU accelerators will be adopted not only by large public clouds but also by individual enterprise customers, Dekate predicted. The combination of hardware, software and partnerships will help those customers set up their AI operations faster.

"What AMD is announcing today is not just a GPU that can be deployed in the data center," Dekate said. "They're also announcing cloud partnerships. They're announcing platforms and software stacks. [Together they will] enable enterprises to hit the ground running with an AMD-native strategy."

Google delivers new AI accelerators

Amid its Gemini general AI model release and unveiling of plans to be the first manufacturer to put generative AI on smartphones, Google also released the TPU v5e, its latest AI accelerator. TPUs power Google's own AI in apps such as Maps, YouTube and Gmail, and it hopes Google Cloud Platform customers will follow suit.

In the future, it's likely that enterprise cloud services buyers will have different AI services powered by different manufacturers' chips, Dekate said. Some enterprise applications and operations will work best -- or cheapest -- on one chipmaker's array compared to the others. It will depend on the scale and bandwidth required for a job, such as training a large enterprise language model.

Competition will be the key to keeping AI chips viable and to keep advancements moving in the AI hardware race as each manufacturer tries to outdo the others, Newman said.

"Ultimately we need a highly competitive marketplace for AI infrastructure, chipsets, software, and more," Newman said. "[Generative AI represents] the biggest transformation our world has seen technologically, and a healthy, vibrant, competitive ecosystem is critical."

Don Fluckinger covers digital experience management, end-user computing, CPUs and assorted other topics for TechTarget Editorial. Got a tip? Email him here.

Dig Deeper on Data center hardware and strategy

SearchWindows Server

Configure domain controllers after Server 2025 upgrade
Windows Server 2025 has many new features, but how can you get the most from them? Use this tutorial to configure AD domain ...
Microsoft Applied Skills program puts expertise to the test
Microsoft's Applied Skills help IT pros validate hands-on technical expertise and real-world skills. But what sets these ...
Understand the basics of Microsoft hybrid identity
Microsoft hybrid identity combines on-premises AD resources and cloud-based Entra ID capabilities to create a seamless access ...

Search Cloud Computing

Cloud infrastructure suffers AI growing pains
Will $5 trillion in AI infrastructure investment be enough? Cloud providers facing that question must also yield a return, ...
8 reasons why IT leaders are embracing cloud repatriation
As IT leaders aggressively re-allocate capital to fund new AI initiatives, repatriation offers both savings and greater control, ...
Microsoft Maia 200 AI chip could boost cloud GPU supply
Industry watchers predict ancillary effects for enterprise cloud buyers from Microsoft's AI accelerator launch this week, from ...

Search Storage

AI, flash highlighted in 2025 data storage conference lineup
The 2025 storage conference calendar featured shows where vendors released major product updates and experts discussed top trends...
How Chiplets will Accelerate Storage
Chiplets are a newer approach to chips in processors, where a smaller collection of chips is packaged together to emulate a ...
Choosing from a universe of SSD form factors
There are a number of different types of SSD storage form factors to choose from. Learn more about each different type, along ...

Sustainability
and ESG

Build a comprehensive supply chain traceability checklist
Start a supply chain traceability journey with this comprehensive checklist to drive efficiency, improve risk management, ...
The CIO's guide to equitable emerging tech
CIOs must prioritize equity when adopting new technologies to prevent harm, improve accessibility and make sure the technology ...
Offshore wind project suspensions pose challenges for CIOs
Trump administration offshore wind suspensions disrupt data center clean energy supply, raise power costs, threaten grid ...

Close