Capture the full potential of IT

AMD Instinct MI300A blends GPU, CPU for super-speedy AI/HPC

Featured content

AMD Instinct MI300A blends GPU, CPU for super-speedy AI/HPC

CPU or GPU for AI and HPC? You can get the best of both with the AMD Instinct MI300A.

Applications:
Featured Technologies:

The AMD Instinct MI300A is the world’s first data center accelerated processing unit for high-performance computing and AI. It does this by integrating both CPU and GPU cores on a single package.

That makes the AMD Instinct MI300A highly efficient at running both HPC and AI workloads. It also makes the MI300A powerful enough to accelerate training the latest AI models.

Introduced about a year ago, the AMD Instinct MI300A accelerator is shipping soon. So are two Supermicro servers—one a liquid-cooled 2U system, the other an air-cooled 4U—each powered by four MI300A units.

Under the Hood

The technology of the AMD Instinct MI300A is impressive. Each MI300A integrates 24 AMD ‘Zen 4’ x86 CPU cores with 228 AMD CDNA 3 high-throughput GPU compute units.

You also get 128GB of unified HBM3 memory. This presents a single shared address space to CPU and GPU, all of which are interconnected into the coherent 4th Gen AMD Infinity architecture.

Also, the AMD Instinct MI300A is designed to be used in a multi-unit configuration. This means you can connect up to four of them in a single server.

To make this work, each APU has 1 TB/sec. of bidirectional connectivity through eight 128 GB/sec. AMD Infinity Fabric interfaces. Four of the interfaces are dedicated Infinity Fabric links. The other four can be flexibly assigned to deliver either Infinity Fabric or PCIe Gen 5 connectivity.

In a typical four-APU configuration, six interfaces are dedicated to inter-GPU Infinity Fabric connectivity. That supplies a total of 384 GB/sec. of peer-to-peer connectivity per APU. One interface is assigned to support x16 PCIe Gen 5 connectivity to external I/O devices. In addition, each MI300A includes two x4 interfaces to storage, such as M.2 boot drives, plus two USB Gen 2 or 3 interfaces.

Converged Computing

There’s more. The AMD Instinct MI300A was designed to handle today’s convergence of HPC and AI applications at scale.

To meet the increasing demands of AI applications, the APU is optimized for widely used data types. These include FP64, FP32, FP16, BF16, TF32, FP8 and INT8.

The MI300A also supports native hardware sparsity for efficiently gathering data from sparse matrices. This saves power and compute cycles, and it also lowers memory use.

Another element of the design aims at high efficiency by eliminating time-consuming data copy operations. The MI300A can easily offload tasks easily between the CPU and GPU. And it’s all supported by AMD’s ROCm 6 open software platform, built for HPC, AI and machine learning workloads.

Finally, virtualized environments are supported on the MI300A through SR-IOV to share resources with up to three partitions per APU. SR-IOV—short for single-root, input/output virtualization—is an extension of the PCIe spec. It allows a device to separate access to its resources among various PCIe functions. The goal: improved manageability and performance.

Fun fact: The AMD Instinct MI300A is a key design component of the El Capitan supercomputer recently dedicated by Lawrence Livermore Labs. This system can process over two quintillion (10¹⁸) calculations per second.

Supermicro Servers

As mentioned above, Supermicro now offers two server systems based on the AMD Instinct MI300A APU. They’re 2U and 4U systems.

These servers both take advantage of AMD’s integration features by combining four MI300A units in a single system. That gives you a total of 912 GPUs, 96 CPUs, and 512GB of HBM3 memory.

Supermicro says these systems can push HPC processing to Exascale levels, meaning they’re very, very fast. “Flop” is short for floating point operations per second, and “exa” indicates a 1 with 18 zeros after it. That’s fast.

Supermicro’s 2U server (model number AS -2145GH-TNMR-LCC) is liquid-cooled and aimed at HPC workloads. Supermicro says direct-to-chip liquid-cooling technology enables a nice TCO with over 51% data center energy cost savings. The company also cites a 70% reduction in fan power usage, compared with air-cooled solutions.

If you’re looking for big HPC horsepower, Supermicro’s got your back with this 2U system. The company’s rack-scale integration is optimized with dual AIOM (advanced I/O modules) and 400G networking. This means you can create a high-density supercomputing cluster with as many as 21 of Supermicro’s 2U systems in a 48U rack. With each system combining four MI300A units, that would give you a total of 84 APUs.

The other Supermicro server (model number AS -4145GH-TNMR) is an air-cooled 4U system, also equipped with four AMD Instinct MI300A accelerators, and it’s intended for converged HPC-AI workloads. The system’s mechanical airflow design keeps thermal throttling at bay; if that’s not enough, the system also has 10 heavy-duty 80mm fans.

Do More:

Learn more: AMD Instinct MI300A

Download a data sheet: AMD Instinct MI300

Check out Supermicro servers powered by AMD Instinct accelerators

CPU and GPU for AI? Learn why from the recent Tech Explainer

Read an AMD ROCm blog post: MI300A — Exploring the APU advantage

Featured videos

Events

IDC CIO Summit

Riyadh, Saudi Arabia; Sept. 17–18, 2025

Architecting an AI-fueled business

Learn more >

Computerworld Cloud & AI Festival

Copenhagen, DE; Sept. 17-18, 2025

Join 2,400+ IT pros to learn about infrastructure, security & more

Learn more >

JURES 25

Pamplona, Spain; Sept. 18-19, 2025

Spain's supercomputing network user conference

Learn more >

Find AMD & Supermicro Elsewhere

Research Roundup: AI edition

Featured content

Research Roundup: AI edition

Catch up on the latest AI trends spotted by leading IT market watchers.

Applications:

Spending on artificial intelligence infrastructure is exploding. So is spending on AI for supply chains. But disappointing results on early GenAI tests is causing some CIOs to worry about the ROI.

That’s some of the latest intelligence from leading IT market watchers and researchers. And here’s your research roundup.

AI Infrastructure: $100B and Beyond

Behind every AI implementation is the need for high-end infrastructure. And spending on this type of equipment is expected to grow rapidly.

Market watcher IDC predicts that global spending on AI infrastructure will exceed $100 billion by 2028. Last year this spending totaled roughly $70 billion.

The AI infrastructure market has enjoyed double-digit growth for the last four and a half years, driven primarily by investments in servers, IDC says. In the first half of 2024, servers accounted for nearly 90% of all AI infrastructure spending.

Covered in IDC’s definition of AI infrastructure are servers and storage used for AI platforms, AI and AI-enabled applications, and AI applications development & deployment software.

AI Servers: $200B

That could be only the tip of the iceberg. Gartner researchers now predict worldwide spending on AI-optimized servers will top $200 billion this year. They also say that’s more than double what’s expected to be spent on more traditional servers.

About 70% of that $200 billion will be spent not by end users, but instead by big IT services companies and hyperscalers, Gartner expects. By 2028, the hyperscalers—large cloud providers including AWS, Google Cloud and Microsoft Azure—will operate AI-optimized servers collectively worth about $1 trillion.

Worth noting: This AI spending is part of an even bigger trend. Gartner predicts overall IT spending will rise this year by nearly 10%, reaching a global total of $5.6 trillion.

AI for Supply Chain: Huge

The use of AI in supply chain management is growing at a super-fast compound annual growth rate (CAGR) of 30%. This spending will jump from $3.5 billion in 2023 to $22.7 billion by 2030, according to a new forecast from ResearchAndMarkets.

Supply chain health became a major concern during the pandemic. Now companies realize they need supply chains that are resilient, adaptable and efficient. And AI can help.

The fastest-growing supply chain sector for AI is expected to be forecasting. There, AI can be used to predict future demand for various products. These forecasts can then be used by manufacturers and their partners to optimize inventories and production plans.

GenAI: Where’s the Value?

This year, Generative AI will fail to create its expected value, predicts ABI Research.

Many GenAI proof-of-concept trials have been disappointing, with failure rates as high as 80% to 90%, ABI says. This is seriously cooling some red-hot expectations.

As a result, some enterprise CIOs will turn away from GenAI. Instead, ABI expects, they’ll adopt more traditional AI approaches that solve business problems and deliver a clearer ROI.

ABI’s jaundiced view of GenAI gets some support from Gartner. In its 2025 IT market forecast, Gartner says GenAI is sliding toward the “trough of disillusionment.”

That phrase comes from Gartner’s Hype Cycle. It states that most innovations progress through a pattern of over-enthusiasm and disillusionment, followed by eventual productivity.

While businesses may still be searching for GenAI’s ROI, a growing number of teens are certainly finding it. About one in four U.S. teens (26%) used ChatGPT for schoolwork last year, according to a new Pew Research Center survey. That’s double the percentage of teens who did so in 2023.

AI’s New Mandate: Trust

A much more positive view comes from Accenture’s 25th annual technology vision report. The consulting firm's report says AI is accelerating across enterprises faster than any other prior technology.

What’s more, nearly 70% of executives polled by Accenture said they believe AI brings new urgency to re-invention and how tech systems and processes are designed, built and run.

An even bigger group, 80% of those polled, told Accenture that natural language processing (NLP) will increase collaboration between humans and robots.

One possible barrier to AI progress is the matter of trust. More than 75% of the executives polled by Accenture believe AI’s true benefits must be built on a foundation of trust.

Accenture CEO Julie Sweet agrees. “Unlocking the benefits of AI,” she says, “will only be possible if leaders seize the opportunity to inject and develop trust in its performance and outcomes.”

Featured videos

Events