Home / Hardware / AMD / CDNA 3 / MI300X

AMD Instinct MI300X

Arch: CDNA 3Production ReadyTDP 750W
Compute Performance (FP8)▲ 2.6x vs H100 FP8
5.3 PetaFLOPS
Peak FP8 matrix performance across 304 compute units
MI300X CDNA 35.3 PF
H100 Hopper2.0 PF
MI250X CDNA 20.4 PF
Memory System
192GB HBM3
8 Hi-Stacks / 8192-bit interface
5.3 TB/s Bandwidth
Interconnect & I/O
896 GB/s Infinity Fabric
7 xGMI links, bi-directional
PCIe Gen 5.0 x16
Real-World Applications
Large Language Model Inference

The MI300X's 192GB HBM3 capacity allows running 70B+ parameter models without tensor parallelism. Combined with 5.3 TB/s bandwidth, it delivers industry-leading inference throughput for memory-bound LLM serving.

Open-Source AI Training

Fully supported by ROCm and the PyTorch ecosystem, the MI300X is the GPU of choice for organizations training models on open-source frameworks. Leading labs have adopted MI300X for training runs of Llama, Mistral, and other open models.

Scientific HPC Workloads

Successor to the Frontier supercomputer lineage, MI300X excels at traditional HPC workloads including climate simulation, molecular dynamics, and computational physics with native FP64 performance and massive memory bandwidth.

Multi-Model Serving

With 192GB of memory, a single MI300X can host multiple AI models simultaneously — serving a routing model, embedding model, and multiple LLMs from one GPU. This consolidation dramatically reduces infrastructure costs for AI platforms.

Full Technical Specifications
GPU ArchitectureAMD CDNA 3
Process NodeTSMC 5nm / 6nm (3D chiplet)
Compute Units304
Stream Processors19,456
Matrix Cores1,216 (AI Accelerators)
Memory Capacity192 GB HBM3
Memory Interface8192-bit
Memory Bandwidth5.3 TB/s
Infinity Cache256 MB
Form FactorOAM (OCP Accelerator Module)
Thermal Design Power750W