AMD To Refresh Instinct MI300 Series With MI350 AI Accelerator Using 4nm Node This Year

Hassan Mujtaba • Apr 11, 2024 11:31 AM EDT

• Copy Shortlink

AMD To Refresh Instinct MI300 Series With MI350 AI Accelerator Using 4nm Node This Year 1

AMD seems to be preparing a 4nm refresh of its MI300 AI Accelerators known as the MI350 which is planned for later this year.

AMD MI350 AI Accelerator To Feature Refreshed 4nm Architecture, Aiming Launch Later This Year

AMD's current MI300 lineup consists of the AI-optimized MI300X & the compute-optimized MI300A accelerators but it looks like the company is planning to expand its portfolio. We recently saw the emergence of the MI388X which might be an export-compliant variant for China but AMD did state that it was prevented from shipments. The MI388X was likely going to be another CDNA 3 offering utilizing a 5nm and 6nm process technology but it looks like AMD has a proper refresh planned for its Instinct family for later this year.

According to a report from TrendForce, it looks like AMD might be launching a new part known as the Instinct MI350 which will utilize a refreshed CDNA 3 architecture utilizing TSMC's 4nm process node. While details on the Instinct MI350 are slim, it was recently teased by AMD itself that they'll be offering higher HBM3E capacities in future refreshes of the Instinct MI300 series. So higher HBM capacities coupled with a fine-tuned architecture on the 4nm node can lead to some decent gains.

Furthermore, TrendForce notes that the extension of export controls now includes not only the previously restricted AI chips from NVIDIA and AMD, such as the NVIDIA A100/H100, AMD MI250/300 series, NVIDIA A800, H800, L40, L40S, and RTX4090, but also their next-generation successors like NVIDIA's H200, B100, B200, GB200, and AMD's MI350 series. In response, HPC manufacturers have quickly developed products that comply with the new TPP and PD standards, such as NVIDIA's adjusted H20/L20/L2, which remain eligible for export.

TrendForce

Videocardz was also able to spot a listing from AMD Singapore which confirms the Instinct MI350 accelerator lineup. The product has already been submitted for silicon readiness & optimizations.

It should be remembered that AMD will be competing against both NVIDIA & Intel in the AI space. The Blackwell B100 GPUs are in production and B100/B200 will be rolling out to customers soon. Meanwhile, Intel also announced its Gaudi 3 accelerators which offer up to 50% faster AI compute versus the NVIDIA H100 GPUs. So the space is heating up. In recent MLPerf benchmarks, NVIDIA & Intel were the only ones to submit their AI performance benchmarks meanwhile AMD missed the spotlight as it didn't submit any numbers.

TrendForce has also shared the full list of products that are affected by the latest version of the US export controls against China. These include several current and upcoming GPUs including AMD's Instinct MI388X & MI350 series.

US Export Controlled Products (Restricted For China / As of 29th March):

Vendor	Product	Process Technology	Release Date
NVIDIA	GB200	4nm (TSMC)	2H 2024
NVIDIA	B200	4nm (TSMC)	2H 2024
NVIDIA	B100	4nm (TSMC)	2H 2024
NVIDIA	H200	4nm (TSMC)	11/2023
NVIDIA	H100	4nm (TSMC)	03/2022
NVIDIA	H800	4nm (TSMC)	03/2022
NVIDIA	L40/L40S	5nm (TSMC)	10/2022
NVIDIA	RTX 4090	5nm (TSMC)	10/2022
NVIDIA	A100	7nm (TSMC)	05/2020
NVIDIA	A800	7nm (TSMC)	05/2020
AMD	MI250	6nm (TSMC)	11/2021
AMD	MI250X	6nm (TSMC)	11/2021
AMD	MI300/MI309	5nm (TSMC)	11/2021
AMD	MI300X/MI388X	5nm/6nm (TSMC)	12/2023
AMD	MI350	4nm (TSMC)	2H 2024

AMD has also confirmed its next-gen MI400 AI accelerator which should be released in 2025 and feature a more capable architecture that is tuned for the AI-era. AMD is also working on its ROCm software suite and has made certain blocks open source to fine-tune its performance for AI work-loads.

AMD Radeon Instinct Accelerators

Accelerator Name	AMD Instinct MI400	AMD Instinct MI350X	AMD Instinct MI300X	AMD Instinct MI300A	AMD Instinct MI250X	AMD Instinct MI250	AMD Instinct MI210	AMD Instinct MI100	AMD Radeon Instinct MI60	AMD Radeon Instinct MI50	AMD Radeon Instinct MI25	AMD Radeon Instinct MI8	AMD Radeon Instinct MI6
CPU Architecture	Zen 5 (Exascale APU)	N/A	N/A	Zen 4 (Exascale APU)	N/A	N/A	N/A	N/A	N/A	N/A	N/A	N/A	N/A
GPU Architecture	CDNA 4	CDNA 3+?	Aqua Vanjaram (CDNA 3)	Aqua Vanjaram (CDNA 3)	Aldebaran (CDNA 2)	Aldebaran (CDNA 2)	Aldebaran (CDNA 2)	Arcturus (CDNA 1)	Vega 20	Vega 20	Vega 10	Fiji XT	Polaris 10
GPU Process Node	4nm	4nm	5nm+6nm	5nm+6nm	6nm	6nm	6nm	7nm FinFET	7nm FinFET	7nm FinFET	14nm FinFET	28nm	14nm FinFET
GPU Chiplets	TBD	TBD	8 (MCM)	8 (MCM)	2 (MCM) 1 (Per Die)	2 (MCM) 1 (Per Die)	2 (MCM) 1 (Per Die)	1 (Monolithic)	1 (Monolithic)	1 (Monolithic)	1 (Monolithic)	1 (Monolithic)	1 (Monolithic)
GPU Cores	TBD	TBD	19,456	14,592	14,080	13,312	6656	7680	4096	3840	4096	4096	2304
GPU Clock Speed	TBD	TBD	2100 MHz	2100 MHz	1700 MHz	1700 MHz	1700 MHz	1500 MHz	1800 MHz	1725 MHz	1500 MHz	1000 MHz	1237 MHz
INT8 Compute	TBD	TBD	2614 TOPS	1961 TOPS	383 TOPs	362 TOPS	181 TOPS	92.3 TOPS	N/A	N/A	N/A	N/A	N/A
FP16 Compute	TBD	TBD	1.3 PFLOPs	980.6 TFLOPs	383 TFLOPs	362 TFLOPs	181 TFLOPs	185 TFLOPs	29.5 TFLOPs	26.5 TFLOPs	24.6 TFLOPs	8.2 TFLOPs	5.7 TFLOPs
FP32 Compute	TBD	TBD	163.4 TFLOPs	122.6 TFLOPs	95.7 TFLOPs	90.5 TFLOPs	45.3 TFLOPs	23.1 TFLOPs	14.7 TFLOPs	13.3 TFLOPs	12.3 TFLOPs	8.2 TFLOPs	5.7 TFLOPs
FP64 Compute	TBD	TBD	81.7 TFLOPs	61.3 TFLOPs	47.9 TFLOPs	45.3 TFLOPs	22.6 TFLOPs	11.5 TFLOPs	7.4 TFLOPs	6.6 TFLOPs	768 GFLOPs	512 GFLOPs	384 GFLOPs
VRAM	TBD	HBM3e	192 GB HBM3	128 GB HBM3	128 GB HBM2e	128 GB HBM2e	64 GB HBM2e	32 GB HBM2	32 GB HBM2	16 GB HBM2	16 GB HBM2	4 GB HBM1	16 GB GDDR5
Infinity Cache	TBD	TBD	256 MB	256 MB	N/A	N/A	N/A	N/A	N/A	N/A	N/A	N/A	N/A
Memory Clock	TBD	TBD	5.2 Gbps	5.2 Gbps	3.2 Gbps	3.2 Gbps	3.2 Gbps	1200 MHz	1000 MHz	1000 MHz	945 MHz	500 MHz	1750 MHz
Memory Bus	TBD	TBD	8192-bit	8192-bit	8192-bit	8192-bit	4096-bit	4096-bit bus	4096-bit bus	4096-bit bus	2048-bit bus	4096-bit bus	256-bit bus
Memory Bandwidth	TBD	TBD	5.3 TB/s	5.3 TB/s	3.2 TB/s	3.2 TB/s	1.6 TB/s	1.23 TB/s	1 TB/s	1 TB/s	484 GB/s	512 GB/s	224 GB/s
Form Factor	TBD	TBD	OAM	APU SH5 Socket	OAM	OAM	Dual Slot Card	Dual Slot, Full Length	Dual Slot, Full Length	Dual Slot, Full Length	Dual Slot, Full Length	Dual Slot, Half Length	Single Slot, Full Length
Cooling	TBD	TBD	Passive Cooling	Passive Cooling	Passive Cooling	Passive Cooling	Passive Cooling	Passive Cooling	Passive Cooling	Passive Cooling	Passive Cooling	Passive Cooling	Passive Cooling
TDP (Max)	TBD	TBD	750W	760W	560W	500W	300W	300W	300W	300W	300W	175W	150W

AMD To Refresh Instinct MI300 Series With MI350 AI Accelerator Using 4nm Node This Year

AMD MI350 AI Accelerator To Feature Refreshed 4nm Architecture, Aiming Launch Later This Year

US Export Controlled Products (Restricted For China / As of 29th March):

AMD Radeon Instinct Accelerators

Comments

Popular Discussions

AMD Ryzen 9000 “Zen 5” CPUs To Offer Up To 6400 MT/s 1:1 & 8000 MT/s 1:2 4-DIMM DDR5 Memory Support, Run Cooler & Lower Power By Default

Basemark “Breaking Limit” Ray Tracing Benchmark Now Out, Tested With NVIDIA, AMD & Intel GPUs

AMD Ryzen 9000X3D CPUs To Feature Full Overclocking Support In Addition To New 3D V-Cache Features

Alleged Intel Arrow Lake Desktop CPU Performance Leaks Out, 20% Faster Single-Thread Versus 14900KS

Lenovo Ideapad Laptop With AMD Ryzen AI 9 365 “Strix” APU Benchmarked, 9% Faster Than Ryzen 9 8945HS

AMD To Refresh Instinct MI300 Series With MI350 AI Accelerator Using 4nm Node This Year

AMD MI350 AI Accelerator To Feature Refreshed 4nm Architecture, Aiming Launch Later This Year

Related Story Basemark “Breaking Limit” Ray Tracing Benchmark Now Out, Tested With NVIDIA, AMD & Intel GPUs

US Export Controlled Products (Restricted For China / As of 29th March):

AMD Radeon Instinct Accelerators

Further Reading

Comments

Trending Stories

Popular Discussions