AMD Instinct MI200 ‘Aldebaran’ GPU seemingly confirmed to be dual-die –

AMD Aldebaran might be dual-die MCM GPU

AMD MI200 had only just appeared in Linux patches, but due to the extensivity of the data, we are still learning new details hidden within the code. A patch from February 24th appears to confirm that Aldebaran will be a dual-die GPU. This was discovered by Coelacanth’s Dream. The patch clearly refers to alde_die_0 and alde_die_1, indicating that the chip is dual-die. In this case, “Alde” refers to Aldebaran, the upcoming compute accelerator for Instinct MI200, not Intel Alder Lake.

AMD Aldebaran in Linux patch, Source: Freedesktop

Aldebaran will compete with Intel Xe-HP(C) and NVIDIA Hopper as the first multi-chip module graphics processors, or more precisely compute accelerators. The same patches have already indicated that the MI200 will have an updated variant of High-Bandwidth-Memory called HBM2E. This means that AMD does not need to increase the number of stacks, but rather add more DRAM layers to increase capacity.

The multi-chip module will require new chiplets to be added to the package, such as the I/O module, which could theoretically be infused into each GPU module (like we had with the first-gen EPYC), but that seems rather unlikely.

Monolithic vs MCM design, Source: VideoCardz

AMD continues to claim that CDNA2-based GPU will be manufactured using “Advanced Node”. This likely means that each chiplets will be using different nodes, such as 7nm and 5nm within the same package. MCM GPUs such as Aldebaran will shape the future of graphics accelerators. It may take a while to see MCM in gaming GPUs though.

AMD CDNA Roadmap, Source: AMD

AMD Instinct Accelerators
Accelerator Name AMD Radeon Instinct MI60 AMD Instinct MI100 AMD Instinct MI200
Architecture 7nm GCN5 7nm CDNA1 (GFX908) CDNA2 (GFX90A)
GPU Vega 20 Arcturus Aldebaran (MCM)
GPU Cores 4096 7680 TBC
GPU Clock Speed 1800 MHz ~1500 MHz TBC
FP16 Compute 29.5 TFLOPs 185 TFLOPs TBC
FP32 Compute 14.7 TFLOPs 23.1 TFLOPs TBC
FP64 Compute 7.4 TFLOPs 11.5 TFLOPs TBC
Memory Clock 1000 MHz 1200 MHz TBC
Memory Bus 4096-bit bus 4096-bit bus TBC
Memory Bandwidth 1 TB/s 1.23 TB/s TBC
Form Factor Dual Slot, Full Length Dual Slot, Full Length OAM
Cooling Passive Cooling Passive Cooling TBC
TDP 300W 300W TBC

Source: Freedesktop via Coelacanth’s Dream

Source link

Leave a Reply

Your email address will not be published. Required fields are marked *

%d bloggers like this: