التخطي إلى المحتوى

Amazon Web Services this week introduced its next-generation Trainium3 accelerator for AI training and inference. As AWS puts it, the new processor is twice as fast as its predecessor and is four times more efficient. This makes it one of the best solutions for AI training and inference in terms of cost. In absolute numbers, Trainium3 offers up to 2,517 MXFP8 TFLOPS, which is nearly two times lower compared to Nvidia’s Blackwell Ultra. However, AWS’s Trn3 UltraServer packs 144 Trainium3 chips per rack, and offers 0.36 ExaFLOPS of FP8 performance, therefore matching the performance of Nvidia’s NVL72 GB300. This is a very big deal, as very few companies can challenge Nvidia’s rack-scale AI systems.

AWS Trainium3

Swipe to scroll horizontally
AWS Trainium vs Nvidia Blackwell

Accelerator name

Trainium2

Trainium3

B200

B300 (Ultra)

Architecture

Trainium2

Trainium3

Blackwell

Blackwell Ultra

Process Technology

?

N3E or N3P

4NP

4NP

Physical Configuration

2 x Accelerators

2 x Accelerators

2 x Reticle Sized GPUs

2 x Reticle Sized GPUs

Packaging

CoWoS-?

CoWoS-?

CoWoS-L

CoWoS-L

FP4 PFLOPs (per Package)

2.517

10

15

FP8/INT6 PFLOPs (per Package)

1299

2.517

5

5

INT8 POPS (per Package)

5

0.33

BF16 PFLOPs (per Package)

0.667

0.671

2.5

2.5

TF32 PFLOPs (per Package)

0.667

0.671

1.15

1.25

FP32 PFLOPs (per Package)

0.181

0.183

0.08

0.08

FP64/FP64 Tensor TFLOPs (per Package)

40

1.3

Memory

96 GB HBM3

144 GB HBM3E

192 GB HBM3E

288 GB HBM3E

Memory Bandwidth

2.9 GB/s

4.9 GB/s

8 TB/s

8 TB/s

HBM Stacks

8

8

8

8

Inter-GPU communications

NeuronLink-v3 1.28 TB/s

NeuronLink-v4 2.56 TB/s

NVLink 5.0, 200 GT/s | 1.8 TB/s bidirectional

NVLink 5.0, 200 GT/s | 1.8 TB/s bidirectional

SerDes speed (Gb/s unidirectional)

?

?

224G

224G

GPU TDP

?

?

1200 W

1400 W

Accompanying CPU

Intel Xeon

AWS Graviton and Intel Xeon

72-core Grace

72-core Grace

Launch Year

2024

2025

2024

2025

Fonte

التعليقات

اترك تعليقاً

لن يتم نشر عنوان بريدك الإلكتروني. الحقول الإلزامية مشار إليها بـ *