Home » 3D-Parallelism

Intel® Gaudi® AI Accelerators Blog

/ 3D-Parallelism

08/31/2023

Training Llama and Bloom 13 Billion Parameter LLMs with 3D Parallelism on Habana® Gaudi2®

One of the main challenges in training Large Language Models (LLMs) is that they are often too large to fit on a single node or even if they fit, the training may be too slow. To address this issue, their training can be parallelized across multiple Gaudi accelerators (HPUs).

3D-Parallelism, DeepSpeed, GenAI, Large Language Models