Gaudi Archives - Intel Gaudi Developers

04/27/2023

Detecting frequent graph re-compilations

In training workloads, there may occur some scenarios in which graph re-compilations occur. This can create system latency and slow down the overall training process with multiple iterations of graph compilation. This blog focuses on detecting these graph re-compilations.

debugging, Gaudi, performance, pytorch

01/31/2023

Pre-Training the BERT 1.5B model with DeepSpeed

In this post, we show you how to run Habana’s DeepSpeed enabled BERT1.5B model from our Model-References repository.

BERT, DeepSpeed, Gaudi, Gaudi2, pytorch, synapseai

12/16/2022

Art Generation with PyTorch Stable Diffusion and Habana Gaudi

In this post, we will learn how to run PyTorch stable diffusion inference on Habana Gaudi processor, expressly designed for the purpose of efficiently accelerating AI Deep Learning models.

Gaudi, pytorch

12/01/2022

Writing training scripts that can run either on Gaudi, GPU, or CPU

In this tutorial we will learn how to write code that automatically detects what type of AI accelerator is installed on the machine (Gaudi, GPU or CPU), and make the needed changes to run the code smoothly.

Gaudi

10/20/2022

Fine tuning GPT2 with Hugging Face and Habana Gaudi

In this tutorial, we will demonstrate fine tuning a GPT2 model on Habana Gaudi AI processors using Hugging Face optimum-habana library with DeepSpeed.

DeepSpeed, Fine Tuning, Gaudi, GPT, GPT2, Hugging Face

10/11/2022

Memory-Efficient Training on Habana^® Gaudi^® with DeepSpeed

One of the key challenges in Large Language Model (LLM) training is reducing the memory requirements needed for training without sacrificing compute/communication efficiency and model accuracy.

DeepSpeed, Gaudi, Large Language Models

Intel® Gaudi® AI Accelerators Blog