Developer Blog - Intel Gaudi Developers

06/07/2024

We’re excited to introduce the release of Intel® Gaudi® software version 1.16.0

Bringing forth numerous enhancements and updates for an improved GenAI development experience.

Intel Gaudi Software

06/06/2024

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

Lean how to use TGI-gaudi and Langchain to build and deploy a RAG application

Gaudi2, Large Language Models, RAG

05/08/2024

Leveraging Intel Gaudi for Distributed Training with FSDP

Learn how to execute scalable model development with Fully sharded data parallel (FSDP) training using PyTorch and Intel Gaudi Accelerators

FDSP, Gaudi2, Intel Gaudi Accelerators

04/16/2024

We’re excited to introduce the release of Intel® Gaudi® software version 1.15.0

Bringing forth numerous enhancements and updates for an improved user experience.

Intel Gaudi Software, synapseai

12/01/2023

Fine-Tuning Llama2-70B with DeepSpeed ZeRO-3 and Low-Rank Adaptation (LoRA) on Intel® Gaudi®2 AI Accelerator

With the Intel Gaudi SynapseAI 1.13.0 release, users can run Fine Tune the Llama2 70B model using only 8 Gaudi2 Accelerators.

DeepSpeed, Fine Tuning, Llama, LoRA

11/22/2023

We’re excited to introduce the release of Habana^® SynapseAI^® Software version 1.13.0

Bringing forth numerous enhancements and updates for an improved user experience.

synapseai

08/31/2023

Training Llama and Bloom 13 Billion Parameter LLMs with 3D Parallelism on Habana® Gaudi2®

One of the main challenges in training Large Language Models (LLMs) is that they are often too large to fit on a single node or even if they fit, the training may be too slow. To address this issue, their training can be parallelized across multiple Gaudi accelerators (HPUs).

3D-Parallelism, DeepSpeed, GenAI, Large Language Models

08/31/2023

Porting a model to Megatron-DeepSpeed with Habana Gaudi

If you want to train a large model using Megatron-DeepSpeed, but the model you want is not included in the implementation, you can port it to the Megatron-DeepSpeed package. Assuming your model is transformer-based, you can add your implementation easily, basing it on existing code.

DeepSpeed, GenAI, Large Language Models

08/16/2023

Optimizing Large Language Model Inference on Gaudi2 with Hugging Face Optimum-Habana

We have optimized additional Large Language Models on Hugging Face using the Optimum Habana library.

DeepSpeed, Hugging Face, Inference

08/09/2023

The Habana team is happy to announce the release of Habana^® SynapseAI^® Software version 1.11.0.

In this release, we’ve upgraded versions of several libraries, including DeepSpeed 0.9.4, PyTorch Lightning 2.0.4 and TensorFlow 2.12.1.

synapseai

07/25/2023

Accelerate Llama 2 with Intel AI Hardware and Software Optimizations

We are excited to see Meta release Llama 2, to help further democratize access to LLMs. Making such models more widely available will facilitate efforts across the AI community to benefit the world at large.

Deep Learning, Gaudi2, Large Language Models

06/30/2023

New MLCommons Results Highlight Impressive Competitive AI Gains for Intel

MLCommons published results of its industry AI performance benchmark, MLPerf Training 3.0, in which both the Habana® Gaudi®2 deep learning accelerator and the 4th Gen Intel® Xeon® Scalable processor delivered impressive training results.

Gaudi2, MLPerf

06/26/2023

Habana^® Gaudi^®2 Powers Deep Learning Instances on Genesis Cloud at Collision 2023

Habana Labs, an Intel company, and Genesis Cloud are collaborating to deliver a new class of cloud instances with Habana® Gaudi®2 accelerators to enable high-performance, high-efficiency deep learning training and inference workloads in the cloud.

Gaudi2

06/06/2023

The Habana team is happy to announce the release of Habana® SynapseAI® Software version 1.10.0.

In the 1.10 release, we’ve upgraded versions of several libraries, including PyTorch 2.0.1, PyTorch Lightning 2.0.0 and TensorFlow 2.12.0. We have added support for EKS 1.25 and OpenShift 4.12

synapseai

05/20/2023

Habana Showcases Gaudi2 Performance on Large Language and Generative AI Models at ISC

We’re excited to participate in this year’s ISC High Performance Compute 2023 event in Hamburg Germany. This year our team will demonstrate the capabilities of our Habana Gaudi2® processors, which deliver high-performance, high-efficiency deep learning training and inference.

Gaudi2, Large Language Models

05/05/2023

Equus Lab-as-a-Service with Habana Gaudi2 Processors Eases Testing and Deployment of Deep Learning Systems

Equus and Habana have teamed up to simplify the process of testing, implementing and deploying AI infrastructure based on Habana Gaudi2 processors.

Gaudi2

04/27/2023

Detecting frequent graph re-compilations

In training workloads, there may occur some scenarios in which graph re-compilations occur. This can create system latency and slow down the overall training process with multiple iterations of graph compilation. This blog focuses on detecting these graph re-compilations.

debugging, Gaudi, performance, pytorch

04/14/2023

New Habana Autonomous Driving Use Case Enabled with Gaudi Processors

Announcing a new End-to-End use case showing Training of a semantic segmentation model for Autonomous Driving

Gaudi2, Semantic Segmentation, Training

03/30/2023

The Habana team is happy to announce the release of SynapseAI® Software version 1.9.0.

In the 1.9 release, we’ve upgraded versions of several libraries, including PyTorch Lightning 1.9.4, DeepSpeed 0.7.7, fairseq 0.12.3, and Horovod v0.27.0.

synapseai

03/28/2023

Fast Inference on Large Language Models: BLOOMZ on Habana Gaudi2 Accelerator

In this article, you'll learn how to easily deploy multi-billion parameter language models on Habana Gaudi2 and get a view into the Hugging Face performance evaluation of Gaudi2 and A100 on BLOOMZ.

Gaudi2

03/21/2023

Accelerating Distributed Training Performance using EFA Peer Direct on Gaudi-Based AWS EC2 DL1 Instances

AWS and Habana collaborated to enable EFA Peer Direct support on the Gaudi-based AWS DL1 instances, offering users significant improvement in multi-instance model training performance.

Gaudi2

03/21/2023

New Habana AI Retail Use Case Enables Automated Store Shelf Management with Gaudi Processors

AI is becoming increasingly important for retail use cases. It can provide retailers with advanced capabilities to personalize customer experiences, optimize operations, and increase sales. Habana has published a new Retail use case showing an ...

Gaudi2

03/03/2023