Large Language Models Archives - Intel Gaudi Developers

06/06/2024

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

Lean how to use TGI-gaudi and Langchain to build and deploy a RAG application

08/31/2023

Training Llama and Bloom 13 Billion Parameter LLMs with 3D Parallelism on Habana® Gaudi2®

One of the main challenges in training Large Language Models (LLMs) is that they are often too large to fit on a single node or even if they fit, the training may be too slow. To address this issue, their training can be parallelized across multiple Gaudi accelerators (HPUs).

3D-Parallelism, DeepSpeed, GenAI, Large Language Models

08/31/2023

Porting a model to Megatron-DeepSpeed with Habana Gaudi

If you want to train a large model using Megatron-DeepSpeed, but the model you want is not included in the implementation, you can port it to the Megatron-DeepSpeed package. Assuming your model is transformer-based, you can add your implementation easily, basing it on existing code.

DeepSpeed, GenAI, Large Language Models

07/25/2023

Accelerate Llama 2 with Intel AI Hardware and Software Optimizations

We are excited to see Meta release Llama 2, to help further democratize access to LLMs. Making such models more widely available will facilitate efforts across the AI community to benefit the world at large.

Deep Learning, Gaudi2, Large Language Models

05/20/2023

Habana Showcases Gaudi2 Performance on Large Language and Generative AI Models at ISC

We’re excited to participate in this year’s ISC High Performance Compute 2023 event in Hamburg Germany. This year our team will demonstrate the capabilities of our Habana Gaudi2® processors, which deliver high-performance, high-efficiency deep learning training and inference.

Gaudi2, Large Language Models

10/11/2022

Memory-Efficient Training on Habana^® Gaudi^® with DeepSpeed

One of the key challenges in Large Language Model (LLM) training is reducing the memory requirements needed for training without sacrificing compute/communication efficiency and model accuracy.

DeepSpeed, Gaudi, Large Language Models

Intel® Gaudi® AI Accelerators Blog