Skip to content
Menu
Home
Resources
Get started with Gaudi
Large Scale Models
Amazon EC2 DL1 Training Instances
Tutorials
Debugging and Optimization Solutions
Model Performance Data
Kernel Libraries
Gaudi on Premise
Software Setup and Installation
Habana Program for Academic and Research Institutions
Videos
Documentation
Installation Guide
PyTorch User Guide
TensorFlow User Guide
Catalog
TensorFlow
PyTorch
Containers
Computer Vision
Natural Language Processing
Forum
Explore More
Habana Developer Blog
Events and Webinars
FAQs
Support
Contact us
Developer
Menu
Home
Resources
Get started with Gaudi
Large Scale Models
Amazon EC2 DL1 Training Instances
Tutorials
Debugging and Optimization Solutions
Model Performance Data
Kernel Libraries
Gaudi on Premise
Software Setup and Installation
Habana Program for Academic and Research Institutions
Videos
Documentation
Installation Guide
PyTorch User Guide
TensorFlow User Guide
Catalog
TensorFlow
PyTorch
Containers
Computer Vision
Natural Language Processing
Forum
Explore More
Habana Developer Blog
Events and Webinars
FAQs
Support
Contact us
Home
»
Large Language Models
Habana Developer Blog
/ Large Language Models
10/11/2022
Memory-Efficient Training on Habana
®
Gaudi
®
with DeepSpeed
One of the key challenges in Large Language Model (LLM) training is reducing the memory requirements needed for training without sacrificing compute/communication efficiency and model accuracy.
Read more
DeepSpeed
,
Gaudi
,
Large Language Models
Sign up for the latest Habana developer news, events, training, and updates.
Sign up