Build or migrate Gaudi AI training models with these guides, videos and tools.
The Gaudi® platform architecture is purpose-designed for Deep Learning training workloads in data centers. It comprises a fully programmable Tensor Processing Core (TPC) cluster with supporting development tools and libraries. With Gaudi’s programmable architecture, we give the data scientist flexibility and ease of use to migrate or build models with Habana’s SynapseAI® software stack, reference models, extensive kernel libraries and documentation.
Here are resources to help Data Scientists bring up models to run on Gaudi. The assumption is the environment has been configured and ready for model training. If additional system configuration is required, refer to the Setup and Install page to get started.
Detailed instructions for installation of the Habana Gaudi driver and SynapseAI software stack if you need to setup your environment.
Detailed description of how to run your model on TensorFlow.
Detailed description of how to run your model on PyTorch.
Learn how to scale your TensorFlow models training with Horovod and HPUStrategy APIs support and Gaudi NIC or Host NIC for system scaling.
Learn how to scale your PyTorch models training with Distributed Data Parallel APIs support and Gaudi NIC or Host NIC for system scaling.
View repository of all Habana reference models and instructions on how to run and train your models, including TensorFlow and PyTorch.
Gaudi: Model Migration
A how-to tutorial to make minimum changes needed to get your model to run on Gaudi.
Gaudi: Training with TensorFlow
A how-to tutorial to help you run your model on TensorFlow.
Gaudi: Training with PyTorch
A how-to tutorial to help you run your model on PyTorch