BLOOM 176B Inference
HuggingFace BLOOM model for Inference on Gaudi2, using DeepSpeed for Inference
Megatron-DeepSpeed BLOOM 13B
GPT based model with 13B parameters based on Megatron DeepSpeed on Gaudi2
BLOOM 7B
Large Language Model for Inference
Hugging Face GPT2
Optimum Habana GPT2 for PyTorch
Hugging Face T5
Optimum Habana T5 for PyTorch
BERT-5B
BERT model with 5 Billion Parameters using DeepSpeed for NLP
BERT-1.5B
BERT model with 1.5 Billion Parameters using DeepSpeed for NLP