BLOOM 176B Inference
HuggingFace BLOOM model for Inference on Gaudi2, using DeepSpeed for Inference
Megatron-DeepSpeed BLOOM 13B
GPT based model with 13B parameters based on Megatron DeepSpeed on Gaudi2
Hugging Face BERT large model (uncased) whole word masking
Habana Optimum BERT large model (uncased) whole word masking for PyTorch
Hugging Face RoBERTa
Habana Optimum RoBERTa for PyTorch
Hugging Face BERT
Habana Optimum BERT for PyTorch
Hugging Face ALBERT Large
Habana Optimum ALBERT Large for PyTorch
Hugging Face Swin Transformer
Habana Optimum Swin Transformer for PyTorch
Hugging Face GPT2
Habana Optimum GPT2 for PyTorch
Hugging Face ALBERT XXLarge
Habana Optimum ALBERT XXLarge for PyTorch
Hugging Face RoBERTa large
Habana Optimum RoBERTa large for PyTorch
Hugging Face DistilBERT
Habana Optimum DistilBERT for PyTorch
Hugging Face T5
Habana Optimum T5 for PyTorch
BERT-5B
BERT model with 5 Billion Parameters using DeepSpeed for NLP
BERT-1.5B
BERT model with 1.5 Billion Parameters using DeepSpeed for NLP
ELECTRA Fine Tuning
ELECTRA Large Discriminator model for Fine Tuning
DistilBERT
DistilBERT model from Huggingface repository
BERT Fine tuning
BERT Fine Tuning Training and Inference