Models DeepSpeedFP8Gaudi2GenerativeInferenceLarge Language ModelsLLMNatural Language ProcessingNLPPyTorch BLOOM 176B Inference HuggingFace BLOOM model for Inference on Gaudi2, using DeepSpeed for Inference Learn more
Models FP8GaudiGaudi2GenerativeInferenceLarge Language ModelsLLMPyTorch BLOOM 7B Large Language Model for Inference Learn more