Abstract
Industry analysts predict Deep Learning (DL) will account for the majority of cloud workloads. Additionally, training of deep learning models will represent the majority of server applications in the next few years. Among DL workloads, foundation models -- a new class of AI models that are trained on broad data (typically via self-supervision) using billions of parameters – are expected to consume the majority of the infrastructure.