Prediction of the Resource Consumption of Distributed Deep Learning Systems

Gyeongsik Yang; Changyong Shin; Jeunghwan Lee; Yeonho Yoo; Chuck Yoo

doi:10.1145/3489048.3530962

What is it about?

Predicting resource consumption for the distributed training of deep learning models is of paramount importance, as it can inform a priori users of how long their training would take and enable users to manage the cost of training. Yet, no such prediction is available for users because the resource consumption itself varies significantly according to "settings" such as GPU types and also by "workloads" like deep learning models. Previous studies have attempted to derive or model such a prediction, but they fall short of accommodating the various combinations of settings and workloads together. This study presents Driple, which designs graph neural networks to predict the resource consumption of diverse workloads. Driple also designs transfer learning to extend the graph neural networks to adapt to differences in settings. The evaluation results show that Driple effectively predicts a wide range of workloads and settings. In addition, Driple can efficiently reduce the time required to tailor the prediction for different settings by up to 7.3×.

Photo by imgix on Unsplash

Why is it important?

This is the first work that predicts the resource consumption and training time of diverse deep learning workloads through graph neural networks and transfer learning.

This page is a summary of: Prediction of the Resource Consumption of Distributed Deep Learning Systems, June 2022, ACM (Association for Computing Machinery),
DOI: 10.1145/3489048.3530962.
You can read the full text:

Read

Contributors

The following have contributed to this page

Gyeongsik Yang
Korea University

Predict deep learning training through graph neural networks

What is it about?

Why is it important?

Contributors

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

Predict deep learning training through graph neural networks

What is it about?

Featured Image

Why is it important?

Read the Original

Contributors

Share this page:

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management