NORMA eResearch @NCI Library

Dynamic Model Evaluation to Accelerate Distributed Machine Learning

Caton, Simon, Venugopal, Srikumar, Bhushan, Shashi, Velamuri, Vidya Sankar and Katrinis, Kostas (2018) Dynamic Model Evaluation to Accelerate Distributed Machine Learning. In: 2018 IEEE International Congress on Big Data (BigData Congress). IEEE, pp. 150-157. ISBN 9781538672327

Full text not available from this repository.
Official URL: http://dx.doi.org/10.1109/BigDataCongress.2018.000...

Abstract

The increase in the volume and variety of data has increased the reliance of data scientists on shared computational resources, either in-house or obtained via cloud providers, to execute machine learning and artificial intelligence programs. This, in turn, has created challenges of exploiting available resources to execute such "cognitive workloads" quickly and effectively to gather the needed knowledge and data insight. A common challenge in machine learning is knowing when to stop model building. This is often exacerbated in the presence of big data as a trade off between the cost of producing the model (time, volume of training data, resources utilised) and its general performance. Whilst there are many tools and application stacks available to train models over distributed resources, the challenge of knowing when a model is "good enough" or no longer worth pursuing persists. In this paper, we propose a framework for the evaluating the models produced by distributed machine learning algorithms during the training process. This framework integrates with the cluster job scheduler so as to finalise model training under constraints of resource availability or time, or simply because model performance is asymptotic with further training. We present a prototype implementation of this framework using Apache Spark and YARN, and demonstrate the benefits of this approach using sample applications with both supervised and unsupervised learning algorithms.

Item Type: Book Section
Subjects: Q Science > QA Mathematics > Electronic computers. Computer science
T Technology > T Technology (General) > Information Technology > Electronic computers. Computer science
Q Science > QA Mathematics > Computer software
T Technology > T Technology (General) > Information Technology > Computer software
Divisions: School of Computing > Staff Research and Publications
Depositing User: Caoimhe Ní Mhaicín
Date Deposited: 02 Jul 2018 09:40
Last Modified: 11 Oct 2018 17:21
URI: https://norma.ncirl.ie/id/eprint/3025

Actions (login required)

View Item View Item