Skip to main navigation Skip to search Skip to main content

DPS: A DSM-based parameter server for machine learning

  • Beihang University
  • Fordham University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

To solve the problem of efficient storing and updating of model parameters in the learning process, the parameter server is concerned as a high-throughput distributed machine learning (ML) architecture with the emergence of big models with billions of parameters. Current parameter servers, such as the Parameter Server and the Petuum, do not address data management and lack high-level data abstraction. Moreover, they have no task scheduling and do not fully utilize the computing resource as well as possibly lead to load imbalance. Their programming interface is too complicated and they do not support data flow operations (e.g. map/reduce) which are very useful for data preprocessing. These drawbacks limit the performance and ease of use of such parameter servers.In this paper, we proposed DPS, a parameter server based on Distributed Shared Memory (DSM) for machine learning. DPS provides flexible consistency models, high-level data abstraction and management that support data flow operations, lightweight task scheduling system and user-friendly programming interface to solve the problems of existing systems mentioned above. The experimental results show that DPS can reduce networking time by about 50%, and achieve up to 1.9x performance compared to Petuum while the algorithms implemented on DPS use less code than those implemented on Petuum. In this paper, we proposed DPS, a parameter server based on Distributed Shared Memory (DSM) for machine learning. DPS provides flexible consistency models, high-level data abstraction and management that support data flow operations, lightweight task scheduling system and user-friendly programming interface to solve the problems of existing systems mentioned above. The experimental results show that DPS can reduce networking time by about 50%, and achieve up to 1.9x performance compared to Petuum while the algorithms implemented on DPS use less code than those implemented on Petuum.

Original languageEnglish
Title of host publicationProceedings - 14th International Symposium on Pervasive Systems, Algorithms and Networks, I-SPAN 2017, 11th International Conference on Frontier of Computer Science and Technology, FCST 2017 and 3rd International Symposium of Creative Computing, ISCC 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages20-27
Number of pages8
ISBN (Electronic)9781538608401
DOIs
StatePublished - 27 Nov 2017
Event14th International Symposium on Pervasive Systems, Algorithms and Networks, I-SPAN 2017, 11th International Conference on Frontier of Computer Science and Technology, FCST 2017 and 3rd International Symposium of Creative Computing, ISCC 2017 - Exeter, Devon, United Kingdom
Duration: 21 Jun 201723 Jun 2017

Publication series

NameProceedings - 14th International Symposium on Pervasive Systems, Algorithms and Networks, I-SPAN 2017, 11th International Conference on Frontier of Computer Science and Technology, FCST 2017 and 3rd International Symposium of Creative Computing, ISCC 2017
Volume2017-November

Conference

Conference14th International Symposium on Pervasive Systems, Algorithms and Networks, I-SPAN 2017, 11th International Conference on Frontier of Computer Science and Technology, FCST 2017 and 3rd International Symposium of Creative Computing, ISCC 2017
Country/TerritoryUnited Kingdom
CityExeter, Devon
Period21/06/1723/06/17

Keywords

  • Big data
  • Machine learning
  • Parameter server

Fingerprint

Dive into the research topics of 'DPS: A DSM-based parameter server for machine learning'. Together they form a unique fingerprint.

Cite this