SPARQ-SGD: Event-Triggered and Compressed Communication in Decentralized Stochastic Optimization

2019-10-31Unverified0· sign in to hype

Navjot Singh, Deepesh Data, Jemin George, Suhas Diggavi

Unverified — Be the first to reproduce this paper.

Abstract

In this paper, we propose and analyze SPARQ-SGD, which is an event-triggered and compressed algorithm for decentralized training of large-scale machine learning models. Each node can locally compute a condition (event) which triggers a communication where quantized and sparsified local model parameters are sent. In SPARQ-SGD each node takes at least a fixed number (H) of local gradient steps and then checks if the model parameters have significantly changed compared to its last update; it communicates further compressed model parameters only when there is a significant change, as specified by a (design) criterion. We prove that the SPARQ-SGD converges as O(1nT) and O(1nT) in the strongly-convex and non-convex settings, respectively, demonstrating that such aggressive compression, including event-triggered communication, model sparsification and quantization does not affect the overall convergence rate as compared to uncompressed decentralized training; thereby theoretically yielding communication efficiency for "free". We evaluate SPARQ-SGD over real datasets to demonstrate significant amount of savings in communication over the state-of-the-art.

Tasks

Quantization Stochastic Optimization

SPARQ-SGD: Event-Triggered and Compressed Communication in Decentralized Stochastic Optimization

Abstract

Tasks

Reproductions