SOTAVerified

Scaling TensorFlow to 300 million predictions per second

2021-09-20Unverified0· sign in to hype

Jan Hartman, Davorin Kopič

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We present the process of transitioning machine learning models to the TensorFlow framework at a large scale in an online advertising ecosystem. In this talk we address the key challenges we faced and describe how we successfully tackled them; notably, implementing the models in TF and serving them efficiently with low latency using various optimization techniques.

Tasks

Reproductions