Why Small Deep Neural Nets

Post date: Aug 26, 2018 7:40:55 AM

Published by Microsoft Research Sep 18, 2017

Differences of NN in small devices:

NN in gadgets compared to datacenters

- Usually safety-critical (except smartphones) vs rarely for datacenters
- Low-power is required vs nice-to-have
- Real-time is required vs preferable

Desirable properties on NN on gadgets:

- sufficiently high accuracy
- low computational complexity
- low energy usage
- small model size

Advantages of small models:

1. Fewer parameters means bigger opportunities for scaling training - 145X speedup on 256 GPUs for FireNet (CVPR 2016), 47x speedup for GoogLeNet
2. Enables complete on-chip integration of CNN model with weights - no need for off-chip memory -> dramatically reduces energy for inference, up-close/personal data gathering, integration with sensor
3. Enables continuous wireless updates of models if retraining is required

Seven ways to squeeze:

Report abuse