Task partitioning

Distributed Inference Acceleration with Adaptive DNN Partitioning and Offloading

Deep neural networks (DNN) are the de-facto solution behind many intelligent applications of today, ranging from machine translation to autonomous driving. DNNs are accurate but resource-intensive, especially for embedded devices such as smartphones …