Efficient inference on IMG Series4 NNAs

Research into neural network architectures generally prioritises accuracy over efficiency. Certain papers have investigated efficiency (Tan and Le 2020).