TY - JOUR
T1 - Rethinking ResNets
T2 - improved stacking strategies with high-order schemes for image classification
AU - Luo, Zhengbo
AU - Sun, Zitang
AU - Zhou, Weilian
AU - Wu, Zizhang
AU - Kamata, Sei ichiro
N1 - Funding Information:
This work was supported by JST SPRING, Grant number JPMJSP2128.
Publisher Copyright:
© 2022, The Author(s).
PY - 2022/8
Y1 - 2022/8
N2 - Abstract: Various deep neural network architectures (DNNs) maintain massive vital records in computer vision. While drawing attention worldwide, the design of the overall structure lacks general guidance. Based on the relationship between DNN design and numerical differential equations, we performed a fair comparison of the residual design with higher order perspectives. We show that the widely used DNN design strategy, constantly stacking a small design (usually, 2–3 layers), could be easily improved, supported by solid theoretical knowledge and with no extra parameters needed. We reorganise the residual design in higher order ways, which is inspired by the observation that many effective networks can be interpreted as different numerical discretisations of differential equations. The design of ResNet follows a relatively simple scheme, which is Euler forward; however, the situation becomes complicated rapidly while stacking. We suppose that stacked ResNet is somehow equalled to a higher order scheme; then, the current method of forwarding propagation might be relatively weak compared with a typical high-order method such as Runge–Kutta. We propose HO-ResNet to verify the hypothesis on widely used CV benchmarks with sufficient experiments. Stable and noticeable increases in performance are observed, and convergence and robustness are also improved. Our stacking strategy improved ResNet-30 by 2.15% and ResNet-58 by 2.35% on CIFAR-10, with the same settings and parameters. The proposed strategy is fundamental and theoretical and can, therefore, be applied to any network as a general guideline. Graphical abstract: [Figure not available: see fulltext.].
AB - Abstract: Various deep neural network architectures (DNNs) maintain massive vital records in computer vision. While drawing attention worldwide, the design of the overall structure lacks general guidance. Based on the relationship between DNN design and numerical differential equations, we performed a fair comparison of the residual design with higher order perspectives. We show that the widely used DNN design strategy, constantly stacking a small design (usually, 2–3 layers), could be easily improved, supported by solid theoretical knowledge and with no extra parameters needed. We reorganise the residual design in higher order ways, which is inspired by the observation that many effective networks can be interpreted as different numerical discretisations of differential equations. The design of ResNet follows a relatively simple scheme, which is Euler forward; however, the situation becomes complicated rapidly while stacking. We suppose that stacked ResNet is somehow equalled to a higher order scheme; then, the current method of forwarding propagation might be relatively weak compared with a typical high-order method such as Runge–Kutta. We propose HO-ResNet to verify the hypothesis on widely used CV benchmarks with sufficient experiments. Stable and noticeable increases in performance are observed, and convergence and robustness are also improved. Our stacking strategy improved ResNet-30 by 2.15% and ResNet-58 by 2.35% on CIFAR-10, with the same settings and parameters. The proposed strategy is fundamental and theoretical and can, therefore, be applied to any network as a general guideline. Graphical abstract: [Figure not available: see fulltext.].
KW - Deep neural networks
KW - Image classification
KW - Image processing
KW - Neural ordinary differential equations
UR - http://www.scopus.com/inward/record.url?scp=85132295414&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85132295414&partnerID=8YFLogxK
U2 - 10.1007/s40747-022-00671-3
DO - 10.1007/s40747-022-00671-3
M3 - Article
AN - SCOPUS:85132295414
SN - 2199-4536
VL - 8
SP - 3395
EP - 3407
JO - Complex and Intelligent Systems
JF - Complex and Intelligent Systems
IS - 4
ER -