TY - GEN
T1 - A Regularization Approach to Maximize Common Sub-Expressions in Neural Network Weights
AU - Kavvousanos, E.
AU - Kouretas, I.
AU - Paliouras, V.
AU - Stouraitis, T.
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - This paper modifies the training of artificial neural networks in order to derive weights the binary expression of which is composed of a limited set of sub-expressions only, placed in specified positions. The benefit of the proposed technique is the substantial complexity reduction achieved. The proposed method can be applied as a post-processing step on pre-trained models, further expanding its impact. The main concept is the use of an introduced regularization function that promotes specific sub-expressions which, in turn, are shown to improve the performance of common sub-expression sharing techniques, reducing area, time and power requirements for inference. Synthesis results reported here, quantify the impact of the proposed method onto hardware implementations and demonstrate substantial area, delay and power improvements, over prior art. In certain cases a ×4 area reduction is achieved, combined with corresponding reduction in delay and power dissipation.
AB - This paper modifies the training of artificial neural networks in order to derive weights the binary expression of which is composed of a limited set of sub-expressions only, placed in specified positions. The benefit of the proposed technique is the substantial complexity reduction achieved. The proposed method can be applied as a post-processing step on pre-trained models, further expanding its impact. The main concept is the use of an introduced regularization function that promotes specific sub-expressions which, in turn, are shown to improve the performance of common sub-expression sharing techniques, reducing area, time and power requirements for inference. Synthesis results reported here, quantify the impact of the proposed method onto hardware implementations and demonstrate substantial area, delay and power improvements, over prior art. In certain cases a ×4 area reduction is achieved, combined with corresponding reduction in delay and power dissipation.
KW - common sub-expression sharing
KW - deep neural networks
KW - regularization
KW - training
KW - weight compression
UR - https://www.scopus.com/pages/publications/85183580709
U2 - 10.1109/ICECS58634.2023.10382719
DO - 10.1109/ICECS58634.2023.10382719
M3 - Conference contribution
AN - SCOPUS:85183580709
T3 - ICECS 2023 - 2023 30th IEEE International Conference on Electronics, Circuits and Systems: Technosapiens for Saving Humanity
BT - ICECS 2023 - 2023 30th IEEE International Conference on Electronics, Circuits and Systems
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 30th IEEE International Conference on Electronics, Circuits and Systems, ICECS 2023
Y2 - 4 December 2023 through 7 December 2023
ER -