Efficient CNN Hardware Architecture Based on Linear Approximation and Computation Reuse Technique

Mohammed F. Tolba, Hani Saleh, Mahmoud Al-Qutayri, Ayman Hroub, Thanos Stouraitis

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    2 Scopus citations

    Abstract

    Large deep neural network (DNN) models pose significant computational and memory challenges, particularly when deploying them on edge devices. To address this, techniques such as pruning, quantization, data sparsity, and data reuse have been applied to DNNs, mitigating memory and computational complexity at the cost of some accuracy loss. This paper introduces an efficient hardware accelerator tailored for Convolutional Neural Networks (CNNs). The proposed architecture is the result of a co-optimized approach encompassing both algorithms and hardware. It leverages linear approximation of pre-trained network weights with minimal accuracy loss. A novel computational reuse method is presented to curtail the number of multiplication and addition operations and memory accesses, seamlessly integrated into the dedicated elements within the CNN design. To validate the effectiveness of this architecture, we conducted experiments on a gem5-based RISCV simulator, employing the VGG16 model for the CIFAR 100 dataset and the AlexNet model for the TinyImageNet dataset. The results showcased an impressive speedup of approximately 2× on AlexNet compared to the reference model. Additionally, our proposed CNN design was successfully implemented on the Xilinx Kintex 7 Field Programmable Gate Array (FPGA), achieving a notable reduction in hardware resource utilization compared to prior research efforts. This work serves as a versatile framework for evaluating diverse trade-offs involving accuracy, latency, power consumption, and cost across different CNN architectures.

    Original languageBritish English
    Title of host publication2023 International Conference on Microelectronics, ICM 2023
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages7-10
    Number of pages4
    ISBN (Electronic)9798350380828
    DOIs
    StatePublished - 2023
    Event2023 International Conference on Microelectronics, ICM 2023 - Abu Dhabi, United Arab Emirates
    Duration: 17 Nov 202320 Nov 2023

    Publication series

    NameProceedings of the International Conference on Microelectronics, ICM

    Conference

    Conference2023 International Conference on Microelectronics, ICM 2023
    Country/TerritoryUnited Arab Emirates
    CityAbu Dhabi
    Period17/11/2320/11/23

    Keywords

    • AI accelerator
    • approximate computing
    • computational reuse
    • Deep neural network
    • Hardware acceleration

    Fingerprint

    Dive into the research topics of 'Efficient CNN Hardware Architecture Based on Linear Approximation and Computation Reuse Technique'. Together they form a unique fingerprint.

    Cite this