TY - GEN
T1 - Integrating Features for Recognizing Human Activities through Optimized Parameters in Graph Convolutional Networks and Transformer Architectures
AU - Belal, Mohammad
AU - Hassan, Taimur
AU - Hassan, Abdelfatah
AU - Alsheikh, Nael
AU - Elhendawi, Noureldin
AU - Hussain, Irfan
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Human activity recognition is a major field of study that employs computer vision, machine vision, and deep learning techniques to categorize human actions. The field of deep learning has made significant progress, with architectures that are extremely effective at capturing human dynamics. This study emphasizes the influence of feature fusion on the accuracy of activity recognition. This technique addresses the limitation of conventional models, which face difficulties in identifying activities because of their limited capacity to understand spatial and temporal features. The technique employs sensory data obtained from four publicly available datasets: HuGaDB, PKU-MMD, LARa, and TUG. The accuracy and F1-score of two deep learning models, specifically a Transformer model and a Parameter-Optimized Graph Convolutional Network (PO-GCN), were evaluated using these datasets. The feature fusion technique integrated the final layer features from both models and inputted them into a classifier. Empirical evidence demonstrates that PO-GCN outperforms standard models in activity recognition. HuGaDB demonstrated a 2.3% improvement in accuracy and a 2.2% increase in F1-score. TUG showed a 5% increase in accuracy and a 0.5 % rise in F1-score. On the other hand, LARa and PKU-MMD achieved lower accuracies of 64% and 69 % respectively. This indicates that the integration of features enhanced the performance of both the Transformer model and PO-GCN.
AB - Human activity recognition is a major field of study that employs computer vision, machine vision, and deep learning techniques to categorize human actions. The field of deep learning has made significant progress, with architectures that are extremely effective at capturing human dynamics. This study emphasizes the influence of feature fusion on the accuracy of activity recognition. This technique addresses the limitation of conventional models, which face difficulties in identifying activities because of their limited capacity to understand spatial and temporal features. The technique employs sensory data obtained from four publicly available datasets: HuGaDB, PKU-MMD, LARa, and TUG. The accuracy and F1-score of two deep learning models, specifically a Transformer model and a Parameter-Optimized Graph Convolutional Network (PO-GCN), were evaluated using these datasets. The feature fusion technique integrated the final layer features from both models and inputted them into a classifier. Empirical evidence demonstrates that PO-GCN outperforms standard models in activity recognition. HuGaDB demonstrated a 2.3% improvement in accuracy and a 2.2% increase in F1-score. TUG showed a 5% increase in accuracy and a 0.5 % rise in F1-score. On the other hand, LARa and PKU-MMD achieved lower accuracies of 64% and 69 % respectively. This indicates that the integration of features enhanced the performance of both the Transformer model and PO-GCN.
UR - https://www.scopus.com/pages/publications/85219543450
U2 - 10.1109/DICTA63115.2024.00072
DO - 10.1109/DICTA63115.2024.00072
M3 - Conference contribution
AN - SCOPUS:85219543450
T3 - Proceedings - 2024 25th International Conference on Digital Image Computing: Techniques and Applications, DICTA 2024
SP - 449
EP - 453
BT - Proceedings - 2024 25th International Conference on Digital Image Computing
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 25th International Conference on Digital Image Computing: Techniques and Applications, DICTA 2024
Y2 - 27 November 2024 through 29 November 2024
ER -