TY - JOUR
T1 - Deep learning-based human activity recognition using CNN, ConvLSTM, and LRCN
AU - Uddin, Md Ashraf
AU - Talukder, Md Alamin
AU - Uzzaman, Muhammad Sajib
AU - Debnath, Chandan
AU - Chanda, Moumita
AU - Paul, Souvik
AU - Islam, Md Manowarul
AU - Khraisat, Ansam
AU - Alazab, Ammar
AU - Aryal, Sunil
N1 - Publisher Copyright:
© 2024 The Authors
PY - 2024/1
Y1 - 2024/1
N2 - Human activity recognition (HAR) plays a crucial role in assisting the elderly and individuals with vascular dementia by providing support and monitoring for their daily activities. This paper presents a deep learning (DL)-based approach to HAR, leveraging convolutional neural network (CNN), convolutional long short-term memory (ConvLSTM), and long-term recurrent convolutional network (LRCN) architectures. These models are designed to extract spatial features and capture temporal dependencies in video data, enhancing the accuracy of activity classification. We conducted experiments on the UCF50 and HMDB51 video datasets, encompassing diverse human activities. Our evaluation demonstrates that the ConvLSTM model achieves an accuracy of 82% on UCF50 and 68% on HMDB51, while the LRCN model gives accuracies of 93.44% and 71.55%, respectively. Finally, the CNN model outperforms with an accuracy rate of 99.58% for the UCF50 and 92.70% for the HMDB51 datasets. These significant improvements showcase the effectiveness of integrating convolutional and recurrent neural networks for HAR tasks. Our research contributes to advancing HAR systems with potential applications in healthcare, assisted living, and surveillance. By accurately recognizing human activities, our models can assist in remote patient monitoring, fall detection, and public safety initiatives. These findings underscore the importance of DL in enhancing the quality of life and safety for individuals in various contexts.
AB - Human activity recognition (HAR) plays a crucial role in assisting the elderly and individuals with vascular dementia by providing support and monitoring for their daily activities. This paper presents a deep learning (DL)-based approach to HAR, leveraging convolutional neural network (CNN), convolutional long short-term memory (ConvLSTM), and long-term recurrent convolutional network (LRCN) architectures. These models are designed to extract spatial features and capture temporal dependencies in video data, enhancing the accuracy of activity classification. We conducted experiments on the UCF50 and HMDB51 video datasets, encompassing diverse human activities. Our evaluation demonstrates that the ConvLSTM model achieves an accuracy of 82% on UCF50 and 68% on HMDB51, while the LRCN model gives accuracies of 93.44% and 71.55%, respectively. Finally, the CNN model outperforms with an accuracy rate of 99.58% for the UCF50 and 92.70% for the HMDB51 datasets. These significant improvements showcase the effectiveness of integrating convolutional and recurrent neural networks for HAR tasks. Our research contributes to advancing HAR systems with potential applications in healthcare, assisted living, and surveillance. By accurately recognizing human activities, our models can assist in remote patient monitoring, fall detection, and public safety initiatives. These findings underscore the importance of DL in enhancing the quality of life and safety for individuals in various contexts.
KW - Convolutional long short-term memory
KW - Convolutional neural network
KW - Deep learning
KW - Human activity recognition
KW - Long-term recurrent convolutional network
UR - http://www.scopus.com/inward/record.url?scp=85197390322&partnerID=8YFLogxK
U2 - 10.1016/j.ijcce.2024.06.004
DO - 10.1016/j.ijcce.2024.06.004
M3 - Article
AN - SCOPUS:85197390322
SN - 2666-3074
VL - 5
SP - 259
EP - 268
JO - International Journal of Cognitive Computing in Engineering
JF - International Journal of Cognitive Computing in Engineering
ER -