Main Article Content

Modern architectures convolutional neural networks in human activity recognition


H. Mahmoud

Abstract

In recent years, many researchers have focused on using convolutional neural networks to perform human activity recognition as evidenced by the emergence of a number of convolutional neural network architectures such as LeNet-5, AlexNet and VGG16 and modern architectures such as ResNet, Inception V3, Inception-ResNet, MobileNetV2, NASNet and PNASNet. The main characteristic of a convolutional neural network (CNN) is its ability to extract features automatically from input images, which facilitates the processes of activity recognition and classification. Convolutional networks indeed derive more relevant and complex features with every additional layer. In addition, CNNs have achieved perfect classification on highly similar activities that were previously extremely difficult to classify. In this paper, the researcher evaluated modern convolutional neural networks in terms of their human activity recognition accuracy, and she compared the results with the state-of-the-art methods. In this research, the researcher used two public data sets, HMDB (Shooting gun, kicking, falling to the floor, and punching) and the Weizman dataset (walking, running, jumping, bending, one hand waving, two-hand waving, jumping in place, jumping jack, and skipping). The experimental results indicated that the CNN with NASNet architecture achieves the best performance of the six CNN architectures on both human activity data sets (HMDB and Weizman).


Journal Identifiers


eISSN: 2735-5985
print ISSN: 2735-5977