Motion-Classification Using Cnn and Lstm

Bhagawat, Vasanth C; Kalaiarasan, C; Chinnaiyan, R

Please use this identifier to cite or link to this item: https://gnanaganga.inflibnet.ac.in:8443/jspui/handle/123456789/16523

Full metadata record

DC Field	Value	Language
dc.contributor.author	Bhagawat, Vasanth C	-
dc.contributor.author	Kalaiarasan, C	-
dc.contributor.author	Chinnaiyan, R	-
dc.date.accessioned	2024-08-29T05:41:24Z	-
dc.date.available	2024-08-29T05:41:24Z	-
dc.date.issued	2023	-
dc.identifier.citation	pp. 1-6	en_US
dc.identifier.isbn	9798350317060	-
dc.identifier.uri	https://doi.org/10.1109/ICCAMS60113.2023.10526025	-
dc.identifier.uri	https://gnanaganga.inflibnet.ac.in:8443/jspui/handle/123456789/16523	-
dc.description.abstract	Action recognition is an active research field in computer vision that has gained significant attention due to its practical applications. While deep learning approaches have shown promise, many existing methods fail to effectively capture spatiotemporal features from videos, mainly because they do not adequately consider the diversity of temporal scales. In this paper, we propose a novel approach, termed Long-Short-Term Spatiotemporal Features CNN (LSF CNN), to address this limitation. The LSF CNN consists of two subnetworks: the Long-Term Spatiotemporal Features Extraction Network (LT-Net) and the Short-Term Spatiotemporal Features Extraction Network (ST-Net). The LT-Net extracts long-term spatiotemporal features from stacked RGB images, while the ST-Net utilizes optical flow computed from adjacent frames to extract short-term spatiotemporal features. Additionally, we introduce a novel expression for the optical flow field, which has been empirically shown to outperform traditional methods in action recognition tasks. The spatiotemporal features from both scales are combined in a fully-connected layer and fed into a linear Support Vector Machine (SVM) for classification. By leveraging the two-stream architecture and the improved optical flow expression, our proposed approach enables comprehensive learning of deep features in both spatial and temporal domains. Extensive experiments conducted on the HMDB51 and UCF101 datasets validate the effectiveness of our approach, as it significantly improves action recognition accuracy by effectively incorporating long-short-term spatiotemporal information. This research contributes to advancing the state-of-the-art in human action recognition, opening up new possibilities for enhanced performance in various real-world applications. © 2023 IEEE.	en_US
dc.language.iso	en	en_US
dc.publisher	2023 International Conference on New Frontiers in Communication, Automation, Management and Security, ICCAMS 2023	en_US
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	en_US
dc.subject	Cnn	en_US
dc.subject	Machine Learning	en_US
dc.title	Motion-Classification Using Cnn and Lstm	en_US
dc.type	Article	en_US
Appears in Collections:	Conference Papers

Files in This Item:

There are no files associated with this item.

Show simple item record

Alliance University, Bengaluru

Institutional Repository