Effective Hand Gesture Recognition by Key Frame Selection and 3D Neural Network

This paper presents an approach for dynamic hand gesture recognition by using algorithm based on 3D Convolutional Neural Network (3D_CNN), which is later extended to 3D Residual Networks (3D_ResNet), and the neural network based key frame selection. Typically, 3D deep neural network is used to classify gestures from the input of image frames, randomly sampled from a video data. In this work, to improve the classification performance, we employ key frames which represent the overall video, as the input of the classification network. The key frames are extracted by SegNet instead of conventional clustering algorithms for video summarization (VSUMM) which require heavy computation. By using a deep neural network, key frame selection can be performed in a real-time system. Experiments are conducted using 3D convolutional kernels such as 3D_CNN, Inflated 3D_CNN (I3D) and 3D_ResNet for gesture classification. Our algorithm achieved up to 97.8% of classification accuracy on the Cambridge gesture dataset. The experimental results show that the proposed approach is efficient and outperforms existing methods.

I. INTRODUCTION

II. PROPOSED METHOD

III. EXPERIMENT AND EVALUATION

IV. CONCLUTION

REFERENCES

Effective Hand Gesture Recognition by Key Frame Selection and 3D Neural Network

(0)

(0)

(0)

(0)

Effective Hand Gesture Recognition by Key Frame Selection and 3D Neural Network

(0)

(0) 팝업 열기 팝업 닫기

(0)

(0)

(0)