In this work, we study the ability to use hand gestures for human-machine interaction from wrist-worn sensors. Towards this goal, we design a wrist-worn prototype to capture RGB video stream of hand gestures. Then we built a new wrist-worn gesture dataset (named WiGes) with various subjects in interaction with home appliances in different environments. To the best of our knowledge, this is the first benchmark released for studying hand gestures from a wrist-worn camera. We then evaluate various CNN models for vision-based recognition. Furthermore, we deeply analyze the models that produce the best trade-off between accuracy, memory requirement, and computational cost. We point out that among studied architectures, MoviNet produces the highest accuracy. Then, we introduce a new MoviNet-based two-stream architecture that takes both RGB and optical flow into account. Our proposed architecture increases the Top-1 accuracy by 1.36% and 3.67% according to two evaluation protocols. Our dataset, baselines, and proposed model analysis give instructive recommendations for human-machine interaction using hand-held devices
Bài báo quốc tế
Kho tri thức
/
Bài báo quốc tế
/
Hand gesture recognition from wrist-worn camera for human-machine interaction
Hand gesture recognition from wrist-worn camera for human-machine interaction
Nguyễn Hồng Quân, Lê Trung Hiếu, Trần Trung Kiên, Hoàng Nhật Tân, Trần Thị Thanh Hải, Lê Thị Lan, Vũ Hải, Nguyễn Thanh Phương, Nguyễn Hữu Thanh, Phạm Văn Cường
Xuất bản trên:
Ngày đăng:
2023
Nhà xuất bản:
Institute of Electrical and Electronics Engineers Inc.
Địa điểm:
Từ khoá:
Cameras, Prototypes, Human computer interaction, Gesture recognition, Sensors, Feature extraction, Computer architecture, Convolutional neural networks, Wearable sensors
Bài báo liên quan
Sign Language Recognition With Self-Learning Fusion Model
Vũ Hoài Nam, Phạm Văn Cường, Hoàng Mậu Trung, Trần Tiến CôngPerson re-identification from multiple surveillance cameras combining face and body feature matching
Nguyen Xuan Ha, Hoang Nhu Dong, Nguyen V Thang, Pham D An, Nguyen Duc Toan, Đặng Minh TuấnPerformance Analysis of AV1 video codec
Nguyễn Thị Thu Hiên, Lê Thanh ThủyMeasurement of Bubbly Two-Phase Flow in a Vertical Pipe Using Ultrasonic Velocity Profiler and Digital Optical Imaging
Nguyễn Tất ThắngLearning Binary Codes for Fast Image Retrieval with Sparse Discriminant Analysis and Deep Autoencoders
Đào Thị Thúy Quỳnh, An Hồng Sơn, Nguyễn Hữu Quỳnh, Cù Việt Dũng, Ngô Quốc TạoLearning Adaptive Motion Search for Fast Versatile Video Coding in Visual Surveillance Systems
Hoàng Văn Xiêm, Nguyễn Quang Sang, Bùi Thanh Hương, Vũ Hữu TiếnInformation extraction from Visually Rich Documents using graph convolutional network
Trịnh Thịnh, Nguyễn Trọng Khánh