arXiv Analytics

Sign in

arXiv:1910.12539 [cs.CV]AbstractReferencesReviewsResources

Virtual Piano using Computer Vision

Seongjae Kang, Jaeyoon Kim, Sung-eui Yoon

Published 2019-10-28Version 1

In this research, Piano performances have been analyzed only based on visual information. Computer vision algorithms, e.g., Hough transform and binary thresholding, have been applied to find where the keyboard and specific keys are located. At the same time, Convolutional Neural Networks(CNNs) has been also utilized to find whether specific keys are pressed or not, and how much intensity the keys are pressed only based on visual information. Especially for detecting intensity, a new method of utilizing spatial, temporal CNNs model is devised. Early fusion technique is especially applied in temporal CNNs architecture to analyze hand movement. We also make a new dataset for training each model. Especially when finding an intensity of a pressed key, both of video frames and their optical flow images are used to train models to find effectiveness.

Related articles: Most relevant | Search more
arXiv:2012.13581 [cs.CV] (Published 2020-12-25)
Camouflaged Object Detection and Tracking: A Survey
arXiv:1110.2053 [cs.CV] (Published 2011-10-10, updated 2017-12-27)
Steps Towards a Theory of Visual Information: Active Perception, Signal-to-Symbol Conversion and the Interplay Between Sensing and Control
arXiv:1501.02825 [cs.CV] (Published 2015-01-12)
A Survey on Recent Advances of Computer Vision Algorithms for Egocentric Video