• Journal of Internet Computing and Services
    ISSN 2287 - 1136 (Online) / ISSN 1598 - 0170 (Print)
    https://jics.or.kr/

A Speaker Detection System based on Stereo Vision and Audio


Jun-Ho An, Kwang-Seok Hong, Journal of Internet Computing and Services, Vol. 11, No. 6, pp. 21-30, Dec. 2010
Full Text:
Keywords: Source Localization, Stereo vision, Speaker Detection

Abstract

In this paper, we propose the system which detects the speaker, who is speaking currently, among a number of users. A proposed speaker detection system based on stereo vision and audio is mainly composed of the followings: a position estimation of speaker candidates using stereo camara and microphone, a current speaker detection, and a speaker information acquisition based on a mobile device. We use the haar-like features and the adaboost algorithm to detect the faces of speaker candidates with stereo camera, and the position of speaker candidates is estimated by a triangulation method. Next, the Time Delay Of Arrival (TDOA) is estimated by the Cross Power Spectrum Phase (CPSP) analysis to find the direction of source with two microphone. Finally we acquire the information of the speaker including his position, voice, and face by comparing the information of the stereo camera with that of two microphone. Furthermore, the proposed system includes a TCP client/server connection method for mobile service.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from November 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[APA Style]
An, J. & Hong, K. (2010). A Speaker Detection System based on Stereo Vision and Audio. Journal of Internet Computing and Services, 11(6), 21-30.

[IEEE Style]
J. An and K. Hong, "A Speaker Detection System based on Stereo Vision and Audio," Journal of Internet Computing and Services, vol. 11, no. 6, pp. 21-30, 2010.

[ACM Style]
Jun-Ho An and Kwang-Seok Hong. 2010. A Speaker Detection System based on Stereo Vision and Audio. Journal of Internet Computing and Services, 11, 6, (2010), 21-30.