Abstract
Audio-visual beamforming involves both an acoustic sensor and an omni-camera to form a composite 3D audio-visual representation of the environment. Information from the respective modalities is combined in the process of acoustic localization taking into account high level cognitive features of the signals, namely the presence of specific sounds - speech and tones - which have characteristic signatures in specific spectral bands. We compare the results from two systems. One is a custom-built architecture based on the MH Acoustics Eigenmike microphone array and a consumer grade omni-camera (Bloggie). The other is the commercially available Visisonics Audio-Visual (AV) camera. We show that the performances of the two systems are comparable.
Original language | English (US) |
---|---|
Title of host publication | 2017 51st Annual Conference on Information Sciences and Systems, CISS 2017 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Electronic) | 9781509047802 |
DOIs | |
State | Published - May 10 2017 |
Event | 51st Annual Conference on Information Sciences and Systems, CISS 2017 - Baltimore, United States Duration: Mar 22 2017 → Mar 24 2017 |
Other
Other | 51st Annual Conference on Information Sciences and Systems, CISS 2017 |
---|---|
Country/Territory | United States |
City | Baltimore |
Period | 3/22/17 → 3/24/17 |
ASJC Scopus subject areas
- Signal Processing
- Information Systems and Management
- Computer Networks and Communications
- Information Systems