3D Mouth Tracking from a Compact Microphone Array Co-Located with a camera

Xinyuan Qian; Alessio Xompero; Andrea Cavallaro; Alessio Brutti; Oswald Lanz; Maurizio Omologo

doi:10.1109/ICASSP.2018.8461323

Back

3D Mouth Tracking from a Compact Microphone Array Co-Located with a camera

Conference proceeding

Open access

Peer reviewed

3D Mouth Tracking from a Compact Microphone Array Co-Located with a camera

Xinyuan Qian, Alessio Xompero, Andrea Cavallaro, Alessio Brutti, Oswald Lanz and Maurizio Omologo

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.3071-3075

IEEE International Conference on Acoustics, Speech, and Signal Processing proceedings

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018) (Calgary, 15/04/2021 - 20/04/2018)

10/09/2018

DOI: https://doi.org/10.1109/ICASSP.2018.8461323

Handle:

https://hdl.handle.net/10863/20246

Abstract

Audio-visual fusion

Particle filter

3D tracking

We address the 3D audio-visual mouth tracking problem when using a compact platform with co-located audio-visual sensors, without a depth camera. In particular, we propose a multi-modal particle filter that combines a face detector and 3D hypothesis mapping to the image plane. The audio likelihood computation is assisted by video, which relies on a GCC-PHAT based acoustic map. By combining audio and video inputs, the proposed approach can cope with a reverberant and noisy environment, and can deal with situations when the person is occluded, outside the Field of View (FoV), or not facing the sensors. Experimental results show that the proposed tracker is accurate both in 3D and on the image plane.

Files and links (3)

pdf

2018_ICASSP_3DMouthTracking_Qian_Xompero_Brutti_Lanz_Omologo_Cavallaro1.07 MBDownload View

Open Access

url

https://www.2018.ieeeicassp.org/View

url

https://ieeexplore.ieee.org/document/8461323View

Details

Title: 3D Mouth Tracking from a Compact Microphone Array Co-Located with a camera
Creators: Xinyuan Qian
Alessio Xompero
Andrea Cavallaro
Alessio Brutti
Oswald Lanz
Maurizio Omologo
Publication Details: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.3071-3075
ISBN: 9781538646595
EISBN: 9781538646588
ISSN: 2379-190X
Conference: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018) (Calgary, 15/04/2021 - 20/04/2018)
Series / Volume: IEEE International Conference on Acoustics, Speech, and Signal Processing proceedings
Publisher: IEEE
Number of pages: 5
Identifiers: 978-1-5386-4659-5
(UNIBZ)42803808
991006269098001241
Web of Science ID: 000446384603048
Scopus ID: 2-s2.0-85054251020
Academic Unit: Faculty of Computer Science
Language: English
Resource Type: Conference proceeding
Author Names String: Qian X, Xompero A, Cavallaro A, Brutti A, Lanz O, Omologo M,

Metrics

39 File views/ downloads

6 Record Views

See more details