Accurate Target Annotation in 3D from Multimodal Streams

Oswald Lanz; Alessio Brutti; Alessio Xompero; Xinyuan Qian; Maurizio Omologo; Andrea Cavallaro

doi:10.1109/ICASSP.2019.8682619

Back

Accurate Target Annotation in 3D from Multimodal Streams

Conference proceeding

Open access

Peer reviewed

Accurate Target Annotation in 3D from Multimodal Streams

Oswald Lanz, Alessio Brutti, Alessio Xompero, Xinyuan Qian, Maurizio Omologo and Andrea Cavallaro

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.3931-3935

IEEE International Conference on Acoustics, Speech, and Signal Processing proceedings

2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2019) (Brighton, 12/05/2019 - 17/05/2019)

2019

DOI: https://doi.org/10.1109/ICASSP.2019.8682619

Handle:

https://hdl.handle.net/10863/20254

Abstract

Multi-modal annotation

Multi-view

Audiovisual speaker tracking

Accurate annotation is fundamental to quantify the performance of multi-sensor and multi-modal object detectors and trackers. However, invasive or expensive instrumentation is needed to automatically generate these annotations. To mitigate this problem, we present a multi-modal approach that leverages annotations from reference streams (e.g. individual camera views) and measurements from unannotated additional streams (e.g. audio) to infer 3D trajectories through an optimization. The core of our approach is a multi-modal extension of Bundle Adjustment with a cross-modal correspondence detection that selectively uses measurements in the optimization. We apply the proposed approach to fully annotate a new multi-modal and multi-view dataset for multi-speaker 3D tracking.

Files and links (2)

pdf

2019_ICASSP__AccurateTargetAnnotationIn3DFromMultimodalStreams_Lanz_Brutti_Xompero_Qian_Omologo_Cavallaro1.68 MBDownload View

Open Access

url

https://ieeexplore.ieee.org/document/8682619View

Details

Title: Accurate Target Annotation in 3D from Multimodal Streams
Creators: Oswald Lanz - Fondazione Bruno Kessler
Alessio Brutti - Fondazione Bruno Kessler
Alessio Xompero - Queen Mary University of London
Xinyuan Qian - Queen Mary University of London
Maurizio Omologo - Fondazione Bruno Kessler
Andrea Cavallaro - Queen Mary University of London
Publication Details: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.3931-3935
ISBN: 9781479981328
EISBN: 9781479981311
ISSN: 2379-190X
Conference: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2019) (Brighton, 12/05/2019 - 17/05/2019)
Series / Volume: IEEE International Conference on Acoustics, Speech, and Signal Processing proceedings
Publisher: IEEE
Number of pages: 5
Identifiers: 978-1-4799-8132-8
(UNIBZ)42804082
991006268696701241
Web of Science ID: 000482554004034
Scopus ID: 2-s2.0-85068976675
Academic Unit: Faculty of Computer Science
Language: English
Resource Type: Conference proceeding
Author Names String: Lanz O, Brutti A, Xompero A, Qian X, Omologo M, Cavallaro A

Metrics

76 File views/ downloads

11 Record Views