Grounding Dialogue History: Strengths and Weaknesses of Pre-trained Transformers

C Greco; A Testoni; Raffaella Bernardi

doi:10.1007/978-3-030-77091-4_17

Back

Grounding Dialogue History: Strengths and Weaknesses of Pre-trained Transformers

Conference proceeding

Peer reviewed

Grounding Dialogue History: Strengths and Weaknesses of Pre-trained Transformers

C Greco, A Testoni and Raffaella Bernardi

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol.12414 LNAI, pp.263-279

12414 LNAI

19th International Conference of the Italian Association for Artiﬁcial Intelligence, AIxIA 2020 (Virtual, Online)

2021

DOI: https://doi.org/10.1007/978-3-030-77091-4_17

Handle:

https://hdl.handle.net/10863/46772

Abstract

We focus on visually grounded dialogue history encoding. We show that GuessWhat?! can be used as a “diagnostic” dataset to understand whether State-of-the-Art encoders manage to capture salient information in the dialogue history. We compare models across several dimensions: the architecture (Recurrent Neural Networks vs. Transformers), the input modalities (only language vs. language and vision), and the model background knowledge (trained from scratch vs. pre-trained and then fine-tuned on the downstream task). We show that pre-trained Transformers, RoBERTa and LXMERT, are able to identify the most salient information independently of the order in which the dialogue history is processed. Moreover, we find that RoBERTa handles the dialogue structure to some extent; instead LXMERT can effectively ground short dialogues, but it fails in processing longer dialogues having a more complex structure. © 2021, Springer Nature Switzerland AG.

Files and links (1)

url

https://doi.org/10.1007/978-3-030-77091-4_17View

Details

Title: Grounding Dialogue History: Strengths and Weaknesses of Pre-trained Transformers
Creators: C Greco - University of Trento
A Testoni - University of Trento
Raffaella Bernardi - University of Trento
Publication Details: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol.12414 LNAI, pp.263-279
ISBN: 9783030770907
Conference: 19th International Conference of the Italian Association for Artiﬁcial Intelligence, AIxIA 2020 (Virtual, Online)
Series / Volume: 12414 LNAI
Publisher: Springer Science and Business Media Deutschland GmbH
Number of pages: 17
Identifiers: 9783030770907
(UNIBZ)89051575
991007045955201241
Web of Science ID: WOS:000886994000017
Scopus ID: 2-s2.0-85111364206
Academic Unit: Faculty of Engineering
Language: English
Resource Type: Conference proceeding
Author Names String: Greco C, Testoni A, Bernardi R
Additional Description: description: Record is part of a bulk validation set

Metrics

1 Record Views