Which turn do neural models exploit the most to solve GuessWhat? diving into the dialogue history encoding in transformers and LSTMs

C Greco; A Testoni; Raffaella Bernardi

Back

Which turn do neural models exploit the most to solve GuessWhat? diving into the dialogue history encoding in transformers and LSTMs

Conference proceeding

Peer reviewed

Which turn do neural models exploit the most to solve GuessWhat? diving into the dialogue history encoding in transformers and LSTMs

C Greco, A Testoni and Raffaella Bernardi

CEUR Workshop Proceedings, Vol.2735, pp.29-43

2735

4th Workshop on Natural Language for Artificial Intelligence, NL4AI 2020 (Virtual, Online)

2020

Handle:

https://hdl.handle.net/10863/46783

Abstract

We focus on visually grounded dialogue history encoding. We show that GuessWhat?! can be used as a "diagnostic"dataset to understand whether State-of-the-Art encoders manage to capture salient information in the dialogue history. We compare models across several dimensions: the architecture (Recurrent Neural Networks vs. Transformers), the input modalities (only language vs. language and vision), and the model background knowledge (trained from scratch vs. pre-trained and then fine-tuned on the downstream task). We show that pre-trained Transformers are able to identify the most salient information independently of the order in which the dialogue history is processed whereas LSTM based models do not. Copyright (c) 2020 for this paper by its authors.

Files and links (1)

url

https://www.scopus.com/inward/record.uri?eid=2-s2.0-85099021948&partnerID=40&md5=e7ba04c3216dc850a1c7978098502997View

Details

Title: Which turn do neural models exploit the most to solve GuessWhat? diving into the dialogue history encoding in transformers and LSTMs
Creators: C Greco
A Testoni
Raffaella Bernardi
Publication Details: CEUR Workshop Proceedings, Vol.2735, pp.29-43
Conference: 4th Workshop on Natural Language for Artificial Intelligence, NL4AI 2020 (Virtual, Online)
Series / Volume: 2735
Publisher: CEUR-WS
Number of pages: 15
Identifiers: (UNIBZ)89053688
991007045055001241
Scopus ID: 2-s2.0-85099021948
Academic Unit: Faculty of Engineering
Language: English
Resource Type: Conference proceeding
Author Names String: Greco C, Testoni A, Bernardi R
Additional Description: description: Record is part of a bulk validation set

Metrics

1 Record Views