Vision and language integration: Moving beyond objects

R Shekhar; S Pezzelle; A Herbelot; M Nabi; E Sangineto; Raffaella Bernardi

Back

Vision and language integration: Moving beyond objects

Conference proceeding

Peer reviewed

Vision and language integration: Moving beyond objects

R Shekhar, S Pezzelle, A Herbelot, M Nabi, E Sangineto and Raffaella Bernardi

12th International Conference on Computational Semantics, IWCS 2017 - Short Papers

12th International Conference on Computational Semantics, IWCS 2017 (Montpellier)

2017

Handle:

https://hdl.handle.net/10863/46802

Abstract

The last years have seen an explosion of work on the integration of vision and language data. New tasks like Image Captioning and Visual Questions Answering have been proposed and impressive results have been achieved. There is now a shared desire to gain an in-depth understanding of the strengths and weaknesses of those models. To this end, several datasets have been proposed to try and challenge the state-of-the-art. Those datasets, however, mostly focus on the interpretation of objects (as denoted by nouns in the corresponding captions). In this paper, we reuse a previously proposed methodology to evaluate the ability of current systems to move beyond objects and deal with attributes (as denoted by adjectives), actions (verbs), manner (adverbs) and spatial relations (prepositions). We show that the coarse representations given by current approaches are not informative enough to interpret attributes or actions, whilst spatial relations somewhat fare better, but only in attention models. © IWCS 2017. All rights reserved.

Files and links (1)

url

https://www.scopus.com/inward/record.uri?eid=2-s2.0-85061739546&partnerID=40&md5=da31897146dabe8ca69561cd11fe3701View

Details

Title: Vision and language integration: Moving beyond objects
Creators: R Shekhar
S Pezzelle
A Herbelot
M Nabi
E Sangineto
Raffaella Bernardi
Publication Details: 12th International Conference on Computational Semantics, IWCS 2017 - Short Papers
Conference: 12th International Conference on Computational Semantics, IWCS 2017 (Montpellier)
Publisher: Association for Computational Linguistics (ACL)
Identifiers: (UNIBZ)89050860
991007045255001241
Scopus ID: 2-s2.0-85061739546
Academic Unit: Faculty of Engineering
Language: English
Resource Type: Conference proceeding
Author Names String: Shekhar R, Pezzelle S, Herbelot A, Nabi M, Sangineto E, Bernardi R
Additional Description: description: Record is part of a bulk validation set

Metrics

1 Record Views