GEESE: Generating and Evaluating Explanations for Semantic Entailment: A CALAMITA Challenge

Andrea Zaninello; B Magnini

Back

GEESE: Generating and Evaluating Explanations for Semantic Entailment: A CALAMITA Challenge

Conference proceeding

Peer reviewed

GEESE: Generating and Evaluating Explanations for Semantic Entailment: A CALAMITA Challenge

Andrea Zaninello and B Magnini

Proceedings of the 10th Italian Conference on Computational Linguistics (CLiC-it), pp.1209-1216

Tenth Italian Conference on Computational Linguistics (Clic-it 2024) (Pisa, 04/12/2024–06/12/2024)

2024

Handle:

https://hdl.handle.net/10863/50728

Abstract

In the GEESE challenge, we present a pipeline to evaluate generated explanations for the task of Recognizing Textual Entailment (RTE) in Italian. The challenge focuses on evaluating the impact of generated explanations on the predictive performance of language models. Using a dataset enriched with human-written explanations, we employ two large language models (LLMs) to generate and utilize explanations for semantic relationships between sentence pairs. Our methodology assesses the quality of generated explanations by measuring changes in prediction accuracy when explanations are provided. Through reproducible experimentation, we establish benchmarks against various baseline approaches, demonstrating the potential of explanation injection to enhance model interpretability and performance.

Files and links (1)

url

https://aclanthology.org/2024.clicit-1.133.pdfView

Details

Title: GEESE: Generating and Evaluating Explanations for Semantic Entailment: A CALAMITA Challenge
Creators: Andrea Zaninello
B Magnini
Publication Details: Proceedings of the 10th Italian Conference on Computational Linguistics (CLiC-it), pp.1209-1216
Editor(s): Dell'Orletta F, Lenci A, Montemagni S, Sprugnoli R
ISBN: 979-12-210-7060-6
Conference: Tenth Italian Conference on Computational Linguistics (Clic-it 2024) (Pisa, 04/12/2024–06/12/2024)
Publisher: CEUR Workshop Proceedings
Pisa
Format: Online
Number of pages: 8
Identifiers: 979-12-210-7060-6
(UNIBZ)91611719
991007250242401241
Scopus ID: n.a.
Academic Unit: Faculty of Engineering
Language: English
Resource Type: Conference proceeding
Author Names String: Zaninello A, Magnini B
Additional Description: Editors/Supervisors: Dell'Orletta F, Lenci A, Montemagni S, Sprugnoli R

Metrics

1 Record Views