Are you a Good Assistant? Assessing LLM Trustability in Task-oriented Dialogues

Tiziano Labruna; Sofia Brenna; G Bonetta; B Magnini

Back

Are you a Good Assistant? Assessing LLM Trustability in Task-oriented Dialogues

Conference proceeding

Open access

Peer reviewed

Are you a Good Assistant? Assessing LLM Trustability in Task-oriented Dialogues

Tiziano Labruna, Sofia Brenna, G Bonetta and B Magnini

Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024), Pisa, Italy, December 4-6, 2024, Vol.3878, pp.470-477

CEUR Workshop Proceedings, 3878

Tenth Italian Conference on Computational Linguistics (Clic-it 2024) (Pisa, 04/12/2024–06/12/2024)

2024

Handle:

https://hdl.handle.net/10863/51369

Abstract

Constraint satisfaction

Knowledge base coherence

Llama3 8B

task-oriented dialogues

Despite the impressive capabilities of recent Large Language Models (LLMs) to generate human-like text, their ability to produce contextually appropriate content for specific communicative situations is still a matter of debate. This issue is particularly crucial when LLMs are employed as assistants to help solve tasks or achieve goals within a given conversational domain. In such scenarios, the assistant is expected to access specific knowledge (e.g., a database of restaurants, a calendar of appointments) that is not directly accessible to the user and must be consistently utilised to accomplish the task. In this paper, we conduct experiments to evaluate the trustworthiness of automatic assistants in task-oriented dialogues. Our findings indicate that state-of-the-art open-source LLMs still face significant challenges in maintaining logical consistency with a knowledge base of facts, highlighting the need for further advancements in this area.

Files and links (2)

pdf

55_main_long1.04 MBDownload View

Open Access

url

https://ceur-ws.org/Vol-3878/View

Details

Title: Are you a Good Assistant? Assessing LLM Trustability in Task-oriented Dialogues
Creators: Tiziano Labruna
Sofia Brenna
G Bonetta
B Magnini
Publication Details: Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024), Pisa, Italy, December 4-6, 2024, Vol.3878, pp.470-477
Editor(s): Dell'Orletta F, Lenci A, Montemagni S, Sprugnoli R
ISSN: 1613-0073
Conference: Tenth Italian Conference on Computational Linguistics (Clic-it 2024) (Pisa, 04/12/2024–06/12/2024)
Series / Volume: CEUR Workshop Proceedings
3878
Publisher: CEUR-WS
Format: Online
Number of pages: 8
Identifiers: (UNIBZ)94007008
991007295252801241
Scopus ID: 2-s2.0-85214398941
Academic Unit: Faculty of Engineering
Language: English
Resource Type: Conference proceeding
Author Names String: Labruna T, Brenna S, Bonetta G, Magnini B
Additional Description: Editors/Supervisors: Dell'Orletta F, Lenci A, Montemagni S, Sprugnoli R

Metrics

1 Record Views