A Digital Humanism perspective on providing language resources to CLARIN in an age of AI commodification: The case of UniTermGPT

Barbara Heinisch

Back

Conference poster

A Digital Humanism perspective on providing language resources to CLARIN in an age of AI commodification: The case of UniTermGPT

Barbara Heinisch

CLARIN Annual Conference 2025 (Vienna, 30/09/2025–02/10/2025)

2025

Handle:

https://hdl.handle.net/10863/51048

Abstract

terminology

language for specific purposes

terminology work

artificial intelligence

research data management

FAIR

CARE

Digital Humanism

Sustainability

Ethics

The increasing use of large language models (LLMs) in translation and terminology work raises critical ethical and infrastructural questions, particularly regarding the commodification of open language resources. From a Digital Humanism perspective, this paper presents UniTermGPT, a project that examines how ChatGPT handles university-related terminology across German varieties, including Austrian, German and South Tyrolean and contributes FAIR-compliant, annotated corpora and resources to CLARIN. UniTermGPT not only supports LLM benchmarking in specialized translation but also highlights the risks of open language resources being exploited by commercial LLM providers. By embedding CARE principles and Digital Humanism values such as transparency, inclusivity and epistemic justice into its methodology, this paper argues for stronger safeguards and ethical standards within public infrastructures. This includes mechanisms for provenance tracking, responsible licensing, transparent governance and the representation of minority language communities in decisions about their language resources. Ultimately, UniTermGPT illustrates that openness in research data management can be balanced with responsibility, ethical reflection and attention to linguistic diversity. By demonstrating how open resources can support both technological development and broader societal benefits, the project provides a practical example of responsible openness and highlights ways in which infrastructures like CLARIN can facilitate the ethical sharing and use of language data.

Files and links (2)

url

https://www.clarin.eu/event/2025/clarin-annual-conference-2025View

url

https://www.clarin.eu/media/9212View

Details

Title: A Digital Humanism perspective on providing language resources to CLARIN in an age of AI commodification: The case of UniTermGPT
Creators: Barbara Heinisch
Conference: CLARIN Annual Conference 2025 (Vienna, 30/09/2025–02/10/2025)
Identifiers: (EURAC)30746879
991007137209201241
Academic Unit: Institute for Applied Linguistics
Language: English
Resource Type: Conference poster
Description coverage: none
Description audience: Scientific
Local Fields: Scientific
Author Names String: Heinisch B

Metrics

1 Record Views