Knowledge-driven joint posterior revision of named entity classification and linking

Marco Rospocher; Francesco Corcoglioniti

doi:10.1016/J.WEBSEM.2020.100617

Back

Knowledge-driven joint posterior revision of named entity classification and linking

Journal article

Peer reviewed

Knowledge-driven joint posterior revision of named entity classification and linking

Marco Rospocher and Francesco Corcoglioniti

Journal of Web Semantics, Vol.65, pp.1-15

22/10/2020

DOI: https://doi.org/10.1016/J.WEBSEM.2020.100617

Handle:

https://hdl.handle.net/10863/49808

Abstract

Knowledge-based systems

Ontology

Petroleum reservoir evaluation

Quality control

Text processing

Knowledge Representation

In this work we address the problem of extracting quality entity knowledge from natural language text, an important task for the automatic construction of knowledge graphs from unstructured content. More in details, we investigate the benefit of performing a joint posterior revision, driven by ontological background knowledge, of the annotations resulting from natural language processing (NLP) entity analyses such as named entity recognition and classification (NERC) and entity linking (EL). The revision is performed via a probabilistic model, called jpark, that given the candidate annotations independently identified by NERC and EL tools on the same textual entity mention, reconsiders the best annotation choice performed by the tools in light of the coherence of the candidate annotations with the ontological knowledge. The model can be explicitly instructed to handle the information that an entity can potentially be NIL (i.e., lacking a corresponding referent in the target linking knowledge base), exploiting it for predicting the best NERC and EL annotation combination. We present a comprehensive evaluation of jpark along various dimensions, comparing its perfor- mances with and without exploiting NIL information, as well as the usage of three different background knowledge resources (YAGO, DBpedia, and Wikidata) to build the model. The evaluation, conducted using different tools (the popular Stanford NER and DBpedia Spotlight, as well as the more recent Flair NER and End-to-End Neural EL) with three reference datasets (AIDA, MEANTIME, and TAC-KBP), empirically confirms the capability of the model to improve the quality of the annotations of the given tools, and thus their performances on the tasks they are designed for.

Files and links (1)

url

https://doi.org/10.1016/j.websem.2020.100617View

Details

Title: Knowledge-driven joint posterior revision of named entity classification and linking
Creators: Marco Rospocher - University of Verona
Francesco Corcoglioniti - Free University of Bozen-Bolzano
Publication Details: Journal of Web Semantics, Vol.65, pp.1-15
ISSN: 1570-8268
Series / Volume: 65
Publisher: Elsevier
Number of pages: 15
Identifiers: (UNIBZ)86616816
991006940174301241
Scopus ID: 2-s2.0-85093703848
Academic Unit: Faculty of Computer Science
Language: English
Resource Type: Journal article
Author Names String: Rospocher M, Corcoglioniti F

Metrics

1 Record Views

See more details