Antonella Balestrieri

UniCa About Professors and Researchers Antonella Balestrieri Research Research outcomes (IRIS)

Antonella Balestrieri

Evaluating LLMs for Named Entity Recognition in Scientific Domain with Fine-Tuning and Few-Shot Learning

Buscaldi D.;Dessi D.;Osborne F.;Piras D.;reforgiato Recupero D.

2025-01-01

Abstract

Entity extraction is a crucial step in constructing Knowledge Graphs (KGs) from natural language text. In the scientific domain, Named Entity Recognition (NER) is widely used to analyze research papers and facilitate the generation of knowledge graphs that capture research concepts. Given the vast scale of contemporary research output, this task necessitates automated pipelines to maintain efficiency while ensuring the quality of the extracted knowledge. Large Language Models (LLMs) present a promising solution to this challenge. As such, this paper explores the effectiveness of LLMs for NER in scientific texts, using the SciERC dataset as a benchmark. Specifically, it evaluates different LLM architectures, including encoder-only, decoder-only, and encoder-decoder models, to identify the most effective approach for NER in the computer science domain. By examining the strengths and limitations of each model type, this study aims to provide deeper insights into the applicability of LLMs for entity extraction, ultimately improving the construction of domain-specific KGs.

Short Card

Tab complete

Full Sheet(DC)

         Anno 
       
        2025 
       
         Lingua/e 
       
        Inglese 
       
         Titolo del Volume 
       
        CEUR Workshop Proceedings 
       
         Nome Editore 
       
        CEUR-WS 
       
         Titolo della Collana/serie 
       
        CEUR WORKSHOP PROCEEDINGS 
       
         Volume 
       
        3979 
       
         Numero di pagine 
       
        10 
       
         Codice Scopus 
       
        2-s2.0-105009126483 
       
         Titolo del convegno 
       
        3rd International Workshop on Semantic Technologies and Deep Learning Models for Scientific, Technical and Legal Data, SemTech4STLD 2025 
       
         Referee 
       
        Esperti anonimi 
       
         Periodo del Convegno 
       
        2025 
       
         Luogo del Convegno 
       
        svn 
       
         Caratterizzazione prevalente 
       
        scientifica 
       
         Parole chiave 
       
        Knowledge Graph Construction
Large Language Models
Named Entity Recognition
Scholarly Domain 
       
         Presenza di coautori internazionali 
       
        sì 
       
         Tipologia 
       
        4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno 
       
         Tutti gli autori 
       
        Buscaldi, D.; Dessi, D.; Osborne, F.; Piras, D.; Reforgiato Recupero, D.
         
         Tipologia sito docente 
       
        273 
       
         Numero autori 
       
        5 
       
         Tipologia 
       
        4.1 Contributo in Atti di convegno 
       
         Fulltext 
       
        none 
       
         Tipologia 
       
        info:eu-repo/semantics/conferencePaper

Files in This Item:

There are no files associated with this item.

University of Cagliari

University of Cagliari