Marco Giovanni Nieddu

UniCa Ateneo Docenti e ricercatori Marco Giovanni Nieddu Ricerca Prodotti della Ricerca (IRIS)

Marco Giovanni Nieddu

Large Language Models for Scientific Question Answering: An Extensive Analysis of the SciQA Benchmark

Lehmann J.;Meloni A.;Motta E.;Osborne F.;reforgiato Recupero D.;Salatino A. A.;Vahdati S.

2024-01-01

Abstract

The SciQA benchmark for scientific question answering aims to represent a challenging task for next-generation question-answering systems on which vanilla large language models fail. In this article, we provide an analysis of the performance of language models on this benchmark including prompting and fine-tuning techniques to adapt them to the SciQA task. We show that both fine-tuning and prompting techniques with intelligent few-shot selection allow us to obtain excellent results on the SciQA benchmark. We discuss the valuable lessons and common error categories, and outline their implications on how to optimise large language models for question answering over knowledge graphs.

Scheda breve

Scheda completa

Scheda completa (DC)

         Anno 
       
        2024 
       
         Lingua/e 
       
        Inglese 
       
         Titolo del Volume 
       
        Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 
       
         Codice ISBN 
       
        9783031606250
9783031606267 
       
         Nome Editore 
       
        Springer Science and Business Media Deutschland GmbH 
       
         Città Editore 
       
        GEWERBESTRASSE 11, CHAM, CH-6330, SWITZERLAND 
       
         Titolo della Collana/serie 
       
        LECTURE NOTES IN COMPUTER SCIENCE 
       
         Volume 
       
        14664 
       
         Da pagina 
       
        199 
       
         A pagina 
       
        217 
       
         Numero di pagine 
       
        19 
       
         Codice DOI 
       
        https://dx.doi.org/10.1007/978-3-031-60626-7_11 
       
         Codice UT ISI 
       
        WOS:001279216400011 
       
         Codice Scopus 
       
        2-s2.0-85194234220 
       
         Titolo del convegno 
       
        21st European Semantic Web Conference, ESWC 2024 
       
         Referee 
       
        Esperti anonimi 
       
         Periodo del Convegno 
       
        2024 
       
         Luogo del Convegno 
       
        grc 
       
         Caratterizzazione prevalente 
       
        scientifica 
       
         Parole chiave 
       
        Few-shot learning
Fine-tuning
Knowledge graphs
Language models
Question answering 
       
         Presenza di coautori internazionali 
       
        sì 
       
         Tipologia 
       
        4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno 
       
         Tutti gli autori 
       
        Lehmann, J.; Meloni, A.; Motta, E.; Osborne, F.; Reforgiato Recupero, D.; Salatino, A. A.; Vahdati, S.
         
         Tipologia sito docente 
       
        273 
       
         Numero autori 
       
        7 
       
         Tipologia 
       
        4.1 Contributo in Atti di convegno 
       
         Fulltext 
       
        open 
       
         Tipologia 
       
        info:eu-repo/semantics/conferencePaper 
       
         Tipologia: 
       
        4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
Analysis_of_SciQA_ORKG_QA_Benchmark-3 (1).pdf Open Access dal 21/05/2025 Tipologia: versione post-print (AAM) Dimensione 331.31 kB Formato Adobe PDF Visualizza/Apri	331.31 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Università degli Studi di Cagliari

Università degli Studi di Cagliari