Marco Giovanni Nieddu

Large Language Models for Scientific Question Answering: An Extensive Analysis of the SciQA Benchmark

Meloni A.;reforgiato Recupero D.
;
2024-01-01

Abstract

The SciQA benchmark for scientific question answering aims to represent a challenging task for next-generation question-answering systems on which vanilla large language models fail. In this article, we provide an analysis of the performance of language models on this benchmark including prompting and fine-tuning techniques to adapt them to the SciQA task. We show that both fine-tuning and prompting techniques with intelligent few-shot selection allow us to obtain excellent results on the SciQA benchmark. We discuss the valuable lessons and common error categories, and outline their implications on how to optimise large language models for question answering over knowledge graphs.
2024
Inglese
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
9783031606250
9783031606267
Springer Science and Business Media Deutschland GmbH
GEWERBESTRASSE 11, CHAM, CH-6330, SWITZERLAND
14664
199
217
19
21st European Semantic Web Conference, ESWC 2024
Esperti anonimi
2024
grc
scientifica
Few-shot learning
Fine-tuning
Knowledge graphs
Language models
Question answering
4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno
Lehmann, J.; Meloni, A.; Motta, E.; Osborne, F.; Reforgiato Recupero, D.; Salatino, A. A.; Vahdati, S.
273
7
4.1 Contributo in Atti di convegno
open
info:eu-repo/semantics/conferencePaper
File in questo prodotto:
File Dimensione Formato  
Analysis_of_SciQA_ORKG_QA_Benchmark-3 (1).pdf

Open Access dal 21/05/2025

Tipologia: versione post-print (AAM)
Dimensione 331.31 kB
Formato Adobe PDF
331.31 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Questionario e social

Condividi su:
Impostazioni cookie