Tamara Yuliett Forbes Hernández

UniCa About Professors and Researchers Tamara Yuliett Forbes Hernández Research Research outcomes (IRIS)

Tamara Yuliett Forbes Hernández

Exploring the Dataset Landscape for Automated Propaganda Detection: A Data-Centric Insight

Usai M.;Mura D. A.;Loddo A.;Sanguinetti M.;Zedda L.;Di Ruberto C.;Atzori M.

2025-01-01

Abstract

The increasing spread of propaganda in digital media has intensified research efforts toward the development of automated detection systems. Central to this task is the availability and quality of annotated datasets, which directly impact model performance, generalizability, and real-world applicability. In this paper, we present a data-centric insight into the current landscape of datasets used for automated propaganda detection. We analyze a representative set of publicly available corpora with respect to key factors such as annotation schemes, label granularity, domain coverage, linguistic diversity, and class balance. This work aims to guide researchers toward more robust, inclusive, and scalable approaches to propaganda detection by emphasizing the foundational role of data quality and structure.

Short Card

Tab complete

Full Sheet(DC)

         Anno 
       
        2025 
       
         Lingua/e 
       
        Inglese 
       
         Titolo del Volume 
       
        Italian Conference on Big Data and Data Science 2025. Proceedings of the 4th Italian Conference on Big Data and Data Science (ITADATA 2025). Turin, Italy, September 9-11, 2025 
       
         Nome Editore 
       
        CEUR-WS 
       
         Curatore/i del Volume 
       
        Bena N., Ceci M., Esposito R., Torlone R., Bruna A., Ardagna C., Polato M., Romano L. 
       
         Titolo della Collana/serie 
       
        CEUR WORKSHOP PROCEEDINGS 
       
         Volume 
       
        4152 
       
         Numero di pagine 
       
        8 
       
         Codice Scopus 
       
        2-s2.0-105037319821 
       
         URL 
       
        https://ceur-ws.org/Vol-4152/
https://ceur-ws.org/Vol-4152/short71.pdf 
       
         Titolo del convegno 
       
        4th Italian Conference on Big Data and Data Science, ITADATA 2025 
       
         Relazione 
       
        Contributo 
       
         Referee 
       
        Esperti anonimi 
       
         Periodo del Convegno 
       
        9 September 2025 - 11 September 2025 
       
         Luogo del Convegno 
       
        Turin, Italy 
       
         Rilevanza del Convegno 
       
        nazionale 
       
         Caratterizzazione prevalente 
       
        scientifica 
       
         Parole chiave 
       
        Dataset Benchmarking; Propaganda Detection; Span Identification 
       
         Presenza di coautori internazionali 
       
        no 
       
         Tipologia 
       
        4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno 
       
         Tutti gli autori 
       
        Usai, M.; Mura, D. A.; Loddo, A.; Sanguinetti, M.; Zedda, L.; Di Ruberto, C.; Atzori, M.
         
         Tipologia sito docente 
       
        273 
       
         Numero autori 
       
        7 
       
         Tipologia 
       
        4.1 Contributo in Atti di convegno 
       
         Fulltext 
       
        open 
       
         Tipologia 
       
        info:eu-repo/semantics/conferencePaper 
       
         Type: 
       
        4.1 Contributo in Atti di convegno

Files in This Item:

File	Size	Format
2025_Exploring the Dataset Landscape for Automated Propaganda Detection_A Data-Centric Insight.pdf open access Description: Articolo completo Type: versione editoriale Size 204.86 kB Format Adobe PDF View/Open	204.86 kB	Adobe PDF	View/Open

University of Cagliari

University of Cagliari