Exploring the Dataset Landscape for Automated Propaganda Detection: A Data-Centric Insight

Usai M.;Mura D. A.;Loddo A.;Sanguinetti M.;Zedda L.;Di Ruberto C.;Atzori M.
2025-01-01

Abstract

The increasing spread of propaganda in digital media has intensified research efforts toward the development of automated detection systems. Central to this task is the availability and quality of annotated datasets, which directly impact model performance, generalizability, and real-world applicability. In this paper, we present a data-centric insight into the current landscape of datasets used for automated propaganda detection. We analyze a representative set of publicly available corpora with respect to key factors such as annotation schemes, label granularity, domain coverage, linguistic diversity, and class balance. This work aims to guide researchers toward more robust, inclusive, and scalable approaches to propaganda detection by emphasizing the foundational role of data quality and structure.
2025
Inglese
Italian Conference on Big Data and Data Science 2025. Proceedings of the 4th Italian Conference on Big Data and Data Science (ITADATA 2025). Turin, Italy, September 9-11, 2025
CEUR-WS
Bena N., Ceci M., Esposito R., Torlone R., Bruna A., Ardagna C., Polato M., Romano L.
4152
8
https://ceur-ws.org/Vol-4152/
https://ceur-ws.org/Vol-4152/short71.pdf
4th Italian Conference on Big Data and Data Science, ITADATA 2025
Contributo
Esperti anonimi
9 September 2025 - 11 September 2025
Turin, Italy
nazionale
scientifica
Dataset Benchmarking; Propaganda Detection; Span Identification
no
4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno
Usai, M.; Mura, D. A.; Loddo, A.; Sanguinetti, M.; Zedda, L.; Di Ruberto, C.; Atzori, M.
273
7
4.1 Contributo in Atti di convegno
open
info:eu-repo/semantics/conferencePaper
Files in This Item:
File Size Format  
2025_Exploring the Dataset Landscape for Automated Propaganda Detection_A Data-Centric Insight.pdf

open access

Description: Articolo completo
Type: versione editoriale
Size 204.86 kB
Format Adobe PDF
204.86 kB Adobe PDF View/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Questionnaire and social

Share on:
Impostazioni cookie