Daniela Francesca Virdis

Exploring the Dataset Landscape for Automated Propaganda Detection: A Data-Centric Insight

Usai M.;Mura D. A.;Loddo A.;Sanguinetti M.;Zedda L.;Di Ruberto C.;Atzori M.
2025-01-01

Abstract

The increasing spread of propaganda in digital media has intensified research efforts toward the development of automated detection systems. Central to this task is the availability and quality of annotated datasets, which directly impact model performance, generalizability, and real-world applicability. In this paper, we present a data-centric insight into the current landscape of datasets used for automated propaganda detection. We analyze a representative set of publicly available corpora with respect to key factors such as annotation schemes, label granularity, domain coverage, linguistic diversity, and class balance. This work aims to guide researchers toward more robust, inclusive, and scalable approaches to propaganda detection by emphasizing the foundational role of data quality and structure.
2025
Inglese
Italian Conference on Big Data and Data Science 2025. Proceedings of the 4th Italian Conference on Big Data and Data Science (ITADATA 2025). Turin, Italy, September 9-11, 2025
CEUR-WS
Bena N., Ceci M., Esposito R., Torlone R., Bruna A., Ardagna C., Polato M., Romano L.
4152
8
https://ceur-ws.org/Vol-4152/
https://ceur-ws.org/Vol-4152/short71.pdf
4th Italian Conference on Big Data and Data Science, ITADATA 2025
Contributo
Esperti anonimi
9 September 2025 - 11 September 2025
Turin, Italy
nazionale
scientifica
Dataset Benchmarking; Propaganda Detection; Span Identification
no
4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno
Usai, M.; Mura, D. A.; Loddo, A.; Sanguinetti, M.; Zedda, L.; Di Ruberto, C.; Atzori, M.
273
7
4.1 Contributo in Atti di convegno
open
info:eu-repo/semantics/conferencePaper
File in questo prodotto:
File Dimensione Formato  
2025_Exploring the Dataset Landscape for Automated Propaganda Detection_A Data-Centric Insight.pdf

accesso aperto

Descrizione: Articolo completo
Tipologia: versione editoriale (VoR)
Dimensione 204.86 kB
Formato Adobe PDF
204.86 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Questionario e social

Condividi su:
Impostazioni cookie