Multi-class text classification of news data

Maurizio Romano
First
;
Maria Paola Priola
Last
2024-01-01

Abstract

Several Multi-class text classification (MCC) strategies, Namely One-Vs-Rest (OVA), One-Vs-One (OVO), Best-of-Best (BOB), and Error-Correcting-Output-Codes (ECOC), are compared in terms of accuracy and computational efficiency. Each strategy is implemented utilizing several classifiers such as Naïve Bayes, Random Forest, Logistic Regression, Neural Networks, Linear Discriminant Analysis, Support Vector Machine, and the recently-introduced Threshold-based Naïve Bayes (Tb-NB). We run a horse race involving the analysis of the 20News-Group dataset, well known in the literature for its complexity. Our results highlight the importance of choosing the right classifier whilst pairing it with an optimal strategy, providing valuable insights for optimizing classifier performance in MCC classification tasks considering both environmental implications and the need for accurate predictions.
2024
Inglese
Proceedings of the Statistics and Data Science 2024 Conference. New perspectives on Statistics and Data Science
978-88-5509-645-4
PUP
Palermo
Antonella Plaia, Leonardo Egidi, Antonino Abbruzzo
28
33
6
https://unipapress.com/book/proceedings-of-the-statistics-and-data-science-2024-conference/
Statistics and Data Science 2024 - New perspectives on Statistics and Data Science
Esperti anonimi
11-12 Aprile 2024
Palermo, Italia
internazionale
scientifica
Statistical Learning; One-Vs-Rest; One-Vs-All; Naïve Bayes; Tb-Nb; Accuracy; 20NewsGroup dataset
Goal 13: Climate action
no
4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno
Romano, Maurizio; Priola, MARIA PAOLA
273
2
4.1 Contributo in Atti di convegno
open
info:eu-repo/semantics/conferencePaper
Files in This Item:
File Size Format  
SDS2024.pdf

open access

Type: versione editoriale
Size 459.12 kB
Format Adobe PDF
459.12 kB Adobe PDF View/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Questionnaire and social

Share on:
Impostazioni cookie