Reducing Data Volume in News Topic Classification: Deep Learning Framework and Dataset

Serreli, Luigi;Marche, Claudio;Nitti, Michele
2024-01-01

Abstract

With the rise of smart devices and technological advancements, accessing vast amounts of information has become easier than ever before. However, sorting and categorising such an overwhelming volume of content has become increasingly challenging. This paper introduces a new framework for classifying news articles based on a Bidirectional LSTM (BiLSTM) network and an attention mechanism. The paper also presents a new dataset of 60,000 news articles from various global sources. Furthermore, it proposes a methodology for reducing data volume by extracting key sentences using an algorithm resulting in inference times that are, on average, 50% shorter than the original document without compromising the system's accuracy. Experimental evaluations demonstrate that our framework outperforms existing methodologies in terms of accuracy. Our system's accuracy has been compared with various works using two popular datasets, AG News and BBC News, and has achieved excellent results of 99.7% and 94.55% respectively.
2024
Inglese
6
153
164
12
Esperti anonimi
scientifica
Data volume; Deep learning; Natural language processing; Topic classification
no
Serreli, Luigi; Marche, Claudio; Nitti, Michele
1.1 Articolo in rivista
info:eu-repo/semantics/article
1 Contributo su Rivista::1.1 Articolo in rivista
262
3
open
Files in This Item:
File Size Format  
Reducing_Data_Volume_in_News_Topic_Classification_Deep_Learning_Framework_and_Dataset.pdf

open access

Type: versione editoriale
Size 2.27 MB
Format Adobe PDF
2.27 MB Adobe PDF View/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Questionnaire and social

Share on:
Impostazioni cookie