Pierpaolo Ciccarelli
Data drift in Android malware detection
Minnei, Luca;Eddoubi, Hicham;Sotgiu, Angelo;Pintor, Maura;Demontis, Ambra;Biggio, Battista
2025-01-01
Abstract
Android malware detectors are now widely implemented with machine learning algorithms, trained on large datasets of goodware and malware applications gathered at a fixed moment in time. However, as recent work showed, this domain is not stationary, causing detectors to show degrading performance over time. While recent work pinpoints the presence of such drift, little has been done to isolate its causes and understand the underlying reasons. In this work, we show which features cause the data drift, i.e., new features to appear and old ones that become unreliable. Our experimental evaluation highlights that particular feature groups cause the data drift. However, we also show that removing these highly variable features from the feature set is insufficient to achieve good classification performance.| File | Size | Format | |
|---|---|---|---|
| Data_Drift_in_Android_Malware_Detection.pdf Solo gestori archivio
Type: versione editoriale
Size 366.5 kB
Format Adobe PDF
|
366.5 kB | Adobe PDF | & nbsp; View / Open Request a copy |
| ICMLC-drift-malware.pdf open access
Type: versione pre-print
Size 306.46 kB
Format Adobe PDF
|
306.46 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
University of Cagliari