Workshop

Deep Convolutional Malware Classifiers Can Learn from Raw Executables and Labels Only

Marek Krčál ⋅ Ondřej Švec ⋅ Martin Bálek ⋅ Otakar Jašek

[ PDF]

Abstract

We propose and evaluate a simple convolutional deep neural network architecture detecting malicious \emph{Portable Executables} (Windows executable files) by learning from their raw sequences of bytes and labels only, that is, without any domain-specific feature extraction nor preprocessing. On a dataset of 20 million \emph{unpacked} half megabyte Portable Executables, such end-to-end approach achieves performance almost on par with the traditional machine learning pipeline based on handcrafted features of Avast.

Chat is not available.