A Graph-Representation-Learning Framework for Supporting Android Malware Identification and Polymorphic Evolution

Alfredo Cuzzocrea, Miguel Quebrado, Abderraouf Hafsaoui, Edoardo Serra

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Detecting Malware is an interesting research area, however, as the polymorphic nature of the latter makes it difficult to identify, particularly when using Hash-based detection methods. Unlike image-based strategies, in this research, a graph-based technique was used to extract control flow graphs from Android APK binaries. In order to handle the generated graph, we employ an approach that combines a novel graph representation learning method called Inferential SIR- GN for Graph representation, which retains graph structural similarities, with XGBoost, i.e., a typical Machine Learning model. The approach is then applied to MALNET, a publicly accessible cybersecurity database that contains the image and graph-based Android APK binary representations for a total of 1, 262, 024 million Android APK binary files with 47 kinds and 696 families. The experimental results indicate that our graph-based strategy outperforms the image-based approach in terms of detection accuracy.

Original languageEnglish
Title of host publicationProceedings - 2023 10th IEEE Swiss Conference on Data Science, SDS 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages34-41
Number of pages8
ISBN (Electronic)9798350338751
DOIs
StatePublished - 2023
Event10th IEEE Swiss Conference on Data Science, SDS 2023 - Zurich, Switzerland
Duration: 22 Jun 202323 Jun 2023

Publication series

NameProceedings - 2023 10th IEEE Swiss Conference on Data Science, SDS 2023

Conference

Conference10th IEEE Swiss Conference on Data Science, SDS 2023
Country/TerritorySwitzerland
CityZurich
Period22/06/2323/06/23

Keywords

  • Malware Polymorphism
  • Structural Graph Representation Learning

Fingerprint

Dive into the research topics of 'A Graph-Representation-Learning Framework for Supporting Android Malware Identification and Polymorphic Evolution'. Together they form a unique fingerprint.

Cite this