Skip to main navigation Skip to search Skip to main content

Binary Code Modularization Method Based on Graph Embedding

  • Shubin Yuan
  • , Chenyu Liu
  • , Jianheng Shi
  • , Yu Han
  • , Wei Pu
  • , Siwei Zhao
  • , Liqun Yang
  • China National Offshore Oil Corp
  • Ltd.

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In the field of cybersecurity, software reverse engineering serves as a crucial foundational technology, particularly in the analysis of software vulnerabilities and malicious code. A key challenge within this domain is the partitioning of binary code into functional modules, which aids analysts in swiftly and accurately comprehending software structure and functionality, thereby enhancing analysis efficiency. This paper presents a novel approach for binary code module partitioning based on graph embedding, aiming to address limitations inherent in traditional methodologies employed in software reverse engineering. By abstracting software systems into attribute graphs and utilizing graph embedding-based clustering techniques to embed and cluster function nodes, this method adequately considers both node attributes and similarities, thus improving the accuracy and robustness of module partitioning. Notably, the approach employs a graph embedding clustering method based on multi-head attention mechanisms, effectively facilitating module partitioning of binary files. Experimental results demonstrate the significant effectiveness and performance advantages of the proposed method on binary files.

Original languageEnglish
Title of host publication2024 4th IEEE International Conference on Software Engineering and Artificial Intelligence, SEAI 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages146-150
Number of pages5
ISBN (Electronic)9798350374346
DOIs
StatePublished - 2024
Event4th IEEE International Conference on Software Engineering and Artificial Intelligence, SEAI 2024 - Xiamen, China
Duration: 21 Jun 202423 Jun 2024

Publication series

Name2024 4th IEEE International Conference on Software Engineering and Artificial Intelligence, SEAI 2024

Conference

Conference4th IEEE International Conference on Software Engineering and Artificial Intelligence, SEAI 2024
Country/TerritoryChina
CityXiamen
Period21/06/2423/06/24

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • attention mechanisms
  • binary code modularization
  • clustering
  • graph embedding

Fingerprint

Dive into the research topics of 'Binary Code Modularization Method Based on Graph Embedding'. Together they form a unique fingerprint.

Cite this