CN114117443A

CN114117443A - Automatic vulnerability detection method and device based on vulnerability model

Info

Publication number: CN114117443A
Application number: CN202111287922.1A
Authority: CN
Inventors: 马琪灿; 陆丽; 陈慧娟; 曹亚南; 吴博; 宋玉柱
Original assignee: Zhengzhou Aiwen Computer Technology Co ltd
Current assignee: Zhengzhou Aiwen Computer Technology Co ltd
Priority date: 2021-11-02
Filing date: 2021-11-02
Publication date: 2022-03-01

Abstract

The invention provides an automatic vulnerability detection device and method based on a vulnerability model. First, a target binary file to be detected is uploaded, the vulnerability model is run in the reverse result of the binary file, and the possible vulnerability points are returned by matching the characteristics of the vulnerability model. Location. The invention can detect the position of vulnerable points of various binary programs, provide strong support for the dynamic analysis of subsequent vulnerabilities, and has important guidance and reference significance for the analysis of malicious codes of binary programs.

Description

Automatic vulnerability detection method and device based on vulnerability model

Technical Field

The invention relates to the technical field of automatic vulnerability detection, in particular to an automatic vulnerability detection method and device based on a vulnerability model.

Background

Vulnerability refers to some functional or security logical flaw in the system, including all factors that cause threat and damage to the security of the computer system, and is the flaw and deficiency of the computer system in the specific implementation of hardware, software, protocol or system security policy. For a variety of reasons, the existence of vulnerabilities is unavoidable, and once some more serious vulnerabilities are discovered by an attacker, they may be exploited to gain unauthorized access to or destruction of the computer system. Discovering and timely patching vulnerabilities prior to an attacker can effectively reduce threats from the network. Therefore, the system security vulnerability is actively discovered and analyzed, and the method has important significance for network defense and attack warfare.

The current vulnerability detection technologies have many kinds: the traditional manual detection mode is time-consuming and labor-consuming, and has higher requirements on the safety technical level of detection personnel; the simple static scanning mode is used for simply analyzing the fragile point function of the target source program, and the false alarm rate is high.

Disclosure of Invention

In order to solve the problems in the background art, the invention provides an automatic vulnerability detection method and device based on a vulnerability model.

An automatic vulnerability detection method based on a vulnerability model comprises the following steps:

s1, reversing the binary file into an assembly layer language representation file;

s2, performing vulnerability function matching on the reverse binary file;

s3, searching controllable variables by using taint propagation, and further matching the fragile point function meeting the conditions.

Based on the above, step S1 includes:

s101, creating a binary file storage directory for storing the binary file to be detected.

S102, downloading the target binary file to the storage directory created in the step S101; if the target to be detected is the firmware type, extracting the executable program by using a firmware analysis tool and then storing the executable program into the directory created in S101;

and S103, carrying out IDA reverse analysis on the downloaded file of the S102, and converting the file into an assembly layer code.

Based on the above, step S2 includes:

s201, representing codes converted into assembly level in an assembly level language representation file by using a Hex-Rays Decompiler plug-in, and processing the codes which cannot be decompiled by using a self-programming Python script;

s202, in the decompiling layer code in the step S201, different detection scripts are operated according to different vulnerability detection types of the target program, the detection scripts are divided into a character string detection type, a command injection type, a memory copy type, a formatting character string type and a shaping overflow type, and all fragile point functions of the corresponding types are searched;

and S203, extracting all calling function lists by using the Get _ func _ name () function in the idc for the vulnerability extracted in the step S202.

Based on the above, step S3 includes:

s301, generating a ControlFloworant object for one function in the S203 function list by using a FIDL (flame eye) frame;

s302, searching for a controllable variable by utilizing taint propagation, and firstly tracking and searching for a primary pollution variable at least comprising a default controllable variable and a supplementary controllable variable;

s303, further searching secondary pollution variables in the step S302, wherein the secondary pollution variables at least comprise direct controllable variables and indirect controllable variables;

s304, carrying out secondary matching according to the characteristics of the fragile point functions of different categories, and finding out the fragile point function with the vulnerability;

s305, recording and storing the matching result in the step S304;

and S306, repeating all the steps from S301 to S305 until the residual function number in the function list is empty.

An automatic vulnerability detection device based on vulnerability model comprises

The front-end module is used for interacting with a user, displaying a generated result, uploading a file to be detected, selecting a testing vulnerability type and adding a deletion task;

the scheduling module is configured to communicate and schedule between the front-end module and the rear-end module, temporarily store a task queue created by the front-end module and control the start and stop of tasks of the rear-end module;

and the back-end module is used for realizing the technical scheme, internally configures a virtual environment, and returns a result to the scheduling module after running the target task.

Compared with the prior art, the invention has prominent substantive characteristics and remarkable progress, particularly:

(1) compared with the traditional reverse static analysis tool, the reverse analysis and vulnerability detection capability of the binary program are greatly improved, the vulnerability model is utilized for feature matching, the probability of finding a fragile point function is improved, and the accuracy is improved;

(2) the invention provides a front-end page easy to interactively display and a rear-end automatic task processing flow mechanism for a user, the user only needs to upload a binary file to be detected on the front end, and the rear end automatically carries out a vulnerability analysis process and returns a result to the front end for displaying;

(3) the technology of the invention utilizes the stain propagation principle, so that the controllable variables found aiming at the suspected fragile point function are more comprehensive and wider in coverage, and the efficiency of finding the fragile point function is improved.

Drawings

FIG. 1 is a schematic flow diagram of the present invention.

FIG. 2 is a detailed flow diagram of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be obtained by a person skilled in the art without inventive effort based on the embodiments of the present invention, are within the scope of the present invention.

As shown in fig. 1 and fig. 2, an automated vulnerability detection method based on vulnerability model includes a step S1 of reversing binary files into assembly layer language representation; s2, performing vulnerability function matching on the reverse binary file; s3, searching for controllable variables by utilizing taint propagation to further match the qualified fragile point function.

Converting the target program into a form of assembly layer language by using an IDA reverse tool, and converting the assembly language into a disassembling form by using a disassembling plug-in; and automatically processing the target program which cannot be directly disassembled into a disassembling language form by using a self-programming script. And respectively searching all fragile point functions of the corresponding types of the vulnerability models by using the character string detection type, the command injection type, the memory copy type, the formatting character string type and the shaping overflow type vulnerability models according to the types of the target program. And tracking primary pollution variables such as default controllable variables and supplementary controllable variables and secondary pollution variables such as direct controllable variables and indirect controllable variables of the matched fragile point function list by using a taint propagation technology, matching by using different vulnerability models again, and screening out the fragile point function with higher accuracy.

Specifically, step S1 includes step S101, creating a binary file saving directory for storing the binary file to be detected. S102, downloading the target binary file to the directory created in S101; if the target to be detected is a firmware type, extracting the executable program by using a firmware analysis tool such as BinWalk and storing the extracted executable program into the directory created in S101. And S103, carrying out IDA reverse analysis on the downloaded file of the S102, and converting the file into an assembly layer code.

Step S2 includes step S201, representing the code converted to assembly level in S103 in Decompiler form using Hex-Rays Decompiler plug-in, and processing the code that cannot be decompiled using self-programming Python script. S202, in the decompilation layer code of the step S201, different detection scripts are operated according to different vulnerability detection types of the target program, the detection scripts are divided into a character string detection type, a command injection type, a memory copy type, a formatting character string type and a shaping overflow type, and all fragile point functions of the corresponding types are searched. And S203, extracting all calling function lists by using a Get _ func _ name () function in idc for the vulnerability extracted in the S202.

Step S3 includes step S301 of generating a contofloworant object for a function in the list of functions of S203 using the FIDL frame. S302, searching for a controllable variable by utilizing taint propagation, and firstly tracking and searching for primary pollution variables such as a default controllable variable, a supplementary controllable variable and the like. And S303, further searching secondary pollution variables such as direct controllable variables, indirect controllable variables and the like in the step S302. And S304, carrying out matching again according to the characteristics of the fragile point functions of different categories, and finding out the fragile point function possibly having the vulnerability. And S305, recording and storing the matching result in the S304. And S306, repeating all the steps from S301 to S305 until the residual function number in the function list is empty.

An automatic vulnerability detection device based on a vulnerability model implementing the technical scheme comprises a front-end module, a scheduling module and a rear-end module: the front-end module is used for displaying interaction with a user and a generated result, and has the functions of uploading a file to be detected, selecting a testing vulnerability type and adding and deleting tasks; the scheduling module is used for communication and scheduling between the front-end functional module and the rear-end functional module, and has the functions of temporarily storing a task queue created by the front end and controlling starting and stopping of a rear-end task; the back end module is used for realizing the technical scheme, a virtual environment exists in the back end module, and the back end module returns a result to the scheduling module after the target task is operated.

It will be evident to those skilled in the art that the invention is not limited to the details of the foregoing illustrative embodiments, and that the present invention may be embodied in other specific forms without departing from the spirit or essential attributes thereof. The present embodiments are therefore to be considered in all respects as illustrative and not restrictive, the scope of the invention being indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein.

Claims

1. An automatic vulnerability detection method based on a vulnerability model is characterized by comprising the following steps:

s2, performing vulnerability function matching on the reverse binary file;

2. The automated vulnerability detection method based on vulnerability model of claim 1, wherein step S1 comprises:

3. The automated vulnerability detection method based on vulnerability model of claim 1, wherein step S2 comprises:

4. The automated vulnerability detection method based on vulnerability model of claim 3, wherein step S3 comprises:

s305, recording and storing the matching result in the step S304;

5. An automated vulnerability detection apparatus based on vulnerability models using the method of any one of claims 1-4, characterized by: comprises that