CN111104169B

CN111104169B - Instruction list scheduling method, device, computer equipment and storage medium

Info

Publication number: CN111104169B
Application number: CN201911214046.2A
Authority: CN
Inventors: 不公告发明人
Original assignee: Shanghai Cambricon Information Technology Co Ltd
Current assignee: Shanghai Cambricon Information Technology Co Ltd
Priority date: 2017-12-29
Filing date: 2017-12-29
Publication date: 2021-01-12
Anticipated expiration: 2037-12-29
Also published as: CN109992307B; CN111104169A; CN109992307A

Abstract

The invention relates to an instruction list scheduling method, device, computer equipment and storage medium. The method obtains all selected nodes for each instruction selection in the instruction scheduling process by analyzing the data dependency relationship of the instructions to be scheduled, and then according to the order of each order The evaluation result of the corresponding selection node determines the instructions in each order in the scheduled instruction list. This method can ensure that each time an instruction is selected, the selected instruction is the optimal result of the current state, and in the scheduled instruction list obtained by using these optimal results, the arrangement of each instruction is more compact, which is convenient for shortening the instructions in the original instruction list. The execution time of the sequence.

Description

Instruction list scheduling method and device, computer equipment and storage medium

The application is a division of a chinese application 2017114844108 entitled "instruction list scheduling method, apparatus, computer device, and storage medium" filed on 29/12/2017.

Technical Field

The present invention relates to the field of computer technology in information technology, and in particular, to a method and an apparatus for scheduling an instruction list, a computer device, and a storage medium.

Background

With the rapid development of computer technology, a Multi-processor computer System (Multi-processor Computing System) including a plurality of first processors, such as a Multi-core computer System (Multi-processor Computing System) and a Heterogeneous computer System (Heterogeneous Computing System), has appeared. The plurality of first processors of the computer system can process different instructions in parallel according to the instruction lists corresponding to the plurality of first processors, so that the processing efficiency of the computer system is improved.

However, the order of the instructions in the instruction list corresponding to the plurality of first processors of the computer system may not be reasonable, for example, the instructions in the instruction list are not made to be parallel as much as possible, which may not improve the processing efficiency of the computer system or improve the efficiency.

Therefore, it is an urgent technical problem to provide a method, an apparatus, a computer device and a storage medium for scheduling an instruction list, so as to adjust the order of instructions in the instruction list, to make the arrangement between the instructions in the instruction list more compact, and to shorten the execution time of the instruction list.

Disclosure of Invention

Based on this, it is necessary to provide an instruction list scheduling method, apparatus, computer device, and storage medium for solving the problem of unreasonable instruction sequence ordering in an instruction list used by a processor.

An instruction list scheduling method, comprising: acquiring a to-be-scheduled instruction set in a to-be-scheduled instruction list, and performing data dependency analysis on the to-be-scheduled instruction set to obtain a data dependency relationship among instructions in the to-be-scheduled instruction set;

obtaining all selection nodes for instruction selection each time in the instruction scheduling process according to the data dependency relationship among the instructions;

and according to a preset rule, determining the instructions in each order in the scheduled instruction list according to the selected nodes in the corresponding order.

In one embodiment, the step of determining, according to a preset rule, instructions in each order in a scheduled instruction list according to the selection node in the corresponding order includes:

accessing the selection node and acquiring the longest execution time corresponding to the currently accessed selection node;

if the longest execution time corresponding to the currently accessed selection node is smaller than the initial execution time, determining the sequenced instructions of the currently accessed selection node as instructions in the corresponding sequence in a scheduled instruction list;

the initial execution time is the execution time of the instruction sequence in the instruction list to be scheduled.

In one embodiment, the method comprises:

and if the longest execution time corresponding to the currently accessed selection node is less than the initial execution time, updating the initial execution time to the longest execution time corresponding to the currently accessed selection node.

In one embodiment, the step of accessing the selected node and obtaining the longest execution time corresponding to the currently accessed selected node includes:

accessing the selection node in a preset access time period, and acquiring the longest execution time corresponding to the currently accessed selection node;

if the longest execution time corresponding to the currently accessed selection node is smaller than the initial execution time, determining the ordered instructions corresponding to the currently accessed selection node as the instructions in the corresponding order in the scheduled instruction list;

In one embodiment, if the longest execution time corresponding to the currently accessed selection node is not less than the initial execution time, the instruction sequence in the instruction table to be scheduled is used as the instruction sequence in the instruction table after scheduling.

and selecting the selected node to access according to a random priority rule, and acquiring the longest execution time corresponding to the selected node which is selected to access currently.

and selecting the selected node for access according to a breadth-first rule, and acquiring the longest execution time corresponding to the selected node which is selected to be accessed currently.

and selecting the selected node for access according to a depth-first rule, and acquiring the longest execution time corresponding to the selected node which is selected to be accessed currently.

selecting the selected nodes with the sequence less than the preset sequence for access according to a breadth or random priority rule to obtain the longest execution time corresponding to the selected node which is selected for access currently;

and selecting the selected nodes not less than the preset sequence according to a depth-first rule for access to obtain the longest execution time corresponding to the selected nodes selected to be accessed currently.

obtaining the shortest execution time corresponding to the currently accessed selection node;

if the shortest execution time corresponding to the currently accessed selection node is greater than the initial execution time, terminating accessing the selection node associated with the currently accessed selection node;

In one embodiment, according to a preset rule, the step of determining the instructions in each order in the scheduled instruction list according to the selection node in the corresponding order includes:

and evaluating all the selected nodes corresponding to the current order according to the preset priority of the instruction to obtain the evaluation result of each selected joint in the current order, and determining the instruction corresponding to the current order according to the evaluation result.

In one embodiment, the method comprises: the priority of each instruction is set according to the specific content and/or type of the currently selected node.

and determining the instruction corresponding to the current sequence according to the length of the shortest execution time corresponding to all the selected nodes in the current sequence.

An instruction scheduling apparatus comprising: an acquisition module, a data dependence analysis module and an evaluation module,

the obtaining module is used for obtaining a to-be-scheduled instruction set in the to-be-scheduled instruction list and obtaining all selection nodes corresponding to each instruction selection in the instruction scheduling process according to the data dependency relationship among the instructions;

the data dependency analysis module is used for carrying out data dependency analysis on the instruction set to be scheduled to obtain a data dependency relationship among the instructions;

and the evaluation module is used for determining the instructions in each order in the scheduled instruction list according to a preset rule and the selected nodes in the corresponding order.

A computer device comprising a memory, a processor, and a computer program stored on the memory and executable on the processor, the processor performing the steps of the above mentioned method.

A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of carrying out the above-mentioned method.

Compared with the prior art, the instruction list scheduling method, the instruction list scheduling device, the computer equipment and the storage medium have the following beneficial effects:

and obtaining all selection nodes corresponding to each instruction selection in the scheduling process by analyzing the data dependency of the instructions to be scheduled, and determining the instructions of each order in the scheduled instruction list according to the evaluation result of the selection nodes corresponding to each order. The method can ensure that the selected instruction is the optimal result of the current state when the instruction is selected every time, the arrangement among the instructions is more compact by using the scheduled instruction list obtained by the optimal results, and the execution time of the instruction sequence in the original instruction list is convenient to shorten.

Drawings

FIG. 1 is a diagram illustrating a computer system according to one embodiment;

FIG. 2 is a flowchart illustrating the steps of a method for scheduling instruction lists according to one embodiment;

FIG. 3 is a diagram of data dependencies for instructions to be scheduled, obtained in one embodiment;

FIG. 4 is a correlation diagram of select nodes obtained in one embodiment;

FIG. 5 is a block diagram of an instruction list scheduler according to an embodiment of the present invention;

fig. 6 is an internal structural diagram of a computer device according to an embodiment.

Detailed Description

In order to make the objects, technical solutions and technical effects of the present invention more apparent, specific embodiments of the present invention are described below with reference to the accompanying drawings. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention. It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict. It should be clear that "first", "second", etc. in this embodiment are only used to distinguish the described objects, and do not have any order or technical meaning.

As shown in fig. 1, the computer System 100 according to an embodiment of the present invention may be a Multi-processor computer System (Multi-core processor Computing System) including a plurality of processors, such as a Multi-core processor computer System (Multi-core processor Computing System), a Heterogeneous computer System (Heterogeneous Computing System), and the like. Optionally, the computer system may specifically include an instruction list scheduling apparatus 110, a plurality of first processors 120, and a memory 130, the plurality of first processors 120 may be connected to the instruction list scheduling apparatus 110 at the same time, and the instruction list scheduling apparatus 110 may be used for instruction list rescheduling of the plurality of first processors 120. Optionally, the instruction list scheduling device 110 may also include a second processor. Optionally, the second processor may include an obtaining module, a data dependency analysis module, an evaluation module, an operation module, a control module, and the like, where the obtaining module may be a hardware module such as an IO (Input/Output) interface, and the operation module and the control module are both hardware modules.

The plurality of first processors 120 may process different instructions in parallel according to the instruction list to improve the processing efficiency of the computer system. Optionally, the instruction list may include one or more instructions, each instruction includes a set of reference operations for a resource, and the resource referenced by the instruction may be known by reading or executing the instruction. I.e., when the first processor or the like executes the instruction, the resource referenced by the instruction may be called to implement the particular operation. For example, the instruction may be a Load instruction (Load), a computation instruction (computing), a store instruction (store), or the like, but the instruction may also be an N-level computation of a neural network, N >0, and N may be an integer or a non-integer.

Furthermore, the instructions in the instruction list are arranged according to an execution sequence, and the resource referred by each instruction in the instruction list may be a virtual memory object or a physical memory object. The virtual memory object may be a memory block, a register, or other virtual memory space of a storage device capable of storing data in software logic. The instruction scheduling process in this embodiment is a process of reordering the instructions in the instruction list on the premise of ensuring that the semantics of the original instruction list are not changed, which may make the arrangement between the instructions in the instruction list more compact, so as to shorten the execution time of the instruction list and improve the processing efficiency of the system.

For example, the instruction list includes N instructions, where N ≧ 1, N is a positive integer, and the N instructions are labeled as a first instruction, a second instruction, … …, and an Nth instruction according to execution timing. The scheduling process of the instruction list is a process of reordering the N instructions.

Specifically, when scheduling the instruction list, the instruction list scheduling apparatus 110 may first obtain the data dependency relationship of each instruction in the instruction list to be scheduled. Alternatively, the form of the data dependency may include RAW (Read After Write)/WAR (Write After Read)/ww (Write After Write). Alternatively, the Data dependency relationship may be described by a Data dependency Graph DDG (Data dependency Graph). Further, the second processor of the instruction list scheduling apparatus 110 may obtain the instruction list to be scheduled through the obtaining module thereof, and perform data dependency analysis on the instructions in the instruction list to be scheduled through the data dependency analysis module thereof to obtain the data dependency relationship between the instructions. Specifically, the data dependency analysis module may perform resource scanning and tracking on each instruction in the instruction list to be scheduled, so as to analyze the data dependency relationship between the instructions. The data dependency between the instructions in this embodiment refers to whether the execution of the current instruction needs to depend on the execution results of other instructions. For example, if there is instruction A "read data written by instruction B" then instruction A depends on the result of instruction B. Then, the obtaining module may obtain all the selection nodes that perform instruction selection each time in the instruction scheduling process according to the obtained data dependency relationship between the instructions.

Then, the instruction list scheduling device may determine, through the evaluation module, instructions in each order in the scheduled instruction list from all the selected nodes in the corresponding order according to a preset rule. Optionally, the second processor may evaluate the selection node corresponding to the current order through the evaluation module of the second processor, obtain an evaluation result of each selection node of the current order, and determine the instruction corresponding to the current order according to the evaluation result. Each selection node records the ordered instruction and the instruction set to be scheduled corresponding to the selection node. Optionally, the evaluation module evaluates the selected nodes corresponding to the current order according to the priority of each instruction. Optionally, the second processor may also set the priority of the instruction according to the specific content and/or type of the currently selected node.

Optionally, when the instruction list scheduling apparatus 110 performs instruction scheduling, the first processor corresponding to an instruction in the instruction list to be scheduled may be adjusted. For example, the first processor corresponding to the instruction to be scheduled may be determined according to the type of the instruction, or the specific content of the instruction to be scheduled.

Fig. 2 is a flowchart illustrating steps of an instruction list scheduling method according to an embodiment of the present invention, which can be applied to the computer system described above. The computer system may include a memory 130 and a plurality of first processors 120. The instruction list scheduling method is used for realizing the rescheduling of the instructions in the instruction list corresponding to the plurality of first processors in the computer system so as to improve the processing efficiency of the computer. Specifically, the method may include the steps of:

step S100: the method comprises the steps of obtaining an instruction set to be scheduled in an instruction list to be scheduled, and carrying out data dependency analysis on the instruction set to be scheduled to obtain a data dependency relationship among instructions in the instruction set to be scheduled.

Specifically, the second processor may obtain, through the obtaining module, an instruction set to be scheduled of the instruction list to be scheduled, and obtain, through the data dependency analysis module, a data dependency relationship of the instruction. The instruction set to be scheduled in this embodiment is composed of a plurality of instructions to be scheduled in an instruction list to be scheduled. Optionally, the instruction set to be scheduled does not include a semantic-free instruction (e.g., a synchronization instruction) in the instruction list to be scheduled. Further, the step of acquiring the instruction set to be scheduled of the instruction list to be scheduled by the acquiring module includes: and acquiring a list of instructions to be scheduled, and deleting the semantic-free instructions in the list of instructions to be scheduled to obtain an instruction set to be scheduled.

For example, the instruction set to be scheduled acquired by the acquisition module includes six instructions { L1, L2, C1, C2, S1, S2 }. Wherein, L1, C1 and S1 need to be executed in sequence, L2, C2 and S2 need to be executed in sequence, and the rest instructions have no data dependency relationship. L1, L2, S1, S2 are I/O instructions, and C1, C2 are compute instructions. The Data dependency analysis module performs Data dependency analysis on the instruction to be scheduled to obtain a Data dependency relationship among instructions in the instruction set to be scheduled, and the Data dependency relationship is described by using a DDG (Data dependency Graph) as shown in fig. 3.

The resource referred by each instruction to be scheduled in the instruction list to be scheduled may be a virtual memory object or a physical memory object. The virtual memory object may be a memory block, a register, or other virtual memory space of a storage device capable of storing data in software logic.

Step S200: and obtaining all selection nodes for instruction selection each time in the instruction scheduling process according to the data dependency relationship among the instructions.

Each selection node records the ordered instruction and the instruction set to be scheduled corresponding to the selection node. Optionally, all the selected processes may be obtained as: the second processor preferentially obtains all first selection nodes during the first instruction selection through the obtaining module, specifically, obtains the ordered instructions and the instruction set to be scheduled corresponding to the first selection nodes. It should be clear that there are data dependencies for each instruction in these sets of instructions to be scheduled. And then the second processor acquires all second selection nodes associated with each first selection node through the acquisition module according to the data dependency relationship of each first selection node, wherein the second selection nodes correspond to the second instruction selection. And (4) circulating the steps to obtain a third selection node … … and an Nth selection node, wherein N is more than or equal to 3, and N is a positive integer. The sum of the first selected node, … …, and the nth selected node obtained in the above steps constitutes all selected nodes for instruction selection at a time.

For example, the instruction set to be scheduled in the acquired instruction list to be scheduled contains six instructions: { L1, L2, C1, C2, S1, S2}, the data dependency relationship among these six instructions is shown in FIG. 3. It can be clearly seen from fig. 3 that the six instructions L1 and L2 in the instruction set to be scheduled may not depend on the execution of other instructions, and therefore, when the instruction is selected for the first time, the instruction needs to be selected from L1 and L2, that is, the obtained first selection node corresponds to two cases of selecting the instruction L1 or L2. When L1 is selected at the time of the first instruction selection, L1 is an ordered instruction, at which time the first selection node records the ordered instruction L1, and deletes the instruction set to be scheduled { L2, C1, C2, S1, S2} of instruction L1. Similarly, another first selection node is obtained when the first instruction is selected at the time of selecting the L2, and the first selection node records the sorted instruction L2 and deletes the instruction set to be scheduled { L1, C1, C2, S1 and S2} of the instruction L2. The above process is looped to obtain the second selection node … … in the second instruction selection and the sixth selection node in the sixth instruction selection.

In this implementation step, each time instruction selection is performed, an instruction set to be scheduled, which is obtained according to a previous instruction selection, for example, an instruction set to be scheduled corresponding to fig. 3, is required to be selected, when an instruction selected in the first instruction selection is L1 (corresponding to one of the first selection nodes), an instruction set to be scheduled { L2, C1, C2, S1, S2} is obtained, instructions L2 and C1 in the instruction set of the first selection node may not depend on execution of other instructions, and at this time, when a second instruction selection is performed, selection from L2 and C1 is required (two second selection nodes are correspondingly present); when the selected instruction is L2 (corresponding to another first selection node) at the time of first instruction selection, the obtained instruction set to be scheduled { L1, C1, C2, S1, S2}, the instructions L1 and C2 in the scheduling instruction set of the first selection node may not depend on the execution of other instructions, and at this time, the instruction needs to be selected from L1 and C2 (corresponding to the existence of two second selection nodes) at the time of second instruction selection. Therefore, there is an association between all the selected nodes obtained in this embodiment, and such an association between the selected nodes can be represented by fig. 4.

Step S300: and according to a preset rule, determining the instructions in each order in the scheduled instruction list according to the selected nodes in the corresponding order. Optionally, the second processor may evaluate the selection node corresponding to the current order through the evaluation module, and determine the instruction corresponding to the current order according to the evaluation result of each selection node of the current order. For example, if the current order is the second instruction, then, corresponding to the second selection node in fig. 4, four second selection nodes in fig. 4 are evaluated according to a preset rule, and the second instruction in the scheduled instruction list is obtained according to the evaluation result. Optionally, the evaluation module evaluates the selected node (e.g., … … after C1 with the highest priority of L2) corresponding to the current order according to the preset priority of each instruction, so as to obtain an evaluation result. Optionally, the second processor sets the priority of each instruction according to the specific content and/or type of the currently selected node.

Optionally, the evaluation module may determine the instruction corresponding to the current order according to the length of the shortest execution time corresponding to all the selected nodes of the current order. For example, the shortest execution time of the instruction sequence corresponding to the first selected node corresponding to the instruction L1 in fig. 4 is t₁The shortest execution time of the corresponding instruction sequence is t for the first selected node corresponding to the instruction L2₂，t₁＞t₂Then L2 is determined to be the first instruction in the scheduled instruction list. Similarly, the second instruction, … …, sixth instruction of the scheduled instruction list is determined.

In the instruction list scheduling method provided in this embodiment, all the selection nodes that perform instruction selection each time in the instruction scheduling process are obtained by analyzing the data dependency relationship of the instruction to be scheduled, and then the instruction in each order in the scheduled instruction list is determined according to the evaluation result of the selection node corresponding to each order. The method can ensure that the selected instruction is the optimal result of the current state when the instruction is selected every time, the arrangement among the instructions is more compact by using the scheduled instruction list obtained by the optimal results, and the execution time of the instruction sequence in the original instruction list is convenient to shorten.

As an optional implementation manner, the step of determining, by the evaluation module, the instructions in each order in the post-scheduling instruction list according to the preset rule and in the selection node in the corresponding order includes:

step 210: and the evaluation module accesses the selection node and acquires the longest execution time corresponding to the currently accessed selection node. The selection nodes accessed by the evaluation module may be a first selection node, a second selection node, … …, an nth selection node.

Step S220: if the longest execution time corresponding to the currently accessed selection node is less than the initial execution time T₀Then, the ordered instruction of the current access node is determined as the corresponding instruction in the scheduled instruction list. The initial execution time is the execution time of the instruction sequence in the instruction list to be scheduled.

The longest execution time corresponding to the currently accessed selection node in this implementation step is the execution time when the arrangement of the instruction sequence corresponding to the currently accessed node is least reasonable. For example, the longest execution time corresponding to the first and second selection nodes on the left side in FIG. 4 is T₁＝t₁+t₂+t₃+t₄+t₅Wherein, t₁To be arranged alreadyExecution time, t, of the sequential instructions L1-L2₂Is the execution time of instruction C1; t is t₃Is the execution time, t, of instruction S1₄Is the execution time, t, of instruction C2₅The execution time of the instruction S2 is the case when the unordered instructions C1, C2, S1, and S2 corresponding to the selected node are not parallel at all and are sorted least reasonably. If T₁＜T₀Then L1, L2 are taken as the first instruction and the second instruction in the scheduled list of instructions, respectively.

Because the longest execution time corresponding to the currently accessed selection node is less than the initial execution time, the execution time of the instruction sequence obtained by the instruction list scheduling method provided in this embodiment is not greater than the instruction sequence in the instruction list to be scheduled.

Since the evaluation module of this embodiment accesses the selected node accessed according to the preset rule, instructions in the instruction list are not scheduled only according to the selected node in the current order, and the influence of the determined instruction in the current order on the selection of subsequent instructions can be avoided. The method is particularly suitable for scheduling an instruction list containing instructions with large calculation amount, and optionally an instruction list containing neural network operation instructions. For example, the instruction list includes N instructions, where the N instructions include a weight load instruction a and a neural network convolutional layer calculation instruction B, and if the conventional method is used, the instruction a and the instruction B may not be parallel to achieve the highest processing efficiency of the system, and the instruction list scheduling scheme of this embodiment may implement the parallel of the instruction a and the instruction B in the scheduled instruction list.

In one embodiment, the method may further include: and if the longest execution time corresponding to the currently accessed selection node is less than the initial execution time, updating the initial execution time to the longest execution time corresponding to the currently accessed selection node. For example, in the above embodiment, when T is₁＜T₀Then, L1 and L2 are used as the first instruction and the second instruction in the scheduled instruction list, and T is used as the first instruction and the second instruction in the scheduled instruction list₁Updated to the initial execution time.

It should be clear that, when the longest execution time corresponding to the currently accessed selection node is less than the initial execution time, the ordered instruction corresponding to the currently accessed selection node is determined as the instruction in the corresponding order in the scheduled instruction list, and it can be ensured that the execution time of the instruction sequence in the obtained scheduled instruction list is shorter. The scheme for updating the initial execution time is to further optimize the ordering of the instructions and improve the processing efficiency of the system.

As an optional implementation manner, the step of accessing, by the evaluation module, the selected node and obtaining the longest execution time corresponding to the currently accessed selected node includes:

and accessing the selection nodes in a preset access time period to obtain the longest execution time corresponding to each selection node in the preset access time period. The present embodiment needs to determine the instructions in each order of the scheduled instruction list by combining the methods proposed in the above embodiments.

Because a plurality of instructions to be scheduled generally exist in the instruction list, the number of the selection nodes obtained according to the instructions to be scheduled is huge, and sufficient time is difficult to traverse all the selection nodes in actual operation. Based on the above, the object of the present invention can be achieved as long as the execution time is shortened by the new instruction list obtained by the instruction list scheduling method provided by the present invention. Therefore, when the instruction list scheduling method provided by the invention is actually used for instruction reordering, the access time period is generally set according to actual requirements, and the scheduling time of the instruction is controlled.

As an optional implementation manner, if the longest execution time corresponding to the currently accessed selection node is not less than the initial execution time, the instruction sequence in the instruction table to be scheduled is taken as the instruction sequence in the instruction table after scheduling.

In this embodiment, the longest execution time corresponding to the currently accessed selection node is not less than the initial execution time, and taking the instruction sequence in the instruction table to be scheduled as the instruction sequence in the instruction table after scheduling is the optimization of the instruction list scheduling method proposed in the above embodiment. The method can ensure that the obtained instruction sequence in the scheduled instruction list is the optimal result obtained in the preset time period.

As an optional implementation manner, the step of accessing the selection node and obtaining the longest execution time corresponding to the currently accessed selection node includes:

step 230: and the evaluation module acquires the shortest execution time corresponding to the currently accessed selected node.

Step 240: if the shortest execution time corresponding to the currently accessed selection node is larger than the initial execution time T₀Then the access to the selected node associated with the currently accessed selected node is terminated. For example, the shortest execution time of the second selected node corresponding to the instruction L2 is T₂，T₂The unordered instructions C1, C2, S1 and S2 corresponding to the selected node are perfectly parallel, and the sorting is the most reasonable. If T₂＞T₀Then access to the third selection nodes associated with the second selection node and the fourth selection nodes associated with these third selection nodes, … …, sixth selection node, is terminated.

Because the evaluation module consumes time when accessing one selection node, the technical scheme of the embodiment can eliminate invalid access to the selection node, and improve the scheduling efficiency of the instruction list.

As an optional implementation manner, the step of accessing, by the evaluation module, the selected node, and acquiring the longest execution time corresponding to the selected node that is currently selected and accessed includes: the evaluation module selects the selected node for access according to random priority (such as Monte Carlo Tree Search, MCTS, Monte Carlo Tree Search), and obtains the longest execution time corresponding to the selected node selected to be accessed currently.

As an optional implementation manner, the step of accessing, by the evaluation module, the selected node and obtaining the longest execution time corresponding to the currently accessed selected node includes: and the evaluation module selects the selected node for access according to a rule of Breadth First (BFS) and acquires the longest execution time corresponding to the selected node which is selected to be accessed currently. Specifically, breadth-first in this embodiment refers to preferentially selecting a selected node in the same order as the currently accessed selected node for access. For example, if the second selection node is currently accessed, the next accessed selection node preferentially selects the other second selection nodes.

As an optional implementation manner, the step of accessing, by the evaluation module, the selected node and obtaining the longest execution time corresponding to the currently accessed selected node includes: and the evaluation module selects the selected node for access according to a Depth First Search (BFS) rule and acquires the longest execution time corresponding to the selected node which is selected to be accessed currently. Specifically, depth-first in this embodiment refers to preferentially selecting a selection node in the next order associated with a currently accessed selection node for access. For example, if the currently accessed selection node is the second selection node, the next accessed selection node preferentially selects the third selection node associated with the second selection node.

Optionally, the evaluation module may also select the selected node for access by using a rule combining random preference with depth preference, or select the selected node for access by using a rule combining breadth preference with depth preference. Specifically, selecting the selection nodes smaller than a preset sequence for access according to a breadth or random priority rule to obtain the longest execution time corresponding to the selection node selected for access currently; and selecting the selected nodes not less than the preset sequence according to a depth-first rule for access to obtain the longest execution time corresponding to the selected nodes selected to be accessed currently. Optionally, the preset values of the corresponding sequences are determined according to empirical values, or according to pre-experimental results.

When an access time period is set for instruction list scheduling, an evaluation module of the instruction list scheduling apparatus does not have enough time to traverse all the selection nodes, and at this time, if a single rule of depth-first or breadth-first is adopted to select the selection nodes for access, the range of the selection nodes to be accessed finally may be relatively unilateral (for example, only the selection nodes associated with a certain selection node are accessed, or only the selection nodes in the first few orders are accessed), but the randomness of the selection nodes to be accessed finally when the selection nodes are selected for access only by adopting the rule of random preference is too strong, so the scheme that the selection nodes are selected for access by adopting the rule of random preference in combination with depth-first, or the selection nodes are selected for access by adopting the rule of breadth-first in combination with depth-first.

It should be understood that although the various steps in the flowchart of fig. 2 are shown as indicated by the arrows, the steps are not necessarily performed in the order indicated by the arrows. The steps are not performed in the exact order shown and described, and may be performed in other orders, unless explicitly stated otherwise. Moreover, at least some of the steps in fig. 1 and 3 may include multiple sub-steps or multiple stages that are not necessarily performed at the same time, but may be performed at different times, and the order of performing the sub-steps or stages is not necessarily performed, but may be performed alternately or alternately with other steps or at least some of the sub-steps or stages of other steps.

Fig. 5 is a schematic structural diagram of an instruction list scheduling apparatus proposed in one embodiment, where the apparatus includes: an acquisition module 510, a data dependency analysis module 520, an evaluation module 530,

the obtaining module 510 is configured to obtain a set of instructions to be scheduled in the instruction list to be scheduled, and obtain all selection nodes for performing instruction selection each time in the instruction scheduling process according to a data dependency relationship between the instructions.

The data dependency analysis module 520 is configured to perform data dependency analysis on the instruction set to be scheduled, so as to obtain a data dependency relationship between instructions in the instruction set to be scheduled.

The evaluation module 530 is configured to determine, according to a preset rule, instructions in each order in the scheduled instruction list according to the selected node in the corresponding order.

In one embodiment, the evaluating module 530 accesses the selected node and obtains the longest execution time corresponding to the currently accessed selected node; if the longest execution time corresponding to the currently accessed selection node is smaller than the initial execution time, determining the sequenced instructions of the currently accessed selection node as instructions in the corresponding sequence in a scheduled instruction list; the initial execution time is the execution time of the instruction sequence in the instruction list to be scheduled.

In one embodiment, the instruction scheduling apparatus further includes an updating module 540, where the updating module is configured to update the initial execution time to the longest execution time corresponding to the currently accessed selected node if the longest execution time corresponding to the currently accessed selected node is less than the initial execution time.

In one embodiment, the evaluating module 530 is configured to access the selection node within a preset access time period, and obtain a longest execution time corresponding to the currently accessed selection node; if the longest execution time corresponding to the currently accessed selection node is smaller than the initial execution time, determining the ordered instructions corresponding to the currently accessed selection node as the instructions in the corresponding order in the scheduled instruction list; the initial execution time is the execution time of the instruction sequence in the instruction list to be scheduled.

In one embodiment, the evaluating module 530 is configured to use the instruction sequence in the instruction table to be scheduled as the instruction sequence in the instruction table after scheduling when the longest execution time corresponding to the currently accessed selection node is not less than the initial execution time.

In one embodiment, the evaluating module 530 is configured to select the selected node for access according to a random priority rule, and obtain the longest execution time corresponding to the selected node that is currently selected for access.

In one embodiment, the evaluation module 530 is configured to select the selected node for access according to a breadth-first rule, and obtain the longest execution time corresponding to the selected node that is currently selected for access.

In one embodiment, the evaluating module 530 is configured to select the selected node for access according to a depth-first rule, and obtain the longest execution time corresponding to the selected node that is currently selected for access.

In one embodiment, the evaluation module 530 is configured to select the selection nodes smaller than the preset order for access according to a breadth or random priority rule, so as to obtain the longest execution time corresponding to the selection node currently selected for access; and selecting the selected nodes not less than the preset sequence according to a depth-first rule for access to obtain the longest execution time corresponding to the selected nodes selected to be accessed currently.

In one embodiment, the evaluating module 530 is configured to obtain a shortest execution time corresponding to a currently accessed selection node; if the shortest execution time corresponding to the currently accessed selection node is larger than the initial execution time, terminating the access of the selection node associated with the currently accessed selection node; the initial execution time is the execution time of the instruction sequence in the instruction list to be scheduled.

In one embodiment, the evaluating module 530 is configured to evaluate all the selected nodes corresponding to the current order according to the preset priority of the instruction, obtain an evaluation result of each selected node in the current order, and determine the instruction corresponding to the current order according to the evaluation result.

In one embodiment, the evaluation module 530 is configured to set the priority of each instruction according to the specific content and/or type of the currently selected node.

In one embodiment, the evaluating module 530 is configured to determine the instruction corresponding to the current order according to the shortest execution time corresponding to all the selection nodes in the current order.

For specific limitations of the instruction list scheduling apparatus, reference may be made to the above limitations of the instruction list scheduling method, which is not described herein again. The modules in the instruction list scheduling device can be wholly or partially implemented by software, hardware and a combination thereof. The modules can be embedded in a hardware form or independent from a processor in the computer device, and can also be stored in a memory in the computer device in a software form, so that the processor can call and execute operations corresponding to the modules.

In one embodiment, a computer device is provided, which may be a terminal, and its internal structure diagram may be as shown in fig. 6. The computer device includes a processor, a memory, a network interface, a display screen, and an input device connected by a system bus. Wherein the processor of the computer device is configured to provide computing and control capabilities. The memory of the computer device comprises a nonvolatile storage medium and an internal memory. The non-volatile storage medium stores an operating system and a computer program. The internal memory provides an environment for the operation of an operating system and computer programs in the non-volatile storage medium. The network interface of the computer device is used for communicating with an external terminal through a network connection. The computer program is executed by a processor to implement the verification stimulus generation method and/or the chip verification method mentioned in the above embodiments. The display screen of the computer equipment can be a liquid crystal display screen or an electronic ink display screen, and the input device of the computer equipment can be a touch layer covered on the display screen, a key, a track ball or a touch pad arranged on the shell of the computer equipment, an external keyboard, a touch pad or a mouse and the like.

Those skilled in the art will appreciate that the architecture shown in fig. 6 is merely a block diagram of some of the structures associated with the disclosed aspects and is not intended to limit the computing devices to which the disclosed aspects apply, as particular computing devices may include more or less components than those shown, or may combine certain components, or have a different arrangement of components.

In one embodiment, a computer device is provided, comprising a memory, a processor, and a computer program stored on the memory and executable on the processor, the processor implementing the following steps when executing the computer program: acquiring a to-be-scheduled instruction set in a to-be-scheduled instruction list, and performing data dependency analysis on the to-be-scheduled instruction set to obtain a data dependency relationship among instructions; obtaining all selection nodes for instruction selection each time in the instruction scheduling process according to the data dependency relationship among the instructions; and according to a preset rule, determining the instructions in each order in the scheduled instruction list according to the selected nodes in the corresponding order.

In one embodiment, the processor, when executing the computer program, further performs the steps of: accessing the selection node and acquiring the longest execution time corresponding to the currently accessed selection node; if the longest execution time corresponding to the currently accessed selection node is smaller than the initial execution time, determining the sequenced instructions of the currently accessed selection node as the instructions in the corresponding sequence in the scheduled instruction list; the initial execution time is the execution time of the instruction sequence in the instruction list to be scheduled.

In one embodiment, the processor, when executing the computer program, further performs the steps of: and if the longest execution time corresponding to the currently accessed selection node is less than the initial execution time, updating the initial execution time to the longest execution time corresponding to the currently accessed selection node.

In one embodiment, the processor, when executing the computer program, further performs the steps of: and if the longest execution time corresponding to the currently accessed selection node is less than the initial execution time, randomly generating an instruction sequence based on the ordered instructions corresponding to the currently accessed selection node, and updating the instruction sequence of the instruction list to be scheduled by using the randomly generated instruction sequence.

In one embodiment, the processor, when executing the computer program, further performs the steps of: accessing the selection node in a preset access time period, and acquiring the longest execution time corresponding to the currently accessed selection node; if the longest execution time corresponding to the currently accessed selection node is smaller than the initial execution time, determining the ordered instructions corresponding to the currently accessed selection node as the instructions in the corresponding order in the scheduled instruction list; the initial execution time is the execution time of the instruction sequence in the instruction list to be scheduled.

In one embodiment, the processor, when executing the computer program, further performs the steps of: and selecting the selected node for access according to a breadth-first rule, and acquiring the longest execution time corresponding to the selected node which is selected to be accessed currently.

In one embodiment, the processor, when executing the computer program, further performs the steps of: and selecting the selected node to access according to a random priority rule, and acquiring the longest execution time corresponding to the selected node which is selected to access currently.

In one embodiment, the processor, when executing the computer program, further performs the steps of: selecting the selected nodes with the sequence less than the preset sequence for access according to a breadth or random priority rule to obtain the longest execution time corresponding to the selected node which is selected for access currently; and selecting the selected nodes not less than the preset sequence according to a depth-first rule for access to obtain the longest execution time corresponding to the selected nodes selected to be accessed currently.

In one embodiment, the processor, when executing the computer program, further performs the steps of: obtaining the shortest execution time corresponding to the currently accessed selection node; if the shortest execution time corresponding to the currently accessed selection node is greater than the initial execution time, terminating accessing the selection node associated with the currently accessed selection node; the initial execution time is the execution time of the instruction sequence in the instruction list to be scheduled.

In one embodiment, the processor, when executing the computer program, further performs the steps of: and evaluating all the selected nodes corresponding to the current order according to the preset priority of the instruction to obtain the evaluation result of each selected joint in the current order, and determining the instruction corresponding to the current order according to the evaluation result.

In one embodiment, the processor, when executing the computer program, further performs the steps of: the priority of each instruction is set according to the specific content and/or type of the currently selected node.

In one embodiment, the processor, when executing the computer program, further performs the steps of: and determining the instruction corresponding to the current sequence according to the length of the shortest execution time corresponding to all the selected nodes in the current sequence.

In one embodiment, a computer-readable storage medium is provided, having a computer program stored thereon, which when executed by a processor, performs the steps of: acquiring a to-be-scheduled instruction set in a to-be-scheduled instruction list, and performing data dependency analysis on the to-be-scheduled instruction set to obtain a data dependency relationship among instructions; obtaining all selection nodes for instruction selection each time in the instruction scheduling process according to the data dependency relationship among the instructions; and according to a preset rule, determining the instructions in each order in the scheduled instruction list according to the selected nodes in the corresponding order.

In one embodiment, the computer program when executed by the processor further performs the steps of: accessing the selection node and acquiring the longest execution time corresponding to the currently accessed selection node; if the longest execution time corresponding to the currently accessed selection node is smaller than the initial execution time, determining the sequenced instructions of the currently accessed selection node as the instructions in the corresponding sequence in the scheduled instruction list; the initial execution time is the execution time of the instruction sequence in the instruction list to be scheduled.

In one embodiment, the computer program when executed by the processor further performs the steps of: and if the longest execution time corresponding to the currently accessed selection node is less than the initial execution time, updating the initial execution time to the longest execution time corresponding to the currently accessed selection node.

In one embodiment, the computer program when executed by the processor further performs the steps of: accessing the selection node in a preset access time period, and acquiring the longest execution time corresponding to the currently accessed selection node; if the longest execution time corresponding to the currently accessed selection node is smaller than the initial execution time, determining the ordered instructions corresponding to the currently accessed selection node as the instructions in the corresponding order in the scheduled instruction list; the initial execution time is the execution time of the instruction sequence in the instruction list to be scheduled.

In one embodiment, the computer program when executed by the processor further performs the steps of: and if the longest execution time corresponding to the currently accessed selection node is not less than the initial execution time, taking the instruction sequence in the instruction list to be scheduled as the instruction sequence in the instruction list after scheduling.

In one embodiment, the computer program when executed by the processor further performs the steps of: and selecting the selected node to access according to a random priority rule, and acquiring the longest execution time corresponding to the selected node which is selected to access currently.

In one embodiment, the computer program when executed by the processor further performs the steps of: and selecting the selected node for access according to a depth-first rule, and acquiring the longest execution time corresponding to the selected node which is selected to be accessed currently.

In one embodiment, the computer program when executed by the processor further performs the steps of: and selecting the selected node for access according to a breadth-first rule, and acquiring the longest execution time corresponding to the selected node which is selected to be accessed currently.

In one embodiment, the computer program when executed by the processor further performs the steps of: selecting the selected nodes with the sequence less than the preset sequence for access according to a breadth or random priority rule to obtain the longest execution time corresponding to the selected node which is selected for access currently; and selecting the selected nodes not less than the preset sequence according to a depth-first rule for access to obtain the longest execution time corresponding to the selected nodes selected to be accessed currently.

In one embodiment, the computer program when executed by the processor further performs the steps of: obtaining the shortest execution time corresponding to the currently accessed selection node; if the shortest execution time corresponding to the currently accessed selection node is greater than the initial execution time, terminating accessing the selection node associated with the currently accessed selection node; the initial execution time is the execution time of the instruction sequence in the instruction list to be scheduled.

In one embodiment, the computer program when executed by the processor further performs the steps of: and evaluating all the selected nodes corresponding to the current order according to the preset priority of the instruction to obtain the evaluation result of each selected joint in the current order, and determining the instruction corresponding to the current order according to the evaluation result.

In one embodiment, the computer program when executed by the processor further performs the steps of: the priority of each instruction is set according to the specific content and/or type of the currently selected node.

In one embodiment, the computer program when executed by the processor further performs the steps of: and determining the instruction corresponding to the current sequence according to the length of the shortest execution time corresponding to all the selected nodes in the current sequence.

It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by a computer program, which can be stored in a computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. Any reference to memory, storage, database, or other medium used in the embodiments provided herein may include non-volatile and/or volatile memory, among others. Non-volatile memory can include read-only memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), Double Data Rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), Synchronous Link DRAM (SLDRAM), Rambus Direct RAM (RDRAM), direct bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).

The above-mentioned embodiments only express several embodiments of the present invention, and the description thereof is more specific and detailed, but not construed as limiting the scope of the present invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the inventive concept, which falls within the scope of the present invention. Therefore, the protection scope of the present patent shall be subject to the appended claims.

Claims

1. An instruction list scheduling method, comprising:

Obtaining the instruction set to be scheduled in the instruction set to be scheduled, and performing data dependency analysis on the instruction set to be scheduled, to obtain the data dependency relationship between the instructions in the instruction set to be scheduled;

According to the data dependencies between the instructions, all selected nodes for each instruction selection in the instruction scheduling process are obtained;

According to the preset rule, according to the selection node of the corresponding order, determine the instructions of each order in the scheduled instruction list;

Wherein, according to the preset rules, according to the selection node in the corresponding order, the instructions in each order in the scheduled instruction list are determined, including:

Get the longest execution time corresponding to the currently accessed selected node;

If the longest execution time corresponding to the currently accessed selection node is less than the initial execution time, then the sorted instructions of the currently accessed selection node are determined as the instructions in the corresponding order in the scheduled instruction list;

2. The method according to claim 1, wherein the step of determining the instructions in each order in the scheduled instruction list according to the selection node in the corresponding order according to a preset rule comprises:

accessing the selection node;

Evaluate the selection nodes corresponding to the current order of access, obtain evaluation results of each selection node in the current order, and determine the instructions corresponding to the current order according to the evaluation results.

3. The method according to claim 1, wherein, according to a preset rule, the step of determining the instructions of each order in the scheduled instruction list according to the selection node of the corresponding order comprises:

All selection nodes corresponding to the current order are evaluated according to the preset priorities of the instructions, an evaluation result of each selection node in the current order is obtained, and an instruction corresponding to the current order is determined according to the evaluation results.

4. The method according to claim 3, wherein the method further comprises:

The priority of each command is set according to the specific content and/or type of the currently selected node.

5. method according to claim 2, is characterized in that, the corresponding selection node of the current order of visit is evaluated, obtains the evaluation result of each selection node of current order, according to the evaluation result, it is determined that the corresponding instruction of current order comprises:

According to the length of the shortest execution time corresponding to all selected nodes in the current order, the instruction corresponding to the current order is determined.

6. The method according to claim 5, wherein the method further comprises:

If the shortest execution time corresponding to the currently accessed selection node is greater than the initial execution time, the access to the selection node associated with the currently accessed selection node is terminated;

7. The method of claim 1, wherein the method comprises:

Visit the selection node, if the longest execution time corresponding to the currently accessed selection node is less than the initial execution time, the initial execution time is updated to the longest execution time corresponding to the currently accessed selection node.

8. The method of claim 2, wherein the step of accessing the selected node comprises:

Visit selected nodes within a preset visit time period.

9. The method according to any one of claims 2-8, wherein accessing the selection node comprises:

The selected node is selected for access according to a random priority rule; or,

The selected node is selected for access according to a breadth-first rule; or,

The selected node is selected for access according to a depth-first rule; or,

According to the rule of breadth or random priority, the selection nodes that are smaller than the preset order are selected for access, and the selection nodes that are not smaller than the preset order are selected to be accessed according to the rule of depth first.

10. An instruction scheduling device, comprising: an acquisition module, a data dependency analysis module, and an evaluation module,

The obtaining module is used to obtain the to-be-scheduled instruction set in the to-be-scheduled instruction list, and to obtain all selection nodes corresponding to each instruction selection in the instruction scheduling process according to the data dependencies between the instructions;

The data dependency analysis module is used for performing data dependency analysis on the instruction set to be scheduled to obtain the data dependency relationship between the instructions;

The evaluation module is used to determine, according to preset rules, the instructions of each order in the scheduled instruction list according to the selection nodes of the corresponding order;

Wherein, the evaluation module is specifically used to obtain the longest execution time corresponding to the currently accessed selection node; if the longest execution time corresponding to the currently accessed selection node is less than the initial execution time, the currently accessed selection node The ordering instruction is determined as an instruction in the corresponding order in the scheduled instruction list; wherein, the initial execution time is the execution time of the instruction sequence in the instruction list to be scheduled.

11. A computer device, comprising a memory, a processor, and a computer program stored in the memory and running on the processor, wherein the processor executes the method according to any one of claims 1-9. steps of the method.

12. A computer-readable storage medium, characterized in that, a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, any one of claims 1-9 is implemented. steps of the method.