CN114812585A

CN114812585A - Path planning method and device

Info

Publication number: CN114812585A
Application number: CN202110115030.7A
Authority: CN
Inventors: 朱正达; 张磊; 冉旭; 王睿
Original assignee: Momenta Suzhou Technology Co Ltd
Current assignee: Momenta Suzhou Technology Co Ltd
Priority date: 2021-01-27
Filing date: 2021-01-27
Publication date: 2022-07-29
Anticipated expiration: 2041-01-27
Also published as: CN114812585B; WO2022160634A1

Abstract

The embodiment of the present invention discloses a path planning method and device. The method includes: obtaining current decision planning related information corresponding to a target object; Planning-related information and preset driving constraints, determine the reachable state set corresponding to the target object; obtain the planned trajectory information corresponding to each reachable state in the reachable state set; based on the planned trajectory information corresponding to each reachable state, each reachable state The level corresponding to the state and the evaluation factor, determine the target state corresponding to the target object from all the reachable states, wherein the level corresponding to the reachable state is the level in the preset hierarchical state machine; control the preset hierarchical state The machine jumps from the current state to the target state, so that the target object travels based on the planning trajectory information corresponding to the target state, so as to improve the rationality of the path planning.

Description

Path planning method and device

技术领域technical field

本发明涉及路径规划技术领域，具体而言，涉及一种路径规划方法及装置。The present invention relates to the technical field of path planning, and in particular, to a path planning method and device.

背景技术Background technique

在高速和城市道路交通中，自动驾驶车辆的路径规划模块需要根据地图任务在满足交规和正常人类司机驾驶习惯的情况下，实时规划出合理轨迹交由底层控制模块执行，并在上游的用户交互界面中更新和维护自动驾驶车辆所处的状态，以向自动驾驶车辆的乘客及其周围有交互的行人和车辆，展现自车的意图和对应的驾驶行为。In high-speed and urban road traffic, the path planning module of the autonomous vehicle needs to plan a reasonable trajectory in real time according to the map task and meet the traffic regulations and the driving habits of normal human drivers, and hand it over to the underlying control module for execution. The state of the self-driving vehicle is updated and maintained in the interface, so as to show the intention of the self-driving vehicle and the corresponding driving behavior to the passengers of the self-driving vehicle and the interacting pedestrians and vehicles around it.

目前，路径规划模块在规划轨迹的过程中，一般是首先基于预设状态机的各状态之间的跳转关系以及当前状态，直接确定出目标状态，进而基于上层信息采集模块所采集的当前环境信息、地图信息和自动驾驶车辆的位姿信息以及所需跳转到的目标状态和预设路径规划算法，规划出合理的轨迹。At present, in the process of planning the trajectory, the path planning module generally first directly determines the target state based on the jump relationship between the states of the preset state machine and the current state, and then directly determines the target state based on the current environment collected by the upper-layer information collection module. Information, map information and the pose information of the autonomous vehicle, as well as the target state to be jumped to and the preset path planning algorithm, plan a reasonable trajectory.

上述过程中，基于预设状态机的各状态之间的跳转关系以及当前状态，直接确定出目标状态，其所确定出的目标状态可能不为最优的目标状态，进而所确定出的轨迹可能不为最合理的轨迹。In the above process, based on the jump relationship between the states of the preset state machine and the current state, the target state is directly determined, and the determined target state may not be the optimal target state, and then the determined trajectory is determined. Probably not the most reasonable trajectory.

发明内容SUMMARY OF THE INVENTION

本发明提供了一种路径规划方法及装置，以实现提高路径规划的合理性。具体的技术方案如下：The present invention provides a path planning method and device to improve the rationality of path planning. The specific technical solutions are as follows:

第一方面，本发明实施例提供了一种路径规划方法，所述方法包括：In a first aspect, an embodiment of the present invention provides a path planning method, the method includes:

获得目标对象对应的当前决策规划相关信息；Obtain the relevant information of the current decision-making plan corresponding to the target object;

基于预设分层状态机的当前状态、各状态之间的跳转关系、所述当前决策规划相关信息以及预设行驶限定条件，确定所述目标对象对应的可达状态集合，所述预设分层状态机包括：所述目标对象对应的各状态之间的跳转关系和层级关系；Based on the current state of the preset hierarchical state machine, the jump relationship between the states, the relevant information of the current decision plan, and the preset driving limit conditions, a set of reachable states corresponding to the target object is determined, and the preset The hierarchical state machine includes: a jump relationship and a hierarchical relationship between states corresponding to the target object;

获得所述可达状态集合中各可达状态对应的规划轨迹信息；obtaining planning trajectory information corresponding to each reachable state in the set of reachable states;

基于各可达状态对应的规划轨迹信息、各可达状态对应的层级及评价因素，从所有可达状态中确定出所述目标对象对应的目标状态，其中，可达状态对应的层级为其在预设分层状态机中的层级；Based on the planning trajectory information corresponding to each reachable state, the level and evaluation factors corresponding to each reachable state, the target state corresponding to the target object is determined from all reachable states, wherein the level corresponding to the reachable state is its Preset levels in a hierarchical state machine;

控制所述预设分层状态机从所述当前状态跳转至所述目标状态，以使目标对象基于所述目标状态对应的规划轨迹信息行驶。The preset hierarchical state machine is controlled to jump from the current state to the target state, so that the target object travels based on the planned trajectory information corresponding to the target state.

可选的，所述当前决策规划相关信息包括：目标对象的当前周围感知信息、当前位姿信息、当前地图信息以及障碍物的预测信息。Optionally, the current decision planning related information includes: current surrounding perception information, current pose information, current map information, and prediction information of obstacles of the target object.

可选的，所述基于预设分层状态机的当前状态、各状态之间的跳转关系、所述当前决策规划相关信息以及预设行驶限定条件，确定所述目标对象对应的可达状态集合的步骤，包括：Optionally, determining the reachable state corresponding to the target object based on the current state of the preset hierarchical state machine, the jump relationship between the states, the current decision-making planning related information, and the preset driving limit condition. The steps of the collection include:

对所述当前决策规划相关信息中的指定信息进行预处理，并判断所述当前决策规划相关信息中的各信息是否有效；Preprocessing the specified information in the current decision planning related information, and judging whether each information in the current decision planning related information is valid;

在确定所获得的当前决策规划相关信息均有效的情况下，基于预设分层状态机的当前状态、各状态之间的跳转关系、所述当前决策规划相关信息以及预设行驶限定条件，确定所述目标对象对应的可达状态集合。In the case where it is determined that the obtained current decision planning related information is valid, based on the current state of the preset hierarchical state machine, the jump relationship between the states, the current decision planning related information, and the preset driving limit conditions, A reachable state set corresponding to the target object is determined.

可选的，所述基于各可达状态对应的规划轨迹信息、各可达状态对应的层级及评价因素，从所有可达状态中确定出所述目标对象对应的目标状态的步骤，包括：Optionally, the step of determining the target state corresponding to the target object from all the reachable states based on the planning trajectory information corresponding to each reachable state, the level and evaluation factors corresponding to each reachable state, includes:

判断所述可达状态集合中是否存在所对应执行标签为必须执行标签的可达状态，其中，所述跳转标签为：基于所述当前决策规划相关信息中的当前周围感知信息和/或当前地图信息所确定的标签；Judging whether there is a reachable state in which the corresponding execution label is a must-execute label in the reachable state set, wherein the jump label is: based on the current surrounding perception information and/or the current surrounding perception information in the current decision planning related information Labels as determined by map information;

若确定所述可达状态集合中存在所对应执行标签为必须执行标签的可达状态，基于各可达状态对应的顶级层级状态所对应的当前优先级，确定所对应当前优先级最高的可达状态，作为待评价状态；基于待评价状态对应的规划轨迹信息以及待评价状态对应的评价因素，确定出待评价状态对应的规划轨迹信息是否可行的评价结果，其中，必须执行标签对应的可达状态的优先级最高；If it is determined that there is a reachable state in the set of reachable states in which the corresponding execution label is a mandatory label, based on the current priority corresponding to the top-level state corresponding to each reachable state, determine the reachable state corresponding to the highest current priority. state, as the state to be evaluated; based on the planning trajectory information corresponding to the state to be evaluated and the evaluation factors corresponding to the state to be evaluated, determine the evaluation result of whether the planned trajectory information corresponding to the state to be evaluated is feasible. Status has the highest priority;

若所述评价结果表征待评价状态对应的规划轨迹信息可行，则基于可行的规划轨迹信息，从待评价状态中确定出所述目标对象对应的目标状态；If the evaluation result indicates that the planned trajectory information corresponding to the state to be evaluated is feasible, then based on the feasible planned trajectory information, the target state corresponding to the target object is determined from the state to be evaluated;

若所述评价结果表征待评价状态对应的规划轨迹信息不可行，则返回执行所述基于各可达状态对应的顶级层级状态所对应的当前优先级，确定所对应当前优先级最高的可达状态，作为待评价状态的步骤。If the evaluation result indicates that the planning trajectory information corresponding to the state to be evaluated is infeasible, return to execute the current priority corresponding to the top-level state corresponding to each reachable state, and determine the reachable state corresponding to the highest current priority , as the step of the state to be evaluated.

可选的，若待评价状态中包含互为有公有父状态且处于同一层级的可达状态；互为有公有父状态且处于同一层级的可达状态之间存在时序阶段过渡关系；Optionally, if the to-be-evaluated state includes reachable states that share a common parent state with each other and are at the same level; there is a temporal phase transition relationship between the reachable states that share a common parent state and are at the same level;

所述基于待评价状态对应的规划轨迹信息以及该待评价状态对应的评价因素，确定出待评价状态对应的规划轨迹信息是否可行的评价结果的步骤，包括：The step of determining whether the planned trajectory information corresponding to the to-be-evaluated state is feasible based on the planned trajectory information corresponding to the to-be-evaluated state and the evaluation factor corresponding to the to-be-evaluated state includes:

针对互为有公有父状态且处于同一层级的待评价状态，基于各待评价状态对应的时序阶段过渡关系的先后顺序，依次确定当前未评价的且所对应时序阶段过渡关系最先的可达状态，为当前待评价状态；For the to-be-evaluated states that share a common parent state and are at the same level, based on the sequence of the transitional relationships of the time-series stages corresponding to each to-be-evaluated state, determine in turn the currently unevaluated and the first reachable state of the corresponding time-series-stage transition relationship , which is the current state to be evaluated;

基于该当前待评价状态对应的规划轨迹信息以及该当前待评价状态对应的评价因素，确定该当前待评价状态对应的评价指标值；Determine the evaluation index value corresponding to the current state to be evaluated based on the planning trajectory information corresponding to the current state to be evaluated and the evaluation factor corresponding to the current state to be evaluated;

若当前待评价状态对应的评级指标值表征：该当前待评价状态对应的规划轨迹信息可行，判断所述待评价状态中是否存在未评价的状态；If the rating index value corresponding to the current to-be-evaluated state represents: the planned trajectory information corresponding to the current to-be-evaluated state is feasible, determine whether there is an unevaluated state in the to-be-evaluated state;

若判断存在，返回所述从待评价状态中，确定出未评价的且所对应时序阶段过渡关系最先的可达状态，作为当前待评价状态的步骤；If it is judged to exist, returning to the state to be evaluated, and determining the unevaluated and the first reachable state of the corresponding sequence stage transition relationship, as the step of the current state to be evaluated;

若判断不存在，或确定出当前待评价状态对应的规划轨迹信息不可行，得到待评价状态对应的规划轨迹信息是否可行的评价结果。If it is judged that it does not exist, or it is determined that the planned trajectory information corresponding to the current state to be evaluated is infeasible, an evaluation result of whether the planned trajectory information corresponding to the state to be evaluated is feasible is obtained.

可选的，所述基于可行的规划轨迹信息，从待评价状态中确定出所述目标对象对应的目标状态的步骤，包括：Optionally, the step of determining the target state corresponding to the target object from the to-be-evaluated state based on feasible planning trajectory information includes:

将可行的规划轨迹信息所对应可达状态中，所对应时序阶段过渡关系最后的可达状态，确定为所述目标对象对应的目标状态。Among the reachable states corresponding to the feasible planning trajectory information, the last reachable state of the corresponding time sequence phase transition relationship is determined as the target state corresponding to the target object.

可选的，所述方法还包括：Optionally, the method further includes:

若确定所述可达状态集合中不存在所对应执行标签为必须执行标签的可达状态，针对每一可达状态，基于该可达状态对应的规划轨迹信息以及该可达状态对应的评价因素，确定该可达状态对应的评价指标值；If it is determined that there is no reachable state in the reachable state set corresponding to the execution label that is a must be executed label, for each reachable state, based on the planning trajectory information corresponding to the reachable state and the evaluation factor corresponding to the reachable state , determine the evaluation index value corresponding to the reachable state;

基于各可达状态对应的评价指标值，从所有可达状态中确定出所述目标对象对应的目标状态。Based on the evaluation index value corresponding to each reachable state, the target state corresponding to the target object is determined from all reachable states.

可选的，所述获得所述可达状态集合中各可达状态对应的规划轨迹信息的步骤，包括：Optionally, the step of obtaining planning trajectory information corresponding to each reachable state in the set of reachable states includes:

基于所述可达状态集合中各可达状态对应的路径规划算法以及所述当前决策规划相关信息中目标对象的当前周围感知信息、当前位姿信息、当前地图信息和障碍物的预测信息，确定所述可达状态集合中各可达状态对应的规划轨迹信息。Based on the path planning algorithm corresponding to each reachable state in the set of reachable states and the current surrounding perception information, current pose information, current map information and obstacle prediction information of the target object in the current decision planning related information, determine Planning trajectory information corresponding to each reachable state in the reachable state set.

可选的，所述方法还包括：Optionally, the method further includes:

基于所述目标状态确定所述目标状态对应的可视化信号以及可视化信息，并输出，以使所述目标对象基于所述可视化信号以及可视化信息进行相应的展示。A visual signal and visual information corresponding to the target state are determined based on the target state, and output, so that the target object is displayed accordingly based on the visual signal and the visual information.

第二方面，本发明实施例提供了一种路径规划装置，所述装置包括：In a second aspect, an embodiment of the present invention provides a path planning device, where the device includes:

第一获得模块，被配置为获得目标对象对应的当前决策规划相关信息；The first obtaining module is configured to obtain the current decision planning related information corresponding to the target object;

第一确定模块，被配置为基于预设分层状态机的当前状态、各状态之间的跳转关系、所述当前决策规划相关信息以及预设行驶限定条件，确定所述目标对象对应的可达状态集合，所述预设分层状态机包括：所述目标对象对应的各状态之间的跳转关系和层级关系；The first determination module is configured to determine, based on the current state of the preset hierarchical state machine, the jump relationship between the states, the relevant information of the current decision-making plan, and the preset driving limit condition, determine the available options corresponding to the target object. reaching a state set, the preset hierarchical state machine includes: the jump relationship and the hierarchical relationship between the states corresponding to the target object;

第二获得模块，被配置为获得所述可达状态集合中各可达状态对应的规划轨迹信息；a second obtaining module, configured to obtain planning trajectory information corresponding to each reachable state in the set of reachable states;

第二确定模块，被配置为基于各可达状态对应的规划轨迹信息、各可达状态对应的层级及评价因素，从所有可达状态中确定出所述目标对象对应的目标状态，其中，可达状态对应的层级为其在预设分层状态机中的层级；The second determination module is configured to determine the target state corresponding to the target object from all the reachable states based on the planning trajectory information corresponding to each reachable state, the level and evaluation factors corresponding to each reachable state, wherein the target state corresponding to the target object can be determined. The level corresponding to the reached state is its level in the preset hierarchical state machine;

控制模块，被配置为控制所述预设分层状态机从所述当前状态跳转至所述目标状态，以使目标对象基于所述目标状态对应的规划轨迹信息行驶。The control module is configured to control the preset hierarchical state machine to jump from the current state to the target state, so that the target object travels based on the planned trajectory information corresponding to the target state.

可选的，所述第一确定模块，被具体配置为对所述当前决策规划相关信息中的指定信息进行预处理，并判断所述当前决策规划相关信息中的各信息是否有效；Optionally, the first determination module is specifically configured to preprocess the specified information in the current decision planning related information, and determine whether each information in the current decision planning related information is valid;

可选的，所述第二确定模块，包括：Optionally, the second determining module includes:

判断单元，被配置为判断所述可达状态集合中是否存在所对应执行标签为必须执行标签的可达状态，其中，所述跳转标签为：基于所述当前决策规划相关信息中的当前周围感知信息和/或当前地图信息所确定的标签；A judging unit, configured to judge whether there is a reachable state in which the corresponding execution label is a must-execute label in the reachable state set, wherein the jump label is: based on the current surrounding information in the current decision planning related information tags determined by sensory information and/or current map information;

第一确定单元，被配置为若确定所述可达状态集合中存在所对应执行标签为必须执行标签的可达状态，基于各可达状态对应的顶级层级状态所对应的当前优先级，确定所对应当前优先级最高的可达状态，作为待评价状态；基于待评价状态对应的规划轨迹信息以及待评价状态对应的评价因素，确定出待评价状态对应的规划轨迹信息是否可行的评价结果，其中，必须执行标签对应的可达状态的优先级最高；The first determination unit is configured to, if it is determined that there is a reachable state in the reachable state set in which the corresponding execution label is a must be executed label, based on the current priority corresponding to the top-level state corresponding to each reachable state, determine the reachable state. The reachable state corresponding to the current highest priority is regarded as the state to be evaluated; based on the planning trajectory information corresponding to the state to be evaluated and the evaluation factors corresponding to the state to be evaluated, determine whether the planned trajectory information corresponding to the state to be evaluated is feasible. , the reachable state corresponding to the tag must be executed with the highest priority;

第二确定单元，被配置为若所述评价结果表征待评价状态对应的规划轨迹信息可行，则基于可行的规划轨迹信息，从待评价状态中确定出所述目标对象对应的目标状态；The second determining unit is configured to determine, based on the feasible planning trajectory information, the target state corresponding to the target object from the to-be-evaluated state if the evaluation result indicates that the planned trajectory information corresponding to the to-be-evaluated state is feasible;

若所述评价结果表征待评价状态对应的规划轨迹信息不可行，则返回触发第一确定单元。If the evaluation result indicates that the planned trajectory information corresponding to the state to be evaluated is infeasible, the process returns to trigger the first determination unit.

所述第一确定单元，被具体配置为针对互为有公有父状态且处于同一层级的待评价状态，基于各待评价状态对应的时序阶段过渡关系的先后顺序，依次确定当前未评价的且所对应时序阶段过渡关系最先的可达状态，为当前待评价状态；The first determining unit is specifically configured to, for the to-be-evaluated states that share a common parent state with each other and are at the same level, sequentially determine the currently unevaluated and the The first reachable state corresponding to the transition relationship in the time sequence stage is the current state to be evaluated;

可选的，所述第二确定单元，被具体配置为将可行的规划轨迹信息所对应可达状态中，所对应时序阶段过渡关系最后的可达状态，确定为所述目标对象对应的目标状态。Optionally, the second determining unit is specifically configured to determine, among the reachable states corresponding to the feasible planning trajectory information, the last reachable state of the corresponding time sequence phase transition relationship as the target state corresponding to the target object. .

可选的，所述第二确定模块，还包括：Optionally, the second determining module further includes:

第三确定单元，被配置为若确定所述可达状态集合中不存在所对应执行标签为必须执行标签的可达状态，针对每一可达状态，基于该可达状态对应的规划轨迹信息以及该可达状态对应的评价因素，确定该可达状态对应的评价指标值；The third determining unit is configured to, if it is determined that there is no reachable state in the reachable state set that corresponds to the execution label that is a must-execute label, for each reachable state, based on the planned trajectory information corresponding to the reachable state and The evaluation factor corresponding to the reachable state is determined, and the evaluation index value corresponding to the reachable state is determined;

可选的，所述第二获得模块，被具体配置为基于所述可达状态集合中各可达状态对应的路径规划算法以及所述当前决策规划相关信息中目标对象的当前周围感知信息、当前位姿信息、当前地图信息和障碍物的预测信息，确定所述可达状态集合中各可达状态对应的规划轨迹信息。Optionally, the second obtaining module is specifically configured to be based on the path planning algorithm corresponding to each reachable state in the reachable state set and the current surrounding perception information of the target object in the current decision planning related information, the current The position and attitude information, the current map information and the prediction information of obstacles are used to determine the planned trajectory information corresponding to each reachable state in the set of reachable states.

可选的，所述装置还包括：Optionally, the device further includes:

确定输出模块，被配置为基于所述目标状态确定所述目标状态对应的可视化信号以及可视化信息，并输出，以使所述目标对象基于所述可视化信号以及可视化信息进行相应的展示。A determining and outputting module is configured to determine a visual signal and visual information corresponding to the target state based on the target state, and output the visual signal, so that the target object can be displayed accordingly based on the visual signal and the visual information.

由上述内容可知，本发明实施例提供的一种路径规划方法及装置，获得目标对象对应的当前决策规划相关信息；基于预设分层状态机的当前状态、各状态之间的跳转关系、当前决策规划相关信息以及预设行驶限定条件，确定目标对象对应的可达状态集合，预设分层状态机包括：目标对象的各状态之间的跳转关系和层级关系；获得可达状态集合中各可达状态对应的规划轨迹信息；基于各可达状态对应的规划轨迹信息、各可达状态对应的层级及评价因素，从所有可达状态中确定出目标对象对应的目标状态，其中，可达状态对应的层级为其在预设分层状态机中的层级；控制预设分层状态机从当前状态跳转至目标状态，以使目标对象基于目标状态对应的规划轨迹信息行驶。It can be seen from the above that a path planning method and device provided by the embodiments of the present invention obtain the current decision planning related information corresponding to the target object; based on the current state of the preset hierarchical state machine, the jump relationship between the states, The current decision-making planning related information and the preset driving conditions determine the set of reachable states corresponding to the target object. The preset hierarchical state machine includes: the jump relationship and hierarchical relationship between the states of the target object; obtain the set of reachable states planning trajectory information corresponding to each reachable state in the The level corresponding to the reachable state is the level in the preset hierarchical state machine; the preset hierarchical state machine is controlled to jump from the current state to the target state, so that the target object travels based on the planned trajectory information corresponding to the target state.

应用本发明实施例，可以首先基于预设分层状态机的当前状态、各状态之间的跳转关系、当前决策规划相关信息以及预设行驶限定条件，确定出从预设分层状态机下一状态可跳转至的可达状态集合，即确定出预设分层状态机对应的目标对象的下一时刻的可达状态集合，进而利用各可达状态对应的规划轨迹信息的合理性，从可达状态集合中确定出所对应规划轨迹信息最合理的可达状态，即目标状态，实现对目标对象的下一时刻的状态的合理化且准确化的确定，实现对路径规划的合理性的提高，并且，在规划过程中，首先确定可达状态，进而获得可达状态对应的规划轨迹信息，以通过对规划信息的评价，从可达状态中确定目标状态，该规划框架更具有可泛化、可扩展化以及可解释化，可以针对实际涌现的新的问题和场景快速定义和嵌入新的场景和动作模式，并适配新的路径规划算法。当然，实施本发明的任一产品或方法并不一定需要同时达到以上所述的所有优点。Applying the embodiments of the present invention, it is possible to first determine the current state of the preset hierarchical state machine based on the current state of the preset hierarchical state machine, the jump relationship between the states, the current decision planning related information, and the preset driving restrictions. A set of reachable states to which a state can be jumped, that is, to determine the set of reachable states at the next moment of the target object corresponding to the preset hierarchical state machine, and then use the rationality of the planning trajectory information corresponding to each reachable state, Determine the most reasonable reachable state corresponding to the planning trajectory information from the set of reachable states, that is, the target state, realize the rational and accurate determination of the state of the target object at the next moment, and improve the rationality of the path planning. , and in the planning process, first determine the reachable state, and then obtain the planning trajectory information corresponding to the reachable state, so as to determine the target state from the reachable state through the evaluation of the planning information, the planning framework is more generalizable , scalable and interpretable, can quickly define and embed new scenarios and action modes for emerging new problems and scenarios, and adapt to new path planning algorithms. Of course, it is not necessary for any product or method of the present invention to achieve all of the advantages described above at the same time.

本发明实施例的创新点包括：The innovative points of the embodiments of the present invention include:

1、首先确定出可达状态集合，进而分析可达状态集合中各可达状态对应的规划轨迹信息的评价结果，从可达状态集合中确定出最合理的可达状态，作为目标状态，实现对目标对象的下一时刻的状态的合理化且准确化的确定，实现对路径规划的合理性的提高。并且，在规划过程中，首先确定可达状态，进而获得可达状态对应的规划轨迹信息，以通过对规划信息的评价，从可达状态中确定目标状态，该规划框架更具有可泛化、可扩展化以及可解释化，可以针对实际涌现的新的问题和场景快速定义和嵌入新的场景和动作模式，并适配新的路径规划算法。1. First determine the reachable state set, and then analyze the evaluation results of the planning trajectory information corresponding to each reachable state in the reachable state set, and determine the most reasonable reachable state from the reachable state set, as the target state, to achieve The rational and accurate determination of the state of the target object at the next moment realizes the improvement of the rationality of the path planning. Moreover, in the planning process, the reachable state is first determined, and then the planning trajectory information corresponding to the reachable state is obtained, so that the target state can be determined from the reachable state through the evaluation of the planning information. The planning framework is more generalizable, Extensible and interpretable, it can quickly define and embed new scenarios and action modes for emerging new problems and scenarios, and adapt to new path planning algorithms.

2、首先确定当前决策规划相关信息中各信息的有效性，在当前决策规划相关信息均有效的情况下，执行后续的路径规划流程，以保证路径规划的安全性以及合理性。2. First, determine the validity of each information in the current decision-making planning related information. When the current decision-making planning related information is valid, execute the subsequent path planning process to ensure the safety and rationality of the path planning.

3、在可达状态集合中存在所对应执行标签为必须执行标签的可达状态的情况下，即其中存在必须执行的可达状态，针对该情况，利用当前优先级优先针对必须执行标签的可达状态进行评价即仲裁，在确定其对应的规划轨迹信息可行的情况下，直接确定该可达状态为目标状态，在确定该必须执行标签的可达状态基所对应优先级最高的可达状态对应的规划轨迹不可行的情况下，依次往下仲裁，以保证所确定出的目标状态更合理，更符合当前环境情况。3. When there is a reachable state in which the corresponding execution label is a must-execute label in the reachable state set, that is, there is a reachable state that must be executed. In this case, the current priority is used to give priority to the reachable state of the must-execute label. The evaluation of the reachable state is called arbitration. When it is determined that the corresponding planning trajectory information is feasible, the reachable state is directly determined as the target state, and the reachable state with the highest priority corresponding to the reachable state base of the must-execute label is determined. When the corresponding planning trajectory is not feasible, the arbitration will be performed in turn to ensure that the determined target state is more reasonable and more in line with the current environmental conditions.

4、在可达状态集合中不存在所对应执行标签为必须执行标签的可达状态的情况下，可以平行确定各可达状态对应的评价指标值，以从中确定出最合理的可达状态作为目标状态。4. In the case that there is no reachable state in the reachable state set whose corresponding execution label is a mandatory label, the evaluation index value corresponding to each reachable state can be determined in parallel, and the most reasonable reachable state can be determined as the target state.

5、针对有公有父状态的子状态，设置时序阶段过渡关系，结合时序阶段过渡关系确定各子状态对应的评价指标值，即确定各子状态对应的规划轨迹信息的可行性，通过时序阶段过渡关系，限制有公有父状态的子状态的执行先后顺序，保证目标对象各状态跳转的最优性以及连贯性。5. For the sub-states that have a common parent state, set the transition relationship of the time sequence stage, and determine the evaluation index value corresponding to each sub-state in combination with the transition relationship of the time sequence stage, that is, determine the feasibility of the planning trajectory information corresponding to each sub-state, and pass the time sequence stage transition. relationship, which restricts the execution sequence of child states with a common parent state, and ensures the optimality and coherence of each state transition of the target object.

附图说明Description of drawings

为了更清楚地说明本发明实施例或现有技术中的技术方案，下面将对实施例或现有技术描述中所需要使用的附图作简单介绍。显而易见地，下面描述中的附图仅仅是本发明的一些实施例。对于本领域普通技术人员来讲，在不付出创造性劳动的前提下，还可以根据这些附图获得其他的附图。In order to illustrate the embodiments of the present invention or the technical solutions in the prior art more clearly, the following briefly introduces the accompanying drawings that are required in the description of the embodiments or the prior art. Obviously, the drawings in the following description are only some embodiments of the invention. For those of ordinary skill in the art, other drawings can also be obtained from these drawings without creative effort.

图1为本发明实施例提供的路径规划方法的一种流程示意图；FIG. 1 is a schematic flowchart of a path planning method provided by an embodiment of the present invention;

图2A和2B为预设分层状态机的状态跳转逻辑的一种示意图；2A and 2B are a schematic diagram of a state jump logic of a preset hierarchical state machine;

图3为本发明实施例提供的路径规划装置的一种结构示意图。FIG. 3 is a schematic structural diagram of a path planning apparatus according to an embodiment of the present invention.

具体实施方式Detailed ways

下面将结合本发明实施例中的附图，对本发明实施例中的技术方案进行清楚、完整的描述。显然，所描述的实施例仅仅是本发明的一部分实施例，而不是全部的实施例。基于本发明中的实施例，本领域普通技术人员在没有付出创造性劳动前提下所获得的所有其他实施例，都属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, but not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

需要说明的是，本发明实施例及附图中的术语“包括”和“具有”以及它们的任何变形，意图在于覆盖不排他的包含。例如包含的一系列步骤或单元的过程、方法、系统、产品或设备没有限定于已列出的步骤或单元，而是可选地还包括没有列出的步骤或单元，或可选地还包括对于这些过程、方法、产品或设备固有的其他步骤或单元。It should be noted that the terms "comprising" and "having" and any modifications thereof in the embodiments of the present invention and the accompanying drawings are intended to cover non-exclusive inclusion. For example, a process, method, system, product or device that includes a series of steps or units is not limited to the steps or units listed, but optionally also includes steps or units not listed, or optionally also includes For other steps or units inherent to these processes, methods, products or devices.

本发明提供了一种路径规划方法及装置，以实现提高路径规划的合理性。下面对本发明实施例进行详细说明。The present invention provides a path planning method and device to improve the rationality of path planning. The embodiments of the present invention will be described in detail below.

图1为本发明实施例提供的路径规划方法的一种流程示意图。该方法可以包括如下步骤：FIG. 1 is a schematic flowchart of a path planning method provided by an embodiment of the present invention. The method may include the following steps:

S101：获得目标对象对应的当前决策规划相关信息。S101: Obtain current decision planning related information corresponding to the target object.

本发明实施例所提供的路径规划方法，可以应用于任一具有计算能力的电子设备，该电子设备可以为终端或者服务器。在一种实现中，实现该路径规划方法的功能软件可以以单独的客户端软件的形式存在，也可以以目前相关的客户端软件的插件的形式存在，例如可以以自动驾驶系统的功能模块的形式存在，这都是可以的。The path planning method provided by the embodiment of the present invention can be applied to any electronic device with computing capability, and the electronic device can be a terminal or a server. In one implementation, the functional software for implementing the path planning method may exist in the form of a separate client software, or may exist in the form of a plug-in of the currently related client software, for example, it may exist in the form of a functional module of an automatic driving system. Form exists, this is all possible.

其中，目标对象可以为自动驾驶车辆以及机器人等需要进行路径规划的对象。在一种实现中，目标对象为自动驾驶车辆的情况下，实现该路径规划方法的电子设备可以为车载设备，设置于该目标对象，可以直接获得该目标对象对应的当前决策规划相关信息。另一种情况中，实现该路径规划方法的电子设备也可以为非车载设备，相应的该电子设备可以与目标对象对应的数据处理设备连接，获得目标对象对应的数据处理设备所发送的目标对象对应的当前决策规划相关信息。The target object may be an object that needs path planning, such as an autonomous vehicle and a robot. In one implementation, when the target object is an autonomous vehicle, the electronic device implementing the path planning method may be an in-vehicle device, set on the target object, and can directly obtain current decision planning related information corresponding to the target object. In another case, the electronic device implementing the path planning method may also be an off-board device, and the corresponding electronic device may be connected to the data processing device corresponding to the target object to obtain the target object sent by the data processing device corresponding to the target object. Corresponding information about the current decision plan.

在本发明的一种实现方式中，该当前决策规划相关信息可以包括但不限于：目标对象的当前周围感知信息、当前位姿信息、当前地图信息以及障碍物的预测信息。其中，当前地图信息可以指目标对象所处环境对应的地图信息。障碍物的预测信息可以包括但不限于：障碍物的预测行驶轨迹以及行驶参数信息。In an implementation manner of the present invention, the current decision planning related information may include, but is not limited to, current surrounding perception information, current pose information, current map information, and obstacle prediction information of the target object. The current map information may refer to map information corresponding to the environment where the target object is located. The predicted information of the obstacle may include, but is not limited to, the predicted travel trajectory of the obstacle and travel parameter information.

该当前决策规划相关信息为用于辅助进行决策规划以及路径规划的信息，其中，决策规划为确定目标对象所需跳转至的状态，进而目标对象基于该所需跳转至的状态对应的路径信息进行行驶，实现路径规划。该当前决策规划相关信息为：基于目标对象所设置的传感器所采集的传感器数据所确定出的信息，目标对象所设置的传感器可以包括但不限于：图像采集设备、轮速传感器、雷达、IMU(Inertial measurement unit，惯性测量单元)、GPS(Global Positioning System，全球定位系统)以及GNSS(Global NavigationSatellite System，全球卫星导航系统/全球导航卫星系统)等。The current decision planning related information is information used to assist decision planning and path planning, wherein the decision planning is to determine the state to which the target object needs to jump to, and then the target object is based on the path corresponding to the state to be jumped to. information for driving and path planning. The current decision-making planning related information is: information determined based on sensor data collected by sensors set by the target object. The sensors set by the target object may include but are not limited to: image acquisition equipment, wheel speed sensors, radar, IMU ( Inertial measurement unit, Inertial Measurement Unit), GPS (Global Positioning System, Global Positioning System) and GNSS (Global Navigation Satellite System, Global Satellite Navigation System/Global Navigation Satellite System) and so on.

S102：基于预设分层状态机的当前状态、各状态之间的跳转关系、所述当前决策规划相关信息以及预设行驶限定条件，确定目标对象对应的可达状态集合。S102: Determine a set of reachable states corresponding to the target object based on the current state of the preset hierarchical state machine, the jump relationship between the states, the current decision-making planning related information, and the preset driving limit conditions.

预设分层状态机包括：目标对象对应的各状态之间的跳转关系和层级关系。The preset hierarchical state machine includes: a jump relationship and a hierarchical relationship between states corresponding to the target object.

电子设备本地或所连接的存储设备预存有预设分层状态机，该预设分层状态机包含所对应目标对象对应的各种状态，预设分层状态机中包含目标对象对应的各状态之间的跳转关系和层级关系。A preset hierarchical state machine is pre-stored locally on the electronic device or in a connected storage device, the preset hierarchical state machine includes various states corresponding to the corresponding target object, and the preset hierarchical state machine includes each state corresponding to the target object Jump relationship and hierarchical relationship between.

其中，预设分层状态机包含处于同一层级的各状态，也包含处于不同层级的各状态。不同状态可以表征目标对象的不同执行动作，即预设分层状态机中某一层级的某一状态可以显示表征目标对象的某种动作模式。预设分层状态机的状态的跳转可以控制目标对象的相应执行动作即动作模式的转变。The preset hierarchical state machine includes states at the same level, and also includes states at different levels. Different states can represent different execution actions of the target object, that is, a certain state of a certain level in the preset hierarchical state machine can display a certain action mode representing the target object. The jump of the state of the preset hierarchical state machine can control the corresponding execution action of the target object, that is, the transition of the action mode.

某一层级的状态对应的下一层级的状态，可以称为该状态的子状态，状态对应的不同子状态可以认为为：该状态对应的动作模式的拆解或者阶段划分。例如：目标对象为自动驾驶车辆，如图2A所示，预设分层状态机即图2A中所述的“分层状态机”可以包括：车道保持、变道以及借道等处于第一层级也即顶级层级的状态，其中，变道可以包括变道准备和变道执行等处于第二层级的子状态，另外变道还可以包括变道返回子状态。相应的，该预设分层状态机可以包括两层状态。例如：预设分层状态机包括：车道保持以及靠边停车等处于第一层级的状态，其中，靠边停车可以包括减速以及靠边停车执行等处于第二层级的子状态。The state of the next level corresponding to a state of a certain level may be referred to as a sub-state of the state, and the different sub-states corresponding to the state may be considered as the disassembly or stage division of the action mode corresponding to the state. For example, if the target object is an autonomous vehicle, as shown in FIG. 2A , the preset hierarchical state machine, that is, the “hierarchical state machine” described in FIG. 2A may include: lane keeping, lane changing and borrowing are at the first level That is, the state of the top level, wherein the lane change may include sub-states at the second level such as lane change preparation and lane change execution, and the lane change may also include a lane change return sub-state. Correspondingly, the preset hierarchical state machine may include two levels of states. For example, the preset hierarchical state machine includes: lane keeping, pullover and other states in the first level, wherein the pullover may include deceleration and pullover execution and other sub-states in the second level.

根据对目标对象的不同动作模式即执行动作的继续划分，还可以构建出包含更多层级的状态的预设分层状态机。例如：处于第二层级的子状态变道执行还可以包括：例如驶向目标车道、进入目标车道以及行驶于目标车道等更低层级的子状态，等等。According to the different action modes of the target object, that is, the continuous division of the execution action, a preset hierarchical state machine including more levels of states can also be constructed. For example, the lane change execution in the sub-state at the second level may also include lower-level sub-states such as driving to the target lane, entering the target lane, and driving in the target lane, and the like.

可以理解的是，可以认为预设分层状态机中处于最低层级的状态对应有具体的细分场景，例如：基于上述例子，目标对象处于需要进行变道准备的场景，目标对象处于需要执行变道执行的场景等；车道保持不对应下一层级的状态，目标对象处于需要目标对象处于车道保持的场景。其需要进行完整的轨迹规划并得到相应的规划轨迹信息。It can be understood that it can be considered that the state at the lowest level in the preset hierarchical state machine corresponds to a specific subdivision scene. For example, based on the above example, the target object is in a scene that needs to be prepared for lane change, and the target object is in a scene that needs to be changed. The scene of lane execution, etc.; lane keeping does not correspond to the state of the next level, and the target object is in the scene that requires the target object to be in the lane keeping. It needs to perform complete trajectory planning and obtain the corresponding planned trajectory information.

该预设行驶限定条件为限定目标对象行驶方式的条件，例如该预设行驶限定条件可以包括目标对象所在区域所需遵守的交通规则，也可以包括目标对象所执行任务所对应的限定条件，也可以包括人为添加的经验性和通识性的条件，以限制目标对象的行驶行为。在一种情况中，人为添加的经验性和通识性的条件中包括但不限于：用于限定所需计算场景的数目和约束范围的条件，场景的数目通过所确定的可达状态的数据确定。目标对象的执行任务可以包括但不限于：目标对象需行驶到各目的地的任务；目标对象在行驶过程中所需完成的行驶动作的任务等。The preset driving restriction condition is a condition that defines the driving mode of the target object. For example, the preset driving restriction condition may include the traffic rules to be complied with in the area where the target object is located, or may include the restriction condition corresponding to the task performed by the target object, or Artificially added empirical and generalized conditions can be included to limit the driving behavior of the target object. In one case, the artificially added empirical and general conditions include, but are not limited to: conditions for limiting the number of required calculation scenarios and constraints, and the number of scenarios is determined by the data of the reachable state. Sure. The execution tasks of the target object may include, but are not limited to: tasks that the target object needs to drive to various destinations; tasks that the target object needs to complete during the driving process, and the like.

本发明实施例中，可以保留允许人工添加条件的功能，以能够实现在后续的电子设备的路径规划能力边界扩展后可以添加并开放其他条件。In the embodiment of the present invention, the function of allowing manual addition of conditions can be retained, so that other conditions can be added and opened after the path planning capability boundary of the subsequent electronic device is expanded.

为了规划出更好的路径，规划出更好的决策，需要保证所确定出的一系列状态即目标对象的执行动作存在连贯性，例如：处于转弯情况下的目标对象，前一个状态为左转，为了保持动作连贯性，后一个状态应当为左转，以避免出现目标对象在转弯情况下出现一会左转一会右转的问题。相应的，需结合预设分层状态机的当前状态确定可达状态集合。In order to plan a better path and plan a better decision, it is necessary to ensure the continuity of the determined series of states, that is, the execution of the target object. , in order to maintain the continuity of the action, the latter state should be a left turn, so as to avoid the problem that the target object turns left and right for a while when the target object is turning. Correspondingly, the reachable state set needs to be determined in combination with the current state of the preset hierarchical state machine.

本步骤中，电子设备可以检测到预设分层状态的当前状态，进而，基于预设分层状态机的当前状态、各状态之间的跳转关系，从预设分层状态机的各状态中确定出可从当前状态跳转至的状态，作为备用状态，进而，基于当前决策规划相关信息以及预设行驶限定条件，从备用状态中挑选出满足该预设行驶限定条件和/或符合当前决策规划相关信息的状态，作为目标对象对应的可达状态，以得到目标对象对应的可达状态集合。In this step, the electronic device can detect the current state of the preset hierarchical state, and further, based on the current state of the preset hierarchical state machine and the jump relationship between the states, from each state of the preset hierarchical state machine The state that can be jumped from the current state is determined as a backup state, and then, based on the current decision-making planning related information and the preset driving limit The state of the decision-making planning related information is regarded as the reachable state corresponding to the target object, so as to obtain the reachable state set corresponding to the target object.

例如：该预设行驶限定条件中包含：在目标对象进入视线区域禁止从车道保持跳转至变道的条件，基于当前决策规划相关信息确定目标对象行驶前方出现事故车或者从目标对象的盲区区域突然出现快速横穿的行人，该类情况下，为了避免碰撞风险，则需要目标对象压实线违反预设行驶限定条件以避免发生碰撞风险，以保证目标对象的安全即其他车辆及行人的安全，相应的，此时将变道确定为可达状态。For example, the preset driving restrictions include: when the target object enters the line of sight area, it is forbidden to jump from lane keeping to lane change, and based on the current decision-making planning related information, it is determined that there is an accident vehicle in front of the target object or the blind area of the target object. A pedestrian who suddenly crosses quickly. In this case, in order to avoid the risk of collision, the target object's compaction line needs to violate the preset driving conditions to avoid the risk of collision, so as to ensure the safety of the target object, that is, the safety of other vehicles and pedestrians. , correspondingly, the lane change is determined as the reachable state at this time.

S103：获得可达状态集合中各可达状态对应的规划轨迹信息。S103: Obtain planning trajectory information corresponding to each reachable state in the reachable state set.

其中，该规划轨迹信息包括但不限于：跳转至所对应可达状态后目标对象所需行驶的轨迹信息，目标对象的行驶参数信息以及与目标对象存在交互关系的其他对象的相关行驶参数信息。The planned trajectory information includes but is not limited to: trajectory information that the target object needs to travel after jumping to the corresponding reachable state, travel parameter information of the target object, and relevant travel parameter information of other objects that have an interactive relationship with the target object .

目标对象的行驶参数信息可以包括但不限于：平均速度信息、加速度信息以及转角信息。与目标对象存在交互关系的其他对象的相关行驶参数信息可以包括但不限于：平均速度信息、加速度信息以及转角信息。The driving parameter information of the target object may include, but is not limited to, average speed information, acceleration information, and turning angle information. The relevant driving parameter information of other objects that have an interactive relationship with the target object may include, but not limited to, average speed information, acceleration information, and turning angle information.

与目标对象存在交互关系的其他对象可以指：避让目标对象的对象和/或目标对象避让的对象，即受目标对象所执行动作的影响的对象，例如：目标对象变道时，所需变道至的目标车道的目标位置的后方的对象，该目标位置为目标对象变道所插入的位置。Other objects that have an interactive relationship with the target object may refer to: the object that avoids the target object and/or the object that the target object avoids, that is, the object that is affected by the action performed by the target object, for example: when the target object changes lanes, it needs to change lanes To the object behind the target position of the target lane, the target position is the position where the target object's lane change is inserted.

一种情况中，电子设备确定出可达状态集合之后，将可达状态集合、当前状态以及当前决策规划相关信息和预设行驶限定条件发送至其他设备中，以通过其他设备针对每一可达状态确定出其对应的规划轨迹信息，并发送至该电子设备。相应的，该电子设备获得可达状态集合中各可达状态对应的规划轨迹信息。In one case, after the electronic device determines the set of reachable states, it sends the set of reachable states, the current state, and current decision planning related information and preset driving conditions to other devices, so that the other devices can target each reachable state through the other devices. The state determines its corresponding planned trajectory information and sends it to the electronic device. Correspondingly, the electronic device obtains the planned trajectory information corresponding to each reachable state in the reachable state set.

在本发明的一种实现方式中，预设分层状态及中各状态对应有相应的用于规划路径的路径规划算法，电子设备确定出可达状态集合之后，可以针对可达状态集合中各可达状态，基于该可达状态对应的路径规划算法、当前决策规划相关信息中目标对象的当前周围感知信息、当前位姿信息、当前地图信息和障碍物的预测信息，确定该可达状态对应的规划轨迹信息。In an implementation of the present invention, the preset hierarchical state and each state in the preset correspond to a corresponding path planning algorithm for planning a path. After the electronic device determines the reachable state set, it Reachable state, based on the path planning algorithm corresponding to the reachable state, the current surrounding perception information of the target object in the current decision planning related information, the current pose information, the current map information and the prediction information of the obstacles, determine the corresponding state of the reachable state. planning trajectory information.

S104：基于各可达状态对应的规划轨迹信息、各可达状态对应的层级及评价因素，从所有可达状态中确定出目标对象对应的目标状态。S104: Determine the target state corresponding to the target object from all the reachable states based on the planning trajectory information corresponding to each reachable state, the level and evaluation factors corresponding to each reachable state.

其中，可达状态对应的层级为其在预设分层状态机中的层级。The level corresponding to the reachable state is the level in the preset hierarchical state machine.

电子设备针对每一可达状态，基于该可达状态对应的轨迹规划信息、该可达状态对应的层级及评价因素，确定该可达状态对应的评价指标值，进而，基于各可达状态对应的评价指标值，从所有可达状态中确定出表征所对应轨迹规划信息最合理的可达状态，作为目标对象对应的目标状态。For each reachable state, the electronic device determines the evaluation index value corresponding to the reachable state based on the trajectory planning information corresponding to the reachable state, the level and evaluation factors corresponding to the reachable state, and further, based on the corresponding reachable state. The evaluation index value of , determines the most reasonable reachable state representing the corresponding trajectory planning information from all reachable states, as the target state corresponding to the target object.

其中，在评价指标值以惩罚项的形式存在，即评价指标值越低表征可达状态对应的轨迹规划信息越合理的情况下，最合理的可达状态可以指所对应评价指标值的数值最小的可达状态；在评价指标值以非惩罚项的形式存在，即评价指标值越高表征可达状态对应的轨迹规划信息越合理的情况下，最合理的可达状态可以指所对应评价指标值的数值最大的可达状态。Among them, when the evaluation index value exists in the form of a penalty item, that is, the lower the evaluation index value represents, the more reasonable the trajectory planning information corresponding to the reachable state is, the most reasonable reachable state may refer to the minimum value of the corresponding evaluation index value. reachable state; in the case that the evaluation index value exists in the form of a non-penalty item, that is, the higher the evaluation index value is, the more reasonable the trajectory planning information corresponding to the reachable state is, the most reasonable reachable state can refer to the corresponding evaluation index. The numerically largest reachable state of the value.

其中，针对基于各可达状态对应的轨迹规划信息、各可达状态对应的层级及评价因素，确定各可达状态对应的评价指标值的过程，可以理解为针对该可达状态集合构建仲裁树的过程，即对每一可达状态对应的规划轨迹信息进行评价仲裁的过程，在仲裁树构建完成，即各可达状态对应的评价指标值确定之后，可以基于深度优先的原则确定出中才得到的最终的所对应轨迹规划信息最合理的目标状态，并确定其对应的轨迹规划信息。Among them, the process of determining the evaluation index value corresponding to each reachable state based on the trajectory planning information corresponding to each reachable state, the level and evaluation factors corresponding to each reachable state can be understood as constructing an arbitration tree for the set of reachable states The process of evaluating and arbitrating the planning trajectory information corresponding to each reachable state, after the construction of the arbitration tree is completed, that is, after the evaluation index value corresponding to each reachable state is determined, it can be determined based on the principle of depth first. The most reasonable target state corresponding to the final trajectory planning information is obtained, and the corresponding trajectory planning information is determined.

在一种情况中，评价因素可以包括但不限于：轨迹的安全性、舒适性、通行效率以及预设行驶限定条件、目标对象的执行任务等因素。In one case, the evaluation factors may include, but are not limited to, factors such as the safety of the trajectory, comfort, traffic efficiency, preset driving conditions, and tasks performed by the target object.

其中，在对不同层级的状态对应规划轨迹信息的评价仲裁过程中，一般使用不同的评价因素组合，以避免出现局部最优的情况。对于低层级的状态的评价仲裁，需考虑具体障碍物不同避让决策即安全性等因素，可以不考虑目标对象所执行任务和所处场景环境等相关因素；高级别的状态的评价仲裁，需考虑目标对象所执行任务、所处场景环境和所处区域的交通规则的具体要求等因素。Among them, in the evaluation and arbitration process of the planning trajectory information corresponding to the states of different levels, different combinations of evaluation factors are generally used to avoid the situation of local optimization. For the evaluation and arbitration of the low-level state, factors such as different avoidance decisions of specific obstacles, that is, safety, need to be considered, and the relevant factors such as the task performed by the target object and the scene environment in which it is located may not be considered; the evaluation and arbitration of the high-level state needs to be considered. Factors such as the task performed by the target object, the scene environment and the specific requirements of the traffic rules in the area.

一种情况中，评价因素可以包括但不限于：轨迹的安全性、轨迹的舒适性、轨迹的通行效率以及轨迹对应的预设执行惩罚项。In one case, the evaluation factors may include but are not limited to: the safety of the trajectory, the comfort of the trajectory, the traffic efficiency of the trajectory, and the preset execution penalty item corresponding to the trajectory.

其中，轨迹的安全性对应的评价指标值的确定，可以基于可达状态对应的规划轨迹信息中的规划轨迹，与其他和目标对象不存在交互关系的对象的预测轨迹，进行时空维度的碰撞检测确定。需要考虑预测意图的不确定性与其他和目标对象不存在交互关系的对象的预测轨迹的轨迹点的分布，以衡量可达状态对应的规划轨迹信息中的规划轨迹出现碰撞情况的概率信息。其可以在确定场景下尝试障碍物不同避让决策的评价仲裁时使用，例如：在目标对象处于变道阶段时选择进入所要变道的目标道路所需插入的位置，即目标道路中哪两个对象之间的缝隙时使用。Among them, the determination of the evaluation index value corresponding to the safety of the trajectory can be performed based on the planned trajectory in the planned trajectory information corresponding to the reachable state, and the predicted trajectory of other objects that have no interaction relationship with the target object to perform collision detection in the space-time dimension. Sure. It is necessary to consider the uncertainty of the predicted intention and the distribution of the trajectory points of the predicted trajectory of other objects that have no interaction with the target object to measure the probability information of collision of the planned trajectory in the planned trajectory information corresponding to the reachable state. It can be used in determining the evaluation and arbitration of different avoidance decisions for obstacles in determining the scene, for example: when the target object is in the lane-changing stage, select the insertion position to enter the target road to be changed, that is, which two objects in the target road should be inserted. used for gaps between.

此处在碰撞检查中只考虑与自车不存在交互的对象。这里进一步说明一下不存在交互的对象一般指本身行为和轨迹不发生干涉，或者自车行为不会影响到他车行为的车辆(比如车道保持的前车)，需要结合场景综合判断；存在交互的对象是指自车的(为完成任务或者保障安全)行为会对他车产生影响，比如自车变道时目标车道的后方车辆。从算法角度会基于经验规则去筛选出需要交互和不需要交互的目标。Here, only objects that do not interact with the ego vehicle are considered in collision checking. It is further explained here that objects without interaction generally refer to vehicles whose own behavior and trajectory do not interfere, or whose own vehicle behavior does not affect the behavior of other vehicles (such as the vehicle in front of lane keeping), which needs to be comprehensively judged in combination with the scene; The object refers to the behavior of the ego vehicle (to complete the task or ensure safety) that will affect other cars, such as the vehicle behind the target lane when the ego car changes lanes. From an algorithmic point of view, targets that require interaction and those that do not require interaction will be screened based on empirical rules.

在一种实现方式，可以使用粗略模式rough mode或者精确模式precise mode，设定轨迹的安全性阈值来确定规划轨迹是否安全可行，具体如下公式(1)表示：In one implementation, the rough mode or the precise mode can be used to set the safety threshold of the trajectory to determine whether the planned trajectory is safe and feasible, as expressed by the following formula (1):

其中，J_safety表示规划轨迹的安全性对应的评价指标值；p_collide表示精确模式下计算所得的可达状态对应的规划轨迹信息中规划轨迹发生碰撞的概率信息；k_collide表示精确模式下的预设权重系数；n_collide表示粗略模式下的可达状态对应的规划轨迹信息中规划轨迹是否发生碰撞的信息。Among them, J _safety represents the evaluation index value corresponding to the safety of the planned trajectory; p _collide represents the probability information of the collision of the planned trajectory in the planned trajectory information corresponding to the reachable state calculated in the precise mode; k _collide represents the predicted value in the precise mode. Set the weight coefficient; n _collide represents the information about whether the planned trajectory collides in the planned trajectory information corresponding to the reachable state in the rough mode.

p_collide的计算依赖于当前决策规划相关信息中的障碍物的预测信息，其中，障碍物预测信息包括障碍物的预测轨迹，其中，每一障碍物对应的预测轨迹包括多个轨迹点；其计算可以存在两种方式，第一种，针对每一障碍物，对该障碍物对应的预测轨迹中的轨迹点进行采样，采样得到该障碍物对应的碰撞轨迹，障碍物对应的碰撞轨迹包括多个采样轨迹点；针对每一障碍物，利用该障碍物对应的碰撞轨迹中某一时刻的采样轨迹点，与目标对象的可达状态对应的规划轨迹中该时刻的轨迹点，进行逐点碰撞检查，统计该障碍物对应的发生碰撞的采样轨迹点数量；针对每一障碍物，利用该障碍物对应的发生碰撞的采样轨迹点数量，以及轨迹点总数量，计算该障碍物对应的发生碰撞的轨迹点数量与轨迹点总数量的比值，作为该障碍物对应的比值；根据所有障碍物对应的比值，确定该可达状态对应的规划轨迹信息对应的发生碰撞的概率信息。The calculation of p _collide depends on the predicted information of obstacles in the current decision planning related information, wherein the predicted information of obstacles includes predicted trajectories of obstacles, wherein the predicted trajectories corresponding to each obstacle include multiple trajectory points; its calculation There are two ways. The first one is to sample the trajectory points in the predicted trajectory corresponding to the obstacle for each obstacle, and obtain the collision trajectory corresponding to the obstacle. The collision trajectory corresponding to the obstacle includes multiple collision trajectories. Sampling trajectory points; for each obstacle, use the sampled trajectory points at a certain moment in the collision trajectory corresponding to the obstacle and the trajectory points at the moment in the planned trajectory corresponding to the reachable state of the target object to perform point-by-point collision checking , count the number of collided sampling trajectory points corresponding to the obstacle; for each obstacle, use the number of collided sampling trajectory points corresponding to the obstacle and the total number of trajectory points to calculate the collision corresponding to the obstacle. The ratio of the number of trajectory points to the total number of trajectory points is used as the ratio corresponding to the obstacle; according to the ratio corresponding to all obstacles, the probability information of collision corresponding to the planned trajectory information corresponding to the reachable state is determined.

其中，利用该障碍物对应的碰撞轨迹中某一时刻的采样轨迹点，与目标对象的可达状态对应的规划轨迹中该时刻的轨迹点，进行逐点碰撞检查的过程为：根据某一时刻下目标对象的位姿信息和表征目标对象的多边形模型，以及该障碍物的位姿信息和表征该障碍物的多边形模型，确定某一时刻下目标对象与障碍物是否出现位置重叠，出现位置重叠则发生碰撞；反之，不出现位置重叠则不发生碰撞。轨迹点总数量可以指目标对象的可达状态对应的规划轨迹中的轨迹点的总数量，也可以指采用得到的预测轨迹中轨迹点的总数量。Among them, using the sampled trajectory points at a certain moment in the collision trajectory corresponding to the obstacle and the trajectory points at this moment in the planned trajectory corresponding to the reachable state of the target object, the process of performing point-by-point collision checking is as follows: according to a certain moment The pose information of the target object and the polygon model representing the target object, as well as the pose information of the obstacle and the polygon model representing the obstacle, determine whether the target object and the obstacle overlap at a certain moment, and the overlapping positions A collision occurs; otherwise, a collision does not occur if there is no overlapping of positions. The total number of trajectory points may refer to the total number of trajectory points in the planned trajectory corresponding to the reachable state of the target object, or may refer to the total number of trajectory points in the predicted trajectory obtained by using it.

根据所有障碍物对应的比值，确定该可达状态对应的规划轨迹信息对应的发生碰撞的概率信息的过程，可以是：统计所对应比值超过预设比值阈值的障碍物的个数，基于该个数，以及障碍物总数，确定该可达状态对应的规划轨迹信息对应的发生碰撞的概率信息。The process of determining the probability information of collision corresponding to the planned trajectory information corresponding to the reachable state according to the ratios corresponding to all the obstacles may be: counting the number of obstacles whose corresponding ratios exceed the preset ratio threshold, based on the number, and the total number of obstacles, to determine the probability information of collision corresponding to the planned trajectory information corresponding to the reachable state.

第二种：逐点根据目标对象的可达状态对应的规划轨迹中轨迹点与障碍物的预测轨迹信息中的轨迹点之间的相对位置，以及障碍物的预测轨迹信息中的轨迹点的空间概率分布情况，得到轨迹点的碰撞概率；选择其中数值最大的轨迹点的碰撞概率，作为目标对象的可达状态对应的规划轨迹信息对应的碰撞概率。The second: point by point according to the relative position of the trajectory point in the planned trajectory corresponding to the reachable state of the target object and the trajectory point in the predicted trajectory information of the obstacle, and the space of the trajectory point in the predicted trajectory information of the obstacle According to the probability distribution, the collision probability of the trajectory point is obtained; the collision probability of the trajectory point with the largest value is selected as the collision probability corresponding to the planned trajectory information corresponding to the reachable state of the target object.

其中，上述得到轨迹点的碰撞概率的过程，可以参照相关技术中任一可以确定两条轨迹出现碰撞的碰撞概率的方式，例如：在障碍物的预测轨迹信息中的轨迹点的空间概率分布情况为离散化表述的情况下，可以直接使用障碍物的预测轨迹信息中的轨迹点的空间概率分布情况及各对轨迹点对应的权重，与目标对象的规划轨迹中轨迹点进行比较，确定轨迹点的碰撞概率。在障碍物的预测轨迹信息中的轨迹点的空间概率分布情况为连续且可积的情况下，可简化目标对象为圆形模型，确定目标对象的规划轨迹的轨迹点与各障碍物的预测轨迹信息中的轨迹点之间的碰撞，得到轨迹点的碰撞概率，等。Wherein, for the above process of obtaining the collision probability of the trajectory points, any method in the related art can be used to determine the collision probability of two trajectories colliding, for example: the spatial probability distribution of the trajectory points in the predicted trajectory information of the obstacle In the case of a discretized representation, the spatial probability distribution of the trajectory points in the predicted trajectory information of the obstacle and the corresponding weights of each pair of trajectory points can be directly used, and the trajectory points in the planned trajectory of the target object can be compared to determine the trajectory points. collision probability. When the spatial probability distribution of the trajectory points in the predicted trajectory information of the obstacle is continuous and integrable, the target object can be simplified as a circular model, and the trajectory points of the planned trajectory of the target object and the predicted trajectory of each obstacle can be determined. The collision between the trajectory points in the information, get the collision probability of the trajectory points, etc.

其中，选择使用粗略模式或者精确模式，可以根据实际设定，例如场景进行设置，场景与状态存在对应关系。Among them, choosing to use the rough mode or the precise mode can be set according to the actual setting, such as the scene, and there is a corresponding relationship between the scene and the state.

轨迹的安全性对应的评价指标值以惩罚项的形式存在，轨迹的安全性对应的评价指标值越大，表征该可达状态对应的规划轨迹信息所对应轨迹的安全性越低。The evaluation index value corresponding to the safety of the trajectory exists in the form of a penalty item. The larger the evaluation index value corresponding to the safety of the trajectory, the lower the safety of the trajectory corresponding to the planned trajectory information representing the reachable state.

轨迹的通行效率对应的评价指标值的确定，需要考虑目标对象的可达状态对应的规划轨迹信息中规划轨迹的平均时速v，也需考虑与目标对象存在交互关系的对象集合Φ中第i个对象因避让目标对象产生的平均时速变化Δv_i；对于不同优先级的对象，可以通过系数k_i进行区分，为预设值。其中，轨迹的通行效率对应的评价指标值中，与目标对象存在交互关系的对象集合Φ中各对象对应的评价指标值部分的确定过程，可以通过如下公式(2)表示：The determination of the evaluation index value corresponding to the traffic efficiency of the trajectory needs to consider the average speed v of the planned trajectory in the planned trajectory information corresponding to the reachable state of the target object, and also needs to consider the ith object set Φ that has an interactive relationship with the target object. The average hourly speed change Δv _i generated by the object avoiding the target object; for objects with different priorities, it can be distinguished by the coefficient k _i , which is a preset value. Among them, in the evaluation index value corresponding to the traffic efficiency of the trajectory, the determination process of the evaluation index value part corresponding to each object in the object set Φ that has an interactive relationship with the target object can be expressed by the following formula (2):

J_interaction＝∑_i∈Φ-k_iΔv_i； (2)J _interaction =∑ _i∈Φ -k _i Δv _i ; (2)

其中，J_interaction表示轨迹的通行效率对应的评价指标值中，与目标对象存在交互关系的对象集合Φ中各对象对应的评价指标值部分的值。Δv_i可以从当前周围感知信息中确定得到。Among them, J _interaction represents the value of the evaluation index value part corresponding to each object in the object set Φ that has an interactive relationship with the target object in the evaluation index value corresponding to the traffic efficiency of the trajectory. Δv _i can be determined from the current surrounding perception information.

轨迹的通行效率对应的评价指标值以惩罚项的形式存在，轨迹的通行效率对应的评价指标值越大，表征该可达状态对应的规划轨迹信息所对应轨迹的通行效率越低。The evaluation index value corresponding to the traffic efficiency of the trajectory exists in the form of a penalty item. The larger the evaluation index value corresponding to the traffic efficiency of the trajectory, the lower the traffic efficiency of the trajectory corresponding to the planned trajectory information representing the reachable state.

轨迹的舒适性对应的评价指标值的确定，一方面可以考虑目标对象的可达状态对应的规划轨迹信息中规划轨迹对应的加速度a，相应的，轨迹的舒适性对应的评价指标值中的第一部分值确定方式可以通过如下公式(3)表示；In the determination of the evaluation index value corresponding to the comfort of the trajectory, on the one hand, the acceleration a corresponding to the planned trajectory in the planned trajectory information corresponding to the reachable state of the target object can be considered. Part of the value determination method can be expressed by the following formula (3);

J_comfort＝k_comfort*a； (3)J _comfort = k _comfort * a; (3)

其中，J_comfort表示轨迹的舒适性对应的评价指标值中的第一部分值，k_comfort表示加速度a对应的系数值，为预设值。Wherein, J _comfort represents the first part of the evaluation index values corresponding to the comfort of the trajectory, and k _comfort represents the coefficient value corresponding to the acceleration a, which is a preset value.

另一方面，在变道或转弯等情况中，轨迹的舒适性对应的评价指标值的确定还可以考虑所产生的横向加速度a_lat，相应的，轨迹的舒适性对应的评价指标值中的第二部分值确定方式可以通过如下公式(4)表示；On the other hand, in the case of changing lanes or turning, the determination of the evaluation index value corresponding to the comfort of the trajectory may also consider the generated lateral acceleration a _lat . Correspondingly, the No. 1 evaluation index value corresponding to the comfort of the trajectory The two-part value determination method can be expressed by the following formula (4);

J_LCManeuver＝k_LCManeuver*a_lat； (4)J _LCManeuver = k _LCManeuver *a _lat ; (4)

其中，J_LCManeuver表示轨迹的舒适性对应的评价指标值中的第二部分值，k_LCManeuver表示横向加速度a对应的系数值，为预设值。Among them, J _LCManeuver represents the second part of the evaluation index value corresponding to the comfort of the trajectory, and k _LCManeuver represents the coefficient value corresponding to the lateral acceleration a, which is a preset value.

轨迹的舒适性对应的评价指标值以惩罚项的形式存在，轨迹的舒适性对应的评价指标值越大，表征该可达状态对应的规划轨迹信息所对应轨迹的舒适性越低。The evaluation index value corresponding to the comfort of the trajectory exists in the form of a penalty item. The larger the evaluation index value corresponding to the comfort of the trajectory, the lower the comfort of the trajectory corresponding to the planned trajectory information representing the reachable state.

在可达状态对应的规划轨迹信息对应的评价指标值的确定过程中，可以通过n_LCNeed∈{0,1}来表示可达状态对应的执行标签是否为必须执行标签，其中，可以是n_LCNeed为1时表示可达状态对应的执行标签为必须执行标签，n_LCNeed为1时表示可达状态对应的执行标签不为必须执行标签，根据可达状态对应的执行标签是否为必须执行标签来确定是否增加轨迹对应的预设执行惩罚项J_LCNeed。In the process of determining the evaluation index value corresponding to the planning trajectory information corresponding to the reachable state, n _LCNeed ∈ {0,1} can be used to indicate whether the execution label corresponding to the reachable state is a mandatory label, where n _LCNeed can be When it is 1, it means that the execution label corresponding to the reachable state is a must-execute label. When n _LCNeed is 1, it means that the execution label corresponding to the reachable state is not a must-execute label. It is determined according to whether the execution label corresponding to the reachable state is a must-execute label. Whether to increase the preset execution penalty item J _LCNeed corresponding to the trajectory.

S105：控制预设分层状态机从当前状态跳转至目标状态，以使目标对象基于目标状态对应的规划轨迹信息行驶。S105: Control the preset hierarchical state machine to jump from the current state to the target state, so that the target object travels based on the planned trajectory information corresponding to the target state.

电子设备确定目标状态之后，可以控制预设分层状态机从当前状态跳转至目标状态，并输出目标状态对应的规划轨迹信息，以使得目标对象基于目标状态对应的规划轨迹信息行驶。After the electronic device determines the target state, it can control the preset hierarchical state machine to jump from the current state to the target state, and output the planned trajectory information corresponding to the target state, so that the target object travels based on the planned trajectory information corresponding to the target state.

应用本发明实施例，基于预设分层状态机的当前状态、各状态之间的跳转关系、当前决策规划相关信息以及预设行驶限定条件，确定出从预设分层状态机下一状态可跳转至的可达状态集合，即确定出预设分层状态机对应的目标对象的下一时刻的可达状态集合，进而利用各可达状态对应的规划轨迹信息的合理性，从可达状态集合中确定出所对应规划轨迹信息最合理的可达状态，即目标状态，实现对目标对象的下一时刻的状态的合理化且准确化的确定，实现对路径规划的合理性的提高，并且，在规划过程中，首先确定可达状态，进而获得可达状态对应的规划轨迹信息，以通过对规划信息的评价，从可达状态中确定目标状态，该规划框架更具有可泛化、可扩展化以及可解释化，可以针对实际涌现的新的问题和场景快速定义和嵌入新的场景和动作模式，并适配新的路径规划算法。Applying the embodiment of the present invention, based on the current state of the preset hierarchical state machine, the jump relationship between the states, the current decision planning related information, and the preset driving limit conditions, determine the next state from the preset hierarchical state machine. The set of reachable states that can be jumped to, that is, the set of reachable states at the next moment of the target object corresponding to the preset hierarchical state machine is determined, and then the rationality of the planning trajectory information corresponding to each reachable state is used to determine the set of reachable states corresponding to each reachable state. Determine the most reasonable reachable state of the corresponding planning trajectory information from the reachable state set, that is, the target state, realize the rational and accurate determination of the state of the target object at the next moment, and realize the improvement of the rationality of the path planning, and , in the planning process, first determine the reachable state, and then obtain the planning trajectory information corresponding to the reachable state, so as to determine the target state from the reachable state through the evaluation of the planning information. Extensible and interpretable, new scenarios and action modes can be quickly defined and embedded for emerging new problems and scenarios, and new path planning algorithms can be adapted.

在本发明的另一实施例中，所述S102，可以包括如下步骤011-012：In another embodiment of the present invention, the S102 may include the following steps 011-012:

011：对当前决策规划相关信息中的指定信息进行预处理，并判断当前决策规划相关信息中的各信息是否有效。011: Preprocess the specified information in the current decision planning related information, and judge whether each information in the current decision planning related information is valid.

012：在确定所获得的当前决策规划相关信息均有效的情况下，基于预设分层状态机的当前状态、各状态之间的跳转关系、当前决策规划相关信息以及预设行驶限定条件，确定目标对象对应的可达状态集合。012: In the case where it is determined that the obtained current decision planning related information is valid, based on the current state of the preset hierarchical state machine, the jump relationship between the states, the current decision planning related information, and the preset driving limit conditions, Determine the set of reachable states corresponding to the target object.

本实现方式中，为了保证后续的决策结果的准确以及路径规划的准确，电子设备在获得当前决策规划相关信息之后，对当前决策规划相关信息中的指定信息进行预处理，例如：利用当前位姿信息以及其之前的历史位姿信息确定目标对象的行驶轨迹；利用当前周围感知信息以及历史周围感知信息，确定当前周围感知信息中各感知目标的行驶轨迹，等。In this implementation manner, in order to ensure the accuracy of subsequent decision-making results and the accuracy of path planning, after obtaining the current decision-making planning related information, the electronic device preprocesses the specified information in the current decision-making planning related information, for example: using the current pose information and its previous historical pose information to determine the driving trajectory of the target object; use the current surrounding perception information and historical surrounding perception information to determine the driving trajectory of each sensing target in the current surrounding perception information, and so on.

进而，基于预处理结果以及当前决策规划相关信息中各信息的具体数据，确定当前决策规划相关信息中各信息是否有效。例如：基于当前决策规划相关信息中的当前位姿信息的具体数值以及所确定的目标对象的行驶轨迹，确定当前位姿信息是否准确；基于当前周围感知信息中各感知目标的位置信息和类型以及各感知目标的行驶轨迹，确定当前周围感知信息是否准确，级确定是否出现感知目标的漏检以及误检，感知目标的位置信息是否准确等。基于障碍物的预测信息的具体数值确定障碍物的预测信息是否准确。Further, based on the preprocessing result and the specific data of each information in the current decision planning related information, it is determined whether each information in the current decision planning related information is valid. For example: determine whether the current pose information is accurate based on the specific value of the current pose information in the current decision-making planning related information and the determined driving trajectory of the target object; The driving trajectory of each sensing target determines whether the current surrounding sensing information is accurate, whether there is a missed detection or false detection of the sensing target, and whether the position information of the sensing target is accurate. Whether the predicted information of the obstacle is accurate is determined based on the specific value of the predicted information of the obstacle.

在确定当前决策规划相关信息各信息均有效的情况下，执行后续的路径规划流程。以保证后续的决策结果的准确以及路径规划的准确。When it is determined that all information related to the current decision planning is valid, the subsequent path planning process is performed. In order to ensure the accuracy of subsequent decision-making results and the accuracy of path planning.

在另一种实现方式中，在确定当前决策规划相关信息各信息未均有效的情况下，电子设备可以基于当前决策规划相关信息中有效的信息确定出减速轨迹信息，以控制目标对象基于该减速轨迹信息进行减速行驶。并且重置预设分层状态机的状态，例如：可以将预设分层状态及的状态重置至其指定状态，以保证目标对象及与其存在交互关系的其他对象的安全。In another implementation manner, when it is determined that the information related to the current decision-making plan is not all valid, the electronic device can determine the deceleration trajectory information based on the valid information in the current decision-making plan-related information, so as to control the target object to decelerate based on the information. Track information to decelerate. And reset the state of the preset hierarchical state machine. For example, the preset hierarchical state and the state of the state machine can be reset to their designated state, so as to ensure the safety of the target object and other objects that have an interactive relationship with it.

在一种情况中，目标对象为自动驾驶车辆的情况下，目标对象的自动驾驶系统启动后，该电子设备可以以固定频率运行，其在获得一帧当前决策规划相关信息之后，即可对当前决策规划相关信息中的指定信息进行预处理，并判断当前决策规划相关信息中的各信息是否有效。In one case, when the target object is an automatic driving vehicle, after the automatic driving system of the target object is activated, the electronic device can run at a fixed frequency. The specified information in the decision planning related information is preprocessed, and it is judged whether each information in the current decision planning related information is valid.

在本发明的另一实施例中，所述S104，可以包括如下步骤021-024：In another embodiment of the present invention, the S104 may include the following steps 021-024:

021：判断可达状态集合中是否存在所对应执行标签为必须执行标签的可达状态。021: Determine whether there is a reachable state in which the corresponding execution label is a must-execute label in the reachable state set.

其中，跳转标签为：基于当前决策规划相关信息中的当前周围感知信息和/或当前地图信息所确定的标签。The jump label is: a label determined based on the current surrounding perception information and/or the current map information in the current decision planning related information.

022：若确定可达状态集合中存在所对应执行标签为必须执行标签的可达状态，基于各可达状态对应的顶级层级状态所对应的当前优先级，确定所对应当前优先级最高的可达状态，作为待评价状态；基于待评价状态对应的规划轨迹信息以及待评价状态对应的评价因素，确定出待评价状态对应的规划轨迹信息是否可行的评价结果。022: If it is determined that there is a reachable state in the reachable state set in which the corresponding execution label is a mandatory label, based on the current priority corresponding to the top-level state corresponding to each reachable state, determine the reachable state with the highest current priority. The state is regarded as the state to be evaluated; based on the planned trajectory information corresponding to the to-be-evaluated state and the evaluation factors corresponding to the to-be-evaluated state, determine whether the planned trajectory information corresponding to the to-be-evaluated state is feasible or not.

其中，必须执行标签对应的可达状态的优先级最高。Among them, the reachable state corresponding to the label that must be executed has the highest priority.

023：若评价结果表征待评价状态对应的规划轨迹信息可行，则基于可行的规划轨迹信息，从待评价状态中确定出目标对象对应的目标状态。023: If the evaluation result indicates that the planned trajectory information corresponding to the state to be evaluated is feasible, then based on the feasible planned trajectory information, the target state corresponding to the target object is determined from the state to be evaluated.

024：若评价结果表征待评价状态对应的规划轨迹信息不可行，则返回执行022。024: If the evaluation result indicates that the planning trajectory information corresponding to the state to be evaluated is infeasible, return to 022.

在确定目标对象的下一状态，即确定目标对象的规划轨迹时，需要优先关注目标对象以及与其存在交互关系的其他对象的安全，基于当前决策规划相关信息中的当前周围感知信息、当前位姿信息以及当前地图信息所表征的周围情况，难免存在需要目标对象执行某些动作，以避免目标对象和/或与目标对象存在交互关系的其他对象出现危险的情况。相应的，所确定的可达状态中难免存在必须执行以及非必须执行的区别。为了保证目标对象与其存在交互关系的其他对象的安全，对于存在必须执行和不存在必须执行的可达状态所对应规划轨迹信息的评价仲裁过程存在区别。When determining the next state of the target object, that is, determining the planned trajectory of the target object, priority should be given to the safety of the target object and other objects that have an interactive relationship with it. Information and the surrounding situation represented by the current map information inevitably require the target object to perform certain actions to avoid danger to the target object and/or other objects that have an interactive relationship with the target object. Correspondingly, there is inevitably a distinction between mandatory execution and non-essential execution in the determined reachable state. In order to ensure the security of the target object and other objects with which there is an interactive relationship, there is a difference between the evaluation and arbitration process of the planning trajectory information corresponding to the reachable state that must be executed and the reachable state that does not exist.

其中，可达状态是否必须执行可以通过可达状态对应的执行标签确定。其中，可达状态对应的执行标签为：基于当前决策规划相关信息中的当前周围感知信息和/或当前地图信息所确定的标签。例如：在通过当前周围感知信息确定目标对象行驶前方的车道中存在故障对象或突然出现快速移动的对象的情况下，目标对象需要变道行驶；在通过当前周围感知信息以及当前地图信息确定目标对象行驶前方无路或无车道的情况下，目标对象需要变道行驶或转弯行驶；在通过当前周围感知信息以及当前地图信息，确定目标对象行驶前方为十字路口，且目标对象所执行任务需要目标对象左拐，相应的，目标对象需要转弯行驶等。Wherein, whether the reachable state must be executed can be determined by the execution tag corresponding to the reachable state. Wherein, the execution label corresponding to the reachable state is: a label determined based on the current surrounding perception information and/or the current map information in the current decision planning related information. For example, when it is determined that there is a faulty object or a fast-moving object suddenly appearing in the lane ahead of the target object according to the current surrounding perception information, the target object needs to change lanes; When there is no road or lane in front of the vehicle, the target object needs to change lanes or turn; through the current surrounding perception information and current map information, it is determined that there is an intersection in front of the target object, and the task performed by the target object requires the target object Turn left, correspondingly, the target object needs to turn and drive.

本实现方式中，电子设备在对各可达状态对应的规划轨迹信息进行评价仲裁的过程中，首先判断可达状态集合中是否存在所对应执行标签为必须执行标签的可达状态，若确定可达状态集合中存在所对应执行标签为必须执行标签的可达状态，需要优先评价仲裁，并在必须执行标签对应的可达状态对应的规划轨迹信息安全可行的情况下，优先考虑跳转至必须执行标签的可达状态。In this implementation manner, in the process of evaluating and arbitrating the planned trajectory information corresponding to each reachable state, the electronic device firstly judges whether there is a reachable state in the reachable state set that corresponds to the execution label that must be executed. There is a reachable state in the reachable state set that the corresponding execution label is a must-be-executed label, and arbitration needs to be prioritized. If the planned trajectory information corresponding to the reachable state corresponding to the must-execute label is safe and feasible, priority is given to jumping to must. The reachable state of the execution tag.

可达状态集合中的可达状态可以包含处于顶级层级的状态，也可以包含处于非顶级层级的状态，在确定可达状态集合中存在所对应执行标签为必须执行标签的可达状态的情况下，可以基于可达状态对应的顶级层级的状态所对应当前优先级的优先情况，依次针对可达状态对应的规划轨迹信息进行评价仲裁，可以理解的是，必须执行标签的可达状态的当前优先级为最高。The reachable state in the reachable state set can include the state at the top level or the state at the non-top level. If it is determined that there is a reachable state in the reachable state set with the corresponding execution label as the must-execute label , based on the priority status of the current priority corresponding to the state of the top level corresponding to the reachable state, the evaluation and arbitration can be performed for the planning trajectory information corresponding to the reachable state in turn. It is understandable that the current priority of the reachable state of the label must be executed. level is the highest.

具体的，基于各可达状态对应的顶级层级状态所对应的当前优先级，确定所对应当前优先级最高的可达状态，作为待评价状态；基于待评价状态对应的规划轨迹信息以及待评价状态对应的评价因素，确定出待评价状态对应的评价指标值，基于待评价状态对应的评价指标值，确定出待评价状态对应的规划轨迹信息是否可行的评价结果；若评价结果表征待评价状态对应的规划轨迹信息可行，则直接将该当前可达状态确定为目标对象对应的目标状态；无需再针对当前优先级低的其他可达状态对应的规划轨迹信息执行评价仲裁过程。Specifically, based on the current priority corresponding to the top-level state corresponding to each reachable state, the corresponding reachable state with the highest current priority is determined as the to-be-evaluated state; based on the planning trajectory information corresponding to the to-be-evaluated state and the to-be-evaluated state Corresponding evaluation factors, determine the evaluation index value corresponding to the state to be evaluated, and determine the evaluation result of whether the planning trajectory information corresponding to the state to be evaluated is feasible based on the evaluation index value corresponding to the state to be evaluated; If the planned trajectory information is feasible, the current reachable state is directly determined as the target state corresponding to the target object; there is no need to perform the evaluation arbitration process for the planned trajectory information corresponding to other reachable states with low current priority.

若评价结果表征待评价状态对应的规划轨迹信息不可行，则继续针对当前优先级低的其他可达状态对应的规划轨迹信息进行评价仲裁，即返回执行基于各可达状态对应的顶级层级状态所对应的当前优先级，确定所对应当前优先级最高的可达状态，作为待评价状态的步骤。If the evaluation result indicates that the planned trajectory information corresponding to the to-be-evaluated state is infeasible, continue to perform evaluation arbitration on the planned trajectory information corresponding to the other reachable states with the current low priority, that is, return to execute the execution based on the top-level state corresponding to each reachable state. For the corresponding current priority, determine the reachable state with the highest corresponding current priority as the step to be evaluated.

一种情况中，待评价状态对应的规划轨迹信息中存在至少一个待评价状态对应的规划轨迹信息可行，则可以确定待评价状态对应的规划轨迹信息可行。In one case, if there is at least one planned trajectory information corresponding to the to-be-evaluated state in the planned trajectory information corresponding to the to-be-evaluated state, it can be determined that the planned trajectory information corresponding to the to-be-evaluated state is feasible.

其中，可达状态对应的顶级层级状态为：可达状态所属的处于预设分层状态机中处于顶级层级的状态。举例而言：预设分层状态机的顶级层级的状态包括车道保持、变道以及借道；其中，顶级层级的状态变道包括变道准备、变道执行和变道返回三个子状态。在一种情况中，电子设备基于预设分层状态机的当前状态、各状态之间的跳转关系、当前决策规划相关信息以及预设行驶限定条件，确定目标对象对应的可达状态集合包括车道保持、变道准备和变道执行三个可达状态。其中，车道保持即处于顶级层级，其对应的顶级层级的状态为其自身；变道准备和变道执行为变道的子状态，相应的，变道准备和变道执行对应的顶级层级的状态为变道。The top-level state corresponding to the reachable state is: the state to which the reachable state belongs is at the top-level in the preset hierarchical state machine. For example, the states of the top level of the preset hierarchical state machine include lane keeping, lane change and lane borrowing; wherein, the state change of the top level includes three sub-states of lane change preparation, lane change execution and lane change return. In one case, the electronic device determines the set of reachable states corresponding to the target object based on the current state of the preset hierarchical state machine, the jump relationship between the states, the current decision planning related information, and the preset driving restriction conditions, including the following: Lane keeping, lane change preparation and lane change execution are three reachable states. Among them, lane keeping is at the top level, and its corresponding top level state is itself; lane change preparation and lane change execution are sub-states of lane change, and correspondingly, the corresponding top level state of lane change preparation and lane change execution for lane change.

在本发明的另一实施例中，待评价状态可能为一个或多个，例如：承接上述例子，变道准备和变道执行对应的顶级层级的状态均为顶级层级状态变道，相应的，变道准备和变道执行对应的当前优先级相同，变道准备与变道执行对应同一父状态，且处于同一层级，则可以认为变道准备与变道执行互为有公有父状态且处于同一层级的可达状态。In another embodiment of the present invention, there may be one or more states to be evaluated. For example, following the above example, the states of the top level corresponding to lane change preparation and lane change execution are both the top level state lane change, correspondingly, The current priority corresponding to the lane change preparation and the lane change execution are the same. The lane change preparation and the lane change execution correspond to the same parent state and are at the same level. It can be considered that the lane change preparation and the lane change execution have a common parent state and are in the same parent state. The reachability status of the hierarchy.

在行驶过程中，目标对象执行某一系列执行动作时，理论上该一系列执行动作之间存在相应的执行顺序，例如：目标对象需要执行变道时，若其变道至的目标道路存在多个对象，目标对象执行变道时，需要先确定变道所需插入的位置，即目标道路中哪两个对象之间的缝隙或哪个对象的前方或后方，进而在其所在道路上行驶至其确定所需插入的位置附近，在执行变道，即进入目标道路。During the driving process, when the target object performs a certain series of execution actions, theoretically, there is a corresponding execution sequence between the series of execution actions. For example, when the target object needs to perform a lane change, if there are many When the target object performs a lane change, it is necessary to first determine the position where the lane change needs to be inserted, that is, the gap between two objects in the target road or the front or back of which object, and then drive on the road to which it is located. Determine the vicinity of the position to be inserted, and perform a lane change, that is, enter the target road.

为了保证目标对象的执行动作的连贯性和舒适性，若待评价状态中包含互为有公有父状态且处于同一层级的可达状态，该类状态之间存在时序阶段过渡关系，该时序阶段过渡关系用于限制各状态之间的执行顺序。In order to ensure the coherence and comfort of the execution of the target object, if the state to be evaluated includes reachable states that share a common parent state with each other and are at the same level, there is a temporal phase transition relationship between such states, and the temporal phase transition Relationships are used to constrain the order of execution between states.

相应的，在本发明的一种实施例中，电子设备在确定出待评价状态之后，并确定待评价状态包含多个的情况，若待评价状态中包含互为有公有父状态且处于同一层级的可达状态；互为有公有父状态且处于同一层级的可达状态之间存在时序阶段过渡关系。Correspondingly, in an embodiment of the present invention, after determining the state to be evaluated, the electronic device determines that the state to be evaluated includes a plurality of situations, if the state to be evaluated includes mutually shared parent states and is at the same level reachable state; there is a time-series phase transition relationship between reachable states that have a common parent state and are at the same level.

所述022，可以包括如下步骤0221-0225：The 022 may include the following steps 0221-0225:

0221：针对互为有公有父状态且处于同一层级的待评价状态，基于各待评价状态对应的时序阶段过渡关系的先后顺序，依次确定当前未评价的且所对应时序阶段过渡关系最先的可达状态，为当前待评价状态。0221: For the to-be-evaluated states that share a common parent state and are at the same level, based on the sequence of the transitional relationships of the time-series stages corresponding to the states to be evaluated, determine in turn the ones that are not currently evaluated and whose corresponding time-series-stage transition relationships are the first. Arrived status, which is the current status to be evaluated.

0222：基于该当前待评价状态对应的规划轨迹信息以及该当前待评价状态对应的评价因素，确定该当前待评价状态对应的评价指标值。0222: Determine an evaluation index value corresponding to the current state to be evaluated based on the planning trajectory information corresponding to the current state to be evaluated and the evaluation factor corresponding to the current state to be evaluated.

0223：若当前待评价状态对应的评级指标值表征：该当前待评价状态对应的规划轨迹信息可行，判断待评价状态中是否存在未评价的状态。0223: If the rating index value corresponding to the current to-be-evaluated state indicates that the planned trajectory information corresponding to the current to-be-evaluated state is feasible, determine whether there is an unevaluated state in the to-be-evaluated state.

0224：若判断存在，返回0221。0224: If it is judged to exist, return 0221.

0225：若判断不存在，或确定出当前待评价状态对应的规划轨迹信息不可行，得到待评价状态对应的规划轨迹信息是否可行的评价结果。0225: If it is judged that it does not exist, or it is determined that the planned trajectory information corresponding to the current state to be evaluated is infeasible, obtain an evaluation result of whether the planned trajectory information corresponding to the state to be evaluated is feasible.

本实现方式中，电子设备遍历待评价状态中的互为有公有父状态且处于同一层级的待评价状态，基于各待评价状态对应的时序阶段过渡关系的先后顺序，依次确定当前未评价的且所对应时序阶段过渡关系最先的可达状态，为当前待评价状态；基于该当前待评价状态对应的规划轨迹信息以及该当前待评价状态对应的评价因素，确定该当前待评价状态对应的评价指标值。进而，基于该当前待评价状态对应的评价指标值，判断该当前待评价状态对应的规划轨迹信息是否可行，若当前待评价状态对应的评级指标值表征：该当前待评价状态对应的规划轨迹信息可行，判断该待评价状态中是否存在未评价的状态，若存在，则继续执行0224；若判断不存在，或确定出当前待评价状态对应的规划轨迹信息不可行，得到待评价状态对应的规划轨迹信息是否可行的评价结果。In this implementation manner, the electronic device traverses the to-be-evaluated states in the to-be-evaluated states that share common parent states and are at the same level, and sequentially determines the currently unevaluated and The first reachable state of the transition relationship in the corresponding time sequence stage is the current state to be evaluated; based on the planning trajectory information corresponding to the current state to be evaluated and the evaluation factors corresponding to the current state to be evaluated, the evaluation corresponding to the current state to be evaluated is determined Index value. Further, based on the evaluation index value corresponding to the current state to be evaluated, it is judged whether the planning trajectory information corresponding to the current state to be evaluated is feasible, if the rating index value corresponding to the current state to be evaluated represents: the planned trajectory information corresponding to the current state to be evaluated Feasible, judge whether there is an unevaluated state in the state to be evaluated, and if so, continue to execute 0224; if it is judged that it does not exist, or it is determined that the planning trajectory information corresponding to the current state to be evaluated is not feasible, obtain the plan corresponding to the state to be evaluated The evaluation result of whether the trajectory information is feasible.

其中，若确定待评价状态中存在至少一个状态对应的规划轨迹信息可行，则所确定的待评价状态对应的规划轨迹信息是否可行的评价结果包括：表征待评价状态对应的规划轨迹信息可行的信息，并且携带规划轨迹信息可行的状态的标识。若确定待评价状态中所有状态对应的规划轨迹信息均不可行，则所确定的待评价状态对应的规划轨迹信息是否可行的评价结果包括：表征待评价状态对应的规划轨迹信息不可行的信息。Wherein, if it is determined that the planned trajectory information corresponding to at least one state in the to-be-evaluated state is feasible, the determined evaluation result of whether the planned trajectory information corresponding to the to-be-evaluated state is feasible includes: information indicating that the planned trajectory information corresponding to the to-be-evaluated state is feasible , and carries the identifier of the feasible state of the planned trajectory information. If it is determined that the planned trajectory information corresponding to all states in the to-be-evaluated state is infeasible, the determined evaluation result of whether the planned trajectory information corresponding to the to-be-evaluated state is feasible includes: information indicating that the planned trajectory information corresponding to the to-be-evaluated state is infeasible.

其中，顶级层级状态对应的子状态所对应规划轨迹信息为顶级层级状态对应的规划轨迹信息的子信息，即顶级层级状态对应的子状态所对应规划轨迹信息可以组合形成顶级层级状态对应的规划轨迹信息。若顶级层级状态对应的子状态还存在下一层级的状态，该子状态的下一层级的各状态所对应规划轨迹信息可以组合形成该子状态所对应规划轨迹信息。以此类推。互为有公有父状态且处于同一层级的各状态所对应规划轨迹信息可以组合形成其父状态对应的规划轨迹信息。The planned trajectory information corresponding to the sub-state corresponding to the top-level state is the sub-information of the planned trajectory information corresponding to the top-level state, that is, the planned trajectory information corresponding to the sub-state corresponding to the top-level state can be combined to form the planned trajectory corresponding to the top-level state. information. If the sub-state corresponding to the top level state also has a state of the next level, the planned trajectory information corresponding to each state of the next level of the sub-state can be combined to form the planned trajectory information corresponding to the sub-state. And so on. The planned trajectory information corresponding to each state that has a common parent state and is at the same level can be combined to form the planned trajectory information corresponding to its parent state.

举例而言，预设分层状态机的顶级层级的状态包括车道保持、变道以及借道；其中，顶级层级的状态变道包括变道准备、变道执行和变道返回三个子状态。目标对象的当前状态为车道保持或者变道准备，基于各可达状态对应的顶级层级状态所对应的当前优先级，确定所对应当前优先级最高的可达状态，作为待评价状态，其中，待评价状态包括变道准备、变道执行和变道返回；其中，如图2B所示，变道准备、变道执行和变道返回之间的时序阶段过渡关系表征：变道准备可过渡至变道执行，变道执行可过渡至变道返回，变道返回可过渡至变道执行，变道执行完成，则变道完成。For example, the states of the top level of the preset hierarchical state machine include lane keeping, lane change and lane borrowing; wherein, the state of the top level lane change includes three sub-states of lane change preparation, lane change execution and lane change return. The current state of the target object is lane keeping or lane change preparation. Based on the current priority corresponding to the top-level state corresponding to each reachable state, the reachable state with the highest current priority is determined as the state to be evaluated. The evaluation state includes lane change preparation, lane change execution and lane change return; among them, as shown in Figure 2B, the transition relationship between the time series stages between lane change preparation, lane change execution and lane change return is characterized: lane change preparation can transition to lane change Lane change execution, lane change execution can transition to lane change return, lane change return can transition to lane change execution, and lane change execution is completed, the lane change is completed.

在目标对象的当前状态为车道保持或者变道准备的情况下，基于各待评价状态对应的时序阶段过渡关系的先后顺序，可以依次评价仲裁变道准备对应的规划轨迹信息的可行性，变道执行对应的规划轨迹信息的可行性以及变道返回对应的规划轨迹信息的可行性，直到存在所评价仲裁的状态对应的规划轨迹信息不可行，确定待评价状态对应的规划轨迹信息是否可行的评价结果。In the case where the current state of the target object is lane keeping or lane change preparation, based on the sequence of the transition relationships of the time series stages corresponding to the states to be evaluated, the feasibility of the planned trajectory information corresponding to the arbitration lane change preparation can be evaluated in turn. The feasibility of executing the corresponding planned trajectory information and the feasibility of returning the corresponding planned trajectory information by changing lanes until the planned trajectory information corresponding to the evaluated arbitration state is infeasible, and determining whether the planned trajectory information corresponding to the state to be evaluated is feasible. result.

在目标对象的当前状态为变道执行的情况下，基于各待评价状态对应的时序阶段过渡关系的先后顺序，可以依次评价仲裁变道执行对应的规划轨迹信息的可行性以及变道返回对应的规划轨迹信息的可行性，直到存在所评价仲裁的状态对应的规划轨迹信息不可行，确定待评价状态对应的规划轨迹信息是否可行的评价结果。When the current state of the target object is lane change execution, based on the sequence of the transition relationships of the time series stages corresponding to the states to be evaluated, the feasibility of the planned trajectory information corresponding to the arbitration lane change execution and the corresponding lane change return can be evaluated in turn. The feasibility of the planned trajectory information is determined until there is an evaluation result of whether the planned trajectory information corresponding to the evaluated arbitration state is infeasible, and whether the planned trajectory information corresponding to the to-be-evaluated state is feasible.

在目标对象的当前状态为变道返回的情况下，基于各待评价状态对应的时序阶段过渡关系的先后顺序，可以依次评价仲裁变道执行和变道返回的可行性，直到存在所评价仲裁的状态对应的规划轨迹信息不可行，确定待评价状态对应的规划轨迹信息是否可行的评价结果。In the case where the current state of the target object is lane change and return, based on the sequence of the transition relationships of the time series stages corresponding to the states to be evaluated, the feasibility of the arbitration lane change execution and lane change return can be evaluated in turn, until there is an evaluated arbitrator. The planned trajectory information corresponding to the state is infeasible, and the evaluation result of determining whether the planned trajectory information corresponding to the state to be evaluated is feasible.

如图2A所示，若预设分层状态机的当前状态即目标对象的当前状态为车道保持，电子设备基于预设分层状态机的当前状态、各状态之间的跳转关系、当前决策规划相关信息以及预设行驶限定条件，确定目标对象对应的可达状态集合，如图2A中所示的“可行目标状态集合”包括：车道保持、变道准备以及变道执行，即预设分层状态机的状态可以由表征车道保持跳转至车道保持，或者由车道保持跳转至变道准备，或者由车道保持跳转至变道执行。电子设备可以获得可达状态集合中的每一可达状态的对应的规划轨迹信息；基于各可达状态对应的规划轨迹信息、各可达状态对应的层级及评价因素，确定可达状态对应的评价指标值，并基于可达状态对应的评价指标值，从可达状态中确定出目标对象对应的目标状态，即构建仲裁树。As shown in FIG. 2A , if the current state of the preset hierarchical state machine, that is, the current state of the target object is lane keeping, the electronic device is based on the current state of the preset hierarchical state machine, the jump relationship between the states, and the current decision-making process. Planning-related information and preset driving constraints determine the set of reachable states corresponding to the target object. The “feasible target state set” shown in FIG. 2A includes: lane keeping, lane change preparation, and lane change execution, that is, preset points. The state of the layer state machine can be represented by lane keeping jump to lane keeping, or lane keeping jumping to lane change preparation, or lane keeping jumping to lane change execution. The electronic device can obtain the corresponding planned trajectory information of each reachable state in the reachable state set; based on the planned trajectory information corresponding to each reachable state, the level and evaluation factors corresponding to each reachable state, determine the The evaluation index value is determined, and based on the evaluation index value corresponding to the reachable state, the target state corresponding to the target object is determined from the reachable state, that is, an arbitration tree is constructed.

电子设备可以自底向上构建仲裁树，即确定每一可达状态对应的评价指标值，相应的，由于变道准备以及变道执行互为有公有父状态且处于同一层级的状态，如图2A所示，首先基于变道准备对应的规划轨迹信息、变道准备对应的层级及评价因素，确定变道准备对应的评价指标值；基于变道准备对应的评价指标值对变道准备对应的规划轨迹信息进行评价仲裁，若变道准备对应的评价指标值表征变道准备对应的规划轨迹信息可行，则基于变道执行对应的规划轨迹信息、变道执行对应的层级及评价因素，确定变道执行对应的评价指标值，基于变道执行对应的评价指标值对变道执行对应的规划轨迹信息进行评价仲裁，若变道执行对应的评价指标值表征变道执行对应的规划轨迹信息不可行，一种情况，若变道对应的执行标签为必须执行标签，则相应的电子设备可以直接将变道准备确定为目标状态。另一种情况，若可达状态集合中不存在所对应执行标签为必须执行标签的状态，则基于车道保持对应的规划轨迹信息、车道保持对应的层级及评价因素，确定车道保持对应的评价指标值；基于车道保持对应的评价指标值和变道准备对应的评价指标值，确定车道保持对应的规划轨迹信息与变道准备对应的规划轨迹信息中最优的规划轨迹信息，将最优的规划轨迹信息对应的可达状态确定为目标状态。The electronic device can build an arbitration tree from the bottom up, that is, determine the evaluation index value corresponding to each reachable state. Correspondingly, since the lane change preparation and the lane change execution have a common parent state and are in the same level state, as shown in Figure 2A As shown, firstly, based on the planning trajectory information corresponding to the lane change preparation, the level and evaluation factors corresponding to the lane change preparation, determine the evaluation index value corresponding to the lane change preparation; based on the evaluation index value corresponding to the lane change preparation, the planning corresponding to the lane change preparation is determined. The trajectory information is evaluated and arbitrated. If the evaluation index value corresponding to the lane change preparation indicates that the planned trajectory information corresponding to the lane change preparation is feasible, then the lane change is determined based on the planned trajectory information corresponding to the lane change execution, the level and evaluation factors corresponding to the lane change execution. Execute the corresponding evaluation index value, and evaluate and arbitrate the planning trajectory information corresponding to the lane change execution based on the evaluation index value corresponding to the lane change execution. In one case, if the execution tag corresponding to the lane change is a mandatory execution tag, the corresponding electronic device may directly determine the lane change preparation as the target state. In another case, if there is no state in the reachable state set that the corresponding execution label is a must-execute label, then the evaluation index corresponding to the lane keeping is determined based on the planning trajectory information corresponding to the lane keeping, the level and evaluation factors corresponding to the lane keeping Based on the evaluation index value corresponding to lane keeping and the evaluation index value corresponding to lane change preparation, determine the optimal planning trajectory information among the planning trajectory information corresponding to lane keeping and the planning trajectory information corresponding to lane changing preparation, and assign the optimal planning trajectory information to the planning trajectory information corresponding to lane changing preparation. The reachable state corresponding to the trajectory information is determined as the target state.

相应的，在本发明的另一实施例中，所述023，包括如下步骤：Correspondingly, in another embodiment of the present invention, the 023 includes the following steps:

将可行的规划轨迹信息所对应可达状态中，所对应时序阶段过渡关系最后的可达状态，确定为目标对象对应的目标状态。Among the reachable states corresponding to the feasible planning trajectory information, the last reachable state of the corresponding time sequence phase transition relationship is determined as the target state corresponding to the target object.

本实现方式中，若待评价状态对应的评价结果表征待评价状态对应的规划轨迹信息可行，则基于待评价状态对应的评价结果确定，可行的规划轨迹信息对应的可达状态的标识，并将可行的规划轨迹信息所对应可达状态中，所对应时序阶段过渡关系最后的可达状态，确定为目标对象对应的目标状态。例如：承接上述例子，若依次针对变道准备对应的规划轨迹信息的可行性，变道执行对应的规划轨迹信息的可行性以及变道返回对应的规划轨迹信息的可行性进行评价仲裁，其中，确定变道准备对应的规划轨迹信息可行，但变道执行对应的规划轨迹信息不可行，则确定变道准备为目标对象对应的目标状态。确定出目标状态之后，电子设备可以基于状态转移函数，控制预设分层状态机从当前状态车道保持，跳转至目标状态，例如变道准备。In this implementation manner, if the evaluation result corresponding to the to-be-evaluated state indicates that the planned trajectory information corresponding to the to-be-evaluated state is feasible, then based on the evaluation result corresponding to the to-be-evaluated state, the identification of the reachable state corresponding to the feasible planned trajectory information is determined, and the Among the reachable states corresponding to the feasible planning trajectory information, the last reachable state of the transition relationship in the corresponding time sequence stage is determined as the target state corresponding to the target object. For example, following the above example, if the feasibility of preparing the corresponding planned trajectory information for lane change, the feasibility of executing the corresponding planned trajectory information in the lane change, and the feasibility of returning the corresponding planned trajectory information from the lane change are evaluated and arbitrated in turn, wherein, If it is determined that the planned trajectory information corresponding to the lane change preparation is feasible, but the planned trajectory information corresponding to the lane change execution is not feasible, the lane change preparation is determined to be the target state corresponding to the target object. After the target state is determined, the electronic device can control the preset hierarchical state machine to maintain lane from the current state and jump to the target state, such as lane change preparation, based on the state transition function.

如图2A所示，预设分层状态机可以支持工作人员的场景配置，以使得预设分层状态机可以根据实际使用场景进行扩展状态和/或修改状态。As shown in FIG. 2A , the preset hierarchical state machine can support the scene configuration of the staff, so that the preset hierarchical state machine can expand the state and/or modify the state according to the actual usage scenario.

在本发明的另一实施例中，所述S104，可以包括如下步骤025-026：In another embodiment of the present invention, the S104 may include the following steps 025-026:

025：若确定可达状态集合中不存在所对应执行标签为必须执行标签的可达状态，针对每一可达状态，基于该可达状态对应的规划轨迹信息以及该可达状态对应的评价因素，确定该可达状态对应的评价指标值。025: If it is determined that there is no reachable state in the reachable state set that corresponds to the execution label and the label must be executed, for each reachable state, based on the planning trajectory information corresponding to the reachable state and the evaluation factor corresponding to the reachable state , and determine the evaluation index value corresponding to the reachable state.

026：基于各可达状态对应的评价指标值，从所有可达状态中确定出目标对象对应的目标状态。026: Based on the evaluation index value corresponding to each reachable state, determine the target state corresponding to the target object from all reachable states.

本实现方式中，电子设备若确定可达状态集合中不存在所对应执行标签为必须执行标签的可达状态，若各可达状态属于同一层级，则可以认为各可达状态之间的选择等级是平等，相应的，电子设备可以针对每一可达状态，基于该可达状态对应的规划轨迹信息以及该可达状态对应的评价因素，确定该可达状态对应的评价指标值；进而，基于各可达状态对应的评价指标值，从所有可达状态中确定出相对安全舒适的合理的可达状态，作为目标对象对应的目标状态。In this implementation manner, if the electronic device determines that there is no reachable state in the reachable state set corresponding to the execution label, which is a reachable state that must execute the label, and if each reachable state belongs to the same level, it can be considered that the selection level between each reachable state is is equal, correspondingly, the electronic device can, for each reachable state, determine the evaluation index value corresponding to the reachable state based on the planning trajectory information corresponding to the reachable state and the evaluation factor corresponding to the reachable state; and further, based on The evaluation index value corresponding to each reachable state determines a relatively safe and comfortable and reasonable reachable state from all reachable states as the target state corresponding to the target object.

在本发明的另一实现方式中，电子设备若确定可达状态集合中不存在所对应执行标签为必须执行标签的可达状态，且各可达状态中存在属于不同层级的状态，其中，难免存在互为有公有父状态且处于同一层级的状态。对于互为有公有父状态且处于同一层级的状态，在确定该互为有公有父状态且处于同一层级的状态对应的规划轨迹信息的可行性，即进行评价仲裁过程时，需要结合考虑该互为有公有父状态且处于同一层级的状态之间的时序阶段过渡关系。In another implementation of the present invention, if the electronic device determines that there is no corresponding execution tag in the reachable state set, it is a reachable state that must execute the tag, and there are states belonging to different levels in each reachable state. There are states that share a common parent state with each other and are at the same level. For the states that share a common parent state and are at the same level, when determining the feasibility of planning trajectory information corresponding to the states that share a common parent state and are at the same level, that is, when evaluating the arbitration process, it is necessary to combine the mutual consideration. It is a temporal phase transition relationship between states that have a common parent state and are at the same level.

若从互为有公有父状态且处于同一层级的状态对应的规划轨迹信息中，确定出存在可行的规划轨迹信息，返回所对应规划轨迹信息可行的且所对应时序阶段过渡关系最后的状态对应的评价指标值，作为第一评价指标值，将该第一评价指标值与该互为有公有父状态且处于同一层级的状态的父状态对应的评价指标值进行比较，以从其中确定出所对应规划轨迹信息最优的状态，作为目标状态。其中，若评价指标值以惩罚项的方式存在，则可达状态对应的评价指标值越大，则其所对应规划轨迹信息越不合理，相应的，所对应规划轨迹信息最优的状态可以为所对应评价指标值最小的状态。If it is determined that there is feasible planning trajectory information from the planning trajectory information corresponding to the states that have common parent states and are at the same level, return the corresponding planning trajectory information that is feasible and that corresponds to the last state of the transition relationship of the corresponding time sequence stage. The evaluation index value is used as the first evaluation index value, and the first evaluation index value is compared with the evaluation index value corresponding to the parent state that has a common parent state and is at the same level, so as to determine the corresponding plan. The optimal state of the trajectory information is taken as the target state. Among them, if the evaluation index value exists in the form of a penalty item, the larger the evaluation index value corresponding to the reachable state is, the more unreasonable the corresponding planned trajectory information is. Correspondingly, the optimal state of the corresponding planned trajectory information can be: The state with the smallest value of the corresponding evaluation index.

在本发明的另一实施例中，所述方法还可以包括：In another embodiment of the present invention, the method may further include:

基于目标状态确定目标状态对应的可视化信号以及可视化信息，并输出，以使目标对象基于可视化信号以及可视化信息进行相应的展示。Based on the target state, a visual signal and visual information corresponding to the target state are determined and output, so that the target object is displayed accordingly based on the visual signal and the visual information.

本实现方式中，考虑到目标对象在执行行驶过程，及执行各状态对应的执行动作的过程中，需要向其对应的使用者以及与其存在交互关系的其他对象展示其将要处于的状态以及将要执行的动作。相应的，电子设备在确定出目标状态之后，可以基于目标状态确定目标状态对应的可视化信号以及可视化信息并输出，以使目标对象基于可视化信号以及可视化信息进行相应的展示。In this implementation, it is considered that the target object needs to show the state it will be in and the state it will be in to its corresponding user and other objects that have an interactive relationship with it in the process of executing the driving process and executing the action corresponding to each state. Actions. Correspondingly, after determining the target state, the electronic device can determine and output the visual signal and visual information corresponding to the target state based on the target state, so that the target object can be displayed accordingly based on the visual signal and visual information.

该可视化信号可以是基于目标状态所确定的用于控制目标对象的指定展示器件进行相应展示的信息，例如：目标状态为变道，该可视化信号为用于控制目标对象的左转或右转指示灯闪烁或持续点亮的信号，其中，具体的左转指示灯亮或者右转指示灯亮基于目标状态对应的规划轨迹信息确定。可视化信息为用于向目标对象的使用者所展示的信息，例如：可以是用于描述目标对象即将跳转的状态即决策结果的信息和/或用于描述目标对象即将执行的动作的信息等。The visual signal may be information that is determined based on the target state and is displayed by a designated display device used to control the target object. For example, the target state is a lane change, and the visual signal is an indication of a left turn or a right turn for controlling the target object. A signal that the light flashes or lights up continuously, wherein the specific left turn indicator light or right turn indicator light is on is determined based on the planned trajectory information corresponding to the target state. Visualization information is the information that is displayed to the user of the target object, for example, it can be information used to describe the state that the target object is about to jump to, that is, the decision result, and/or information used to describe the action to be performed by the target object, etc. .

相应于上述方法实施例，本发明实施例提供了一种路径规划装置，如图3所示，所述装置可以包括：Corresponding to the foregoing method embodiments, an embodiment of the present invention provides a path planning apparatus. As shown in FIG. 3 , the apparatus may include:

第一获得模块310，被配置为获得目标对象对应的当前决策规划相关信息；The first obtaining module 310 is configured to obtain the current decision planning related information corresponding to the target object;

第一确定模块320，被配置为基于预设分层状态机的当前状态、各状态之间的跳转关系、所述当前决策规划相关信息以及预设行驶限定条件，确定所述目标对象对应的可达状态集合，所述预设分层状态机包括：所述目标对象对应的各状态之间的跳转关系和层级关系；The first determination module 320 is configured to determine the corresponding target object based on the current state of the preset hierarchical state machine, the jump relationship between the states, the current decision planning related information, and the preset driving limit conditions. A set of reachable states, the preset hierarchical state machine includes: a jump relationship and a hierarchical relationship between states corresponding to the target object;

第二获得模块330，被配置为获得所述可达状态集合中各可达状态对应的规划轨迹信息；The second obtaining module 330 is configured to obtain planning trajectory information corresponding to each reachable state in the set of reachable states;

第二确定模块340，被配置为基于各可达状态对应的规划轨迹信息、各可达状态对应的层级及评价因素，从所有可达状态中确定出所述目标对象对应的目标状态，其中，可达状态对应的层级为其在预设分层状态机中的层级；The second determination module 340 is configured to determine the target state corresponding to the target object from all the reachable states based on the planning trajectory information corresponding to each reachable state, the level and evaluation factors corresponding to each reachable state, wherein, The level corresponding to the reachable state is its level in the preset hierarchical state machine;

控制模块350，被配置为控制所述预设分层状态机从所述当前状态跳转至所述目标状态，以使目标对象基于所述目标状态对应的规划轨迹信息行驶。The control module 350 is configured to control the preset hierarchical state machine to jump from the current state to the target state, so that the target object travels based on the planned trajectory information corresponding to the target state.

在本发明的另一实施例中，所述当前决策规划相关信息包括：目标对象的当前周围感知信息、当前位姿信息、当前地图信息以及障碍物的预测信息。In another embodiment of the present invention, the current decision planning related information includes: current surrounding perception information, current pose information, current map information, and prediction information of obstacles of the target object.

在本发明的另一实施例中，所述第一确定模块320，被具体配置为对所述当前决策规划相关信息中的指定信息进行预处理，并判断所述当前决策规划相关信息中的各信息是否有效；In another embodiment of the present invention, the first determining module 320 is specifically configured to preprocess the specified information in the current decision planning related information, and determine each item in the current decision planning related information whether the information is valid;

在本发明的另一实施例中，所述第二确定模块340，包括：In another embodiment of the present invention, the second determining module 340 includes:

判断单元(图中未示出)，被配置为判断所述可达状态集合中是否存在所对应执行标签为必须执行标签的可达状态，其中，所述跳转标签为：基于所述当前决策规划相关信息中的当前周围感知信息和/或当前地图信息所确定的标签；A judging unit (not shown in the figure), configured to judge whether there is a reachable state in which the corresponding execution label is a must-execute label in the reachable state set, wherein the jump label is: based on the current decision The label determined by the current surrounding perception information and/or the current map information in the planning-related information;

第一确定单元(图中未示出)，被配置为若确定所述可达状态集合中存在所对应执行标签为必须执行标签的可达状态，基于各可达状态对应的顶级层级状态所对应的当前优先级，确定所对应当前优先级最高的可达状态，作为待评价状态；基于待评价状态对应的规划轨迹信息以及待评价状态对应的评价因素，确定出待评价状态对应的规划轨迹信息是否可行的评价结果，其中，必须执行标签对应的可达状态的优先级最高；The first determination unit (not shown in the figure) is configured to, if it is determined that there is a reachable state in the reachable state set that corresponds to the execution label as a must-executed label, based on the corresponding top-level state corresponding to each reachable state. The current priority corresponding to the current priority is determined as the state to be evaluated; based on the planning trajectory information corresponding to the state to be evaluated and the evaluation factors corresponding to the state to be evaluated, the planned trajectory information corresponding to the state to be evaluated is determined. Whether the evaluation result is feasible, among which, the reachable state corresponding to the label must be executed with the highest priority;

第二确定单元(图中未示出)，被配置为若所述评价结果表征待评价状态对应的规划轨迹信息可行，则基于可行的规划轨迹信息，从待评价状态中确定出所述目标对象对应的目标状态；The second determination unit (not shown in the figure) is configured to determine the target object from the to-be-evaluated state based on the feasible planned trajectory information if the evaluation result indicates that the planned trajectory information corresponding to the to-be-evaluated state is feasible the corresponding target state;

在本发明的另一实施例中，若待评价状态中包含互为有公有父状态且处于同一层级的可达状态；互为有公有父状态且处于同一层级的可达状态之间存在时序阶段过渡关系；In another embodiment of the present invention, if the states to be evaluated include reachable states that share a common parent state with each other and are at the same level; there is a time sequence stage between the reachable states that share a common parent state and are at the same level transitional relationship;

所述第一确定单元(图中未示出)，被具体配置为针对互为有公有父状态且处于同一层级的待评价状态，基于各待评价状态对应的时序阶段过渡关系的先后顺序，依次确定当前未评价的且所对应时序阶段过渡关系最先的可达状态，为当前待评价状态；The first determination unit (not shown in the figure) is specifically configured to, for the to-be-evaluated states that have a common parent state and are at the same level, based on the sequence of the transitional relationships of the time-series stages corresponding to the to-be-evaluated states, in turn. Determine the reachable state that is not currently evaluated and the first transition relationship of the corresponding time sequence phase is the current state to be evaluated;

若判断存在，返回所述从待评价状态中，确定出未评价的且所对应时序阶段过渡关系最先的可达状态，作为当前待评价状态的步骤；If it is judged to exist, returning to the state to be evaluated, and determining the unevaluated and the first reachable state of the corresponding time sequence phase transition relationship, as the step of the current state to be evaluated;

在本发明的另一实施例中，所述第二确定单元(图中未示出)，被具体配置为将可行的规划轨迹信息所对应可达状态中，所对应时序阶段过渡关系最后的可达状态，确定为所述目标对象对应的目标状态。In another embodiment of the present invention, the second determining unit (not shown in the figure) is specifically configured to determine the last available state of the transition relationship of the corresponding timing stage in the reachable state corresponding to the feasible planning trajectory information. reach state, and determine the target state corresponding to the target object.

在本发明的另一实施例中，所述第二确定模块340，还包括：In another embodiment of the present invention, the second determining module 340 further includes:

在本发明的另一实施例中，所述第二获得模块330，被具体配置为基于所述可达状态集合中各可达状态对应的路径规划算法以及所述当前决策规划相关信息中目标对象的当前周围感知信息、当前位姿信息、当前地图信息和障碍物的预测信息，确定所述可达状态集合中各可达状态对应的规划轨迹信息。In another embodiment of the present invention, the second obtaining module 330 is specifically configured to be based on the path planning algorithm corresponding to each reachable state in the reachable state set and the target object in the current decision planning related information The current surrounding perception information, the current pose information, the current map information and the prediction information of the obstacle are determined, and the planned trajectory information corresponding to each reachable state in the set of reachable states is determined.

在本发明的另一实施例中，所述装置还包括：In another embodiment of the present invention, the device further includes:

确定输出模块(图中未示出)，被配置为基于所述目标状态确定所述目标状态对应的可视化信号以及可视化信息，并输出，以使所述目标对象基于所述可视化信号以及可视化信息进行相应的展示。A determination output module (not shown in the figure) is configured to determine, based on the target state, a visual signal and visual information corresponding to the target state, and output it, so that the target object performs an operation based on the visual signal and the visual information. corresponding display.

上述系统、装置实施例与系统实施例相对应，与该方法实施例具有同样的技术效果，具体说明参见方法实施例。装置实施例是基于方法实施例得到的，具体的说明可以参见方法实施例部分，此处不再赘述。本领域普通技术人员可以理解：附图只是一个实施例的示意图，附图中的模块或流程并不一定是实施本发明所必须的。The foregoing system and device embodiments correspond to the system embodiments, and have the same technical effects as the method embodiments. For specific descriptions, refer to the method embodiments. The apparatus embodiment is obtained based on the method embodiment, and the specific description can refer to the method embodiment section, which will not be repeated here. Those of ordinary skill in the art can understand that the accompanying drawing is only a schematic diagram of an embodiment, and the modules or processes in the accompanying drawing are not necessarily necessary to implement the present invention.

本领域普通技术人员可以理解：实施例中的装置中的模块可以按照实施例描述分布于实施例的装置中，也可以进行相应变化位于不同于本实施例的一个或多个装置中。上述实施例的模块可以合并为一个模块，也可以进一步拆分成多个子模块。Those skilled in the art may understand that: the modules in the apparatus in the embodiment may be distributed in the apparatus in the embodiment according to the description of the embodiment, and may also be located in one or more apparatuses different from this embodiment with corresponding changes. The modules in the foregoing embodiments may be combined into one module, or may be further split into multiple sub-modules.

最后应说明的是：以上实施例仅用以说明本发明的技术方案，而非对其限制；尽管参照前述实施例对本发明进行了详细的说明，本领域的普通技术人员应当理解：其依然可以对前述实施例所记载的技术方案进行修改，或者对其中部分技术特征进行等同替换；而这些修改或者替换，并不使相应技术方案的本质脱离本发明实施例技术方案的精神和范围。Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present invention, but not to limit them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that it can still be The technical solutions described in the foregoing embodiments are modified, or some technical features thereof are equivalently replaced; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the spirit and scope of the technical solutions in the embodiments of the present invention.

Claims

1. A method of path planning, the method comprising:

obtaining relevant information of a current decision-making plan corresponding to a target object;

determining a reachable state set corresponding to the target object based on a current state of a preset layered state machine, a jump relation among states, the current decision-making planning related information and a preset driving limiting condition, wherein the preset layered state machine comprises: jump relations and hierarchical relations among the states corresponding to the target object;

acquiring planning track information corresponding to each reachable state in the reachable state set;

determining a target state corresponding to the target object from all reachable states based on planning track information corresponding to each reachable state, a level corresponding to each reachable state and evaluation factors, wherein the level corresponding to each reachable state is the level of the reachable state in a preset layered state machine;

and controlling the preset hierarchical state machine to jump from the current state to the target state so as to enable the target object to run based on the planned track information corresponding to the target state.

2. The method of claim 1, wherein the current decision plan related information comprises: current surrounding perception information, current pose information, current map information, and prediction information of the obstacle of the target object.

3. The method as claimed in claim 1, wherein the step of determining the reachable state set corresponding to the target object based on the current state of the preset hierarchical state machine, the jump relationship between the states, the current decision-making planning related information and the preset driving restriction condition comprises:

preprocessing the specified information in the current decision-making planning related information, and judging whether each information in the current decision-making planning related information is effective or not;

and under the condition that the obtained relevant information of the current decision-making plan is effective, determining a reachable state set corresponding to the target object based on the current state of a preset hierarchical state machine, the jump relation among the states, the relevant information of the current decision-making plan and a preset running limiting condition.

4. The method according to any one of claims 1 to 3, wherein the step of determining the target state corresponding to the target object from all reachable states based on the planned trajectory information corresponding to each reachable state, the hierarchy corresponding to each reachable state, and the evaluation factor comprises:

judging whether the reachable state set has a reachable state in which the corresponding execution tag is the necessary execution tag, wherein the skip tag is: a label determined based on current surrounding awareness information and/or current map information in the current decision plan related information;

if the reachable state set is determined to have the corresponding execution tag as the reachable state of the tag to be executed, determining the reachable state with the highest corresponding current priority as the state to be evaluated based on the current priority corresponding to the top level state corresponding to each reachable state; determining whether the planning track information corresponding to the state to be evaluated is feasible or not based on the planning track information corresponding to the state to be evaluated and the evaluation factor corresponding to the state to be evaluated, wherein the reachable state corresponding to the execution tag has the highest priority;

if the evaluation result represents that the planning track information corresponding to the state to be evaluated is feasible, determining a target state corresponding to the target object from the state to be evaluated based on the feasible planning track information;

and if the evaluation result represents that the planning track information corresponding to the state to be evaluated is not feasible, returning to execute the current priority corresponding to the top level state corresponding to each reachable state, and determining the reachable state with the highest corresponding current priority as the state to be evaluated.

5. The method of claim 4, wherein if the states to be evaluated include reachable states that are common parent states to each other and are in the same level; the reachable states which have public father states and are in the same level have a time sequence stage transition relation;

the step of determining whether the planning track information corresponding to the state to be evaluated is feasible or not based on the planning track information corresponding to the state to be evaluated and the evaluation factor corresponding to the state to be evaluated includes:

aiming at to-be-evaluated states which have public father states and are in the same level, sequentially determining a reachable state which is not evaluated currently and has the first transition relation of the corresponding time sequence stage as the current to-be-evaluated state based on the sequence of the transition relation of the time sequence stage corresponding to each to-be-evaluated state;

determining an evaluation index value corresponding to the current state to be evaluated based on the planning track information corresponding to the current state to be evaluated and the evaluation factor corresponding to the current state to be evaluated;

if the rating index value corresponding to the current state to be evaluated represents: the planning track information corresponding to the current state to be evaluated is feasible, and whether an unevaluated state exists in the state to be evaluated is judged;

if yes, returning to the slave state to be evaluated, and determining an unacevated reachable state with the first transition relation of the corresponding time sequence stage as the current state to be evaluated;

and if the evaluation result does not exist, or the planning track information corresponding to the current state to be evaluated is determined to be not feasible, obtaining the evaluation result whether the planning track information corresponding to the state to be evaluated is feasible or not.

6. The method according to claim 5, wherein the step of determining the target state corresponding to the target object from the states to be evaluated based on the feasible planning trajectory information comprises:

and determining the reachable state of the transition relation of the corresponding time sequence stage in the reachable states corresponding to the feasible planning track information as the target state corresponding to the target object.

7. The method of claim 4, wherein the method further comprises:

if it is determined that the corresponding execution tag does not exist in the reachable state set and is the reachable state in which the tag needs to be executed, determining an evaluation index value corresponding to the reachable state based on the planning track information corresponding to the reachable state and the evaluation factor corresponding to the reachable state for each reachable state;

and determining the target state corresponding to the target object from all reachable states based on the evaluation index value corresponding to each reachable state.

8. The method of claim 1, wherein the step of obtaining planned trajectory information corresponding to each reachable state in the set of reachable states comprises:

and determining planning track information corresponding to each reachable state in the reachable state set based on a path planning algorithm corresponding to each reachable state in the reachable state set and current surrounding perception information, current pose information, current map information and prediction information of the barrier of the target object in the current decision planning related information.

9. The method of any one of claims 1-8, further comprising:

and determining a visualization signal and visualization information corresponding to the target state based on the target state, and outputting the visualization signal and visualization information to enable the target object to be correspondingly displayed based on the visualization signal and visualization information.

10. A path planning apparatus, the apparatus comprising:

a first obtaining module configured to obtain current decision-making planning related information corresponding to a target object;

a first determining module, configured to determine, based on a current state of a preset hierarchical state machine, a skip relationship between states, the current decision-making planning related information, and a preset driving limiting condition, a reachable state set corresponding to the target object, where the preset hierarchical state machine includes: jump relations and hierarchical relations among the states corresponding to the target object;

a second obtaining module configured to obtain planning track information corresponding to each reachable state in the reachable state set;

the second determining module is configured to determine a target state corresponding to the target object from all reachable states based on planning track information corresponding to each reachable state, a level corresponding to each reachable state and evaluation factors, wherein the level corresponding to each reachable state is a level of the reachable state in a preset layered state machine;

and the control module is configured to control the preset hierarchical state machine to jump from the current state to the target state so as to enable a target object to run based on the planned track information corresponding to the target state.