CN117076816B

CN117076816B - Response prediction method, response prediction apparatus, computer device, storage medium, and program product

Info

Publication number: CN117076816B
Application number: CN202310890538.3A
Authority: CN
Inventors: 聂冰冰; 秦德通; 王情帆; 李泉; 卢天乐; 刘斯源; 周青
Original assignee: Tsinghua University
Current assignee: Tsinghua University
Priority date: 2023-07-19
Filing date: 2023-07-19
Publication date: 2024-07-16
Anticipated expiration: 2043-07-19
Also published as: CN117076816A

Abstract

The present application relates to a response prediction method, device, computer equipment, storage medium and program product. The method includes: for a target dangerous traffic scene, starting from a preset stage of the target dangerous traffic scene, calculating the cumulative amount of target evidence information at preset time intervals, the target evidence information is the evidence information based on which the target decision is made; after each calculation, after the cumulative amount is obtained, determining whether the decision response condition is met according to the cumulative amount corresponding to the current calculation moment; if the decision response condition is met, determining the response time required for the driver to make the target decision according to the current calculation moment, in this process, there is no need for the big data-driven deep learning model in traditional technology, so the present application reduces the difficulty of predicting the driver's response time.

Description

Response prediction method, device, computer equipment, storage medium and program product

技术领域Technical Field

本申请涉及预测技术领域，特别是涉及一种响应预测方法、装置、计算机设备、存储介质和程序产品。The present application relates to the field of prediction technology, and in particular to a response prediction method, apparatus, computer equipment, storage medium and program product.

背景技术Background technique

随着自动驾驶技术的发展，驾驶辅助系统越来越智能化，用户对于驾驶辅助系统的要求也越来越高，其中在险态交通场景中，预测驾驶人的决策历程及各类决策的响应时长，能够为驾驶辅助系统预测车辆运动规划和路径提供更多的变量输入。With the development of autonomous driving technology, driving assistance systems are becoming more and more intelligent, and users' requirements for driving assistance systems are also getting higher and higher. In dangerous traffic scenarios, predicting the driver's decision-making process and the response time of various decisions can provide more variable inputs for the driving assistance system to predict vehicle motion planning and paths.

传统技术中，通过深度学习模型来预测正常交通场景中驾驶人的响应时长，缺乏对于险态交通场景中的预测，因为深度学习模型需要大数据驱动，而针对险态交通场景中的数据难以获取，因此，在险态交通场景中，传统技术存在预测驾驶人决策响应的难度较大的问题。In traditional technology, deep learning models are used to predict the driver's response time in normal traffic scenarios, but there is a lack of predictions in dangerous traffic scenarios. This is because deep learning models need to be driven by big data, and data for dangerous traffic scenarios is difficult to obtain. Therefore, in dangerous traffic scenarios, traditional technology has the problem of difficulty in predicting the driver's decision-making response.

发明内容Summary of the invention

基于此，有必要针对上述技术问题，提供一种能够降低预测驾驶人决策响应难度的响应预测方法、装置、计算机设备、存储介质和程序产品。Based on this, it is necessary to provide a response prediction method, device, computer equipment, storage medium and program product that can reduce the difficulty of predicting the driver's decision response in response to the above technical problems.

第一方面，本申请提供了一种响应预测方法。所述方法包括：对于目标险态交通场景，从目标险态交通场景的预设阶段开始，每隔预设时长计算目标证据信息的累积量，目标证据信息为做出目标决策所依据的证据信息；在每次计算得到累积量后，根据当前计算时刻对应的累积量，确定是否满足决策响应条件；若满足决策响应条件，则根据当前计算时刻确定驾驶人做出目标决策所需的响应时长。In the first aspect, the present application provides a response prediction method. The method includes: for a target dangerous traffic scene, starting from a preset stage of the target dangerous traffic scene, calculating the cumulative amount of target evidence information at preset time intervals, where the target evidence information is evidence information based on which a target decision is made; after each calculation, determining whether a decision response condition is met based on the cumulative amount corresponding to the current calculation moment; if the decision response condition is met, determining the response time required for the driver to make the target decision based on the current calculation moment.

在其中一个实施例中，每隔预设时长计算目标证据信息的累积量，包括：每隔预设时长，获取上一计算时刻计算得到目标证据信息的累积量，并根据目标险态交通场景中当前时刻的道路信息确定当前支持目标决策的第一证据信息量以及反对目标决策的第二证据信息量，根据上一计算时刻对应的累积量、第一证据信息量以及第二证据信息量获取当前计算时刻的目标证据信息的累积量。In one of the embodiments, the cumulative amount of target evidence information is calculated at preset time intervals, including: at preset time intervals, obtaining the cumulative amount of target evidence information calculated at the previous calculation moment, and determining the first amount of evidence information that currently supports the target decision and the second amount of evidence information that opposes the target decision based on the road information at the current moment in the target dangerous traffic scene, and obtaining the cumulative amount of the target evidence information at the current calculation moment based on the cumulative amount corresponding to the previous calculation moment, the first amount of evidence information, and the second amount of evidence information.

在其中一个实施例中，根据上一计算时刻对应的累积量、第一证据信息量以及第二证据信息量获取当前计算时刻的目标证据信息的累积量，包括：基于维纳过程获取当前计算时刻的噪声量；根据上一计算时刻对应的累积量、第一证据信息量、第二证据信息量以及噪声量获取当前计算时刻的目标证据信息的累积量。In one embodiment, the cumulative amount of target evidence information at the current calculation moment is obtained based on the cumulative amount corresponding to the previous calculation moment, the first evidence information amount, and the second evidence information amount, including: obtaining the noise amount at the current calculation moment based on the Wiener process; obtaining the cumulative amount of target evidence information at the current calculation moment based on the cumulative amount corresponding to the previous calculation moment, the first evidence information amount, the second evidence information amount, and the noise amount.

在其中一个实施例中，根据当前计算时刻对应的累积量，确定是否满足决策响应条件，包括：根据当前计算时刻对应的累积量、驾驶人参数以及车辆参数，确定是否满足决策响应条件；其中，驾驶人参数与目标险态交通场景中的驾驶人的信息相关，车辆参数与目标险态交通场景中的车辆的状态相关。In one of the embodiments, whether the decision response condition is met is determined based on the cumulative amount corresponding to the current calculation moment, including: determining whether the decision response condition is met based on the cumulative amount corresponding to the current calculation moment, driver parameters and vehicle parameters; wherein the driver parameters are related to the information of the driver in the target dangerous traffic scene, and the vehicle parameters are related to the state of the vehicle in the target dangerous traffic scene.

在其中一个实施例中，根据当前计算时刻对应的累积量、驾驶人参数以及车辆参数，确定是否满足决策响应条件，包括：确定当前计算时刻对应的累积量与驾驶人参数之和是否大于等于车辆参数；若大于等于车辆参数，则确定满足决策响应条件。In one of the embodiments, whether the decision response conditions are met is determined based on the cumulative amount corresponding to the current calculation moment, the driver parameters and the vehicle parameters, including: determining whether the sum of the cumulative amount corresponding to the current calculation moment and the driver parameters is greater than or equal to the vehicle parameters; if it is greater than or equal to the vehicle parameters, it is determined that the decision response conditions are met.

在其中一个实施例中，车辆参数根据目标险态交通场景中的车辆的状态以及目标函数计算得到，目标函数为函数值随着时间逐渐减小的函数。In one of the embodiments, the vehicle parameters are calculated based on the state of the vehicle in the target dangerous traffic scene and the objective function, and the objective function is a function whose function value gradually decreases with time.

在其中一个实施例中，目标决策为决策池中的决策，所述方法还包括：确定决策池中的每一目标决策对应的响应时长；根据每一目标决策对应的响应时长，对决策池包括的多个目标决策进行排序，确定驾驶人在目标险态交通场景中的响应历程。In one of the embodiments, the target decision is a decision in a decision pool, and the method further includes: determining the response time corresponding to each target decision in the decision pool; sorting the multiple target decisions included in the decision pool according to the response time corresponding to each target decision, and determining the driver's response process in the target dangerous traffic scenario.

第二方面，本申请还提供了一种响应预测装置。所述装置包括：计算模块，对于目标险态交通场景，从目标险态交通场景的预设阶段开始，每隔预设时长计算目标证据信息的累积量，目标证据信息为做出目标决策所依据的证据信息；第一确定模块，在每次计算得到累积量后，根据当前计算时刻对应的累积量，确定是否满足决策响应条件；第二确定模块，若满足决策响应条件，则根据当前计算时刻确定驾驶人做出目标决策所需的响应时长。In the second aspect, the present application also provides a response prediction device. The device includes: a calculation module, for a target dangerous traffic scene, starting from a preset stage of the target dangerous traffic scene, calculating the cumulative amount of target evidence information at preset time intervals, the target evidence information being the evidence information based on which the target decision is made; a first determination module, after each calculation, after obtaining the cumulative amount, determining whether the decision response condition is met according to the cumulative amount corresponding to the current calculation moment; a second determination module, if the decision response condition is met, determining the response time required for the driver to make the target decision according to the current calculation moment.

在其中一个实施例中，计算模块，具体用于每隔预设时长，获取上一计算时刻计算得到目标证据信息的累积量，并根据目标险态交通场景中当前时刻的道路信息确定当前支持目标决策的第一证据信息量以及反对目标决策的第二证据信息量，根据上一计算时刻对应的累积量、第一证据信息量以及第二证据信息量获取当前计算时刻的目标证据信息的累积量。In one of the embodiments, the calculation module is specifically used to obtain the cumulative amount of target evidence information calculated at the previous calculation moment every preset time period, and determine the first amount of evidence information that currently supports the target decision and the second amount of evidence information that opposes the target decision based on the road information at the current moment in the target dangerous traffic scene, and obtain the cumulative amount of the target evidence information at the current calculation moment based on the cumulative amount corresponding to the previous calculation moment, the first amount of evidence information, and the second amount of evidence information.

在其中一个实施例中，计算模块，具体用于基于维纳过程获取当前计算时刻的噪声量；根据上一计算时刻对应的累积量、第一证据信息量、第二证据信息量以及噪声量获取当前计算时刻的目标证据信息的累积量。In one of the embodiments, the calculation module is specifically used to obtain the noise amount at the current calculation moment based on the Wiener process; and obtain the cumulative amount of the target evidence information at the current calculation moment according to the cumulative amount corresponding to the previous calculation moment, the first evidence information amount, the second evidence information amount and the noise amount.

在其中一个实施例中，第一确定模块，具体用于根据当前计算时刻对应的累积量、驾驶人参数以及车辆参数，确定是否满足决策响应条件；其中，驾驶人参数与目标险态交通场景中的驾驶人的信息相关，车辆参数与目标险态交通场景中的车辆的状态相关。In one of the embodiments, the first determination module is specifically used to determine whether the decision response conditions are met based on the cumulative amount, driver parameters and vehicle parameters corresponding to the current calculation moment; wherein the driver parameters are related to the information of the driver in the target dangerous traffic scene, and the vehicle parameters are related to the state of the vehicle in the target dangerous traffic scene.

在其中一个实施例中，第一确定模块，具体用于确定当前计算时刻对应的累积量与驾驶人参数之和是否大于等于车辆参数，若大于等于车辆参数，则确定满足决策响应条件。In one of the embodiments, the first determination module is specifically used to determine whether the sum of the cumulative amount corresponding to the current calculation moment and the driver parameter is greater than or equal to the vehicle parameter. If it is greater than or equal to the vehicle parameter, it is determined that the decision response condition is met.

在其中一个实施例中，目标决策为决策池中的决策，所述装置还包括第三确定模块，用于确定决策池中的每一目标决策对应的响应时长；根据每一目标决策对应的响应时长，对决策池包括的多个目标决策进行排序，确定驾驶人在目标险态交通场景中的响应历程。In one embodiment, the target decision is a decision in a decision pool, and the device also includes a third determination module for determining the response time corresponding to each target decision in the decision pool; according to the response time corresponding to each target decision, the multiple target decisions included in the decision pool are sorted to determine the driver's response process in the target dangerous traffic scenario.

第三方面，本申请还提供了一种计算机设备。所述计算机设备包括存储器和处理器，所述存储器存储有计算机程序，所述处理器执行所述计算机程序时实现上述第一方面任一项所述的方法的步骤。In a third aspect, the present application further provides a computer device, wherein the computer device comprises a memory and a processor, wherein the memory stores a computer program, and when the processor executes the computer program, the steps of any one of the methods described in the first aspect are implemented.

第四方面，本申请还提供了一种计算机可读存储介质。所述计算机可读存储介质，其上存储有计算机程序，所述计算机程序被处理器执行时实现上述第一方面任一项所述的方法的步骤。In a fourth aspect, the present application further provides a computer-readable storage medium having a computer program stored thereon, and when the computer program is executed by a processor, the steps of any one of the methods described in the first aspect are implemented.

第五方面，本申请还提供了一种计算机程序产品。所述计算机程序产品，包括计算机程序，该计算机程序被处理器执行时实现上述第一方面任一项所述的方法的步骤。In a fifth aspect, the present application further provides a computer program product, wherein the computer program product comprises a computer program, and when the computer program is executed by a processor, the steps of any one of the methods described in the first aspect are implemented.

上述响应预测方法、装置、计算机设备、存储介质和程序产品，对于目标险态交通场景，从目标险态交通场景的预设阶段开始，通过每隔预设时长计算目标证据信息的累积量，其中，目标证据信息为做出目标决策所依据的证据信息，再在每次计算得到累积量后，根据当前计算时刻对应的累积量，确定是否满足决策响应条件，若满足决策响应条件，则根据当前计算时刻确定驾驶人做出目标决策所需的响应时长，此过程无需传统技术中的大数据驱动深度学习模型，从而本申请降低了预测驾驶人决策响应的难度。The above-mentioned response prediction method, device, computer equipment, storage medium and program product, for the target dangerous traffic scenario, starts from the preset stage of the target dangerous traffic scenario, calculates the cumulative amount of target evidence information at preset time intervals, wherein the target evidence information is the evidence information based on which the target decision is made, and then after each calculation, determines whether the decision response condition is met according to the cumulative amount corresponding to the current calculation moment. If the decision response condition is met, the response time required for the driver to make the target decision is determined according to the current calculation moment. This process does not require the big data-driven deep learning model in traditional technology, so the present application reduces the difficulty of predicting the driver's decision response.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1为一个实施例中一种响应预测方法的流程示意图；FIG1 is a schematic flow chart of a response prediction method in one embodiment;

图2为一个实施例中一种证据积累模型示意图；FIG2 is a schematic diagram of an evidence accumulation model in one embodiment;

图3为一个实施例中另一种响应预测方法；FIG3 is another response prediction method according to an embodiment;

图4为一个实施例中一种贝叶斯框架原理图；FIG4 is a schematic diagram of a Bayesian framework in one embodiment;

图5为一个实施例中一种响应预测装置的结构框图；FIG5 is a structural block diagram of a response prediction device in one embodiment;

图6为一个实施例中计算机设备的内部结构图。FIG. 6 is a diagram showing the internal structure of a computer device in one embodiment.

具体实施方式Detailed ways

为了使本申请的目的、技术方案及优点更加清楚明白，以下结合附图及实施例，对本申请进行进一步详细说明。应当理解，此处描述的具体实施例仅仅用以解释本申请，并不用于限定本申请。In order to make the purpose, technical solution and advantages of the present application more clearly understood, the present application is further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present application and are not used to limit the present application.

险态交通场景指的是在道路交通环境中，突然出现的危险源或正常行驶的交通参与者转变为危险源，对自车正常行驶造成影响的场景工况。险态交通场景通常需要驾驶人进行主动响应(如松油门、制动和转向等)以规避风险避免碰撞。Dangerous traffic scenarios refer to scenarios where a sudden danger source appears in the road traffic environment or a normal traffic participant turns into a danger source, affecting the normal driving of the vehicle. Dangerous traffic scenarios usually require the driver to actively respond (such as releasing the accelerator, braking, and steering) to avoid risks and avoid collisions.

道路交通环境中的车辆从正常行驶到碰撞发生往往可以划分为四个阶段，第一阶段车辆正常驾驶，面对复杂的交通环境，车辆通过传感器或驾驶人不断与环境进行信息交互实现环境监控，保证车辆的正常行驶。当出现危险工况时，车辆进入第二阶段，车辆传感器或驾驶人感知危险源，车辆主动安全系统介入干预，该阶段往往从碰撞前6秒到碰撞前3秒，主动安全系统成功干预和规避碰撞的概率较大。一旦未规避碰撞，车辆进入第三阶段临界险态，也即是进入险态交通场景，驾驶人发生主动响应行为(又称为主动行为)，对车辆采取最大能力的干预，同时智能约束系统启动，车辆干预可能成功规避了碰撞，也可能不成功但降低了碰撞接触强度，此阶段驾驶人的决策行为及反应时间直接影响到碰撞零时刻驾驶人的状态，进而对损伤风险造成影响。最后进入第四阶段碰撞阶段，在车辆主被动安全一体化防护设备的作用下对车内乘员提供有针对性的保护。Vehicles in a road traffic environment can often be divided into four stages from normal driving to collision. In the first stage, the vehicle is driving normally. Facing a complex traffic environment, the vehicle continuously exchanges information with the environment through sensors or drivers to achieve environmental monitoring and ensure the normal driving of the vehicle. When a dangerous condition occurs, the vehicle enters the second stage, the vehicle sensor or driver perceives the danger source, and the vehicle's active safety system intervenes. This stage often lasts from 6 seconds before the collision to 3 seconds before the collision. The probability of the active safety system successfully intervening and avoiding the collision is relatively high. Once the collision is not avoided, the vehicle enters the third stage of critical danger, that is, entering a dangerous traffic scene. The driver takes active response behavior (also known as active behavior) and intervenes in the vehicle to the maximum extent possible. At the same time, the intelligent restraint system is activated. The vehicle intervention may successfully avoid the collision, or it may not be successful but reduce the collision contact intensity. The driver's decision-making behavior and reaction time at this stage directly affect the driver's state at the zero moment of the collision, and thus affect the risk of injury. Finally, it enters the fourth stage of the collision stage, and the vehicle's active and passive safety integrated protection equipment provides targeted protection for the occupants in the vehicle.

在险态交通场景中，驾驶人如不对车辆采取主动控制，则险态交通场景会演变为严重的交通事故，威胁交通参与者安全。因此驾驶人需要在车辆进入第三阶段临界险态过程中主动响应以规避碰撞或降低碰撞强度。而主动响应主要关注决策类型及不同类型的响应时长(即反应时间)，在时间历程上的不同决策过程就构成了驾驶人的主动响应历程。其中，驾驶人在险态交通场景中的反应时间通常被定义为危险源或危险信息出现(如跟车场景中前导车开始制动)到驾驶人做出主动响应决策以及采取避撞行为(如松油门、制动、转向等)的时间。In dangerous traffic scenarios, if the driver does not take active control of the vehicle, the dangerous traffic scenario will evolve into a serious traffic accident, threatening the safety of traffic participants. Therefore, the driver needs to actively respond to avoid collision or reduce the intensity of collision when the vehicle enters the third stage of critical dangerous state. The active response mainly focuses on the decision type and the response time of different types (i.e. reaction time). The different decision-making processes in the time course constitute the driver's active response process. Among them, the driver's reaction time in dangerous traffic scenarios is usually defined as the time from the appearance of the danger source or danger information (such as the leading vehicle in the following scenario starts braking) to the driver making an active response decision and taking collision avoidance actions (such as releasing the accelerator, braking, steering, etc.).

传统技术中，驾驶人决策类型及反应时间预测算法的研究集中在车辆常规巡航任务(如换道、转向、跟车、疲劳检测，以及自动驾驶车辆上驾驶人第二任务的识别等)下，而缺乏针对险态交通场景中驾驶人决策类型推断及反应时间预测的研究。且传统驾驶人决策类型推断及反应时间预测模型分为分解式和端到端式两种技术方案，其中，分解式方案进一步分为感知、控制、决策三大模块，分工清晰，可解释性强，但系统复杂度高，计算量大；端到端方案基于机器学习模型或深度学习模型模拟拟人化驾驶行为决策，计算量小、系统复杂度低，但对算法要求高，安全性低，可解释性和可靠性差，且机器学习模型和深度学习模型需要大量数据驱动，这是险态交通场景研究的一大局限。In traditional technologies, the research on driver decision type and reaction time prediction algorithms focuses on vehicle routine cruising tasks (such as lane changing, steering, following, fatigue detection, and identification of the driver's second task on autonomous vehicles), but lacks research on driver decision type inference and reaction time prediction in dangerous traffic scenarios. In addition, the traditional driver decision type inference and reaction time prediction models are divided into two technical solutions: decomposition and end-to-end. Among them, the decomposition solution is further divided into three modules: perception, control, and decision-making. It has clear division of labor and strong interpretability, but the system complexity is high and the amount of calculation is large; the end-to-end solution is based on machine learning models or deep learning models to simulate anthropomorphic driving behavior decisions. The amount of calculation is small and the system complexity is low, but the algorithm requirements are high, the safety is low, and the interpretability and reliability are poor. In addition, machine learning models and deep learning models require a lot of data to drive, which is a major limitation of dangerous traffic scene research.

另外，通过决策树、模糊逻辑控制等机器学习模型或深度学习模型进行驾驶人决策类型推断及反应时间预测，大多是基于数据驱动的黑盒化过程，虽然具有较好的拟合精度，但无法解释决策过程中的逻辑关系，且难以解释驾驶人的反应时间分布及其变化形式。In addition, the inference of driver decision type and prediction of reaction time through machine learning models or deep learning models such as decision trees and fuzzy logic control are mostly data-driven black box processes. Although they have good fitting accuracy, they cannot explain the logical relationship in the decision-making process and it is difficult to explain the distribution of driver reaction time and its changing forms.

此外，还有以TTC(Time to Collision，临碰撞时间)、THW(Time Headway，车头时距)等描述场景紧急程度的物理量为参考，来进行驾驶人决策类型推断及反应时间预测。然而，人-车-路构成了异常复杂的道路交通系统，人员是该系统的核心要素，驾驶人在险态交通场景中会感知关键要素、理解交通场景中的潜在危险、根据先验知识和场景特征预测危险源的运动形式、做出避撞决策并最终采取避撞操作行为，因此，在真实险态交通场景中，驾驶人的感知决策、主动响应同时受到驾驶人(年龄、风格、疲劳状态等)、车辆(动力学状态、控制模式等)、交通场景(事故类型、紧急程度、危险源类型等)多种因素的影响。因此，以TTC和THW预测到的决策类型和反应时间存在预测准确度较低的问题。In addition, physical quantities such as TTC (Time to Collision) and THW (Time Headway) that describe the urgency of the scene are used as references to infer the driver's decision type and predict the reaction time. However, people, vehicles, and roads constitute an extremely complex road traffic system, and people are the core elements of the system. In dangerous traffic scenes, drivers will perceive key elements, understand potential dangers in traffic scenes, predict the movement form of dangerous sources based on prior knowledge and scene characteristics, make collision avoidance decisions, and finally take collision avoidance actions. Therefore, in real dangerous traffic scenes, the driver's perception, decision-making, and active response are affected by multiple factors such as the driver (age, style, fatigue status, etc.), the vehicle (dynamic status, control mode, etc.), and the traffic scene (accident type, urgency, type of dangerous source, etc.). Therefore, the decision type and reaction time predicted by TTC and THW have the problem of low prediction accuracy.

基于此，有必要综合多方因素对险态交通场景下驾驶人决策类型和反应时间进行预测。Based on this, it is necessary to comprehensively consider multiple factors to predict the driver's decision type and reaction time in dangerous traffic scenarios.

其中，反应时间是影响险态发展的重要随机变量，在心理学或认知学领域，SDT(Signal Detection Theory，信号探测理论)认为人类在接收到刺激信息并做出决策的过程中，感受器在感知刺激信号和人脑神经元的随机信号处理过程都会产生噪声，导致人类决策行为的不确定性。而且，驾驶人在处理大多数道路冲突时，不需要调动大脑的高水平认知功能区进行长时间的深思熟虑，只需根据感知的信息以及先验知识快速做出反应。Among them, reaction time is an important random variable that affects the development of dangerous situations. In the field of psychology or cognitive science, SDT (Signal Detection Theory) believes that when humans receive stimulus information and make decisions, the receptors in the process of perceiving stimulus signals and the random signal processing process of human brain neurons will generate noise, leading to uncertainty in human decision-making behavior. Moreover, when dealing with most road conflicts, drivers do not need to mobilize the high-level cognitive function areas of the brain for long-term deliberation, but only need to respond quickly based on the perceived information and prior knowledge.

在理论神经科学领域，针对人脑在外界刺激下决策过程建立了不同尺度下的决策模型。细观层的NSP(Neuron Spiking Model，神经元脉冲模型)能模拟神经元在接受刺激信号后累计电位达到阈值产生发放的生理过程；介观层的RRM(Reduced Rate Model，简化脉冲发放率模型)是基于功能神经元群动态稳定特性建立的简化多神经元脉冲发放模型；宏观行为层的DDM(Drift Diffusion Model，漂移扩散决策模型)和LATER(Linear Approachto Threshold with Ergodic Rate，遍历速率的线性阈值决策)模型能表征人在行为认知层所表现出的行为特性。行为层的DDM和LATER模型则能够表征人类决策结果，反应时间与刺激信号变化间的关系。但是相比传统认知行为学研究中的单一静态决策任务，动态交通场景中的驾驶人决策受到众多潜在因素的影响。In the field of theoretical neuroscience, decision-making models at different scales have been established for the decision-making process of the human brain under external stimuli. The NSP (Neuron Spiking Model) at the microscopic level can simulate the physiological process of neurons firing when the accumulated potential reaches the threshold after receiving the stimulation signal; the RRM (Reduced Rate Model) at the mesoscopic level is a simplified multi-neuron pulse firing model based on the dynamic stability characteristics of the functional neuron group; the DDM (Drift Diffusion Model) and LATER (Linear Approach to Threshold with Ergodic Rate) models at the macroscopic behavioral level can characterize the behavioral characteristics of people at the behavioral cognitive level. The DDM and LATER models at the behavioral level can characterize the relationship between human decision results, reaction time and changes in stimulation signals. However, compared with the single static decision-making task in traditional cognitive behavioral research, the driver's decision in dynamic traffic scenes is affected by many potential factors.

基于SDT的模型大多关注在构建阈值模型检测和反应阈值，即关注的信号在某个时刻超过阈值驾驶人就做出反应，忽略了险态交通场景发展的时间历程。证据积累过程强调伴随噪声的不同信号在时间历程上的演变与积累。证据积累模型，如DDM或LATER模型已证明能够拟合不同任务和复杂度的认知决策任务，且有研究证实了对应的神经生理学机制。但未有在现实交通场景中的应用。Most SDT-based models focus on constructing threshold model detection and reaction thresholds, that is, the driver reacts when the signal of interest exceeds the threshold at a certain moment, ignoring the time course of the development of dangerous traffic scenes. The evidence accumulation process emphasizes the evolution and accumulation of different signals accompanied by noise over time. Evidence accumulation models, such as DDM or LATER models, have been shown to fit cognitive decision-making tasks of different tasks and complexity, and studies have confirmed the corresponding neurophysiological mechanisms. However, there is no application in real traffic scenarios.

因此，可以借助证据积累过程，将人-车-路三种影响因素融合到一个模型，来准确预测驾驶人决策类型和反应时间，具体过程如下所述。Therefore, with the help of the evidence accumulation process, the three influencing factors of human-vehicle-road can be integrated into a model to accurately predict the driver's decision type and reaction time. The specific process is described as follows.

在一个实施例中，如图1所示，提供了一种响应预测方法，以该方法应用于终端为例进行说明，包括以下步骤：In one embodiment, as shown in FIG1 , a response prediction method is provided, which is described by taking the method applied to a terminal as an example, and includes the following steps:

步骤101，对于目标险态交通场景，从目标险态交通场景的预设阶段开始，每隔预设时长计算目标证据信息的累积量，目标证据信息为做出目标决策所依据的证据信息。Step 101, for a target dangerous traffic scene, starting from a preset stage of the target dangerous traffic scene, the cumulative amount of target evidence information is calculated at preset time intervals, where the target evidence information is evidence information based on which a target decision is made.

其中，目标险态交通场景指的是上述第三阶段，驾驶人也就是驾驶人主动响应规避碰撞阶段。主动响应指的是根据危险工况，驾驶人做出决策并执行该决策的过程，例如，驾驶人看到前导车开始制动，驾驶人经过短暂的思考后决定右转向，且将方向盘向右打的过程。The target dangerous traffic scenario refers to the third stage mentioned above, the driver's active response to avoid collision. Active response refers to the process in which the driver makes a decision and executes the decision based on the dangerous working conditions. For example, the driver sees the leading vehicle start to brake, and after a short thought, the driver decides to turn right and turns the steering wheel to the right.

预设阶段可以是预先输入的目标险态交通场景的开始时刻。The preset stage may be the start time of a pre-input target dangerous traffic scenario.

目标决策包括松油门、制动和转向等中的至少一种。证据信息指的是视觉变化刺激(视角θ、视角变化率或其组合)、刹车灯等视觉提示信息、距离、速度等三维空间信息等。The target decision includes at least one of releasing the accelerator, braking and turning, etc. The evidence information refers to visual change stimuli (viewing angle θ, viewing angle change rate or a combination thereof), visual prompt information such as brake lights, and three-dimensional spatial information such as distance and speed.

可选的，每隔预设时长，根据上一计算时刻计算得到的目标证据信息的累积量，支持目标决策的第一证据信息量以及反对目标决策的第二证据信息量，计算当前时刻的目标证据信息的累积量。Optionally, at preset time intervals, the cumulative amount of target evidence information at the current moment is calculated based on the cumulative amount of target evidence information calculated at the previous calculation moment, the amount of first evidence information supporting the target decision, and the amount of second evidence information opposing the target decision.

步骤102，在每次计算得到累积量后，根据当前计算时刻对应的累积量，确定是否满足决策响应条件。Step 102, after each calculation to obtain the cumulative amount, determine whether the decision response condition is met according to the cumulative amount corresponding to the current calculation moment.

可选的，确定当前计算时刻对应的累积量是否大于等于决策阈值，若大于等于决策阈值，则确定满足决策响应条件。Optionally, it is determined whether the cumulative amount corresponding to the current calculation moment is greater than or equal to a decision threshold. If it is greater than or equal to the decision threshold, it is determined that the decision response condition is met.

在另一个可选的实施例中，确定当前计算时刻对应的累积量与驾驶人参数之和是否大于等于车辆参数，若大于等于车辆参数，则确定满足决策响应条件，其中，驾驶人参数与目标险态交通场景中的驾驶人的信息相关，车辆参数与目标险态交通场景中的车辆的状态相关。In another optional embodiment, it is determined whether the sum of the cumulative amount corresponding to the current calculation moment and the driver parameters is greater than or equal to the vehicle parameters. If so, it is determined that the decision response conditions are met, wherein the driver parameters are related to the information of the driver in the target dangerous traffic scene, and the vehicle parameters are related to the state of the vehicle in the target dangerous traffic scene.

步骤103，若满足决策响应条件，则根据当前计算时刻确定驾驶人做出目标决策所需的响应时长。Step 103: If the decision response condition is met, the response time required for the driver to make the target decision is determined according to the current calculation time.

可选的，每隔预设时长，都会计算得到做出目标决策的目标证据信息的累积量，并判断累积量是否满足决策响应条件，若不满足，则将在下一计算时刻继续计算目标证据信息的累计量，直到某个计算时刻计算得到的累积量满足决策响应条件，则将该时刻作为驾驶人做出目标决策所需的响应时长，也即是驾驶人的反应时间。Optionally, the cumulative amount of target evidence information for making a target decision will be calculated at preset time intervals, and a determination will be made as to whether the cumulative amount meets the decision response condition. If not, the cumulative amount of the target evidence information will continue to be calculated at the next calculation moment until the cumulative amount calculated at a certain calculation moment meets the decision response condition. This moment will then be taken as the response time required for the driver to make the target decision, that is, the driver's reaction time.

例如，每0.01秒计算一次目标证据信息的累积量，当在1.15秒时计算得到的累积量满足决策条件，则驾驶人做出目标决策所需的响应时长为1.15秒。For example, the cumulative amount of target evidence information is calculated every 0.01 seconds. When the cumulative amount calculated at 1.15 seconds meets the decision condition, the response time required for the driver to make the target decision is 1.15 seconds.

综上所述，对于目标险态交通场景，从目标险态交通场景的预设阶段开始，通过每隔预设时长计算目标证据信息的累积量，其中，目标证据信息为做出目标决策所依据的证据信息，再在每次计算得到累积量后，根据当前计算时刻对应的累积量，确定是否满足决策响应条件，若满足决策响应条件，则根据当前计算时刻确定驾驶人做出目标决策所需的响应时长，在此过程中，无需传统技术中的大数据驱动深度学习模型，从而本申请降低了预测驾驶人决策响应的难度。To summarize, for the target dangerous traffic scenario, starting from the preset stage of the target dangerous traffic scenario, the cumulative amount of target evidence information is calculated at preset time intervals, wherein the target evidence information is the evidence information based on which the target decision is made. After each calculation, the cumulative amount is obtained, and then it is determined whether the decision response conditions are met according to the cumulative amount corresponding to the current calculation moment. If the decision response conditions are met, the response time required for the driver to make the target decision is determined according to the current calculation moment. In this process, there is no need for big data-driven deep learning models in traditional technologies, and thus the present application reduces the difficulty of predicting the driver's decision response.

其中，道路信息也即是道路环境信息，包括险态场景类型(如追尾、交通路口冲突、对向来车等)、危险源类型(如小汽车、卡车、行人、骑行者等)、碰撞角度、碰撞速度。Among them, road information is also road environment information, including dangerous scene types (such as rear-end collisions, conflicts at traffic intersections, oncoming vehicles, etc.), hazard source types (such as cars, trucks, pedestrians, cyclists, etc.), collision angles, and collision speeds.

可选的，终端可以根据道路信息从决策数据库中查询到支持目标决策的第一证据信息和反对目标决策的第二证据信息，并分别统计第一证据信息的数量和第二证据信息的数量，得到第一证据信息量和第二证据信息量。其中，决策数据库中存储有道路信息和目标决策信息对应的支持和反对证据信息的多组对应关系。Optionally, the terminal can query the first evidence information supporting the target decision and the second evidence information opposing the target decision from the decision database according to the road information, and respectively count the number of the first evidence information and the second evidence information to obtain the first evidence information amount and the second evidence information amount. The decision database stores multiple groups of corresponding relationships between the supporting and opposing evidence information corresponding to the road information and the target decision information.

上述实施例可以用如下数学公式表达：The above embodiment can be expressed by the following mathematical formula:

E(t)＝E(t-Δt)+β(t)Δt+ρW(t)Δt (1)E(t)＝E(t-Δt)+β(t)Δt+ρW(t)Δt (1)

其中，E(t)表示t时刻的累积量，E(t-Δt)表示上一计算时刻对应的累积量，Δt表示预设时长。S(t)表示t时刻支持目标决策的第一证据信息量，O(t)表示t时刻反对目标决策的第二证据信息量，R表示道路信息，g(y_i)表示道路信息中包括的第i项对应的信息，如g(y₁)表示险态场景类型，g(y₂)表示危险源类型，g(y₃)表示碰撞角度，g(y₄)表示碰撞速度，S(t)与O(t)的差值β(t)与道路信息相关，还可以与险态交通场景交互过程(刺激强度和证据质量)相关。W(t)表示维纳过程(Wiener Process)或布朗运动(Brownian Motion)，也即是t时刻的噪声量，ρ表示维纳过程的缩放系数，维纳过程中W(t)-W(t-Δt)的分布不依赖时间t，期望为0，可用公式(3)表示。Among them, E(t) represents the cumulative amount at time t, E(t-Δt) represents the cumulative amount corresponding to the previous calculation time, and Δt represents the preset duration. S(t) represents the amount of first evidence information supporting the target decision at time t, O(t) represents the amount of second evidence information opposing the target decision at time t, R represents the road information, g(y _i ) represents the information corresponding to the i-th item included in the road information, such as g(y ₁ ) represents the dangerous scene type, g(y ₂ ) represents the dangerous source type, g(y ₃ ) represents the collision angle, g(y ₄ ) represents the collision speed, and the difference β(t) between S(t) and O(t) is related to the road information and can also be related to the dangerous traffic scene interaction process (stimulus intensity and evidence quality). W(t) represents the Wiener process or Brownian motion, that is, the amount of noise at time t, ρ represents the scaling factor of the Wiener process, and the distribution of W(t)-W(t-Δt) in the Wiener process does not depend on time t, and is expected to be 0, which can be expressed by formula (3).

另外，表示Δt预设时长内噪声证据积累量，表示Δt预设时长内证据信息积累量。in addition, It represents the amount of noise evidence accumulated within the preset time period of Δt, Indicates the amount of evidence information accumulated within the preset time length Δt.

其中，驾驶人的信息包括年龄、性别、驾驶风格、驾驶经验、疲劳状态、分心状态；车辆的状态包括位置、航向角、速度、加速度、控制状态(自动驾驶、手动驾驶、接管临界状态等)。Among them, the driver's information includes age, gender, driving style, driving experience, fatigue status, and distraction status; the vehicle's status includes position, heading angle, speed, acceleration, control status (automatic driving, manual driving, takeover critical state, etc.).

可选的，上述实施例可以用如下数学公式表达：Optionally, the above embodiment can be expressed by the following mathematical formula:

S₀+E(t)≥α(t) (4)S ₀ +E(t)≥α(t) (4)

E₀(t)＝α(t)-S₀ (5)E ₀ (t) = α (t) - S ₀ (5)

其中，公式(4)为决策响应条件，S₀表示驾驶人参数，也即是证据积累的初始值，与不同个体/群体驾驶人特性有关，也即是与驾驶人的信息D相关，f(x_i)表示驾驶人的信息中包括的第i项对应的信息，如f(x₁)表示年龄，f(x₂)表示性别，f(x₃)表示驾驶风格，f(x₄)表示驾驶经验，f(x₅)表示疲劳状态，f(x₆)表示分心状态，S₀假设服从正态分布N(α,σ)。α(t)表示t时刻的车辆参数，也即是t时刻的决策阈值，与车辆的状态V相关和时间压力t有关，G(V,t)表示目标函数，h(z_i)表示车辆的状态中包括的第i项对应的信息，如h(z₁)表示位置，h(z₂)表示航向角，h(z₃)表示速度，h(z₄)表示加速度，h(z₅)表示控制状态。E₀(t)表示t时刻达到决策阈值所需积累的证据积累阈值，由t时刻的决策阈值与初始值的差值决定。Among them, formula (4) is the decision response condition, _S0 represents the driver parameter, that is, the initial value of evidence accumulation, which is related to the characteristics of different individual/group drivers, that is, it is related to the driver's information D, f( _xi ) represents the information corresponding to the i-th item included in the driver's information, such as f( _x1 ) represents age, f( _x2 ) represents gender, f( _x3 ) represents driving style, f( _x4 ) represents driving experience, f( _x5 ) represents fatigue state, f( _x6 ) represents distraction state, and _S0 is assumed to obey the normal distribution N(α,σ). α(t) represents the vehicle parameters at time t, that is, the decision threshold at time t, which is related to the vehicle state V and the time pressure t. G(V,t) represents the objective function, and h(z _i ) represents the information corresponding to the i-th item included in the vehicle state, such as h(z ₁ ) represents the position, h(z ₂ ) represents the heading angle, h(z ₃ ) represents the speed, h(z ₄ ) represents the acceleration, and h(z ₅ ) represents the control state. E ₀ (t) represents the evidence accumulation threshold required to reach the decision threshold at time t, which is determined by the difference between the decision threshold at time t and the initial value.

上述公式(1)-公式(7)可以认为是构建的证据积累模型。如图2所示，提供了一种证据积累模型示意图，可以看出，证据积累模型中的初始值S₀与决策阈值(响应边界)α共同决定了做出决策反应所需要的证据积累阈值E₀(t)，初始值增大或响应边界减小均有利于用少量证据做出反应从而缩短反应时间。The above formulas (1) to (7) can be considered as the constructed evidence accumulation model. As shown in Figure 2, a schematic diagram of the evidence accumulation model is provided. It can be seen that the initial value S ₀ and the decision threshold (response boundary) α in the evidence accumulation model jointly determine the evidence accumulation threshold E ₀ (t) required for making a decision response. Increasing the initial value or reducing the response boundary is conducive to responding with a small amount of evidence, thereby shortening the response time.

初始值S₀与驾驶人的信息有关，驾驶经验丰富能够预测危险场景并提前做出反应、驾驶风格保守的驾驶人会尽快决策响应以保持更安全的距离，因此由驾驶人的信息中的六个变量可求出不同个体或不同群体的初始值，其中初始值越大的驾驶人对相同刺激的险态交通场景能够做出更快响应，正态分布代表群体内的不同个体分布形式及个体响应的随机不确定性。The initial value S ₀ is related to the driver's information. Drivers with rich driving experience can predict dangerous scenes and respond in advance, and drivers with conservative driving style will make decisions and respond as soon as possible to maintain a safer distance. Therefore, the initial values of different individuals or groups can be calculated from the six variables in the driver's information. Drivers with larger initial values can respond faster to dangerous traffic scenes with the same stimulus. The normal distribution represents the distribution forms of different individuals in the group and the random uncertainty of individual responses.

证据积累速率v对应险态交通场景中刺激信号的强弱程度，险态交通场景越紧急，刺激信号(如视觉变化刺激、距离、速度等)越强，证据积累速率越大，证据积累越快。另外，证据积累速率v也与道路信息相关，根据道理信息得到的支持该目标决策的第一证据信息量越大，反对该目标决策的第二证据信息量越小，则证据积累速率越大。The evidence accumulation rate v corresponds to the strength of the stimulus signal in the dangerous traffic scene. The more urgent the dangerous traffic scene is, the stronger the stimulus signal (such as visual change stimulus, distance, speed, etc.), the greater the evidence accumulation rate, and the faster the evidence accumulation. In addition, the evidence accumulation rate v is also related to road information. The greater the amount of first evidence information supporting the target decision obtained based on the reasoning information, and the smaller the amount of second evidence information opposing the target decision, the greater the evidence accumulation rate.

响应边界α与车辆的状态及时间压力有关，险态交通场景初期响应边界表现为恒定边界，驾驶人若在初期不采取主动响应会使险态交通场景恶化，面临更加紧急的情况，从而被迫做出响应决策，在证据积累模型中表现为响应边界随时间的减小(崩塌边界α′)，即触发决策所需的证据量随着决策时间的增加而变小，在宏观上表现为反应时间较长的不充分决策。车辆的状态如自车控制模式(自动驾驶、手动驾驶、接管临界状态等)或是否有刹车灯、车内提示音等也能改变响应边界，从而帮助驾驶人做出快速决策响应。E表示做出目标决策所积累的证据信息的累积量，即做出决策证据的累积后验概率。The response boundary α is related to the state of the vehicle and time pressure. The initial response boundary of the dangerous traffic scene is a constant boundary. If the driver does not take an active response in the early stage, the dangerous traffic scene will deteriorate and face a more urgent situation, thus being forced to make a response decision. In the evidence accumulation model, it is manifested as a decrease in the response boundary over time (collapse boundary α′), that is, the amount of evidence required to trigger the decision becomes smaller as the decision time increases. On a macro level, it is manifested as an insufficient decision with a long reaction time. The state of the vehicle, such as the self-control mode (automatic driving, manual driving, critical state of takeover, etc.) or whether there are brake lights, in-car warning sounds, etc., can also change the response boundary, thereby helping the driver to make a quick decision response. E represents the cumulative amount of evidence information accumulated to make the target decision, that is, the cumulative posterior probability of the decision evidence.

可选的，决策池中包括松油门、制动和转向等目标决策，对于每种目标决策，均可以根据上述方法得到每个目标决策对应的响应时长，例如，松油门对应的响应时长为0.70秒，制动对应的响应时长为0.92秒，转向对应的响应时长为1.13秒，则驾驶人在目标险态交通场景中的响应历程是，驾驶人在0.70秒时进行松油门操作，在0.90秒时进行制动操作，在1.13秒时进行转向操作。Optionally, the decision pool includes target decisions such as releasing the accelerator, braking and steering. For each target decision, the response time corresponding to each target decision can be obtained according to the above method. For example, the response time corresponding to releasing the accelerator is 0.70 seconds, the response time corresponding to braking is 0.92 seconds, and the response time corresponding to steering is 1.13 seconds. The driver's response process in the target dangerous traffic scenario is that the driver releases the accelerator at 0.70 seconds, brakes at 0.90 seconds, and turns at 1.13 seconds.

综上所述，如图3所示，提供了另一种响应预测方法，该响应预测方法包括如下步骤：In summary, as shown in FIG3 , another response prediction method is provided, and the response prediction method includes the following steps:

步骤301，对于目标险态交通场景，从目标险态交通场景的预设阶段开始，每隔预设时长，获取上一计算时刻计算得到目标证据信息的累积量，并根据目标险态交通场景中当前时刻的道路信息确定当前支持目标决策的第一证据信息量以及反对目标决策的第二证据信息量。Step 301, for the target dangerous traffic scenario, starting from the preset stage of the target dangerous traffic scenario, at every preset time interval, the cumulative amount of target evidence information calculated at the previous calculation moment is obtained, and the first amount of evidence information supporting the target decision and the second amount of evidence information opposing the target decision are determined based on the road information at the current moment in the target dangerous traffic scenario.

步骤302，基于维纳过程获取当前计算时刻的噪声量；根据上一计算时刻对应的累积量、第一证据信息量、第二证据信息量以及噪声量获取当前计算时刻的目标证据信息的累积量。Step 302, obtaining the noise amount at the current calculation moment based on the Wiener process; obtaining the cumulative amount of the target evidence information at the current calculation moment according to the cumulative amount corresponding to the previous calculation moment, the first evidence information amount, the second evidence information amount and the noise amount.

步骤303，确定当前计算时刻对应的累积量与驾驶人参数之和是否大于等于车辆参数；若大于等于车辆参数，则确定满足决策响应条件，其中，驾驶人参数与目标险态交通场景中的驾驶人的信息相关，车辆参数与目标险态交通场景中的车辆的状态相关，车辆参数根据目标险态交通场景中的车辆的状态以及目标函数计算得到，目标函数为函数值随着时间逐渐减小的函数。Step 303, determine whether the sum of the cumulative amount corresponding to the current calculation moment and the driver parameters is greater than or equal to the vehicle parameters; if it is greater than or equal to the vehicle parameters, determine that the decision response conditions are met, wherein the driver parameters are related to the information of the driver in the target dangerous traffic scene, and the vehicle parameters are related to the state of the vehicle in the target dangerous traffic scene. The vehicle parameters are calculated based on the state of the vehicle in the target dangerous traffic scene and the objective function, and the objective function is a function whose function value gradually decreases with time.

步骤304，若当前计算时刻对应的累积量与驾驶人参数之和满足决策响应条件，则根据当前计算时刻确定驾驶人做出目标决策所需的响应时长。Step 304: If the sum of the cumulative amount corresponding to the current calculation time and the driver parameter meets the decision response condition, the response time required for the driver to make the target decision is determined according to the current calculation time.

步骤305，目标决策为决策池中的决策，确定决策池中的每一目标决策对应的响应时长；根据每一目标决策对应的响应时长，对决策池包括的多个目标决策进行排序，确定驾驶人在目标险态交通场景中的响应历程。Step 305, the target decision is the decision in the decision pool, and the response time corresponding to each target decision in the decision pool is determined; according to the response time corresponding to each target decision, the multiple target decisions included in the decision pool are sorted to determine the driver's response process in the target dangerous traffic scenario.

本申请还具有如下优点：This application also has the following advantages:

(1)借助认知心理学、认知神经科学等领域的前沿研究现状，基于证据积累过程建立了一种证据积累模型，其是能够反映驾驶人感知决策机制的驾驶人主动响应预测模型。该模型可以综合考虑驾驶人因素(对应上述驾驶人的信息)、道路环境因素(对应上述道路信息)、车辆状态因素(对应上述车辆的状态)推断驾驶人的决策类型及反应时间，能够解释不同因素及参数改变对驾驶人感知决策的影响，提高了算法的可解释性和可靠性，实现驾驶人主动响应的白盒化预测。(1) With the help of cutting-edge research in cognitive psychology, cognitive neuroscience and other fields, an evidence accumulation model is established based on the evidence accumulation process. It is a driver's active response prediction model that can reflect the driver's perception decision-making mechanism. The model can comprehensively consider driver factors (corresponding to the above driver information), road environment factors (corresponding to the above road information), and vehicle state factors (corresponding to the above vehicle state) to infer the driver's decision type and reaction time. It can explain the impact of different factors and parameter changes on the driver's perception decision, improve the interpretability and reliability of the algorithm, and realize the white-box prediction of the driver's active response.

(2)该模型用适量数据即可求得相关模型参数，避免了基于数据驱动方法所需要的大量数据，这对于获取数据难度较大的险态交通场景来说更具有适用性。(2) The model can obtain relevant model parameters with a moderate amount of data, avoiding the large amount of data required by data-driven methods. This is more applicable to dangerous traffic scenarios where it is difficult to obtain data.

(3)该模型可以通过驾驶人因素相关表征构建表征不同群体、个体的参数化模型，从而体现驾驶人的个体差异性，同时能够推断预测驾驶人在险态交通场景中的响应历程(例如刹车时刻、转向时刻等)，从而为不同驾驶人提供个性化的驾驶辅助，提高人机交互过程中的可理解性。(3) The model can construct parameterized models that represent different groups and individuals through the representation of driver factors, thereby reflecting the individual differences of drivers. At the same time, it can infer and predict the driver's response process in dangerous traffic scenarios (such as braking time, turning time, etc.), thereby providing personalized driving assistance for different drivers and improving the comprehensibility of the human-computer interaction process.

另外，对于构建的证据积累模型进行模型验证，具体如下：为了该模型的准确性，提供了两种验证方法。第一种方法通过志愿者分别获取驾驶模拟器试验以及以真实交通视频图像为刺激的试验数据，获取险态工况下驾驶人主动响应及决策行为的数据信息。以模拟器试验、视频刺激试验为真值，比较真值与模型预测的反应时间误差以及决策类型推断的准确率来衡量所建立模型的准确性及有效性，此时应注意将数据训练集和验证集进行区分。使用真实交通视频图像作为刺激的原因是力争在实验室环境还原真实交通场景的刺激程度。第二种利用已发表的试验数据进行驾驶人决策类型推断和反应时间预测，将结果与对应试验数据所建立的已有模型进行对比，验证该发明模型的先进性。In addition, the constructed evidence accumulation model is verified as follows: In order to verify the accuracy of the model, two verification methods are provided. The first method uses volunteers to obtain driving simulator test data and test data using real traffic video images as stimuli, respectively, to obtain data information on the driver's active response and decision-making behavior under dangerous conditions. Using simulator tests and video stimulation tests as true values, the accuracy and effectiveness of the established model are measured by comparing the reaction time error between the true value and the model prediction and the accuracy of decision type inference. At this time, attention should be paid to distinguishing between the data training set and the validation set. The reason for using real traffic video images as stimuli is to strive to restore the degree of stimulation of real traffic scenes in a laboratory environment. The second method uses published test data to infer the driver's decision type and predict the reaction time, and compares the results with the existing model established by the corresponding test data to verify the advanced nature of the invented model.

上述证据积累模型的构建是基于贝叶斯框架，如图4所示，提供了一种贝叶斯框架原理图，叶斯框架的思想如下：The construction of the above-mentioned evidence accumulation model is based on the Bayesian framework. As shown in Figure 4, a Bayesian framework principle diagram is provided. The idea of the Bayesian framework is as follows:

贝叶斯框架是基于不确定数据做出决策的适当数学框架，贝叶斯决策指在此次决策之前的试验结果会影响此次决策过程，即具有先验知识与后验知识的综合影响。要做出一个决定必须进行顺序采样并逐步积累证据，直到证据足够令人信服为止。而逐步采样积累证据的过程，就是贝叶斯框架中不断利用后验更新结果的过程。The Bayesian framework is an appropriate mathematical framework for making decisions based on uncertain data. Bayesian decision-making means that the results of experiments before the decision will affect the decision-making process, that is, it has the combined influence of prior knowledge and posterior knowledge. To make a decision, you must sample sequentially and accumulate evidence step by step until the evidence is convincing enough. The process of gradually sampling and accumulating evidence is the process of continuously using posterior knowledge to update the results in the Bayesian framework.

如果有一个假设H，并且观察到一个事件E，根据贝叶斯定律可得：If there is a hypothesis H and an event E is observed, according to Bayes' theorem:

P(E)P(H|E)＝P(H)P(E|H) (8)P(E)P(H|E)＝P(H)P(E|H) (8)

重新整理可得观察到事件E后将估计概率从旧的值P(H)(先验)更新为新的值P(H|E)(后验)：Rearranging this way, we can see that after observing event E, the estimated probability is updated from the old value P(H) (prior) to the new value P(H|E) (posterior):

等号左边为后验概率(posteriorodds)，等号右边为先验概率(priorodds)与似然比(likelihoodratio)。似然比在重复采样时以乘法的方式不断出现，因此将上式转化为对数形式后：The left side of the equal sign is the posterior odds, and the right side of the equal sign is the prior odds and likelihood ratio. The likelihood ratio continues to appear in a multiplicative manner during repeated sampling, so after converting the above formula into logarithmic form:

log(posteriorodds)＝log(priorodds)+log(likelihoodratio) (10)log(posteriorodds)＝log(priorodds)+log(likelihoodratio) (10)

乘法的形式转变为加法，因此每次采样后支持某一选项证据的似然比不断增加，支持正确假设的对数概率越来越大，以代表每次获得的信息量的速率。因此通过贝叶斯框架可以看出，需要的是一个表示对数概率的决策信号S，该信号初值为S₀，表示对数先验概率；当次数出现时，S开始以速率v逐步上升，速率v由对数似然比决定，代表采样的信息为某一选项提供了支持证据；最终对数概率将达到做出决策的水平S_T。可以看出，由贝叶斯框架推导的决策过程与由反应时间响应过程的证据积累框架一致。The form of multiplication is transformed into addition, so the likelihood ratio of the evidence supporting a certain option increases after each sampling, and the logarithmic probability of supporting the correct hypothesis becomes larger and larger, representing the rate of information obtained each time. Therefore, through the Bayesian framework, it can be seen that what is needed is a decision signal S representing the logarithmic probability, the initial value of which is S ₀ , representing the logarithmic prior probability; when the number of times occurs, S begins to gradually increase at a rate v, which is determined by the logarithmic likelihood ratio, representing that the sampled information provides supporting evidence for a certain option; the final logarithmic probability will reach the level of decision making S _T. It can be seen that the decision process derived by the Bayesian framework is consistent with the evidence accumulation framework of the reaction time response process.

上述驾驶人因素D、道路环境因素R和车辆状态因素V的建模过程如下：The modeling process of the above driver factor D, road environment factor R and vehicle state factor V is as follows:

(1)数据的获取(1) Data acquisition

驾驶人的感知决策影响险态工况下交互过程，直接影响到是否能够避免碰撞以及碰撞时间、碰撞角度，进而对交通参与者在碰撞过程中的损伤风险造成影响。驾驶人因素中的部分信息通过(如年龄、性别、驾驶风格、驾驶经验)可通过驾驶人行为问卷或询问获取，但驾驶人的疲劳状态和分心状态以及道路环境因素、车辆状态因素与险态场景密切相关，需要通过险态场景历程信息及采集的生理信号具体分析。险态场景数据获取途径主要是通过志愿者试验的手段获得。通过试验前的驾驶人行为问卷调查、试验任务设计及状态评估记录志愿者相关信息用于感知决策影响的驾驶人因素分析；实时标记记录试验过程中道路环境参数数据，以及自车的车辆状态数据，能够从不同层面对驾驶人感知决策的影响因素进行量化分析。The driver's perception decision affects the interaction process under dangerous conditions, directly affecting whether a collision can be avoided, the collision time and collision angle, and thus affecting the risk of injury to traffic participants during the collision. Some information about driver factors (such as age, gender, driving style, and driving experience) can be obtained through driver behavior questionnaires or inquiries, but the driver's fatigue and distraction status, as well as road environment factors and vehicle status factors are closely related to dangerous scenarios, and need to be specifically analyzed through dangerous scenario process information and collected physiological signals. The main way to obtain dangerous scenario data is through volunteer experiments. Through the driver behavior questionnaire survey before the test, the test task design and state evaluation to record the relevant information of volunteers for the analysis of driver factors affecting perception decision-making; real-time marking and recording of road environment parameter data during the test, as well as the vehicle status data of the vehicle itself, the factors affecting the driver's perception decision-making can be quantitatively analyzed from different levels.

以上志愿者试验通常是在驾驶模拟器上进行，能够设计不同的险态场景及感知决策任务。虽然实车试验能够为被试提供最真实的驾驶体验，最符合被试的驾驶习惯。但是在真实道路中行驶无法人为设计危险交通场景，效率较低且不具备可重复性，考虑到安全性，无法开展严重的险态场景试验。相关险态场景数据也可以通过对自然驾驶数据集或事故分析数据集进行分析，结合事故调查报告，重现事故场景信息，完成驾驶人感知决策影响因素的统计。但由于信息记录的有限性以及对调查者隐私的保护，难以通过此方法获取所需的全部层面信息，无法为模型的搭建提供足够的数据支撑。The above volunteer tests are usually conducted on driving simulators, which can design different dangerous scenarios and perception decision-making tasks. Although real-car tests can provide the subjects with the most realistic driving experience and are most in line with the subjects' driving habits. However, it is impossible to artificially design dangerous traffic scenarios when driving on real roads, which is inefficient and not repeatable. Considering safety, serious dangerous scenario tests cannot be carried out. Relevant dangerous scenario data can also be obtained by analyzing natural driving data sets or accident analysis data sets, combining accident investigation reports, reproducing accident scene information, and completing the statistics of factors affecting driver perception decisions. However, due to the limited information records and the protection of the privacy of investigators, it is difficult to obtain all the required levels of information through this method, and it is impossible to provide sufficient data support for the construction of the model.

(2)构建模型(2) Model building

建模的前提是有足够的数据支撑，基本的理论方法为控制变量法研究。接下来以驾驶人因素为例进行介绍。驾驶人因素可以细分年龄x₁、性别x₂、驾驶风格x₃、驾驶经验x₄、疲劳状态x₅、分心状态x₆六个方面，其中年龄可以近似为连续变量，而性别、驾驶风格、驾驶经验、疲劳状态、分心状态为离散型变量。The premise of modeling is to have sufficient data support, and the basic theoretical method is the control variable method. Next, we will take the driver factor as an example. The driver factor can be subdivided into six aspects: age x ₁ , gender x ₂ , driving style x ₃ , driving experience x ₄ , fatigue state x ₅ , and distraction state x _6. Among them, age can be approximated as a continuous variable, while gender, driving style, driving experience, fatigue state, and distraction state are discrete variables.

对各变量的分级见表1。可根据研究情况及精细程度进行进一步分级，如将分心状态根据不同的次级任务分为不同等级。各因素分级可通过驾驶人行为问卷调查分数及状态评定具体确定，已有此方面研究基础。运用控制变量法，分别研究分析各个参数对驾驶人因素的具体影响。具体可以分为两个步骤，首先确定各因素对驾驶人反应时间正负影响并确定函数趋势线，进而确定函数系数值。The classification of each variable is shown in Table 1. Further classification can be carried out according to the research situation and degree of sophistication, such as dividing the distraction state into different levels according to different secondary tasks. The classification of each factor can be specifically determined by the driver behavior questionnaire score and status assessment, and there is already a research basis in this regard. Using the control variable method, the specific impact of each parameter on the driver factor is studied and analyzed separately. It can be divided into two steps. First, determine the positive and negative impact of each factor on the driver's reaction time and determine the function trend line, and then determine the function coefficient value.

表1Table 1

变量名称Variable Name 变量符号Variable Symbols 变量分级Variable classification 性别gender x₂ x ₂ 女：0，男1Female: 0, Male: 1 驾驶风格Driving style x₃ x ₃ 保守：0，正常：1，激进2Conservative: 0, Normal: 1, Radical 2 驾驶经验Driving Experience x₄ x ₄ 不丰富：0，丰富：1Not rich: 0, Rich: 1 疲劳状态Fatigue status x₅ x ₅ 清醒：0，疲劳：1Awake: 0, Fatigue: 1 分心状态Distracted state x₆ x ₆ 精力集中：0，分心：1Concentration: 0, Distraction: 1

首先研究年龄x₁对驾驶人因素D的影响，数据统计的结果和事故数据调查报告显示，事故数据量随着驾驶人年龄呈现先增加后减少的趋势，故可以近似认为D与x₁之间近似呈现二次函数的关系。利用已知数据集的结果，求出二次函数的参数a，b，c，得到D与x₁的明确关系式。First, the effect of age x ₁ on driver factor D is studied. The statistical results and accident data investigation report show that the amount of accident data increases first and then decreases with the driver's age. Therefore, it can be approximately considered that there is a quadratic function relationship between D and x _1. Using the results of the known data set, the parameters a, b, and c of the quadratic function are calculated, and a clear relationship between D and x ₁ is obtained.

以疲劳状态x₅为例说明离散变量对驾驶人因素D的影响。已有研究和数据统计结果显示，清醒状态的驾驶人能够更快感知交通场景中的危险并做出决策判断。离散变量对该因素的影响程度用系数来表示，系数大小表示对因素的影响程度，系数正负表示对因素的影响效果(延长反应时间或缩短反应时间),截距代表变量对因素的整体影响漂移。利用已知数据集的结果，求出变量的影响因数d，f，得到D与x₅的明确关系式。Take the fatigue state _x5 as an example to illustrate the influence of discrete variables on driver factor D. Existing research and statistical results show that drivers in a sober state can perceive dangers in traffic scenes more quickly and make decisions. The influence of discrete variables on the factor is expressed by coefficients. The size of the coefficient indicates the degree of influence on the factor. The positive or negative coefficient indicates the effect of the factor (extending or shortening the reaction time). The intercept represents the overall influence drift of the variable on the factor. Using the results of the known data set, the influence factors d and f of the variables are calculated, and a clear relationship between D and _x5 is obtained.

D＝dx₅+fD＝dx ₅ +f

同理，利用类似的方法可以求出驾驶人因素D与性别x₂、驾驶风格x₃、驾驶经验x₄、疲劳状态x₅、分心状态x₆的函数关系式，最终可以对D与年龄、性别、驾驶风格、驾驶经验、疲劳状态、分心状态的函数关系式量化如下：Similarly, a similar method can be used to find the functional relationship between the driver factor D and gender x ₂ , driving style x ₃ , driving experience x ₄ , fatigue state x ₅ , and distraction state x ₆ . Finally, the functional relationship between D and age, gender, driving style, driving experience, fatigue state, and distraction state can be quantified as follows:

通过以上过程和步骤可以推导出道路环境因素R和车辆状态因素V与其相关变量之间的量化影响关系，从而最终完成驾驶人感知决策影响因素的建模，分别从三个层面表征其影响因素及效果：Through the above process and steps, the quantitative influence relationship between the road environment factor R and the vehicle state factor V and their related variables can be derived, so as to finally complete the modeling of the factors affecting the driver's perception decision, and characterize its influencing factors and effects from three levels:

(3)模型验证(3) Model verification

所建立的理论模型有待于实验验证，提出了两种验证所建立的理论模型的方法。第一种方法通过开展志愿者行为试验收集的数据集进行验证，例如为研究驾驶人年龄或驾驶风格对驾驶人感知决策特性的影响，可以有针对性地设计典型险态交通场景，筛选有代表性的被试开展险态场景驾驶人危险感知试验，通过试验记录的结果进行模型效果验证及模型参数修订。为了验证模型在真实交通场景的应用效果，可以将修订的模型应用到自然驾驶数据集中进行有效性验证，此时可以根据获取的少量自然驾驶数据类型有针对性地验证评估模型。需要注意的是，无论是志愿者行为试验数据集还是自然驾驶数据集，模型的训练集和验证集需要进行有效划分，根据在验证集上验证的模型准确率来衡量所建立的驾驶人行为机制模型的准确度。The established theoretical model needs to be verified experimentally, and two methods for verifying the established theoretical model are proposed. The first method is to verify the data set collected by conducting volunteer behavior experiments. For example, to study the impact of driver age or driving style on driver perception and decision-making characteristics, typical dangerous traffic scenarios can be designed in a targeted manner, and representative subjects can be selected to conduct dangerous perception experiments of drivers in dangerous scenarios. The model effect can be verified and the model parameters can be revised based on the results of the test records. In order to verify the application effect of the model in real traffic scenarios, the revised model can be applied to the natural driving data set for effectiveness verification. At this time, the evaluation model can be verified in a targeted manner based on a small amount of natural driving data type obtained. It should be noted that whether it is a volunteer behavior test data set or a natural driving data set, the training set and validation set of the model need to be effectively divided, and the accuracy of the established driver behavior mechanism model is measured according to the model accuracy verified on the validation set.

应该理解的是，虽然如上所述的各实施例所涉及的流程图中的各个步骤按照箭头的指示依次显示，但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明，这些步骤的执行并没有严格的顺序限制，这些步骤可以以其它的顺序执行。而且，如上所述的各实施例所涉及的流程图中的至少一部分步骤可以包括多个步骤或者多个阶段，这些步骤或者阶段并不必然是在同一时刻执行完成，而是可以在不同的时刻执行，这些步骤或者阶段的执行顺序也不必然是依次进行，而是可以与其它步骤或者其它步骤中的步骤或者阶段的至少一部分轮流或者交替地执行。It should be understood that, although the various steps in the flowcharts involved in the above-mentioned embodiments are displayed in sequence according to the indication of the arrows, these steps are not necessarily executed in sequence according to the order indicated by the arrows. Unless there is a clear explanation in this article, the execution of these steps does not have a strict order restriction, and these steps can be executed in other orders. Moreover, at least a part of the steps in the flowcharts involved in the above-mentioned embodiments can include multiple steps or multiple stages, and these steps or stages are not necessarily executed at the same time, but can be executed at different times, and the execution order of these steps or stages is not necessarily carried out in sequence, but can be executed in turn or alternately with other steps or at least a part of the steps or stages in other steps.

基于同样的发明构思，本申请实施例还提供了一种用于实现上述所涉及的响应预测方法的响应预测装置。该装置所提供的解决问题的实现方案与上述方法中所记载的实现方案相似，故下面所提供的一个或多个响应预测装置实施例中的具体限定可以参见上文中对于响应预测方法的限定，在此不再赘述。Based on the same inventive concept, the embodiment of the present application also provides a response prediction device for implementing the above-mentioned response prediction method. The implementation scheme for solving the problem provided by the device is similar to the implementation scheme recorded in the above-mentioned method, so the specific limitations in one or more response prediction device embodiments provided below can refer to the limitations of the response prediction method above, and will not be repeated here.

在一个实施例中，如图5所示，提供了一种响应预测装置，响应预测装置500包括：计算模块501、第一确定模块502和第二确定模块503，其中：In one embodiment, as shown in FIG5 , a response prediction device is provided. The response prediction device 500 includes: a calculation module 501, a first determination module 502, and a second determination module 503, wherein:

计算模块501，对于目标险态交通场景，从目标险态交通场景的预设阶段开始，每隔预设时长计算目标证据信息的累积量，目标证据信息为做出目标决策所依据的证据信息；The calculation module 501 calculates the cumulative amount of target evidence information for the target dangerous traffic scene at a preset time interval starting from the preset stage of the target dangerous traffic scene, where the target evidence information is evidence information based on which the target decision is made;

第一确定模块502，在每次计算得到累积量后，根据当前计算时刻对应的累积量，确定是否满足决策响应条件；A first determination module 502 determines whether a decision response condition is satisfied according to the accumulation amount corresponding to the current calculation moment after each calculation.

第二确定模块503，若满足决策响应条件，则根据当前计算时刻确定驾驶人做出目标决策所需的响应时长。The second determination module 503 determines the response time required for the driver to make the target decision according to the current calculation time if the decision response condition is met.

在其中一个实施例中，计算模块501，具体用于每隔预设时长，获取上一计算时刻计算得到目标证据信息的累积量，并根据目标险态交通场景中当前时刻的道路信息确定当前支持目标决策的第一证据信息量以及反对目标决策的第二证据信息量，根据上一计算时刻对应的累积量、第一证据信息量以及第二证据信息量获取当前计算时刻的目标证据信息的累积量。In one embodiment, the calculation module 501 is specifically used to obtain the cumulative amount of target evidence information calculated at the previous calculation moment every preset time period, and determine the first amount of evidence information that currently supports the target decision and the second amount of evidence information that opposes the target decision based on the road information at the current moment in the target dangerous traffic scene, and obtain the cumulative amount of the target evidence information at the current calculation moment based on the cumulative amount corresponding to the previous calculation moment, the first amount of evidence information, and the second amount of evidence information.

在其中一个实施例中，计算模块501，具体用于基于维纳过程获取当前计算时刻的噪声量；根据上一计算时刻对应的累积量、第一证据信息量、第二证据信息量以及噪声量获取当前计算时刻的目标证据信息的累积量。In one embodiment, the calculation module 501 is specifically used to obtain the noise amount at the current calculation moment based on the Wiener process; and obtain the cumulative amount of the target evidence information at the current calculation moment according to the cumulative amount corresponding to the previous calculation moment, the first evidence information amount, the second evidence information amount and the noise amount.

在其中一个实施例中，第一确定模块502，具体用于根据当前计算时刻对应的累积量、驾驶人参数以及车辆参数，确定是否满足决策响应条件；其中，驾驶人参数与目标险态交通场景中的驾驶人的信息相关，车辆参数与目标险态交通场景中的车辆的状态相关。In one embodiment, the first determination module 502 is specifically used to determine whether the decision response conditions are met based on the cumulative amount, driver parameters and vehicle parameters corresponding to the current calculation moment; wherein the driver parameters are related to the information of the driver in the target dangerous traffic scene, and the vehicle parameters are related to the state of the vehicle in the target dangerous traffic scene.

在其中一个实施例中，第一确定模块502，具体用于确定当前计算时刻对应的累积量与驾驶人参数之和是否大于等于车辆参数，若大于等于车辆参数，则确定满足决策响应条件。In one embodiment, the first determination module 502 is specifically used to determine whether the sum of the cumulative amount corresponding to the current calculation moment and the driver parameter is greater than or equal to the vehicle parameter. If it is greater than or equal to the vehicle parameter, it is determined that the decision response condition is met.

上述响应预测装置中的各个模块可全部或部分通过软件、硬件及其组合来实现。上述各模块可以硬件形式内嵌于或独立于计算机设备中的处理器中，也可以以软件形式存储于计算机设备中的存储器中，以便于处理器调用执行以上各个模块对应的操作。Each module in the above-mentioned response prediction device can be implemented in whole or in part by software, hardware and a combination thereof. Each of the above-mentioned modules can be embedded in or independent of a processor in a computer device in the form of hardware, or can be stored in a memory in a computer device in the form of software, so that the processor can call and execute the operations corresponding to each of the above modules.

在一个实施例中，提供了一种计算机设备，该计算机设备可以是终端，其内部结构图可以如图6所示。该计算机设备包括处理器、存储器、输入/输出接口、通信接口、显示单元和输入装置。其中，处理器、存储器和输入/输出接口通过系统总线连接，通信接口、显示单元和输入装置通过输入/输出接口连接到系统总线。其中，该计算机设备的处理器用于提供计算和控制能力。该计算机设备的存储器包括非易失性存储介质和内存储器。该非易失性存储介质存储有操作系统和计算机程序。该内存储器为非易失性存储介质中的操作系统和计算机程序的运行提供环境。该计算机设备的输入/输出接口用于处理器与外部设备之间交换信息。该计算机设备的通信接口用于与外部的终端进行有线或无线方式的通信，无线方式可通过WIFI、移动蜂窝网络、NFC(近场通信)或其他技术实现。该计算机程序被处理器执行时以实现一种响应预测方法。该计算机设备的显示单元用于形成视觉可见的画面，可以是显示屏、投影装置或虚拟现实成像装置。显示屏可以是液晶显示屏或者电子墨水显示屏，该计算机设备的输入装置可以是显示屏上覆盖的触摸层，也可以是计算机设备外壳上设置的按键、轨迹球或触控板，还可以是外接的键盘、触控板或鼠标等。In one embodiment, a computer device is provided, which may be a terminal, and its internal structure diagram may be shown in FIG6. The computer device includes a processor, a memory, an input/output interface, a communication interface, a display unit, and an input device. The processor, the memory, and the input/output interface are connected via a system bus, and the communication interface, the display unit, and the input device are connected to the system bus via the input/output interface. The processor of the computer device is used to provide computing and control capabilities. The memory of the computer device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system and a computer program. The internal memory provides an environment for the operation of the operating system and the computer program in the non-volatile storage medium. The input/output interface of the computer device is used to exchange information between the processor and an external device. The communication interface of the computer device is used to communicate with an external terminal in a wired or wireless manner, and the wireless manner may be implemented through WIFI, a mobile cellular network, NFC (near field communication) or other technologies. When the computer program is executed by the processor, a response prediction method is implemented. The display unit of the computer device is used to form a visually visible picture, which may be a display screen, a projection device, or a virtual reality imaging device. The display screen can be a liquid crystal display screen or an electronic ink display screen, and the input device of the computer device can be a touch layer covering the display screen, or a button, trackball or touchpad set on the computer device shell, or an external keyboard, touchpad or mouse.

本领域技术人员可以理解，图6中示出的结构，仅仅是与本申请方案相关的部分结构的框图，并不构成对本申请方案所应用于其上的计算机设备的限定，具体的计算机设备可以包括比图中所示更多或更少的部件，或者组合某些部件，或者具有不同的部件布置。Those skilled in the art will understand that the structure shown in FIG. 6 is merely a block diagram of a partial structure related to the solution of the present application, and does not constitute a limitation on the computer device to which the solution of the present application is applied. The specific computer device may include more or fewer components than shown in the figure, or combine certain components, or have a different arrangement of components.

在一个实施例中，提供了一种计算机设备，包括存储器和处理器，存储器中存储有计算机程序，该处理器执行计算机程序时实现上述任一方法实施例所述的步骤。In one embodiment, a computer device is provided, including a memory and a processor, wherein a computer program is stored in the memory, and the processor implements the steps described in any of the above method embodiments when executing the computer program.

在一个实施例中，提供了一种计算机可读存储介质，其上存储有计算机程序，计算机程序被处理器执行时实现上述任一方法实施例所述的步骤。In one embodiment, a computer-readable storage medium is provided, on which a computer program is stored. When the computer program is executed by a processor, the steps described in any of the above method embodiments are implemented.

在一个实施例中，提供了一种计算机程序产品，包括计算机程序，该计算机程序被处理器执行时实现上述任一方法实施例所述的步骤。In one embodiment, a computer program product is provided, including a computer program, which implements the steps described in any of the above method embodiments when executed by a processor.

需要说明的是，本申请所涉及的用户信息(包括但不限于用户设备信息、用户个人信息等)和数据(包括但不限于用于分析的数据、存储的数据、展示的数据等)，均为经用户授权或者经过各方充分授权的信息和数据，且相关数据的收集、使用和处理需要遵守相关国家和地区的相关法律法规和标准。It should be noted that the user information (including but not limited to user device information, user personal information, etc.) and data (including but not limited to data used for analysis, stored data, displayed data, etc.) involved in this application are all information and data authorized by the user or fully authorized by all parties, and the collection, use and processing of relevant data must comply with relevant laws, regulations and standards of relevant countries and regions.

本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程，是可以通过计算机程序来指令相关的硬件来完成，所述的计算机程序可存储于一非易失性计算机可读取存储介质中，该计算机程序在执行时，可包括如上述各方法的实施例的流程。其中，本申请所提供的各实施例中所使用的对存储器、数据库或其它介质的任何引用，均可包括非易失性和易失性存储器中的至少一种。非易失性存储器可包括只读存储器(Read-OnlyMemory，ROM)、磁带、软盘、闪存、光存储器、高密度嵌入式非易失性存储器、阻变存储器(ReRAM)、磁变存储器(Magnetoresistive Random Access Memory，MRAM)、铁电存储器(Ferroelectric Random Access Memory，FRAM)、相变存储器(Phase Change Memory，PCM)、石墨烯存储器等。易失性存储器可包括随机存取存储器(Random Access Memory，RAM)或外部高速缓冲存储器等。作为说明而非局限，RAM可以是多种形式，比如静态随机存取存储器(Static Random Access Memory，SRAM)或动态随机存取存储器(Dynamic RandomAccess Memory，DRAM)等。本申请所提供的各实施例中所涉及的数据库可包括关系型数据库和非关系型数据库中至少一种。非关系型数据库可包括基于区块链的分布式数据库等，不限于此。本申请所提供的各实施例中所涉及的处理器可为通用处理器、中央处理器、图形处理器、数字信号处理器、可编程逻辑器、基于量子计算的数据处理逻辑器等，不限于此。Those of ordinary skill in the art can understand that all or part of the processes in the above-mentioned embodiment methods can be completed by instructing the relevant hardware through a computer program, and the computer program can be stored in a non-volatile computer-readable storage medium. When the computer program is executed, it can include the processes of the embodiments of the above-mentioned methods. Among them, any reference to the memory, database or other medium used in the embodiments provided in the present application can include at least one of non-volatile and volatile memory. Non-volatile memory can include read-only memory (ROM), magnetic tape, floppy disk, flash memory, optical memory, high-density embedded non-volatile memory, resistive random access memory (ReRAM), magnetoresistive random access memory (MRAM), ferroelectric random access memory (FRAM), phase change memory (PCM), graphene memory, etc. Volatile memory can include random access memory (RAM) or external cache memory, etc. As an illustration and not limitation, RAM can be in various forms, such as static random access memory (SRAM) or dynamic random access memory (DRAM). The database involved in each embodiment provided in this application may include at least one of a relational database and a non-relational database. Non-relational databases may include distributed databases based on blockchains, etc., but are not limited to this. The processor involved in each embodiment provided in this application may be a general-purpose processor, a central processing unit, a graphics processor, a digital signal processor, a programmable logic device, a data processing logic device based on quantum computing, etc., but are not limited to this.

以上实施例的各技术特征可以进行任意的组合，为使描述简洁，未对上述实施例中的各个技术特征所有可能的组合都进行描述，然而，只要这些技术特征的组合不存在矛盾，都应当认为是本说明书记载的范围。The technical features of the above embodiments may be arbitrarily combined. To make the description concise, not all possible combinations of the technical features in the above embodiments are described. However, as long as there is no contradiction in the combination of these technical features, they should be considered to be within the scope of this specification.

以上所述实施例仅表达了本申请的几种实施方式，其描述较为具体和详细，但并不能因此而理解为对本申请专利范围的限制。应当指出的是，对于本领域的普通技术人员来说，在不脱离本申请构思的前提下，还可以做出若干变形和改进，这些都属于本申请的保护范围。因此，本申请的保护范围应以所附权利要求为准。The above-described embodiments only express several implementation methods of the present application, and the descriptions thereof are relatively specific and detailed, but they cannot be understood as limiting the scope of the present application. It should be pointed out that, for a person of ordinary skill in the art, several variations and improvements can be made without departing from the concept of the present application, and these all belong to the protection scope of the present application. Therefore, the protection scope of the present application shall be subject to the attached claims.

Claims

1. A method of response prediction, the method comprising:

for a target dangerous state traffic scene, starting from a preset stage of the target dangerous state traffic scene, calculating the accumulation amount of target evidence information at intervals of preset time, wherein the target evidence information is evidence information on which a target decision is made;

After the accumulated quantity is obtained through each calculation, determining whether a decision response condition is met or not according to the accumulated quantity corresponding to the current calculation moment;

if the decision response condition is met, determining the response time required by the driver to make the target decision according to the current calculation time;

The calculating the accumulated amount of the target evidence information every preset time length comprises the following steps: and acquiring the accumulated quantity of the target evidence information obtained by calculation at the last calculation time every preset time, determining the first evidence information quantity supporting the target decision currently and the second evidence information quantity opposite to the target decision currently according to the road information at the current time in the target dangerous state traffic scene, and acquiring the accumulated quantity of the target evidence information at the current calculation time according to the accumulated quantity corresponding to the last calculation time, the first evidence information quantity and the second evidence information quantity.

2. The method according to claim 1, wherein the obtaining the accumulated amount of the target evidence information at the current calculation time from the accumulated amount corresponding to the previous calculation time, the first evidence information amount, and the second evidence information amount includes:

Acquiring the noise amount at the current calculation time based on the wiener process;

And acquiring the accumulated quantity of the target evidence information at the current calculation time according to the accumulated quantity corresponding to the last calculation time, the first evidence information quantity, the second evidence information quantity and the noise quantity.

3. The method according to claim 1, wherein determining whether the decision response condition is satisfied according to the accumulated amount corresponding to the current calculation time comprises:

Determining whether the decision response condition is met according to the accumulated quantity, the driver parameter and the vehicle parameter corresponding to the current calculation moment;

Wherein the driver parameter is related to information of a driver in the target dangerous state traffic scene, and the vehicle parameter is related to a state of a vehicle in the target dangerous state traffic scene.

4. A method according to claim 3, wherein said determining whether the decision response condition is satisfied based on the accumulated amount corresponding to the current calculation time, the driver parameter, and the vehicle parameter comprises:

determining whether the sum of the accumulated amount corresponding to the current calculation time and the driver parameter is greater than or equal to the vehicle parameter;

And if the vehicle parameter is greater than or equal to the vehicle parameter, determining that the decision response condition is met.

5. A method according to claim 3, wherein the vehicle parameters are calculated from the state of the vehicle in the target dangerous state traffic scenario and an objective function, the objective function being a function of which the function value gradually decreases with time.

6. The method according to any one of claims 1 to 5, wherein the target decision is a decision in a decision pool, the method further comprising:

determining the response time length corresponding to each target decision in the decision pool;

And sequencing a plurality of target decisions included in the decision pool according to the response time length corresponding to each target decision, and determining the response process of the driver in the target dangerous state traffic scene.

7. A response predicting apparatus, the apparatus comprising:

the calculation module is used for calculating the accumulation of target evidence information for the target dangerous state traffic scene from the preset stage of the target dangerous state traffic scene at intervals of preset time, wherein the target evidence information is evidence information on which a target decision is made;

The first determining module is used for determining whether the decision response condition is met or not according to the accumulated quantity corresponding to the current calculation moment after the accumulated quantity is obtained through each calculation;

The second determining module is used for determining the response time required by the driver to make the target decision according to the current calculation moment if the decision response condition is met;

The calculation module is specifically configured to obtain, at intervals of the preset duration, an accumulated amount of the target evidence information obtained by calculation at a previous calculation time, determine, according to road information at a current time in the target dangerous state traffic scene, a first evidence information amount supporting the target decision currently and a second evidence information amount opposing the target decision, and obtain, according to the accumulated amount corresponding to the previous calculation time, the first evidence information amount and the second evidence information amount, an accumulated amount of the target evidence information at the current calculation time.

8. A computer device comprising a memory and a processor, the memory storing a computer program, characterized in that the processor implements the steps of the method of any of claims 1 to 6 when the computer program is executed.

9. A computer readable storage medium, on which a computer program is stored, characterized in that the computer program, when being executed by a processor, implements the steps of the method of any of claims 1 to 6.

10. A computer program product comprising a computer program, characterized in that the computer program, when being executed by a processor, implements the steps of the method of any of claims 1 to 6.