[go: up one dir, main page]

CN104821082B - Short-time traffic flow prediction method based on integrated evaluation - Google Patents

Short-time traffic flow prediction method based on integrated evaluation Download PDF

Info

Publication number
CN104821082B
CN104821082B CN201510212889.4A CN201510212889A CN104821082B CN 104821082 B CN104821082 B CN 104821082B CN 201510212889 A CN201510212889 A CN 201510212889A CN 104821082 B CN104821082 B CN 104821082B
Authority
CN
China
Prior art keywords
traffic flow
data
dttm
evaluation index
historical
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201510212889.4A
Other languages
Chinese (zh)
Other versions
CN104821082A (en
Inventor
冷甦鹏
张泉峰
段景山
张可
刘浩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Electronic Science and Technology of China
Original Assignee
University of Electronic Science and Technology of China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Electronic Science and Technology of China filed Critical University of Electronic Science and Technology of China
Priority to CN201510212889.4A priority Critical patent/CN104821082B/en
Publication of CN104821082A publication Critical patent/CN104821082A/en
Application granted granted Critical
Publication of CN104821082B publication Critical patent/CN104821082B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0125Traffic data processing
    • G08G1/0129Traffic data processing for creating historical data or processing based on historical data

Landscapes

  • Chemical & Material Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Traffic Control Systems (AREA)

Abstract

本发明公开一种基于综合评价的短时交通流预测方法,通过研究分析结合实际的交通流预测场景,获得待定评价指标;采集待定评价指标的历史数据,得到历史数据库;通过数据预处理,构建基于历史数据库的历史数据矩阵;通过计算各待定评价指标与交通流的相关系数,筛选出主要评价指标,并计算主要评价指标各自的贡献权值;结合实时的交通流数据,通过改进的时间序列相似性度量方法对历史数据库中的交通流历史数据进行分析,得到在每个主要评价指标下的交通流量预测值;根据得到的在每个主要评价指标下交通流量预测值,采用线性加权综合评价方法预测交通流量,本发明的方法综合多维时间序列,得到更加精准的数值预测结果。

The invention discloses a short-term traffic flow prediction method based on comprehensive evaluation. Through research and analysis combined with actual traffic flow prediction scenarios, undetermined evaluation indicators are obtained; historical data of undetermined evaluation indicators is collected to obtain a historical database; through data preprocessing, construction Based on the historical data matrix of the historical database; by calculating the correlation coefficient between the undetermined evaluation indicators and traffic flow, the main evaluation indicators are screened out, and the respective contribution weights of the main evaluation indicators are calculated; combined with real-time traffic flow data, through the improved time series The similarity measurement method analyzes the historical data of traffic flow in the historical database to obtain the predicted value of traffic flow under each main evaluation index; according to the obtained predicted value of traffic flow under each main evaluation index, a linear weighted comprehensive evaluation is used The method predicts traffic flow, and the method of the present invention synthesizes multidimensional time series to obtain more accurate numerical prediction results.

Description

一种基于综合评价的短时交通流预测方法A Short-term Traffic Flow Forecasting Method Based on Comprehensive Evaluation

技术领域technical field

本发明涉及一种交通流预测方法,尤其涉及一种基于综合评价的短时交通客流量预测方法。The invention relates to a traffic flow prediction method, in particular to a short-term traffic passenger flow prediction method based on comprehensive evaluation.

背景技术Background technique

近年来,随着经济不断发展以及各类机动车量爆炸式增长,导致城市的交通问题日益严重。在越来越大的交通压力下,智能交通的概念应运而生。智能交通是未来交通系统的发展方向,通过将先进的信息技术、数据通讯传输技术、电子传感技术、控制技术及计算机技术等有效地集成运用于整个地面交通管理系统而建立的综合交通运输管理系统。通过数据挖掘技术对历史交通流量数据进行研究分析,得出相关的规律。根据所得规律应用于当前的场景,并对未来短期交通流量进行预测是动态交通诱导的前提和基础,有了精度较高的实时交通流量信息,才能进一步运用现代通讯技术、计算机技术等为出行者提供最佳的行驶路线,达到网路畅通、高效运行的目的。In recent years, with the continuous development of the economy and the explosive growth of various types of motor vehicles, the traffic problems in cities have become increasingly serious. Under the increasing traffic pressure, the concept of intelligent transportation came into being. Intelligent transportation is the development direction of the future transportation system. It is a comprehensive transportation management system established by effectively integrating advanced information technology, data communication transmission technology, electronic sensing technology, control technology and computer technology into the entire ground traffic management system. system. Through the research and analysis of historical traffic flow data by data mining technology, relevant rules can be obtained. Applying the obtained rules to the current scene and predicting the short-term traffic flow in the future is the premise and basis of dynamic traffic guidance. Only with high-precision real-time traffic flow information can we further use modern communication technology, computer technology, etc. to serve travelers Provide the best driving route to achieve the goal of smooth and efficient network operation.

交通流可以认为是一组值随着时间推移而不断变化的时间序列。而目前对于时间序列的研究大部分都是局限于一维的时间序列。而在复杂的实际应用场景中研究目标往往受到多重因素的影响,因此若只考虑一维的情况则很有可能会产生较大的偏差。因此必须引入多维时间序列。多维时间序列预测是指通过对按时间顺序取得的多种属性的一系列观测值进行综合分析后得出的对于分析目标的预测结果。目前对多维时间序列的研究方向主要包括分类、聚类、相关规律性、探索式数据分析等基于相似性的模式挖掘研究。通常所采用的时间序列分析模型包括小波分析、神经网络、混沌理论、支持向量机方法的非线性预测模型等。而且对于多维时间序列预测也是侧重于对已有大量数据的模式挖掘基础上的得出的比较宏观的预测分析结果与规律总结,而没有精准的数值预测结果。无法适用于对数值预测精度有较高要求的场景。组合预测模型对数据要求严苛且实现复杂,难以在实际应用场景中实现。Traffic flow can be thought of as a time series of changing values over time. Most of the current research on time series is limited to one-dimensional time series. However, in complex practical application scenarios, research objectives are often affected by multiple factors, so if only one-dimensional situations are considered, it is likely to produce large deviations. Therefore, multidimensional time series must be introduced. Multidimensional time series forecasting refers to the forecasting results of the analysis target obtained after comprehensive analysis of a series of observations of various attributes obtained in time order. At present, the research direction of multidimensional time series mainly includes classification, clustering, correlation regularity, exploratory data analysis and other similarity-based pattern mining research. The commonly used time series analysis models include wavelet analysis, neural network, chaos theory, nonlinear prediction model of support vector machine method, etc. Moreover, the multi-dimensional time series prediction also focuses on the relatively macroscopic prediction analysis results and law summary based on the pattern mining of a large amount of data, but there is no accurate numerical prediction result. It cannot be applied to scenarios that have high requirements for numerical prediction accuracy. The combined forecasting model has strict data requirements and complex implementation, which is difficult to implement in practical application scenarios.

发明内容Contents of the invention

本发明为解决的上述技术问题,提出一种基于综合评价的短时交通流预测方法。In order to solve the above technical problems, the present invention proposes a short-term traffic flow prediction method based on comprehensive evaluation.

本发明采用的技术方案是:一种基于综合评价的短时交通流预测方法,包括以下步骤:The technical solution adopted in the present invention is: a short-term traffic flow prediction method based on comprehensive evaluation, comprising the following steps:

S1、通过研究分析结合实际的交通流预测场景,获得待定评价指标;S1. Obtain undetermined evaluation indicators through research and analysis combined with actual traffic flow prediction scenarios;

S2、采集步骤S1获得的待定评价指标的历史数据,得到历史数据库;S2. Collect the historical data of the undetermined evaluation indicators obtained in step S1 to obtain a historical database;

S3、通过数据预处理,构建基于历史数据库的历史数据矩阵;S3. Through data preprocessing, construct a historical data matrix based on the historical database;

S4、计算各待定评价指标与交通流的相关系数,将相关系数换算为百分比,然后将各相关系数按从大到小的顺序进行排序,并将相关系数按从大到小顺序依次累加,直至累加结果大于或等于第一阈值,则停止累加运算,从而剔除未被累加的相关系数较小的待定评价指标,从而得到主要评价指标,并计算主要评价指标各自的贡献权值;S4. Calculate the correlation coefficient between each undetermined evaluation index and traffic flow, convert the correlation coefficient into a percentage, then sort the correlation coefficients in descending order, and accumulate the correlation coefficients in descending order until If the accumulation result is greater than or equal to the first threshold, the accumulation operation is stopped, thereby eliminating unaccumulated undetermined evaluation indicators with small correlation coefficients, thereby obtaining main evaluation indicators, and calculating the respective contribution weights of the main evaluation indicators;

S5、结合实时的交通流数据,通过改进的欧式距离对历史数据库中的交通流历史数据进行分析,得到在每个主要评价指标下的交通流量预测值;根据得到的在每个主要评价指标下交通流量预测值,采用线性加权综合评价方法预测交通流量。S5. Combined with real-time traffic flow data, the traffic flow historical data in the historical database is analyzed through the improved Euclidean distance, and the traffic flow prediction value under each main evaluation index is obtained; according to the obtained traffic flow under each main evaluation index For the traffic flow forecast value, the linear weighted comprehensive evaluation method is used to predict the traffic flow.

进一步地,所述步骤S2创建的历史数据库为:DB_TABLE=[DTTM,F,r1,r2,…,rp];Further, the historical database created in step S2 is: DB_TABLE=[DTTM,F,r 1 ,r 2 ,...,r p ];

其中,DTTM表示记录采样时间点,F表示对应采样时间点的交通流历史数据。Among them, DTTM represents the record sampling time point, and F represents the historical traffic flow data corresponding to the sampling time point.

进一步地,所述步骤S3具体包括:S31:数据预处理;S32:构建历史数据矩阵;Further, the step S3 specifically includes: S31: data preprocessing; S32: constructing a historical data matrix;

所述步骤S31数据预处理具体包括以下分步骤:The step S31 data preprocessing specifically includes the following sub-steps:

S311:符号信息数值化,将符号信息,转化为数值化信息;S311: Digitizing the symbolic information, transforming the symbolic information into numerical information;

S312:数值缺失与数值错误预处理,对于不连续的交通流数值序列,采用平均插值法对数据补齐;而对于多个时段统计和的数据采用按时段均分的方法补齐各时段的数据;对于明显错误的数据则将该数据点删除,同时采用平均插值法补齐;S312: Numerical missing and numerical error preprocessing, for the discontinuous traffic flow numerical sequence, use the average interpolation method to complete the data; and for the statistical sum data of multiple time periods, use the method of equal division by time period to complete the data of each time period ; For obviously wrong data, the data point is deleted, and the average interpolation method is used to fill it up;

S313:数据预处理,采用数据分段处理对各待定评价指标的取值集合进行数据处理;S313: Data preprocessing, using data segment processing to perform data processing on the value sets of each undetermined evaluation index;

所述步骤S32:构建历史数据矩阵具体为:基于步骤S2的历史数据库构建历史数据矩阵。The step S32: constructing a historical data matrix is specifically: constructing a historical data matrix based on the historical database in step S2.

进一步地,所述步骤S4指标筛选与权值计算具体包括以下分步骤:Further, the step S4 index screening and weight calculation specifically includes the following sub-steps:

S41:数据标准化操作,并根据标准化后的数据得到标准化的历史数据矩阵;S41: Data standardization operation, and obtain a standardized historical data matrix according to the standardized data;

S42:计算标准化历史数据矩阵中各评价指标与交通流的相关系数,对相关系数排序,将相关系数换算为百分比,然后将各相关系数按从大到小的顺序进行排序,并将相关系数按从大到小顺序依次累加,直至累加结果大于或等于第一阈值,则停止累加运算,剔除未被累加的相关系数较小的待定评价指标,从而得到K个主要评价指标;S42: Calculate the correlation coefficients between each evaluation index and traffic flow in the standardized historical data matrix, sort the correlation coefficients, convert the correlation coefficients into percentages, and then sort the correlation coefficients in descending order, and sort the correlation coefficients by Accumulate sequentially from large to small until the accumulation result is greater than or equal to the first threshold, then stop the accumulation operation, and eliminate the undetermined evaluation indicators that have not been accumulated and have a small correlation coefficient, thereby obtaining K main evaluation indicators;

S43:计算主要评价指标各自对于交通流的贡献权。S43: Calculate the respective contribution rights of the main evaluation indicators to the traffic flow.

进一步地,所述步骤S5具体包括以下分步骤:Further, the step S5 specifically includes the following sub-steps:

根据设定的时间窗口的长度为T,历史数据的采样频度t,得到时间窗口内历史数据的记录个数;According to the length of the set time window T and the sampling frequency t of historical data, the number of historical data records in the time window is obtained;

S52:构造交通流矩阵,并计算交通流矩阵对应的相关系数矩阵,根据得到的相关系数矩阵计算时间窗口内不同时段历史数据的占比:S52: Construct the traffic flow matrix, and calculate the correlation coefficient matrix corresponding to the traffic flow matrix, and calculate the proportion of historical data in different time periods in the time window according to the obtained correlation coefficient matrix:

S53:根据改进的欧式距离计算在每个主要评价指标下的交通流量预测值;S53: Calculate the traffic flow prediction value under each main evaluation index according to the improved Euclidean distance;

S54:采用线性加权综合评价的预测方法预测交通流量。S54: Predict the traffic flow using the prediction method of linear weighted comprehensive evaluation.

更进一步地,所述步骤S52具体包括以下分步骤:Further, the step S52 specifically includes the following sub-steps:

S521:构造交通流矩阵,假设交通流F长度为len的时间序列为:f1,f2,…fi,…,flen,满足条件len>>n,则构造如下交通流矩阵TF:S521: Construct a traffic flow matrix, assuming that the time series of traffic flow F with a length of len is: f 1 , f 2 ,...f i ,...,f len , satisfying the condition len>>n, then construct the following traffic flow matrix TF:

其中,矩阵第1列表示预测时间的数据,第2至n+1列表示时间窗口长度内的n个交通流历史数据;Among them, the first column of the matrix represents the data of the forecast time, and the second to n+1 columns represent n traffic flow historical data within the length of the time window;

S522:计算矩阵TF的相关系数矩阵:S522: Calculate the correlation coefficient matrix of the matrix TF:

S523:计算时间窗口内每个历史数据的占比αi′S523: Calculate the proportion α i′ of each historical data in the time window:

αα ii ′′ == rr 1,11,1 ++ ii ′′ ΣΣ ii ′′ == 11 nno rr 1,11,1 ++ ii ′′ ,, ii == 1,21,2 ,, ·&Center Dot; ·· ·· ,, nno ..

更进一步地,所述步骤S53具体包括以下分步骤:Further, the step S53 specifically includes the following sub-steps:

S531:获取当前主要评价指标rk,当前主要评价指标rk的值为Val,则将历史数据库中rk的值为Val的历史数据提取出来组成新的数据集DTS:<DTTM,F>,数据集DTS中记录的个数为dts_count;S531: Obtain the current main evaluation index r k , the value of the current main evaluation index r k is Val, then extract the historical data with the value of r k in the historical database to form a new data set DTS: <DTTM,F>, The number of records in the data set DTS is dts_count;

S532:若dts_count=0则表明主要评价指标rk的值为Val时,历史数据库中无相应的 数据,因此令Fpredict_k=0,同时转至步骤S535,否则转至步骤S533;S532: If dts_count=0, it indicates that when the value of the main evaluation index r k is Val, there is no corresponding data in the historical database, so set F predict_k =0, and go to step S535 at the same time, otherwise go to step S533;

其中,Fpredict_k表示第k个主要评价指标对应的交通流预测值;Among them, F predict_k represents the traffic flow prediction value corresponding to the k-th main evaluation index;

S533:将时刻(DTTMnow-T)至DTTMnow时间段内的长度为m的时间序列记为F_NOW,对数据集DTS中的每一个DTTMa,将时刻(DTTMa-T)至DTTMa时间段内的长度为m的时间序列记为F_HISTORYa,其中a=1,2,…,dts_count;S533: Record the time series of length m from the moment (DTTM now -T) to the time period of DTTM now as F_NOW, and for each DTTM a in the data set DTS, record the time series from the moment (DTTM a -T) to DTTM a The time series with length m in the segment is recorded as F_HISTORY a , where a=1,2,...,dts_count;

其中,DTTMnow表示预测时间点;Among them, DTTM now represents the prediction time point;

S534:在主要评价指标rk下,使用下式预测交通流的具体数值:S534: Under the main evaluation index r k , use the following formula to predict the specific value of traffic flow:

Fpredict_k=Fkey_dttm F predict_k = F key_dttm

其中,Fkey_dttm表示历史数据库中DTTM值为key_dttm所对应的历史记录中的交通流的值;Wherein, F key_dttm represents the value of the traffic flow in the historical record corresponding to key_dttm in the DTTM value in the history database;

S535:判断各主要评价指标是否已完成交通流的预测,是则结束判断,否则转到步骤S531,获取下一个主要评价指标。S535: Judging whether each main evaluation index has completed the traffic flow prediction, if yes, end the judgment; otherwise, go to step S531 to obtain the next main evaluation index.

更进一步地,所述步骤S534中key_dttm的取值应满足如下条件:Furthermore, the value of key_dttm in the step S534 should satisfy the following conditions:

dist(F_NOW,F_HISTORYkey_dttm)dist(F_NOW,F_HISTORY key_dttm )

=min{dist(F_NOW,F_HISTORYa)},a=1,2,…,dts_count,=min{dist(F_NOW,F_HISTORY a )}, a=1,2,...,dts_count,

其中,dist(i),表示根据改进的欧式距离求相似度,min{·}表示取集合内的最小值。Among them, dist(i) means to calculate the similarity based on the improved Euclidean distance, and min{·} means to take the minimum value in the set.

更进一步地,所述步骤S54具体为:Further, the step S54 is specifically:

S541:记各评价指标的累计贡献权值为wtotal,则当评价指标rk产生有效预测结果时,将其对交通流的贡献权值进行累加;S541: record the cumulative contribution weight of each evaluation index as w total , then when the evaluation index r k produces an effective prediction result, accumulate its contribution weight to the traffic flow;

S542:根据步骤S541得到的wtotal,采用线性加权综合评价的预测方法预测交通流量FpredictS542 : According to w total obtained in step S541 , predict the traffic flow F predict using a linear weighted comprehensive evaluation prediction method.

本发明的有益效果:本发明的一种基于综合评价的短时交通流预测方法,通过研究分析结合实际的交通流预测场景,获得待定评价指标;采集待定评价指标的历史数据,得到历史数据库;通过数据预处理,构建基于历史数据库的历史数据矩阵;通过计算各待定评价指标与交通流的相关系数,筛选出主要评价指标,并计算主要评价指标各自的贡献权值;结合实时的交通流数据,通过改进的时间序列相似性度量方法对历史数据库中的交通流历史数据进行分析,得到在每个主要评价指标下的交通流量预测值;根据得到的在每个主要评价指标下交通流量预测值,采用线性加权综合评价方法预测交通流量,本发明的方法综合多维时间序列,得到更加精准的数值预测结果。Beneficial effects of the present invention: a short-term traffic flow prediction method based on comprehensive evaluation of the present invention obtains undetermined evaluation indicators through research and analysis combined with actual traffic flow prediction scenarios; collects historical data of undetermined evaluation indicators to obtain a historical database; Through data preprocessing, construct a historical data matrix based on the historical database; by calculating the correlation coefficient between each undetermined evaluation index and traffic flow, select the main evaluation index, and calculate the respective contribution weight of the main evaluation index; combined with real-time traffic flow data , through the improved time series similarity measurement method to analyze the historical traffic flow data in the historical database, and obtain the traffic flow forecast value under each main evaluation index; according to the obtained traffic flow forecast value under each main evaluation index , using a linear weighted comprehensive evaluation method to predict traffic flow, and the method of the present invention synthesizes multidimensional time series to obtain more accurate numerical prediction results.

附图说明Description of drawings

图1为本发明的方法流程图。Fig. 1 is a flow chart of the method of the present invention.

图2为本发明的数据预处理流程图。Fig. 2 is a flow chart of data preprocessing in the present invention.

图3为本发明的评价指标的筛选与权值计算流程图。Fig. 3 is a flow chart of the screening and weight calculation of the evaluation index in the present invention.

图4为本发明的时间窗口示意图。Fig. 4 is a schematic diagram of the time window of the present invention.

图5为本发明的基于改进欧氏距离的时间序列相似性度量方法。Fig. 5 is the time series similarity measurement method based on the improved Euclidean distance of the present invention.

图6为本发明实施例提供的的时间窗口内历史数据的记录个数示意图。FIG. 6 is a schematic diagram of the number of records of historical data in a time window provided by an embodiment of the present invention.

图7为步骤S52的详细步骤分解图。FIG. 7 is a detailed step-by-step exploded diagram of step S52.

图8为步骤S53的详细步骤分解图。FIG. 8 is a detailed step-by-step exploded diagram of step S53.

具体实施方式detailed description

为便于本领域技术人员理解本发明的技术内容,下面结合附图对本发明内容进一步阐释。In order to facilitate those skilled in the art to understand the technical content of the present invention, the content of the present invention will be further explained below in conjunction with the accompanying drawings.

如图1所示为本发明的方法流程图,本专利所提出的基于综合评价的短时交通流预测算法共分为如下五个步骤:As shown in Figure 1, it is the method flowchart of the present invention, and the short-term traffic flow prediction algorithm based on comprehensive evaluation proposed by this patent is divided into the following five steps:

S1、通过研究分析结合实际的交通流预测场景,选出对交通流有影响的因素r1,r2,…rp,并作为待定的评价指标;S1. Select the factors r 1 , r 2 ,…r p that have an impact on traffic flow through research and analysis combined with the actual traffic flow forecasting scene, and use them as undetermined evaluation indicators;

S2、采集步骤S1获得的待定评价指标的历史数据,得到历史数据库;S2. Collect the historical data of the undetermined evaluation indicators obtained in step S1 to obtain a historical database;

S3、通过数据预处理,构建基于历史数据库的历史数据矩阵;S3. Through data preprocessing, construct a historical data matrix based on the historical database;

S4、指标筛选与权值计算,得出对交通流有较大影响的指标r1,r2,…rK,相应的贡献权值为w1,w2,…,wKS4. Index screening and weight calculation, the indicators r 1 , r 2 ,...r K that have a greater impact on traffic flow are obtained, and the corresponding contribution weights are w 1 , w 2 ,...,w K ;

S5、考虑每个评价指标rk,结合实时的交通流数据,通过改进的时间序列相似性度量方法对历史数据库中的交通流历史数据进行分析,得到在单个评价指标条件下交通流量预测值Fpredict_k;采用线性加权综合评价的预测方法得到未来短时的交通流量FpredictS5. Consider each evaluation index r k , combined with real-time traffic flow data, analyze the traffic flow historical data in the historical database through the improved time series similarity measurement method, and obtain the traffic flow prediction value F under the condition of a single evaluation index predict_k ; The future short-term traffic flow F predict is obtained by using the prediction method of linear weighted comprehensive evaluation.

本发明实施例以机场为例。The embodiment of the present invention takes an airport as an example.

步骤S1中评价指标选取可以通过调研分析结合实际的交通流预测场景,选出共p个对交通流F有影响的影响因子作为交通流预测的待定评价指标,记为r1,r2,…,rp。机场出租车运力受多种因素的影响,包括航班信息r1,天气信息r2,不同时段r3,乘坐出租车的人数r4,当前出租车的运力情况r5等。The selection of evaluation indicators in step S1 can be based on research and analysis combined with actual traffic flow forecasting scenarios, and a total of p factors that have an impact on traffic flow F can be selected as undetermined evaluation indicators for traffic flow forecasting, denoted as r 1 , r 2 ,… , r p . Airport taxi capacity is affected by many factors, including flight information r 1 , weather information r 2 , different time periods r 3 , the number of people taking taxis r 4 , the current capacity of taxis r 5 and so on.

步骤S2采集步骤S1获得的待定评价指标的历史数据,得到历史数据库为:DB_TABLE=[DTTM,F,r1,r2,…,rp];Step S2 collects the historical data of the undetermined evaluation indicators obtained in step S1, and obtains the historical database as follows: DB_TABLE=[DTTM,F,r 1 ,r 2 ,...,r p ];

其中,DTTM表示记录采样时间点,F表示对应采样时间点的交通流历史数据。Among them, DTTM represents the record sampling time point, and F represents the historical traffic flow data corresponding to the sampling time point.

所述步骤S3包括:S31:数据预处理;S32:构建历史数据矩阵;The step S3 includes: S31: data preprocessing; S32: constructing a historical data matrix;

如图2所示,所述步骤S31数据预处理具体为:数据预处理部分考虑到指标r1,r2,…rp的数据形式多样,包括符号、数值等多种数据类型,无法直接进行计算,同时不规整或错误的数据会造成计算结果的较大偏差。因此需要对相应的数据进行预处理,这样不仅能得到较好的预测结果,同时也能有效提升运算速度。如图2所示为本发明的数据预处理流程图,具体的数据预处理的主要步骤如下:As shown in Figure 2, the data preprocessing in step S31 is specifically: the data preprocessing part considers that the indicators r 1 , r 2 ,...r p have various data forms, including symbols, values and other data types, which cannot be directly performed At the same time, irregular or wrong data will cause large deviations in the calculation results. Therefore, it is necessary to preprocess the corresponding data, so that not only better prediction results can be obtained, but also the calculation speed can be effectively improved. As shown in Figure 2, it is the data preprocessing flowchart of the present invention, and the main steps of concrete data preprocessing are as follows:

S311:符号信息数值化。由于评价指标r1,r2,…rp可能包含文字等符号信息,如对交通流有较大影响的天气信息等。因此需要将文字类型的数据数值化便于计算。S311: Digitize the symbol information. Since the evaluation indicators r 1 , r 2 ,...r p may contain text and other symbolic information, such as weather information that has a great impact on traffic flow. Therefore, it is necessary to digitize the data of text type to facilitate calculation.

以天气信息r2为例,其对航班到港造成的影响程度将天气信息r2划分为4类,如表1所示。Taking the weather information r 2 as an example, the degree of its impact on the flight arrival divides the weather information r 2 into four categories, as shown in Table 1.

表1 天气信息分类Table 1 Classification of weather information

天气类型weather type 描述describe 第Ⅰ类天气(无影响)Category I weather (no impact) 晴天、阴天sunny, cloudy 第Ⅱ类天气(微弱影响)Category II weather (weak impact) 阵雨、小到中雨、小雨、小到中雪、小雪Showers, light to moderate rain, light rain, light to moderate snow, light snow 第Ⅲ类天气(中度影响)Category III weather (moderate impact) 中雪、雨夹雪、中雨、中到大雨、大雨、轻雾、霾Moderate snow, sleet, moderate rain, moderate to heavy rain, heavy rain, light fog, haze 第Ⅳ类天气(严重影响)Category IV weather (severe impact) 大雪、浮尘、大到暴雨、雷阵雨、暴雨、雾 Heavy snow, floating dust, torrential rain, thunderstorm, rainstorm, fog

以四种类型天气为基本类型,可得到如表2所示的天气变换趋势及其符号表示,本文设定考察天气变换趋势的时间窗口长度为2小时。Taking the four types of weather as the basic types, the weather change trend and its symbolic representation as shown in Table 2 can be obtained. In this paper, the time window length for investigating the weather change trend is set to 2 hours.

表2 天气变换趋势及其类型表Table 2 Weather change trend and its types

天气变换趋势weather change trend 类型Types of Ⅰ→ⅠⅠ → Ⅰ 11 Ⅰ→ⅡI → II 22 Ⅰ→ⅢI → III 33 Ⅰ→ⅣI → IV 44 Ⅱ→ⅠII→I 55 Ⅱ→ⅡII → II 66 Ⅱ→ⅢII → III 77 Ⅱ→ⅣII → IV 88 Ⅲ→ⅠⅢ→Ⅰ 99 Ⅲ→ⅡⅢ→Ⅱ 1010 Ⅲ→ⅢⅢ→Ⅲ 1111 Ⅲ→ⅣⅢ→Ⅳ 1212 Ⅳ→ⅠIV → I 1313 Ⅳ→ⅡIV → II 1414 Ⅳ→ⅢIV → III 1515 Ⅳ→ⅣIV → IV 16 16

S312:数值缺失与数值错误预处理。对于不连续的交通流数值序列,采用平均插值法对数据补齐。而对于多个时段统计和的数据采用按时段均分的方法补齐各时段的数据。对于明显错误的数据则将该数据点删除,同时采用平均插值法补齐。S312: Numerical missing and numerical error preprocessing. For the discontinuous numerical sequence of traffic flow, the average interpolation method is used to complete the data. For the statistics and data of multiple time periods, the method of evenly dividing by time period is used to complete the data of each time period. For obviously wrong data, the data point is deleted, and the average interpolation method is used to fill it up.

S313:数据预处理。由于任意评价指标rj,j=1,2,…,p其取值范围都比较大,若是将每一个值都视为一个分类,则会导致分类太多而影响预测效率。而对于评价指标rj而言,相同数量级的取值对交通流F往往有相同的影响,因此可以采用数据分段处理的方法对评价指标rj的取值集合{vje|j=1,2,…,p;e=1,2,…s}进行处理,其中s表示评价指标rj的取值个数。数据预处理后评价指标ri的取值集合表示为:S313: data preprocessing. Since any evaluation index r j ,j=1,2,...,p has a relatively large range of values, if each value is regarded as a category, it will lead to too many categories and affect the prediction efficiency. For the evaluation index r j , values of the same order of magnitude often have the same impact on the traffic flow F, so the method of data segment processing can be used to evaluate the value set of the evaluation index r j {v je |j=1, 2,...,p; e=1,2,...s} for processing, where s represents the value number of the evaluation index r j . The value set of evaluation index r i after data preprocessing is expressed as:

rj→{v* je|j=1,2,…,p;e=1,2,…s*}r j → {v * je |j=1,2,…,p; e=1,2,…s * }

其中,v* je表示将评价指标rj进行数据预处理后的取值,s*表示数据预处理后评价指标rj取值的个数。Among them, v * je represents the value of the evaluation index r j after data preprocessing, and s * represents the number of values of the evaluation index r j after data preprocessing.

步骤S32:构建历史数据矩阵具体为:根据步骤S2中得到的历史数据库的数据表,如下所示:Step S32: Constructing the historical data matrix is specifically: according to the data table of the historical database obtained in step S2, as follows:

DB_TABLE=[DTTM,F,r1,r2,…,rp];DB_TABLE=[DTTM,F,r 1 ,r 2 ,...,r p ];

其中,DTTM表示记录采样时间点,F表示对应采样时间点的交通流历史数据。基于上述历史数据表构建历史数据矩阵为:Among them, DTTM represents the record sampling time point, and F represents the historical traffic flow data corresponding to the sampling time point. The historical data matrix is constructed based on the above historical data table as follows:

MTX=[F,r1,r2,…,rp]MTX=[F,r 1 ,r 2 ,...,r p ]

步骤S4评价指标筛选与权值计算主要对步骤S1中所提出的评价指标进行判断与选择。如图3所示,通过计算各指标与交通流的相关度,选择相关性较大的评价指标,去除相关性较小或者无相关性的指标。具体步骤如下:Step S4 evaluation index screening and weight calculation mainly judges and selects the evaluation index proposed in step S1. As shown in Figure 3, by calculating the correlation between each index and traffic flow, the evaluation index with greater correlation is selected, and the index with less correlation or no correlation is removed. Specific steps are as follows:

S41:数据标准化操作,根据下式对数据进行标准化的操作:S41: data standardization operation, standardize the data according to the following formula:

xx igig ** == xx igig -- xx gg &OverBar;&OverBar; varvar (( xx gg )) ,, ii == 1,21,2 ,, &CenterDot;&CenterDot; &CenterDot;&Center Dot; &CenterDot;&Center Dot; ,, nno ;; gg == 1,21,2 ,, &CenterDot;&Center Dot; &CenterDot;&Center Dot; &CenterDot;&Center Dot; ,, qq

其中,分别表示历史数据矩阵中第g项的均值与方差,且q=p+1,得到标准化后的矩阵MTX′=(xig *)n×qin, and Represent the mean value and variance of the gth item in the historical data matrix respectively, and q=p+1, obtain the matrix MTX'=(x ig * ) n×q after standardization;

S42:计算矩阵MTX′各评价指标与交通流的相关系数uFg,如下所示:S42: Calculate the correlation coefficient u Fg between each evaluation index of the matrix MTX′ and the traffic flow, as follows:

uu FgFg == &Sigma;&Sigma; ii == 11 nno (( xx ** iFiF -- xx Ff ** &OverBar;&OverBar; )) (( xx ** igig -- xx gg ** &OverBar;&OverBar; )) &Sigma;&Sigma; ii == 11 nno (( xx ** iFiF -- xx Ff ** &OverBar;&OverBar; )) 22 &Sigma;&Sigma; kk == 11 nno (( xx ** igig -- xx gg ** &OverBar;&OverBar; )) 22 ,, ii == 1,21,2 ,, &CenterDot;&Center Dot; &CenterDot;&Center Dot; &CenterDot;&Center Dot; ,, nno ,, gg == 2,32,3 ,, &CenterDot;&Center Dot; &CenterDot;&Center Dot; &CenterDot;&CenterDot; ,, qq

对相关系数排序,将相关系数换算为百分比,然后将各相关系数按从大到小的顺序进行排序,并将相关系数按从大到小顺序依次累加,直至累加结果大于或等于第一阈值,本实施例中第一阈值取值为80%,则停止累加运算,剔除未被累加的相关系数较小的待定评价指标,从而得到K个主要评价指标r1,r2,…rk,…,rKSort the correlation coefficients, convert the correlation coefficients into percentages, then sort the correlation coefficients in descending order, and accumulate the correlation coefficients in descending order until the cumulative result is greater than or equal to the first threshold, In this embodiment, the value of the first threshold is 80%, then the accumulation operation is stopped, and the undetermined evaluation indicators with small correlation coefficients that have not been accumulated are eliminated, so as to obtain K main evaluation indicators r 1 , r 2 ,...r k ,... , r K ;

S43:计算主要评价指标各自对于交通流的贡献权值wk,公式如下:S43: Calculate the contribution weight w k of the main evaluation indicators to the traffic flow, the formula is as follows:

ww kk == uu FkFk &Sigma;&Sigma; kk == 11 KK uu FkFk ,, kk == 1,21,2 ,, &CenterDot;&CenterDot; &CenterDot;&CenterDot; &CenterDot;&CenterDot; ,, KK ;;

其中,uFk表示第k个主要与交通流的相关系数。Among them, u Fk represents the correlation coefficient between the kth main and traffic flow.

步骤S5中在仅考虑单个评价指标情况下进行交通流预测的过程中,采用时间序列相似性度量方法从历史数据库中查找最相似的交通流数据作为预测值。在考查交通流随时间变化的规律时,可以认为交通流是取值随时间变化的时间序列。而由于交通流的连续性,过去时段的交通流数据会对当前时段的交通流造成影响,如图4所示为时间窗口示意图。考虑到不同的历史时段对当前的交通流影响程度也不同,因此本发明采用改进的欧氏距离作为时间序列相似性度量的方法设计流程如图5所示。详细的设计度量方法包括如下步骤:In step S5, in the process of traffic flow prediction considering only a single evaluation index, the time series similarity measurement method is used to find the most similar traffic flow data from the historical database as the predicted value. When examining the law of traffic flow over time, it can be considered that traffic flow is a time series whose values change over time. Due to the continuity of traffic flow, the traffic flow data in the past period will affect the traffic flow in the current period, as shown in Figure 4, which is a schematic diagram of the time window. Considering that different historical time periods have different influences on the current traffic flow, the design flow of the method using the improved Euclidean distance as time series similarity measure in the present invention is shown in Fig. 5 . The detailed design measurement method includes the following steps:

S51:设定时间窗口的长度,假设时间窗口的长度为T(min),则当历史数据的采样频度为t(min)时,时间窗口内历史数据的记录个数为:S51: Set the length of the time window, assuming that the length of the time window is T (min), then when the sampling frequency of historical data is t (min), the number of historical data records in the time window is:

其中,记当前预测时间点为tpredict,记当前最近的采样时间点为tsample,Δt=tpredict-tsample表示取整。Among them, record the current prediction time point as t predict , record the current latest sampling time point as t sample , Δt=t predict -t sample , Indicates rounding.

S52:构造交通流矩阵,并计算交通流矩阵对应的相关系数矩阵,根据得到的相关系数矩阵计算时间窗口内不同时段历史数据的占比αi′S52: Construct the traffic flow matrix, and calculate the correlation coefficient matrix corresponding to the traffic flow matrix, and calculate the proportion α i′ of historical data in different time periods in the time window according to the obtained correlation coefficient matrix:

S53:计算在每个主要评价指标下的交通流量Fpredict_kS53: Calculate the traffic flow F predict_k under each main evaluation index;

S54:采用线性加权综合评价的预测方法预测交通流量FpredictS54: Predict the traffic flow F predict using a linear weighted comprehensive evaluation prediction method.

所述步骤S51具体为:设定时间窗口的长度为T(min),则假设历史数据的采样频度为t(min),可得时间窗口内历史数据的记录个数为:Described step S51 is specifically: the length of setting time window is T (min), then assumes that the sampling frequency of historical data is t (min), the record number of historical data in available time window is:

记当前预测时间点为tpredict,记当前最近的采样时间点为tsample,则时间窗口内历史数据的记录个数可表示为:Record the current prediction time point as t predict and the latest sampling time point as t sample , then the number of historical data records in the time window can be expressed as:

其中,Δt=tpredict-tsample表示取整。Among them, Δt=t predict -t sample , Indicates rounding.

如图6所示,1)预测点为半小时点,即8:00、8:30、9:00、9:30、10:00,采样时间点为半点,即8:00、8:30、9:00、9:30、10:00,则根据当前预测时间点为tpredict=9:30,当前最近的采样时间点为tsample=9:00,得到Δt=tpredict-tsample=30min<T=130min,则As shown in Figure 6, 1) The prediction point is half an hour, namely 8:00, 8:30, 9:00, 9:30, 10:00, and the sampling time is half an hour, namely 8:00, 8:30 , 9:00, 9:30, 10:00, then according to the current prediction time point is t predict = 9:30, and the current latest sampling time point is t sample = 9:00, then Δt = t predict -t sample = 30min<T=130min, then

2)预测点为半小时点,即8:00、8:30、9:00、9:30、10:00,采样时间点为半点差5分,即7:55、8:25、8:55、9:25、9:55,,则根据当前预测时间点为tpredict=9:30,当前最近的采样时间点为tsample=9:25,得到Δt=tpredict-tsample=5min<T=130min,则2) The prediction point is half an hour, that is, 8:00, 8:30, 9:00, 9:30, 10:00, and the sampling time point is 5 minutes from the half point, that is, 7:55, 8:25, 8:00 55, 9:25, 9:55, then according to the current prediction time point is t predict = 9:30, and the current latest sampling time point is t sample = 9:25, then Δt = t predict -t sample = 5min< T=130min, then

3)预测点为半小时点,即8:00、8:30、9:00、9:30、10:00,采样时间点为半点差15分,即7:45、8:15、8:45、9:15、9:45,,则根据当前预测时间点为tpredict=9:30,当前最近的采样时间点为tsample=9:15,得到Δt=tpredict-tsample=15min<T=130min,则3) The prediction point is half an hour, that is, 8:00, 8:30, 9:00, 9:30, 10:00, and the sampling time is 15 minutes from the half point, that is, 7:45, 8:15, 8:00 45, 9:15, 9:45, then according to the current prediction time point is t predict = 9:30, and the current latest sampling time point is t sample = 9:15, then Δt = t predict -t sample = 15min< T=130min, then

如图7所示,所述步骤S52具体为:在考查交通流随时间变化的规律时,可以认为交通流是取值随时间变化的时间序列。本文利采用以改进的欧氏距离作为时间序列相似性度量的方法,并从历史数据库中查找与当前场景最为相似的记录,并作为预测值。其中改进的欧式距离的表达如下:As shown in FIG. 7 , the step S52 is specifically: when examining the law of traffic flow changing over time, it can be considered that the traffic flow is a time series whose values change over time. This paper uses the method of using the improved Euclidean distance as the time series similarity measure, and finds the record most similar to the current scene from the historical database, and uses it as the predicted value. The expression of the improved Euclidean distance is as follows:

distdist (( PP ,, QQ )) == &Sigma;&Sigma; ii == 11 mm &alpha;&alpha; ii (( pp ii -- qq ii )) 22

其中,P,Q分别表示长度为m的时间序列,pi′,qi′表示对应时间点的序列值,αi′表示时间序列中所占的比重,且满足如图7所示,具体包括以下分步骤:Among them, P and Q represent the time series of length m respectively, p i′ and q i′ represent the sequence values at the corresponding time points, α i′ represents the proportion of the time series, and satisfy As shown in Figure 7, it specifically includes the following sub-steps:

S521:假设交通流F长度为len的时间序列为:f1,f2,…fi,…,flen,满足条件len>>n。则构造如下交通流矩阵TF:S521: Assume that the time series of traffic flow F with a length of len is: f 1 , f 2 ,...f i ,...,f len , and the condition len>>n is satisfied. Then construct the following traffic flow matrix TF:

其中,矩阵第一列即表示预测时间的数据,而后n列则表示时间窗口内的n·个交通流历史数据。Among them, the first column of the matrix represents the data of the forecast time , and the last n columns represent the n · traffic flow historical data in the time window.

S522:计算相关系数矩阵。S522: Calculate a correlation coefficient matrix.

对矩阵TF计算其相关系数矩阵:Calculate the correlation coefficient matrix for the matrix TF:

S523:计算时间窗口内每个历史数据的占比αi′S523: Calculate the proportion α i′ of each historical data in the time window:

&alpha;&alpha; ii &prime;&prime; == rr 1,11,1 ++ ii &prime;&prime; &Sigma;&Sigma; ii &prime;&prime; == 11 nno rr 1,11,1 ++ ii &prime;&prime; ,, ii == 1,21,2 ,, &CenterDot;&CenterDot; &CenterDot;&CenterDot; &CenterDot;&CenterDot; ,, nno

根据改进的欧式距离公式计算相似度 Calculate the similarity according to the improved Euclidean distance formula

其中,P表示长度为m的时间序列,Q表示长度为m的时间序列,pi′,qi′表示对应时间点的序列值,αi′表示每个历史数据在时间序列中所占的比重,且满足Among them, P represents the time series of length m, Q represents the time series of length m, p i′ and q i′ represent the sequence values of corresponding time points, and α i′ represents the proportion of each historical data in the time series specific gravity, and satisfy

&Sigma;&Sigma; ii &prime;&prime; == 11 mm &alpha;&alpha; ii &prime;&prime; == 11

所述步骤S53具体为:假设预测时间点为DTTMnow,其对应评价指标rk的值为Val。如图8所示,具体分步骤如下:The step S53 specifically includes: assuming that the predicted time point is DTTM now , the value of the corresponding evaluation index r k is Val. As shown in Figure 8, the specific steps are as follows:

S531:获取当前主要评价指标rk,当前主要评价指标rk的当前值为Val,则将历史数据库中rk的值为Val的历史数据提取出来组成新的数据集DTS:<DTTM,F>,数据集DTS中记录的个数为dts_count。S531: Obtain the current main evaluation index r k , and the current value of the main evaluation index r k is Val, then extract the historical data whose value of r k is Val in the historical database to form a new data set DTS: <DTTM,F> , the number of records in the dataset DTS is dts_count.

S532:若dts_count则表明主要评价指标rk的值为Val时,历史数据库中无相应的数据,因此令Fpredict_k=0,同时转至步骤S535,否则转至步骤S533。S532: If dts_count indicates that when the value of the main evaluation index r k is Val, there is no corresponding data in the historical database, so set F predict_k =0, and go to step S535, otherwise go to step S533.

S533:将时刻(DTTMnow-T)至DTTMnow时间段内的长度为m的时间序列记为F_NOW,对数据集DTS中的每一个DTTMa,将时刻(DTTMa-T)至DTTMa时间段内的长度为m的时间序列记为F_HISTORYa,其中a=1,2,…,dts_count;S533: Record the time series of length m from the moment (DTTM now -T) to the time period of DTTM now as F_NOW, and for each DTTM a in the data set DTS, record the time series from the moment (DTTM a -T) to DTTM a The time series with length m in the segment is recorded as F_HISTORY a , where a=1,2,...,dts_count;

S534:在主要评价指标rk下,使用下式预测交通流的具体数值:S534: Under the main evaluation index r k , use the following formula to predict the specific value of traffic flow:

Fpredict_k=Fkey_dttm F predict_k = F key_dttm

其中,Fkey_dttm表示历史数据库中DTTM值为key_dttm所对应的历史记录中的交通流的值。key_dttm的取值应满足如下条件:Wherein, F key_dttm represents the value of the traffic flow in the historical record corresponding to the DTTM value in the historical database key_dttm. The value of key_dttm should meet the following conditions:

数据集DTS中DTTM取值为key_dttm的取值应满足如下条件:The value of DTTM in the data set DTS as key_dttm should meet the following conditions:

dist(F_NOW,F_HISTORYkey_dttm)dist(F_NOW,F_HISTORY key_dttm )

=min{dist(F_NOW,F_HISTORYa)},a=1,2,…,dts_count=min{dist(F_NOW,F_HISTORY a )}, a=1,2,...,dts_count

即在仅考虑评价指标rk时,在所有历史记录中,采样时间为key_dttm所对应的历史数据与当前的场景最为相似。因此以Fkey_dttm的值作为在考察主要评价指标rk时的交通流量为Fpredict_kThat is, when only the evaluation index r k is considered, among all historical records, the historical data corresponding to the sampling time key_dttm is most similar to the current scene. Therefore, taking the value of F key_dttm as the traffic flow when examining the main evaluation index r k is F predict_k .

S535:判断各主要评价指标是否已完成交通流的预测,是则结束判断,否则转到步骤S531,执行k=k+1;S535: Judging whether each main evaluation index has completed the traffic flow prediction, if yes, then end the judgment, otherwise go to step S531, execute k=k+1;

其中,k表示第k个主要评价指标,且k=1,2,…,K。Wherein, k represents the kth main evaluation index, and k=1,2,...,K.

所述步骤S54具体包含以下分步骤:The step S54 specifically includes the following sub-steps:

S541:记各评价指标的累计贡献权值为wtotal,则当评价指标rk产生有效预测结果时, 将其对交通流的贡献权值累加至wtotal,如下式所示:S541: Record the cumulative contribution weight of each evaluation index as w total , then when the evaluation index r k produces an effective prediction result, add its contribution weight to the traffic flow to w total , as shown in the following formula:

wtotal=wtotal+wkw total = w total + w k ;

S542:根据步骤S541得到的wtotal,采用线性加权综合评价的预测方法预测交通流量Fpredict,计算公式如下:S542: According to w total obtained in step S541, the traffic flow F predict is predicted by using the prediction method of linear weighted comprehensive evaluation, and the calculation formula is as follows:

Ff predictpredict == &Sigma;&Sigma; kk == 11 KK (( ww kk ww totaltotal &times;&times; Ff predictpredict __ kk ))

其中,wk是主要评价指标rk对交通流Fpredict的贡献度,wtotal是各主要评价指标的累计贡献度,Fpredict_k是在仅考察主要评价指标rk的前提下所得到的交通流预测结果。Among them, w k is the contribution degree of the main evaluation index r k to the traffic flow F predict , w total is the cumulative contribution degree of each main evaluation index, and F predict_k is the traffic flow obtained under the premise of only examining the main evaluation index r k forecast result.

本发明的一种基于综合评价的短时交通流预测方法,通过研究分析结合实际的交通流预测场景,获得待定评价指标;采集待定评价指标的历史数据,得到历史数据库;通过数据预处理,构建基于历史数据库的历史数据矩阵;通过计算各待定评价指标与交通流的相关系数,筛选出主要评价指标,并计算主要评价指标各自的贡献权值;结合实时的交通流数据,通过改进的时间序列相似性度量方法对历史数据库中的交通流历史数据进行分析,得到在每个主要评价指标下的交通流量预测值;根据得到的在每个主要评价指标下交通流量预测值,采用线性加权综合评价方法预测交通流量,本发明的方法综合多维时间序列,得到更加精准的数值预测结果。A short-term traffic flow prediction method based on comprehensive evaluation of the present invention obtains undetermined evaluation indicators through research and analysis combined with actual traffic flow prediction scenarios; collects historical data of undetermined evaluation indicators to obtain a historical database; through data preprocessing, constructs Based on the historical data matrix of the historical database; by calculating the correlation coefficient between the undetermined evaluation indicators and traffic flow, the main evaluation indicators are screened out, and the respective contribution weights of the main evaluation indicators are calculated; combined with real-time traffic flow data, through the improved time series The similarity measurement method analyzes the historical data of traffic flow in the historical database to obtain the predicted value of traffic flow under each main evaluation index; according to the obtained predicted value of traffic flow under each main evaluation index, a linear weighted comprehensive evaluation is used The method predicts traffic flow, and the method of the present invention synthesizes multidimensional time series to obtain more accurate numerical prediction results.

本领域的普通技术人员将会意识到,这里所述的实施例是为了帮助读者理解本发明的原理,应被理解为本发明的保护范围并不局限于这样的特别陈述和实施例。对于本领域的技术人员来说,本发明可以有各种更改和变化。凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的权利要求范围之内。Those skilled in the art will appreciate that the embodiments described here are to help readers understand the principles of the present invention, and it should be understood that the protection scope of the present invention is not limited to such specific statements and embodiments. Various modifications and variations of the present invention will occur to those skilled in the art. Any modifications, equivalent replacements, improvements, etc. made within the spirit and principles of the present invention shall be included within the scope of the claims of the present invention.

Claims (8)

1. A short-term traffic flow prediction method based on comprehensive evaluation is characterized by comprising the following steps:
s1, obtaining an evaluation index to be determined by combining research and analysis with an actual traffic flow prediction scene;
s2, acquiring historical data of the to-be-determined evaluation index obtained in the step S1 to obtain a historical database;
s3, constructing a historical data matrix based on a historical database through data preprocessing;
s4, calculating correlation coefficients of each undetermined evaluation index and traffic flow, converting the correlation coefficients into percentages, sequencing the correlation coefficients in descending order, accumulating the correlation coefficients in descending order until the accumulation result is larger than or equal to a first threshold value, stopping accumulation operation, eliminating the undetermined evaluation indexes with smaller correlation coefficients which are not accumulated, obtaining main evaluation indexes, and calculating the contribution weight of each main evaluation index;
s5, analyzing the traffic flow historical data in the historical database through the improved Euclidean distance by combining with the real-time traffic flow data to obtain a traffic flow predicted value under each main evaluation index; and predicting the traffic flow by adopting a linear weighted comprehensive evaluation method according to the obtained traffic flow predicted value under each main evaluation index.
2. The short-term traffic flow prediction method based on comprehensive evaluation according to claim 1, wherein the historical database created in step S2 is: DB _ TABLE [ DTTM, F, r ]1,r2,…,rp];
Wherein, DTTM represents the recording sampling time point, F represents the traffic flow historical data corresponding to the sampling time point; r is1,r2,…,rpIndicates an evaluation index to be determined.
3. The short-term traffic flow prediction method based on comprehensive evaluation according to claim 1, wherein the step S3 specifically includes: s31: preprocessing data; s32: constructing a historical data matrix;
the step S31 of preprocessing data specifically includes the following sub-steps:
s311: digitizing the symbol information, and converting the symbol information into digitized information;
s312: numerical value missing and numerical value error preprocessing, namely, for a discontinuous traffic flow numerical value sequence, filling data by adopting an average interpolation method; the data of each time interval is supplemented by adopting a method of equally dividing the data of the statistical sums of a plurality of time intervals according to the time intervals; deleting the data point for the obviously wrong data, and simultaneously adopting an average interpolation method for completing;
s313: data preprocessing, namely performing data processing on the value set of each to-be-evaluated index by adopting data segmentation processing;
the step S32: the specific steps for constructing the historical data matrix are as follows: a history data matrix is constructed based on the history database of step S2.
4. The short-term traffic flow prediction method based on comprehensive evaluation according to claim 1, wherein the step S4 of index screening and weight calculation specifically comprises the following substeps:
s41: carrying out data standardization operation, and obtaining a standardized historical data matrix according to the standardized data;
s42: calculating correlation coefficients of all evaluation indexes and traffic flow in a standardized historical data matrix, sequencing the correlation coefficients, converting the correlation coefficients into percentages, sequencing the correlation coefficients from large to small, sequentially accumulating the correlation coefficients from large to small until an accumulation result is larger than or equal to a first threshold value, stopping accumulation operation, and eliminating undetermined evaluation indexes with smaller correlation coefficients which are not accumulated, thereby obtaining K main evaluation indexes;
s43: and calculating the contribution of each main evaluation index to the traffic flow.
5. The short-term traffic flow prediction method based on comprehensive evaluation according to claim 1, wherein the step S5 specifically includes the following substeps:
s51: obtaining the record number of the historical data in the time window according to the set length of the time window as T and the sampling frequency T of the historical data;
s52: constructing a traffic flow matrix, calculating a correlation coefficient matrix corresponding to the traffic flow matrix, and calculating the ratio of historical data in different time periods in a time window according to the obtained correlation coefficient matrix;
s53: calculating a traffic flow predicted value under each main evaluation index according to the improved Euclidean distance;
s54: and predicting the traffic flow by adopting a linear weighted comprehensive evaluation prediction method.
6. The short-term traffic flow prediction method based on comprehensive evaluation according to claim 5, wherein the step S53 specifically comprises the following substeps:
s531: obtaining the current main evaluation index rkCurrent main evaluation index rkWhen the value of (a) is Val, r in the history database is setkThe historical data with the value Val is extracted to form a new data set DTS:<DTTM,F>the number of records in the data set DTS is DTS _ count;
s532: if dts _ count is 0, this indicates the main evaluation index rkWhen the value of (A) is Val, there is no corresponding data in the history database, so let Fpredict_kGo to step S535 if it is 0, otherwise go to step S533;
wherein, Fpredict_kRepresenting a traffic flow predicted value corresponding to the kth main evaluation index;
s533: time of Day (DTTM)now-T) to DTTMnowThe time series of length m within the time period is denoted as F _ NOW, DTTM for each of the data sets DTSaTime of Day (DTTM)a-T) to DTTMaThe time series with length m in the time period is recorded as F _ HISTORYaWhere a is 1,2, …, dts _ count;
wherein, DTTMnowRepresenting a predicted time point;
s534: in the main evaluation index rkNext, specific values for traffic flow are predicted using the following formula:
Fpredict_k=Fkey_dttm
wherein, Fkey_dttmThe DTTM value in the historical database is a traffic flow value in the historical record corresponding to the key _ DTTM;
s535: and judging whether each main evaluation index completes the prediction of the traffic flow, if so, ending the judgment, otherwise, turning to the step S531 to obtain the next main evaluation index.
7. The short-term traffic flow prediction method based on comprehensive evaluation according to claim 6, wherein the value of key _ dttm in the step S534 should satisfy the following condition:
d i s t ( F _ N O W , F _ HISTORY k e y _ d t t m ) = min { d i s t ( F _ N O W , F _ HISTORY a ) } , a = 1 , 2 , ... , d t s _ c o u n t ;
wherein, F _ HISTORYkey_dttmRepresenting the historical data corresponding to the key _ dttm of sampling time, dist (·) representing the similarity obtained according to the improved Euclidean distance, and min {. represents the minimum value in the set.
8. The short-term traffic flow prediction method based on comprehensive evaluation according to claim 5, wherein the step S54 is specifically:
s541: the accumulated contribution weight of each evaluation index is recorded as wtotalWhen the evaluation index r is determinedkWhen an effective prediction result is generated, accumulating the contribution weight of the effective prediction result to the traffic flow;
s542: w obtained from step S541totalPredicting the traffic flow F by adopting a prediction method of linear weighted comprehensive evaluationpredict
CN201510212889.4A 2015-04-29 2015-04-29 Short-time traffic flow prediction method based on integrated evaluation Expired - Fee Related CN104821082B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510212889.4A CN104821082B (en) 2015-04-29 2015-04-29 Short-time traffic flow prediction method based on integrated evaluation

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510212889.4A CN104821082B (en) 2015-04-29 2015-04-29 Short-time traffic flow prediction method based on integrated evaluation

Publications (2)

Publication Number Publication Date
CN104821082A CN104821082A (en) 2015-08-05
CN104821082B true CN104821082B (en) 2017-01-18

Family

ID=53731362

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510212889.4A Expired - Fee Related CN104821082B (en) 2015-04-29 2015-04-29 Short-time traffic flow prediction method based on integrated evaluation

Country Status (1)

Country Link
CN (1) CN104821082B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110517479A (en) * 2018-05-22 2019-11-29 杭州海康威视系统技术有限公司 A kind of urban highway traffic prediction technique, device and electronic equipment
US12165506B2 (en) 2018-10-18 2024-12-10 Alibaba Group Holding Limited Method and device for predicting traffic flow or travel time period

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105118294B (en) * 2015-09-25 2017-03-29 山东易构软件技术股份有限公司 A kind of Short-time Traffic Flow Forecasting Methods based on state model
CN105957329B (en) * 2016-06-29 2019-04-19 芜湖达成储运有限公司 A kind of highway information intelligence system
CN107844848B (en) * 2016-09-20 2020-12-29 中国移动通信集团湖北有限公司 A method and system for predicting the flow of people in a region
CN106951976B (en) * 2016-10-12 2021-09-21 华南理工大学 Bus passenger flow prediction method based on mode classification
CN108090241B (en) * 2016-11-23 2021-08-17 财团法人工业技术研究院 Method and system for trend variable identification in continuous process
CN107066831B (en) * 2017-05-19 2021-04-23 君晟合众(北京)科技有限公司 Regional comprehensive environment evaluation method, device and system
CN107085942B (en) * 2017-06-26 2021-01-26 广东工业大学 Traffic flow prediction method, device and system based on wolf colony algorithm
CN107610470A (en) * 2017-10-31 2018-01-19 迈锐数据(北京)有限公司 A kind of traffic congestion evaluation method and device
CN108053646B (en) * 2017-11-22 2020-12-08 华中科技大学 Traffic feature acquisition method, prediction method and system based on time-sensitive features
CN108922183A (en) * 2018-07-17 2018-11-30 重庆大学 A kind of highway short term traffic forecasting method of space time correlation degree in conjunction with SVR
CN109241077A (en) * 2018-08-30 2019-01-18 东北大学 Production target variation tendency visual query system and method based on similitude
CN109559512B (en) * 2018-12-05 2021-08-24 北京掌行通信息技术有限公司 Regional traffic flow prediction method and device
CN111862583B (en) * 2019-04-24 2021-12-03 北京嘀嘀无限科技发展有限公司 Traffic flow prediction method and device
CN111862592B (en) * 2020-05-27 2021-12-17 浙江工业大学 A Traffic Flow Prediction Method Based on RGCN
CN117437786B (en) * 2023-12-21 2024-02-27 速度科技股份有限公司 Real-time traffic flow prediction method based on artificial intelligence for traffic network
CN118968748B (en) * 2024-07-16 2025-02-28 重庆高速公路集团有限公司 Highway Section Traffic Flow Prediction Method Based on Historical Data Situation Mining

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101694744A (en) * 2009-10-28 2010-04-14 北京交通大学 Method and system for evaluating road emergency evacuation capacity and method and system for grading road emergency evacuation capacity
CN102693633A (en) * 2012-06-07 2012-09-26 浙江大学 Short-term traffic flow weighted combination prediction method

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5017437B2 (en) * 2010-08-27 2012-09-05 日本電信電話株式会社 TRAVEL TIME ESTIMATION METHOD, TRAVEL TIME ESTIMATION DEVICE, TRAVEL TIME ESTIMATION PROGRAM IMPLEMENTING THE METHOD AND MEDIUM CONTAINING THE PROGRAM, TRAVEL TIME PREDICTION METHOD, TRAVEL TIME PREDICTION DEVICE, TRAVEL TIME PREDICTION PROGRAM INCLUDING THE METHOD, AND ITS PROGRAM Stored media
KR101347603B1 (en) * 2012-06-27 2014-01-07 명지대학교 산학협력단 Method for orinin based traffic assign model using shortest path

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101694744A (en) * 2009-10-28 2010-04-14 北京交通大学 Method and system for evaluating road emergency evacuation capacity and method and system for grading road emergency evacuation capacity
CN102693633A (en) * 2012-06-07 2012-09-26 浙江大学 Short-term traffic flow weighted combination prediction method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
基于卡尔曼滤波的短时交通流量预测模型研究;张一;《中国优秀硕士学位论文全文数据库工程科技Ⅱ辑》;20150315(第3期);正文第25-33页 *
船舶交通流组合预测方法研究;刘敬贤等;《中国航海》;20090930;第32卷(第3期);第80-84页 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110517479A (en) * 2018-05-22 2019-11-29 杭州海康威视系统技术有限公司 A kind of urban highway traffic prediction technique, device and electronic equipment
CN110517479B (en) * 2018-05-22 2020-11-03 杭州海康威视系统技术有限公司 Urban road traffic prediction method and device and electronic equipment
US12165506B2 (en) 2018-10-18 2024-12-10 Alibaba Group Holding Limited Method and device for predicting traffic flow or travel time period

Also Published As

Publication number Publication date
CN104821082A (en) 2015-08-05

Similar Documents

Publication Publication Date Title
CN104821082B (en) Short-time traffic flow prediction method based on integrated evaluation
CN107610469B (en) Day-dimension area traffic index prediction method considering multi-factor influence
CN101216998B (en) An urban traffic flow information amalgamation method of evidence theory based on fuzzy rough sets
CN108416468B (en) Flood advanced early warning and forecasting method
CN102819677B (en) Rainfall site similarity evaluation method on basis of single rainfall type
CN105741549B (en) A kind of traffic flow forecasting method theoretical based on space copula
CN107292417B (en) Regional heavy pollution discrimination and forecast method and device based on heavy pollution sequence case library
CN111027763A (en) A Machine Learning-Based Method for Similarity Analysis of Watershed Flood Responses
CN117558124A (en) Channel running state deduction prediction method based on multi-source information depth fusion
Badhiye et al. Temperature and humidity data analysis for future value prediction using clustering technique: an approach
CN110675626B (en) Traffic accident black point prediction method, device and medium based on multidimensional data
CN108648445B (en) Dynamic traffic situation prediction method based on traffic big data
CN113159374B (en) Data-driven urban traffic flow rate mode identification and real-time prediction early warning method
Badhiye et al. Analysis of Temperature and Humidity Data for Future value prediction
CN106682703A (en) Method for recognizing travel mode of driver based on OBD data
CN114580171B (en) A method for identification of watershed flood types and analysis of their influencing factors
CN112508237A (en) Rain type region division method based on data analysis and real-time rain type prediction method
CN113283634B (en) Expressway accident prediction method and system based on gradient lifting tree model
CN113836808A (en) PM2.5 deep learning prediction method based on heavy pollution feature constraint
CN112669595A (en) Online taxi booking flow prediction method based on deep learning
CN113095597A (en) Novel flood process design method suitable for small watershed without data
CN111145535B (en) Travel time reliability distribution prediction method under complex scene
CN111175852B (en) Airport fog forecast early warning method based on long-time memory algorithm
CA3046873A1 (en) A method of predicting a traffic behaviour in a road system
CN112183597B (en) A non-equilibrium traffic accident severity analysis method based on spectral clustering

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
EXSB Decision made by sipo to initiate substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20170118

Termination date: 20190429