CN105761488B - Real-time extreme learning machine Short-time Traffic Flow Forecasting Methods based on fusion - Google Patents
Real-time extreme learning machine Short-time Traffic Flow Forecasting Methods based on fusion Download PDFInfo
- Publication number
- CN105761488B CN105761488B CN201610190046.3A CN201610190046A CN105761488B CN 105761488 B CN105761488 B CN 105761488B CN 201610190046 A CN201610190046 A CN 201610190046A CN 105761488 B CN105761488 B CN 105761488B
- Authority
- CN
- China
- Prior art keywords
- traffic flow
- short
- real
- prediction
- value
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000004927 fusion Effects 0.000 title claims abstract description 18
- 238000013277 forecasting method Methods 0.000 title description 6
- 238000000034 method Methods 0.000 claims abstract description 11
- 230000007246 mechanism Effects 0.000 claims abstract description 9
- 230000000694 effects Effects 0.000 claims description 10
- 239000011159 matrix material Substances 0.000 claims description 5
- 238000007781 pre-processing Methods 0.000 claims description 4
- 238000011156 evaluation Methods 0.000 claims description 3
- 230000006870 function Effects 0.000 claims description 3
- 238000012360 testing method Methods 0.000 claims description 3
- 230000004913 activation Effects 0.000 claims description 2
- 239000006185 dispersion Substances 0.000 claims description 2
- 230000007786 learning performance Effects 0.000 claims description 2
- 238000013528 artificial neural network Methods 0.000 abstract description 8
- 238000012549 training Methods 0.000 abstract description 4
- 238000005516 engineering process Methods 0.000 abstract description 2
- 238000001514 detection method Methods 0.000 description 8
- 238000010586 diagram Methods 0.000 description 4
- 230000002159 abnormal effect Effects 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000007726 management method Methods 0.000 description 2
- 206010039203 Road traffic accident Diseases 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000002542 deteriorative effect Effects 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G08—SIGNALLING
- G08G—TRAFFIC CONTROL SYSTEMS
- G08G1/00—Traffic control systems for road vehicles
- G08G1/01—Detecting movement of traffic to be counted or controlled
- G08G1/0104—Measuring and analyzing of parameters relative to traffic conditions
- G08G1/0125—Traffic data processing
Landscapes
- Chemical & Material Sciences (AREA)
- Analytical Chemistry (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Traffic Control Systems (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
本发明公开了一种基于融合的实时极限学习机短时交通流预测方法,针对短时非平稳交通流场景中的预测技术,基于短时交通流实时性、准确性、可靠性三大特征和融合的实时极限学习机来预测短时交通流。本发明提出的短时交通流预测方法,基于简化的单隐层前馈神经网络结构,能够在交通流峰值期快速的训练历史数据并能增量地更新到达的数据,在保证一定预测精度的同时节省学习时间。此外,采用融合机制保证了短时交通流预测的稳定性和鲁棒性。在数据缺失和波动剧烈时期进行重构,训练阶段时耗短,且预测结果的均方根误差、标准误差百分比均在置信区域内。
The invention discloses a fusion-based real-time extreme learning machine short-term traffic flow prediction method, aiming at the prediction technology in the short-term non-stationary traffic flow scene, based on the three characteristics of short-term traffic flow real-time, accuracy and reliability and Fused real-time extreme learning machine to predict short-term traffic flow. The short-term traffic flow prediction method proposed by the present invention is based on a simplified single hidden layer feed-forward neural network structure, which can quickly train historical data during the peak period of traffic flow and incrementally update the arriving data, while ensuring a certain prediction accuracy Also save study time. In addition, the adoption of fusion mechanism ensures the stability and robustness of short-term traffic flow prediction. Reconstruction is performed during periods of missing data and severe fluctuations, and the training phase takes a short time, and the root mean square error and standard error percentage of the prediction results are all within the confidence area.
Description
技术领域technical field
本发明主要涉及机器学习、交通流预测等智能交通系统领域,尤其是基于融合的实时极限学习机短时交通流预测方法。The invention mainly relates to the fields of intelligent transportation systems such as machine learning and traffic flow forecasting, especially a fusion-based real-time extreme learning machine short-term traffic flow forecasting method.
背景技术Background technique
随着全球经济发展和社会城市化进步,交通运输业的发展变得越来越重要。作为人类社会进步的重要物质基础,交通运输业是整个国民经济发展的命脉。然而近年来,道路车辆逐步增多导致交通运行效率每况愈下,交通堵塞、交通废气排放污染、交通操作繁杂效率低下、交通事故等现象频繁出现,给人们出行等社会活动带来困扰。为了减缓交通问题,智能交通系统(ITS)等一系列先进管理控制系统飞速发展并得到广泛应用,在一定程度上解决部分道路交通问题。交通流预测对于支持交通管理系统的需求预测功能起着重要作用。交通流量数据(也可说交通参数)可直接反映宏观的交通状态,是交通业务的基础数据,同时,交通流量数据也是交通中最易采集的数据,可以通过感应线圈检测、微波检测、视频检测、全球定位系统(Global Position System,简称GPS)、社会媒体设备等多种方法获取,包括交通流、交通速度、交通占比、行程时间等信息。交通流预测实质上是对这些交通流基本参数的预测,根据预测周期长度可以将交通流量预测分为两类:短期预测和中长期预测。交通流数据分布呈现出两个峰值两个低谷特征,类似高斯分布,进行峰值预测,以及提高短时交通流预测实时性和高精度不仅可以及时告知驾乘者交通信息,还能设计和实现移动基础设施。短时交通流预测要求实时性,预测难度较大,得益于车载与道路传感设施不断完善以及ITS和交通控制系统的支持,短时交通预测技术也不断发展。With the development of the global economy and the progress of social urbanization, the development of the transportation industry has become more and more important. As an important material basis for the progress of human society, the transportation industry is the lifeblood of the entire national economic development. However, in recent years, the gradual increase of road vehicles has led to a deteriorating traffic operation efficiency. Traffic congestion, traffic exhaust pollution, complicated and inefficient traffic operations, and frequent traffic accidents have brought troubles to people's travel and other social activities. In order to alleviate traffic problems, a series of advanced management and control systems such as Intelligent Transportation System (ITS) have developed rapidly and been widely used, which can solve some road traffic problems to a certain extent. Traffic flow forecasting plays an important role in supporting the demand forecasting function of traffic management systems. Traffic flow data (also called traffic parameters) can directly reflect the macroscopic traffic status and is the basic data of traffic business. At the same time, traffic flow data is also the most easily collected data in traffic, which can be detected by induction coils, microwaves, and video , Global Positioning System (Global Position System, referred to as GPS), social media equipment and other methods to obtain, including traffic flow, traffic speed, traffic proportion, travel time and other information. Traffic flow prediction is essentially the prediction of these basic parameters of traffic flow. According to the length of the forecast period, traffic flow forecasting can be divided into two categories: short-term forecasting and medium- and long-term forecasting. The distribution of traffic flow data presents two peaks and two troughs, similar to the Gaussian distribution, peak prediction, and improving the real-time and high precision of short-term traffic flow prediction can not only inform drivers and passengers of traffic information in time, but also design and implement mobile infrastructure. Short-term traffic flow forecasting requires real-time performance and is difficult to predict. Thanks to the continuous improvement of vehicle and road sensing facilities and the support of ITS and traffic control systems, short-term traffic forecasting technology is also developing continuously.
早期短时交通流预测方法大多交通流预测方式是在简单平稳均衡交通流数据假设下进行预测的,对于数据集的限制条件比较多,比如等间隔采样、间隔长短适中、历史数据样本数量合适、样本数据无噪声等等,但在现实交通场景中,因设备本身故障或外部因素(如天气恶劣、道路异常等)干扰,交通流数据在采集和传输过程中容易发生缺失、突变;上下班、节假日时期交通流容易激增达到峰值等,这些情况都可导致交通流数据出现非平稳非线性等异质性,这里的异质性值数据分布不均匀和复杂性,再加上设备成本因素的考虑,很多数据监控、采集、处理、传输等设备不可能覆盖全部交通网,更是增加了交通流数据的异质性,所有这给因素都给交通流预测模型建模增加了难度,预测的实时性、准确性、稳定性有待提高。In the early short-term traffic flow forecasting methods, most of the traffic flow forecasting methods were forecasted under the assumption of simple and balanced traffic flow data, and there were many restrictions on the data set, such as equal interval sampling, moderate interval length, appropriate number of historical data samples, The sample data has no noise, etc., but in the real traffic scene, due to the failure of the equipment itself or the interference of external factors (such as bad weather, abnormal roads, etc.), the traffic flow data is prone to loss and mutation during the collection and transmission process; During holidays, the traffic flow tends to surge and reach the peak value. These situations can lead to non-stationary and nonlinear heterogeneity in the traffic flow data. Here, the heterogeneity value data distribution is uneven and complex, coupled with the consideration of equipment cost factors , many data monitoring, collection, processing, transmission and other equipment cannot cover the entire traffic network, which increases the heterogeneity of traffic flow data. All these factors add difficulty to the modeling of traffic flow prediction models. Performance, accuracy, and stability need to be improved.
为了加强短时交通流预测方法的可扩展性和鲁棒性,使其在交通流数据非稳定异质情形下仍能达到一定预测实时性和精度要求,扩展已有预测模型的适用性,进行实时异质时序交通流的预测方法研究很有必要。In order to strengthen the scalability and robustness of the short-term traffic flow forecasting method, so that it can still meet certain real-time and accuracy requirements in the case of unstable and heterogeneous traffic flow data, and expand the applicability of existing forecasting models, the Research on forecasting methods of real-time heterogeneous time-series traffic flow is necessary.
发明内容Contents of the invention
本发明提出了一种基于融合的实时极限学习机短时交通流预测方法,以提高短时交通流数据的预测精度和可靠性,适用于实时交通流预测。The invention proposes a fusion-based real-time extreme learning machine short-term traffic flow prediction method to improve the prediction accuracy and reliability of short-term traffic flow data, and is suitable for real-time traffic flow prediction.
本发明的设计思路为,基于融合的实时极限学习机预测短时交通流,短时交通流数据呈现周期性和实时性、准确性、可靠性特征,随机选取实时交通流数据,将序列学习思想应用于ELM算法并提出了实时序列ELM算法。实时序列ELM算法是在原始ELM算法的基础上,采用在线学习模式而提出的一种新的算法,在该算法中,数据可以一个一个或一块一块地添加到网络中,并且原先的数据学习完成后就会抛弃不再使用。The design idea of the present invention is to predict the short-term traffic flow based on the integrated real-time extreme learning machine. The short-term traffic flow data presents the characteristics of periodicity, real-time, accuracy and reliability, and the real-time traffic flow data is randomly selected, and the sequence learning idea It is applied to the ELM algorithm and a real-time sequence ELM algorithm is proposed. The real-time sequence ELM algorithm is a new algorithm proposed based on the original ELM algorithm and using the online learning mode. In this algorithm, data can be added to the network one by one or piece by piece, and the original data learning is completed. Then it will be discarded and no longer used.
本发明所采用的技术方案为:一种基于融合的实时极限学习机短时交通流预测方法,包括以下步骤:The technical scheme adopted in the present invention is: a kind of fusion-based real-time extreme learning machine short-term traffic flow prediction method, comprising the following steps:
S1、随机选定道路上的探测器,按照预设的时间周期采集短时交通流数据;S1. Randomly select detectors on the road to collect short-term traffic flow data according to the preset time period;
S2、预处理并归一化获得的交通流数据,判断所处的交通场景是平稳情况还是非平稳情况;S2, preprocessing and normalizing the obtained traffic flow data, and judging whether the traffic scene is in a stable situation or a non-stationary situation;
S3、如果是非平稳交通场景,初始化短时交通流预测模型;S3. If it is a non-stationary traffic scene, initialize the short-term traffic flow prediction model;
S4、建立短时交通流预测模型的实时序列学习部分;S4, establishing the real-time sequence learning part of the short-term traffic flow prediction model;
S5、完成短时交通流预测模型中的预测模块;S5. Complete the prediction module in the short-term traffic flow prediction model;
S6、将预测结果进行反归一化处理并进行评估。S6. Denormalize and evaluate the prediction results.
S7、如果是平稳场景,则可直接按照S4到S6步骤进行预测。S7. If it is a stable scene, the prediction can be performed directly according to steps S4 to S6.
进一步的,步骤S3中初始化短时交通流预测模型包括以下步骤:Further, initializing the short-term traffic flow prediction model in step S3 includes the following steps:
S31、初始数据集为随机分配预测模型的输入参数,包括输入结点和隐结点之间的权向量wi、阈值bi,并随机选取隐结点的输入权值ai和阈值bi,其中i=1,2,…,L;S31. The initial data set is Randomly assign the input parameters of the prediction model, including the weight vector wi and the threshold bi between the input node and the hidden node, and randomly select the input weight ai and threshold bi of the hidden node, where i=1,2,..., L;
S32、计算隐层输出矩阵H0:S32. Calculate the hidden layer output matrix H 0 :
S33、计算初始输出权值β(0),为确保极限学习机可以保持同样的学习性能,假设H的秩为有且已证明β=H+T和H+=(HTH)-1HT,则有:S33. Calculate the initial output weight β (0) . In order to ensure that the extreme learning machine can maintain the same learning performance, it is assumed that the rank of H is Have And it has been proved that β=H + T and H+=(H T H) -1 H T , then:
其中P0=(H0 TH0)-1,M0=H0 TH0=P0 -1;Where P 0 =(H 0 T H 0 ) -1 , M 0 =H 0 T H 0 =P 0 -1 ;
其中β、H和T分别指的是集合{β(0),β(1),β(2),…,β(n)}、{H0,H1,H2,…,Hn}、{T0,T1,T2,…,Tn};where β, H and T refer to the sets {β (0) , β (1) , β (2) , ..., β (n) }, {H 0 , H 1 , H 2 , ..., Hn}, {T 0 , T 1 , T 2 ,..., Tn};
S34、设置到达的数据块序列k=0。S34. Set the arrived data block sequence k=0.
进一步的,步骤S4中建立短时交通流预测模型的实时序列学习部分包括以下步骤:Further, the real-time sequence learning part of establishing the short-term traffic flow prediction model in step S4 includes the following steps:
S41、计算新添加数据的隐层输出矩阵Hk+1:S41. Calculate the hidden layer output matrix H k+1 of newly added data:
S42、令 S42. Order
且则可计算输出权值:and Then the output weight can be calculated:
S43、根据公式求得S43, obtained according to the formula
S44、设置到达数据块序列k=k+1,表示滑动窗口向前移动一个位置,即滑动窗口大小为1,返回步骤S41。S44. Set the arriving data block sequence k=k+1, indicating that the sliding window moves forward by one position, that is, the size of the sliding window is 1, and return to step S41.
进一步的,步骤S5中完成短时交通流预测模型中的预测模块包括以下步骤:Further, completing the prediction module in the short-term traffic flow prediction model in step S5 includes the following steps:
S51、每当有新数据块k+1到达时,每个实时序列学习机训练β(k+1)来计算fk+2,其中fk+2表示k+2时刻预测的交通流数值;S51. Whenever a new data block k+1 arrives, each real-time sequence learning machine trains β (k+1) to calculate f k+2 , where f k+2 represents the traffic flow value predicted at k+2 moment;
S52、将fk+2放入测试集来预测下一刻交通流数值;S52, put f k+2 into the test set to predict the traffic flow value at the next moment;
S53、只要还有新的数据块到达,就返回步骤S51;S53, as long as there are new data blocks arriving, return to step S51;
S54、根据公式计算加权平均值,实现加权融合机制;S54, according to the formula Calculate the weighted average to realize the weighted fusion mechanism;
其中,设单个实时极限学习网络个数为L,每个网络有着相同数量的隐层结点和激励函数。Among them, let the number of single real-time extreme learning networks be L, and each network has the same number of hidden layer nodes and activation functions.
进一步的,所述步骤S6的评价指标为:Further, the evaluation index of the step S6 is:
假设观测实际交通流量数据序列为fi或者Yp(t),预测交通流量值结果为ti或者Yr(t)Assume that the observed actual traffic flow data sequence is fi or Yp(t), and the predicted traffic flow value result is ti or Yr(t)
(1)绝对百分比误差(1) Absolute percentage error
(2)平均绝对百分比误差(2) Mean absolute percentage error
(3)均方根误差(3) root mean square error
(4)平均相对误差(4) Average relative error
平均相对误差反映的是交通流量预测值相对于真实值的偏离程度,其值越小表示预测效果越好;The average relative error reflects the degree of deviation of the traffic flow prediction value from the real value, and the smaller the value, the better the prediction effect;
(5)平均绝对误差(5) Mean absolute error
平均相对误差反映的是交通流量预测值与真实值之间误差的绝对值大小,其值越小表示预测效果越好;The average relative error reflects the absolute value of the error between the traffic flow prediction value and the real value, and the smaller the value, the better the prediction effect;
(6)均方误差(6) mean square error
该指标不仅反映了交通流量预测误差的大小,而且还反映了误差的离散分布情况。其值越小,表示误差离散程度越小,预测效果越好;This index not only reflects the magnitude of the traffic flow forecast error, but also reflects the discrete distribution of the error. The smaller the value, the smaller the error dispersion and the better the prediction effect;
(7)拟合度(7) Fitting degree
拟合度从交通流量的几何特征方面反映了交通流量预测曲线是否与实际观测曲线的变化趋势拟合。其值越大,说明交通流量的预测值越接近实际观测值,预测效果越好。The fitting degree reflects whether the traffic flow prediction curve fits the change trend of the actual observation curve from the aspect of the geometric characteristics of traffic flow. The larger the value, the closer the predicted value of traffic flow is to the actual observed value, and the better the prediction effect.
与现有技术相比,本发明的优点在于:Compared with the prior art, the present invention has the advantages of:
基于简化的单隐层前馈神经网络结构,能够在交通流峰值期快速的训练历史数据并能增量地更新到达的数据,在保证一定预测精度的同时节省学习时间。此外,采用融合机制保证了短时交通流预测的稳定性和鲁棒性。本发明在数据缺失和波动剧烈时期进行重构,训练阶段时耗短,且预测结果的均方根误差、标准误差百分比均在置信区域内。Based on the simplified single hidden layer feed-forward neural network structure, it can quickly train historical data and incrementally update the arriving data during the peak period of traffic flow, saving learning time while ensuring a certain prediction accuracy. In addition, the adoption of fusion mechanism ensures the stability and robustness of short-term traffic flow prediction. The invention performs reconstruction in the period of data absence and severe fluctuation, and the time consumption of the training stage is short, and the root mean square error and standard error percentage of the prediction result are all within the confidence region.
附图说明Description of drawings
图1为本发明所述短时交通流预测流程图;Fig. 1 is the flow chart of short-term traffic flow prediction of the present invention;
图2为本发明所述基于融合的实时极限学习机预测算法实现流程图;Fig. 2 is the realization flowchart of the real-time extreme learning machine prediction algorithm based on fusion of the present invention;
图3为本发明所述实例场景一中探测点US101N处在高峰期5:00AM-10:00AM期间无缺失数据情况下的实际交通流和预测交通流值对比图;Fig. 3 is a comparison diagram of the actual traffic flow and the predicted traffic flow value when the detection point US101N is in the peak period 5:00AM-10:00AM in the first example scene of the present invention without missing data;
图4为本发明所述实例场景一中探测点US101N处在高峰期5:00PM-10:00PM期间无缺失数据情况下的实际交通流和预测交通流值对比图;Fig. 4 is a comparison chart of the actual traffic flow and the predicted traffic flow value under the condition that the detection point US101N is in the peak period 5:00PM-10:00PM in the example scene one of the present invention without missing data;
图5为本发明所述实例场景一中探测点US101N处在两个高峰期间无缺失数据情况下的APE值对比图;Fig. 5 is a comparison chart of APE values under the condition that the detection point US101N is in the case of no missing data during the two peak periods in the example scenario 1 of the present invention;
图6为本发明所述实例场景二中探测点SR120E处在高峰期5:00AM-10:00AM期间存在缺失数据情况下的实际交通流和预测交通流值对比图;Fig. 6 is a comparison diagram of the actual traffic flow and the predicted traffic flow value under the condition that the detection point SR120E is in the peak period 5:00AM-10:00AM in the second example scene of the present invention when there is missing data;
图7为本发明所述实例场景二中探测点SR120E处在高峰期5:00PM-10:00PM期间存在缺失数据情况下的实际交通流和预测交通流值对比图;Fig. 7 is a comparison diagram of the actual traffic flow and the predicted traffic flow value under the condition that the detection point SR120E is in the peak period of 5:00PM-10:00PM in the second example scene of the present invention when there is missing data;
图8为本发明所述实例场景二中探测点SR120E处在两个高峰期间存在缺失数据情况下的APE值对比图。Fig. 8 is a comparison chart of APE values in the case of missing data during the two peak periods at the detection point SR120E in the second example scenario of the present invention.
具体实施方式Detailed ways
以下结合说明书附图和具体优选的实施例对本发明作进一步描述,但并不因此而限制本发明的保护范围。The present invention will be further described below in conjunction with the accompanying drawings and specific preferred embodiments, but the protection scope of the present invention is not limited thereby.
参照图1、图2所示,基于融合的实时序列极限学习机短时交通流预测方法主要包括如下步骤:Referring to Figure 1 and Figure 2, the fusion-based real-time sequence extreme learning machine short-term traffic flow prediction method mainly includes the following steps:
步骤1、采集短时交通流数据。本发明通过PeMS系统随机选取了美国加州四个高速公路的探测点来获取交通流历史数据并进行预测分析。并且随机选取了2014-11-24到2014-12-1期间的交通流数据,该数据包括平常工作日和节假日的交通流数据,可以代表平稳和非平稳时期的场景。其中,前7天的数据作为训练集,最后一天的数据作为测试集。采用这些数据分别预测一天内的交通流峰值期5:00-10:00AM和5:00-10:00PM期间的数据完整和数据缺失两种情况下的交通流变化。该发明将采集的数据分为两个部分:场景一,利用采集的数据集,进行学习之后,预测一天当中交通流高峰期的完整数据;场景二,针对交通流数据缺失的情况下,利用改进的实时极限学习机框架进行数据重构预测,加入融合机制并验证,在保证一定预测精度的前提下使得预测结果更加平稳。Step 1. Collect short-term traffic flow data. The present invention uses the PeMS system to randomly select the detection points of four highways in California, USA to obtain the traffic flow historical data and perform prediction analysis. And randomly select the traffic flow data from 2014-11-24 to 2014-12-1, the data includes the traffic flow data of normal working days and holidays, which can represent the scenes of smooth and non-stationary periods. Among them, the data of the first 7 days is used as the training set, and the data of the last day is used as the test set. These data are used to predict the traffic flow changes in the two cases of complete data and missing data during the peak traffic flow period of 5:00-10:00AM and 5:00-10:00PM in a day. The invention divides the collected data into two parts: Scenario 1, using the collected data set to learn and predict the complete data of the traffic flow peak period in a day; Scenario 2, in the case of lack of traffic flow data, use the improved The real-time extreme learning machine framework is used for data reconstruction prediction, and the fusion mechanism is added and verified to make the prediction result more stable under the premise of ensuring a certain prediction accuracy.
步骤2、导入采集的短时交通流数据。如果成功,则继续交通场景的判断,如果失败则退出,重新导入数据。Step 2. Import the collected short-term traffic flow data. If it succeeds, continue to judge the traffic scene, if it fails, exit and re-import the data.
步骤3、预处理采集的短时交通流数据。作为短时交通流预测的基础,数据质量对短时预测模型的有效性有重要作用。判断交通流量数据是否异常,需要进行判断:车辆在道路上的行驶满足一定规律,故采集到的交通流数据必属于以下两种情况之一:(1)若车流量flow>0,速度speed>0以及占有率occupancy>0;(2)若flow=0,则speed=0。不满足以上条件中任何一个的交通流数据则被认为是明显异常数据。预处理后再对数据进行归一化处理,归一化是为了加快预测计算速度,减少时间消耗。Step 3. Preprocessing the collected short-term traffic flow data. As the basis of short-term traffic flow forecasting, data quality plays an important role in the effectiveness of short-term forecasting models. To judge whether the traffic flow data is abnormal, it needs to be judged: the driving of vehicles on the road satisfies certain rules, so the collected traffic flow data must belong to one of the following two situations: (1) If the traffic flow flow>0, speed> 0 and occupancy>0; (2) if flow=0, then speed=0. Traffic flow data that does not meet any of the above conditions is considered to be obviously abnormal data. After preprocessing, the data is normalized. Normalization is to speed up the prediction calculation speed and reduce time consumption.
步骤4、初始化短时交通流预测模型。初始数据集为随机分配预测模型的输入参数,包括输入结点和隐结点之间的权向量wi、阈值bi,并随机选取隐结点的输入权值ai和阈值bi,其中i=1,2,…,L;Step 4. Initialize the short-term traffic flow prediction model. The initial dataset is Randomly assign the input parameters of the prediction model, including the weight vector wi and the threshold bi between the input node and the hidden node, and randomly select the input weight ai and threshold bi of the hidden node, where i=1,2,..., L;
步骤5、建立实时序列学习机制。在该算法中,采用滑动窗口,根据已训练、新到达交通流数据的时空关系,动态滑动,原先的数据在学习完成后就会随着滑动窗口的移动而被抛弃不再使用,而新到达的数据可以一个一个或一块一块地添加到网络中。随着第k+1步所添加的数据块不断到达,其中Nk+1表示第k+1步添加数据的个数,计算新添加数据的隐层输出矩阵Hk+1:Step 5, establishing a real-time sequence learning mechanism. In this algorithm, a sliding window is used to dynamically slide according to the spatio-temporal relationship of the trained and newly arrived traffic flow data. The data can be added to the network one by one or piece by piece. As the data blocks added in step k+1 continue to arrive, where N k+1 represents the number of data added in step k+1, calculate the hidden layer output matrix H k+1 of the newly added data:
再令计算输出权值:Reorder Calculate output weights:
设置到达的数据块序列为k=k+1,返回步骤4。Set the arriving data block sequence as k=k+1, and return to step 4.
步骤6、加入自适应丢弃机制。对于已经训练过的数据,它对新添加的数据影响不大,根据距离目标时间的先后以及数据本身特征分配不同的权值,自适应丢弃部分,再与新训练样本一起进行预测,体现数据异质性的同时保证预测的准确性。Step 6. Add an adaptive discarding mechanism. For the data that has been trained, it has little effect on the newly added data. Different weights are assigned according to the sequence of distance from the target time and the characteristics of the data itself, and the part is discarded adaptively, and then predicted together with the new training samples to reflect the difference in data. Qualitative while ensuring the accuracy of the forecast.
步骤7、加入加权平均融合机制。加权平均融合机制是考虑了多个相同结构的实时序列学习机的影响,将多个将多个预测的结果按照下列公式进行加权平均: Step 7. Add a weighted average fusion mechanism. The weighted average fusion mechanism takes into account the influence of multiple real-time sequence learning machines with the same structure, and performs weighted average of multiple predicted results according to the following formula:
步骤8、实验结果进行反归一化,再进行评估及分析。为了直观地体现峰值期短时交通流数据的预测值和实际观测值,本发明分别选取探测点US101N和探测点SR120EN两处的在2014年12月1日两个交通流峰值期的观测数据和预测数据,对比不同的预测算法,画出图像如下图3至图8。其中,黑色虚线代表实际观测值,其余分别代表多层感知神经网络预测的交通流、小波神经网络的预测值、极限学习机预测的交通流、融合实时极限学习机预测的交通流。场景一中,探测点US101N处交通流预测值和实际值对比,其预测间隔是5分钟,融合实时极限学习机的预测值最贴近实际观测值趋势。而多层感知神经网络在交通流数据波动较大的时候波动也很明显,预测误差较大。从图3可以看出由ERS-ELM计算出来的预测值是最接近真实的交通流趋势,即使在6:10-7:00AM这段波动比较大的时间段内,所以大部分的APE误差是最小的。极限学习机是第二好的算法,但在6:00-7:05AM期间的误差值仍然很大。在6:00-7:30AM期间,小波神经网络得到的预测值是偏离实际值最远的。而5:00-10:00PM期间的交通流数据波动呈锯齿状,给预测带来了很大的困难。图4表示了由ERS-ELM计算出的预测值是最接近实际的交通流趋势,而其他三个算法得到的预测值都不如ERS-ELM。同样,APE值也是最小的。Step 8. The experimental results are denormalized, and then evaluated and analyzed. In order to intuitively reflect the predicted value and actual observed value of short-term traffic flow data in the peak period, the present invention respectively selects the observation data and Forecast data, compare different prediction algorithms, and draw images as shown in Figure 3 to Figure 8. Among them, the black dotted line represents the actual observation value, and the rest represent the traffic flow predicted by the multi-layer perceptual neural network, the predicted value of the wavelet neural network, the traffic flow predicted by the extreme learning machine, and the traffic flow predicted by the fusion of real-time extreme learning machine. In Scenario 1, the predicted value of the traffic flow at the detection point US101N is compared with the actual value. The prediction interval is 5 minutes, and the predicted value integrated with the real-time extreme learning machine is closest to the trend of the actual observed value. However, the multi-layer perceptual neural network fluctuates obviously when the traffic flow data fluctuates greatly, and the prediction error is relatively large. It can be seen from Figure 3 that the predicted value calculated by ERS-ELM is the closest to the real traffic flow trend, even in the time period of 6:10-7:00 AM with relatively large fluctuations, so most of the APE error is the smallest. Extreme learning machine is the second best algorithm, but the error value during 6:00-7:05AM is still very large. During 6:00-7:30AM, the predicted value obtained by the wavelet neural network deviates the farthest from the actual value. However, the fluctuation of traffic flow data during 5:00-10:00PM is jagged, which brings great difficulties to prediction. Figure 4 shows that the predicted value calculated by ERS-ELM is the closest to the actual traffic flow trend, while the predicted values obtained by the other three algorithms are not as good as ERS-ELM. Likewise, the APE value is also the smallest.
场景二中交通流高峰期存在损坏的交通流数据。每次运行算法,该算法都需要通过重构的交通流趋势曲线来预测损坏的数据。从实验结果可以看出,提出的算法可以更快地学习历史数据,通过滑动窗口和增量地添加上一刻预测的数据来获得下一刻预测的数据,且预测精度比较好。图中可以看出,在5:20-6:05AM,6:55-7:10PM,8:45-9:00PM这三个时间段存在损坏,为了更好的比较实际交通流数值和预测的交通流数值之间的误差,本文画出了全部的曲线,包括发生损坏而缺失的交通流数据部分。图中采用框图标出来。从图中可以看出,在趋势平稳的时间段,四种预测方法都能很好的匹配,但在损坏阶段,ERS-ELM是匹配交通流趋势匹配得最好的一种算法。比如,在图7中,在6:55-7:10PM,8:45-9:00PM时间段,只有ERS-ELM预测出了一个转角趋势,而其他三种算法都只是简单地追随已有的趋势。ELM的预测性能仅次于ERS-ELM算法,而小波神经网络和多层感知神经网络没有适应波动点。There is corrupted traffic flow data during the peak traffic flow in Scenario 2. Every time the algorithm is run, the algorithm needs to predict the corrupted data by reconstructing the traffic flow trend curve. It can be seen from the experimental results that the proposed algorithm can learn historical data faster, and obtain the next-moment forecast data by sliding the window and incrementally adding the previous-moment forecast data, and the prediction accuracy is better. It can be seen from the figure that there is damage in the three time periods of 5:20-6:05AM, 6:55-7:10PM, and 8:45-9:00PM. In order to better compare the actual traffic flow value with the predicted The error between the traffic flow values, this paper draws all the curves, including the missing part of the traffic flow data due to damage. The diagram is shown with a box icon. It can be seen from the figure that in the time period when the trend is stable, the four prediction methods can be well matched, but in the damage stage, ERS-ELM is the algorithm that matches the traffic flow trend best. For example, in Figure 7, in the time period of 6:55-7:10PM and 8:45-9:00PM, only ERS-ELM predicts a corner trend, while the other three algorithms simply follow the existing trend. The prediction performance of ELM is second only to the ERS-ELM algorithm, while the wavelet neural network and multi-layer perceptual neural network do not adapt to fluctuation points.
我们还可以得出以下结论,交通流趋势越是平稳,预测精度越高。但数据集并不是影响预测性能的唯一因素。在非平稳场景下,比如存在损坏的数据,算法需要有效的学习历史数据间的关系并分析数据变化趋势。ERS-ELM可以很好地很快地学习历史数据,并且通过可变大小的滑动窗口来保持高预测精度。We can also draw the following conclusion that the smoother the traffic flow trend, the higher the prediction accuracy. But the dataset is not the only factor that affects predictive performance. In non-stationary scenarios, such as corrupted data, the algorithm needs to effectively learn the relationship between historical data and analyze the trend of data changes. ERS-ELM can learn historical data well and quickly, and maintain high prediction accuracy through variable-sized sliding windows.
本发明实施例使用的数据来自开放性能评价系统平台PeMS 14.0。The data used in the embodiment of the present invention comes from the open performance evaluation system platform PeMS 14.0.
上述只是本发明的较佳实施例,并非对本发明作任何形式上的限制。虽然本发明已以较佳实施例揭露如上,然而并非用以限定本发明。任何熟悉本领域的技术人员,在不脱离本发明技术方案范围的情况下,都可利用上述揭示的技术内容对本发明技术方案做出许多可能的变动和修饰,或修改为等同变化的等效实施例。因此,凡是未脱离本发明技术方案的内容,依据本发明技术实质对以上实施例所做的任何简单修改、等同变化及修饰,均应落在本发明技术方案保护的范围内。The above are only preferred embodiments of the present invention, and do not limit the present invention in any form. Although the present invention has been disclosed above with preferred embodiments, it is not intended to limit the present invention. Any person familiar with the art, without departing from the scope of the technical solution of the present invention, can use the technical content disclosed above to make many possible changes and modifications to the technical solution of the present invention, or modify it into an equivalent implementation of equivalent changes example. Therefore, any simple modifications, equivalent changes and modifications made to the above embodiments according to the technical essence of the present invention shall fall within the protection scope of the technical solution of the present invention.
Claims (3)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610190046.3A CN105761488B (en) | 2016-03-30 | 2016-03-30 | Real-time extreme learning machine Short-time Traffic Flow Forecasting Methods based on fusion |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610190046.3A CN105761488B (en) | 2016-03-30 | 2016-03-30 | Real-time extreme learning machine Short-time Traffic Flow Forecasting Methods based on fusion |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105761488A CN105761488A (en) | 2016-07-13 |
CN105761488B true CN105761488B (en) | 2018-11-23 |
Family
ID=56346724
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610190046.3A Active CN105761488B (en) | 2016-03-30 | 2016-03-30 | Real-time extreme learning machine Short-time Traffic Flow Forecasting Methods based on fusion |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105761488B (en) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106297285B (en) * | 2016-08-17 | 2018-09-21 | 重庆大学 | Freeway traffic operating status fuzzy synthetic appraisement method based on changeable weight |
CN109923595B (en) * | 2016-12-30 | 2021-07-13 | 同济大学 | An urban road traffic anomaly detection method based on floating car data |
CN108674413B (en) * | 2018-05-18 | 2021-02-19 | 广州小鹏汽车科技有限公司 | Vehicle and pedestrian collision prevention method and system |
CN108770010B (en) * | 2018-06-26 | 2021-12-14 | 南京航空航天大学 | A Service-Oriented Intelligent Reconfiguration Method for Networking Mode of Wireless Networks |
CN109377752A (en) * | 2018-10-19 | 2019-02-22 | 桂林电子科技大学 | Short-term traffic flow change prediction method, device, computer equipment and storage medium |
CN110708129B (en) * | 2019-08-30 | 2023-01-31 | 北京邮电大学 | A method for acquiring wireless channel state information |
CN110517494A (en) * | 2019-09-03 | 2019-11-29 | 中国科学院自动化研究所 | Integrated learning-based traffic flow forecasting model, forecasting method, system, and device |
CN110675623B (en) * | 2019-09-06 | 2020-12-01 | 中国科学院自动化研究所 | Method, system and device for short-term traffic flow prediction based on hybrid deep learning |
CN110956807B (en) * | 2019-12-05 | 2021-04-09 | 中通服咨询设计研究院有限公司 | Highway flow prediction method based on combination of multi-source data and sliding window |
CN111179596B (en) * | 2020-01-06 | 2021-09-21 | 南京邮电大学 | Traffic flow prediction method based on group normalization and gridding cooperation |
CN111311905A (en) * | 2020-01-21 | 2020-06-19 | 北京工业大学 | Particle swarm optimization wavelet neural network-based expressway travel time prediction method |
CN111524348A (en) * | 2020-04-14 | 2020-08-11 | 长安大学 | A long-term and short-term traffic flow prediction model and method |
CN112070157B (en) * | 2020-09-07 | 2024-11-01 | 中南大学 | Multi-limit learning machine model construction method and device and computer equipment |
CN113537580B (en) * | 2021-06-28 | 2024-04-09 | 中科领航智能科技(苏州)有限公司 | Public transportation passenger flow prediction method and system based on self-adaptive graph learning |
CN113761020B (en) * | 2021-07-30 | 2023-06-13 | 北京交通大学 | Urban rail transit abnormal large passenger flow real-time prediction method |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101593424B (en) * | 2009-07-10 | 2010-09-29 | 浙江大学 | An Intelligent Combined Forecasting Method for Short-term Traffic Flow |
EP3435262A1 (en) * | 2010-03-15 | 2019-01-30 | Singapore Health Services Pte. Ltd. | A system for the detection of impending acute cardiopulmonary medical events |
CN104134079B (en) * | 2014-07-31 | 2017-06-16 | 中国科学院自动化研究所 | A kind of licence plate recognition method based on extremal region and extreme learning machine |
CN104616030B (en) * | 2015-01-21 | 2019-03-29 | 北京工业大学 | A kind of recognition methods based on extreme learning machine algorithm |
CN104992165A (en) * | 2015-07-24 | 2015-10-21 | 天津大学 | Extreme learning machine based traffic sign recognition method |
-
2016
- 2016-03-30 CN CN201610190046.3A patent/CN105761488B/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN105761488A (en) | 2016-07-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105761488B (en) | Real-time extreme learning machine Short-time Traffic Flow Forecasting Methods based on fusion | |
CN110648014B (en) | A regional wind power forecasting method and system based on spatiotemporal quantile regression | |
CN107941537A (en) | A kind of mechanical equipment health state evaluation method | |
CN109215350B (en) | Short-term traffic state prediction method based on RFID electronic license plate data | |
CN110047291B (en) | Short-term traffic flow prediction method considering diffusion process | |
CN115953186B (en) | Network appointment vehicle demand pattern recognition and short-time demand prediction method | |
CN113159403B (en) | Intersection pedestrian track prediction method and device | |
CN103699771B (en) | A kind of sight-clustering method of cooling load prediction | |
CN104298881A (en) | Bayesian network model based public transit environment dynamic change forecasting method | |
CN112700162B (en) | Method and device for evaluating running state of rail transit air conditioner | |
CN104599500A (en) | Grey entropy analysis and Bayes fusion improvement based traffic flow prediction method | |
CN113496314B (en) | Method for predicting road traffic flow by neural network model | |
Li | Predicting short-term traffic flow in urban based on multivariate linear regression model | |
CN109993966A (en) | A method and device for constructing a user portrait | |
CN115756922A (en) | Fault prediction diagnosis method and device, electronic equipment and storage medium | |
CN115600833A (en) | Smart city restriction scheme determination method, internet of things system, device and medium | |
CN106526710A (en) | Haze prediction method and device | |
CN114287023A (en) | Multi-sensor learning system for traffic prediction | |
CN118313638B (en) | Method, device, equipment and medium for predicting long-term travel demand of inter-city network about car based on SRFE-BLP-converter | |
CN116189425A (en) | A method and system for predicting traffic conditions based on Internet of Vehicles big data | |
Srinivasarao et al. | Deep learning based condition monitoring of road traffic for enhanced transportation routing | |
CN110796301B (en) | Passenger flow prediction method and device based on IC card data | |
CN118195296A (en) | Scenic spot crowd gathering risk assessment and early warning method and system based on multiple features | |
KR20210128823A (en) | Crossroads LOS Prediction Method Based on Big Data and AI, and Storage Medium Having the Same | |
CN115620524A (en) | A traffic jam prediction method, system, device and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20200805 Address after: 410000 building A4, block a, poly west coast, Yuelu District, Changsha City, Hunan Province Patentee after: OuYang Bo Address before: Yuelu District City, Hunan province 410082 Changsha Lushan Road No. 1 Patentee before: HUNAN University |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20210329 Address after: Room 2303, 23 / F, building F4, luguyuyuan, 27 Wenxuan Road, high tech Development Zone, Changsha, Hunan 410000 Patentee after: HUNAN XIANGJIANG WISDOM TECHNOLOGY Co.,Ltd. Address before: 410000 building A4, area a, Baoli west coast, Yuelu District, Changsha City, Hunan Province Patentee before: OuYang Bo |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Short term traffic flow prediction method based on fusion of real time extreme learning machine Effective date of registration: 20210629 Granted publication date: 20181123 Pledgee: Huarong Xiangjiang Bank Co.,Ltd. Xiangjiang New Area Branch Pledgor: HUNAN XIANGJIANG WISDOM TECHNOLOGY Co.,Ltd. Registration number: Y2021430000027 |