CN110691319B

CN110691319B - A method for realizing high-precision indoor positioning of heterogeneous devices using domain adaptation

Info

Publication number: CN110691319B
Application number: CN201910828131.1A
Authority: CN
Inventors: 刘楠; 刘静; 潘志文
Original assignee: Southeast University
Current assignee: Southeast University
Priority date: 2019-09-03
Filing date: 2019-09-03
Publication date: 2021-06-01
Anticipated expiration: 2039-09-03
Also published as: CN110691319A

Abstract

The present invention provides a method for realizing high-precision indoor positioning of heterogeneous equipment using domain adaptation, which minimizes the difference between the two domains by aligning the fingerprint vector collected by the online positioning terminal and the second-order statistical information of the fingerprint database fingerprint in the offline stage. offset, and does not require any information about the terminal label. Based on the transfer learning framework, domain adaptation and elimination of terminal differences are combined to improve the scalability of the positioning system. Before the classifier is trained, the fingerprint database collected by the fixed terminal in the offline phase is used as the target feature, and the source-target whitening alignment of the fingerprint of any terminal in the online phase can greatly reduce the damage to the localization performance caused by the heterogeneity. The method of the invention realizes on-line adjustment simply and quickly, and achieves ideal performance in actual multi-terminal positioning.

Description

A method for realizing high-precision indoor positioning of heterogeneous devices using domain adaptation

技术领域technical field

本发明属于定位技术领域，涉及基于WLAN的室内定位技术，尤其涉及一种使用领域自适应实现异构设备高精度室内定位的方法。The invention belongs to the technical field of positioning, relates to an indoor positioning technology based on WLAN, and in particular relates to a method for realizing high-precision indoor positioning of heterogeneous equipment by using field adaptation.

背景技术Background technique

基于终端的指纹定位系统的精度取决于终端接收到的来自各个接入点AP的通信包，RSS值是实现定位的基础。不同的数据采集设备配备不同的硬件芯片和天线，可能具有不同的信号感知能力，因而会产生不同的数据分布。当在线阶段信号分布发生变化，需要重新采集更新新的指纹库为保持系统的定位精度，而大型复杂建筑中采集并维护信号强度指纹库极其耗费人力资源。终端版本迭代速度之快使得指纹库的采集设备并不能覆盖市面上所有的智能终端。在实际的定位系统中，用户使用的各种移动设备不同于用于构建无线电地图的设备，严重影响定位系统的推广和使用。此外，在异构设备定位时，易出现终端域偏移，从而造成定位性能下降，定位精度不足。The accuracy of the terminal-based fingerprint positioning system depends on the communication packets received by the terminal from each access point AP, and the RSS value is the basis for positioning. Different data acquisition devices are equipped with different hardware chips and antennas, and may have different signal perception capabilities, thus resulting in different data distributions. When the signal distribution changes in the online phase, it is necessary to re-collect and update a new fingerprint database to maintain the positioning accuracy of the system. In large and complex buildings, collecting and maintaining the signal strength fingerprint database is extremely labor-intensive. The terminal version iteration speed is so fast that the collection device of the fingerprint database cannot cover all smart terminals on the market. In the actual positioning system, various mobile devices used by users are different from those used to construct radio maps, which seriously affects the promotion and use of the positioning system. In addition, during the positioning of heterogeneous devices, terminal domain offset is prone to occur, resulting in a decrease in positioning performance and insufficient positioning accuracy.

发明内容SUMMARY OF THE INVENTION

为解决上述问题，本发明提出了一种使用领域自适应实现异构设备高精度室内定位的方法，，以实现对异构设备在线定位时更好的定位性能。本发明利用相关对齐最小化源数据和目标数据边际布的距离。依照指纹库数据为参照，对离线阶段和在线阶段指纹的二阶统计特性进行变换对齐的操作。在混淆源域和目标域后，分类器无法区分源域和目标域，自然可以缓解设备差异性带来的影响。In order to solve the above problems, the present invention proposes a method for realizing high-precision indoor positioning of heterogeneous devices using field adaptation, so as to achieve better positioning performance when positioning heterogeneous devices online. The present invention utilizes relative alignment to minimize the distance of the source and target data margins. According to the fingerprint database data as a reference, the operation of transforming and aligning the second-order statistical characteristics of the fingerprints in the offline phase and the online phase is performed. After confusing the source domain and the target domain, the classifier cannot distinguish the source domain and the target domain, which can naturally alleviate the impact of device differences.

为了达到上述目的，本发明提供如下技术方案：In order to achieve the above object, the present invention provides the following technical solutions:

一种使用领域自适应实现异构设备高精度室内定位的方法，包括如下步骤：A method for realizing high-precision indoor positioning of heterogeneous devices using domain adaptation, comprising the following steps:

(1)通过常用终端在实际场景中模拟多种设备异构情况的定位过程；(1) Simulate the positioning process of a variety of equipment heterogeneous situations in actual scenarios through common terminals;

(2)对于室内定位系统，需对待测试环境均匀划分采样点，以采集并搭建周边各接入点的无线信号强度的指纹库；随后进行离线集成学习训练，训练阶段通过随机森林回归、多层感知机回归和多层感知机分类构成的集成学习，训练建立各个参考点位置标签与其对应指纹之间的映射关系，保存已训练的位置估计网络模型；最后进行在线位置估计测试，当在线用户接受到周围接入点的指纹时，经过相关对齐在线调整后，利用已训练的算法进行实时定位；(2) For the indoor positioning system, it is necessary to divide the sampling points evenly in the test environment to collect and build a fingerprint database of the wireless signal strength of each surrounding access point; then conduct offline integrated learning and training. The integrated learning composed of perceptron regression and multi-layer perceptron classification, training to establish the mapping relationship between the position labels of each reference point and their corresponding fingerprints, and saving the trained position estimation network model; finally, the online position estimation test is carried out. When the fingerprint of the surrounding access point is reached, after the relevant alignment is adjusted online, the trained algorithm is used for real-time positioning;

(5)在线调整阶段时，先对另一手机接收到的测试源域数据进行白化操作，消除源域数据的特征相关性；再计算指纹库中各训练目标域数据之间的相关性，将相关性作用在白化后的源数据上，重建源数据的特征；通过源域和目标域二阶统计信息之间的不断逼近，使得源域与目标域融合；二阶统计特性保留了数据特征对齐的同时还保留了分布信息，笼统的囊括了设备在芯片、天线方向和安装材料等差异导致的异构性；(5) During the online adjustment stage, first perform a whitening operation on the test source domain data received by another mobile phone to eliminate the feature correlation of the source domain data; and then calculate the correlation between the training target domain data in the fingerprint database. The correlation acts on the whitened source data to reconstruct the features of the source data; through the continuous approximation between the second-order statistical information of the source domain and the target domain, the source domain and the target domain are fused; the second-order statistical characteristics preserve the data feature alignment At the same time, it also retains the distribution information, which generally includes the heterogeneity of the device caused by the difference in the chip, antenna direction and installation material;

(6)实时接收的指纹经过已训练的位置估计网络，该算法可以预计出位置坐标。(6) The fingerprints received in real time pass through the trained position estimation network, and the algorithm can predict the position coordinates.

进一步的，所述步骤(2)中搭建指纹库步骤具体包括如下过程：Further, the step of building a fingerprint database in the step (2) specifically includes the following process:

(11)将待测区域按照面积均匀划分一定间隔的网格参考点，作为该定位区域的采样点；(11) The area to be measured is evenly divided into grid reference points with a certain interval according to the area, as the sampling points of the positioning area;

(12)在每个参考点上以终端A接受记录附近一段时间内所有接入点AP发送的无线信号强度，形成包含参考点位置[x_i,y_i]、各个AP的MAC地址和对应接收到信号强度[rss₁,rss₂,...,rss_n]的指纹向量([rss₁,rss₂,...,rss_n],[x_i,y_i])；(12) On each reference point, terminal A receives and records the wireless signal strengths sent by all access points APs in the vicinity for a period of time, and forms a form including the reference point position [x _i , y _i ], the MAC address of each AP and the corresponding reception to the fingerprint vector ([rss ₁ ,rss ₂ ,...,rss _n ],[x _i ,y _i ]) of signal strengths [rss ₁ ,rss ₂ ,...,rss _n ];

(13)每个参考点的指纹向量即可组合存储为该待测区域的离线指纹库。(13) The fingerprint vector of each reference point can be combined and stored as an offline fingerprint database of the area to be measured.

进一步的，所述步骤(2)中离线集成学习训练步骤包括如下过程：Further, the off-line integrated learning and training step in the step (2) includes the following process:

(11)对指纹库中RSS数据[rss₁,rss₂,...,rss_n]归一化为零均值和标准方差后，将离线数据集根据十折交叉验证法划分指纹库数据；指纹库数据被分为训练集，验证集和测试集；训练的过程中是在训练集上进行训练，最终的定位结果是在测试集上测试；(11) After normalizing the RSS data [rss ₁ , rss ₂ ,..., rss _n ] in the fingerprint database to zero mean and standard deviation, divide the offline data set into the fingerprint database data according to the ten-fold cross-validation method; The database data is divided into training set, validation set and test set; the training process is performed on the training set, and the final positioning result is tested on the test set;

(12)以随机森林回归、多层感知机回归和多层感知机分类模型作为基学习器在训练集和验证集上组合成集成学习，该训练过程的目标函数是编码后的神经元与有标签的位置之间的映射关系的均方误差，最小化目标函数优化参数；(12) Using random forest regression, multi-layer perceptron regression and multi-layer perceptron classification model as the base learner, the training set and the validation set are combined into an ensemble learning. The objective function of the training process is the encoded neurons and the The mean square error of the mapping relationship between the positions of the labels minimizes the objective function optimization parameters;

(13)采用预测位置与实际位置的距离作为精度的批判标准，记待定位区域有L个未知的未知参考点，其中第i个参考点的真实地理位置记为(x_i,y_i)，输入定位系统中估计的未知记为(x_i′,y_i′)，则ALE能够表示为：(13) The distance between the predicted position and the actual position is used as a critical criterion for accuracy, and it is recorded that there are L unknown unknown reference points in the area to be located, and the real geographical position of the i-th reference point is recorded as ( _xi , y _i ), The unknown estimated in the input positioning system is denoted as ( _xi ', y _i '), then ALE can be expressed as:

进一步的，所述步骤(12)包括如下过程：Further, the step (12) includes the following process:

基类学习器随机森林回归学习法，是由多个决策树组成；测量的信号[rss₁,rss₂,...,rss_n]的n个AP可被视为划分条件的特征，作为算法的输入数据；参考点位置标签[x_i,y_i]即相当于决策树的叶节点，即算法的二维输出结果；采用了启发式的节点选择法C4.5算法，先找出信息增益Ent(·)高于平均水平的属性，再从中选择具有最大信息增益率Gain(·)的属性，以构建决策树；决策树的搭建需要网格协调调整一些参数来有效防止过拟合和欠拟合；随机森林以分类结果中估计每个特征的重要性，将决策树输出中出现最多的类作为随机森林分类器的最终输出；The base class learner random forest regression learning method is composed of multiple decision trees; the n APs of the measured signals [rss ₁ , rss ₂ ,..., rss _n ] can be regarded as the characteristics of the division conditions, as the algorithm The input data of ; the reference point position label [x _i , y _i ] is equivalent to the leaf node of the decision tree, that is, the two-dimensional output result of the algorithm; using the heuristic node selection method C4.5 algorithm, first find out the information gain Ent(·) is higher than the average level of attributes, and then select the attributes with the largest information gain rate Gain(·) to build a decision tree; the construction of a decision tree requires grid coordination to adjust some parameters to effectively prevent overfitting and underfitting Fitting; random forest estimates the importance of each feature in the classification result, and takes the class that appears most in the output of the decision tree as the final output of the random forest classifier;

由输入数据集中较小子集组成的递归过程来训练每个决策树，直至所有树节点都达到类似的输出目标；随机森林分类器将基于输入的权值作为与决策树数量相似的参数；代替分类算法，RFR算法被选择来训练指纹与标签的映射，则特征划分标准分裂标准为最小均方差：A recursive process consisting of a smaller subset of the input dataset trains each decision tree until all tree nodes achieve a similar output goal; the random forest classifier takes input-based weights as parameters similar to the number of decision trees; instead The classification algorithm, the RFR algorithm is selected to train the mapping of fingerprints and labels, and the feature division standard splitting standard is the minimum mean square error:

式中，y_L*表示左子树的值，y_R*表示的右子树的值；In the formula, y _L* represents the value of the left subtree, and y _R* represents the value of the right subtree;

多层感知机算法是前馈型的人工神经网络；该算法中隐藏层、神经元个数和非线性的激活函数需要慎重选择；除了输入节点以外的每个节点都是非线性的激活函数作用下的神经元，使用多层感知机时采用了每层有100个神经元的两个隐藏层；在最小化损失函数时采用SGD法来更新参数：The multilayer perceptron algorithm is a feedforward artificial neural network; the hidden layer, the number of neurons and the nonlinear activation function in this algorithm need to be carefully selected; every node except the input node is under the action of the nonlinear activation function. The neurons of the multi-layer perceptron are used, and two hidden layers with 100 neurons in each layer are used; when the loss function is minimized, the SGD method is used to update the parameters:

其中，η是搜寻参数空间中控制步长的学习率，Loss是整个网络的损失函数；Among them, η is the learning rate of the control step size in the search parameter space, and Loss is the loss function of the entire network;

集成学习综合考虑多个基础学习器模型的结果；采用平均法结合策略：Ensemble learning comprehensively considers the results of multiple basic learner models; the average method is used to combine strategies:

其中，位置(x_i,y_i)就是各个基础学习器预测的结果，而W_i是基础学习器预测结果对于最终位置估计的权重。Among them, the position (x _i , y _i ) is the prediction result of each basic learner _, and Wi is the weight of the prediction result of the basic learner to the final position estimation.

进一步的，所述步骤(2)中在线位置估计步骤包括如下过程：Further, the online position estimation step in the step (2) includes the following process:

测试集中第i个测试点处采集到的来自n个AP的指纹

利用已经训练好的位置指纹特征与位置标签的网络模型，针对新输入的在线特征估计其对应的位置。Fingerprints from n APs collected at the i-th test point in the test set

Using the network model of the trained location fingerprint feature and location label, the corresponding location of the newly input online feature is estimated.

进一步的，所述步骤(3)中对在线特征白化和重建过程包括如下初始化阶段、建模阶段和优化阶段：Further, in the step (3), the online feature whitening and reconstruction process includes the following initialization stage, modeling stage and optimization stage:

(11)初始化阶段，对于给定测试集中数据作为源域的数据RSS_S＝(rss₁,…,rss_m)，其中，

希望其对齐的目标域的数据是指纹库中的训练数据R_T＝(r₁,…,r_k)，其中，

源域和目标域的特征都是n维向量；记这两个域特征向量的均值和协方差矩阵分别为μ_S、μ_T和Cov_S、Cov_T，经过中心化标准后，μ_S＝μ_T＝0，但是Cov_S≠Cov_T；(11) In the initialization phase, for the data in the given test set as the source domain data RSS _S =(rss ₁ ,...,rss _m ), where,

The data of the target domain whose alignment is desired is the training data R _T =(r ₁ ,...,r _k ) in the fingerprint library, where,

The features of the source domain and the target domain are all n-dimensional vectors; the mean and covariance matrices of the eigenvectors of these two domains are respectively μ _S , μ _T and Cov _S , Cov _T , after the centralization standard, μ _S = μ _T = 0, but Cov _S ≠ Cov _T ;

(12)建模阶段，为减小源域和目标域特征的二阶统计特性间的差距，即协方差Cov_S与Cov_T的距离，对源域特征作线性变化，最小化变化后源域特征与目标特征协方差的距离；则目标函数为：(12) In the modeling stage, in order to reduce the gap between the second-order statistical characteristics of the source domain and target domain features, that is, the distance between the covariance Cov _S and Cov _T , the source domain features are linearly changed to minimize the source domain after the change. The distance between the feature and the target feature covariance; then the objective function is:

其中，

表示线性变化后源域特征的协方差矩阵。in,

Represents the covariance matrix of the source domain features after linear variation.

(13)优化阶段，对目标函数的协方差矩阵做奇异值分解Cov_T＝U_TΣ_TV_T，其中，U_T[1:r]、Σ_T[1:r]V_T[1:r]分别为Cov_T的左奇异向量，最大的r个奇异值和对应的右奇异向量，那么

的最优解为

其中，

由于协方差矩阵是对称矩阵，对Cov_S特征分解得Cov_S＝U_SΣ_SV_S：(13) In the optimization stage, perform singular value decomposition on the covariance matrix of the objective function Cov _T =U _T Σ _T V _T , where U _T[1:r] and Σ _T[1:r] V _{T[1:r ]} are the left singular vector of Cov _T , the largest r singular values and the corresponding right singular vector, then

The optimal solution of is

in,

Since the covariance matrix is a symmetric matrix, the eigendecomposition of Cov _S can obtain Cov _S =U _S Σ _S V _S :

由

得，

则线性变化重建过程推导为：Depend on

have to,

Then the linear change reconstruction process is deduced as:

上式中，第一部分

为白化过程，第二部分

为对齐过程；In the above formula, the first part

for the whitening process, part 2

for the alignment process;

(14)对于需要在线定位的用户，提供测试点处指纹向量，分别计算目标域指纹库和源域在线测试指纹的协方差矩阵；对于源域协方差矩阵先做白化变换，以协方差矩阵广义逆矩阵的

次幂，消除源域数据的相关性；再以目标函数的二阶统计特性对白化特征作相关性叠加变换；(14) For users who need online positioning, provide the fingerprint vector at the test point, and calculate the covariance matrix of the target domain fingerprint database and the source domain online test fingerprint respectively. inverse matrix

power to eliminate the correlation of the source domain data; then use the second-order statistical characteristics of the objective function to perform correlation stacking transformation on the whitening features;

(15)CORAL对齐后的在线特征，利用原始训练集中数据进行位置估计。(15) The online features after CORAL alignment use the data in the original training set for location estimation.

与现有技术相比，本发明具有如下优点和有益效果：Compared with the prior art, the present invention has the following advantages and beneficial effects:

本发明采用相关对齐混淆训练集和测试集设备特征的方式，将实时指纹在线调整与训练指纹库中指纹对齐。本发明仅需对离线阶段和在线阶段指纹的二阶统计特性进行变换对齐的操作，而且不需要知道两个阶段使用的设备标签。整个定位系统只需要在离线阶段集成回归学习，训练建立位置坐标与多维特征的高维流形图映射关系，针对设备异构性的CORAL对齐在线定位阶段即可快速调整，最大限度地减少了域偏移，很好地补偿了由于终端域偏移而造成的定位性能下降问题。The invention adopts the method of confusing training set and test set equipment features by related alignment, and aligns the real-time fingerprint online with the fingerprint in the training fingerprint database. The invention only needs to perform transformation and alignment operations on the second-order statistical characteristics of the fingerprints in the offline stage and the online stage, and does not need to know the device labels used in the two stages. The entire positioning system only needs to integrate regression learning in the offline stage, training to establish a high-dimensional manifold graph mapping relationship between position coordinates and multi-dimensional features, and CORAL alignment for device heterogeneity can be quickly adjusted in the online positioning stage, which minimizes the number of domains. offset, which well compensates for the degradation of localization performance due to terminal domain offset.

附图说明Description of drawings

图1为本发明训练流程示意图。FIG. 1 is a schematic diagram of the training flow of the present invention.

图2为本发明在线调整阶段CORAL领域自适应过程，其中(a)为初始中心化阶段，(b)为白化源域特征阶段，(c)为重构源域特征阶段。Figure 2 shows the CORAL domain adaptation process in the online adjustment stage of the present invention, wherein (a) is the initial centralization stage, (b) is the whitening source domain feature stage, and (c) is the reconstructed source domain feature stage.

具体实施方式Detailed ways

以下将结合具体实施例对本发明提供的技术方案进行详细说明，应理解下述具体实施方式仅用于说明本发明而不用于限制本发明的范围。The technical solutions provided by the present invention will be described in detail below with reference to specific embodiments. It should be understood that the following specific embodiments are only used to illustrate the present invention and not to limit the scope of the present invention.

本发明提出了一种使用相关性对齐的领域自适应方法，能够补偿由于终端域偏移而造成的性能下降。The present invention proposes a domain adaptation method using correlation alignment, which can compensate for performance degradation due to terminal domain offset.

本发明提供的一种使用领域自适应实现异构设备高精度室内定位的方法，如图1所示，包括如下步骤：A method for realizing high-precision indoor positioning of heterogeneous devices provided by the present invention, as shown in FIG. 1, includes the following steps:

(1)首先考虑用户在线定位过程，机器学习的训练使得定位性能大大提升，因此对于定位系统的泛化要求也提高了。更新迭代速度如此之快的终端市场要求定位系统保持高精度定位的同时能够兼容多种型号的设备。我们通过常用终端在实际场景中模拟多种设备异构情况的定位过程。在现有的定位实验系统中，在离线阶段和在线阶段都使用不同的设备来收集RSS指纹值(例如采用小米手机在线定位匹配的指纹库是由华为手机采集构建)，构造设备异构定位的情况。(1) First, consider the online positioning process of users. The training of machine learning greatly improves the positioning performance, so the generalization requirements for the positioning system are also improved. End markets with such rapid iterations require positioning systems to maintain high-precision positioning while being compatible with multiple models of devices. We use common terminals to simulate the positioning process of various equipment heterogeneous situations in actual scenarios. In the existing positioning experiment system, different devices are used to collect RSS fingerprint values in the offline and online stages (for example, the fingerprint database for online positioning and matching of Xiaomi mobile phones is collected and constructed by Huawei mobile phones), and the heterogeneous positioning of devices is constructed. Happening.

(2)对于室内定位系统而言，我们首先需要对待测试环境均匀划分采样点，本例中，采样点之间间隔1.6m*1.6m，以采集整理周边各接入点的无线信号强度的指纹库。训练阶段，通过随机森林回归、多层感知机回归和多层感知机分类构成的集成学习，训练建立各个参考点位置标签与其对应指纹之间的映射关系，保存已训练的位置估计网络模型。当在线用户接受到周围接入点的指纹时，经过相关对齐在线调整后，可以利用已训练的算法进行实时定位。(2) For the indoor positioning system, we first need to evenly divide the sampling points in the test environment. In this example, the interval between sampling points is 1.6m*1.6m to collect and sort out the fingerprints of the wireless signal strength of the surrounding access points. library. In the training stage, through the ensemble learning consisting of random forest regression, multi-layer perceptron regression and multi-layer perceptron classification, the training establishes the mapping relationship between each reference point position label and its corresponding fingerprint, and saves the trained position estimation network model. When online users receive the fingerprints of surrounding access points, they can use the trained algorithm for real-time positioning after online adjustment of relevant alignment.

具体的说，步骤(2)包括搭建指纹库，离线集成学习训练和在线位置估计测试过程。Specifically, step (2) includes building a fingerprint database, offline ensemble learning and training, and online position estimation testing.

其中，搭建指纹库包括如下步骤：The building of the fingerprint database includes the following steps:

(11)将待测区域按照面积均匀划分1-2m间隔的网格参考点，作为该定位区域的采样点。(11) The area to be measured is evenly divided into grid reference points with an interval of 1-2m according to the area, as the sampling points of the positioning area.

(12)在每个参考点上以终端A接受记录附近1分钟所有接入点AP发送的无线信号强度，形成包含参考点位置[x_i,y_i]、各个AP的MAC地址和对应接收到信号强度[rss₁,rss₂,...,rss_n]的指纹向量([rss₁,rss₂,...,rss_n],[x_i,y_i])。(12) At each reference point, terminal A receives and records the wireless signal strengths sent by all access points APs in the vicinity for 1 minute, and forms a form including the reference point position [x _i , y _i ], the MAC address of each AP and the corresponding received Fingerprint vector ([rss ₁ ,rss ₂ ,...,rss _n ],[x _i ,y _i ]) of signal strengths [rss ₁ ,rss ₂ ,...,rss _n ].

离线集成学习训练包括如下步骤：Offline ensemble learning training includes the following steps:

(11)对指纹库中RSS数据[rss₁,rss₂,...,rss_n]归一化为零均值和标准方差后，将离线数据集根据十折交叉验证法划分指纹库数据。指纹库数据被分为训练集，验证集和测试集。训练的过程中是在训练集上进行训练，在验证集上验证定位精度来提高训练能力，最终的定位结果是在测试集上测试，在保证高精度时还防止过拟合的出现。(11) After normalizing the RSS data [rss ₁ , rss ₂ ,..., rss _n ] in the fingerprint database to zero mean and standard deviation, divide the offline data set into the fingerprint database data according to the ten-fold cross-validation method. The fingerprint database data is divided into training set, validation set and test set. During the training process, training is performed on the training set, and the positioning accuracy is verified on the validation set to improve the training ability. The final positioning result is tested on the test set, which prevents overfitting while ensuring high accuracy.

(12)以随机森林回归、多层感知机回归和多层感知机分类模型作为基学习器在训练集和验证集上组合成集成学习，该训练过程的目标函数是编码后的神经元(即重构的无线信号指纹特征)与有标签的位置之间的映射关系的均方误差，最小化目标函数优化参数。(12) Using random forest regression, multilayer perceptron regression and multilayer perceptron classification model as the base learner to combine the training set and the validation set into ensemble learning, the objective function of the training process is the encoded neurons (ie The mean square error of the mapping relationship between the reconstructed wireless signal fingerprint feature) and the labelled position minimizes the objective function optimization parameters.

基类学习器随机森林回归学习法，是由多个决策树组成。测量的信号[rss₁,rss₂,...,rss_n]的n个AP可被视为划分条件的特征，作为算法的输入数据。参考点位置标签[x_i,y_i]即相当于决策树的叶节点，即算法的二维输出结果。采用了启发式的节点选择法C4.5算法，先找出信息增益Ent(·)高于平均水平的属性，再从中选择具有最大信息增益率Gain(·)的属性，以构建决策树。决策树的搭建也需要网格协调调整一些参数来有效防止过拟合和欠拟合，例如，特征选择标准、决策树最大深度、节点划分最小不纯度等等。参数的选择也是影响定位精度的另一因子。随机森林以分类结果中估计每个特征的重要性，将决策树输出中出现最多的类作为随机森林分类器的最终输出。The base class learner random forest regression learning method is composed of multiple decision trees. The _n APs of the measured signals [rss ₁ , rss ₂ , . The reference point position label [x _i , y _i ] is equivalent to the leaf node of the decision tree, that is, the two-dimensional output result of the algorithm. The heuristic node selection method C4.5 algorithm is adopted. First, find the attributes whose information gain Ent(·) is higher than the average level, and then select the attributes with the largest information gain rate Gain(·) to construct a decision tree. The construction of decision tree also requires grid coordination to adjust some parameters to effectively prevent overfitting and underfitting, such as feature selection criteria, maximum depth of decision tree, minimum impurity of node division, etc. The choice of parameters is also another factor that affects the positioning accuracy. Random forest estimates the importance of each feature in the classification result, and takes the class that appears most in the output of the decision tree as the final output of the random forest classifier.

由输入数据集中较小子集组成的递归过程来训练每个决策树，直至所有树节点都达到类似的输出目标。随机森林分类器将基于输入的权值作为与决策树数量相似的参数。代替分类算法，RFR(Random Forest Regression，随机森林回归学习算法)被选择来训练指纹与标签的映射，则特征划分标准分裂标准为最小均方差：A recursive process consisting of smaller subsets of the input dataset trains each decision tree until all tree nodes achieve a similar output goal. Random forest classifiers take input-based weights as parameters similar to the number of decision trees. Instead of the classification algorithm, RFR (Random Forest Regression, random forest regression learning algorithm) is selected to train the mapping of fingerprints and labels, and the feature division standard splitting standard is the minimum mean square error:

式中，y_L*表示左子树的值，y_R*表示的右子树的值。In the formula, y _L* represents the value of the left subtree, and y _R* represents the value of the right subtree.

多层感知机算法是前馈型的人工神经网络。该算法中隐藏层、神经元个数和非线性的激活函数也需要慎重选择。除了输入节点以外的每个节点都是非线性的激活函数作用下的神经元，我们使用多层感知机时采用了每层有100个神经元的两个隐藏层。为加速学习过程，在最小化损失函数时采用SGD(Stochastic GradientDescent，随机梯度下降法)来更新参数。The multilayer perceptron algorithm is a feedforward artificial neural network. The hidden layer, the number of neurons and the nonlinear activation function in the algorithm also need to be carefully selected. Every node except the input node is a neuron under the action of a non-linear activation function. We use a multilayer perceptron with two hidden layers of 100 neurons each. To speed up the learning process, SGD (Stochastic Gradient Descent) is used to update the parameters when minimizing the loss function.

其中，η是搜寻参数空间中控制步长的学习率，Loss是整个网络的损失函数。where η is the learning rate that controls the step size in the search parameter space, and Loss is the loss function of the entire network.

集成学习综合考虑多个基础学习器模型的结果。平均法结合策略的一个重要优点无需事先对每个基础弱学习器进行调参。平均法有效结合了多种基础学习器的算法来提高系统准确性，稳健性和通用性，是最受欢迎的投票方法之一。它的工作原理如下：虽然每种个体学习器都能估计位置，但是学习器性能不一样，采用平均法结合策略，可以防止基本学习器过度拟合。Ensemble learning comprehensively considers the results of multiple base learner models. An important advantage of the averaging combined strategy does not require prior tuning of each underlying weak learner. The averaging method effectively combines the algorithms of a variety of basic learners to improve the accuracy, robustness and generality of the system, and is one of the most popular voting methods. It works as follows: Although each individual learner can estimate the position, the performance of the learner is not the same, and the average method combined with the strategy can prevent the basic learner from overfitting.

其中，位置(x_i,y_i)就是各个基础学习器预测的结果，而W_i是基础学习器预测结果对于最终位置估计的权重。这里我们从简设置权重都为1。Among them, the position (x _i , y _i ) is the prediction result of each basic learner _, and Wi is the weight of the prediction result of the basic learner to the final position estimation. Here we simply set the weights to 1.

(13)虽然每种个体学习器都能估计位置，但是学习器性能不一样，采用平均法结合策略，可以防止基本学习器过度拟合。对于定位系统的衡量标准，这里采用预测位置与实际位置的距离作为精度的批判标准。记待定位区域有L个未知的未知参考点，其中第i个参考点的真实地理位置记为(x_i,y_i)，输入定位系统中估计的未知记为(x_i′,y_i′)，则ALE(average localization error，平均定位误差)可以表示为：(13) Although each individual learner can estimate the position, the performance of the learner is not the same. Using the average method combined with the strategy can prevent the basic learner from overfitting. For the measurement of the positioning system, the distance between the predicted position and the actual position is used here as a critical criterion for accuracy. Remember that there are L unknown unknown reference points in the area to be located, and the real geographic location of the ith reference point is recorded as ( _xi , y _i ), and the unknown estimated in the input positioning system is recorded as ( _xi ′, y _i ′ ), then ALE (average localization error, average localization error) can be expressed as:

在线位置估计过程包括如下步骤：The online location estimation process includes the following steps:

测试集中第i个测试点处采集到的来自n个AP的指纹

(3)在线调整阶段时，需要先对另一手机接收到的测试源域数据进行白化操作，消除源域数据的特征相关性；再计算指纹库中各训练目标域数据之间的相关性，将相关性作用在白化后的源数据上，重建源数据的特征。通过源域和目标域二阶统计信息之间的不断逼近，使得源域与目标域融合。二阶统计特性保留了数据特征对齐的同时还保留了分布信息，笼统的囊括了设备在芯片、天线方向和安装材料等差异导致的异构性。(3) During the online adjustment stage, it is necessary to perform a whitening operation on the test source domain data received by another mobile phone to eliminate the feature correlation of the source domain data; then calculate the correlation between the training target domain data in the fingerprint database, The correlation is applied to the whitened source data to reconstruct the characteristics of the source data. Through the continuous approximation between the second-order statistical information of the source domain and the target domain, the source domain and the target domain are fused. The second-order statistical properties retain the data feature alignment and distribution information, and generally include the heterogeneity caused by differences in the chip, antenna orientation, and mounting materials of the device.

步骤(3)中对在线特征白化和重建的步骤包括：初始化阶段、建模阶段和优化阶段，具体包括：The steps of whitening and reconstructing the online features in step (3) include: an initialization phase, a modeling phase and an optimization phase, specifically including:

源域和目标域的特征都是n维向量。记这两个域特征向量的均值和协方差矩阵分别为μ_S、μ_T和Cov_S、Cov_T，经过中心化标准后，μ_S＝μ_T＝0，但是Cov_S≠Cov_T。(11) In the initialization phase, for the data in the given test set as the source domain data RSS _S =(rss ₁ ,...,rss _m ), where,

The features of the source and target domains are both n-dimensional vectors. Denote the mean and covariance matrix of these two domain eigenvectors as μ _S , μ _T and Cov _S , Cov _T , respectively. After the centralization standard, μ _S = μ _T = 0, but Cov _S ≠ Cov _T .

(12)建模阶段，为减小源域和目标域特征的二阶统计特性间的差距，即协方差Cov_S与Cov_T的距离，对源域特征作线性变化，最小化变化后源域特征与目标特征协方差的距离即可。(12) In the modeling stage, in order to reduce the difference between the second-order statistical characteristics of the source domain and target domain features, that is, the distance between the covariance Cov _S and Cov _T , the source domain features are linearly changed to minimize the source domain after the change. The distance between the feature and the target feature covariance is sufficient.

则目标函数为：Then the objective function is:

其中，

表示线性变化后源域特征的协方差矩阵。in,

的最优解为

其中，

由于协方差矩阵是对称矩阵，对Cov_S特征分解很容易得Cov_S＝U_SΣ_SV_S：(13) In the optimization stage, perform singular value decomposition on the covariance matrix of the objective function Cov _T =U _T Σ _T V _T , where U _T[1:r] and Σ _T[1:r] V _{T[1:r ]} are the left singular vector of Cov _T , the largest r singular values and the corresponding right singular vector, then

The optimal solution of is

in,

Since the covariance matrix is a symmetric matrix, it is easy to decompose the Cov _S eigendecomposition to Cov _S =U _S Σ _S V _S :

由

得，

则线性变化重建过程可以很容易地推导为Depend on

have to,

Then the linearly varying reconstruction process can be easily deduced as

上式中，第一部分

为白化过程，第二部分

为对齐过程In the above formula, the first part

for the whitening process, part 2

for the alignment process

CORAL并不是简单地执行特征规范化，而是将两个不同的分布对齐。由于相关性对齐只改变特征，因此它可以应用于任何基本分类器。高效率地使得目标领域变化，避免了流形学习中的子空间投影以及维度选择的困难。Instead of simply performing feature normalization, CORAL aligns two different distributions. Since correlation alignment only changes features, it can be applied to any basic classifier. The target domain is changed efficiently, avoiding the difficulty of subspace projection and dimension selection in manifold learning.

(14)对于需要在线定位的用户，提供测试点处指纹向量，分别计算目标域指纹库和源域在线测试指纹的协方差矩阵。对于源域协方差矩阵先做白化变换，以协方差矩阵广义逆矩阵的

次幂，消除源域数据的相关性。再以目标函数的二阶统计特性对白化特征作相关性叠加变换。(14) For users who need online positioning, provide the fingerprint vector at the test point, and calculate the covariance matrix of the target domain fingerprint database and the source domain online test fingerprint respectively. For the source domain covariance matrix, whitening transformation is performed first, and the generalized inverse matrix of the covariance matrix is used.

power to remove the correlation of the source domain data. Then use the second-order statistical characteristics of the objective function to perform correlation stacking transformation on the whitening features.

(15)CORAL对齐后的在线特征，可以利用原始训练集中数据进行位置估计。(15) The online features after CORAL alignment can use the data in the original training set for location estimation.

(4)依赖于目标域融合的新的测试数据，很好的补偿了设备异构性带来的影响，可以经过已训练的位置估计网络得到预计的位置坐标。防止随机性给实验结果带来的误差，我们分别在对多种异构设备在线定位进行模拟。(4) Relying on the new test data fused by the target domain, the influence of equipment heterogeneity is well compensated, and the estimated position coordinates can be obtained through the trained position estimation network. To prevent errors caused by randomness to the experimental results, we are simulating the online positioning of various heterogeneous devices respectively.

本发明通过对齐在线定位终端采集的指纹向量与离线阶段指纹库指纹的二阶统计信息来最小化两个领域之间的偏移，而且不需要任何关于终端标签的信息。基于迁移学习框架，将领域自适应与消除终端差异性结合，以提高定位系统的延展性。在分类器训练之前，以离线阶段固定终端采集的指纹库作为目标特征，对在线时的任意终端指纹的源目标白化对齐，即可大大削减异构性带来的对定位性能的损害。该算法简洁快速的实现了在线调整，在实际多终端定位时取得了理想的性能。The present invention minimizes the offset between the two fields by aligning the fingerprint vector collected by the online positioning terminal with the second-order statistical information of the fingerprint database fingerprint in the offline phase, and does not require any information about the terminal label. Based on the transfer learning framework, domain adaptation and elimination of terminal differences are combined to improve the scalability of the positioning system. Before the classifier is trained, the fingerprint database collected by the fixed terminal in the offline phase is used as the target feature, and the source-target whitening alignment of the fingerprint of any terminal in the online phase can greatly reduce the damage to the localization performance caused by the heterogeneity. The algorithm is simple and fast to realize online adjustment, and achieves ideal performance in actual multi-terminal positioning.

本发明方案所公开的技术手段不仅限于上述实施方式所公开的技术手段，还包括由以上技术特征任意组合所组成的技术方案。应当指出，对于本技术领域的普通技术人员来说，在不脱离本发明原理的前提下，还可以做出若干改进和润饰，这些改进和润饰也视为本发明的保护范围。The technical means disclosed in the solution of the present invention are not limited to the technical means disclosed in the above embodiments, but also include technical solutions composed of any combination of the above technical features. It should be pointed out that for those skilled in the art, without departing from the principle of the present invention, several improvements and modifications can be made, and these improvements and modifications are also regarded as the protection scope of the present invention.

Claims

1. A method for realizing high-precision indoor positioning of heterogeneous equipment in a self-adaptive mode in the field of use is characterized by comprising the following steps:

(1) simulating a positioning process of various equipment heterogeneous conditions in an actual scene through a common terminal;

(2) for an indoor positioning system, sampling points need to be uniformly divided for an environment to be tested so as to collect and build a fingerprint database of wireless signal intensity of each peripheral access point; then off-line ensemble learning training is carried out, in the training stage, the mapping relation between each reference point position label and the corresponding fingerprint is trained and established through ensemble learning formed by random forest regression, multilayer perceptron regression and multilayer perceptron classification, and the trained position estimation network model is stored; finally, carrying out an online position estimation test, and carrying out real-time positioning by using a trained algorithm after related alignment online adjustment when an online user receives fingerprints of surrounding access points;

(3) in the online adjustment stage, whitening operation is firstly carried out on test source domain data received by another mobile phone, and the characteristic correlation of the source domain data is eliminated; then calculating the correlation among the training target domain data in the fingerprint database, acting the correlation on the whitened source data, and reconstructing the characteristics of the source data; fusing the source domain and the target domain through continuous approximation between the second-order statistical information of the source domain and the target domain; the second-order statistical characteristic keeps the data feature alignment and also keeps the distribution information;

(4) the fingerprints received in real time are passed through a trained position estimation network, and the algorithm can predict position coordinates.

2. The method for realizing high-precision indoor positioning of heterogeneous equipment in a use field self-adaption mode according to claim 1, wherein the step of building the fingerprint database in the step (2) specifically comprises the following steps:

(11) uniformly dividing a region to be measured into grid reference points at certain intervals according to the area, and taking the grid reference points as sampling points of a positioning region;

(12) at each reference point, the wireless signal strength sent by all access points AP in a period of time nearby is recorded by the acceptance of the terminal A, and a reference point is formedPosition [ x ]_i,y_i]MAC address of each AP and corresponding received signal strength rss₁,rss₂,...,rss_n]Is given as a fingerprint vector ([ rss ]₁,rss₂,...,rss_n],[x_i,y_i])；

(13) The fingerprint vectors of each reference point can be combined and stored as an off-line fingerprint library of the area to be detected.

3. The method for realizing high-precision indoor positioning of heterogeneous equipment by using domain self-adaptation as claimed in claim 1, wherein the step of off-line integrated learning training in the step (2) comprises the following processes:

(11) for RSS data [ RSS ] in fingerprint database₁,rss₂,...,rss_n]After normalization to zero mean and standard variance, dividing the off-line data set into fingerprint database data according to a cross-over verification method, [ rss [ [ rss ]₁,rss₂,...,rss_n]Is the signal strength; the fingerprint database data is divided into a training set, a verification set and a test set; in the training process, training is carried out on a training set, and the final positioning result is tested on a test set;

(12) combining a random forest regression model, a multilayer perceptron regression model and a multilayer perceptron classification model as a base learner on a training set and a verification set to form integrated learning, wherein an objective function in the training process is the mean square error of a mapping relation between coded neurons and positions with labels, and objective function optimization parameters are minimized;

(13) taking the distance between the predicted position and the actual position as a critical standard of precision, and recording that the to-be-positioned area has L unknown reference points, wherein the real geographic position of the ith reference point is recorded as (x)_i,y_i) The estimated position in the input positioning system is noted as (x)_i′,y_i'), the average positioning error ALE can be expressed as:

4. the method for realizing high-precision indoor positioning of heterogeneous equipment by using field adaptive method according to claim 3, wherein the step (12) comprises the following steps:

the RFR algorithm of the base class learner random forest regression learning method consists of a plurality of decision trees; measured signal [ rss ]₁,rss₂,...,rss_n]Can be regarded as the characteristic of the division condition, as the input data of the algorithm; reference point location label [ x ]_i,y_i]The leaf nodes of the decision tree are equivalent, namely the two-dimensional output result of the algorithm; a heuristic node selection method C4.5 algorithm is adopted, the attribute that the information Gain Ent (-) is higher than the average level is found out, and then the attribute with the maximum information Gain rate Gain (-) is selected to construct a decision tree; the construction of the decision tree requires the coordination and adjustment of some parameters of the grids to effectively prevent over-fitting and under-fitting; estimating the importance of each feature in the classification result by the random forest, and taking the class with the most occurrence in the decision tree output as the final output of the random forest classifier;

training each decision tree by a recursive process consisting of smaller subsets in the input data set until all tree nodes reach similar output targets; the random forest classifier takes the weight value based on input as a parameter similar to the number of the decision trees; instead of a classification algorithm, the RFR algorithm is chosen to train the mapping of fingerprints to labels, and the feature classification criterion is the minimum mean square error:

in the formula, y_L*Value, y, representing the left sub-tree_R*The value of the right subtree represented;

the multi-layer perceptron algorithm is a feedforward type artificial neural network; hidden layers, the number of neurons and nonlinear activation functions in the algorithm need to be carefully selected; except for the input nodes, each node is a neuron under the action of a nonlinear activation function, and when a multilayer perceptron is used, two hidden layers with 100 neurons in each layer are adopted; the parameters are updated using the SGD method when minimizing the loss function:

wherein eta is the learning rate of the control step length in the search parameter space, and Loss is the Loss function of the whole network;

the integrated learning comprehensively considers the results of a plurality of basic learner models; the average method is adopted to combine the strategies:

wherein, the position (x)_i,y_i) Is the result of the prediction of the respective base learner, and W_iIs the weight of the base learner prediction result to the final position estimate.

5. The method for realizing high-precision indoor positioning of heterogeneous equipment by using domain self-adaptation as claimed in claim 1, wherein the online position estimation step in the step (2) comprises the following processes:

fingerprints from n APs collected at the ith test point in the test set

And estimating the corresponding position of the newly input online feature by using the trained network model of the position fingerprint feature and the position label.

6. The method for realizing high-precision indoor positioning of heterogeneous equipment by using domain adaptation as claimed in claim 1, wherein the whitening and rebuilding process of the online features in the step (3) comprises the following initialization stage, modeling stage and optimization stage:

(11) initialization phase, data RSS as source domain for data in given test set_S＝(rss₁,…,rss_m) Wherein the signal strength

The data of the target domain whose alignment is desired is the training data R in the fingerprint library_T＝(r₁,…,r_k) Wherein

the characteristics of the source domain and the target domain are n-dimensional vectors; the mean and covariance matrices of the two domain eigenvectors are recorded as mu respectively_S、μ_TAnd Cov_S、Cov_TAfter centering criterion, μ_S＝μ_T0, but Cov_S≠Cov_T；

(12) A modeling stage for reducing the difference between the second-order statistical properties of the source domain and target domain features, namely covariance Cov_SAnd Cov_TThe source domain features are linearly changed, and the distance between the covariance of the changed source domain features and the covariance of the changed target features is minimized; the objective function is then:

wherein,

a covariance matrix representing the source domain features after linear variation;

(13) in the optimization stage, singular value decomposition Cov is carried out on the covariance matrix of the objective function_T＝U_TΣ_TV_TWherein, U_T[1:r]、Σ_T[1:r]V_T[1:r]Are each Cov_TThe largest r singular values and the corresponding right singular vectors, then

Is optimally solved as

Wherein,

since the covariance matrix is a symmetric matrix, for Cov_SFeature decomposition to get Cov_S＝U_SΣ_SV_S：

By

So as to obtain the compound with the characteristics of,

the linear variation reconstruction process is derived as:

in the above formula, the first part

For the whitening process, the second part

An alignment process;

(14) providing fingerprint vectors at test points for users needing online positioning, and respectively calculating covariance matrixes of online test fingerprints of a target domain fingerprint library and a source domain; the source domain covariance matrix is firstly subjected to whitening transformation, and the covariance matrix is used as a generalized inverse matrix

The power of the next, eliminating the correlation of the source domain data; then using the second order of the objective functionThe statistical characteristics perform correlation superposition transformation on the whitened features;

(15) and carrying out position estimation on the online characteristics after CORAL alignment by using data in the original training set.