CN111757258A

CN111757258A - An adaptive positioning fingerprint database construction method in complex indoor signal environment

Info

Publication number: CN111757258A
Application number: CN202010639947.2A
Authority: CN
Inventors: 秦宁宁; 王超
Original assignee: Jiangnan University
Current assignee: Jiangnan University
Priority date: 2020-07-06
Filing date: 2020-07-06
Publication date: 2020-10-09
Anticipated expiration: 2040-07-06
Also published as: CN111757258B

Abstract

A self-adaptive positioning fingerprint database construction method under a complex indoor signal environment belongs to the technical field of indoor positioning. According to the AP laying position and the space structure, the system adopts a one-to-many support vector machine algorithm to perform partition operation on a target area so as to accurately determine the area range of signal change. A multivariate Gaussian mixture model based on mutual interference between signals is established by using the coupling relation between the signals in the narrow and small subareas so as to improve the reduction of the positioning precision caused by signal fluctuation. When the indoor environment changes, the self-adaptive updating algorithm based on the partition multivariate Gaussian mixture model can judge the credibility of fingerprint data of each partition, and updates the model parameters of the partitions with larger signal fluctuation by the self-adaptive algorithm, so that the coupling degree between the model and the existing environment is improved. The experimental result shows that the method can utilize relatively small amount of sample data to construct a stable and maintainable indoor signal distribution model, and compared with other algorithms, the positioning accuracy is improved to a certain extent.

Description

An adaptive positioning fingerprint database construction method in complex indoor signal environment

技术领域technical field

本发明涉及一种室内定位的离线指纹库构建优化技术，属于室内定位技术领域。The invention relates to an offline fingerprint database construction and optimization technology for indoor positioning, and belongs to the technical field of indoor positioning.

背景技术Background technique

全球卫星导航系统在室外环境下已被广泛应用于为人们提供位置服务，但信号的缺失也导致该系统无法在复杂的室内环境下发挥作用。WiFi设施的广泛铺设和智能手机的普及，使得基于接收信号强度(Received Signal Strength,RSS)值的室内定位系统，得到了大批研究人员的密切关注。然而，由于无线通讯设施的设计初衷并非为人们提供室内导航，因此如何降低环境波动对无线信号的不确定干扰导致的定位影响，已成为现有研究不得不面对和解决的难点。Global satellite navigation systems have been widely used to provide people with location services in outdoor environments, but the lack of signals also makes the system unable to function in complex indoor environments. The widespread deployment of WiFi facilities and the popularization of smartphones have made indoor positioning systems based on Received Signal Strength (RSS) values closely watched by a large number of researchers. However, since the original design of wireless communication facilities is not to provide people with indoor navigation, how to reduce the impact of environmental fluctuations on the positioning caused by the uncertain interference of wireless signals has become a difficulty that existing research has to face and solve.

基于已有无线设施的定位系统，其常见商用装置不具备自主可编辑功能，仅能提供室内通用RSS测量值，这使得传统基于到达时间差和到达距离差等方法，无法直接平移应用。利用测量信号与实际位置间匹配运算的指纹定位算法，弥补了无线设施所发送信号在时间和空间特性上的缺失，可有效实现信号环境与实际场景的映射，为基于RSS值的室内定位提供了可能。The positioning system based on the existing wireless facilities, its common commercial devices do not have the function of self-editing, and can only provide the indoor general RSS measurement value, which makes the traditional method based on the difference of arrival time and the difference of arrival distance, unable to directly translate applications. The fingerprint positioning algorithm using the matching operation between the measured signal and the actual position makes up for the lack of time and space characteristics of the signal sent by the wireless facility, which can effectively realize the mapping between the signal environment and the actual scene, and provide an indoor positioning based on the RSS value. possible.

发明内容SUMMARY OF THE INVENTION

对大型室内场景，上述方法将耗费大量人力物力，且易受环境因素影响，所构建的离线指纹库与实际场景中信号分布的映射关系也会因时间变化而减弱，需不断修正离线指纹库，以降低时间积累所造成的映射误差累计。For large indoor scenes, the above method will consume a lot of manpower and material resources, and is easily affected by environmental factors. The mapping relationship between the constructed offline fingerprint database and the signal distribution in the actual scene will also weaken due to time changes, and the offline fingerprint database needs to be continuously revised. In order to reduce the accumulation of mapping errors caused by time accumulation.

本发明的技术方案：Technical scheme of the present invention:

一种复杂室内信号环境下的自适应定位指纹库构建方法，包括三个阶段：离线阶段、在线定位阶段、在线指纹库更新阶段，具体步骤如下：A method for constructing an adaptive positioning fingerprint database in a complex indoor signal environment, including three stages: an offline stage, an online positioning stage, and an online fingerprint database update stage. The specific steps are as follows:

步骤一、划分目标区域网格及构建离线指纹库Step 1. Divide the target area grid and build an offline fingerprint library

目标区域由K个划分区域Ω_k，k∈{1,2,...,K}组成，将区域Ω_k划分为N_k个网格，取每个网格的几何中心作为参考点

其中n∈{1,2,...,N_k}，

为2×N_k维位置矩阵表示RP位置。对于各RP位置

相应的区域标志为

其中

且

i≠k，表示参考点位于k分区。在

收集到的来自M_k个AP的RSS样本值为

其中

表示在

处收集到的来自第m个AP的RSS样本值，m∈{1,2,...,M_k}。The target area is composed of K divided areas Ω _k , k∈{1,2,...,K}, the area Ω _k is divided into N _k grids, and the geometric center of each grid is taken as the reference point

where _n∈ {1,2,...,Nk},

represents the RP position as a 2×N _k -dimensional position matrix. For each RP location

The corresponding region flags are

in

and

i≠k, indicating that the reference point is located in the k partition. exist

The collected RSS sample values from M _k APs are

in

expressed in

RSS sample values from the mth AP collected at m∈{1,2,..., _Mk }.

步骤二、构建在线定位阶段的SVM分区模型Step 2. Build the SVM partition model in the online positioning stage

以一对多方式设置支持向量机概率分类模型，对于预先设置的各分区，以目标是否位于本分区进行二分类标识，通过训练数据构建各分区SVM模型。对于给定K个分区，设立K个SVM模型，取各分区内所有参考AP的测量信号组成当前观测数据r＝[r₁,r₂,...,r_M]，其中M为目标所接收到区域内AP数量，对于未接收到的信号值取为-100db。针对当前观测数据，各分区SVM模型给出目标是否位于相应区域内的分布概率p(y^k＝1|r)，其中y^k为分区标识，表示目标位于分区Ω_k内，k∈{1,2,...,K}。通过各分区SVM模型给出的分布概率p(y^k＝1|r)，对目标所在分区做初步判断，并作为一级判断依据。The support vector machine probability classification model is set up in a one-to-many manner. For each pre-set partition, the two-class identification is performed according to whether the target is located in this partition, and the SVM model of each partition is constructed through the training data. For a given K partitions, K SVM models are established, and the measurement signals of all reference APs in each partition are taken to form the current observation data r=[r ₁ , r ₂ ,...,r _M ], where M is received by the target The number of APs in the area, and the value of the unreceived signal is -100db. For the current observation data, each partition SVM model gives the distribution probability p(y ^k =1|r) of whether the target is located in the corresponding area, where y ^k is the partition identifier, indicating that the target is located in the partition Ω _k , k∈{1, 2,...,K}. According to the distribution probability p (y ^k =1|r) given by the SVM model of each partition, a preliminary judgment is made on the partition where the target is located, and it is used as the first-level judgment basis.

采用基于概率SVM的分区操作，将步骤一所获取参考点观测数据划分为训练集与测试集，对分区判断模型SVM进行训练。通过K个SVM模型获取目标位于相应分区的概率值，通过设置二级判断依据，以克服对于分区交接附近的测试点误判断问题。选取被判定区域内2个最大概率的分区区域，即p(yⁱ＝1|r)与p(y^j＝1|r)，i,j∈{1,2,...,K}，且p(yⁱ＝1|r)＞p(y^j＝1|r)，其差值表示为：Using the partition operation based on probability SVM, the reference point observation data obtained in step 1 is divided into training set and test set, and the partition judgment model SVM is trained. The probability value that the target is located in the corresponding partition is obtained through K SVM models, and the second-level judgment basis is set to overcome the problem of misjudgment of the test points near the partition handover. Select two partitioned regions with the highest probability in the determined region, namely p(y ⁱ =1|r) and p(y ^j =1|r), i,j∈{1,2,...,K}, And p(y ⁱ =1|r)>p(y ^j =1|r), the difference is expressed as:

Δy_p＝p(yⁱ＝1|r)-p(y^j＝1|r)， (1)Δy _p =p(y ⁱ =1|r)-p(y ^j =1|r), (1)

当Δy_p＞Δy时，说明i分区对测试点影响力远大于j分区，则将参考点判定于i分区，其中Δy为二级判断阈值。对于Δy_p＜Δy，则将两区域均判定为目标区域，分别做相应的区域匹配运算，并将各自分区所得目标位置做概率平均，以求取最终位置估计When Δy _p >Δy, it means that the influence of the i partition on the test point is much greater than that of the j partition, and the reference point is determined in the i partition, where Δy is the secondary judgment threshold. For Δy _p < Δy, both regions are determined as target regions, and the corresponding region matching operations are performed respectively, and the probability average of the target positions obtained by the respective partitions is performed to obtain the final position estimate.

步骤三、构建多元高斯混合模型MVGMMStep 3. Build a multivariate Gaussian mixture model MVGMM

利用不同AP信号的相关性建立多元高斯混合模型MVGMM，通过不断增加高斯元素个数，利用不同参数的概率密度函数加权和近似模拟分区内RP位置与所获取各AP信号间的联合分布情况，弥补传统指纹库构建工作中对AP信号间耦合关系的忽略。多元高斯混合模型的概率分布函数表示为：The multivariate Gaussian mixture model MVGMM is established by using the correlation of different AP signals. By continuously increasing the number of Gaussian elements, the probability density function of different parameters is used to weight and approximate the joint distribution between the RP positions in the partition and the acquired AP signals to make up for The coupling relationship between AP signals is ignored in the construction of traditional fingerprint database. The probability distribution function of the multivariate Gaussian mixture model is expressed as:

其中，C表示组成元素的个数，p_N(x|μ_c,P_c)表示均值为μ_c、协方差为P_c的高斯组成成分，权重w_c的加和为1。Among them, C represents the number of constituent elements, p _N (x|μ _c , P _c ) represents a Gaussian composition with a mean of μ _c and a covariance of P _c , and the sum of the weights w _c is 1.

基于式(2)，利用分区Ω_k内RP位置与RSS信号值联合分布的后验概率将多元高斯混合模型表示为：Based on equation (2), the _multivariate Gaussian mixture model is expressed as:

其中，y^k＝1表示目标处于第k个分区，r表示在参考点x处接收到各AP信号的RSS值，

为多元高斯混合模型的组成元素权重，

为元素均值，

为元素协方差，C_k为组成元素数量。Among them, y ^k =1 indicates that the target is in the kth partition, r indicates the RSS value of each AP signal received at the reference point x,

is the element weight of the multivariate Gaussian mixture model,

is the element mean,

is the element covariance, and C _k is the number of constituent elements.

MVGMM模型采用EM算法估计模型参数。MVGMM模型选用C_k个均值为

协方差为

的多元高斯函数，拟合分区Ω_k内N_k个参考点

与相应分区内所获取各AP信号RSS值

的联合概率分布。利用k-means算法将样本数据聚类为C_k个初始簇，取各簇平均值与协方差用于初始化EM算法参数且各簇初始权重设置相同。联合概率分布的对数似然形式表示为：The MVGMM model uses the EM algorithm to estimate the model parameters. The MVGMM model selects C _k means as

The covariance is

The multivariate Gaussian function is fitted to N _k reference points in the partition Ω _k

and the RSS value of each AP signal obtained in the corresponding partition

The joint probability distribution of . The sample data is clustered into C _k initial clusters by the k-means algorithm, and the mean and covariance of each cluster are taken to initialize the parameters of the EM algorithm, and the initial weights of each cluster are set the same. The log-likelihood form of the joint probability distribution is expressed as:

其中，z^k＝[x^k；R^k]，

隐变量γ_c,n表示第n个样本值属于第c个高斯组成元素的概率。where z ^k =[x ^k ; R ^k ],

The latent variable γc _,n represents the probability that the nth sample value belongs to the cth Gaussian component.

由此确定观测数据

属于第c个高斯组成元素的概率表示为：From this, the observed data are determined

The probability of belonging to the c-th Gaussian component is expressed as:

通过式(5)计算多元高斯函数的权重

均值

和协方差

分别为：Calculate the weight of the multivariate Gaussian function by formula (5)

mean

and covariance

They are:

对不同的C_k值重复聚类与EM估计过程离线指纹库表示为(w^k；μ^k；P^k)，其中k∈{1,2,..,K}。Repeating the clustering and EM estimation process for different values of C _k The offline fingerprint library is denoted as (w ^k ; μ ^k ; P ^k ), where k∈{1,2,..,K}.

步骤四、定位分域内目标Step 4. Locate the target in the sub-domain

通过离线指纹库对区域内目标位置进行估计。若当前时刻目标接受M个AP的RSS值为

通过分区判断模型获得目标在各分区的分布概率p(y_k＝1|r^now)，k∈{1,2,...,K}。The target location in the area is estimated through the offline fingerprint database. If the RSS value of M APs accepted by the target at the current moment is

The distribution probability p(y _k =1|r ^now ) of the target in each partition is obtained through the partition judgment model, k∈{1,2,...,K}.

利用当前测量值r^now选取对应分区Ω_k内对目标定位有效的AP信号测量值

根据多元高斯分布的条件概率准则，获得给定观测数据下目标位置的后验概率分布：Use the current measurement value r ^now to select the AP signal measurement value that is valid for target positioning in the corresponding partition Ω _k

According to the conditional probability criterion of multivariate Gaussian distribution, the posterior probability distribution of the target position under the given observation data is obtained:

基于所得离线指纹库(w^k；μ^k；P^k)，得：Based on the obtained offline fingerprint database (w ^k ; μ ^k ; P ^k ), we have:

联合分区概率与式(9)，获得目标在分区Ω_k内各参考点的后验分布概率为：Combining the partition probability with equation (9), the posterior distribution probability of each reference point of the target in the partition Ω _k is obtained as:

目标位于分区Ω_k内各参考点的分布权重更新为：The distribution weight of each reference point in the partition Ω _k is updated as:

其中，λ为归一化因子，则分区Ω_k内各参考点为目标位置的分布权重为

Among them, λ is the normalization factor, then the distribution weight of each reference point in the partition Ω _k as the target position is:

综上所述，目标的位置估计为In summary, the location of the target is estimated as

步骤五、指纹库更新Step 5. Update the fingerprint database

当分区Ω_k中所得信息熵小于分区阈值时，通过新采集N_k个数据

对分区模型进行参数更新。基于现有多元高斯混合模型，通过采集数据对模型参数进行自适应调整，以推导出与现有环境具有更加紧密耦合关系的MVGMM模型。与EM算法一致，MVGMM模型参数的自适应过程也是一个两步估计。首先，通过k-means算法将新增观测数据

聚类为C_k组簇，并通过聚类所得数据权值

均值

协方差

初始化各高斯组成成分。利用式(5)-(8)分别计算新增观测数据的统计权重

均值

与协方差

When the information entropy obtained in the partition Ω _k is less than the partition threshold, N _k data are newly collected by

Parameter update for the partition model. Based on the existing multivariate Gaussian mixture model, the model parameters are adaptively adjusted by collecting data to derive a MVGMM model with a more tightly coupled relationship with the existing environment. Consistent with the EM algorithm, the adaptation process of the MVGMM model parameters is also a two-step estimation. First, the new observation data will be added through the k-means algorithm.

Clustering into C _k groups of clusters, and data weights obtained by clustering

mean

Covariance

Initialize each Gaussian component. Use equations (5)-(8) to calculate the statistical weights of the newly added observation data respectively

mean

with covariance

利用新增数据的统计参数对分区Ω_k所对应的MVGMM模型参数进行自适应优化，即：The MVGMM model parameters corresponding to the partition Ω _k are adaptively optimized using the statistical parameters of the newly added data, namely:

其中，

ρ∈{w,m,v}用于平衡新旧数据对模型参数的影响程度，λ为归一化因子，r^ρ为参数的固有相关因子。in,

ρ∈{w,m,v} is used to balance the influence of old and new data on model parameters, λ is the normalization factor, and r ^ρ is the inherent correlation factor of the parameters.

步骤六、更新模型参数Step 6. Update model parameters

将自适应过程中更新得到的模型参数属于步骤二和步骤三中，进行实时更新。The model parameters updated in the adaptive process belong to the second and third steps, and are updated in real time.

本发明的有益效果：专利根据AP铺设位置与空间结构，系统采用一对多支持向量机算法对目标区域做分区操作，以精确信号变化的区域范围。利用狭小分区内信号间的耦合关系，建立基于信号间相互干扰的多元高斯混合模型，以改善信号波动所造成的定位精度下降。当室内环境发生变化时，基于分区多元高斯混合模型的自适应更新算法可对各分区指纹数据的可信度做出判断，并以自适应算法更新信号波动较大分区的模型参数，提高模型与现有环境间的耦合程度。Beneficial effects of the present invention: According to the AP laying position and spatial structure, the system adopts a one-to-many support vector machine algorithm to perform a partition operation on the target area, so as to accurately determine the area range of the signal change. Using the coupling relationship between signals in a narrow partition, a multivariate Gaussian mixture model based on mutual interference between signals is established to improve the positioning accuracy drop caused by signal fluctuations. When the indoor environment changes, the adaptive update algorithm based on the partitioned multivariate Gaussian mixture model can judge the reliability of the fingerprint data of each partition, and use the adaptive algorithm to update the model parameters of the partitions with large signal fluctuations, so as to improve the model and the accuracy of the model. The degree of coupling between existing environments.

附图说明Description of drawings

图1为本发明方法的流程图。Figure 1 is a flow chart of the method of the present invention.

图2为实验场景图。Figure 2 shows the experimental scene diagram.

图3为分区效果对比图，其中，(a)为区域划分情况，(b)为练后的区域划分效果图。Figure 3 is a comparison diagram of the effect of partitions, in which (a) is the situation of regional division, and (b) is the effect diagram of regional division after practice.

图4为RSS指纹地图构建效果对比图，其中，(a)为真实测量值，(b)为GP型估计，(c)为MVGMM模型估计。Figure 4 is a comparison diagram of the construction effect of RSS fingerprint map, in which (a) is the real measurement value, (b) is the GP-type estimation, and (c) is the MVGMM model estimation.

图5为分区一内AP3信号的拟合效果对比图。FIG. 5 is a comparison diagram of the fitting effect of AP3 signals in partition one.

图6为目标运动的轨迹预测对比图。FIG. 6 is a comparison diagram of trajectory prediction of target motion.

图7为轨迹估计误差箱型图。Figure 7 is a boxplot of trajectory estimation error.

图8为误差累计函数对比图。Figure 8 is a comparison diagram of the error accumulation function.

图9指纹库更新前后AP3数据拟合效果对比图。其中，(a)为AP3信号强度衰减值，(b)为AP3信号预测误差。Figure 9. Comparison of AP3 data fitting effects before and after the fingerprint database update. Among them, (a) is the AP3 signal strength attenuation value, (b) is the AP3 signal prediction error.

具体实施方式Detailed ways

针对大型室内场景下采样数据量大及维护成本高等问题，方法通过分区操作精确维护区域，并根据分区内信号间的耦合关系提出一种分区多元高斯混合模型(Multivariate Gaussian Mixture Model,MVGMM)以提高对信号分布的拟合程度。模型根据信号接入点(Access Point,AP)位置与物理连通结构对目标区域进行划分，并通过一对多支持向量机模型实现分区操作。在相对狭小的分区区域内，利用信号间存在的相互干扰分别建立多元高斯混合模型，以强化信号的拟合程度，最终达到改善分区定位精度的效果。当环境发生变化时，算法以信息熵作为分区数据更新判据，以及时响应分区变化对指纹库的影响，降低维护成本。从而在室内定位应用中，实现少量数据支撑高效可维护指纹库的构建。In view of the large amount of sampling data and high maintenance cost in large indoor scenes, the method accurately maintains the area through partition operation, and proposes a partitioned Multivariate Gaussian Mixture Model (MVGMM) according to the coupling relationship between signals in the partition to improve the The fit of the signal distribution. The model divides the target area according to the location of the signal access point (AP) and the physical connection structure, and realizes the partition operation through the one-to-many support vector machine model. In a relatively narrow partition area, the multivariate Gaussian mixture model is established by using the mutual interference between the signals to strengthen the fitting degree of the signals, and finally achieve the effect of improving the partition positioning accuracy. When the environment changes, the algorithm uses the information entropy as the partition data update criterion, and responds to the impact of the partition change on the fingerprint database in time to reduce the maintenance cost. Therefore, in indoor positioning applications, a small amount of data can support the construction of an efficient and maintainable fingerprint database.

基于多元高斯混合模型的目标定位系统，能够以较低的采集代价对目标进行精确定位。算法以传统WiFi设施中所获取测量信号作为训练数据，构建目标区域内AP信号的分布状况。与传统目标定位过程相比，系统由三个模块组成，包括离线阶段、在线定位阶段、在线指纹库更新阶段，如图1所示。The target localization system based on the multivariate Gaussian mixture model can accurately locate the target with low acquisition cost. The algorithm uses the measurement signals obtained in traditional WiFi facilities as training data to construct the distribution of AP signals in the target area. Compared with the traditional target positioning process, the system consists of three modules, including an offline stage, an online positioning stage, and an online fingerprint database update stage, as shown in Figure 1.

离线阶段主要是通过定位区域内各参考点(Reference Point,RP)位置与所接收到RSS信号值建立完整指纹数据库，并将其应用于在线阶段的匹配运算。在线定位阶段采用分区模型确定目标所在区域，利用多元高斯函数的条件概率准则，计算以当前观测数据为条件的目标位置后验概率，并通过WKNN估计目标位置。在线指纹库更新阶段则以后验分布概率计算所得信息熵作为指纹库更新的衡量尺度，并建立基于分区多元高斯混合模型的反馈调度指纹库。The offline stage mainly establishes a complete fingerprint database by locating the position of each reference point (Reference Point, RP) in the area and the received RSS signal value, and applies it to the matching operation in the online stage. In the online positioning stage, the partition model is used to determine the target area, and the conditional probability criterion of the multivariate Gaussian function is used to calculate the posterior probability of the target position conditioned on the current observation data, and estimate the target position through WKNN. In the online fingerprint database update stage, the information entropy calculated by the posterior distribution probability is used as the measure of the fingerprint database update, and a feedback scheduling fingerprint database based on the partitioned multivariate Gaussian mixture model is established.

具体如下：details as follows:

构建离线指纹库Build an offline fingerprint library

指纹收集方案Fingerprint Collection Scheme

目标区域由K个划分区域Ω_k，k∈{1,2,...,K}组成，并将区域Ω_k划分为N_k个网格，取每个网格的几何中心作为参考点

其中n∈{1,2,...,N_k}，

为2×N_k维位置矩阵表示RP位置。对于各RP位置

相应的区域标志为

其中

且

i≠k，表示参考点位于k分区。在

收集到的来自M_k个AP的RSS样本值为

其中

表示在

处收集到的来自第m个AP的RSS样本值，m∈{1,2,...,M_k}。The target area consists of K partitioned areas Ω _k , k∈{1,2,...,K}, and the area Ω _k is divided into N _k meshes, and the geometric center of each mesh is taken as the reference point

where _n∈ {1,2,...,Nk},

The corresponding region flags are

in

and

i≠k, indicating that the reference point is located in the k partition. exist

The collected RSS sample values from M _k APs are

in

expressed in

RSS sample values from the mth AP collected at m∈{1,2,..., _Mk }.

分区模型构建Partition Model Construction

综合考量区域分类的精确度与效率，以一对多方式设置支持向量机概率分类模型，可有效解决根据AP位置与物理连通所设置分区的分类问题[14]。对于预先设置的各分区，以目标是否位于本分区进行二分类标识，通过训练数据构建各分区SVM模型。对于给定K个分区，设立K个SVM模型，取各分区内所有参考AP的测量信号组成当前观测数据r＝[r₁,r₂,...,r_M]，其中M为目标所接收到区域内AP数量，对于未接收到的信号值取为-100db。针对当前观测数据，各分区SVM模型可给出目标是否位于相应区域内的分布概率p(y^k＝1|r)，其中y^k为分区标识，表示目标位于分区Ω_k内，k∈{1,2,...,K}。通过各分区SVM模型给出的分布概率p(y^k＝1|r)，可对目标所在分区做初步判断，并作为一级判断依据Considering the accuracy and efficiency of regional classification, setting the support vector machine probability classification model in a one-to-many manner can effectively solve the classification problem of partitions set according to AP location and physical connectivity [14]. For each pre-set partition, the two-class identification is carried out according to whether the target is located in this partition, and the SVM model of each partition is constructed by training data. For a given K partitions, K SVM models are established, and the measurement signals of all reference APs in each partition are taken to form the current observation data r=[r ₁ , r ₂ ,...,r _M ], where M is received by the target The number of APs in the area, and the value of the unreceived signal is -100db. For the current observation data, the SVM model of each partition can give the distribution probability p(y ^k =1|r) of whether the target is located in the corresponding area, where y ^k is the partition identifier, indicating that the target is located in the partition Ω _k , k∈{1 ,2,...,K}. Through the distribution probability p(y ^k =1|r) given by the SVM model of each partition, a preliminary judgment can be made on the partition where the target is located, and it can be used as a first-level judgment basis

算法采用基于概率SVM的分区操作，将离线阶段所获取参考点观测数据划分为训练集与测试集，对分区判断模型进行训练。通过K个SVM模型可获取目标位于相应分区的概率值，但分区交接处信号分布复杂，易造成分区模型的判断失误。由此，通过设置二级判断依据，以克服对于分区交接附近的测试点误判断问题。选取被判定区域内2个最大概率的分区区域，即p(yⁱ＝1|r)与p(y^j＝1|r)，i,j∈{1,2,...,K}，且p(yⁱ＝1|r)＞p(y^j＝1|r)，其差值可表示为The algorithm adopts the partition operation based on probability SVM, divides the observation data of reference points obtained in the offline phase into training set and test set, and trains the partition judgment model. Through K SVM models, the probability value of the target located in the corresponding partition can be obtained, but the signal distribution at the junction of the partition is complex, which is easy to cause errors in the judgment of the partition model. Therefore, by setting a secondary judgment basis, the problem of misjudging the test points near the partition handover can be overcome. Select two partitioned regions with the highest probability in the determined region, namely p(y ⁱ =1|r) and p(y ^j =1|r), i,j∈{1,2,...,K}, And p(y ⁱ =1|r)>p(y ^j =1|r), the difference can be expressed as

当Δy_p＞Δy时，说明i分区对测试点影响力远大于j分区，可将参考点判定于i分区，其中Δy为二级判断阈值。对于Δy_p＜Δy，则将两区域均判定为目标区域，可分别做相应的区域匹配运算，并将各自分区所得目标位置做概率平均，以求取最终位置估计When Δy _p >Δy, it means that the influence of the i partition on the test point is much greater than that of the j partition, and the reference point can be determined in the i partition, where Δy is the secondary judgment threshold. For Δy _p < Δy, both regions are determined as target regions, and corresponding region matching operations can be performed respectively, and the probability average of the target positions obtained by each partition is performed to obtain the final position estimate

多元高斯混合模型构建Multivariate Gaussian Mixture Model Construction

由于狭小区域内墙壁对电磁信号的衰减和反射导致天线的辐射模型并非定向均匀。不同AP在某一时刻的信号强度值也会因墙壁折射、人员遮挡、信道等问题出现相互关联。因此，简单地通过单一AP采样值则会割裂各AP信号间的相关性，导致模型构建中的较大失误。The radiation model of the antenna is not uniform in direction due to the attenuation and reflection of the electromagnetic signal by the walls in the narrow area. The signal strength values of different APs at a certain time are also correlated with each other due to wall refraction, personnel occlusion, and channel problems. Therefore, simply sampling the value of a single AP will split the correlation between the AP signals, resulting in a large error in model construction.

考虑到划分后狭小分区内信号间的相互干扰，可利用不同AP信号的相关性建立多元高斯混合模型，通过不断增加高斯元素个数，以利用不同参数的概率密度函数加权和近似分区内RP位置与所获取各AP信号间的联合概率密度分布，弥补常见工作中对AP信号间耦合关系的忽略。多元高斯混合模型的概率分布函数可表示为Considering the mutual interference between the signals in the narrow partition after division, a multivariate Gaussian mixture model can be established by using the correlation of different AP signals, and by continuously increasing the number of Gaussian elements, the probability density function of different parameters can be used to weight and approximate the RP position in the partition. The joint probability density distribution between the acquired AP signals makes up for the neglect of the coupling relationship between AP signals in common work. The probability distribution function of the multivariate Gaussian mixture model can be expressed as

其中，C表示组成元素的个数，p_N(x|μ_c,P_c)表示均值为μ_c，协方差为P_c的高斯组成成分，权重w_c的加和为1。Among them, C represents the number of constituent elements, p _N (x|μ _c , P _c ) represents a Gaussian composition with a mean value of μ _c , a covariance of P _c , and the sum of the weights w _c is 1.

基于式(2)，利用分区Ω_k内RP位置与RSS信号值联合分布的后验概率可将多元高斯混合模型表示为Based on equation (2), the _multivariate Gaussian mixture model can be expressed as

为多元高斯混合模型的组成元素权重，

为元素均值，

is the element weight of the multivariate Gaussian mixture model,

is the element mean,

is the element covariance, and C _k is the number of constituent elements.

为提高MVGMM模型对于目标区域内所采集样本数据的拟合效果，采用EM算法估计模型参数。MVGMM模型选用C_k个均值为

协方差为

的多元高斯函数，拟合分区Ω_k内N_k个参考点

与所获取各AP信号RSS值

的联合概率分布。利用k-means算法将样本数据聚类为C_k个初始簇，取各簇平均值与协方差用于初始化EM算法参数且各簇初始权重设置相同。联合概率分布的对数似然形式可表示为In order to improve the fitting effect of the MVGMM model to the sample data collected in the target area, the EM algorithm is used to estimate the model parameters. The MVGMM model selects C _k means as

The covariance is

and the obtained RSS value of each AP signal

The joint probability distribution of . The sample data is clustered into C _k initial clusters by the k-means algorithm, and the mean and covariance of each cluster are taken to initialize the parameters of the EM algorithm, and the initial weights of each cluster are set the same. The log-likelihood form of the joint probability distribution can be expressed as

其中，z^k＝[x^k；R^k]，

由此确定观测数据

属于第c个高斯组成元素的概率可表示为From this, the observed data are determined

The probability of belonging to the c-th Gaussian component can be expressed as

通过式(5)可计算多元高斯函数的权重

均值

和协方差

分别为：The weight of the multivariate Gaussian function can be calculated by formula (5)

mean

and covariance

They are:

对不同的C_k值重复聚类与EM估计过程：Repeat the clustering and EM estimation process for different values of _Ck :

(1)输入：高斯组成元素数量C_k，高斯组成元素的初始权值

初始均值

初始协方差

分区Ω_k内参考点位置

与相应采样值

c∈{1,2,..,C_k}(1) Input: the number of Gaussian constituent elements C _k , the initial weight of the Gaussian constituent elements

initial mean

initial covariance

Position of reference point in partition Ω _k

with the corresponding sample value

c∈{1,2,..,C _k }

(2)基于公式(5)计算

(2) Calculated based on formula (5)

(3)基于公式(6)至(8)，计算w_c，

(3) Based on formulae (6) to (8), calculate w _c ,

(4)重复步骤(2)至(3)，直至

(4) Repeat steps (2) to (3) until

(5)输出：高斯组成元素权值

均值

协方差

(5) Output: Gaussian composition element weights

mean

Covariance

基于上述分析可得，离线指纹库可表示为(w^k；μ^k；P^k)，其中k∈{1,2,..,K}。Based on the above analysis, the offline fingerprint library can be expressed as (w ^k ; μ ^k ; P ^k ), where k∈{1,2,..,K}.

在线阶段：目标定位Online Phase: Targeting

利用所采集样本数据构建关于目标区域的离线指纹库，通过该指纹库可对区域内目标位置进行估计。若当前时刻目标接受M个AP的RSS值为

通过分区判断模型可得目标在各分区的分布概率p(y_k＝1|r^now)，k∈{1,2,...,K}。The collected sample data is used to build an offline fingerprint database about the target area, through which the target position in the area can be estimated. If the RSS value of M APs accepted by the target at the current moment is

Through the partition judgment model, the distribution probability p(y _k =1|r ^now ) of the target in each partition can be obtained, k∈{1,2,...,K}.

利用当前测量值r^now可选取对应分区Ω_k内对目标定位有效的AP信号测量值

根据多元高斯分布的条件概率准则，可得给定观测数据下目标位置的后验概率分布Using the current measurement value r ^now , the AP signal measurement value that is valid for target positioning in the corresponding partition Ω _k can be selected

According to the conditional probability criterion of multivariate Gaussian distribution, the posterior probability distribution of the target position under the given observation data can be obtained

其中，基于所得离线指纹库(w^k；μ^k；P^k)，可得Among them, based on the obtained offline fingerprint database (w ^k ; μ ^k ; P ^k ), we can get

联合分区概率与式(9)，可得目标在分区Ω_k内各参考点的后验分布概率为Combining the partition probability with equation (9), the posterior distribution probability of each reference point of the target in the partition Ω _k can be obtained as

目标位于分区Ω_k内各参考点的分布权重可更新为The distribution weight of each reference point in the partition Ω _k can be updated as

在线阶段：指纹库更新Online Phase: Fingerprint Database Update

对分区模型进行参数更新。基于现有多元高斯混合模型，通过采集数据可对模型参数进行自适应调整，以推导出与现有环境具有更加紧密耦合关系的MVGMM模型。与EM算法一致，MVGMM模型参数的自适应过程也是一个两步估计^[13]。首先，通过k-means算法将新增观测数据

聚类为C_k组簇，并通过聚类所得数据权值

均值

协方差

初始化各高斯组成成分。利用式(5)-(8)可分别计算新增观测数据的统计权重

均值

与协方差

Parameter update for the partition model. Based on the existing multivariate Gaussian mixture model, the model parameters can be adaptively adjusted by collecting data to derive the MVGMM model with a more tightly coupled relationship with the existing environment. Consistent with the EM algorithm, the adaptation process of the MVGMM model parameters is also a two-step estimation ^[13] . First, the new observation data will be added through the k-means algorithm.

mean

Covariance

Initialize each Gaussian component. Using equations (5)-(8), the statistical weights of the newly added observation data can be calculated separately

mean

with covariance

其中，

ρ∈{w,m,v}用于平衡新旧数据对模型参数的影响程度，λ为归一化因子，r^ρ为参数的固有相关因子。新增数据更能体现当前环境内信号的分布状况，在自适应更新过程中，如下所示，平衡因子

ρ∈{w,m,v}更依赖于新增观测数据。in,

ρ∈{w,m,v} is used to balance the influence of old and new data on model parameters, λ is the normalization factor, and r ^ρ is the inherent correlation factor of the parameters. The newly added data can better reflect the distribution of signals in the current environment. During the adaptive update process, as shown below, the balance factor

ρ∈{w,m,v} is more dependent on the newly added observation data.

(1)输入：原始高斯组成元素数量C_k，原始高斯组成元素的权值

均值

协方差

新增分区Ω_k内采样数据

c∈{1,2,..,C_k}；(1) Input: the number of original Gaussian constituent elements C _k , the weight of the original Gaussian constituent elements

mean

Covariance

Added sampling data in partition Ω _k

c∈{1,2,.., _Ck };

(2)基于公式(5)，计算

(2) Based on formula (5), calculate

(3)基于公式(6)至(8)计算

基于公式(15)至(17)计算

(3) Calculated based on formulae (6) to (8)

Calculated based on equations (15) to (17)

(4)重复步骤(2)至(3)，直至

(4) Repeat steps (2) to (3) until

(5)输出：更新后高斯组成元素权值

均值

协方差

(5) Output: The updated Gaussian component weights

mean

Covariance

根据权利要求所包含的内容举例说明Illustrate according to what is contained in the claims

实施例1：人员位置检测Example 1: Person Location Detection

对室内人员的位置信息进行实时检测，需为每位目标人员配备智能手环，通过智能手环接收到的AP信号，确定目标位置。但由于室内环境复杂，且AP设备受时间影响发生变化。采用SMVGMM算法对AP信号在室内的分布情况进行拟合，能够提高定位精度、减少环境因素影响，在线更新阶段可通过自适应更新算法更新模型参数，以降低时间变化对离线指纹库的影响。To detect the location information of indoor personnel in real time, it is necessary to equip each target person with a smart bracelet, and determine the target position through the AP signal received by the smart bracelet. However, because the indoor environment is complex, and AP devices are affected by time and change. The SMVGMM algorithm is used to fit the indoor distribution of AP signals, which can improve the positioning accuracy and reduce the impact of environmental factors. In the online update stage, the model parameters can be updated through the adaptive update algorithm to reduce the impact of time changes on the offline fingerprint database.

试验场景为某学院C区某层环形走廊环境，选取移动运营商在学院内均匀铺设的WiFi路由器作为AP信号源。由于AP信号源主要铺设于走廊中，且单侧走廊区域相对开阔，信号受墙壁影响的差异性较小，故将物理结构相对连同且接收AP信号差异较小的单侧走廊区域划分为对应分区，则试验区域可划分为K＝4个分区。RP位置采用网格拓扑，并以走廊宽幅居中形式排列，相邻RP间隔1米，共计368个RP点，AP信号源与RP点排布平面图如图2所示。根据各分区内AP信号源的稳定性，选取区域1-4内AP信号源数量分别为{4,5,4,4}。为降低设备差异性对定位算法的影响，实验使用统一型号智能手机进行信号收集。The test scene is a ring corridor environment on a certain floor in the C area of a college, and the WiFi routers evenly laid by the mobile operator in the college are selected as the AP signal source. Since the AP signal source is mainly laid in the corridor, and the one-sided corridor area is relatively open, the difference of the signal affected by the wall is small, so the one-sided corridor area with relatively combined physical structure and small difference in receiving AP signals is divided into corresponding partitions , the test area can be divided into K=4 partitions. The RP location adopts a grid topology and is arranged in the form of a corridor width and center. The adjacent RPs are separated by 1 meter, with a total of 368 RP points. The layout of AP signal sources and RP points is shown in Figure 2. According to the stability of AP signal sources in each partition, the number of AP signal sources in areas 1-4 is selected to be {4, 5, 4, 4} respectively. In order to reduce the impact of device differences on the positioning algorithm, the experiment uses a uniform model of smart phone for signal collection.

在所有参考点处采集所有分区所选用共计12个AP的信号强度值，采样间隔为1.2s，共采集4.8s(避免设备因频率原因所导致的数据缓存)，根据第3.1小节所述过程构建离线指纹库。测试阶段，实验员手持同款智能手机沿试验区域行走一圈，行进至测试点处通过操作获取实时观测数据，并标记当前位置，测试过程中共获得184个测试点，间隔1米。Collect the signal strength values of a total of 12 APs selected for all partitions at all reference points, the sampling interval is 1.2s, and the total collection is 4.8s (to avoid data buffering caused by the frequency of the device), constructed according to the process described in Section 3.1 Offline fingerprint library. During the test phase, the experimenter walked around the test area with the same smartphone, and then traveled to the test point to obtain real-time observation data through operations, and marked the current position. During the test, a total of 184 test points were obtained, with an interval of 1 meter.

目标区域划分完毕后，可对采集参考点做分区标识，如图3中的(a)所示，通过采样数据与分区标识构建分区模型。基于分区模型，可对测试数据进行分区操作，分区效果如图3中的(b)所示，算法将需要启动二级判别准则的测试点划分为区域5以表示信号复杂区域。After the target area is divided, the collection reference point can be identified as a zone, as shown in (a) in Figure 3, a zone model is constructed by sampling data and zone identification. Based on the partition model, the test data can be partitioned, and the partition effect is shown in (b) in Figure 3. The algorithm divides the test points that need to activate the second-level discrimination criteria into area 5 to represent the complex signal area.

为验证该算法对RSS指纹地图的构建效果，图4给出目标区域内AP3信号的真实数值图与MVGMM模型和GP模型对其的估计效果图。利用MVGMM模型对目标区域内参考点位置与AP信号的联合分布进行拟合后，区域内AP3信号的RSS分布可使用公式(7)和(8)对相应位置进行调整来获得。试验在参考点的观测数据中提取AP3信号的RSS测量值，选取50％测试值作为训练数据，对模型进行训练，利用剩余50％测量值验证RSS分布效果。由于不同分区选用不用AP设施且AP3主要作用于分区一内，故图5给出了分区一中部分参考点处两种模型对AP3信号的拟合效果。通过比较实际测量值与模型估计值，可看出MVGMM模型通过多个高斯组成元素较好的拟合了AP信号在室内环境下的分布状况，尤其体现在分区一内。In order to verify the effect of the algorithm on the construction of the RSS fingerprint map, Figure 4 shows the real numerical map of the AP3 signal in the target area and the estimated effect of the MVGMM model and the GP model. After fitting the joint distribution of the reference point position and the AP signal in the target area by using the MVGMM model, the RSS distribution of the AP3 signal in the area can be obtained by adjusting the corresponding position using formulas (7) and (8). In the experiment, the RSS measurement value of AP3 signal was extracted from the observation data of the reference point, 50% of the test value was selected as the training data, the model was trained, and the remaining 50% of the measurement value was used to verify the RSS distribution effect. Since AP facilities are not used in different partitions and AP3 mainly acts in partition one, Figure 5 shows the fitting effect of the two models on AP3 signals at some reference points in partition one. By comparing the actual measured value and the model estimated value, it can be seen that the MVGMM model can better fit the distribution of AP signals in the indoor environment through multiple Gaussian components, especially in partition one.

基于已获取MVGMM模型与测试数据，将专利(简记为：SMVGMM)分别与传统WKNN算法，GP算法做对比，分析算法的定位精度。用户在目标区域内行进一圈，三种算法的位置估计对比图如图6所示。由图6可知，专利所得目标行进轨迹预测更为平滑，且相较于GP算法，其全程定位精度有所提高。图7则给出了三种算法在各测试点的误差值箱型图。由图可知，对于目标轨迹的预测，相比于GP算法，专利的全程定位精度提高了20％以上，近一步的印证了专利所得目标预测轨迹的平滑性。Based on the obtained MVGMM model and test data, the patent (abbreviated as: SMVGMM) is compared with the traditional WKNN algorithm and GP algorithm respectively, and the positioning accuracy of the algorithm is analyzed. The user travels in a circle in the target area, and the position estimation comparison diagram of the three algorithms is shown in Figure 6. It can be seen from Figure 6 that the target travel trajectory prediction obtained by the patent is smoother, and compared with the GP algorithm, its full-time positioning accuracy is improved. Figure 7 shows the box plots of the error values of the three algorithms at each test point. As can be seen from the figure, for the prediction of target trajectory, compared with the GP algorithm, the patent's full-time positioning accuracy is improved by more than 20%, which further confirms the smoothness of the target predicted trajectory obtained by the patent.

由于WKNN算法与其余两种算法在定位精度的巨大差异，图8仅给出了专利与GP算法位置估计误差的累计概率对比图。由图可以看出，专利初始的误差累计速度相比于GP算法较慢，整体效果优于GP算法，也从另一方面体现出专利通过狭小分区内AP信号间的相关性全面提升了传统算法的定位效果。Due to the huge difference in positioning accuracy between the WKNN algorithm and the other two algorithms, Figure 8 only shows the cumulative probability comparison of the position estimation error between the patent and the GP algorithm. It can be seen from the figure that the initial error accumulation speed of the patent is slower than that of the GP algorithm, and the overall effect is better than that of the GP algorithm. On the other hand, it shows that the patent comprehensively improves the traditional algorithm through the correlation between AP signals in a narrow partition. positioning effect.

为验证指纹库在线更新的效果与价值，实验分两次对目标区域进行数据采集(间隔一周时间)，利用第二次所采集数据对原始数据所构建MVGMM模型参数进行自适应更新。通过两次采集的测试数据，比较参数更新前后MVGMM模型对AP3信号的拟合效果。图9给出了参数更新前后模型对AP3信号的拟合状况，及其拟合误差。从图中可以看出，两次采集的测试数据在区域四存在较大差异，参数更新后模型对最新测试数据的拟合效果优于前次模型的拟合状况，尤其体现于区域四内。In order to verify the effect and value of the online update of the fingerprint database, the data was collected in the target area twice (with a one-week interval), and the parameters of the MVGMM model constructed from the original data were adaptively updated using the data collected for the second time. Through the test data collected twice, the fitting effect of the MVGMM model on the AP3 signal was compared before and after the parameter update. Figure 9 shows the fitting status of the model to the AP3 signal before and after the parameter update, and its fitting error. It can be seen from the figure that the test data collected twice are quite different in area 4, and the fitting effect of the model to the latest test data after parameter update is better than that of the previous model, especially in area 4.

Claims

1. A self-adaptive positioning fingerprint database construction method under a complex indoor signal environment is characterized by comprising three stages: the method comprises the following steps of an off-line stage, an on-line positioning stage and an on-line fingerprint database updating stage, wherein the specific steps are as follows:

step one, dividing target area grids and constructing an offline fingerprint database

The target region is divided into K regions omega_kK ∈ {1, 2.., K } component, which is the region Ω_kDivision into N_kEach grid, the geometric center of each grid is taken as a reference point

Where N ∈ {1, 2., N_k}，

Is 2 × N_kThe dimension position matrix represents the RP position; for each RP location

The corresponding region is marked as

Wherein

And is

i ≠ k, meaning that the reference point is located in the k partition; in that

Collected from M_kRSS sample value of AP

Wherein

Is shown in

The collected RSS sample value from the mth AP, M ∈ {1,2_k}；

Step two, constructing SVM partition model of online positioning stage

Setting a probability classification model of a support vector machine in a one-to-many mode, carrying out two-classification identification on each preset partition according to whether a target is located in the partition, and constructing an SVM model of each partition through training data; for given K partitions, K SVM models are set, and measurement signals of all reference APs in each partition are taken to form current observation data r ═ r₁,r₂,...,r_M]Wherein M is the number of APs in the target received area, and the value of the unreceived signal is-100 db; aiming at the current observation data, each partitioned SVM model gives the distribution probability p (y) of whether the target is positioned in the corresponding region^k1| r), wherein y^kIdentify for partition, indicate that the target is located in partition Ω_kInner, K ∈ {1, 2.. multidot.K }, distribution probability p (y) given by each partitioned SVM model^k1 r), performing primary judgment on a partition where the target is located, and taking the partition as a primary judgment basis;

dividing the reference point observation data obtained in the step one into a training set and a test set by adopting partition operation based on a probability SVM, and training a partition judgment model SVM; the probability value of the target in the corresponding partition is obtained through K SVM models, and the problem of misjudgment of the test point near the partition handover is solved by setting a secondary judgment basis; selecting 2 subarea areas with the maximum probability in the judged area, namely p (y)ⁱ1| r) and p (y)^j1| r), i, j ∈ {1,2ⁱ＝1|r)＞p(y^j1| r), the difference being expressed as:

Δy_p＝p(yⁱ＝1|r)-p(y^j＝1|r)， (1)

when Δ y_pWhen the value is more than delta y, the influence of the partition i on the test point is far larger than that of the partition j, and the reference point is determined in the partition i, wherein the delta y is a secondary determination threshold value; for Δ y_pIf the difference is less than delta y, both the areas are judged as target areas, corresponding area matching operation is respectively carried out, and the target positions obtained by respective subareas are subjected to probability averaging to obtain the final position estimation

Step three, constructing a multivariate Gaussian mixture model (MVGMM)

Establishing a multivariate Gaussian mixture model (MVGMM) by utilizing the correlation of different AP signals, and compensating the neglect of the coupling relation between the AP signals in the traditional fingerprint library construction work by continuously increasing the number of Gaussian elements and utilizing the probability density function weighting and approximate simulation of different parameters to simulate the joint distribution condition between the RP position in the subarea and the obtained AP signals; the probability distribution function of the multivariate gaussian mixture model is expressed as:

wherein C represents the number of constituent elements, p_N(x|μ_c,P_c) Represents the mean value of μ_cCovariance of P_cOf the weight w_cThe sum of (a) and (b) is 1;

based on equation (2), using partition omega_kThe posterior probability of the inner RP location and RSS signal value joint distribution represents the multivariate gaussian mixture model as:

wherein, y^k1 indicates that the target is in the kth partition, r indicates the RSS value of each AP signal received at reference point x,

is the weight of the constituent elements of the multivariate Gaussian mixture model,

is the average value of the elements and is the average value of the elements,

is the element covariance, C_kIs the number of constituent elements;

the MVGMM adopts an EM algorithm to estimate model parameters; MVGMM model selection C_kMean value of

Covariance of

Fitting the partition omega_kInner N_kA reference point

With the RSS value of each AP signal acquired in the corresponding partition

A joint probability distribution of (a); clustering sample data into C by using k-means algorithm_kTaking the mean value and covariance of each cluster to initialize EM algorithm parameters, wherein the initial weight settings of each cluster are the same; the log-likelihood form of the joint probability distribution is expressed as:

wherein z is^k＝[x^k；R^k]，

Hidden variable gamma_c,nRepresenting the probability that the nth sample value belongs to the c-th gaussian component element;

thereby determining observation data

The probability of belonging to the c-th gaussian component element is expressed as:

calculating the height of the multiple element by the formula (5)Weight of a gaussian function

Mean value

Sum covariance

Respectively as follows:

for different C_kThe value repeat clustering and EM estimation process offline fingerprint library is represented as (w)^k；μ^k；P^k) K ∈ {1, 2.., K }, and step four, positioning the target in the subarea

Estimating the target position in the area through an offline fingerprint database; if the target at the current moment accepts the RSS values of M APs as

Obtaining the distribution probability p (y) of the target in each subarea through a subarea judgment model_k＝1|r^now)，k∈{1,2,...,K}；

Using the current measured value r^nowSelecting corresponding sub-region omega_kAP signal measurement value effective for internal target positioning

Obtaining the posterior probability distribution of the target position under given observation data according to the conditional probability criterion of the multivariate Gaussian distribution:

based on the resulting off-line fingerprint library (w)^k；μ^k；P^k) Obtaining:

combining the partition probability with the formula (9) to obtain the target in the partition omega_kThe posterior distribution probability of each internal reference point is as follows:

target is located in partition Ω_kThe distribution weight of each reference point in the image is updated as follows:

wherein, λ is normalization factor, then the division is Ω_kThe distribution weight of the internal reference points as the target positions is

In summary, the position of the target is estimated as

Step five, updating the fingerprint database

When dividing into omega_kWhen the entropy of the obtained information is less than the partition threshold value, newly collecting N_kData of a person

Updating parameters of the partition model; based on the existing multivariate Gaussian mixture model, model parameters are adaptively adjusted through data acquisition so as to deduce an MVGMM model which has a closer coupling relation with the existing environment; consistent with the EM algorithm, the self-adaptive process of the MVGMM model parameters is also a two-step estimation; firstly, newly-added observation data is added through a k-means algorithm

Cluster as C_kClustering, and obtaining data weight by clustering

Mean value

Covariance

Initializing each Gaussian composition; respectively calculating the statistical weight of the newly added observation data by using the formulas (5) to (8)

Mean value

And covariance

Dividing omega by using statistic parameter of new data_kThe corresponding MVGMM model parameters are subjected to self-adaptive optimization, namely:

wherein,

used for balancing the influence degree of new and old data on model parameters, wherein lambda is a normalization factor, and r^ρIs an intrinsic correlation factor of the parameter;

sixthly, updating the model parameters

And (5) carrying out real-time updating on the model parameters obtained by updating in the self-adaptive process in the second step and the third step.