CN111757258A - An adaptive positioning fingerprint database construction method in complex indoor signal environment - Google Patents

An adaptive positioning fingerprint database construction method in complex indoor signal environment Download PDF

Info

Publication number
CN111757258A
CN111757258A CN202010639947.2A CN202010639947A CN111757258A CN 111757258 A CN111757258 A CN 111757258A CN 202010639947 A CN202010639947 A CN 202010639947A CN 111757258 A CN111757258 A CN 111757258A
Authority
CN
China
Prior art keywords
partition
target
model
probability
value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010639947.2A
Other languages
Chinese (zh)
Other versions
CN111757258B (en
Inventor
秦宁宁
王超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangnan University
Original Assignee
Jiangnan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiangnan University filed Critical Jiangnan University
Priority to CN202010639947.2A priority Critical patent/CN111757258B/en
Publication of CN111757258A publication Critical patent/CN111757258A/en
Application granted granted Critical
Publication of CN111757258B publication Critical patent/CN111757258B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/02Services making use of location information
    • H04W4/021Services related to particular areas, e.g. point of interest [POI] services, venue services or geofences
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • G06F18/23213Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2411Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/02Services making use of location information
    • H04W4/023Services making use of location information using mutual or relative location information between multiple location based services [LBS] targets or of distance thresholds
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/30Services specially adapted for particular environments, situations or purposes
    • H04W4/33Services specially adapted for particular environments, situations or purposes for indoor environments, e.g. buildings
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W64/00Locating users or terminals or network equipment for network management purposes, e.g. mobility management

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Computation (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Probability & Statistics with Applications (AREA)
  • Remote Sensing (AREA)
  • Image Analysis (AREA)

Abstract

A self-adaptive positioning fingerprint database construction method under a complex indoor signal environment belongs to the technical field of indoor positioning. According to the AP laying position and the space structure, the system adopts a one-to-many support vector machine algorithm to perform partition operation on a target area so as to accurately determine the area range of signal change. A multivariate Gaussian mixture model based on mutual interference between signals is established by using the coupling relation between the signals in the narrow and small subareas so as to improve the reduction of the positioning precision caused by signal fluctuation. When the indoor environment changes, the self-adaptive updating algorithm based on the partition multivariate Gaussian mixture model can judge the credibility of fingerprint data of each partition, and updates the model parameters of the partitions with larger signal fluctuation by the self-adaptive algorithm, so that the coupling degree between the model and the existing environment is improved. The experimental result shows that the method can utilize relatively small amount of sample data to construct a stable and maintainable indoor signal distribution model, and compared with other algorithms, the positioning accuracy is improved to a certain extent.

Description

一种复杂室内信号环境下的自适应定位指纹库构建方法An adaptive positioning fingerprint database construction method in complex indoor signal environment

技术领域technical field

本发明涉及一种室内定位的离线指纹库构建优化技术,属于室内定位技术领域。The invention relates to an offline fingerprint database construction and optimization technology for indoor positioning, and belongs to the technical field of indoor positioning.

背景技术Background technique

全球卫星导航系统在室外环境下已被广泛应用于为人们提供位置服务,但信号的缺失也导致该系统无法在复杂的室内环境下发挥作用。WiFi设施的广泛铺设和智能手机的普及,使得基于接收信号强度(Received Signal Strength,RSS)值的室内定位系统,得到了大批研究人员的密切关注。然而,由于无线通讯设施的设计初衷并非为人们提供室内导航,因此如何降低环境波动对无线信号的不确定干扰导致的定位影响,已成为现有研究不得不面对和解决的难点。Global satellite navigation systems have been widely used to provide people with location services in outdoor environments, but the lack of signals also makes the system unable to function in complex indoor environments. The widespread deployment of WiFi facilities and the popularization of smartphones have made indoor positioning systems based on Received Signal Strength (RSS) values closely watched by a large number of researchers. However, since the original design of wireless communication facilities is not to provide people with indoor navigation, how to reduce the impact of environmental fluctuations on the positioning caused by the uncertain interference of wireless signals has become a difficulty that existing research has to face and solve.

基于已有无线设施的定位系统,其常见商用装置不具备自主可编辑功能,仅能提供室内通用RSS测量值,这使得传统基于到达时间差和到达距离差等方法,无法直接平移应用。利用测量信号与实际位置间匹配运算的指纹定位算法,弥补了无线设施所发送信号在时间和空间特性上的缺失,可有效实现信号环境与实际场景的映射,为基于RSS值的室内定位提供了可能。The positioning system based on the existing wireless facilities, its common commercial devices do not have the function of self-editing, and can only provide the indoor general RSS measurement value, which makes the traditional method based on the difference of arrival time and the difference of arrival distance, unable to directly translate applications. The fingerprint positioning algorithm using the matching operation between the measured signal and the actual position makes up for the lack of time and space characteristics of the signal sent by the wireless facility, which can effectively realize the mapping between the signal environment and the actual scene, and provide an indoor positioning based on the RSS value. possible.

发明内容SUMMARY OF THE INVENTION

对大型室内场景,上述方法将耗费大量人力物力,且易受环境因素影响,所构建的离线指纹库与实际场景中信号分布的映射关系也会因时间变化而减弱,需不断修正离线指纹库,以降低时间积累所造成的映射误差累计。For large indoor scenes, the above method will consume a lot of manpower and material resources, and is easily affected by environmental factors. The mapping relationship between the constructed offline fingerprint database and the signal distribution in the actual scene will also weaken due to time changes, and the offline fingerprint database needs to be continuously revised. In order to reduce the accumulation of mapping errors caused by time accumulation.

本发明的技术方案:Technical scheme of the present invention:

一种复杂室内信号环境下的自适应定位指纹库构建方法,包括三个阶段:离线阶段、在线定位阶段、在线指纹库更新阶段,具体步骤如下:A method for constructing an adaptive positioning fingerprint database in a complex indoor signal environment, including three stages: an offline stage, an online positioning stage, and an online fingerprint database update stage. The specific steps are as follows:

步骤一、划分目标区域网格及构建离线指纹库Step 1. Divide the target area grid and build an offline fingerprint library

目标区域由K个划分区域Ωk,k∈{1,2,...,K}组成,将区域Ωk划分为Nk个网格,取每个网格的几何中心作为参考点

Figure BDA0002571157280000011
其中n∈{1,2,...,Nk},
Figure BDA0002571157280000012
为2×Nk维位置矩阵表示RP位置。对于各RP位置
Figure BDA0002571157280000013
相应的区域标志为
Figure BDA0002571157280000014
其中
Figure BDA0002571157280000015
Figure BDA0002571157280000016
i≠k,表示参考点位于k分区。在
Figure BDA0002571157280000017
收集到的来自Mk个AP的RSS样本值为
Figure BDA0002571157280000018
其中
Figure BDA0002571157280000019
表示在
Figure BDA00025711572800000110
处收集到的来自第m个AP的RSS样本值,m∈{1,2,...,Mk}。The target area is composed of K divided areas Ω k , k∈{1,2,...,K}, the area Ω k is divided into N k grids, and the geometric center of each grid is taken as the reference point
Figure BDA0002571157280000011
where n∈ {1,2,...,Nk},
Figure BDA0002571157280000012
represents the RP position as a 2×N k -dimensional position matrix. For each RP location
Figure BDA0002571157280000013
The corresponding region flags are
Figure BDA0002571157280000014
in
Figure BDA0002571157280000015
and
Figure BDA0002571157280000016
i≠k, indicating that the reference point is located in the k partition. exist
Figure BDA0002571157280000017
The collected RSS sample values from M k APs are
Figure BDA0002571157280000018
in
Figure BDA0002571157280000019
expressed in
Figure BDA00025711572800000110
RSS sample values from the mth AP collected at m∈{1,2,..., Mk }.

步骤二、构建在线定位阶段的SVM分区模型Step 2. Build the SVM partition model in the online positioning stage

以一对多方式设置支持向量机概率分类模型,对于预先设置的各分区,以目标是否位于本分区进行二分类标识,通过训练数据构建各分区SVM模型。对于给定K个分区,设立K个SVM模型,取各分区内所有参考AP的测量信号组成当前观测数据r=[r1,r2,...,rM],其中M为目标所接收到区域内AP数量,对于未接收到的信号值取为-100db。针对当前观测数据,各分区SVM模型给出目标是否位于相应区域内的分布概率p(yk=1|r),其中yk为分区标识,表示目标位于分区Ωk内,k∈{1,2,...,K}。通过各分区SVM模型给出的分布概率p(yk=1|r),对目标所在分区做初步判断,并作为一级判断依据。The support vector machine probability classification model is set up in a one-to-many manner. For each pre-set partition, the two-class identification is performed according to whether the target is located in this partition, and the SVM model of each partition is constructed through the training data. For a given K partitions, K SVM models are established, and the measurement signals of all reference APs in each partition are taken to form the current observation data r=[r 1 , r 2 ,...,r M ], where M is received by the target The number of APs in the area, and the value of the unreceived signal is -100db. For the current observation data, each partition SVM model gives the distribution probability p(y k =1|r) of whether the target is located in the corresponding area, where y k is the partition identifier, indicating that the target is located in the partition Ω k , k∈{1, 2,...,K}. According to the distribution probability p (y k =1|r) given by the SVM model of each partition, a preliminary judgment is made on the partition where the target is located, and it is used as the first-level judgment basis.

采用基于概率SVM的分区操作,将步骤一所获取参考点观测数据划分为训练集与测试集,对分区判断模型SVM进行训练。通过K个SVM模型获取目标位于相应分区的概率值,通过设置二级判断依据,以克服对于分区交接附近的测试点误判断问题。选取被判定区域内2个最大概率的分区区域,即p(yi=1|r)与p(yj=1|r),i,j∈{1,2,...,K},且p(yi=1|r)>p(yj=1|r),其差值表示为:Using the partition operation based on probability SVM, the reference point observation data obtained in step 1 is divided into training set and test set, and the partition judgment model SVM is trained. The probability value that the target is located in the corresponding partition is obtained through K SVM models, and the second-level judgment basis is set to overcome the problem of misjudgment of the test points near the partition handover. Select two partitioned regions with the highest probability in the determined region, namely p(y i =1|r) and p(y j =1|r), i,j∈{1,2,...,K}, And p(y i =1|r)>p(y j =1|r), the difference is expressed as:

Δyp=p(yi=1|r)-p(yj=1|r), (1)Δy p =p(y i =1|r)-p(y j =1|r), (1)

当Δyp>Δy时,说明i分区对测试点影响力远大于j分区,则将参考点判定于i分区,其中Δy为二级判断阈值。对于Δyp<Δy,则将两区域均判定为目标区域,分别做相应的区域匹配运算,并将各自分区所得目标位置做概率平均,以求取最终位置估计When Δy p >Δy, it means that the influence of the i partition on the test point is much greater than that of the j partition, and the reference point is determined in the i partition, where Δy is the secondary judgment threshold. For Δy p < Δy, both regions are determined as target regions, and the corresponding region matching operations are performed respectively, and the probability average of the target positions obtained by the respective partitions is performed to obtain the final position estimate.

步骤三、构建多元高斯混合模型MVGMMStep 3. Build a multivariate Gaussian mixture model MVGMM

利用不同AP信号的相关性建立多元高斯混合模型MVGMM,通过不断增加高斯元素个数,利用不同参数的概率密度函数加权和近似模拟分区内RP位置与所获取各AP信号间的联合分布情况,弥补传统指纹库构建工作中对AP信号间耦合关系的忽略。多元高斯混合模型的概率分布函数表示为:The multivariate Gaussian mixture model MVGMM is established by using the correlation of different AP signals. By continuously increasing the number of Gaussian elements, the probability density function of different parameters is used to weight and approximate the joint distribution between the RP positions in the partition and the acquired AP signals to make up for The coupling relationship between AP signals is ignored in the construction of traditional fingerprint database. The probability distribution function of the multivariate Gaussian mixture model is expressed as:

Figure BDA0002571157280000021
Figure BDA0002571157280000021

其中,C表示组成元素的个数,pN(x|μc,Pc)表示均值为μc、协方差为Pc的高斯组成成分,权重wc的加和为1。Among them, C represents the number of constituent elements, p N (x|μ c , P c ) represents a Gaussian composition with a mean of μ c and a covariance of P c , and the sum of the weights w c is 1.

基于式(2),利用分区Ωk内RP位置与RSS信号值联合分布的后验概率将多元高斯混合模型表示为:Based on equation (2), the multivariate Gaussian mixture model is expressed as:

Figure BDA0002571157280000022
Figure BDA0002571157280000022

其中,yk=1表示目标处于第k个分区,r表示在参考点x处接收到各AP信号的RSS值,

Figure BDA0002571157280000023
为多元高斯混合模型的组成元素权重,
Figure BDA0002571157280000024
为元素均值,
Figure BDA0002571157280000025
为元素协方差,Ck为组成元素数量。Among them, y k =1 indicates that the target is in the kth partition, r indicates the RSS value of each AP signal received at the reference point x,
Figure BDA0002571157280000023
is the element weight of the multivariate Gaussian mixture model,
Figure BDA0002571157280000024
is the element mean,
Figure BDA0002571157280000025
is the element covariance, and C k is the number of constituent elements.

MVGMM模型采用EM算法估计模型参数。MVGMM模型选用Ck个均值为

Figure BDA0002571157280000026
协方差为
Figure BDA0002571157280000027
的多元高斯函数,拟合分区Ωk内Nk个参考点
Figure BDA0002571157280000028
与相应分区内所获取各AP信号RSS值
Figure BDA0002571157280000029
的联合概率分布。利用k-means算法将样本数据聚类为Ck个初始簇,取各簇平均值与协方差用于初始化EM算法参数且各簇初始权重设置相同。联合概率分布的对数似然形式表示为:The MVGMM model uses the EM algorithm to estimate the model parameters. The MVGMM model selects C k means as
Figure BDA0002571157280000026
The covariance is
Figure BDA0002571157280000027
The multivariate Gaussian function is fitted to N k reference points in the partition Ω k
Figure BDA0002571157280000028
and the RSS value of each AP signal obtained in the corresponding partition
Figure BDA0002571157280000029
The joint probability distribution of . The sample data is clustered into C k initial clusters by the k-means algorithm, and the mean and covariance of each cluster are taken to initialize the parameters of the EM algorithm, and the initial weights of each cluster are set the same. The log-likelihood form of the joint probability distribution is expressed as:

Figure BDA00025711572800000210
Figure BDA00025711572800000210

其中,zk=[xk;Rk],

Figure BDA00025711572800000211
隐变量γc,n表示第n个样本值属于第c个高斯组成元素的概率。where z k =[x k ; R k ],
Figure BDA00025711572800000211
The latent variable γc ,n represents the probability that the nth sample value belongs to the cth Gaussian component.

由此确定观测数据

Figure BDA0002571157280000031
属于第c个高斯组成元素的概率表示为:From this, the observed data are determined
Figure BDA0002571157280000031
The probability of belonging to the c-th Gaussian component is expressed as:

Figure BDA0002571157280000032
Figure BDA0002571157280000032

通过式(5)计算多元高斯函数的权重

Figure BDA0002571157280000033
均值
Figure BDA0002571157280000034
和协方差
Figure BDA0002571157280000035
分别为:Calculate the weight of the multivariate Gaussian function by formula (5)
Figure BDA0002571157280000033
mean
Figure BDA0002571157280000034
and covariance
Figure BDA0002571157280000035
They are:

Figure BDA0002571157280000036
Figure BDA0002571157280000036

Figure BDA0002571157280000037
Figure BDA0002571157280000037

Figure BDA0002571157280000038
Figure BDA0002571157280000038

对不同的Ck值重复聚类与EM估计过程离线指纹库表示为(wk;μk;Pk),其中k∈{1,2,..,K}。Repeating the clustering and EM estimation process for different values of C k The offline fingerprint library is denoted as (w k ; μ k ; P k ), where k∈{1,2,..,K}.

步骤四、定位分域内目标Step 4. Locate the target in the sub-domain

通过离线指纹库对区域内目标位置进行估计。若当前时刻目标接受M个AP的RSS值为

Figure BDA0002571157280000039
通过分区判断模型获得目标在各分区的分布概率p(yk=1|rnow),k∈{1,2,...,K}。The target location in the area is estimated through the offline fingerprint database. If the RSS value of M APs accepted by the target at the current moment is
Figure BDA0002571157280000039
The distribution probability p(y k =1|r now ) of the target in each partition is obtained through the partition judgment model, k∈{1,2,...,K}.

利用当前测量值rnow选取对应分区Ωk内对目标定位有效的AP信号测量值

Figure BDA00025711572800000310
根据多元高斯分布的条件概率准则,获得给定观测数据下目标位置的后验概率分布:Use the current measurement value r now to select the AP signal measurement value that is valid for target positioning in the corresponding partition Ω k
Figure BDA00025711572800000310
According to the conditional probability criterion of multivariate Gaussian distribution, the posterior probability distribution of the target position under the given observation data is obtained:

Figure BDA00025711572800000311
Figure BDA00025711572800000311

基于所得离线指纹库(wk;μk;Pk),得:Based on the obtained offline fingerprint database (w k ; μ k ; P k ), we have:

Figure BDA00025711572800000312
Figure BDA00025711572800000312

Figure BDA00025711572800000313
Figure BDA00025711572800000313

联合分区概率与式(9),获得目标在分区Ωk内各参考点的后验分布概率为:Combining the partition probability with equation (9), the posterior distribution probability of each reference point of the target in the partition Ω k is obtained as:

Figure BDA0002571157280000041
Figure BDA0002571157280000041

目标位于分区Ωk内各参考点的分布权重更新为:The distribution weight of each reference point in the partition Ω k is updated as:

Figure BDA0002571157280000042
Figure BDA0002571157280000042

其中,λ为归一化因子,则分区Ωk内各参考点为目标位置的分布权重为

Figure BDA0002571157280000043
Among them, λ is the normalization factor, then the distribution weight of each reference point in the partition Ω k as the target position is:
Figure BDA0002571157280000043

综上所述,目标的位置估计为In summary, the location of the target is estimated as

Figure BDA0002571157280000044
Figure BDA0002571157280000044

步骤五、指纹库更新Step 5. Update the fingerprint database

当分区Ωk中所得信息熵小于分区阈值时,通过新采集Nk个数据

Figure BDA0002571157280000045
对分区模型进行参数更新。基于现有多元高斯混合模型,通过采集数据对模型参数进行自适应调整,以推导出与现有环境具有更加紧密耦合关系的MVGMM模型。与EM算法一致,MVGMM模型参数的自适应过程也是一个两步估计。首先,通过k-means算法将新增观测数据
Figure BDA0002571157280000046
聚类为Ck组簇,并通过聚类所得数据权值
Figure BDA0002571157280000047
均值
Figure BDA0002571157280000048
协方差
Figure BDA0002571157280000049
初始化各高斯组成成分。利用式(5)-(8)分别计算新增观测数据的统计权重
Figure BDA00025711572800000410
均值
Figure BDA00025711572800000411
与协方差
Figure BDA00025711572800000412
When the information entropy obtained in the partition Ω k is less than the partition threshold, N k data are newly collected by
Figure BDA0002571157280000045
Parameter update for the partition model. Based on the existing multivariate Gaussian mixture model, the model parameters are adaptively adjusted by collecting data to derive a MVGMM model with a more tightly coupled relationship with the existing environment. Consistent with the EM algorithm, the adaptation process of the MVGMM model parameters is also a two-step estimation. First, the new observation data will be added through the k-means algorithm.
Figure BDA0002571157280000046
Clustering into C k groups of clusters, and data weights obtained by clustering
Figure BDA0002571157280000047
mean
Figure BDA0002571157280000048
Covariance
Figure BDA0002571157280000049
Initialize each Gaussian component. Use equations (5)-(8) to calculate the statistical weights of the newly added observation data respectively
Figure BDA00025711572800000410
mean
Figure BDA00025711572800000411
with covariance
Figure BDA00025711572800000412

利用新增数据的统计参数对分区Ωk所对应的MVGMM模型参数进行自适应优化,即:The MVGMM model parameters corresponding to the partition Ω k are adaptively optimized using the statistical parameters of the newly added data, namely:

Figure BDA00025711572800000413
Figure BDA00025711572800000413

Figure BDA00025711572800000414
Figure BDA00025711572800000414

Figure BDA00025711572800000415
Figure BDA00025711572800000415

其中,

Figure BDA00025711572800000416
ρ∈{w,m,v}用于平衡新旧数据对模型参数的影响程度,λ为归一化因子,rρ为参数的固有相关因子。in,
Figure BDA00025711572800000416
ρ∈{w,m,v} is used to balance the influence of old and new data on model parameters, λ is the normalization factor, and r ρ is the inherent correlation factor of the parameters.

步骤六、更新模型参数Step 6. Update model parameters

将自适应过程中更新得到的模型参数属于步骤二和步骤三中,进行实时更新。The model parameters updated in the adaptive process belong to the second and third steps, and are updated in real time.

本发明的有益效果:专利根据AP铺设位置与空间结构,系统采用一对多支持向量机算法对目标区域做分区操作,以精确信号变化的区域范围。利用狭小分区内信号间的耦合关系,建立基于信号间相互干扰的多元高斯混合模型,以改善信号波动所造成的定位精度下降。当室内环境发生变化时,基于分区多元高斯混合模型的自适应更新算法可对各分区指纹数据的可信度做出判断,并以自适应算法更新信号波动较大分区的模型参数,提高模型与现有环境间的耦合程度。Beneficial effects of the present invention: According to the AP laying position and spatial structure, the system adopts a one-to-many support vector machine algorithm to perform a partition operation on the target area, so as to accurately determine the area range of the signal change. Using the coupling relationship between signals in a narrow partition, a multivariate Gaussian mixture model based on mutual interference between signals is established to improve the positioning accuracy drop caused by signal fluctuations. When the indoor environment changes, the adaptive update algorithm based on the partitioned multivariate Gaussian mixture model can judge the reliability of the fingerprint data of each partition, and use the adaptive algorithm to update the model parameters of the partitions with large signal fluctuations, so as to improve the model and the accuracy of the model. The degree of coupling between existing environments.

附图说明Description of drawings

图1为本发明方法的流程图。Figure 1 is a flow chart of the method of the present invention.

图2为实验场景图。Figure 2 shows the experimental scene diagram.

图3为分区效果对比图,其中,(a)为区域划分情况,(b)为练后的区域划分效果图。Figure 3 is a comparison diagram of the effect of partitions, in which (a) is the situation of regional division, and (b) is the effect diagram of regional division after practice.

图4为RSS指纹地图构建效果对比图,其中,(a)为真实测量值,(b)为GP型估计,(c)为MVGMM模型估计。Figure 4 is a comparison diagram of the construction effect of RSS fingerprint map, in which (a) is the real measurement value, (b) is the GP-type estimation, and (c) is the MVGMM model estimation.

图5为分区一内AP3信号的拟合效果对比图。FIG. 5 is a comparison diagram of the fitting effect of AP3 signals in partition one.

图6为目标运动的轨迹预测对比图。FIG. 6 is a comparison diagram of trajectory prediction of target motion.

图7为轨迹估计误差箱型图。Figure 7 is a boxplot of trajectory estimation error.

图8为误差累计函数对比图。Figure 8 is a comparison diagram of the error accumulation function.

图9指纹库更新前后AP3数据拟合效果对比图。其中,(a)为AP3信号强度衰减值,(b)为AP3信号预测误差。Figure 9. Comparison of AP3 data fitting effects before and after the fingerprint database update. Among them, (a) is the AP3 signal strength attenuation value, (b) is the AP3 signal prediction error.

具体实施方式Detailed ways

针对大型室内场景下采样数据量大及维护成本高等问题,方法通过分区操作精确维护区域,并根据分区内信号间的耦合关系提出一种分区多元高斯混合模型(Multivariate Gaussian Mixture Model,MVGMM)以提高对信号分布的拟合程度。模型根据信号接入点(Access Point,AP)位置与物理连通结构对目标区域进行划分,并通过一对多支持向量机模型实现分区操作。在相对狭小的分区区域内,利用信号间存在的相互干扰分别建立多元高斯混合模型,以强化信号的拟合程度,最终达到改善分区定位精度的效果。当环境发生变化时,算法以信息熵作为分区数据更新判据,以及时响应分区变化对指纹库的影响,降低维护成本。从而在室内定位应用中,实现少量数据支撑高效可维护指纹库的构建。In view of the large amount of sampling data and high maintenance cost in large indoor scenes, the method accurately maintains the area through partition operation, and proposes a partitioned Multivariate Gaussian Mixture Model (MVGMM) according to the coupling relationship between signals in the partition to improve the The fit of the signal distribution. The model divides the target area according to the location of the signal access point (AP) and the physical connection structure, and realizes the partition operation through the one-to-many support vector machine model. In a relatively narrow partition area, the multivariate Gaussian mixture model is established by using the mutual interference between the signals to strengthen the fitting degree of the signals, and finally achieve the effect of improving the partition positioning accuracy. When the environment changes, the algorithm uses the information entropy as the partition data update criterion, and responds to the impact of the partition change on the fingerprint database in time to reduce the maintenance cost. Therefore, in indoor positioning applications, a small amount of data can support the construction of an efficient and maintainable fingerprint database.

基于多元高斯混合模型的目标定位系统,能够以较低的采集代价对目标进行精确定位。算法以传统WiFi设施中所获取测量信号作为训练数据,构建目标区域内AP信号的分布状况。与传统目标定位过程相比,系统由三个模块组成,包括离线阶段、在线定位阶段、在线指纹库更新阶段,如图1所示。The target localization system based on the multivariate Gaussian mixture model can accurately locate the target with low acquisition cost. The algorithm uses the measurement signals obtained in traditional WiFi facilities as training data to construct the distribution of AP signals in the target area. Compared with the traditional target positioning process, the system consists of three modules, including an offline stage, an online positioning stage, and an online fingerprint database update stage, as shown in Figure 1.

离线阶段主要是通过定位区域内各参考点(Reference Point,RP)位置与所接收到RSS信号值建立完整指纹数据库,并将其应用于在线阶段的匹配运算。在线定位阶段采用分区模型确定目标所在区域,利用多元高斯函数的条件概率准则,计算以当前观测数据为条件的目标位置后验概率,并通过WKNN估计目标位置。在线指纹库更新阶段则以后验分布概率计算所得信息熵作为指纹库更新的衡量尺度,并建立基于分区多元高斯混合模型的反馈调度指纹库。The offline stage mainly establishes a complete fingerprint database by locating the position of each reference point (Reference Point, RP) in the area and the received RSS signal value, and applies it to the matching operation in the online stage. In the online positioning stage, the partition model is used to determine the target area, and the conditional probability criterion of the multivariate Gaussian function is used to calculate the posterior probability of the target position conditioned on the current observation data, and estimate the target position through WKNN. In the online fingerprint database update stage, the information entropy calculated by the posterior distribution probability is used as the measure of the fingerprint database update, and a feedback scheduling fingerprint database based on the partitioned multivariate Gaussian mixture model is established.

具体如下:details as follows:

构建离线指纹库Build an offline fingerprint library

指纹收集方案Fingerprint Collection Scheme

目标区域由K个划分区域Ωk,k∈{1,2,...,K}组成,并将区域Ωk划分为Nk个网格,取每个网格的几何中心作为参考点

Figure BDA0002571157280000051
其中n∈{1,2,...,Nk},
Figure BDA0002571157280000052
为2×Nk维位置矩阵表示RP位置。对于各RP位置
Figure BDA0002571157280000053
相应的区域标志为
Figure BDA0002571157280000054
其中
Figure BDA0002571157280000055
Figure BDA0002571157280000056
i≠k,表示参考点位于k分区。在
Figure BDA0002571157280000057
收集到的来自Mk个AP的RSS样本值为
Figure BDA0002571157280000058
其中
Figure BDA0002571157280000059
表示在
Figure BDA00025711572800000510
处收集到的来自第m个AP的RSS样本值,m∈{1,2,...,Mk}。The target area consists of K partitioned areas Ω k , k∈{1,2,...,K}, and the area Ω k is divided into N k meshes, and the geometric center of each mesh is taken as the reference point
Figure BDA0002571157280000051
where n∈ {1,2,...,Nk},
Figure BDA0002571157280000052
represents the RP position as a 2×N k -dimensional position matrix. For each RP location
Figure BDA0002571157280000053
The corresponding region flags are
Figure BDA0002571157280000054
in
Figure BDA0002571157280000055
and
Figure BDA0002571157280000056
i≠k, indicating that the reference point is located in the k partition. exist
Figure BDA0002571157280000057
The collected RSS sample values from M k APs are
Figure BDA0002571157280000058
in
Figure BDA0002571157280000059
expressed in
Figure BDA00025711572800000510
RSS sample values from the mth AP collected at m∈{1,2,..., Mk }.

分区模型构建Partition Model Construction

综合考量区域分类的精确度与效率,以一对多方式设置支持向量机概率分类模型,可有效解决根据AP位置与物理连通所设置分区的分类问题[14]。对于预先设置的各分区,以目标是否位于本分区进行二分类标识,通过训练数据构建各分区SVM模型。对于给定K个分区,设立K个SVM模型,取各分区内所有参考AP的测量信号组成当前观测数据r=[r1,r2,...,rM],其中M为目标所接收到区域内AP数量,对于未接收到的信号值取为-100db。针对当前观测数据,各分区SVM模型可给出目标是否位于相应区域内的分布概率p(yk=1|r),其中yk为分区标识,表示目标位于分区Ωk内,k∈{1,2,...,K}。通过各分区SVM模型给出的分布概率p(yk=1|r),可对目标所在分区做初步判断,并作为一级判断依据Considering the accuracy and efficiency of regional classification, setting the support vector machine probability classification model in a one-to-many manner can effectively solve the classification problem of partitions set according to AP location and physical connectivity [14]. For each pre-set partition, the two-class identification is carried out according to whether the target is located in this partition, and the SVM model of each partition is constructed by training data. For a given K partitions, K SVM models are established, and the measurement signals of all reference APs in each partition are taken to form the current observation data r=[r 1 , r 2 ,...,r M ], where M is received by the target The number of APs in the area, and the value of the unreceived signal is -100db. For the current observation data, the SVM model of each partition can give the distribution probability p(y k =1|r) of whether the target is located in the corresponding area, where y k is the partition identifier, indicating that the target is located in the partition Ω k , k∈{1 ,2,...,K}. Through the distribution probability p(y k =1|r) given by the SVM model of each partition, a preliminary judgment can be made on the partition where the target is located, and it can be used as a first-level judgment basis

算法采用基于概率SVM的分区操作,将离线阶段所获取参考点观测数据划分为训练集与测试集,对分区判断模型进行训练。通过K个SVM模型可获取目标位于相应分区的概率值,但分区交接处信号分布复杂,易造成分区模型的判断失误。由此,通过设置二级判断依据,以克服对于分区交接附近的测试点误判断问题。选取被判定区域内2个最大概率的分区区域,即p(yi=1|r)与p(yj=1|r),i,j∈{1,2,...,K},且p(yi=1|r)>p(yj=1|r),其差值可表示为The algorithm adopts the partition operation based on probability SVM, divides the observation data of reference points obtained in the offline phase into training set and test set, and trains the partition judgment model. Through K SVM models, the probability value of the target located in the corresponding partition can be obtained, but the signal distribution at the junction of the partition is complex, which is easy to cause errors in the judgment of the partition model. Therefore, by setting a secondary judgment basis, the problem of misjudging the test points near the partition handover can be overcome. Select two partitioned regions with the highest probability in the determined region, namely p(y i =1|r) and p(y j =1|r), i,j∈{1,2,...,K}, And p(y i =1|r)>p(y j =1|r), the difference can be expressed as

Δyp=p(yi=1|r)-p(yj=1|r), (1)Δy p =p(y i =1|r)-p(y j =1|r), (1)

当Δyp>Δy时,说明i分区对测试点影响力远大于j分区,可将参考点判定于i分区,其中Δy为二级判断阈值。对于Δyp<Δy,则将两区域均判定为目标区域,可分别做相应的区域匹配运算,并将各自分区所得目标位置做概率平均,以求取最终位置估计When Δy p >Δy, it means that the influence of the i partition on the test point is much greater than that of the j partition, and the reference point can be determined in the i partition, where Δy is the secondary judgment threshold. For Δy p < Δy, both regions are determined as target regions, and corresponding region matching operations can be performed respectively, and the probability average of the target positions obtained by each partition is performed to obtain the final position estimate

多元高斯混合模型构建Multivariate Gaussian Mixture Model Construction

由于狭小区域内墙壁对电磁信号的衰减和反射导致天线的辐射模型并非定向均匀。不同AP在某一时刻的信号强度值也会因墙壁折射、人员遮挡、信道等问题出现相互关联。因此,简单地通过单一AP采样值则会割裂各AP信号间的相关性,导致模型构建中的较大失误。The radiation model of the antenna is not uniform in direction due to the attenuation and reflection of the electromagnetic signal by the walls in the narrow area. The signal strength values of different APs at a certain time are also correlated with each other due to wall refraction, personnel occlusion, and channel problems. Therefore, simply sampling the value of a single AP will split the correlation between the AP signals, resulting in a large error in model construction.

考虑到划分后狭小分区内信号间的相互干扰,可利用不同AP信号的相关性建立多元高斯混合模型,通过不断增加高斯元素个数,以利用不同参数的概率密度函数加权和近似分区内RP位置与所获取各AP信号间的联合概率密度分布,弥补常见工作中对AP信号间耦合关系的忽略。多元高斯混合模型的概率分布函数可表示为Considering the mutual interference between the signals in the narrow partition after division, a multivariate Gaussian mixture model can be established by using the correlation of different AP signals, and by continuously increasing the number of Gaussian elements, the probability density function of different parameters can be used to weight and approximate the RP position in the partition. The joint probability density distribution between the acquired AP signals makes up for the neglect of the coupling relationship between AP signals in common work. The probability distribution function of the multivariate Gaussian mixture model can be expressed as

Figure BDA0002571157280000061
Figure BDA0002571157280000061

其中,C表示组成元素的个数,pN(x|μc,Pc)表示均值为μc,协方差为Pc的高斯组成成分,权重wc的加和为1。Among them, C represents the number of constituent elements, p N (x|μ c , P c ) represents a Gaussian composition with a mean value of μ c , a covariance of P c , and the sum of the weights w c is 1.

基于式(2),利用分区Ωk内RP位置与RSS信号值联合分布的后验概率可将多元高斯混合模型表示为Based on equation (2), the multivariate Gaussian mixture model can be expressed as

Figure BDA0002571157280000062
Figure BDA0002571157280000062

其中,yk=1表示目标处于第k个分区,r表示在参考点x处接收到各AP信号的RSS值,

Figure BDA0002571157280000063
为多元高斯混合模型的组成元素权重,
Figure BDA0002571157280000064
为元素均值,
Figure BDA0002571157280000065
为元素协方差,Ck为组成元素数量。Among them, y k =1 indicates that the target is in the kth partition, r indicates the RSS value of each AP signal received at the reference point x,
Figure BDA0002571157280000063
is the element weight of the multivariate Gaussian mixture model,
Figure BDA0002571157280000064
is the element mean,
Figure BDA0002571157280000065
is the element covariance, and C k is the number of constituent elements.

为提高MVGMM模型对于目标区域内所采集样本数据的拟合效果,采用EM算法估计模型参数。MVGMM模型选用Ck个均值为

Figure BDA0002571157280000071
协方差为
Figure BDA0002571157280000072
的多元高斯函数,拟合分区Ωk内Nk个参考点
Figure BDA0002571157280000073
与所获取各AP信号RSS值
Figure BDA0002571157280000074
的联合概率分布。利用k-means算法将样本数据聚类为Ck个初始簇,取各簇平均值与协方差用于初始化EM算法参数且各簇初始权重设置相同。联合概率分布的对数似然形式可表示为In order to improve the fitting effect of the MVGMM model to the sample data collected in the target area, the EM algorithm is used to estimate the model parameters. The MVGMM model selects C k means as
Figure BDA0002571157280000071
The covariance is
Figure BDA0002571157280000072
The multivariate Gaussian function is fitted to N k reference points in the partition Ω k
Figure BDA0002571157280000073
and the obtained RSS value of each AP signal
Figure BDA0002571157280000074
The joint probability distribution of . The sample data is clustered into C k initial clusters by the k-means algorithm, and the mean and covariance of each cluster are taken to initialize the parameters of the EM algorithm, and the initial weights of each cluster are set the same. The log-likelihood form of the joint probability distribution can be expressed as

Figure BDA0002571157280000075
Figure BDA0002571157280000075

其中,zk=[xk;Rk],

Figure BDA0002571157280000076
隐变量γc,n表示第n个样本值属于第c个高斯组成元素的概率。where z k =[x k ; R k ],
Figure BDA0002571157280000076
The latent variable γc ,n represents the probability that the nth sample value belongs to the cth Gaussian component.

由此确定观测数据

Figure BDA0002571157280000077
属于第c个高斯组成元素的概率可表示为From this, the observed data are determined
Figure BDA0002571157280000077
The probability of belonging to the c-th Gaussian component can be expressed as

Figure BDA0002571157280000078
Figure BDA0002571157280000078

通过式(5)可计算多元高斯函数的权重

Figure BDA0002571157280000079
均值
Figure BDA00025711572800000710
和协方差
Figure BDA00025711572800000711
分别为:The weight of the multivariate Gaussian function can be calculated by formula (5)
Figure BDA0002571157280000079
mean
Figure BDA00025711572800000710
and covariance
Figure BDA00025711572800000711
They are:

Figure BDA00025711572800000712
Figure BDA00025711572800000712

Figure BDA00025711572800000713
Figure BDA00025711572800000713

Figure BDA00025711572800000714
Figure BDA00025711572800000714

对不同的Ck值重复聚类与EM估计过程:Repeat the clustering and EM estimation process for different values of Ck :

(1)输入:高斯组成元素数量Ck,高斯组成元素的初始权值

Figure BDA00025711572800000715
初始均值
Figure BDA00025711572800000716
初始协方差
Figure BDA00025711572800000717
分区Ωk内参考点位置
Figure BDA00025711572800000718
与相应采样值
Figure BDA00025711572800000719
c∈{1,2,..,Ck}(1) Input: the number of Gaussian constituent elements C k , the initial weight of the Gaussian constituent elements
Figure BDA00025711572800000715
initial mean
Figure BDA00025711572800000716
initial covariance
Figure BDA00025711572800000717
Position of reference point in partition Ω k
Figure BDA00025711572800000718
with the corresponding sample value
Figure BDA00025711572800000719
c∈{1,2,..,C k }

(2)基于公式(5)计算

Figure BDA00025711572800000720
(2) Calculated based on formula (5)
Figure BDA00025711572800000720

(3)基于公式(6)至(8),计算wc

Figure BDA00025711572800000721
(3) Based on formulae (6) to (8), calculate w c ,
Figure BDA00025711572800000721

(4)重复步骤(2)至(3),直至

Figure BDA00025711572800000722
(4) Repeat steps (2) to (3) until
Figure BDA00025711572800000722

(5)输出:高斯组成元素权值

Figure BDA0002571157280000081
均值
Figure BDA0002571157280000082
协方差
Figure BDA0002571157280000083
(5) Output: Gaussian composition element weights
Figure BDA0002571157280000081
mean
Figure BDA0002571157280000082
Covariance
Figure BDA0002571157280000083

基于上述分析可得,离线指纹库可表示为(wk;μk;Pk),其中k∈{1,2,..,K}。Based on the above analysis, the offline fingerprint library can be expressed as (w k ; μ k ; P k ), where k∈{1,2,..,K}.

在线阶段:目标定位Online Phase: Targeting

利用所采集样本数据构建关于目标区域的离线指纹库,通过该指纹库可对区域内目标位置进行估计。若当前时刻目标接受M个AP的RSS值为

Figure BDA0002571157280000084
通过分区判断模型可得目标在各分区的分布概率p(yk=1|rnow),k∈{1,2,...,K}。The collected sample data is used to build an offline fingerprint database about the target area, through which the target position in the area can be estimated. If the RSS value of M APs accepted by the target at the current moment is
Figure BDA0002571157280000084
Through the partition judgment model, the distribution probability p(y k =1|r now ) of the target in each partition can be obtained, k∈{1,2,...,K}.

利用当前测量值rnow可选取对应分区Ωk内对目标定位有效的AP信号测量值

Figure BDA0002571157280000085
根据多元高斯分布的条件概率准则,可得给定观测数据下目标位置的后验概率分布Using the current measurement value r now , the AP signal measurement value that is valid for target positioning in the corresponding partition Ω k can be selected
Figure BDA0002571157280000085
According to the conditional probability criterion of multivariate Gaussian distribution, the posterior probability distribution of the target position under the given observation data can be obtained

Figure BDA0002571157280000086
Figure BDA0002571157280000086

其中,基于所得离线指纹库(wk;μk;Pk),可得Among them, based on the obtained offline fingerprint database (w k ; μ k ; P k ), we can get

Figure BDA0002571157280000087
Figure BDA0002571157280000087

Figure BDA0002571157280000088
Figure BDA0002571157280000088

联合分区概率与式(9),可得目标在分区Ωk内各参考点的后验分布概率为Combining the partition probability with equation (9), the posterior distribution probability of each reference point of the target in the partition Ω k can be obtained as

Figure BDA0002571157280000089
Figure BDA0002571157280000089

目标位于分区Ωk内各参考点的分布权重可更新为The distribution weight of each reference point in the partition Ω k can be updated as

Figure BDA00025711572800000810
Figure BDA00025711572800000810

其中,λ为归一化因子,则分区Ωk内各参考点为目标位置的分布权重为

Figure BDA00025711572800000811
Among them, λ is the normalization factor, then the distribution weight of each reference point in the partition Ω k as the target position is:
Figure BDA00025711572800000811

综上所述,目标的位置估计为In summary, the location of the target is estimated as

Figure BDA00025711572800000812
Figure BDA00025711572800000812

在线阶段:指纹库更新Online Phase: Fingerprint Database Update

当分区Ωk中所得信息熵小于分区阈值时,通过新采集Nk个数据

Figure BDA00025711572800000813
对分区模型进行参数更新。基于现有多元高斯混合模型,通过采集数据可对模型参数进行自适应调整,以推导出与现有环境具有更加紧密耦合关系的MVGMM模型。与EM算法一致,MVGMM模型参数的自适应过程也是一个两步估计[13]。首先,通过k-means算法将新增观测数据
Figure BDA00025711572800000814
聚类为Ck组簇,并通过聚类所得数据权值
Figure BDA00025711572800000815
均值
Figure BDA00025711572800000816
协方差
Figure BDA00025711572800000817
初始化各高斯组成成分。利用式(5)-(8)可分别计算新增观测数据的统计权重
Figure BDA0002571157280000091
均值
Figure BDA0002571157280000092
与协方差
Figure BDA0002571157280000093
When the information entropy obtained in the partition Ω k is less than the partition threshold, N k data are newly collected by
Figure BDA00025711572800000813
Parameter update for the partition model. Based on the existing multivariate Gaussian mixture model, the model parameters can be adaptively adjusted by collecting data to derive the MVGMM model with a more tightly coupled relationship with the existing environment. Consistent with the EM algorithm, the adaptation process of the MVGMM model parameters is also a two-step estimation [13] . First, the new observation data will be added through the k-means algorithm.
Figure BDA00025711572800000814
Clustering into C k groups of clusters, and data weights obtained by clustering
Figure BDA00025711572800000815
mean
Figure BDA00025711572800000816
Covariance
Figure BDA00025711572800000817
Initialize each Gaussian component. Using equations (5)-(8), the statistical weights of the newly added observation data can be calculated separately
Figure BDA0002571157280000091
mean
Figure BDA0002571157280000092
with covariance
Figure BDA0002571157280000093

利用新增数据的统计参数对分区Ωk所对应的MVGMM模型参数进行自适应优化,即:The MVGMM model parameters corresponding to the partition Ω k are adaptively optimized using the statistical parameters of the newly added data, namely:

Figure BDA0002571157280000094
Figure BDA0002571157280000094

Figure BDA0002571157280000095
Figure BDA0002571157280000095

Figure BDA0002571157280000096
Figure BDA0002571157280000096

其中,

Figure BDA0002571157280000097
ρ∈{w,m,v}用于平衡新旧数据对模型参数的影响程度,λ为归一化因子,rρ为参数的固有相关因子。新增数据更能体现当前环境内信号的分布状况,在自适应更新过程中,如下所示,平衡因子
Figure BDA0002571157280000098
ρ∈{w,m,v}更依赖于新增观测数据。in,
Figure BDA0002571157280000097
ρ∈{w,m,v} is used to balance the influence of old and new data on model parameters, λ is the normalization factor, and r ρ is the inherent correlation factor of the parameters. The newly added data can better reflect the distribution of signals in the current environment. During the adaptive update process, as shown below, the balance factor
Figure BDA0002571157280000098
ρ∈{w,m,v} is more dependent on the newly added observation data.

(1)输入:原始高斯组成元素数量Ck,原始高斯组成元素的权值

Figure BDA0002571157280000099
均值
Figure BDA00025711572800000910
协方差
Figure BDA00025711572800000911
新增分区Ωk内采样数据
Figure BDA00025711572800000912
c∈{1,2,..,Ck};(1) Input: the number of original Gaussian constituent elements C k , the weight of the original Gaussian constituent elements
Figure BDA0002571157280000099
mean
Figure BDA00025711572800000910
Covariance
Figure BDA00025711572800000911
Added sampling data in partition Ω k
Figure BDA00025711572800000912
c∈{1,2,.., Ck };

(2)基于公式(5),计算

Figure BDA00025711572800000913
(2) Based on formula (5), calculate
Figure BDA00025711572800000913

(3)基于公式(6)至(8)计算

Figure BDA00025711572800000914
基于公式(15)至(17)计算
Figure BDA00025711572800000915
Figure BDA00025711572800000916
(3) Calculated based on formulae (6) to (8)
Figure BDA00025711572800000914
Calculated based on equations (15) to (17)
Figure BDA00025711572800000915
Figure BDA00025711572800000916

(4)重复步骤(2)至(3),直至

Figure BDA00025711572800000917
(4) Repeat steps (2) to (3) until
Figure BDA00025711572800000917

(5)输出:更新后高斯组成元素权值

Figure BDA00025711572800000918
均值
Figure BDA00025711572800000919
协方差
Figure BDA00025711572800000920
(5) Output: The updated Gaussian component weights
Figure BDA00025711572800000918
mean
Figure BDA00025711572800000919
Covariance
Figure BDA00025711572800000920

根据权利要求所包含的内容举例说明Illustrate according to what is contained in the claims

实施例1:人员位置检测Example 1: Person Location Detection

对室内人员的位置信息进行实时检测,需为每位目标人员配备智能手环,通过智能手环接收到的AP信号,确定目标位置。但由于室内环境复杂,且AP设备受时间影响发生变化。采用SMVGMM算法对AP信号在室内的分布情况进行拟合,能够提高定位精度、减少环境因素影响,在线更新阶段可通过自适应更新算法更新模型参数,以降低时间变化对离线指纹库的影响。To detect the location information of indoor personnel in real time, it is necessary to equip each target person with a smart bracelet, and determine the target position through the AP signal received by the smart bracelet. However, because the indoor environment is complex, and AP devices are affected by time and change. The SMVGMM algorithm is used to fit the indoor distribution of AP signals, which can improve the positioning accuracy and reduce the impact of environmental factors. In the online update stage, the model parameters can be updated through the adaptive update algorithm to reduce the impact of time changes on the offline fingerprint database.

试验场景为某学院C区某层环形走廊环境,选取移动运营商在学院内均匀铺设的WiFi路由器作为AP信号源。由于AP信号源主要铺设于走廊中,且单侧走廊区域相对开阔,信号受墙壁影响的差异性较小,故将物理结构相对连同且接收AP信号差异较小的单侧走廊区域划分为对应分区,则试验区域可划分为K=4个分区。RP位置采用网格拓扑,并以走廊宽幅居中形式排列,相邻RP间隔1米,共计368个RP点,AP信号源与RP点排布平面图如图2所示。根据各分区内AP信号源的稳定性,选取区域1-4内AP信号源数量分别为{4,5,4,4}。为降低设备差异性对定位算法的影响,实验使用统一型号智能手机进行信号收集。The test scene is a ring corridor environment on a certain floor in the C area of a college, and the WiFi routers evenly laid by the mobile operator in the college are selected as the AP signal source. Since the AP signal source is mainly laid in the corridor, and the one-sided corridor area is relatively open, the difference of the signal affected by the wall is small, so the one-sided corridor area with relatively combined physical structure and small difference in receiving AP signals is divided into corresponding partitions , the test area can be divided into K=4 partitions. The RP location adopts a grid topology and is arranged in the form of a corridor width and center. The adjacent RPs are separated by 1 meter, with a total of 368 RP points. The layout of AP signal sources and RP points is shown in Figure 2. According to the stability of AP signal sources in each partition, the number of AP signal sources in areas 1-4 is selected to be {4, 5, 4, 4} respectively. In order to reduce the impact of device differences on the positioning algorithm, the experiment uses a uniform model of smart phone for signal collection.

在所有参考点处采集所有分区所选用共计12个AP的信号强度值,采样间隔为1.2s,共采集4.8s(避免设备因频率原因所导致的数据缓存),根据第3.1小节所述过程构建离线指纹库。测试阶段,实验员手持同款智能手机沿试验区域行走一圈,行进至测试点处通过操作获取实时观测数据,并标记当前位置,测试过程中共获得184个测试点,间隔1米。Collect the signal strength values of a total of 12 APs selected for all partitions at all reference points, the sampling interval is 1.2s, and the total collection is 4.8s (to avoid data buffering caused by the frequency of the device), constructed according to the process described in Section 3.1 Offline fingerprint library. During the test phase, the experimenter walked around the test area with the same smartphone, and then traveled to the test point to obtain real-time observation data through operations, and marked the current position. During the test, a total of 184 test points were obtained, with an interval of 1 meter.

目标区域划分完毕后,可对采集参考点做分区标识,如图3中的(a)所示,通过采样数据与分区标识构建分区模型。基于分区模型,可对测试数据进行分区操作,分区效果如图3中的(b)所示,算法将需要启动二级判别准则的测试点划分为区域5以表示信号复杂区域。After the target area is divided, the collection reference point can be identified as a zone, as shown in (a) in Figure 3, a zone model is constructed by sampling data and zone identification. Based on the partition model, the test data can be partitioned, and the partition effect is shown in (b) in Figure 3. The algorithm divides the test points that need to activate the second-level discrimination criteria into area 5 to represent the complex signal area.

为验证该算法对RSS指纹地图的构建效果,图4给出目标区域内AP3信号的真实数值图与MVGMM模型和GP模型对其的估计效果图。利用MVGMM模型对目标区域内参考点位置与AP信号的联合分布进行拟合后,区域内AP3信号的RSS分布可使用公式(7)和(8)对相应位置进行调整来获得。试验在参考点的观测数据中提取AP3信号的RSS测量值,选取50%测试值作为训练数据,对模型进行训练,利用剩余50%测量值验证RSS分布效果。由于不同分区选用不用AP设施且AP3主要作用于分区一内,故图5给出了分区一中部分参考点处两种模型对AP3信号的拟合效果。通过比较实际测量值与模型估计值,可看出MVGMM模型通过多个高斯组成元素较好的拟合了AP信号在室内环境下的分布状况,尤其体现在分区一内。In order to verify the effect of the algorithm on the construction of the RSS fingerprint map, Figure 4 shows the real numerical map of the AP3 signal in the target area and the estimated effect of the MVGMM model and the GP model. After fitting the joint distribution of the reference point position and the AP signal in the target area by using the MVGMM model, the RSS distribution of the AP3 signal in the area can be obtained by adjusting the corresponding position using formulas (7) and (8). In the experiment, the RSS measurement value of AP3 signal was extracted from the observation data of the reference point, 50% of the test value was selected as the training data, the model was trained, and the remaining 50% of the measurement value was used to verify the RSS distribution effect. Since AP facilities are not used in different partitions and AP3 mainly acts in partition one, Figure 5 shows the fitting effect of the two models on AP3 signals at some reference points in partition one. By comparing the actual measured value and the model estimated value, it can be seen that the MVGMM model can better fit the distribution of AP signals in the indoor environment through multiple Gaussian components, especially in partition one.

基于已获取MVGMM模型与测试数据,将专利(简记为:SMVGMM)分别与传统WKNN算法,GP算法做对比,分析算法的定位精度。用户在目标区域内行进一圈,三种算法的位置估计对比图如图6所示。由图6可知,专利所得目标行进轨迹预测更为平滑,且相较于GP算法,其全程定位精度有所提高。图7则给出了三种算法在各测试点的误差值箱型图。由图可知,对于目标轨迹的预测,相比于GP算法,专利的全程定位精度提高了20%以上,近一步的印证了专利所得目标预测轨迹的平滑性。Based on the obtained MVGMM model and test data, the patent (abbreviated as: SMVGMM) is compared with the traditional WKNN algorithm and GP algorithm respectively, and the positioning accuracy of the algorithm is analyzed. The user travels in a circle in the target area, and the position estimation comparison diagram of the three algorithms is shown in Figure 6. It can be seen from Figure 6 that the target travel trajectory prediction obtained by the patent is smoother, and compared with the GP algorithm, its full-time positioning accuracy is improved. Figure 7 shows the box plots of the error values of the three algorithms at each test point. As can be seen from the figure, for the prediction of target trajectory, compared with the GP algorithm, the patent's full-time positioning accuracy is improved by more than 20%, which further confirms the smoothness of the target predicted trajectory obtained by the patent.

由于WKNN算法与其余两种算法在定位精度的巨大差异,图8仅给出了专利与GP算法位置估计误差的累计概率对比图。由图可以看出,专利初始的误差累计速度相比于GP算法较慢,整体效果优于GP算法,也从另一方面体现出专利通过狭小分区内AP信号间的相关性全面提升了传统算法的定位效果。Due to the huge difference in positioning accuracy between the WKNN algorithm and the other two algorithms, Figure 8 only shows the cumulative probability comparison of the position estimation error between the patent and the GP algorithm. It can be seen from the figure that the initial error accumulation speed of the patent is slower than that of the GP algorithm, and the overall effect is better than that of the GP algorithm. On the other hand, it shows that the patent comprehensively improves the traditional algorithm through the correlation between AP signals in a narrow partition. positioning effect.

为验证指纹库在线更新的效果与价值,实验分两次对目标区域进行数据采集(间隔一周时间),利用第二次所采集数据对原始数据所构建MVGMM模型参数进行自适应更新。通过两次采集的测试数据,比较参数更新前后MVGMM模型对AP3信号的拟合效果。图9给出了参数更新前后模型对AP3信号的拟合状况,及其拟合误差。从图中可以看出,两次采集的测试数据在区域四存在较大差异,参数更新后模型对最新测试数据的拟合效果优于前次模型的拟合状况,尤其体现于区域四内。In order to verify the effect and value of the online update of the fingerprint database, the data was collected in the target area twice (with a one-week interval), and the parameters of the MVGMM model constructed from the original data were adaptively updated using the data collected for the second time. Through the test data collected twice, the fitting effect of the MVGMM model on the AP3 signal was compared before and after the parameter update. Figure 9 shows the fitting status of the model to the AP3 signal before and after the parameter update, and its fitting error. It can be seen from the figure that the test data collected twice are quite different in area 4, and the fitting effect of the model to the latest test data after parameter update is better than that of the previous model, especially in area 4.

Claims (1)

1. A self-adaptive positioning fingerprint database construction method under a complex indoor signal environment is characterized by comprising three stages: the method comprises the following steps of an off-line stage, an on-line positioning stage and an on-line fingerprint database updating stage, wherein the specific steps are as follows:
step one, dividing target area grids and constructing an offline fingerprint database
The target region is divided into K regions omegakK ∈ {1, 2.., K } component, which is the region ΩkDivision into NkEach grid, the geometric center of each grid is taken as a reference point
Figure FDA0002571157270000011
Where N ∈ {1, 2., Nk},
Figure FDA0002571157270000012
Is 2 × NkThe dimension position matrix represents the RP position; for each RP location
Figure FDA0002571157270000013
The corresponding region is marked as
Figure FDA0002571157270000014
Wherein
Figure FDA0002571157270000015
And is
Figure FDA0002571157270000016
i ≠ k, meaning that the reference point is located in the k partition; in that
Figure FDA0002571157270000017
Collected from MkRSS sample value of AP
Figure FDA0002571157270000018
Wherein
Figure FDA0002571157270000019
Is shown in
Figure FDA00025711572700000110
The collected RSS sample value from the mth AP, M ∈ {1,2k};
Step two, constructing SVM partition model of online positioning stage
Setting a probability classification model of a support vector machine in a one-to-many mode, carrying out two-classification identification on each preset partition according to whether a target is located in the partition, and constructing an SVM model of each partition through training data; for given K partitions, K SVM models are set, and measurement signals of all reference APs in each partition are taken to form current observation data r ═ r1,r2,...,rM]Wherein M is the number of APs in the target received area, and the value of the unreceived signal is-100 db; aiming at the current observation data, each partitioned SVM model gives the distribution probability p (y) of whether the target is positioned in the corresponding regionk1| r), wherein ykIdentify for partition, indicate that the target is located in partition ΩkInner, K ∈ {1, 2.. multidot.K }, distribution probability p (y) given by each partitioned SVM modelk1 r), performing primary judgment on a partition where the target is located, and taking the partition as a primary judgment basis;
dividing the reference point observation data obtained in the step one into a training set and a test set by adopting partition operation based on a probability SVM, and training a partition judgment model SVM; the probability value of the target in the corresponding partition is obtained through K SVM models, and the problem of misjudgment of the test point near the partition handover is solved by setting a secondary judgment basis; selecting 2 subarea areas with the maximum probability in the judged area, namely p (y)i1| r) and p (y)j1| r), i, j ∈ {1,2i=1|r)>p(yj1| r), the difference being expressed as:
Δyp=p(yi=1|r)-p(yj=1|r), (1)
when Δ ypWhen the value is more than delta y, the influence of the partition i on the test point is far larger than that of the partition j, and the reference point is determined in the partition i, wherein the delta y is a secondary determination threshold value; for Δ ypIf the difference is less than delta y, both the areas are judged as target areas, corresponding area matching operation is respectively carried out, and the target positions obtained by respective subareas are subjected to probability averaging to obtain the final position estimation
Step three, constructing a multivariate Gaussian mixture model (MVGMM)
Establishing a multivariate Gaussian mixture model (MVGMM) by utilizing the correlation of different AP signals, and compensating the neglect of the coupling relation between the AP signals in the traditional fingerprint library construction work by continuously increasing the number of Gaussian elements and utilizing the probability density function weighting and approximate simulation of different parameters to simulate the joint distribution condition between the RP position in the subarea and the obtained AP signals; the probability distribution function of the multivariate gaussian mixture model is expressed as:
Figure FDA00025711572700000111
wherein C represents the number of constituent elements, pN(x|μc,Pc) Represents the mean value of μcCovariance of PcOf the weight wcThe sum of (a) and (b) is 1;
based on equation (2), using partition omegakThe posterior probability of the inner RP location and RSS signal value joint distribution represents the multivariate gaussian mixture model as:
Figure FDA0002571157270000021
wherein, yk1 indicates that the target is in the kth partition, r indicates the RSS value of each AP signal received at reference point x,
Figure FDA0002571157270000022
Figure FDA0002571157270000023
is the weight of the constituent elements of the multivariate Gaussian mixture model,
Figure FDA0002571157270000024
is the average value of the elements and is the average value of the elements,
Figure FDA0002571157270000025
is the element covariance, CkIs the number of constituent elements;
the MVGMM adopts an EM algorithm to estimate model parameters; MVGMM model selection CkMean value of
Figure FDA0002571157270000026
Covariance of
Figure FDA0002571157270000027
Fitting the partition omegakInner NkA reference point
Figure FDA0002571157270000028
With the RSS value of each AP signal acquired in the corresponding partition
Figure FDA0002571157270000029
A joint probability distribution of (a); clustering sample data into C by using k-means algorithmkTaking the mean value and covariance of each cluster to initialize EM algorithm parameters, wherein the initial weight settings of each cluster are the same; the log-likelihood form of the joint probability distribution is expressed as:
Figure FDA00025711572700000210
wherein z isk=[xk;Rk],
Figure FDA00025711572700000211
Hidden variable gammac,nRepresenting the probability that the nth sample value belongs to the c-th gaussian component element;
thereby determining observation data
Figure FDA00025711572700000212
The probability of belonging to the c-th gaussian component element is expressed as:
Figure FDA00025711572700000213
calculating the height of the multiple element by the formula (5)Weight of a gaussian function
Figure FDA00025711572700000214
Mean value
Figure FDA00025711572700000215
Sum covariance
Figure FDA00025711572700000216
Respectively as follows:
Figure FDA00025711572700000217
Figure FDA00025711572700000218
Figure FDA0002571157270000031
for different CkThe value repeat clustering and EM estimation process offline fingerprint library is represented as (w)k;μk;Pk) K ∈ {1, 2.., K }, and step four, positioning the target in the subarea
Estimating the target position in the area through an offline fingerprint database; if the target at the current moment accepts the RSS values of M APs as
Figure FDA0002571157270000032
Obtaining the distribution probability p (y) of the target in each subarea through a subarea judgment modelk=1|rnow),k∈{1,2,...,K};
Using the current measured value rnowSelecting corresponding sub-region omegakAP signal measurement value effective for internal target positioning
Figure FDA0002571157270000033
Obtaining the posterior probability distribution of the target position under given observation data according to the conditional probability criterion of the multivariate Gaussian distribution:
Figure FDA0002571157270000034
based on the resulting off-line fingerprint library (w)k;μk;Pk) Obtaining:
Figure FDA0002571157270000035
Figure FDA0002571157270000036
combining the partition probability with the formula (9) to obtain the target in the partition omegakThe posterior distribution probability of each internal reference point is as follows:
Figure FDA0002571157270000037
target is located in partition ΩkThe distribution weight of each reference point in the image is updated as follows:
Figure FDA0002571157270000038
wherein, λ is normalization factor, then the division is ΩkThe distribution weight of the internal reference points as the target positions is
Figure FDA0002571157270000039
In summary, the position of the target is estimated as
Figure FDA00025711572700000310
Step five, updating the fingerprint database
When dividing into omegakWhen the entropy of the obtained information is less than the partition threshold value, newly collecting NkData of a person
Figure FDA00025711572700000311
Updating parameters of the partition model; based on the existing multivariate Gaussian mixture model, model parameters are adaptively adjusted through data acquisition so as to deduce an MVGMM model which has a closer coupling relation with the existing environment; consistent with the EM algorithm, the self-adaptive process of the MVGMM model parameters is also a two-step estimation; firstly, newly-added observation data is added through a k-means algorithm
Figure FDA0002571157270000041
Cluster as CkClustering, and obtaining data weight by clustering
Figure FDA0002571157270000042
Mean value
Figure FDA0002571157270000043
Covariance
Figure FDA0002571157270000044
Initializing each Gaussian composition; respectively calculating the statistical weight of the newly added observation data by using the formulas (5) to (8)
Figure FDA0002571157270000045
Mean value
Figure FDA0002571157270000046
And covariance
Figure FDA0002571157270000047
Dividing omega by using statistic parameter of new datakThe corresponding MVGMM model parameters are subjected to self-adaptive optimization, namely:
Figure FDA0002571157270000048
Figure FDA0002571157270000049
Figure FDA00025711572700000410
wherein,
Figure FDA00025711572700000411
used for balancing the influence degree of new and old data on model parameters, wherein lambda is a normalization factor, and rρIs an intrinsic correlation factor of the parameter;
sixthly, updating the model parameters
And (5) carrying out real-time updating on the model parameters obtained by updating in the self-adaptive process in the second step and the third step.
CN202010639947.2A 2020-07-06 2020-07-06 Self-adaptive positioning fingerprint database construction method under complex indoor signal environment Active CN111757258B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010639947.2A CN111757258B (en) 2020-07-06 2020-07-06 Self-adaptive positioning fingerprint database construction method under complex indoor signal environment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010639947.2A CN111757258B (en) 2020-07-06 2020-07-06 Self-adaptive positioning fingerprint database construction method under complex indoor signal environment

Publications (2)

Publication Number Publication Date
CN111757258A true CN111757258A (en) 2020-10-09
CN111757258B CN111757258B (en) 2021-05-14

Family

ID=72679412

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010639947.2A Active CN111757258B (en) 2020-07-06 2020-07-06 Self-adaptive positioning fingerprint database construction method under complex indoor signal environment

Country Status (1)

Country Link
CN (1) CN111757258B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112533137A (en) * 2020-11-26 2021-03-19 北京爱笔科技有限公司 Device positioning method and device, electronic device and computer storage medium
CN113518308A (en) * 2021-05-25 2021-10-19 江南大学 The Best AP Screening Method in Indoor Positioning
CN114048783A (en) * 2021-11-17 2022-02-15 东南大学 A Cellular Signal Map Construction Method Based on Mobile Group Awareness

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107087256A (en) * 2017-03-17 2017-08-22 上海斐讯数据通信技术有限公司 A kind of fingerprint cluster method and device based on WiFi indoor positionings
CN107241700A (en) * 2017-04-23 2017-10-10 西安电子科技大学 Indoor orientation method based on CSI Space-frequency characters and reference point locations clustering algorithm
CN109413575A (en) * 2018-12-04 2019-03-01 重庆邮电大学 A kind of adaptive AP layout method of indoor positioning
US10571546B2 (en) * 2015-09-25 2020-02-25 Intel Corporation Vision and radio fusion based precise indoor localization
CN111148030A (en) * 2019-12-31 2020-05-12 泰斗微电子科技有限公司 Fingerprint database updating method and device, server and storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10571546B2 (en) * 2015-09-25 2020-02-25 Intel Corporation Vision and radio fusion based precise indoor localization
CN107087256A (en) * 2017-03-17 2017-08-22 上海斐讯数据通信技术有限公司 A kind of fingerprint cluster method and device based on WiFi indoor positionings
CN107241700A (en) * 2017-04-23 2017-10-10 西安电子科技大学 Indoor orientation method based on CSI Space-frequency characters and reference point locations clustering algorithm
CN109413575A (en) * 2018-12-04 2019-03-01 重庆邮电大学 A kind of adaptive AP layout method of indoor positioning
CN111148030A (en) * 2019-12-31 2020-05-12 泰斗微电子科技有限公司 Fingerprint database updating method and device, server and storage medium

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112533137A (en) * 2020-11-26 2021-03-19 北京爱笔科技有限公司 Device positioning method and device, electronic device and computer storage medium
CN112533137B (en) * 2020-11-26 2023-10-17 北京爱笔科技有限公司 Positioning method and device of equipment, electronic equipment and computer storage medium
CN113518308A (en) * 2021-05-25 2021-10-19 江南大学 The Best AP Screening Method in Indoor Positioning
CN113518308B (en) * 2021-05-25 2022-03-25 江南大学 The Best AP Screening Method in Indoor Positioning
CN114048783A (en) * 2021-11-17 2022-02-15 东南大学 A Cellular Signal Map Construction Method Based on Mobile Group Awareness
CN114048783B (en) * 2021-11-17 2024-04-16 东南大学 Cellular signal map construction method based on mobile group perception

Also Published As

Publication number Publication date
CN111757258B (en) 2021-05-14

Similar Documents

Publication Publication Date Title
CN111757258B (en) Self-adaptive positioning fingerprint database construction method under complex indoor signal environment
CN108989976B (en) Fingerprint positioning method and system in intelligent classroom
CN112748397B (en) UWB positioning method based on self-adaptive BP neural network under non-line-of-sight condition
CN106131959B (en) A kind of dual-positioning method divided based on Wi-Fi signal space
Kaemarungsi Efficient design of indoor positioning systems based on location fingerprinting
CN109598936B (en) Short-term traffic prediction method based on dynamic STKNN model
CN110933599B (en) Self-adaptive positioning method fusing UWB and WIFI fingerprints
WO2019062734A1 (en) Indoor positioning method and device based on wi-fi hot spots
CN103533647B (en) A kind of radio frequency map adaptive location method based on sub-clustering mechanism and robustness regression
CN106851573A (en) Joint weighting k nearest neighbor indoor orientation method based on log path loss model
CN109511085B (en) A UWB Fingerprint Location Method Based on MeanShift and Weighted k-Nearest Neighbor Algorithm
CN108810799B (en) Multi-floor indoor positioning method and system based on linear discriminant analysis
CN109348416B (en) Fingerprint indoor positioning method based on bipartite k-means
CN109195104A (en) A kind of indoor orientation method combined based on support vector regression and Kalman filtering
EP3039446A1 (en) Location detection system and method
Yang et al. Research on Wi-Fi indoor positioning in a smart exhibition hall based on received signal strength indication
CN110260855B (en) An indoor pedestrian navigation and positioning method integrating pedestrian dead reckoning, geomagnetic information and indoor map information
CN109814066A (en) RSSI indoor positioning and ranging method and indoor positioning platform based on neural network learning
CN109379711A (en) a positioning method
CN105866732B (en) The mixing indoor orientation method that a kind of improvement MK models and WKNN algorithms are combined
CN109059919A (en) A kind of indoor orientation method based on crowdsourcing sample weighting surface fitting
CN111257830B (en) WIFI positioning algorithm based on preset AP position
Luo et al. An improved wknn algorithm based on flexible k selection strategy and distance compensation for indoor localization
CN108919182B (en) Target positioning method based on support set and expectation maximization in WIFI environment
Zhou et al. IMLours: Indoor mapping and localization using time-stamped WLAN received signal strength

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant