CN117572457A

CN117572457A - A cross-scene multispectral point cloud classification method based on pseudo-label learning

Info

Publication number: CN117572457A
Application number: CN202410061674.6A
Authority: CN
Inventors: 王青旺; 王铭野; 王盼新; 蒋涛; 张梓峰; 沈韬
Original assignee: Kunming University of Science and Technology
Current assignee: Kunming University of Science and Technology
Priority date: 2024-01-16
Filing date: 2024-01-16
Publication date: 2024-02-20
Anticipated expiration: 2044-01-16
Also published as: CN117572457B

Abstract

The invention relates to a cross-scene multispectral point cloud classification method based on pseudo tag learning, and belongs to the technical field of multispectral laser radar point clouds. The method comprises the following steps: 1) Respectively characterizing the multispectral laser radar point clouds of the source domain scene and the target domain scenePre-alignment of row features; 2) Respectively extracting graph features of two scenes; 3) Calculating loss; 4) Iteratively performing 3) updating the source domain-target domain alignment network parameters until the model converges to obtain a pseudo tag of the target domain and the confidence coefficient thereof; 5) Setting threshold value for pseudo tags in descending orderαBefore selectingα% of pseudo tags; 6) Splicing the adjacent matrix and the feature matrix in the target domain to obtain a new feature matrix; 7) Calculating loss according to the pseudo tag obtained in the step 5) and the feature matrix obtained in the step 6); 8) And 7) updating parameters in the step until the model converges, and finally obtaining a target domain multispectral point cloud data classification result. The method can realize high-precision classification of multi-spectrum point cloud of the cross-scene.

Description

A cross-scene multispectral point cloud classification method based on pseudo-label learning

技术领域Technical field

本发明涉及一种基于伪标签学习的跨场景多光谱点云分类方法，属于多光谱激光雷达点云技术领域。The invention relates to a cross-scene multispectral point cloud classification method based on pseudo-label learning, and belongs to the technical field of multispectral laser radar point cloud.

背景技术Background technique

多光谱LiDAR系统能够同步获取场景中的三维空间分布信息和光谱信息，可以为遥感场景解译任务提供更加丰富的特征信息。在多光谱LiDAR的相关处理任务中，目前大多数分类方法，特别是基于深度学习的分类方法，需要大量的训练数据集才能达到最佳性能。然而，收集和标记大量的点云往往是费力和耗时的。另一方面，它们只适用于固定场景，即训练样本和测试样本是独立且同分布的。当应用于陌生场景时性能会显著下降。因此，这些方法不能直接转移到其他场景，也不能在实时收集的未标记数据上进行测试。这已经成为多光谱LiDAR数据解译的主要制约因素。The multispectral LiDAR system can simultaneously obtain the three-dimensional spatial distribution information and spectral information in the scene, and can provide richer feature information for remote sensing scene interpretation tasks. In multispectral LiDAR-related processing tasks, most current classification methods, especially those based on deep learning, require a large number of training data sets to achieve optimal performance. However, collecting and labeling large amounts of point clouds is often laborious and time-consuming. On the other hand, they only apply to fixed scenarios, where training and test samples are independent and identically distributed. Performance drops significantly when applied to unfamiliar scenarios. Therefore, these methods are not directly transferable to other scenarios, nor can they be tested on unlabeled data collected in real time. This has become a major constraint in the interpretation of multispectral LiDAR data.

多光谱LiDAR对遥感场景进行数据采集时，激光脉冲发射角度、地物空间分布、季节和天气变化等多种因素都会影响接收激光脉冲的强度，即产生光谱漂移现象。此外，无论传统方法还是基于深度学习的方法，它们的场景自适应能力较差，当训练样本与测试样本存在分布差异时性能会显著下降。显然，多光谱点云同时具有地物的空间几何信息和光谱信息，通过从源域场景多光谱点云中学习表征地物本质属性的空间几何-光谱一致性信息，指导目标域场景多光谱点云伪标签高精度生成，采用目标域伪标签训练网络，能够提高多光谱点云地物分类网络在目标域场景中的性能，提高网络的场景自适应能力。因此，如何在不同场景中多光谱点云光谱漂移、地物分布不一致等情况下，生成高精度目标域场景点云伪标签，在没有目标域场景真实标签的情况下实现跨场景多光谱点云高精度分类，是目前亟待解决的技术问题。When multispectral LiDAR collects data from remote sensing scenes, various factors such as laser pulse emission angle, spatial distribution of ground objects, seasonal and weather changes will affect the intensity of the received laser pulse, which causes spectral drift. In addition, both traditional methods and methods based on deep learning have poor scene adaptability, and their performance will drop significantly when there is a distribution difference between training samples and test samples. Obviously, multispectral point clouds contain both spatial geometric information and spectral information of ground objects. By learning the spatial geometry-spectral consistency information that represents the essential attributes of ground objects from the multispectral point cloud of the source domain scene, it can guide the multispectral points of the target domain scene. The high-precision generation of cloud pseudo-labels and the use of target domain pseudo-labels to train the network can improve the performance of the multispectral point cloud feature classification network in target domain scenes and improve the network's scene adaptability. Therefore, how to generate high-precision target domain scene point cloud pseudo-labels when the multispectral point cloud spectrum drifts and the distribution of ground objects is inconsistent in different scenes, etc., and how to achieve cross-scene multispectral point cloud without real labels of the target domain scene. High-precision classification is a technical problem that needs to be solved urgently.

发明内容Contents of the invention

本发明要解决的技术问题是提供一种基于伪标签学习的跨场景多光谱点云分类方法，以应对多光谱激光雷达点云在不同场景之间的光谱漂移现象，缓解光谱漂移现象带来的跨场景多光谱激光雷达点云分类困难等问题，在没有目标域场景真实标签的情况下实现跨场景多光谱点云高精度分类。The technical problem to be solved by this invention is to provide a cross-scene multi-spectral point cloud classification method based on pseudo-label learning to cope with the spectral drift phenomenon of multi-spectral lidar point clouds between different scenes and alleviate the problems caused by the spectral drift phenomenon. Cross-scene multispectral lidar point cloud classification is difficult and other problems. High-precision classification of cross-scene multispectral point clouds is achieved without real labels of target domain scenes.

本发明的技术方案是：一种基于伪标签学习的跨场景多光谱点云分类方法，包括如下步骤：The technical solution of the present invention is: a cross-scene multispectral point cloud classification method based on pseudo-label learning, which includes the following steps:

Step1：分别将带标签源域场景和无标签目标域场景多光谱激光雷达点云特征根据L₂范数和拉普拉斯矩阵进行特征预对齐；Step1: Pre-align the multispectral lidar point cloud features of the labeled source domain scene and the unlabeled target domain scene according to the _L2 norm and Laplacian matrix;

Step2：根据预对齐后的特征，采用图卷积神经网络(Graph Convolution NeuralNetworks, GCN)分别提取两个场景的图特征；Step2: Based on the pre-aligned features, use Graph Convolution Neural Networks (GCN) to extract graph features of the two scenes respectively;

Step3：根据提取得到的两个场景的图特征和源域标签计算源域分类损失、最大均值差异(Maximum Mean Discrepancy, MMD)损失、目标域香农熵损失；Step3: Calculate the source domain classification loss, Maximum Mean Discrepancy (MMD) loss, and target domain Shannon entropy loss based on the extracted graph features and source domain labels of the two scenes;

Step4：迭代地进行Step3，更新源域-目标域对齐网络参数，判断模型是否收敛，是则结束，然后进行Step5，否则重复Step3，得到目标域的伪标签及其置信度；Step4: Perform Step3 iteratively, update the source domain-target domain alignment network parameters, determine whether the model has converged, if so, end, and then proceed to Step5, otherwise repeat Step3 to obtain the pseudo label and its confidence of the target domain;

Step5：根据置信度对伪标签降序排列，设置阈值α，选取前α%的目标域伪标签作为目标域分类网络真值输入；Step5: Arrange the pseudo-labels in descending order according to the confidence level, set the threshold α, and select the top α% of the pseudo-labels of the target domain as the true value input of the target domain classification network;

Step6：拼接目标域中的邻接矩阵和特征矩阵得到新的特征矩阵作为目标域分类网络特征输入；Step6: Splice the adjacency matrix and feature matrix in the target domain to obtain a new feature matrix as the feature input of the target domain classification network;

Step7：根据Step5选取出的伪标签和Step6得到的新的特征矩阵计算目标域分类损失；Step7: Calculate the target domain classification loss based on the pseudo labels selected in Step5 and the new feature matrix obtained in Step6;

Step8：迭代地进行Step7，更新目标域分类网络参数，判断模型是否收敛，是则结束，否则重复Step7，最终得到目标域多光谱点云数据分类结果。Step8: Perform Step7 iteratively, update the target domain classification network parameters, and determine whether the model has converged. If so, end it. Otherwise, repeat Step7, and finally obtain the target domain multispectral point cloud data classification results.

具体地，在Step1中，所述带标签源域场景多光谱激光雷达点云数据记为（P_s，Y），无标签目标域场景记为（P_t，），其中表示源域场景包含N_s个有标签多光谱点，表示源域场景中第i个有标签多光谱点，分别表示目标域场景包含N_t个无标签多光谱点，表示目标域场景中第i个无标签多光谱点，表示所有源域场景多光谱点对应的真值标签，表示源域场景中第i个多光谱点对应的真值标签。 Specifically, in Step 1, the multispectral lidar point cloud data of the labeled source domain scene is recorded as (P _s , Y), and the unlabeled target domain scene is recorded as (P _t , ),in Indicates that the source domain scene contains N _s labeled multispectral points, Represents the i-th labeled multispectral point in the source domain scene, Respectively indicating that the target domain scene contains N _t unlabeled multispectral points, Represents the i-th unlabeled multispectral point in the target domain scene, Represents the ground truth labels corresponding to all source domain scene multispectral points, Represents the true value label corresponding to the i-th multispectral point in the source domain scene.

具体地，在Step1中，所述根据L₂范数和拉普拉斯矩阵进行特征预对齐具体步骤为：Specifically, in Step 1, the specific steps for feature pre-alignment based on L ₂ norm and Laplacian matrix are:

（1）通过L₂范数对源域和目标域特征进行特征变换，具体特征变换公式为：(1) Perform feature transformation on the source domain and target domain features through L ₂ norm. The specific feature transformation formula is:

其中x为源域、目标域特征，为特征变换后的源域、目标域特征，为2范数。 where x is the source domain and target domain features, are the source domain and target domain features after feature transformation, is 2 norm.

（2）根据步骤（1）的公式，得到M维度的源域特征和M维度的目标域特征，将和拼接得到M维度的总体特征矩阵，根据K最邻近算法计算总体特征矩阵的邻接矩阵W，进一步计算对角矩阵D，对角矩阵D中的元素，为邻接矩阵W中的元素，则拉普拉斯矩阵L=D-W，所以最终的总体特征矩阵X根据以下公式更新： (2) According to the formula in step (1), obtain the M-dimensional source domain features and M-dimensional target domain features ,Will and Splicing to obtain the overall feature matrix of M dimensions , calculate the overall feature matrix according to the K nearest neighbor algorithm The adjacency matrix W, further calculates the diagonal matrix D, the elements in the diagonal matrix D , is an element in the adjacency matrix W, then the Laplacian matrix L=DW, so the final overall feature matrix X is updated according to the following formula:

其中，为更新后的特征矩阵，^T为矩阵转置操作，N_s为源域场景有标签多光谱点的个数，N_t为目标域场景中无标签多光谱点的个数。 in, is the updated feature matrix, ^T is the matrix transposition operation, N _s is the number of labeled multispectral points in the source domain scene, and N _t is the number of unlabeled multispectral points in the target domain scene.

具体地，所述Step3具体为：Specifically, the Step 3 is:

将Step2中提取得到的源域场景和目标域场景图特征分别记为和，源域分类损失计算公式为：The source domain scene and target domain scene graph features extracted in Step 2 are recorded as and , the source domain classification loss calculation formula is:

其中，是源域场景中第i个点的标签，是源域场景中第i个点的预测标签，是源域场景标签集合，N_s为源域场景有标签多光谱点的个数； in, is the label of the i-th point in the source domain scene, is the predicted label of the i-th point in the source domain scene, is the source domain scene label set, N _s is the number of labeled multispectral points in the source domain scene;

为了衡量提取得到的特征之间的差异，采用最大均值差异(Maximum MeanDiscrepancy, MMD)损失计算两个场景的特征偏差，用以促进GCN提取域不变特征：In order to measure the difference between the extracted features, the Maximum Mean Discrepancy (MMD) loss is used to calculate the feature deviation of the two scenes to promote GCN to extract domain-invariant features:

其中，是将原始变量映射到高维空间的映射函数，是第i个源域多光谱点的图特征，是第j个目标域多光谱点的图特征，N_t为目标域场景中无标签多光谱点的个数； in, is a mapping function that maps original variables to high-dimensional space, is the graph feature of the i-th source domain multispectral point, is the graph feature of the jth target domain multispectral point, and N _t is the number of unlabeled multispectral points in the target domain scene;

采用香农熵损失约束网络以得到更高置信度的目标域场景伪标签，具体香农熵损失公式为：Shannon entropy loss is used to constrain the network to obtain higher confidence target domain scene pseudo-labels. The specific Shannon entropy loss formula is:

其中，H为香农熵矩阵，为H中的元素，具体计算公式如下： Among them, H is the Shannon entropy matrix, is an element in H, specifically Calculated as follows:

其中，P为网络对目标域多光谱激光雷达点云的预测概率矩阵，为预测概率，l 为多光谱点云的特征通道数，为目标域节点预对齐后的特征。 Among them, P is the network’s prediction probability matrix for the multispectral lidar point cloud in the target domain, is the prediction probability, l is the number of characteristic channels of the multispectral point cloud, Pre-aligned features for target domain nodes.

具体地，所述Step4中更新源域-目标域对齐网络参数，具体为：Specifically, the source domain-target domain alignment network parameters are updated in Step 4, specifically as follows:

（1）所有参数都使用标准反向传播算法进行优化；(1) All parameters are optimized using the standard backpropagation algorithm;

（2）在训练中，整体损失为源域分类损失、最大均值差异(Maximum MeanDiscrepancy, MMD)损失、目标域香农熵损失的组合，训练的整体损失为：(2) During training, the overall loss is a combination of source domain classification loss, Maximum Mean Discrepancy (MMD) loss, and target domain Shannon entropy loss. The overall loss of training is:

其中，和是平衡损失的平衡系数。 in, and is the balance coefficient to balance the loss.

具体地，所述Step6中拼接目标域中的邻接矩阵和特征矩阵，具体为：Specifically, the adjacency matrix and feature matrix in the target domain are spliced in Step 6, specifically:

将目标域邻接矩阵记为，则将M维度的目标域特征与拼接得到更新后的目标域特征。 Denote the adjacency matrix of the target domain as , then the M-dimensional target domain features and Splicing to obtain updated target domain features .

具体地，所述Step7中计算目标域分类损失具体公式如下：Specifically, the specific formula for calculating the target domain classification loss in Step 7 is as follows:

其中，是目标域场景中第i个点的伪标签，是目标域场景中第i个点的预测标签，N_t为目标域场景中无标签多光谱点的个数，为Step2中提取得到的目标域场景图特征，为目标域伪标签集合。 in, is the pseudo label of the i-th point in the target domain scene, is the predicted label of the i-th point in the target domain scene, N _t is the number of unlabeled multispectral points in the target domain scene, is the target domain scene graph feature extracted in Step 2, is the pseudo-label set of the target domain.

具体地，所述Step8中更新目标域分类网络参数，具体为：Specifically, the target domain classification network parameters are updated in Step 8, specifically as follows:

（1）所有参数都使用标准反向传播算法进行优化。(1) All parameters are optimized using standard backpropagation algorithm.

（2）在训练中，采用Step7中的目标域分类损失作为训练损失。(2) In training, the target domain classification loss in Step 7 is used as the training loss.

多光谱激光雷达在不同场景中往往会出现同物异谱或者同谱异物的现象，这会导致在目标域场景没有标签可供训练时，仅采用源域点云标签训练得到的网络对目标域点云分类精度较低。本发明通过设计特征预对齐操作对源域和目标域场景的特征进行对齐，采用最大均值差异(Maximum Mean Discrepancy, MMD)损失和香农熵损失促进GCN提取域不变特征，并得到高质量的目标域点云伪标签。根据目标域邻接矩阵对目标域特征进行特征增强，实现利用有标签的源域场景多光谱点云训练图神经网络，对无标签的目标域多光谱点云进行高精度分类。Multispectral lidar often has the same objects with different spectra or different objects with the same spectrum in different scenes. This will lead to the use of only the network trained with source domain point cloud labels to train the target domain when there are no labels for training in the target domain scene. Point cloud classification accuracy is low. This invention aligns the features of the source domain and target domain scenes by designing feature pre-alignment operations, and uses Maximum Mean Discrepancy (MMD) loss and Shannon entropy loss to promote GCN to extract domain-invariant features and obtain high-quality targets. Domain point cloud pseudo-labels. The target domain features are enhanced according to the target domain adjacency matrix, and the labeled source domain scene multispectral point cloud is used to train the graph neural network to perform high-precision classification of the unlabeled target domain multispectral point cloud.

本发明的有益效果是：本发明与现有技术相比，缓解了不同场景间多光谱点云光谱漂移带来的负面影响。通过特征域对齐操作帮助GCN提取域不变特征、采用最大均值差异(Maximum Mean Discrepancy, MMD)损失和香农熵损失保证目标域点云伪标签的准确性。进一步根据邻接矩阵对目标域特征进行增强。在不同场景中多光谱点云光谱漂移、地物分布不一致等情况下，实现了有效和可靠的信息转移以实现对无标签目标域场景进行地物分类。在没有目标域场景真实标签的情况下实现跨场景多光谱点云高精度分类。The beneficial effects of the present invention are: compared with the existing technology, the present invention alleviates the negative impact caused by the spectral drift of multispectral point clouds between different scenes. The feature domain alignment operation helps GCN extract domain invariant features, and maximum mean difference (MMD) loss and Shannon entropy loss are used to ensure the accuracy of pseudo-labeling of target domain point clouds. The target domain features are further enhanced based on the adjacency matrix. In situations such as spectral drift of multispectral point clouds and inconsistent distribution of ground objects in different scenes, effective and reliable information transfer is achieved to classify ground objects in unlabeled target domain scenes. Achieve high-precision classification of cross-scene multispectral point clouds without real labels of target domain scenes.

附图说明Description of the drawings

图1是本发明的基于伪标签学习的跨场景多光谱点云分类方法框架；Figure 1 is the framework of the present invention's cross-scene multispectral point cloud classification method based on pseudo-label learning;

图2是实施例中数据集真实地物分布图，（a）是源场景可视化图、（b）是目标场景可视化图。Figure 2 is a distribution map of real ground objects in the data set in the embodiment, (a) is a source scene visualization map, (b) is a target scene visualization map.

具体实施方式Detailed ways

下面结合附图和具体实施例，对本发明作进一步说明。The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

实施例1：如图1所示，一种基于伪标签学习的跨场景多光谱点云分类方法，包括如下步骤：Embodiment 1: As shown in Figure 1, a cross-scene multispectral point cloud classification method based on pseudo-label learning includes the following steps:

在Step1中，所述带标签源域场景多光谱激光雷达点云数据记为（P_s，Y），无标签目标域场景记为（P_t，），其中表示源域场景包含N_s个有标签多光谱点，表示源域场景中第i个有标签多光谱点，分别表示目标域场景包含N_t个无标签多光谱点，表示目标域场景中第i个无标签多光谱点，表示所有源域场景多光谱点对应的真值标签，表示源域场景中第i个多光谱点对应的真值标签。 In Step 1, the multispectral lidar point cloud data of the labeled source domain scene is recorded as (P _s , Y), and the unlabeled target domain scene is recorded as (P _t , ),in Indicates that the source domain scene contains N _s labeled multispectral points, Represents the i-th labeled multispectral point in the source domain scene, Respectively indicating that the target domain scene contains N _t unlabeled multispectral points, Represents the i-th unlabeled multispectral point in the target domain scene, Represents the ground truth labels corresponding to all source domain scene multispectral points, Represents the true value label corresponding to the i-th multispectral point in the source domain scene.

在Step1中，所述根据L₂范数和拉普拉斯矩阵进行特征预对齐具体步骤为：In Step 1, the specific steps for feature pre-alignment based on L ₂ norm and Laplacian matrix are:

将Step2中提取得到的源域场景和目标域场景图特征分别记为和，源域分类损失计算公式为： The source domain scene and target domain scene graph features extracted in Step 2 are recorded as and , the source domain classification loss calculation formula is:

其中，H为香农熵矩阵，为H中的元素，具体计算公式如下：Among them, H is the Shannon entropy matrix, is an element in H, specifically Calculated as follows:

所述Step4中更新源域-目标域对齐网络参数，具体为：In Step 4, the source domain-target domain alignment network parameters are updated, specifically:

其中，和是平衡损失的平衡系数，在本发明中，和取值为1。 in, and is the balance coefficient to balance the loss. In the present invention, and The value is 1.

使用标准反向传播算法更新源域-目标域对齐网络参数，判断模型是否收敛，是则结束，否则重复步骤S3，直至模型收敛。Use the standard backpropagation algorithm to update the source domain-target domain alignment network parameters and determine whether the model has converged. If so, end. Otherwise, repeat step S3 until the model converges.

Step5：根据置信度对伪标签降序排列，设置阈值α，选取前α%的目标域伪标签作为目标域分类网络真值输入，在本发明中α取值为50；Step5: Arrange the pseudo labels in descending order according to the confidence level, set the threshold α, and select the top α% of the pseudo labels in the target domain as the true value input of the target domain classification network. In the present invention, the value of α is 50;

所述Step6中拼接目标域中的邻接矩阵和特征矩阵，具体为：The adjacency matrix and feature matrix in the splicing target domain in Step 6 are specifically:

所述Step7中计算目标域分类损失具体公式如下：The specific formula for calculating the target domain classification loss in Step 7 is as follows:

Step8：将步骤S7的目标分类损失作为训练损失，使用标准反向传播算法更新目标域分类网络参数，判断模型是否收敛，是则结束，否则重复步骤S7，直至模型收敛。Step8: Use the target classification loss in step S7 as the training loss, use the standard backpropagation algorithm to update the target domain classification network parameters, and determine whether the model has converged. If so, end it. Otherwise, repeat step S7 until the model converges.

下面在具体实施记载的基础上，通过实验的方式来说明本发明是切实可行的：On the basis of the specific implementation records, the feasibility of the present invention will be demonstrated through experiments below:

1、实验数据1. Experimental data

Harbor of Tobermory数据集：该数据集场景是位于英国托伯莫里的一个小型海港，由Optech Titan激光雷达采集的三波段点云数据，波长分别为1550nm、1064nm和532nm，数据集可视化效果如图2所示，其中（a）是源场景可视化图、（b）是目标场景可视化图。根据土地覆盖的高度、材料和语义信息将研究区域划分为7类，分别为裸地、草地、道路、建筑物、树木、电力线和汽车。Harbor of Tobermory data set: The scene of this data set is a small seaport located in Tobermory, UK. The three-band point cloud data collected by Optech Titan lidar, the wavelengths are 1550nm, 1064nm and 532nm respectively. The visualization effect of the data set is as follows 2, where (a) is the source scene visualization diagram and (b) is the target scene visualization diagram. The study area is divided into seven categories according to the height, material and semantic information of land cover, namely bare land, grassland, roads, buildings, trees, power lines and cars.

University of Houston数据集：该数据集场景是休斯顿校园的一部分区域，由Optech Titan激光雷达采集的三波段点云数据，波长分别为1550nm、1064nm和532nm。根据土地覆盖的高度、材料和语义信息将研究区域划分为7类，分别为裸地、汽车、草地、道路、电力线、建筑物和树木。采用F分数作为评价指标。两个数据集的可视化效果如图2所示。University of Houston data set: This data set scene is a part of the Houston campus. It is a three-band point cloud data collected by the Optech Titan lidar. The wavelengths are 1550nm, 1064nm and 532nm respectively. The study area is divided into seven categories according to the height, material and semantic information of land cover, namely bare land, cars, grassland, roads, power lines, buildings and trees. The F score is used as the evaluation index. The visualization of the two data sets is shown in Figure 2.

2、实验内容2. Experimental content

在实验中，采用本发明方法和传统GCN方法对以上数据集进行分类验证。将Harborof Tobermory数据集作为源域场景，将University of Houston数据集作为目标域场景，为节约计算资源，采用超点分割方法将两个场景分别分割为8000个超点作为输入。采用本发明方法进行点云分类，将分类结果采用如下公式中的评价指标进行评价，表1为本发明方法在不同地物中的均交并比（MIoU）。In the experiment, the method of the present invention and the traditional GCN method were used to classify and verify the above data set. The Harborof Tobermory data set is used as the source domain scene, and the University of Houston data set is used as the target domain scene. In order to save computing resources, the super-point segmentation method is used to divide the two scenes into 8000 super-points as input. The method of the present invention is used for point cloud classification, and the classification results are evaluated using the evaluation indicators in the following formula. Table 1 shows the average intersection over union (MIoU) ratio of the method of the present invention in different ground objects.

其中，TP是被分割到正类点中正类点的数量、FP是被分割到正类点中负类点的数量、FN是被分割到负类点中正类点的数量。Among them, TP is the number of positive class points divided into positive class points, FP is the number of negative class points divided into positive class points, and FN is the number of positive class points divided into negative class points.

表1Table 1

本发明能够有效应对多光谱激光雷达点云在不同场景之间的光谱漂移现象，缓解光谱漂移现象带来的跨场景多光谱激光雷达点云分类困难等问题，在没有目标域场景真实标签的情况下实现跨场景多光谱点云高精度分类。This invention can effectively deal with the spectral drift phenomenon of multi-spectral lidar point clouds between different scenes, alleviate problems such as difficulty in classifying cross-scenario multi-spectral lidar point clouds caused by the spectral drift phenomenon, and solve the problem of no real label of the target domain scene. High-precision classification of cross-scenario multispectral point clouds is achieved.

以上结合附图对本发明的具体实施方式作了详细说明，但是本发明并不限于上述实施方式，在本领域普通技术人员所具备的知识范围内，还可以在不脱离本发明宗旨的前提下做出各种变化。The specific embodiments of the present invention have been described in detail above with reference to the accompanying drawings. However, the present invention is not limited to the above-described embodiments. Within the scope of knowledge possessed by those of ordinary skill in the art, other modifications can be made without departing from the spirit of the present invention. various changes.

Claims

1. A cross-scene multispectral point cloud classification method based on pseudo tag learning is characterized in that: the method comprises the following steps:

step1: respectively enabling the multispectral laser radar point cloud characteristics of the tagged source domain scene and the untagged target domain scene to be according to L ₂ Performing feature pre-alignment on the norms and the Laplace matrix;

step2: respectively extracting graph features of two scenes by adopting a graph convolution neural network (GCN) according to the pre-aligned features;

step3: calculating source domain classification loss, maximum mean difference MMD loss and target domain shannon entropy loss according to the extracted graph characteristics and source domain labels of the two scenes;

step4: iteratively performing Step3, updating the source domain-target domain alignment network parameters, judging whether the model is converged, if yes, ending, then performing Step5, otherwise repeating Step3 to obtain a pseudo tag of the target domain and the confidence coefficient thereof;

step5: according to the confidence level, the pseudo labels are arranged in a descending order, a threshold value alpha is set, and the target domain pseudo labels with the alpha percent before are selected to be used as the true value input of the target domain classification network;

step6: splicing the adjacent matrix and the feature matrix in the target domain to obtain a new feature matrix as the feature input of the target domain classification network;

step7: calculating the classification loss of the target domain according to the pseudo tag selected by Step5 and the new feature matrix obtained by Step 6;

step8: and (3) iteratively performing Step7, updating the target domain classification network parameters, judging whether the model is converged, if yes, ending, otherwise repeating Step7, and finally obtaining a target domain multispectral point cloud data classification result.

2. The cross-scene multispectral point cloud classification method based on pseudo tag learning of claim 1, wherein the method comprises the following steps of: in Step1, the labeled source domain scene multispectral lidar point cloud data is denoted as (P _s Y), the unlabeled target domain scene is denoted (P _t ，) Wherein->Representing a source domain scene contains N _s Multiple spectral spots with labels->Representing the i-th labeled multispectral point in the source domain scene,>respectively representing that the target domain scene contains N _t Label-free multispectral spots->Representing the i-th unlabeled multispectral point in the target domain scene,>truth-value label corresponding to multispectral points of all source field scenes>And (5) representing a truth value label corresponding to the ith multispectral point in the source domain scene.

3. The cross-scene multispectral point cloud classification method based on pseudo tag learning of claim 1, wherein the method comprises the following steps of: in Step1, the process is described in terms of L ₂ The specific steps of feature pre-alignment of the norm and the Laplace matrix are as follows:

(1) Through L ₂ The norm carries out characteristic transformation on the characteristics of the source domain and the target domain, and a specific characteristic transformation formula is as follows:

；

where x is the source domain, target domain characteristics,for the source domain, target domain characteristics after characteristic transformation,/->Is 2 norms;

(2) Obtaining source domain features of M dimensions according to the formula of the step (1)And M-dimensional target domain featuresWill->And->Splicing to obtain an overall feature matrix of M dimension +.>Calculating an overall feature matrix according to K nearest neighbor algorithm>Further calculating a diagonal matrix D, the elements in the diagonal matrix D being +.>，For elements in the adjacency matrix W, then the laplace matrix l=d-W, so the final overall feature matrix X is updated according to the following formula:

；

wherein,in order to update the feature matrix after the update, ^T for matrix transposition operationsMake N _s The number of the multispectral points with labels for the source domain scene is N _t The number of unlabeled multispectral points in the target domain scene.

4. The cross-scene multispectral point cloud classification method based on pseudo tag learning of claim 1, wherein the method comprises the following steps of: the Step3 specifically comprises the following steps:

respectively marking the source domain scene and the target domain scene map features extracted from Step2 asAnd->The source domain classification loss calculation formula is:

；

wherein,is the label of the i-th point in the source domain scene,/->Is the predictive label of the i-th point in the source domain scene,>is a source domain scene tag set, N _s The number of the multi-spectrum points is the number of the labeled multi-spectrum points for the source field scene;

in order to measure the difference between the extracted features, the feature deviation of two scenes is calculated by adopting the maximum mean difference MMD loss, so as to promote the GCN extraction domain invariant feature:

；

wherein,is a mapping function mapping the original variable to a high-dimensional space,/->Is a graph characteristic of the ith source domain multispectral point,is the graph characteristic of the j-th target domain multispectral point, N _t The number of unlabeled multispectral points in the target domain scene;

the shannon entropy loss constraint network is adopted to obtain a target domain scene pseudo tag with higher confidence, and a specific shannon entropy loss formula is as follows:

；

wherein H is shannon entropy matrix,is an element in H, in particular +.>The calculation formula is as follows:

；

wherein P is a prediction probability matrix of the network to the target domain multispectral laser radar point cloud,for predicting probability, l is the number of characteristic channels of the multispectral point cloud, +.>Pre-aligned features for the target domain node.

5. The cross-scene multispectral point cloud classification method based on pseudo tag learning of claim 4, wherein the method comprises the following steps: the Step4 updates the source domain-target domain alignment network parameters specifically as follows:

(1) All parameters are optimized using a standard back propagation algorithm;

(2) In training, the overall loss is a combination of source domain classification loss, maximum mean difference MMD loss and target domain shannon entropy loss, and the overall loss of training is as follows:

；

wherein,and->Is the balance coefficient of the balance loss.

6. A cross-scene multispectral point cloud classification method based on pseudo tag learning as claimed in claim 3, wherein: the adjacent matrix and the feature matrix in the splicing target domain in Step6 specifically are:

marking the target domain adjacency matrix asThe target domain feature of M dimension is +.>And->Splicing to obtain updated target domain characteristics +.>。

7. The cross-scene multispectral point cloud classification method based on pseudo tag learning of claim 1, wherein the method comprises the following steps of: the specific formula for calculating the target domain classification loss in Step7 is as follows:

；

wherein,pseudo tag which is the i-th point in the target domain scene,>is the predictive label of the ith point in the target domain scene, N _t For the number of unlabeled multispectral points in the target domain scene, < >>For the target domain scene graph feature extracted from Step2,/a>Is a set of target domain pseudo tags.

8. The cross-scene multispectral point cloud classification method based on pseudo tag learning of claim 1, wherein the method comprises the following steps of: the Step8 updates the target domain classification network parameters specifically as follows:

(1) All parameters are optimized using a standard back propagation algorithm;

(2) In training, the target domain classification loss in Step7 is used as a training loss.