CN114415254B

CN114415254B - Multi-case weak supervision mars surface morphology detection method based on online learning

Info

Publication number: CN114415254B
Application number: CN202210072417.3A
Authority: CN
Inventors: 张永强; 丁明理; 田瑞; 张印; 张子安; 王骢
Original assignee: Harbin Institute of Technology Shenzhen
Current assignee: Harbin Institute of Technology Shenzhen
Priority date: 2022-01-21
Filing date: 2022-01-21
Publication date: 2023-02-07
Anticipated expiration: 2042-01-21
Also published as: CN114415254A

Abstract

A multi-case weak supervision mars surface morphology detection method based on online learning belongs to the field of machine vision object detection. The method aims at the problems that the existing weak supervision object detection method is low in detection precision and converges on a local optimal solution. Generating a plurality of candidate boxes by adopting a selective search algorithm on the Mars long-shot image; extracting image features of the Mars perspective image by adopting a VGG16 network model; further obtaining the full connection characteristic of each candidate frame; in the weak supervision detection network, inputting the full connection characteristics of the candidate frames, judging the category of each candidate frame through classification and detection branches, scoring the position information of the candidate frames, and finally multiplying the scores of the two branches to obtain the score of the candidate frame as a case level label; and the K-grade refined network layer takes the score of each candidate box of the multi-case learning network or the previous-grade branch as supervision information, trains other optimized branches of the network, and performs backward propagation calculation. The method is used for detecting the surface topography and the target of the mars.

Description

Multi-instance Weakly Supervised Mars Surface Morphology Detection Method Based on Online Learning

技术领域technical field

本发明涉及基于在线学习的多事例弱监督火星表面形态检测方法，属于机器视觉物体检测技术领域。The invention relates to a multi-case weakly supervised Mars surface shape detection method based on online learning, and belongs to the technical field of machine vision object detection.

背景技术Background technique

火星是太阳系中紧邻地球的一颗类地行星，也是太阳系中与地球最相似的类地行星；水的发现使火星被认为是最可能孕育生命的星球之一，并成为开展空间探测的主要目标之一。目前航天事业已在地球卫星和载人航天工程中取得举世瞩目的成就，发展深空探测将是后续重点，对科技进步和社会发展具有重大意义。由于火星环境复杂多变，并时常伴有沙尘等天气，目前对火星探测的关键技术在于获取高分辨率的火星图像，并利用高分辨率图像对火星表面物体/地形进行检测，使火星车能平稳、安全地降落在火星表面，进而开展后续相关科研任务。Mars is the terrestrial planet next to Earth in the solar system and the most similar to the Earth in the solar system; the discovery of water has made Mars considered one of the most likely planets to harbor life and a prime target for space exploration one. At present, the aerospace industry has achieved world-renowned achievements in earth satellite and manned spaceflight projects. The development of deep space exploration will be the follow-up focus, which is of great significance to scientific and technological progress and social development. Due to the complex and changeable environment of Mars, which is often accompanied by dust and other weather, the current key technology for Mars exploration is to obtain high-resolution images of Mars, and use the high-resolution images to detect objects/terrains on the surface of Mars. It can land on the surface of Mars smoothly and safely, and then carry out follow-up related scientific research tasks.

视觉任务是火星车实现自主工作的关键环节，目前火星车已搭载了双目视觉环境感知、自主路径规划技术；随着火星探测的推进，更多的视觉技术将被应用于火星车以实现更复杂的探测任务。火星探测器搭载火星巡视车在火星进行行星探测任务，探测器为了更好的使火星巡视车平稳、安全地降落在火星表面，获取火星表面的地形地貌情况至关重要。Visual tasks are the key link for the Mars rover to achieve autonomous work. At present, the Mars rover is equipped with binocular visual environment perception and autonomous path planning technologies; with the advancement of Mars exploration, more visual technologies will be applied to the Mars rover to achieve more complex detection tasks. The Mars rover is equipped with a Mars rover to carry out planetary exploration missions on Mars. In order to better make the Mars rover land on the surface of Mars smoothly and safely, it is very important for the rover to obtain the topography of the Martian surface.

目前，针对物体检测经典的主流框架有基于边界框回归的YOLO算法、基于候选区域生成的RCNN系列算法等，但上述经典的物体检测算法采用的均为全监督的训练方式，需在训练集上进行事例级的标注(事例级标注意味着训练网络时不仅仅给出图像的类别，而且给出目标定位框以中心坐标和边界框的长宽表征准确位置)，通过回归的思想与基于候选区域的思想进行事例级别的训练，进一步得到被检测物体的类别和位置结果。然而，目前采集到的火星地形地貌图像不存在事例级标注信息，若采用人为进行标注的方式，会导致事例级标注的主观性太强，并且人为标注是一个费时又耗力的过程，上述的事例级标注的条件是很难被满足的。At present, the classic mainstream frameworks for object detection include YOLO algorithm based on bounding box regression, RCNN series algorithms based on candidate region generation, etc. Carry out case-level labeling (case-level labeling means that when training the network, not only the category of the image is given, but also the target positioning box is given to represent the exact position with the center coordinates and the length and width of the bounding box), through the idea of regression and based on the candidate area The idea of case-level training is carried out, and the category and position of the detected object are further obtained. However, there is no case-level labeling information in the currently collected images of Mars topography and geomorphology. If artificial labeling is used, the subjectivity of case-level labeling will be too strong, and manual labeling is a time-consuming and labor-intensive process. The above-mentioned The condition for instance-level labeling is difficult to satisfy.

在上述的事例级标注信息缺失的情况下，目前通常采用弱监督学习的方式来实现物体检测任务。在基于弱监督学习的物体检测任务中，只需要物体类别信息的标注(无需位置信息的标注)即可实现物体检测的目的。但是，在弱监督物体检测领域，存在两个主要问题：(1)模型对于初始化比较敏感，检测精度较低；(2)易于收敛于局部最优解，直观表现为只能检测到物体最有特征的区域，而不是全部物体区域，导致物体定位失败。In the absence of the above-mentioned case-level annotation information, weakly supervised learning is usually used to achieve object detection tasks. In the object detection task based on weakly supervised learning, only the labeling of object category information (no labeling of position information) is required to achieve the purpose of object detection. However, in the field of weakly supervised object detection, there are two main problems: (1) the model is sensitive to initialization, and the detection accuracy is low; (2) it is easy to converge to the local optimal solution, and the intuitive performance can only detect the most effective The region of the feature, rather than the entire object region, causes object localization to fail.

发明内容Contents of the invention

针对现有弱监督物体检测方法的检测精度低和收敛于局部最优解的问题，本发明提供一种基于在线学习的多事例弱监督火星表面形态检测方法。Aiming at the problems that the existing weakly supervised object detection method has low detection accuracy and converges to a local optimal solution, the present invention provides a multi-instance weakly supervised Mars surface morphology detection method based on online learning.

本发明的一种基于在线学习的多事例弱监督火星表面形态检测方法，包括，A multi-instance weakly supervised Mars surface form detection method based on online learning of the present invention, comprising:

采用火星远景图像构成训练集；每张火星远景图像配置地形类别标签；Use Mars prospect images to form a training set; each Mars perspective image is configured with a terrain category label;

设置在线网络包括侯选框生成单元、VGG16网络模型和弱监督检测网络；Set up an online network including a candidate frame generation unit, a VGG16 network model and a weakly supervised detection network;

在侯选框生成单元中对每张火星远景图像采用选择性搜索算法生成多个目标物体或目标地形的侯选框；采用在ImageNet上预训练好的VGG16网络模型对每张火星远景图像进行图像特征的提取；结合图像特征和每个侯选框的位置信息得到每个候选框的全连接特征；In the candidate frame generation unit, a selective search algorithm is used to generate candidate frames of multiple target objects or target terrains for each Mars prospect image; each Mars prospect image is imaged by using the VGG16 network model pre-trained on ImageNet Feature extraction; combine the image features and the position information of each candidate frame to obtain the fully connected features of each candidate frame;

采用弱监督检测网络对每个候选框的全连接特征进行检测，由检测结果获得候选框包含目标物体或目标地形的位置检测评分；同时对每个候选框的全连接特征进行分类，并由分类结果得到类别评分；由候选框的位置检测评分与类别评分的点乘结果获得侯选框的初级事例级标签；The weakly supervised detection network is used to detect the fully connected features of each candidate frame, and the position detection score of the candidate frame containing the target object or target terrain is obtained from the detection results; at the same time, the fully connected features of each candidate frame are classified, and classified The result is a category score; the primary case-level label of the candidate box is obtained from the dot product result of the position detection score of the candidate box and the category score;

以初级事例级标签作为监督信息，对每个侯选框的全连接特征采用K级精细化网络层逐级进行优化处理，获得侯选框的最终事例级标签；Using primary case-level labels as supervision information, the fully connected features of each candidate frame are optimized step by step using K-level refined network layers to obtain the final case-level labels of the candidate frames;

将每一级精细化网络层输出的事例级标签与相邻第一级精细化网络层输出的事例级标签或初级事例级标签进行比较，以及将每个候选框的分类结果与地形类别真值进行比较获得损失函数，基于损失函数优化弱监督检测网络，获得最终弱监督检测网络；Compare the case-level labels output by each level of refined network layer with the case-level labels output by the adjacent first-level refined network layer or the primary case-level labels, and compare the classification results of each candidate box with the ground truth of the terrain category The loss function is obtained by comparison, the weakly supervised detection network is optimized based on the loss function, and the final weakly supervised detection network is obtained;

采用最终弱监督检测网络对实时获取的火星远景图像进行火星表面形态检测。The final weakly supervised detection network is used to detect the surface morphology of Mars on the Mars perspective images acquired in real time.

根据本发明的基于在线学习的多事例弱监督火星表面形态检测方法，所述地形类别标签包括山丘、沟壑、石块区与平坦区。According to the online learning-based multi-instance weakly supervised Mars surface morphology detection method of the present invention, the terrain category labels include hills, ravines, rocky areas and flat areas.

根据本发明的基于在线学习的多事例弱监督火星表面形态检测方法，采用K级精细化网络层获得侯选框的最终事例级标签过程包括：According to the online learning-based multi-case weakly supervised Mars surface form detection method of the present invention, the final case-level labeling process of the candidate frame obtained by using a K-level refined network layer includes:

在一级精细化网络层中，采用softmax分类器对每个候选框的全连接特征进行事例级分类，采用事例级分类结果修正初级事例级标签获得一级修正事例级标签；In the first-level refined network layer, the softmax classifier is used to classify the fully connected features of each candidate box at the case level, and the case-level classification results are used to correct the primary case-level labels to obtain the first-level corrected case-level labels;

在二级精细化网络层中，采用softmax分类器对每个候选框的全连接特征进行事例级分类，采用事例级分类结果修正一级修正事例级标签获得二级修正事例级标签；In the second-level refined network layer, the softmax classifier is used to classify the fully connected features of each candidate box at the case level, and the case-level classification results are used to correct the first-level correction case-level label to obtain the second-level correction case-level label;

……...

在K级精细化网络层中，采用softmax分类器对每个候选框的全连接特征进行事例级分类，采用事例级分类结果修正K-1级修正事例级标签获得K级修正事例级标签作为最终事例级标签。In the K-level refined network layer, the softmax classifier is used to classify the fully connected features of each candidate box at the case level, and the case-level classification results are used to correct the K-1 level corrected case-level label to obtain the K-level corrected case-level label as the final Instance-level labels.

根据本发明的基于在线学习的多事例弱监督火星表面形态检测方法，所述VGG16网络模型通过卷积操作获得火星远景图像的图像特征；再结合图像特征对每个侯选框通过RoI池化操作获得侯选框的全连接特征。According to the multi-case weakly supervised Mars surface form detection method based on online learning of the present invention, the VGG16 network model obtains the image features of the Mars prospect image through a convolution operation; then combines the image features to perform an RoI pooling operation on each candidate frame Get the fully connected features of the candidate box.

根据本发明的基于在线学习的多事例弱监督火星表面形态检测方法，每张火星远景图像生成1000个目标物体或目标地形的侯选框。According to the online learning-based multi-case weakly supervised Mars surface morphology detection method of the present invention, each Mars prospect image generates 1000 candidate frames of target objects or target terrains.

根据本发明的基于在线学习的多事例弱监督火星表面形态检测方法，所述VGG16网络模型通过带有16层参数的权重进行反向传播。According to the multi-instance weakly supervised Mars surface form detection method based on online learning of the present invention, the VGG16 network model is backpropagated through weights with 16 layers of parameters.

根据本发明的基于在线学习的多事例弱监督火星表面形态检测方法，所述火星远景图像由NASA的mars32k数据集和GMSRI数据集提供。According to the multi-instance weakly supervised Mars surface morphology detection method based on online learning of the present invention, the Mars perspective image is provided by the mars32k data set and the GMSRI data set of NASA.

根据本发明的基于在线学习的多事例弱监督火星表面形态检测方法，对在线网络完成训练后，还对在线网络采用验证集进行验证，以及采用测试集进行测试；训练集、验证集与测试集的比例为3：1：1。According to the online learning-based multi-case weakly supervised Mars surface morphology detection method of the present invention, after the online network is trained, the online network is also verified using a verification set, and a test set is used for testing; training set, verification set and test set The ratio is 3:1:1.

本发明的有益效果：本发明可用于火星探测相关任务中在训练数据标注信息稀缺情况下的火星表面物体以及地形的检测。它用于执行火星探测器实际探测过程中降落时对地形地貌的检测任务，使火星车能够更好地寻找最适合巡视车降落的位置。Beneficial effects of the present invention: the present invention can be used for detection of Martian surface objects and topography in the case of scarcity of training data labeling information in Mars exploration-related tasks. It is used to perform the detection task of the terrain during the actual exploration of the Mars rover when it lands, so that the Mars rover can better find the most suitable position for the rover to land.

本发明方法仅通过图像级别(image-level)标注信息完成火星的地形地貌的检测，为探测器在火星上着陆并为后续规划任务中选择最有科学价值的星球表面位置提供契机。其中，多事例弱监督网络中，输入为每个候选框的全连接特征，两个并列的分类和检测分支的作用分别为判断每个候选框的类别和对每一个候选框的位置信息进行打分，最后将分类分支和检测分支的得分相乘得到此候选框的得分作为侯选框的事例级标签。在K级精细化网络层中，以多事例学习网络或者前一级分支的每个候选框的得分作为监督信息，对网络其他优化分支进行训练，并进行后向传播计算，进一步提高检测精度。The method of the present invention completes the detection of the topography of Mars only through image-level annotation information, providing an opportunity for the probe to land on Mars and select the most scientifically valuable planetary surface position for subsequent planning tasks. Among them, in the multi-instance weakly supervised network, the input is the fully connected feature of each candidate box, and the functions of the two parallel classification and detection branches are to judge the category of each candidate box and score the position information of each candidate box , and finally multiply the scores of the classification branch and the detection branch to obtain the score of the candidate box as the case-level label of the candidate box. In the K-level refined network layer, the multi-instance learning network or the score of each candidate frame of the previous branch is used as supervision information to train other optimization branches of the network, and perform backward propagation calculations to further improve detection accuracy.

本发明方法突破了全监督物体检测方法需要事例级标注信息的局限，解决了弱监督物体检测方法中检测精度低和收敛于局部最优解的问题，在不需要人为标注事例级标签的情况下利用本发明方法，在仅有图像级别(即给出每张图像中存在的地形、地貌种类)的监督信息下就可以达到火星表面物体以及地形检测的目的。The method of the present invention breaks through the limitation that the full-supervised object detection method requires case-level labeling information, solves the problems of low detection accuracy and convergence to a local optimal solution in the weakly supervised object detection method, and does not need to manually mark case-level labels. Utilizing the method of the invention, the purpose of detecting Martian surface objects and topography can be achieved under the supervision information of only the image level (that is, the topography and landform type existing in each image).

本发明方法作为火星探测任务中的基础性技术研究工作，可在一定程度上推动后续规划任务，并为后续规划任务提供一定的技术支撑。As the basic technical research work in the Mars exploration mission, the method of the invention can promote the follow-up planning task to a certain extent, and provide certain technical support for the follow-up planning task.

附图说明Description of drawings

图1是本发明所述基于在线学习的多事例弱监督火星表面形态检测方法的原理示意图；Fig. 1 is the schematic diagram of the principle of the multi-instance weak supervision Mars surface form detection method based on online learning according to the present invention;

图2是四种地形类别的火星远景图像对比示意图；Figure 2 is a schematic diagram of the comparison of Mars vision images of four terrain categories;

图3是火星远景图像的包与事例示意图；Fig. 3 is a schematic diagram of packages and instances of the Mars vision image;

图4是VGG16网络模型的结构示意图；Figure 4 is a schematic diagram of the structure of the VGG16 network model;

图5是K级精细化网络层的原理示意图；FIG. 5 is a schematic diagram of the principle of a K-level refined network layer;

图6是具体实施例的实验结果对比图。Fig. 6 is a comparison chart of experimental results of specific embodiments.

具体实施方式Detailed ways

下面将结合本发明实施例中的附图，对本发明实施例中的技术方案进行清楚、完整地描述，显然，所描述的实施例仅仅是本发明一部分实施例，而不是全部的实施例。基于本发明中的实施例，本领域普通技术人员在没有做出创造性劳动的前提下所获得的所有其它实施例，都属于本发明保护的范围。The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

需要说明的是，在不冲突的情况下，本发明中的实施例及实施例中的特征可以相互组合。It should be noted that, in the case of no conflict, the embodiments of the present invention and the features in the embodiments can be combined with each other.

下面结合附图和具体实施例对本发明作进一步说明，但不作为本发明的限定。The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments, but not as a limitation of the present invention.

具体实施方式一、结合图1至图5所示，本发明提供了一种基于在线学习的多事例弱监督火星表面形态检测方法，包括，DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 1. As shown in FIGS. 1 to 5, the present invention provides a multi-instance weakly supervised Mars surface morphology detection method based on online learning, including:

本实施方式可以实现火星图像中火星表面物体和地形检测，尤其是使得火星表面物体和地形检测技术不再依赖于具有标记信息的大型数据库，只需知道图像中简单的物体或地形类别信息，不需要复杂的物体位置信息的标注就可对模型进行训练，进而达到火星表面物体和地形检测的目的。它主要通过火星表面物体(如石头、细沙)的特征识别，实现火星地形地貌的检测，为火星车着陆或火星车后续探测任务中的选择最具有科学价值的着陆点创造条件。This embodiment can realize the detection of Martian surface objects and terrain in Mars images, especially so that the Martian surface object and terrain detection technology no longer depends on a large database with label information, and only needs to know the simple object or terrain category information in the image. The labeling of complex object position information is required to train the model, and then achieve the purpose of detecting objects and terrain on the surface of Mars. It mainly realizes the detection of Martian topography through the feature recognition of Martian surface objects (such as stones and fine sand), and creates conditions for the Mars rover landing or the selection of the most scientifically valuable landing site in the Mars rover’s subsequent exploration missions.

作为示例，结合图2所示，所述地形类别标签包括山丘、沟壑、石块区与平坦区。这四种类别标签将作为每一张火星图像的图像级别标签，来训练在线网络。As an example, as shown in FIG. 2 , the terrain category labels include hills, ravines, stone areas, and flat areas. These four category labels will be used as image-level labels for each Mars image to train the online network.

进一步，结合图1和5所示，采用K级精细化网络层获得侯选框的最终事例级标签过程包括：Further, as shown in Figures 1 and 5, the final instance-level labeling process of the candidate box obtained by using the K-level refined network layer includes:

……...

本实施方式利用多事例学习方法来实现弱监督的物体和地形检测器。由于没有人为标注的位置信息作为物体位置回归的真值，弱监督物体和地形检测器通常会收敛于一个局部最优解，导致局部聚焦的现象，如只能突出物体和地形的局部最有判别力的区域。为了提高识别率，本发明提出一种在线优化的策略，其中包含若干个与多事例学习检测网络相平行的优化分支，如图1所示。训练时以弱监督检测网络或前一精细化网络层的输出结果作为监督信息，来监督训练后续精细化网络层的优化分支，进而进一步实现火星表面物体和地形准确检测的目的。The present embodiment utilizes a multi-instance learning method to implement a weakly supervised object and terrain detector. Since there is no human-labeled position information as the true value of object position regression, weakly supervised object and terrain detectors usually converge to a local optimal solution, leading to local focusing phenomena, such as only highlighting the local most discriminative of objects and terrain area of force. In order to improve the recognition rate, the present invention proposes an online optimization strategy, which includes several optimization branches parallel to the multi-instance learning and detection network, as shown in FIG. 1 . During training, the weakly supervised detection network or the output result of the previous refined network layer is used as supervision information to supervise and train the optimization branch of the subsequent refined network layer, and further achieve the purpose of accurate detection of Martian surface objects and terrain.

综上，本发明方法包括物体检测部分(基于多事例的弱监督检测网络)与优化部分(K级精细化网络层)，物体检测部分实际上是对目标物体进行粗定位，通过检测与分类网络后进行element-wise运算得到以C×R的矩阵，其中C为目标的种类，即为石块区、沟壑、山丘域与平坦区，R为候选框的数量。接着，通过K次精细化网络使目标框减少局部聚焦的情况，使物体检测的输出位置更加的精准且满足实际需要。In summary, the method of the present invention includes an object detection part (weakly supervised detection network based on multiple instances) and an optimization part (K-level refined network layer). Then perform element-wise operation to obtain a C×R matrix, where C is the type of target, that is, rocky areas, ravines, hilly areas, and flat areas, and R is the number of candidate frames. Then, through the K-th refinement network, the local focus of the target frame is reduced, so that the output position of the object detection is more accurate and meets the actual needs.

所述K级精细化网络层并不是交替训练，将训练好的数据重新标上标签，进一步反复迭代多次训练的思想，而是通过与最高得分的候选区域交并比大于某一阈值的区域进行同类化的方法进行反复精细化，进一步修正每个候选区域的事例级的标签，从而可以得到一个基于在线学习的多事例弱监督火星表面物体和地形检测网络，进而解决现有基于多事例弱监督检测器的位置不准确，检测精度低的问题。The K-level refined network layer is not alternately trained, the trained data is re-labeled, and the idea of further iterating multiple trainings is repeated, but the area with the highest score candidate area whose intersection and union ratio is greater than a certain threshold The homogeneous method is repeatedly refined, and the case-level labels of each candidate area are further corrected, so that a multi-instance weakly supervised Martian surface object and terrain detection network based on online learning can be obtained, and then the existing multi-instance weak supervision network can be solved. The position of the supervisory detector is not accurate and the detection accuracy is low.

再进一步，结合图1和4所示，所述VGG16网络模型通过卷积操作获得火星远景图像的图像特征；再结合图像特征对每个侯选框通过RoI池化操作获得侯选框的全连接特征。Further, as shown in Figures 1 and 4, the VGG16 network model obtains the image features of the Mars vision image through the convolution operation; then combines the image features for each candidate frame through the RoI pooling operation to obtain the full connection of the candidate frame feature.

作为示例，结合图1所示，每张火星远景图像生成1000个目标物体或目标地形的侯选框。As an example, as shown in FIG. 1 , each Mars distant image generates 1000 candidate frames of target objects or target terrains.

在弱监督检测网络中，对于每张仅有火星地形地貌类别标签的火星图像数据，首先采用选择性搜索算法生成约1000个目标物体和区域可能出现的位置，每个目标物体和区域可能出现的位置称为候选区域(proposals)，用侯选框标记；接着使用VGG16网络模型得到每个候选区域的全连接特征，这些被提取的全连接特征输入到后面的检测分类网络中，实现火星图像中的物体和地形检测任务。In the weakly supervised detection network, for each piece of Mars image data that only has the label of the Martian terrain and landform category, a selective search algorithm is first used to generate about 1,000 possible locations of target objects and regions, and the possible locations of each target object and region The position is called the candidate area (proposals), marked with the candidate box; then use the VGG16 network model to obtain the fully connected features of each candidate area, and these extracted fully connected features are input into the subsequent detection and classification network to realize the Mars image. object and terrain detection tasks.

所述VGG16网络模型通过带有16层参数的权重进行反向传播。The VGG16 network model is backpropagated through weights with 16 layers of parameters.

作为示例，所述火星远景图像由NASA的mars32k数据集和GMSRI数据集提供。As an example, the Mars distant image is provided by NASA's mars32k dataset and the GMSRI dataset.

本实施方式在实际使用中，具体检测对象的类别可以根据用户的实际问题决定。In actual use of this embodiment, the category of the specific detection object may be determined according to the actual problem of the user.

本实施方式在训练过程中可以采用好奇号火星车拍摄的真实火星图像和利用学习的方法产生的火星图像构成数据集。NASA提供的数据集包含2012年8月至2018年11月期间好奇号(Curiosity)火星车在火星上收集的32368幅彩色图像和生成的约30000幅彩色火星图像。这些图像显示了火星的各种地理和地质特征，如山脉和山谷，火山口，沙丘和岩石地形等，图像分辨率为560×500px。mars32k和GMSRI数据集图像包括火星近景，远景，较大块岩石，细小沙粒以及在光线昏暗时拍摄的一些图像，由于探测器距离火星表面千米级别时需进行降落平坦区域的搜索，因此采用火星远景图像进行作为训练集，图像类别分为山丘、沟壑、石块区域与平坦区域四类。利用这些仅有的类别标签训练在线网络，不需要位置信息的标注，不需要大量的人力物力去标注数据库，可避免人为标注主观性引入的偏差。In this embodiment, during the training process, the real Mars images taken by the Curiosity rover and the Mars images generated by the learning method can be used to form a data set. The dataset provided by NASA contains 32,368 color images collected by the Curiosity rover on Mars from August 2012 to November 2018 and about 30,000 color Martian images generated. These images show various geographical and geological features of Mars, such as mountains and valleys, craters, dunes and rocky terrain, etc. The image resolution is 560×500px. The mars32k and GMSRI data set images include close-up views of Mars, long-range views, larger rocks, fine sand grains, and some images taken in dim light. Since the probe needs to search for flat areas when it is kilometers away from the surface of Mars, it uses Mars prospect images are used as a training set, and the image categories are divided into four categories: hills, ravines, rocky areas, and flat areas. Using these only category labels to train the online network does not require labeling of location information, and does not require a lot of manpower and material resources to label the database, which can avoid the bias introduced by the subjectivity of human labeling.

再进一步，对在线网络完成训练后，还对在线网络采用验证集进行验证，以及采用测试集进行测试；训练集、验证集与测试集的比例为3：1：1。Furthermore, after the online network is trained, the online network is also verified with a verification set and tested with a test set; the ratio of the training set, verification set, and test set is 3:1:1.

综上，本实施方式在不需要位置标注信息的情况下可以得到较高的火星表面物体和地形检测率。可促进基于火星图像物体和地形检测技术的发展，为后续火星探测的相关任务提供一定的技术支撑。To sum up, this embodiment can obtain a higher detection rate of surface objects and terrains on Mars without the need for location labeling information. It can promote the development of object and terrain detection technology based on Mars images, and provide certain technical support for subsequent tasks related to Mars exploration.

具体实施例：Specific examples:

首先根据用户实际需求准备训练样本，然后依据多事例学习(MIL)方法训练一个弱监督物体/地形检测器。之后，利用在线优化的策略，进一步提高弱监督物体和地形的检测精度，得到更为准确的火星表面地形检测结果。下面对每部分进行详细描述：Firstly, training samples are prepared according to the actual needs of users, and then a weakly supervised object/terrain detector is trained according to the multiple instance learning (MIL) method. After that, the online optimization strategy is used to further improve the detection accuracy of weakly supervised objects and terrain, and obtain more accurate detection results of Martian surface terrain. Each part is described in detail below:

首先准备训练样本。First prepare the training samples.

然后训练基于多事例学习的弱监督检测网络。A weakly supervised detection network based on multi-instance learning is then trained.

所述弱监督检测网络为一种端到端的检测方法，其中在多事例学习中，包的标签是已知的，事例的标签是未知的，包的标签仅仅说明这副图像存在什么种类的目标，但是目标的位置是不知道的。以火星图像为例，每一张560×500的火星图像就是一个包，图像中的一些区域(Patches)即为事例，如图3所示。从图3可以看出，该包存在山丘与石块区，不存在沟壑；针对于包只存在类别的标签：The weakly supervised detection network is an end-to-end detection method, wherein in multi-instance learning, the label of the package is known, and the label of the case is unknown, and the label of the package only shows what kind of target exists in this pair of images , but the location of the target is unknown. Taking the Mars image as an example, each 560×500 Mars image is a package, and some areas (Patches) in the image are examples, as shown in Figure 3. It can be seen from Figure 3 that there are hills and rocks in the package, but no ravines; there are only category labels for the package:

y＝[y₁,y₂,y₃,…，y_n]∈R^C×l，y=[y ₁ ,y ₂ ,y ₃ ,...,y _n ]∈RC ^×l ,

其中y_i表示目标是种类i的标签，i＝1，2，3，…，n；n为图像中区域(Patches)的个数，C为类别标签的个数；Among them, y _i indicates that the target is a label of type i, i=1, 2, 3,..., n; n is the number of regions (Patches) in the image, and C is the number of category labels;

y_i取值为+1或-1。The value of y _i is +1 or -1.

若标注的类别标签为{1,-1,1}，表示存在山丘区域事例，不存在沟壑区域事例，存在石块区域事例，在训练样本的标签中不存在山丘区域与石块区域的具体位置信息。接着，对于输入的每张火星图像，采用选择性搜索算法生成1000个目标物体/地形可能出现的位置，每个目标物体/地形可能出现的位置称为候选区域(proposals)。接着，使用预训练好的VGG16网络模型提取特征，最后利用RoI池化的方法获取每一个候选区域的特征，进而得到每个候选区域的全连接特征，这些被提取的全连接特征输入到后面的检测分类网络中，实现火星图像中的物体/地形检测任务。其中，VGG16网络结构通过带有16层参数的权重进行反向传播而命名，网络结构包含以下几部分：特征提取网络，全连接网络，并行的两个分支检测与识别网络，如图4所示为基于多事例学习的弱监督火星表面物体/地形检测器网络的结构。If the labeled category label is {1,-1,1}, it means that there are examples of hilly areas, no examples of gully areas, and examples of rocky areas, and there are no examples of hilly areas and rocky areas in the labels of training samples. specific location information. Next, for each input image of Mars, a selective search algorithm is used to generate 1000 possible locations of target objects/terrains, and each possible location of target objects/terrains is called a candidate area (proposals). Then, use the pre-trained VGG16 network model to extract features, and finally use the RoI pooling method to obtain the features of each candidate area, and then obtain the fully connected features of each candidate area, and these extracted fully connected features are input to the following In the detection and classification network, the task of object/terrain detection in Mars images is realized. Among them, the VGG16 network structure is named after backpropagation with weights of 16 layers of parameters. The network structure includes the following parts: feature extraction network, fully connected network, and two parallel branch detection and recognition networks, as shown in Figure 4 Architecture for Weakly Supervised Martian Surface Object/Terrain Detector Networks Based on Multiple Instance Learning.

设计在线优化策略。为避免边界框输出结果容易出现在局部信息较强的物体区域上，不是准确的完整的位置，本实施例在弱监督检测器的基础上，提出在线学习的K次精细化修正策略，旨在实现高分边界框的小区域向大区域传递标签信息的能力，如图5所示。由于弱监督检测只存在图像级标签不存在事例级的标签，本发明的思想是通过事例集标签与得分矩阵进行交叉熵损失函数的计算，反向传播更新得分矩阵，因此得到事例级标签是尤为重要的。首先{C+1}维的候选区域为每一类的概率，第二个输出的矩阵是R×C的矩阵，通过前一个精细化输出的事例级标签对此矩阵计算损失反向传播优化此得分矩阵，每个候选区域都会产生一个C+1维向量，k表示当前的优化次数，其中k＝{0,1…,K-1}。Design online optimization strategies. In order to prevent the output result of the bounding box from easily appearing in the object area with strong local information, which is not an accurate and complete position, this embodiment proposes a K-time refinement correction strategy for online learning on the basis of a weakly supervised detector, aiming at Realize the ability to transfer label information from a small area of a high-scoring bounding box to a large area, as shown in Figure 5. Since there are only image-level labels and no case-level labels in weakly supervised detection, the idea of the present invention is to calculate the cross-entropy loss function through the case set labels and the score matrix, and update the score matrix through backpropagation, so the case-level labels are obtained. important. First, the {C+1}-dimensional candidate area is the probability of each class, and the second output matrix is an R×C matrix. The loss backpropagation is optimized for this matrix through the case-level labels output by the previous refinement. Score matrix, each candidate area will generate a C+1-dimensional vector, k represents the current optimization times, where k={0,1...,K-1}.

选用NASA的mars32k和GMSRI数据集作为发明的训练、验证以及验证数据使用，数据集的60％作为训练集，通过迭代次数降低损失，数据集的20％作为验证集，在一定的迭代次数后验证此时模型的泛化能力，剩余的20％数据集作为测试集，证明网络结构的泛化能力与算法在火星地形地貌的检测中的有效性。选用预训练好的VGG16对火星地形地貌特征提取，训练弱监督基本检测器时采取20个epoch迭代次数，前10个epoch的学习率为10^-5，后10个epoch的学习率为10^-6。优化器选择Adam，此优化器引入动量法，使得参数更新时脱离局部最优，动量和权重衰退分别设定为0.9和0.0005。引入在线优化策略后，网络训练采用随机梯度下降的优化器，每次优化输入图像的mini-batch为2张(mini-batch＝2)，动量和权重衰退仍然设定为0.9和0.0005，精细化时Iou阈值为It＝0.5。最后，非极大抑制NMS被设定为0.3，去计算最终的评价指标mAP和CorLoc。精细化次数通过试验选取为K＝4。为了增加训练数据样本，采用了数据增广的方式，设置输入火星图像随机水平翻转的概率p＝0.5，将图像的长和宽的比例固定在0.5-2之间，若不满足上述条件，训练集、验证集、测试集将最短的边设置为{480，576，688，864，1200}，最长边不超过2000像素。NASA's mars32k and GMSRI data sets are used as the training, verification and verification data of the invention. 60% of the data set is used as the training set, and the loss is reduced through the number of iterations. 20% of the data set is used as the verification set, which is verified after a certain number of iterations. At this time, the generalization ability of the model, the remaining 20% of the data set is used as the test set, which proves the generalization ability of the network structure and the effectiveness of the algorithm in the detection of Mars topography. Select the pre-trained VGG16 to extract the features of the Martian topography, and use 20 epoch iterations when training the weakly supervised basic detector. The learning rate of the first 10 epochs is 10 ^-5 , and the learning rate of the last 10 epochs is 10 ^-6 . The optimizer chooses Adam, and this optimizer introduces the momentum method, which makes the parameter update deviate from the local optimum, and the momentum and weight decay are set to 0.9 and 0.0005, respectively. After the online optimization strategy is introduced, the network training adopts the optimizer of stochastic gradient descent, and the mini-batch of each optimized input image is 2 (mini-batch=2), the momentum and weight decay are still set to 0.9 and 0.0005, and the refinement When the Iou threshold is It=0.5. Finally, the non-maximum suppression NMS is set to 0.3 to calculate the final evaluation indicators mAP and CorLoc. The number of refinements is selected as K=4 through experiments. In order to increase the training data samples, the method of data augmentation is adopted, the probability of random horizontal flipping of the input Mars image is set to p=0.5, and the ratio of the length and width of the image is fixed between 0.5-2. If the above conditions are not met, the training set, validation set, test set set the shortest side to {480, 576, 688, 864, 1200}, and the longest side does not exceed 2000 pixels.

经上述步骤训练的基于在线学习的多事例弱监督火星表面物体和地形检测网络，在不需要事例级别标注信息的情况下可以实现火星表面物体和地形检测功能，可以根据具体需求应用到火星车着陆点的选择，以及火星探测后续规划任务中的选择最具有科学价值落地点的相关任务中，不受现有物体检测方法需要完全信息标注数据库的限制，不需要花费人力物力去对每一个训练样本图像进行标注。The multi-instance weakly supervised Martian surface object and terrain detection network based on online learning trained by the above steps can realize the Martian surface object and terrain detection function without the need for instance-level labeling information, and can be applied to Mars rover landing according to specific needs The selection of points, as well as the selection of the most scientifically valuable landing points in the follow-up planning tasks of Mars exploration, are not limited by the existing object detection methods that require complete information labeling databases, and do not need to spend manpower and material resources on each training sample. Annotate the image.

实验证明本发明方法对于火星表面地形检测精度高，定位准确，表1和标2为实验结果对比数据：The experiment proves that the method of the present invention has high detection accuracy and accurate positioning for the surface terrain of Mars. Table 1 and Standard 2 are comparative data of experimental results:

表1以mAP(％)为评价指标的实验结果对比数据表Table 1 Comparison data table of experimental results with mAP (%) as evaluation index

表2以CorLoc(％)为评价指标的实验结果对比数据表Table 2 Take CorLoc (%) as the experimental result comparison data table of the evaluation index

其中mAP是平均准确率(mean Average Precision)，是对测试样本进行评估的一个指标，Corloc是正确定位率(Correct Location)，是对训练过程中训练样本的定位效果进行评估的一个指标。从对比数据中可以看出，仅仅进行弱监督检测mAP/Corloc为50.6％/69.4％，相比于引入在线优化的策略将mAP/Corloc提高为58.3％/78.3，明显地改善了弱监督的局部聚焦的问题，证明了在线优化策略的有效性。进一步，以生成伪真值作为训练的事例级标签的全监督检测器，通过充分利用了全监督学习的强回归能力，与弱监督物体检测器的输出结果相比，物体最终的位置坐标信息更为准确，进一步从侧面证明了本发明方法的有效性。图6为实验结果图，测试过程中对火星表面地形地貌的位置进行检测，并且侯选边界框将火星表面的物体和地形地貌框选出来；通过探测器寻找到石块区域、山丘区域、沟壑区域，进一步完成下落任务中平坦区域的选择，从检测区域可以清晰地观测到大型石块、石块密集区域、山丘区域与沟壑区域。Among them, mAP is the mean average precision, which is an index for evaluating the test samples, and Corloc is the correct location rate (Correct Location), which is an index for evaluating the positioning effect of the training samples during the training process. It can be seen from the comparative data that only weak supervision detection mAP/Corloc is 50.6%/69.4%, compared with the introduction of online optimization strategy to increase mAP/Corloc to 58.3%/78.3, which significantly improves the local area of weak supervision Focusing on the problem, the effectiveness of the online optimization strategy is demonstrated. Furthermore, the fully supervised detector that uses the generated false ground truth as the training case-level label fully utilizes the strong regression ability of fully supervised learning. Compared with the output of the weakly supervised object detector, the final position coordinate information of the object is more accurate. For accuracy, the effectiveness of the method of the present invention is further proved from the side. Figure 6 is a diagram of the experimental results. During the test, the position of the topography on the surface of Mars is detected, and the candidate bounding box selects the objects and topography on the surface of Mars; the rock area, hill area, The ravine area further completes the selection of flat areas in the whereabouts task. Large rocks, dense rock areas, hill areas, and ravine areas can be clearly observed from the detection area.

虽然在本文中参照了特定的实施方式来描述本发明，但是应该理解的是，这些实施例仅仅是本发明的原理和应用的示例。因此应该理解的是，可以对示例性的实施例进行许多修改，并且可以设计出其他的布置，只要不偏离所附权利要求所限定的本发明的精神和范围。应该理解的是，可以通过不同于原始权利要求所描述的方式来结合不同的从属权利要求和本文中所述的特征。还可以理解的是，结合单独实施例所描述的特征可以使用在其它所述实施例中。Although the invention is described herein with reference to specific embodiments, it should be understood that these embodiments are merely illustrative of the principles and applications of the invention. It is therefore to be understood that numerous modifications may be made to the exemplary embodiments and that other arrangements may be devised without departing from the spirit and scope of the invention as defined by the appended claims. It shall be understood that different dependent claims and features described herein may be combined in a different way than that described in the original claims. It will also be appreciated that features described in connection with individual embodiments can be used in other described embodiments.

Claims

1. A multi-case weak supervision mars surface morphology detection method based on online learning is characterized by comprising the following steps,

forming a training set by using a Mars perspective image; configuring a terrain category label for each Mars long shot image;

the online network is arranged to comprise a candidate box generation unit, a VGG16 network model and a weak supervision detection network;

a candidate frame generating unit generates a plurality of candidate frames of target objects or target terrains for each Mars long shot image by adopting a selective search algorithm; extracting image features of each Mars perspective image by adopting a VGG16 network model pre-trained on ImageNet; combining the image characteristics and the position information of each candidate frame to obtain the full-connection characteristics of each candidate frame;

detecting the full-connection characteristics of each candidate frame by adopting a weak supervision detection network, and obtaining a position detection score of the candidate frame including a target object or a target terrain according to a detection result; classifying the full-connection characteristics of each candidate frame, and obtaining a classification score according to a classification result; obtaining a primary case-level label of the candidate box according to the dot product result of the position detection score and the category score of the candidate box;

taking the primary case-level label as supervision information, and performing optimization processing on the full connection characteristics of each candidate frame step by adopting a K-grade fine network layer to obtain a final case-level label of the candidate frame;

comparing the case-level label output by each refinement network layer with the case-level label or the primary case-level label output by the adjacent previous refinement network layer, comparing the classification result of each candidate box with the terrain category true value to obtain a loss function, and optimizing the weak supervision detection network based on the loss function to obtain a final weak supervision detection network;

carrying out Mars surface morphology detection on the Mars long shot image acquired in real time by adopting a final weak supervision detection network;

the process of obtaining the final case level label of the candidate box by adopting the K-level refinement network layer comprises the following steps:

in a first-level refinement network layer, performing case-level classification on the full-connection features of each candidate frame by adopting a softmax classifier, and correcting the primary case-level labels by adopting case-level classification results to obtain first-level corrected case-level labels;

in a second-level refinement network layer, performing case-level classification on the full-connection features of each candidate frame by adopting a softmax classifier, and correcting a first-level correction case-level label by adopting a case-level classification result to obtain a second-level correction case-level label;

……

in a K-level refinement network layer, performing case-level classification on the full-connection features of each candidate frame by adopting a softmax classifier, and correcting a K-1-level correction case-level label by adopting a case-level classification result to obtain a K-level correction case-level label as a final case-level label;

generating 1000 candidate frames of target objects or target terrains by each Mars long shot image;

the VGG16 network model performs back propagation through weights with 16-layer parameters;

the Mars perspective image is provided by the mars32k dataset and the GMRI dataset of NASA;

the VGG16 network model adopts 20 epoch iteration times when training the weak supervision detection network, and the learning rate of the first 10 epochs is 10 ^-5 And the learning rate of the last 10 epochs is 10 ^-6 (ii) a The optimizer selects Adam.

2. The method of claim 1, wherein the terrain category labels comprise hills, ravines, stone regions, and flat regions.

3. The multi-case weak supervision mars surface morphology detection method based on online learning of claim 2, wherein the VGG16 network model obtains image features of mars perspective images through convolution operation; and combining the image characteristics to obtain the full connection characteristics of the candidate frames for each candidate frame through the RoI pooling operation.

4. The multi-case weak supervision mars surface morphology detection method based on online learning of claim 3, wherein after the online network is trained, the online network is verified by using a verification set, and is tested by using a test set; the proportion of the training set, the verification set and the test set is 3:1:1.