CN108805029B

CN108805029B - A ground-based cloud image recognition method based on significant dual activation coding

Info

Publication number: CN108805029B
Application number: CN201810433104.XA
Authority: CN
Inventors: 张重; 李东红; 刘爽
Original assignee: Tianjin Normal University
Current assignee: Tianjin Normal University
Priority date: 2018-05-08
Filing date: 2018-05-08
Publication date: 2021-08-24
Anticipated expiration: 2038-05-08
Also published as: CN108805029A

Abstract

The embodiment of the present invention discloses a ground-based cloud image recognition method based on salient dual activation coding, which includes: inputting a training ground-based cloud image into a convolutional neural network to obtain a convolution activation map; using a shallow convolution activation map to obtain a local area of a saliency image , extract its features to obtain salient feature vectors; obtain the image area corresponding to the deep convolution activation map, and learn the corresponding weights; based on the salient feature vectors and weights, obtain a set of salient feature vectors for the weight of the training ground cloud image, and perform significant dual activation coding on them. , obtain the weighted salient feature vector; obtain the weighted salient feature vector of the test ground cloud image, and classify it to obtain the identification result. The invention uses the convolutional neural network shallow and deep convolutional activation maps to extract features, mines significant structure and texture information and features containing high semantic information, and further obtains the most representative feature representation ground-based cloud map through significant dual activation coding , which improves the accuracy of ground-based cloud image classification.

Description

A ground-based cloud image recognition method based on significant dual activation coding

技术领域technical field

本发明属于模式识别、人工智能技术领域，具体涉及一种基于显著对偶激活编码的地基云图识别方法。The invention belongs to the technical fields of pattern recognition and artificial intelligence, and in particular relates to a ground-based cloud image recognition method based on significant dual activation coding.

背景技术Background technique

在大气科学领域，云的生成、外形特征、云量的多少等反映着大气的运动，预示着未来天气变化的重要征兆之一，在天气预报预警中起着至关重要的作用。云观测的一种重要方式就是地基云观测，地基云图自动分类对气候气象研究等具有重要的意义。目前，国内外的专家已经开展了相关领域的研究工作。Isosalo等人采用局部纹理信息，比如局部二值模式(Local Binary Pattern，LBP)和局部边缘模式(Local Edge Patterns，LEP)进行高积云、卷云、层云、积云和晴空等5种天空类型的判断。Calbo等人抽取傅里叶变换图像信息和图像的统计信息来描述地基云图，进行高积云、卷云、层云、积云和晴空等5种天空类型的分类。Heinle等人同时采用光谱、纹理和颜色信息描述地基云图，进行地基云图分类。Xiao等人进一步以稠密采样的方式提取纹理、结构和颜色信息进行不同天空类型的分类。Wang等人基于旋转不变的LBP(Rotation Invariant LBP)，提出稳定的LBPs(The Stable LBPs)分类不同的云类型。随着卷积神经网络(Convolutional Neural Networks，CNNs)在模式识别、图像处理等领域取得的巨大成果，CNNs也开始应用于地基云图分类，并且CNNs的分类效果优于基于手工特征(hand-craft features)的传统地基云图分类方法。Ye等人首次利用卷积神经网络(Convolutional Neural Networks，CNNs)进行地基云图分类，分类准确率明显提高。Zhang等人通过对卷积激活图上的局部特征进行编码，改善了交叉域地基云分类的性能。此外，Shi等人认为在地基云图的表示方面，深层卷积激活图上的特征优于传统的手工特征(hand-craft features)。以上基于CNNs的地基云图分类方法都是在单卷积层进行特征提取，不能获得较完备的地基云图信息。在地基云图的特征表示方面，这些方法通常使用最大池化(max pooling)、平均池化(average pooling)等对提取到的特征聚合成一个特征向量来表示地基云图，此种特征向量通常会缺少判别性。因此，在地基云图的特征表示方面，需要更进一步的创新方法来提高地基云图分类的准确率。In the field of atmospheric science, cloud formation, shape characteristics, and amount of cloud reflect the movement of the atmosphere, indicating one of the important signs of future weather changes, and playing a crucial role in weather forecasting and early warning. An important method of cloud observation is ground-based cloud observation. The automatic classification of ground-based cloud images is of great significance to climate and meteorological research. At present, domestic and foreign experts have carried out research work in related fields. Isosalo et al. used local texture information, such as Local Binary Patterns (LBP) and Local Edge Patterns (LEPs), for 5 sky types including Altocumulus, Cirrus, Stratus, Cumulus and Clear Sky. judge. Calbo et al. extracted the Fourier transform image information and the statistical information of the image to describe the ground-based cloud map, and classified five sky types including altocumulus, cirrus, stratus, cumulus and clear sky. Heinle et al. also used spectral, texture and color information to describe ground-based cloud images for ground-based cloud image classification. Xiao et al. further extracted texture, structure and color information in a dense sampling manner for classification of different sky types. Based on Rotation Invariant LBP (Rotation Invariant LBP), Wang et al. proposed Stable LBPs (The Stable LBPs) to classify different cloud types. With the great achievements of Convolutional Neural Networks (CNNs) in pattern recognition, image processing and other fields, CNNs have also begun to be applied to ground-based cloud image classification, and the classification effect of CNNs is better than that based on hand-craft features. ) of the traditional ground-based cloud image classification method. Ye et al. used Convolutional Neural Networks (CNNs) for ground-based cloud image classification for the first time, and the classification accuracy was significantly improved. Zhang et al. improved the performance of cross-domain ground-based cloud classification by encoding local features on convolutional activation maps. Furthermore, Shi et al. argue that features on deep convolutional activation maps outperform traditional hand-craft features in terms of ground-based cloud map representation. The above ground-based cloud image classification methods based on CNNs all perform feature extraction in a single convolution layer, and cannot obtain more complete ground-based cloud image information. In terms of feature representation of ground-based cloud images, these methods usually use max pooling, average pooling, etc. to aggregate the extracted features into a feature vector to represent ground-based cloud images. Such feature vectors are usually missing. discriminative. Therefore, in terms of feature representation of ground-based cloud images, further innovative methods are needed to improve the accuracy of ground-based cloud image classification.

发明内容SUMMARY OF THE INVENTION

本发明的目的是要解决地基云图分类问题，为此，本发明提供一种基于显著对偶激活编码的地基云图识别方法。The purpose of the present invention is to solve the problem of ground-based cloud image classification, and for this purpose, the present invention provides a ground-based cloud image identification method based on significant dual activation coding.

为了实现所述目的，本发明提出一种基于显著对偶激活编码的地基云图识别方法，所述方法包括以下步骤：In order to achieve the purpose, the present invention proposes a ground-based cloud image recognition method based on significant dual activation coding, the method comprising the following steps:

步骤S1，对多幅输入地基云图进行预处理，得到训练地基云图；Step S1, preprocessing multiple input ground-based cloud images to obtain training ground-based cloud images;

步骤S2，将所述训练地基云图输入到卷积神经网络中，得到卷积激活图；Step S2, inputting the training ground cloud image into a convolutional neural network to obtain a convolution activation map;

步骤S3，利用浅层卷积激活图，得到训练地基云图的显著性图像局部区域；Step S3, using the shallow convolution activation map to obtain the saliency image local area of the training ground cloud map;

步骤S4，对于每个显著性图像局部区域进行特征提取，得到相应的显著特征向量；Step S4, perform feature extraction for each saliency image local area to obtain a corresponding salient feature vector;

步骤S5，利用浅层的显著性图像局部区域，获取深层卷积激活图相对应的图像区域，并学习所述图像区域相对应的权重；Step S5, using the saliency image local area of the shallow layer, obtain the image area corresponding to the deep convolution activation map, and learn the corresponding weight of the image area;

步骤S6，基于所述显著特征向量和权重，得到所述训练地基云图对应的权重显著特征向量集合；Step S6, based on the significant feature vector and the weight, obtain the weighted significant feature vector set corresponding to the training ground cloud image;

步骤S7，对于所述权重显著特征向量集合进行显著对偶激活编码，得到训练地基云图对应的权重显著特征向量；Step S7, performing significant dual activation coding on the weight significant feature vector set to obtain the weight significant feature vector corresponding to the training ground cloud image;

步骤S8，获取测试地基云图的权重显著特征向量，基于所述权重显著特征向量对测试地基云图进行分类，得到地基云图识别结果。In step S8, the weighted salient feature vector of the test ground cloud image is obtained, and the test ground cloud image is classified based on the weighted salient feature vector to obtain the ground foundation cloud image identification result.

可选地，所述步骤S1包括以下步骤：Optionally, the step S1 includes the following steps:

步骤S11，将所述输入地基云图的大小归一化为H×W，得到训练地基云图，其中，H和W分别表示训练地基云图的高度和宽度；Step S11, normalize the size of the input ground cloud image to H×W to obtain a training ground cloud image, where H and W represent the height and width of the training ground cloud image, respectively;

步骤S12，获取每幅训练地基云图的类别标签。In step S12, the category label of each training ground cloud image is obtained.

可选地，所述步骤S2包括以下步骤：Optionally, the step S2 includes the following steps:

步骤S21，确定卷积神经网络，并对其进行初始化，将所述卷积神经网络末端的输出个数修改为地基云图的类别数目D；Step S21, determining the convolutional neural network, and initializing it, and modifying the number of outputs at the end of the convolutional neural network to the number D of categories of the ground-based cloud map;

步骤S22，将所述训练地基云图输入至初始化后的卷积神经网络中，得到卷积激活图。Step S22, inputting the training ground cloud image into the initialized convolutional neural network to obtain a convolution activation map.

可选地，所述步骤S3包括以下步骤：Optionally, the step S3 includes the following steps:

步骤S31，获取一预设浅层卷积层对应的一组浅层卷积激活图，该组浅层卷积激活图可以表示成一个张量，大小为H_s×W_s×N_s，其中，下标s表示浅层所在的层数，H_s和W_s分别表示该层卷积激活图的高度和宽度，N_s表示该层卷积激活图的数目；Step S31, obtaining a set of shallow convolution activation maps corresponding to a preset shallow convolution layer, the set of shallow convolution activation maps can be expressed as a tensor with a size of H _s ×W _s ×N _s , where , the subscript s represents the number of layers where the shallow layer is located, H _s and W _s represent the height and width of the convolution activation map of this layer, respectively, and N _s represents the number of convolution activation maps of this layer;

步骤S32，将所述浅层卷积层对应的所有卷积激活图上每个相同位置处的激活响应依次连接，得到一个N_s维的局部特征向量；Step S32, connecting the activation responses at each same position on all convolution activation maps corresponding to the shallow convolutional layers in turn to obtain an N _s -dimensional local feature vector;

步骤S33，对所述浅层卷积层对应的所有卷积激活图利用滑动窗口进行稠密采样，基于所述局部特征向量获取每个滑动窗口的激活响应显著值S_k，其中，下标k表示第k个滑动窗口；Step S33, using a sliding window to densely sample all the convolution activation maps corresponding to the shallow convolutional layers, and obtain the activation response saliency value S _k of each sliding window based on the local feature vector, where the subscript k represents The kth sliding window;

步骤S34，将所述激活响应显著值S_k进行降序排序，选择前K个激活响应显著值对应的滑动窗口作为显著性图像局部区域，得到K个所述训练地基云图的显著性图像局部区域。Step S34, sort the activation response saliency values S _k in descending order, select the sliding windows corresponding to the first K activation response salient values as the saliency image local area, and obtain K saliency image local areas of the training ground cloud image.

可选地，所述滑动窗口的大小为a×a，稠密采样的步长为a/2。Optionally, the size of the sliding window is a×a, and the step size of dense sampling is a/2.

可选地，浅层卷积激活图上第k个滑动窗口的激活响应显著值S_k表示为：Optionally, the activation response saliency S _k of the kth sliding window on the shallow convolution activation map is expressed as:

其中，PgP²表示向量的二范数，

表示第i个位置处的局部特征向量，a²＝a×a表示第k个滑动窗口内的局部特征向量数目，

表示第k个滑动窗口的均值特征向量，即滑动窗口内所有局部特征向量的平均值，表示为：where PgP ² represents the two-norm of the vector,

represents the local feature vector at the ith position, a ² =a×a represents the number of local feature vectors in the kth sliding window,

Represents the mean eigenvector of the kth sliding window, that is, the average of all local eigenvectors in the sliding window, expressed as:

可选地，所述步骤S4中，每个所述显著性图像局部区域表示为显著特征向量m_k。Optionally, in the step S4, each of the saliency image local regions is represented as a saliency feature vector m _k .

可选地，所述步骤S5包括以下步骤：Optionally, the step S5 includes the following steps:

步骤S51，获取一预设深层卷积层对应的一组深层卷积激活图，该组深层卷积激活图可以表示成一个张量，大小为H_d×W_d×N_d，其中，下标d表示深层所在的层数，H_d和W_d分别表示该层卷积激活图的高度和宽度，N_d表示该层卷积激活图的数目；Step S51, obtain a set of deep convolution activation maps corresponding to a preset deep convolution layer, the set of deep convolution activation maps can be expressed as a tensor with a size of H _d ×W _d ×N _d , where the subscript d represents the number of layers where the deep layer is located, H _d and W _d represent the height and width of the convolution activation map of this layer, respectively, and N _d represents the number of convolution activation maps of this layer;

其中，所述深层卷积层可从所述卷积神经网络后半部分的卷积层中选择。Wherein, the deep convolutional layer can be selected from the convolutional layers in the second half of the convolutional neural network.

步骤S52，将所述深层卷积层对应的所有卷积激活图上每个相同位置处的激活响应依次连接，得到N_d维的局部特征向量；Step S52, connecting the activation responses at each same position on all convolution activation maps corresponding to the deep convolutional layers in turn to obtain an N _d -dimensional local feature vector;

步骤S53，根据所述浅层卷积层对应的显著性图像局部区域，获取深层卷积激活图中相对应的K个大小为b×b的图像区域；Step S53, according to the saliency image local area corresponding to the shallow convolution layer, obtain K corresponding image areas of size b×b in the deep convolution activation map;

步骤S54，计算所述图像区域对应的权重，表示为：Step S54, calculating the weight corresponding to the image area, expressed as:

其中，

表示第k个图像区域的权重，

表示第j个位置处的局部特征向量，b²＝b×b表示第k个图像区域的局部特征向量数目。in,

represents the weight of the kth image region,

represents the local feature vector at the jth position, and b ² =b×b represents the number of local feature vectors in the kth image region.

可选地，所述步骤S6中，根据浅层卷积激活图的K个显著性图像局部区域的显著特征向量m_k，和深层卷积激活图的K个图像区域的权重w_k，得到每幅训练地基云图的权重显著特征向量集合χ：Optionally, in the step S6, according to the salient feature vectors m _k of the K salient image local regions of the shallow convolution activation map, and the weights w _k of the K image regions of the deep convolution activation map, each is obtained. The weighted salient feature vector set χ of the training ground-based cloud images:

χ＝{w₁m₁,w₂m₂,...,w_Km_K}。χ={w ₁ m ₁ , w ₂ m ₂ , . . . , w _K m _K }.

可选地，所述步骤S7中，所述权重显著特征向量表示为：Optionally, in the step S7, the weight significant feature vector is expressed as:

h＝(ue m)((ue m)^T(ue m))^-1Ch=(ue m)((ue m) ^T (ue m)) ^-1 C

其中，

e表示矩阵元素对应相乘，

是一个元素均为c的常数向量。in,

e represents the corresponding multiplication of matrix elements,

is a constant vector with all elements c.

本发明的有益效果为：本发明利用卷积神经网络的浅层和深层卷积激活图进行特征提取，能够挖掘具有显著的结构和纹理信息和包含高语义信息的特征，进一步通过显著对偶激活编码，得到最具代表性的特征表示地基云图，从而提高地基云图分类的正确率。The beneficial effects of the present invention are as follows: the present invention utilizes the shallow and deep convolution activation maps of the convolutional neural network to perform feature extraction, can mine features with significant structure and texture information and contain high semantic information, and further encode through significant dual activation. , to obtain the most representative features representing the ground-based cloud map, thereby improving the accuracy of the ground-based cloud map classification.

需要说明的是，本发明得到了国家自然科学基金项目No.61501327、No.61711530240，天津市自然科学基金重点项目No.17JCZDJC30600，天津市应用基础与前沿技术研究计划青年基金项目No.15JCQNJC01700，天津师范大学“青年科研拔尖人才培育计划”No.135202RC1703，模式识别国家重点实验室开放课题基金No.201700001、No.201800002，中国国家留学基金No.201708120040、No.201708120039和天津市高等教育创新团队基金项目的资助。It should be noted that the present invention has obtained the National Natural Science Foundation of China Project No. 61501327, No. 61711530240, the Tianjin Natural Science Foundation Key Project No. 17JCZDJC30600, and the Tianjin Applied Basic and Frontier Technology Research Program Youth Fund Project No. 15JCQNJC01700, Tianjin Normal University "Youth Scientific Research Top Talents Cultivation Program" No.135202RC1703, State Key Laboratory of Pattern Recognition Open Project Fund No.201700001, No.201800002, China National Scholarship Fund No.201708120040, No.201708120039 and Tianjin Higher Education Innovation Team Fund project funding.

附图说明Description of drawings

图1是根据本发明一实施例提出的基于显著对偶激活编码的地基云图识别方法的流程图。FIG. 1 is a flowchart of a ground-based cloud image identification method based on salient dual activation coding proposed according to an embodiment of the present invention.

具体实施方式Detailed ways

为使本发明的目的、技术方案和优点更加清楚明了，下面结合具体实施方式并参照附图，对本发明进一步详细说明。应该理解，这些描述只是示例性的，而并非要限制本发明的范围。此外，在以下说明中，省略了对公知结构和技术的描述，以避免不必要地混淆本发明的概念。In order to make the objectives, technical solutions and advantages of the present invention clearer, the present invention will be further described in detail below with reference to the specific embodiments and the accompanying drawings. It should be understood that these descriptions are exemplary only and are not intended to limit the scope of the invention. Also, in the following description, descriptions of well-known structures and techniques are omitted to avoid unnecessarily obscuring the concepts of the present invention.

图1是根据本发明一实施例提出的基于显著对偶激活编码的地基云图识别方法的流程图，下面以图1为例来说明本发明的一些具体实现流程。本发明的方法是一种基于显著对偶激活编码的地基云图识别方法，其具体步骤包括：FIG. 1 is a flowchart of a ground-based cloud image identification method based on significant dual activation coding proposed according to an embodiment of the present invention, and some specific implementation processes of the present invention are described below by taking FIG. 1 as an example. The method of the present invention is a ground-based cloud image identification method based on significant dual activation coding, and its specific steps include:

其中，对多幅输入地基云图进行预处理包括以下步骤：The preprocessing of multiple input ground-based cloud images includes the following steps:

在本发明一实施例中，H×W＝224×224。In an embodiment of the present invention, H×W=224×224.

进一步地，所述步骤S2包括以下步骤：Further, the step S2 includes the following steps:

步骤S21，选择一个深度学习中典型的卷积神经网络进行初始化，将所述卷积神经网络末端的输出个数修改为地基云图的类别数目D；Step S21, select a typical convolutional neural network in deep learning to initialize, and modify the output number of the end of the convolutional neural network to the number of categories D of the ground-based cloud map;

在本发明一实施例中，所述卷积神经网络为VGG19，将对7类地基云图进行分类，故D＝7。In an embodiment of the present invention, the convolutional neural network is VGG19, which will classify 7 types of ground-based cloud images, so D=7.

进一步地，所述步骤S3包括以下步骤：Further, the step S3 includes the following steps:

步骤S31，获取一个预设浅层卷积层对应的一组浅层卷积激活图，该组浅层卷积激活图可以表示成一个张量，大小为H_s×W_s×N_s，其中，下标s表示浅层所在的层数，H_s和W_s分别表示该层卷积激活图的高度和宽度，N_s表示该层卷积激活图的数目；Step S31, obtaining a set of shallow convolution activation maps corresponding to a preset shallow convolution layer, the set of shallow convolution activation maps can be expressed as a tensor with a size of H _s ×W _s ×N _s , where , the subscript s represents the number of layers where the shallow layer is located, H _s and W _s represent the height and width of the convolution activation map of this layer, respectively, and N _s represents the number of convolution activation maps of this layer;

其中，所述浅层卷积层可从所述卷积神经网络前半部分的卷积层中选择。Wherein, the shallow convolutional layer can be selected from the convolutional layers in the first half of the convolutional neural network.

在本发明一实施例中，H_s×W_s×N_s＝224×224×64。In an embodiment of the present invention, H _s ×W _s ×N _s =224×224×64.

其中，所述滑动窗口的大小为a×a，稠密采样的步长为a/2。The size of the sliding window is a×a, and the step size of dense sampling is a/2.

其中，浅层卷积激活图上第k个滑动窗口的激活响应显著值S_k表示为：Among them, the activation response saliency value S _k of the kth sliding window on the shallow convolution activation map is expressed as:

其中，PgP²表示向量的二范数，

注意，m_k也被称作为显著特征向量。Note that m _k is also referred to as a salient feature vector.

在本发明一实施例中，a×a＝12×12，步长取为6。In an embodiment of the present invention, a×a=12×12, and the step size is 6.

步骤S34，将所述激活响应显著值S_k进行降序排序，选择前K个激活响应显著值对应的滑动窗口作为显著性图像局部区域，得到K个所述训练地基云图的显著性图像局部区域；Step S34, sorting the activation response salient values S _k in descending order, selecting the sliding windows corresponding to the first K activation response salient values as the saliency image local area, and obtaining K salient image local areas of the training ground cloud image;

在本发明一实施例中，K取为200。In an embodiment of the present invention, K is taken as 200.

其中，每个所述显著性图像局部区域用显著特征向量m_k表示。Wherein, each of the saliency image local regions is represented by a saliency feature vector m _k .

进一步地，所述步骤S5包括以下步骤：Further, the step S5 includes the following steps:

步骤S51，获取一个预设深层卷积层对应的一组深层卷积激活图，该组深层卷积激活图可以表示成一个张量，大小为H_d×W_d×N_d，其中，下标d表示深层所在的层数，H_d和W_d分别表示该层卷积激活图的高度和宽度，N_d表示该层卷积激活图的数目；Step S51 , obtain a set of deep convolution activation maps corresponding to a preset deep convolution layer, and the set of deep convolution activation maps can be expressed as a tensor with a size of H _d ×W _d ×N _d , where the subscript d represents the number of layers where the deep layer is located, H _d and W _d represent the height and width of the convolution activation map of this layer, respectively, and N _d represents the number of convolution activation maps of this layer;

在本发明一实施例中，H_d×W_d×N_d＝56×56×256。In an embodiment of the present invention, H _d ×W _d ×N _d =56×56×256.

步骤S52，将所述深层卷积层对应的所有卷积激活图上每个相同位置处的激活响应依次连接，得到一个N_d维的局部特征向量；Step S52, connecting the activation responses at each same position on all the convolution activation maps corresponding to the deep convolutional layers in turn to obtain an N _d -dimensional local feature vector;

在本发明一实施例中，b×b＝3×3。In an embodiment of the present invention, b×b=3×3.

其中，

表示第k个图像区域的权重，

represents the weight of the kth image region,

其中，根据浅层卷积激活图的K个显著性图像局部区域的显著特征向量m_k，和深层卷积激活图的K个图像区域的权重w_k，得到每幅训练地基云图的权重显著特征向量集合χ：Among them, according to the salient feature vector m _k of the K saliency image local regions of the shallow convolution activation map, and the weight w _k of the K image regions of the deep convolution activation map, the weighted salient features of each training ground cloud image are obtained. Set of vectors χ:

进一步地，所述步骤S7包括以下步骤：Further, the step S7 includes the following steps:

步骤S71，利用下述目标函数学习一个特征向量

作为图像的最终表示，即所述权重显著特征向量，基于：Step S71, utilize the following objective function to learn a feature vector

As the final representation of the image, i.e. the weighted salient feature vector, based on:

(w_km_k)^Th＝c，(k＝1,2,...,K)，(w _k m _k ) ^T h=c, (k=1,2,...,K),

其中，c表示一个常数。where c represents a constant.

那么目标函数可表示为：Then the objective function can be expressed as:

(ue m)^Th＝C，(ue m) ^Th = C,

其中，e表示矩阵元素对应相乘，

是一个元素均为c的常数向量。Among them, e represents the corresponding multiplication of matrix elements,

is a constant vector with all elements c.

在本发明一实施例中，c取为1。In an embodiment of the present invention, c is taken as 1.

步骤S72，为得到最优的

提出：Step S72, in order to obtain the optimal

propose:

minP(ue m)^Th-CP²，minP(ue m) ^Th -CP ² ,

利用伪逆进行求解，得到极小范数解，即最优的h：Use the pseudo-inverse to solve, and get the minimal norm solution, that is, the optimal h:

h＝(ue m)((ue m)^T(ue m))^-1C。h=(ue m)((ue m) ^T (ue m)) ^- 1C.

步骤S8，根据所述步骤S1-S7，获取测试地基云图的权重显著特征向量，基于所述测试地基云图的权重显著特征向量，对测试地基云图进行分类，得到地基云图识别结果。Step S8, according to the steps S1-S7, obtain the weight significant feature vector of the test ground cloud image, classify the test ground cloud image based on the weight significant feature vector of the test ground cloud image, and obtain the ground foundation cloud image identification result.

在本发明一实施例中，基于所述测试地基云图的权重显著特征向量，使用最近邻分类器对测试地基云图进行分类。In an embodiment of the present invention, based on the weighted salient feature vector of the test ground cloud image, a nearest neighbor classifier is used to classify the test ground cloud image.

以中国气象科学研究院采集的地基云图数据库为例，地基云图识别的正确率为91.24％，由此可见本发明方法的有效性。Taking the ground-based cloud image database collected by the Chinese Academy of Meteorological Sciences as an example, the correct rate of ground-based cloud image recognition is 91.24%, which shows the effectiveness of the method of the present invention.

应当理解的是，本发明的上述具体实施方式仅仅用于示例性说明或解释本发明的原理，而不构成对本发明的限制。因此，在不偏离本发明的精神和范围的情况下所做的任何修改、等同替换、改进等，均应包含在本发明的保护范围之内。此外，本发明所附权利要求旨在涵盖落入所附权利要求范围和边界、或者这种范围和边界的等同形式内的全部变化和修改例。It should be understood that the above-mentioned specific embodiments of the present invention are only used to illustrate or explain the principle of the present invention, but not to limit the present invention. Therefore, any modifications, equivalent replacements, improvements, etc. made without departing from the spirit and scope of the present invention should be included within the protection scope of the present invention. Furthermore, the appended claims of this invention are intended to cover all changes and modifications that fall within the scope and boundaries of the appended claims, or the equivalents of such scope and boundaries.

Claims

1. A foundation cloud picture identification method based on significant dual activation coding is characterized by comprising the following steps:

step S1, preprocessing the multiple input foundation cloud pictures to obtain training foundation cloud pictures;

step S2, inputting the training foundation cloud picture into a convolutional neural network to obtain a convolutional activation picture;

step S3, obtaining a saliency image local area of the training foundation cloud picture by utilizing the shallow convolution activation picture;

step S4, extracting the characteristics of each local area of the saliency image to obtain corresponding saliency characteristic vectors;

step S5, acquiring an image area corresponding to the deep convolution activation map by using the superficial saliency image local area, and learning the weight corresponding to the image area;

step S6, based on the significant feature vector and the weight, obtaining a weight significant feature vector set corresponding to the training foundation cloud picture;

step S7, carrying out significant dual activation coding on the weight significant feature vector set to obtain a weight significant feature vector corresponding to the training foundation cloud picture, namely learning a feature vector as the weight significant feature vector by using a target function;

and step S8, acquiring the weight significant feature vector of the test foundation cloud picture, and classifying the test foundation cloud picture based on the weight significant feature vector to obtain a foundation cloud picture identification result.

2. The method according to claim 1, wherein the step S1 comprises the steps of:

step S11, normalizing the size of the input foundation cloud picture into H multiplied by W to obtain a training foundation cloud picture, wherein H and W respectively represent the height and width of the training foundation cloud picture;

and step S12, obtaining the category label of each training foundation cloud picture.

3. The method according to claim 1, wherein the step S2 comprises the steps of:

step S21, determining a convolutional neural network, initializing the convolutional neural network, and modifying the output number of the tail end of the convolutional neural network into the class number D of the foundation cloud picture;

and step S22, inputting the training foundation cloud picture into the initialized convolutional neural network to obtain a convolutional activation picture.

4. The method according to claim 1, wherein the step S3 comprises the steps of:

step S31, obtaining a set of shallow convolution activation maps corresponding to a preset shallow convolution layer, where the set of shallow convolution activation maps can be expressed as a tensor with a size H_s×W_s×N_sWherein the subscript s denotes the number of shallow layers, H_sAnd W_sRespectively representing the height and width, N, of the layer of convolution activation map_sRepresenting the number of the layer of convolution activation maps;

step S32, sequentially connecting the activation responses at each same position on all convolution activation graphs corresponding to the shallow layer convolution layer to obtain N_sA local feature vector of the dimension;

step S33, performing dense sampling on all convolution activation graphs corresponding to the shallow convolution layer by using sliding windows, and acquiring an activation response significant value S of each sliding window based on the local feature vectors_kWherein the subscript k denotes the kth sliding window;

step S34, the activation response significant value S_kAnd performing descending sorting, and selecting sliding windows corresponding to the first K activation response significant values as significant image local areas to obtain K significant image local areas of the training foundation cloud pictures.

5. The method of claim 4, wherein the sliding window has a size of a x a and the step size of the dense samples is a/2.

6. The method of claim 5, wherein the activation response saliency value S for the kth sliding window on the shallow convolution activation map_kExpressed as:

wherein | · | purple sweet²Which represents the two-norm of the vector,

representing a local feature vector at the ith position, a²A x a denotes the number of local feature vectors within the kth sliding window,

the mean feature vector representing the kth sliding window, i.e. the mean of all local feature vectors within the sliding window, is represented as:

7. the method according to claim 1, wherein in step S4, each local region of the significant image is represented as a significant feature vector m_k。

8. The method according to claim 1, wherein the step S5 comprises the steps of:

step S51, obtaining a set of deep convolution activation maps corresponding to a preset deep convolution layer, where the set of deep convolution activation maps can be expressed as a tensor with a size of H_d×W_d×N_dWherein the subscript d represents the number of layers in which the deep layer is located, H_dAnd W_dRespectively representing the height and width, N, of the layer of convolution activation map_dRepresenting the number of the layer of convolution activation maps;

wherein the deep convolutional layer can be selected from convolutional layers of the latter half of the convolutional neural network;

step S52, sequentially connecting the activation responses at each identical position on all convolution activation graphs corresponding to the deep convolution layer to obtain N_dA local feature vector of the dimension;

step S53, acquiring K corresponding image areas with b × b size in the deep convolution activation map according to the local area of the saliency image corresponding to the shallow convolution layer;

step S54, calculating a weight corresponding to the image region, expressed as:

wherein,

the weight representing the k-th image region,

representing local feature vectors at the j-th position, b²B × b denotes the number of local feature vectors of the k-th image region.

9. The method according to claim 4, wherein in step S6, the salient feature vectors m of the local areas of K salient images of the activation map are convolved according to a shallow layer_kAnd weights w of K image regions of the deep convolution activation map_kAnd obtaining a weight significant feature vector set χ of each training foundation cloud picture:

χ＝{w₁m₁,w₂m₂,...,w_Km_K}。

10. the method according to claim 9, wherein in step S7, the weighted significant feature vector is represented as:

h＝(u⊙m)((u⊙m)^T(u⊙m))^-1C，

wherein,

an indication of a corresponding multiplication of matrix elements,

is a constant vector whose elements are c.