CN113269805B - Rainfall event guided remote sensing rainfall inversion training sample self-adaptive selection method - Google Patents

Rainfall event guided remote sensing rainfall inversion training sample self-adaptive selection method Download PDF

Info

Publication number
CN113269805B
CN113269805B CN202110655233.5A CN202110655233A CN113269805B CN 113269805 B CN113269805 B CN 113269805B CN 202110655233 A CN202110655233 A CN 202110655233A CN 113269805 B CN113269805 B CN 113269805B
Authority
CN
China
Prior art keywords
precipitation
connected domain
samples
pixels
sample
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110655233.5A
Other languages
Chinese (zh)
Other versions
CN113269805A (en
Inventor
马自强
朱思宇
张玉浩
洪阳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Peking University
Original Assignee
Peking University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Peking University filed Critical Peking University
Priority to CN202110655233.5A priority Critical patent/CN113269805B/en
Publication of CN113269805A publication Critical patent/CN113269805A/en
Application granted granted Critical
Publication of CN113269805B publication Critical patent/CN113269805B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/187Segmentation; Edge detection involving region growing; involving region merging; involving connected component labelling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/11Region-based segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/136Segmentation; Edge detection involving thresholding
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Image Analysis (AREA)

Abstract

本发明涉及一种降水事件引导的遥感降水反演训练样本自适应选取方法。所述方法包括:获取降水二值图,采用分类算法对降水区域进行连通域划分得到多个降水连通域;对于任意一个降水连通域,根据降水连通域的像素数、降水区域的像素数和预设降水样本数量得到降水连通域的取样数量;根据降水连通域的取样数量和降水连通域的像素数对降水连通域的像素进行取样,得到降水连通域的降水样本;对非降水区域的像素进行随机取样得到非降水样本;确定所有降水连通域的降水样本和非降水样本为降水反演训练样本。本发明能够同时满足训练样本的代表性和反演模型的普适性,以提高反演模型的反演精度。

Figure 202110655233

The invention relates to an adaptive selection method of remote sensing precipitation inversion training samples guided by precipitation events. The method includes: obtaining a precipitation binary map, and using a classification algorithm to divide the precipitation area into a connected domain to obtain a plurality of precipitation connected domains; for any precipitation connected domain, according to the number of pixels in the precipitation connected domain, the number of pixels in the precipitation area, and the prediction of the precipitation connected domain. Set the number of precipitation samples to obtain the sampling number of the precipitation connected area; sample the pixels of the precipitation connected area according to the sampling number of the precipitation connected area and the number of pixels of the precipitation connected area to obtain the precipitation samples of the precipitation connected area; Random sampling is used to obtain non-precipitation samples; precipitation samples and non-precipitation samples in all precipitation connected regions are determined as training samples for precipitation inversion. The invention can satisfy the representativeness of the training samples and the universality of the inversion model at the same time, so as to improve the inversion precision of the inversion model.

Figure 202110655233

Description

降水事件引导的遥感降水反演训练样本自适应选取方法An adaptive selection method of training samples for remote sensing precipitation inversion guided by precipitation events

技术领域technical field

本发明涉及降水反演领域,特别是涉及一种降水事件引导的遥感降水反演训练样本自适应选取方法及系统。The invention relates to the field of precipitation inversion, in particular to a method and system for adaptive selection of training samples for remote sensing precipitation inversion guided by precipitation events.

背景技术Background technique

降水在水文学、气象学、生态学以及农业研究等领域担任了重要角色,特别是全球尺度物质能量交换主要驱动力之一。因此利用卫星数据进行降水反演来获取高精度和高时空分辨率的降水数据是十分重要的。降水反演是利用卫星观测数据进行降水值估测的数学过程,该方法常采用模型法,如拟合公式求参,机器学习等。而无论是拟合公式求参还是机器学习,模型法均需要输入实测降水值和卫星所观测的特征值(常为云顶亮温)作为训练数据,通过一定的方法确定了参数之后,训练好的模型则可用来进行降水反演,即输入新观测值,可得到反演的估计值。那么该过程中训练样本的选择对于最终的反演结果则有十分直接的影响,即样本选择的策略能够觉得模型最终的训练结果。Precipitation plays an important role in the fields of hydrology, meteorology, ecology, and agricultural research, and is especially one of the main drivers of global-scale material and energy exchange. Therefore, it is very important to use satellite data for precipitation inversion to obtain precipitation data with high precision and high spatial and temporal resolution. Precipitation inversion is a mathematical process of estimating precipitation values using satellite observation data. This method often uses model methods, such as fitting formulas for parameters, machine learning, etc. Regardless of whether it is a fitting formula for parameters or machine learning, the model method needs to input the measured precipitation value and the eigenvalue (often cloud top brightness temperature) observed by the satellite as the training data. After the parameters are determined by a certain method, the trained Models can then be used for precipitation inversion, ie inputting new observations to obtain inversion estimates. Then the selection of training samples in this process has a very direct impact on the final inversion result, that is, the strategy of sample selection can feel the final training result of the model.

优秀的样本选择策略应该同时满足样本具有代表性和模型具有普适性的特点。所谓代表性,指的是样本选择过程中可以覆盖到各种特殊的情况,并且各类情况均有足够数量的样本,这样模型在训练过程中才可以不缺少有效信息。而普适性,意为模型对于不同的或者新的输入均有较高的稳定性,而非局限于样本集所提供的种类和情况中。An excellent sample selection strategy should satisfy both the representativeness of the sample and the universality of the model. The so-called representativeness means that various special situations can be covered in the sample selection process, and each type of situation has a sufficient number of samples, so that the model can not lack effective information during the training process. Universality means that the model has high stability for different or new inputs, rather than being limited to the types and situations provided by the sample set.

现有的样本选择策略主要包括三种大类型:随机选取法、数值分布控制法以及基于相似度(一致性)判断的样本选择法。The existing sample selection strategies mainly include three types: random selection method, numerical distribution control method and sample selection method based on similarity (consistency) judgment.

随机选取法为对于所有样本,根据一定的随机数原则,或者在打乱顺序的情况下根据一定的间隔步长进行选取,保证样本选取的随机性。随机选取法方法简单,易于实现,选择的样本在数量足够的情况下,具有一定的普适性,即通过选择出来样本进行模型训练,所得到的训练模型能够应对大部分情况,能够识别大部分特征。但缺乏一定的代表性,因为随机性选取往往会选取大量重复特征样本,或者无效特征样本,导致样本集在相同数量的样本下,对整体事件的代表能力大大降低,产生较大的数据冗余。The random selection method is to select all samples according to a certain random number principle, or according to a certain interval step size in the case of shuffling the order, to ensure the randomness of sample selection. The random selection method is simple and easy to implement. The selected samples have certain universality when the number of selected samples is sufficient. That is, by selecting samples for model training, the obtained training model can cope with most situations and can identify most of the feature. However, it lacks a certain degree of representativeness, because random selection often selects a large number of duplicate feature samples or invalid feature samples, resulting in a sample set with the same number of samples, the ability to represent the overall event is greatly reduced, resulting in greater data redundancy. .

数值分布控制法为在整体事件集中根据数值分布规律,通过划分数值区域的方式,在每一个数值区域中选取一定数量的样本加入样本集。这样选取的样本集能够有较强的代表性,由于样本来自不同的规定数值区间,这样对每一个数值区间均有一定样本数量的保证,以此使得样本集能够代表大多数出现的情况。但由于数值区间的划分和名额的分配是先验的,所以此为有监督的样本选取,人为因素不可避免地加入样本集中,则会降低一定的普适性,导致最终训练好的模型仅能反映人为考虑到的样本情况。The numerical distribution control method is to select a certain number of samples in each numerical area and add them to the sample set by dividing the numerical area according to the numerical distribution law in the overall event set. The sample set selected in this way can have a strong representativeness. Since the samples come from different specified numerical ranges, there is a guarantee of a certain number of samples for each numerical range, so that the sample set can represent most of the occurrences. However, since the division of the numerical range and the allocation of the quota are a priori, this is a supervised sample selection, and human factors are inevitably added to the sample set, which will reduce a certain degree of universality, resulting in the final trained model can only be Reflects the artificially considered sample situation.

基于相似度(一致性)判断的样本选择法是一类更加复杂的方法,这类方法通常通过样本间相似度判断,然后通过不断迭代的方法去精进样本集的分类,使得最终的样本间有较高的一致性。这类方法较为复杂,对于相似度的定义也有多种,迭代方法更是有较多备选。但总而言之,其能够增加样本的代表性,即能够通过数量较少的样本集,代表整体事件的出现情况和特征,大大降低无效样本和重复样本的比例。但是由于对于不一致样本的剔除,会导致针对复杂情况样本的丢失,模型训练过程中容易过拟合,最终导致模型的普适性降低。The sample selection method based on similarity (consistency) judgment is a more complex method. This kind of method usually judges the similarity between samples, and then refines the classification of the sample set through continuous iterative methods, so that the final samples have higher consistency. This type of method is more complicated, and there are many definitions of similarity, and there are more options for iterative methods. But all in all, it can increase the representativeness of the sample, that is, it can represent the occurrence and characteristics of the overall event through a smaller number of sample sets, and greatly reduce the proportion of invalid samples and duplicate samples. However, due to the elimination of inconsistent samples, it will lead to the loss of samples for complex situations, and it is easy to overfit in the process of model training, which ultimately leads to a decrease in the universality of the model.

综上所述,现有的样本选择策略无法同时满足样本具有代表性和模型具有普适性。To sum up, the existing sample selection strategies cannot satisfy the representativeness of the sample and the universality of the model at the same time.

发明内容SUMMARY OF THE INVENTION

本发明的目的是提供一种降水事件引导的遥感降水反演训练样本自适应选取方法,能够同时满足训练样本的代表性和反演模型的普适性,以提高反演模型的反演精度。The purpose of the present invention is to provide an adaptive selection method of training samples for remote sensing precipitation inversion guided by precipitation events, which can satisfy the representativeness of training samples and the universality of inversion models at the same time, so as to improve the inversion accuracy of inversion models.

为实现上述目的,本发明提供了如下方案:For achieving the above object, the present invention provides the following scheme:

一种降水事件引导的遥感降水反演训练样本自适应选取方法,包括:An adaptive selection method of training samples for remote sensing precipitation inversion guided by precipitation events, comprising:

获取降水二值图,所述降水二值图包括降水区域和非降水区域;obtaining a precipitation binary map, where the precipitation binary map includes a precipitation area and a non-precipitation area;

采用分类算法对所述降水区域进行连通域划分得到多个降水连通域;The precipitation area is divided into connected domains by using a classification algorithm to obtain multiple precipitation connected domains;

对于任意一个降水连通域,根据所述降水连通域的像素数、所述降水区域的像素数和预设降水样本数量得到所述降水连通域的取样数量;For any precipitation connected domain, obtain the sampling number of the precipitation connected domain according to the number of pixels of the precipitation connected domain, the pixel number of the precipitation area and the preset number of precipitation samples;

根据所述降水连通域的取样数量和所述降水连通域的像素数对所述降水连通域的像素进行取样,得到所述降水连通域的降水样本;sampling the pixels of the precipitation connected domain according to the sampling quantity of the precipitation connected domain and the number of pixels of the precipitation connected domain to obtain the precipitation sample of the precipitation connected domain;

对所述非降水区域的像素进行随机取样得到非降水样本;random sampling of pixels in the non-precipitation area to obtain non-precipitation samples;

确定所有降水连通域的降水样本和所述非降水样本为降水反演训练样本。Determine the precipitation samples of all precipitation connected domains and the non-precipitation samples as training samples of precipitation retrieval.

可选的,所述获取降水二值图,具体包括:Optionally, the obtaining a binary image of precipitation specifically includes:

获取降水情况图,所述降水情况图的像素值为降水量,所述降水情况图的像素位置为像素对应的地理位置;obtaining a precipitation situation map, where the pixel value of the precipitation situation map is the amount of precipitation, and the pixel position of the precipitation situation map is the geographic location corresponding to the pixel;

对所述降水情况图进行阈值分割得到降水二值图。Threshold segmentation is performed on the precipitation situation map to obtain a precipitation binary map.

可选的,所述在所述采用分类算法对所述降水区域进行连通域划分得到多个降水连通域之前还包括:Optionally, before the use of a classification algorithm to divide the precipitation region into connected domains to obtain multiple precipitation connected domains, the method further includes:

对所述降水二值图进行图像掩膜处理得到处理后的降水区域。Perform image mask processing on the precipitation binary image to obtain a processed precipitation area.

可选的,所述根据所述降水连通域的像素数、所述降水区域的像素数和预设降水样本数量得到所述降水连通域的取样数量,具体为:Optionally, the sampling number of the precipitation connected region is obtained according to the number of pixels of the precipitation connected region, the number of pixels of the precipitation region, and the preset number of precipitation samples, specifically:

根据公式Mi=(Ni/Ntotal)*Mtotal计算第i个降水连通域的取样数量,其中,Mi为第i个降水连通域的取样数量,Mtotal为预设降水样本数量;Ni为第i个降水连通域的像素数,Ntotal为降水区域的像素数。Calculate the sampling quantity of the ith precipitation connected domain according to the formula Mi = (N i / N total )*M total , where Mi is the sampling quantity of the ith precipitation connected domain, and M total is the preset precipitation sample quantity; Ni is the number of pixels in the ith precipitation connected domain, and N total is the number of pixels in the precipitation area.

可选的,所述根据所述降水连通域的取样数量和所述降水连通域的像素数对所述降水连通域的像素进行取样,得到所述降水连通域的降水样本,具体为:Optionally, sampling the pixels of the precipitation connected domain according to the sampling quantity of the precipitation connected domain and the number of pixels of the precipitation connected domain to obtain the precipitation sample of the precipitation connected domain, specifically:

根据公式Si={Pk|k=[(t-0.5)/Mi*Ni],t∈N+,1≤t≤Mi}得到第i个降水连通域的降水样本,其中Si为第i个降水连通域的降水样本,Pk为第i个降水连通域的降水样本中第k个降水像素,t为降水连通域的样本序号,Mi为第i个降水连通域的取样数量,Ni为第i个降水连通域的像素数。According to the formula S i ={P k |k=[(t-0.5)/M i *N i ], t∈N + , 1≤t≤M i }, the precipitation samples of the i-th precipitation connected domain are obtained, where S i is the precipitation sample of the i-th precipitation connected domain, P k is the k-th precipitation pixel in the precipitation sample of the i -th precipitation connected domain, t is the sample number of the precipitation Number of samples, Ni is the number of pixels in the i -th precipitation connected domain.

可选的,所述对所述非降水区域的像素进行随机取样得到非降水样本,具体包括:Optionally, the random sampling of pixels in the non-precipitation area to obtain non-precipitation samples specifically includes:

根据所述非降水区域中各像素的位置对所述非降水区域中各像素进行标号得到所述非降水区域各像素的像素序号;Labeling each pixel in the non-precipitation area according to the position of each pixel in the non-precipitation area to obtain the pixel serial number of each pixel in the non-precipitation area;

根据预设非降水样本数量和各像素的像素序号对所述非降水区域的像素进行随机取样得到非降水样本。The non-precipitation samples are obtained by randomly sampling the pixels in the non-precipitation area according to the preset number of non-precipitation samples and the pixel serial number of each pixel.

一种降水事件引导的遥感降水反演训练样本自适应选取系统,包括:An adaptive selection system for remote sensing precipitation retrieval training samples guided by precipitation events, comprising:

获取模块,用于获取降水二值图,所述降水二值图包括降水区域和非降水区域;an obtaining module, configured to obtain a precipitation binary image, wherein the precipitation binary image includes a precipitation area and a non-precipitation area;

降水连通域确定模块,用于采用分类算法对所述降水区域进行连通域划分得到多个降水连通域;A precipitation connected domain determining module, used for dividing the precipitation region into a connected domain by using a classification algorithm to obtain a plurality of precipitation connected domains;

数量确定模块,用于对于任意一个降水连通域,根据所述降水连通域的像素数、所述降水区域的像素数和预设降水样本数量得到所述降水连通域的取样数量;a quantity determination module, configured to obtain the sampling quantity of the precipitation connected domain according to the pixel count of the precipitation connected domain, the pixel count of the precipitation area and the preset precipitation sample quantity for any precipitation connected domain;

降水样本确定模块,用于根据所述降水连通域的取样数量和所述降水连通域的像素数对所述降水连通域的像素进行取样,得到所述降水连通域的降水样本;a precipitation sample determination module, configured to sample the pixels of the precipitation connected domain according to the sampling quantity of the precipitation connected domain and the number of pixels of the precipitation connected domain to obtain the precipitation sample of the precipitation connected domain;

非降水样本确定模块,用于对所述非降水区域的像素进行随机取样得到非降水样本;a non-precipitation sample determination module, configured to randomly sample the pixels in the non-precipitation area to obtain non-precipitation samples;

反演训练样本确定模块,用于确定所有降水连通域的降水样本和所述非降水样本为降水反演训练样本。The inversion training sample determination module is used to determine the precipitation samples of all precipitation connected domains and the non-precipitation samples as precipitation inversion training samples.

可选的,所述获取模块包括:Optionally, the obtaining module includes:

获取单元,用于获取降水情况图,所述降水情况图的像素值为降水量,所述降水情况图的像素位置为像素对应的地理位置;an acquiring unit, configured to acquire a precipitation situation map, where the pixel value of the precipitation situation map is the amount of precipitation, and the pixel position of the precipitation situation map is the geographic location corresponding to the pixel;

二值图确定单元,用于对所述降水情况图进行阈值分割得到降水二值图。The binary map determining unit is configured to perform threshold segmentation on the precipitation situation map to obtain a precipitation binary map.

可选的,所述降水事件引导的遥感降水反演训练样本自适应选取系统还包括:处理模块,用于对所述降水二值图进行图像掩膜处理得到处理后的降水区域。Optionally, the precipitation event-guided remote sensing precipitation inversion training sample adaptive selection system further includes: a processing module configured to perform image mask processing on the precipitation binary image to obtain a processed precipitation area.

可选的,所述非降水样本确定模块包括:Optionally, the non-precipitation sample determination module includes:

序号确定单元,用于根据所述非降水区域中各像素的位置对所述非降水区域中各像素进行标号得到所述非降水区域各像素的像素序号;a serial number determining unit, configured to label each pixel in the non-precipitation area according to the position of each pixel in the non-precipitation area to obtain the pixel serial number of each pixel in the non-precipitation area;

非降水样本确定单元,用于根据预设非降水样本数量和各像素的像素序号对所述非降水区域的像素进行随机取样得到非降水样本。A non-precipitation sample determination unit, configured to randomly sample the pixels in the non-precipitation area according to the preset number of non-precipitation samples and the pixel serial number of each pixel to obtain non-precipitation samples.

根据本发明提供的具体实施例,本发明公开了以下技术效果:本发明根据各降水连通域的像素数、降水区域的像素数和预设降水样本数量得到各降水连通域的取样数量,使得各降水连通域均可以被采样,并且由于覆盖到每个降水连通域,使得提取的样本数量无大量重复和冗余,能够大大弥补随机选取法代表性不足的问题,而且根据降水连通域的取样数量和像素数对降水连通域的像素进行取样,使得样本选择不会过于集中也有足够的变异性,能够有效代表不同情况、不同特征的降水,以此来提高根据其训练出的模型的普适性避免过拟合。According to the specific embodiment provided by the present invention, the present invention discloses the following technical effects: the present invention obtains the sampling number of each precipitation connected region according to the number of pixels in each precipitation connected region, the number of pixels in the precipitation region and the preset number of precipitation samples, so that each All precipitation connected domains can be sampled, and because each precipitation connected domain is covered, the number of samples extracted does not have a lot of repetition and redundancy, which can greatly compensate for the lack of representativeness of the random selection method. and the number of pixels to sample the pixels of the precipitation connected domain, so that the sample selection will not be too concentrated and have sufficient variability, which can effectively represent the precipitation of different situations and different characteristics, so as to improve the universality of the model trained according to it. Avoid overfitting.

附图说明Description of drawings

为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the accompanying drawings required in the embodiments will be briefly introduced below. Obviously, the drawings in the following description are only some of the present invention. In the embodiments, for those of ordinary skill in the art, other drawings can also be obtained according to these drawings without creative labor.

图1为本发明实施例提供的降水事件引导的遥感降水反演训练样本自适应选取方法的流程图;1 is a flowchart of a method for adaptively selecting training samples for remote sensing precipitation inversion guided by a precipitation event provided by an embodiment of the present invention;

图2为应用降水事件引导的遥感降水反演训练样本自适应选取方法进行实验的具体流程图;Fig. 2 is the specific flow chart of the experiment using the method of self-adaptive selection of training samples for remote sensing precipitation inversion guided by precipitation events;

图3为本发明中不同算法结果的对比图,图3(a)为随机平均样本取样法的取样结果图,图3(b)为采用本发明提出的基于降水事件的样本自适应选取方法的取样结果图;Fig. 3 is a comparison diagram of the results of different algorithms in the present invention, Fig. 3 (a) is a sampling result diagram of the random average sample sampling method, Fig. 3 (b) is a sample adaptive selection method based on precipitation events proposed by the present invention. Sampling result graph;

图4为本发明实施例提供的降水事件引导的遥感降水反演训练样本自适应选取系统的框图;4 is a block diagram of an adaptive selection system for remote sensing precipitation inversion training samples guided by precipitation events provided by an embodiment of the present invention;

图5为本发明实施例提供的更为具体的降水事件引导的遥感降水反演训练样本自适应选取方法的流程图。5 is a flowchart of a more specific method for adaptively selecting training samples for remote sensing precipitation inversion guided by a precipitation event provided by an embodiment of the present invention.

具体实施方式Detailed ways

下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

为使本发明的上述目的、特征和优点能够更加明显易懂,下面结合附图和具体实施方式对本发明作进一步详细的说明。In order to make the above objects, features and advantages of the present invention more clearly understood, the present invention will be described in further detail below with reference to the accompanying drawings and specific embodiments.

目前的训练样本选取方法主要有随机法、数值分布控制法以及基于相似度(一致性)判断的样本选择法。不同样本集选取下模型的代表性和普适性各有侧重,其特点罗列如下:The current training sample selection methods mainly include random method, numerical distribution control method and sample selection method based on similarity (consistency) judgment. The representativeness and universality of the models selected from different sample sets have their own emphasis, and their characteristics are listed as follows:

不同样本选择策略下模型的代表性由强至弱为:基于相似度判断的样本选择法、数值分布控制法、随机选择法;而不同样本选择策略下模型的普适性由强至弱为:随机选择法、数值分布控制法、基于相似度判断的样本选择法;而由于先验设定导致的人为干扰对样本集的影响程度由强至弱为:数值分布控制法、基于相似度判断的样本选择法、随机选择法。The representativeness of the models under different sample selection strategies is from strong to weak: the sample selection method based on similarity judgment, the numerical distribution control method, and the random selection method; while the universality of the models under different sample selection strategies is from strong to weak: Random selection method, numerical distribution control method, and sample selection method based on similarity judgment; and the degree of influence of human interference on the sample set due to a priori setting is from strong to weak: numerical distribution control method, similarity judgment-based method Sample selection method, random selection method.

其中随机选择法优点是实现简单,但是缺乏对正样本的代表性;数值分布控制法能有效提升对正样本的代表性,但其中的监督性大大增强,其结果很容易受到先验设定参数的影响;而基于相似度判断的样本选择方法能够显著提升样本集的集内一致性,但该过程需要不断迭代,计算过程较慢,同时由于集内一致性的提高,导致过拟合的可能性显著增强,而过拟合则会显著降低训练模型的普适性,可能导致模型无法使用。代表性强,普适性弱的模型对于样本集中代表的情况模拟较好,但是对于样本集中未出现或者出现较少的情况,会出现奇异值结果,对于整体模拟结果影响较大。The advantage of the random selection method is that it is simple to implement, but it lacks the representativeness of the positive samples; the numerical distribution control method can effectively improve the representativeness of the positive samples, but the supervision is greatly enhanced, and the results are easily affected by the a priori setting parameters. The sample selection method based on similarity judgment can significantly improve the intra-set consistency of the sample set, but the process requires constant iteration, the calculation process is slow, and at the same time, due to the improvement of the intra-set consistency, it may lead to overfitting. The performance is significantly enhanced, while overfitting can significantly reduce the generalizability of the trained model, possibly rendering the model unusable. Models with strong representativeness and weak universality can simulate the situation represented in the sample set better, but for the situation that does not appear in the sample set or occurs less frequently, singular value results will appear, which has a greater impact on the overall simulation results.

本实施例提供的样本确定方法根据实际的降水事件,将样本选取名额分配至各个降水事件中,最终生成具有较高代表性和普适性的降水样本集,供降水反演模型训练和学习,本实施例克服已有样本选择方法存在的明显缺陷,基于物理事实和现象选择合适的样本,最终使得该样本集具有较高的代表性和普适性,同时摆脱对于先验参数的严重依赖,如图1所示,本实施例提供的降水事件引导的遥感降水反演训练样本自适应选取方法过程如下:The sample determination method provided in this embodiment allocates sample selection quotas to each precipitation event according to actual precipitation events, and finally generates a relatively representative and universal precipitation sample set for training and learning of the precipitation inversion model. This embodiment overcomes the obvious defects of existing sample selection methods, selects suitable samples based on physical facts and phenomena, and finally makes the sample set highly representative and universal, and at the same time gets rid of the heavy dependence on prior parameters, As shown in FIG. 1 , the process of the method for adaptively selecting training samples for remote sensing precipitation inversion guided by precipitation events provided in this embodiment is as follows:

步骤101:获取降水二值图。所述降水二值图包括降水区域和非降水区域。Step 101: Obtain a binary image of precipitation. The precipitation binary map includes a precipitation area and a non-precipitation area.

步骤102:采用分类算法对所述降水区域进行连通域划分得到多个降水连通域。Step 102: Use a classification algorithm to divide the precipitation region into connected domains to obtain a plurality of precipitation connected domains.

步骤103:对于任意一个降水连通域,根据所述降水连通域的像素数、所述降水区域的像素数和预设降水样本数量得到所述降水连通域的取样数量。Step 103: For any precipitation connected domain, obtain the sampling quantity of the precipitation connected domain according to the pixel count of the precipitation connected domain, the pixel count of the precipitation region, and the preset precipitation sample quantity.

步骤104:根据所述降水连通域的取样数量和所述降水连通域的像素数对所述降水连通域的像素进行取样,得到所述降水连通域的降水样本。Step 104: Sampling the pixels of the precipitation connected region according to the sampling quantity of the precipitation connected region and the number of pixels of the precipitation connected region to obtain a precipitation sample of the precipitation connected region.

步骤105:对所述非降水区域的像素进行随机取样得到非降水样本。对于非降水区域,由于其往往面积较大并且连接成片,所以其方法较为简单,仅需要根据像素位置序号平均选取即可。Step 105: Randomly sample the pixels in the non-precipitation area to obtain non-precipitation samples. For non-precipitation areas, because they are often large in area and connected into pieces, the method is relatively simple, and only needs to be averaged according to the pixel position sequence number.

步骤106:确定所有降水连通域的降水样本和所述非降水样本为降水反演训练样本。Step 106: Determine the precipitation samples and the non-precipitation samples in all precipitation connected domains as training samples for precipitation inversion.

输入的降水情况图需要进行阈值分割以产生降水事件的分布图(降水二值图),因为降水预测往往存在大量接近0的小估计值,这类估计值是否需要被判断成为降水事件需要通过阈值分割来实现。在实际应用中,步骤101具体包括:The input precipitation situation map needs to be thresholded to generate the distribution map of precipitation events (precipitation binary map), because precipitation prediction often has a large number of small estimated values close to 0, whether such estimated values need to be judged as precipitation events need to pass the threshold value split to achieve. In practical applications, step 101 specifically includes:

获取降水情况图,该图为栅格格式,所述降水情况图的像素值为降水量,所述降水情况图的像素位置为像素对应的地理位置。A precipitation situation map is obtained, which is in a raster format, the pixel value of the precipitation situation map is the amount of precipitation, and the pixel position of the precipitation situation map is the geographic location corresponding to the pixel.

对所述降水情况图进行阈值分割得到降水二值图。此处本方法采用国际上较为通用的值(0.1mm/hour)作为阈值,对降水事件进行判断:即大于等于该阈值像素的标记为1,判断为降水,相似的,小于该阈值的像素标记为0,判断为无降水。Threshold segmentation is performed on the precipitation situation map to obtain a precipitation binary map. Here, this method uses the internationally common value (0.1mm/hour) as the threshold value to judge the precipitation event: that is, if the pixel is greater than or equal to the threshold value is marked as 1, it is judged to be precipitation, and similarly, the pixel mark less than the threshold value is marked. If it is 0, it is judged that there is no precipitation.

在实际应用中,所述采用分类算法对所述降水区域进行连通域划分得到多个降水连通域是基于连通域原理,即相邻相近的像素会被判断为同一个连通域下面的子集,这样提取出来的相邻连通域中不会出现像素独立的情况,采用的邻近判断为4-邻近判断,即当某一个像素处于另一个像素的正上,正下,正左,或正右的情况下,这两个像素才会被判断为相邻。因此通过这种方式,一张降水分布图将会被分割为多个连通域,每一个连通域则标志着其为一次地理连续的降水事件。In practical applications, the use of a classification algorithm to divide the precipitation area into a connected domain to obtain multiple precipitation connected domains is based on the connected domain principle, that is, adjacent and similar pixels will be judged as a subset under the same connected domain, There is no pixel independence in the adjacent connected domains extracted in this way, and the adjacent judgment used is 4-proximity judgment, that is, when a pixel is directly above, directly below, directly left, or directly right of another pixel In this case, these two pixels will be judged as adjacent. Therefore, in this way, a precipitation distribution map will be divided into multiple connected domains, and each connected domain will mark it as a geographically continuous precipitation event.

图像的膨胀与腐蚀是图形学操作中一个基本步骤。在实际应用中,在所述采用分类算法对所述降水区域进行连通域划分得到多个降水连通域之前还包括:Dilation and erosion of images is a fundamental step in graphics operations. In practical applications, before the use of a classification algorithm to divide the precipitation region into connected domains to obtain multiple precipitation connected domains, the method further includes:

对所述降水二值图进行图像掩膜处理得到处理后的降水区域,最终能够使得连通域的范围扩大或缩小。此处图形膨胀或腐蚀的阈值选择为4连通域,同时仅操作一次,先腐蚀后膨胀,该步骤的目的是为了减少单个降水像素的影响,减少随机噪声的影响,使得整个算法专注于规模较大的降水事件上。Image mask processing is performed on the precipitation binary image to obtain a processed precipitation area, which can finally expand or narrow the range of the connected domain. Here, the threshold of graph expansion or erosion is selected as 4 connected domains, and only one operation is performed at the same time, first erosion and then expansion. The purpose of this step is to reduce the influence of a single precipitation pixel and the influence of random noise, so that the whole algorithm focuses on the larger scale. large precipitation events.

在实际应用中,所述根据所述降水连通域的像素数、所述降水区域的像素数和预设降水样本数量得到所述降水连通域的取样数量,具体为:In practical applications, the sampling quantity of the precipitation connected domain is obtained according to the pixel count of the precipitation connected domain, the pixel count of the precipitation region, and the preset precipitation sample quantity, specifically:

根据Mi=(Ni/Ntotal)*Mtotal 公式(1)According to M i =(N i /N total )*M total formula (1)

计算第i个降水连通域的取样数量,其中,Mi为第i个降水连通域的取样数量,Mtotal为预设降水样本数量;Ni为第i个降水连通域的像素数,Ntotal为降水区域的像素数。Calculate the sampling number of the ith precipitation connected domain, where Mi is the sampling number of the ith precipitation connected domain, M total is the preset number of precipitation samples; Ni is the number of pixels of the ith precipitation connected domain, N total is the number of pixels in the precipitation area.

在实际应用中,所述根据所述降水连通域的取样数量和所述降水连通域的像素数对所述降水连通域的像素进行取样,得到所述降水连通域的降水样本,具体为:In practical applications, sampling the pixels of the precipitation connected domain according to the sampling quantity of the precipitation connected domain and the number of pixels of the precipitation connected domain to obtain the precipitation sample of the precipitation connected domain, specifically:

根据Si={Pk|k=[(t-0.5)/Mi*Ni],t∈N+,1≤t≤Mi} 公式(2)According to S i ={P k |k=[(t-0.5)/M i *N i ], t∈N + , 1≤t≤M i } Formula (2)

得到第i个降水连通域的降水样本,其中Si为第i个降水连通域的降水样本,Pk为第i个降水连通域的降水样本中第k个降水像素,t为降水连通域的样本序号,Mi为第i个降水连通域的取样数量,Ni为第i个降水连通域的像素数。Obtain the precipitation sample of the ith precipitation connected domain, where S i is the precipitation sample of the ith precipitation connected domain, P k is the kth precipitation pixel in the precipitation sample of the ith precipitation connected domain, and t is the precipitation pixel of the ith precipitation connected domain. Sample serial number, Mi is the sampling number of the ith precipitation connected domain, and Ni is the number of pixels of the ith precipitation connected domain.

在实际应用中,所述对所述非降水区域的像素进行随机取样得到非降水样本,具体包括:In practical applications, the random sampling of pixels in the non-precipitation area to obtain non-precipitation samples specifically includes:

根据所述非降水区域中各像素的位置对所述非降水区域中各像素进行标号得到所述非降水区域各像素的像素序号。A pixel serial number of each pixel in the non-precipitation area is obtained by labeling each pixel in the non-precipitation area according to the position of each pixel in the non-precipitation area.

根据预设非降水样本数量和各像素的像素序号对所述非降水区域的像素进行随机取样得到非降水样本。The non-precipitation samples are obtained by randomly sampling the pixels in the non-precipitation area according to the preset number of non-precipitation samples and the pixel serial number of each pixel.

如图2所示,本实施例还提供了应用上述方法对2018年6月1日经纬度为(105.0° -130.0°E,18.2° -38.0°N)的降雨量进行实验,最终得到与预设名额一致的150个降水样本与450个非降水样本,具体步骤为:As shown in FIG. 2 , this embodiment also provides an experiment on the rainfall with the latitude and longitude of (105.0°-130.0°E, 18.2°-38.0°N) on June 1, 2018 by applying the above method, and finally obtained and preset There are 150 precipitation samples and 450 non-precipitation samples with the same quota. The specific steps are:

步骤S210:对降水图进行阈值分割,得到降水区域与非降水区域的二值图像。Step S210: Perform threshold segmentation on the precipitation map to obtain binary images of the precipitation area and the non-precipitation area.

步骤S220:对于非降水区域进行根据地理位置的平均取样选取方法。Step S220: Perform an average sampling selection method according to geographic location for the non-precipitation area.

步骤S230:对降水区域进行图像腐蚀和图像膨胀操作。Step S230: Perform image erosion and image expansion operations on the precipitation area.

步骤S240:对降水区域进行降水事件连通域划分。Step S240: Divide the precipitation event connected domain for the precipitation area.

步骤S250:对于已经划分好的降水区域,根据其面积进行降水名额分配。具体为根据公式(1)进行降水名额分配根据公式(2)降水样本选取。Step S250: For the precipitation area that has been divided, perform precipitation quota allocation according to its area. Specifically, the precipitation quota is allocated according to formula (1), and the precipitation samples are selected according to formula (2).

步骤S260:将降水事件样本与非降水事件样本整合为最终降水样本集。Step S260: Integrate the precipitation event samples and the non-precipitation event samples into a final precipitation sample set.

其效果对比如图3所示,其中图3(a)为常见的平均随机取样法得到的结果图,而图3(b)则为应用本发明提供的基于降水事件的自适应样本选择方法得到的结果图,通过图3可知,本实施例提供的方法可替代原先常见的平均随机样本取样法,生成具有降水事件特征,同时代表性高普适性高的,可以根据预设名额调整降水与非降水样本数量的降水训练样本集。而后将此数据集输入机器学习框架,即可进行降水反演。The effect comparison is shown in Figure 3, in which Figure 3(a) is the result obtained by the common average random sampling method, and Figure 3(b) is obtained by applying the adaptive sample selection method based on precipitation events provided by the present invention. As can be seen from Figure 3, the method provided in this embodiment can replace the original common average random sample sampling method to generate precipitation event characteristics, high representativeness and high universality, and precipitation and precipitation can be adjusted according to the preset quota. The precipitation training sample set for the non-precipitation sample number. This dataset is then fed into a machine learning framework for precipitation inversion.

如图4所示,本实施例还提供了一种与上述方法对应的降水事件引导的遥感降水反演训练样本自适应选取系统,所述系统包括:As shown in FIG. 4 , the present embodiment also provides a system for adaptive selection of training samples for remote sensing precipitation inversion guided by precipitation events corresponding to the above method, and the system includes:

获取模块A1,用于获取降水二值图,所述降水二值图包括降水区域和非降水区域。The obtaining module A1 is configured to obtain a precipitation binary map, where the precipitation binary map includes a precipitation area and a non-precipitation area.

降水连通域确定模块A2,用于采用分类算法对所述降水区域进行连通域划分得到多个降水连通域。The precipitation connected domain determination module A2 is configured to use a classification algorithm to divide the connected domain of the precipitation area to obtain a plurality of precipitation connected domains.

数量确定模块A3,用于对于任意一个降水连通域,根据所述降水连通域的像素数、所述降水区域的像素数和预设降水样本数量得到所述降水连通域的取样数量。The quantity determination module A3 is configured to, for any precipitation connected domain, obtain the sampling quantity of the precipitation connected domain according to the pixel count of the precipitation connected domain, the pixel count of the precipitation region and the preset precipitation sample quantity.

降水样本确定模块A4,用于根据所述降水连通域的取样数量和所述降水连通域的像素数对所述降水连通域的像素进行取样,得到所述降水连通域的降水样本。The precipitation sample determination module A4 is configured to sample the pixels of the precipitation connected area according to the sampling quantity of the precipitation connected area and the number of pixels of the precipitation connected area, so as to obtain the precipitation sample of the precipitation connected area.

非降水样本确定模块A5,用于对所述非降水区域的像素进行随机取样得到非降水样本。The non-precipitation sample determination module A5 is configured to randomly sample the pixels in the non-precipitation area to obtain non-precipitation samples.

反演训练样本确定模块A6,用于确定所有降水连通域的降水样本和所述非降水样本为降水反演训练样本。The inversion training sample determination module A6 is used to determine the precipitation samples of all precipitation connected domains and the non-precipitation samples as precipitation inversion training samples.

作为一种可选的实施方式,所述获取模块包括:As an optional implementation manner, the acquisition module includes:

获取单元,用于获取降水情况图,所述降水情况图的像素值为降水量,所述降水情况图的像素位置为像素对应的地理位置。The obtaining unit is configured to obtain a precipitation situation map, where the pixel value of the precipitation situation map is the amount of precipitation, and the pixel position of the precipitation situation map is the geographic location corresponding to the pixel.

二值图确定单元,用于对所述降水情况图进行阈值分割得到降水二值图。The binary map determining unit is configured to perform threshold segmentation on the precipitation situation map to obtain a precipitation binary map.

作为一种可选的实施方式,所述降水事件引导的遥感降水反演训练样本自适应选取系统还包括:处理模块,用于对所述降水二值图进行图像掩膜处理得到处理后的降水区域。As an optional implementation manner, the precipitation event-guided remote sensing precipitation inversion training sample adaptive selection system further includes: a processing module configured to perform image mask processing on the precipitation binary image to obtain processed precipitation area.

作为一种可选的实施方式,所述非降水样本确定模块包括:As an optional implementation manner, the non-precipitation sample determination module includes:

序号确定单元,用于根据所述非降水区域中各像素的位置对所述非降水区域中各像素进行标号得到所述非降水区域各像素的像素序号。A serial number determination unit, configured to label each pixel in the non-precipitation area according to the position of each pixel in the non-precipitation area to obtain the pixel serial number of each pixel in the non-precipitation area.

非降水样本确定单元,用于根据预设非降水样本数量和各像素的像素序号对所述非降水区域的像素进行随机取样得到非降水样本。A non-precipitation sample determination unit, configured to randomly sample the pixels in the non-precipitation area according to the preset number of non-precipitation samples and the pixel serial number of each pixel to obtain non-precipitation samples.

图5为本实施例提供的更为具体的降水事件引导的遥感降水反演训练样本自适应选取方法的流程图,对于实际降水图,首先通过阈值分割得到二值图,根据二值图中的像素值判断此像素是否有降水,得到降水区域与非降水区域,对非降水区域通过简单的平均取样,对降水区域进行图像腐蚀并膨胀,然后通过降水事件的划分得到彼此独立但内部连通的连通域,然后按分位点选取得到降水样本,将降水样本和非降水样本确定为样本位置集。FIG. 5 is a flowchart of a more specific method for adaptive selection of training samples for remote sensing precipitation inversion guided by precipitation events provided in this embodiment. For an actual precipitation map, a binary map is first obtained through threshold segmentation. The pixel value judges whether there is precipitation in this pixel, and obtains the precipitation area and the non-precipitation area. The non-precipitation area is sampled by simple average, and the image of the precipitation area is corroded and expanded, and then the independent but internally connected connectivity is obtained by dividing the precipitation events. Then, the precipitation samples are obtained by quantile selection, and the precipitation samples and non-precipitation samples are determined as the sample location set.

本发明有以下优点:The present invention has the following advantages:

1、本发明先根据图形学中连通域的概念将整体降水划分为各个降水区域,然后再在每一个降水区域中根据所分配名额和分位数选取样本能够有效平衡不同方法间的劣势,因为样本选择时根据降水事件作为基础,根据名额分配至降水事件中,并无大量重复和冗余,所以能够大大弥补随机选取法代表性不足的问题1. The present invention first divides the overall precipitation into various precipitation areas according to the concept of connected domain in graphics, and then selects samples in each precipitation area according to the assigned quota and quantile, which can effectively balance the disadvantages between different methods, because The sample selection is based on precipitation events and is allocated to precipitation events according to the number of places. There is not a lot of repetition and redundancy, so it can greatly make up for the lack of representativeness of the random selection method.

2、由于该方法所选取的样本集是根据降水名额分配至各个降水事件中,不会使得样本选择过于集中,使其也有足够的变异性,能够有效代表不同情况、不同特征的降水事件,以此来提高其训练出的模型的普适性避免过拟合,优于基于相似度的判别法。2. Since the sample set selected by this method is allocated to each precipitation event according to the precipitation quota, it will not make the sample selection too concentrated, so that it also has sufficient variability, and can effectively represent the precipitation events with different conditions and different characteristics. This improves the universality of the trained model and avoids overfitting, which is better than the similarity-based discriminant method.

3、无太多关键参数需要输入,仅有降水名额需要事先输入,而该参数仅基本影响数据的体量,对于数据的分布和选择策略并无直接影响,所以相对数值分布控制法,对于人为先验参数设定并无严重依赖,是一个鲁棒性较高的样本选择方法。3. There are not too many key parameters to input, only the precipitation quota needs to be input in advance, and this parameter only basically affects the volume of the data, and has no direct impact on the distribution and selection strategy of the data, so the relative numerical distribution control method, for artificial The prior parameter setting does not depend heavily, and it is a robust sample selection method.

4、能够定向寻找降水事件,根据降水事件来划分样本名额并选择,可以有效提高样本集的代表性;并且先验参数对样本选择结果影响不大,具有足够的稳定性;同时由于根据降水团的选择,样本集具有足够的变异性,能够有效防止过拟合,增加样本集的普适性。4. It can search for precipitation events in a targeted manner, and divide and select the number of samples according to the precipitation events, which can effectively improve the representativeness of the sample set; and the prior parameters have little influence on the results of sample selection and have sufficient stability; The selection of the sample set has sufficient variability, which can effectively prevent overfitting and increase the universality of the sample set.

5、本发明能够应用图像腐蚀技术,有效减少噪音点对于最终样本选择的影响。5. The present invention can apply image erosion technology to effectively reduce the influence of noise points on final sample selection.

6、本发明能够根据每个降水事件内的分位数进行样本选择,使得最终选择的降水样本集具有普适性,对于最终的样本生成结果并不敏感,可以摆脱对于先验参数的依赖。6. The present invention can select samples according to the quantile in each precipitation event, so that the final selected precipitation sample set has universality, is not sensitive to the final sample generation result, and can get rid of the dependence on a priori parameters.

本说明书中各个实施例采用递进的方式描述,每个实施例重点说明的都是与其他实施例的不同之处,各个实施例之间相同相似部分互相参见即可。对于实施例公开的系统而言,由于其与实施例公开的方法相对应,所以描述的比较简单,相关之处参见方法部分说明即可。The various embodiments in this specification are described in a progressive manner, and each embodiment focuses on the differences from other embodiments, and the same and similar parts between the various embodiments can be referred to each other. For the system disclosed in the embodiment, since it corresponds to the method disclosed in the embodiment, the description is relatively simple, and the relevant part can be referred to the description of the method.

本文中应用了具体个例对本发明的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本发明的方法及其核心思想;同时,对于本领域的一般技术人员,依据本发明的思想,在具体实施方式及应用范围上均会有改变之处。综上所述,本说明书内容不应理解为对本发明的限制。In this paper, specific examples are used to illustrate the principles and implementations of the present invention. The descriptions of the above embodiments are only used to help understand the methods and core ideas of the present invention; meanwhile, for those skilled in the art, according to the present invention There will be changes in the specific implementation and application scope. In conclusion, the contents of this specification should not be construed as limiting the present invention.

Claims (8)

1.一种降水事件引导的遥感降水反演训练样本自适应选取方法,其特征在于,包括:1. a remote sensing precipitation inversion training sample adaptive selection method guided by a precipitation event, is characterized in that, comprising: 获取降水二值图,所述降水二值图包括降水区域和非降水区域;obtaining a precipitation binary map, where the precipitation binary map includes a precipitation area and a non-precipitation area; 采用分类算法对所述降水区域进行连通域划分得到多个降水连通域;The precipitation area is divided into connected domains by using a classification algorithm to obtain multiple precipitation connected domains; 对于任意一个降水连通域,根据所述降水连通域的像素数、所述降水区域的像素数和预设降水样本数量得到所述降水连通域的取样数量;For any precipitation connected domain, obtain the sampling number of the precipitation connected domain according to the number of pixels of the precipitation connected domain, the pixel number of the precipitation area and the preset number of precipitation samples; 根据所述降水连通域的取样数量和所述降水连通域的像素数对所述降水连通域的像素进行取样,得到所述降水连通域的降水样本;sampling the pixels of the precipitation connected domain according to the sampling quantity of the precipitation connected domain and the number of pixels of the precipitation connected domain to obtain the precipitation sample of the precipitation connected domain; 对所述非降水区域的像素进行随机取样得到非降水样本;randomly sampling the pixels of the non-precipitation area to obtain non-precipitation samples; 确定所有降水连通域的降水样本和所述非降水样本为降水反演训练样本;Determine the precipitation samples of all precipitation connected domains and the non-precipitation samples as precipitation retrieval training samples; 所述根据所述降水连通域的像素数、所述降水区域的像素数和预设降水样本数量得到所述降水连通域的取样数量,具体为:The sampling number of the precipitation connected region is obtained according to the number of pixels of the precipitation connected region, the number of pixels of the precipitation region and the preset number of precipitation samples, specifically: 根据公式Mi=(Ni/Ntotal)*Mtotal计算第i个降水连通域的取样数量,其中,Mi为第i个降水连通域的取样数量,Mtotal为预设降水样本数量;Ni为第i个降水连通域的像素数,Ntotal为降水区域的像素数;Calculate the sampling quantity of the ith precipitation connected domain according to the formula Mi = (N i / N total )*M total , where Mi is the sampling quantity of the ith precipitation connected domain, and M total is the preset precipitation sample quantity; Ni is the number of pixels in the i -th precipitation connected domain, and N total is the number of pixels in the precipitation area; 所述根据所述降水连通域的取样数量和所述降水连通域的像素数对所述降水连通域的像素进行取样,得到所述降水连通域的降水样本,具体为:The sampling of the pixels of the precipitation connected domain according to the sampling quantity of the precipitation connected domain and the number of pixels of the precipitation connected domain is performed to obtain the precipitation sample of the precipitation connected domain, specifically: 根据公式Si={Pk|k=[(t-0.5)/Mi*Ni],t∈N+,1≤t≤Mi}得到第i个降水连通域的降水样本,其中Si为第i个降水连通域的降水样本,Pk为第i个降水连通域的降水样本中第k个降水像素,t为降水连通域的样本序号,Mi为第i个降水连通域的取样数量,Ni为第i个降水连通域的像素数。According to the formula S i ={P k |k=[(t-0.5)/M i *N i ], t∈N + , 1≤t≤M i }, the precipitation samples of the i-th precipitation connected domain are obtained, where S i is the precipitation sample of the i-th precipitation connected domain, P k is the k-th precipitation pixel in the precipitation sample of the i -th precipitation connected domain, t is the sample number of the precipitation Number of samples, Ni is the number of pixels in the i -th precipitation connected domain. 2.根据权利要求1所述的一种降水事件引导的遥感降水反演训练样本自适应选取方法,其特征在于,所述获取降水二值图,具体包括:2. The method for self-adapting selection of training samples for remote sensing precipitation inversion guided by a precipitation event according to claim 1, wherein the obtaining of a binary map of precipitation specifically comprises: 获取降水情况图,所述降水情况图的像素值为降水量,所述降水情况图的像素位置为像素对应的地理位置;obtaining a precipitation situation map, where the pixel value of the precipitation situation map is the amount of precipitation, and the pixel position of the precipitation situation map is the geographic location corresponding to the pixel; 对所述降水情况图进行阈值分割得到降水二值图。Threshold segmentation is performed on the precipitation situation map to obtain a precipitation binary map. 3.根据权利要求1所述的一种降水事件引导的遥感降水反演训练样本自适应选取方法,其特征在于,在所述采用分类算法对所述降水区域进行连通域划分得到多个降水连通域之前还包括:3. The method for adaptively selecting training samples for remote sensing precipitation inversion guided by a precipitation event according to claim 1, characterized in that, in the described precipitation region using a classification algorithm, a connected domain is divided to obtain a plurality of precipitation connectivity The domain also includes: 对所述降水二值图进行图像掩膜处理得到处理后的降水区域。Perform image mask processing on the precipitation binary image to obtain a processed precipitation area. 4.根据权利要求1所述的一种降水事件引导的遥感降水反演训练样本自适应选取方法,其特征在于,所述对所述非降水区域的像素进行随机取样得到非降水样本,具体包括:4 . The method for adaptively selecting training samples for remote sensing precipitation inversion guided by a precipitation event according to claim 1 , wherein the random sampling of pixels in the non-precipitation area to obtain non-precipitation samples, specifically comprising: 5 . : 根据所述非降水区域中各像素的位置对所述非降水区域中各像素进行标号得到所述非降水区域各像素的像素序号;Labeling each pixel in the non-precipitation area according to the position of each pixel in the non-precipitation area to obtain the pixel serial number of each pixel in the non-precipitation area; 根据预设非降水样本数量和各像素的像素序号对所述非降水区域的像素进行随机取样得到非降水样本。The non-precipitation samples are obtained by randomly sampling the pixels in the non-precipitation area according to the preset number of non-precipitation samples and the pixel serial number of each pixel. 5.一种降水事件引导的遥感降水反演训练样本自适应选取系统,其特征在于,包括:5. A remote sensing precipitation inversion training sample adaptive selection system guided by a precipitation event is characterized in that, comprising: 获取模块,用于获取降水二值图,所述降水二值图包括降水区域和非降水区域;an obtaining module, configured to obtain a precipitation binary image, wherein the precipitation binary image includes a precipitation area and a non-precipitation area; 降水连通域确定模块,用于采用分类算法对所述降水区域进行连通域划分得到多个降水连通域;A precipitation connected domain determining module, used for dividing the precipitation region into a connected domain by using a classification algorithm to obtain a plurality of precipitation connected domains; 数量确定模块,用于对于任意一个降水连通域,根据所述降水连通域的像素数、所述降水区域的像素数和预设降水样本数量得到所述降水连通域的取样数量;A quantity determination module, configured to obtain the sampling quantity of the precipitation connected domain according to the number of pixels of the precipitation connected domain, the pixel quantity of the precipitation area and the preset precipitation sample quantity for any precipitation connected domain; 降水样本确定模块,用于根据所述降水连通域的取样数量和所述降水连通域的像素数对所述降水连通域的像素进行取样,得到所述降水连通域的降水样本;a precipitation sample determination module, configured to sample the pixels of the precipitation connected domain according to the sampling quantity of the precipitation connected domain and the number of pixels of the precipitation connected domain to obtain the precipitation sample of the precipitation connected domain; 非降水样本确定模块,用于对所述非降水区域的像素进行随机取样得到非降水样本;a non-precipitation sample determination module, configured to randomly sample the pixels in the non-precipitation area to obtain non-precipitation samples; 反演训练样本确定模块,用于确定所有降水连通域的降水样本和所述非降水样本为降水反演训练样本;The inversion training sample determination module is used to determine the precipitation samples of all precipitation connected domains and the non-precipitation samples as the precipitation inversion training samples; 所述根据所述降水连通域的像素数、所述降水区域的像素数和预设降水样本数量得到所述降水连通域的取样数量,具体为:The sampling number of the precipitation connected region is obtained according to the number of pixels in the precipitation connected region, the number of pixels in the precipitation region, and the preset number of precipitation samples, specifically: 根据公式Mi=(Ni/Ntotal)*Mtotal计算第i个降水连通域的取样数量,其中,Mi为第i个降水连通域的取样数量,Mtotal为预设降水样本数量;Ni为第i个降水连通域的像素数,Ntotal为降水区域的像素数;Calculate the sampling quantity of the ith precipitation connected domain according to the formula Mi = (N i / N total )*M total , where Mi is the sampling quantity of the ith precipitation connected domain, and M total is the preset precipitation sample quantity; Ni is the number of pixels in the i -th precipitation connected domain, and N total is the number of pixels in the precipitation area; 所述根据所述降水连通域的取样数量和所述降水连通域的像素数对所述降水连通域的像素进行取样,得到所述降水连通域的降水样本,具体为:The sampling of the pixels of the precipitation connected domain according to the sampling quantity of the precipitation connected domain and the number of pixels of the precipitation connected domain is performed to obtain the precipitation sample of the precipitation connected domain, specifically: 根据公式Si={Pk|k=[(t-0.5)/Mi*Ni],t∈N+,1≤t≤Mi}得到第i个降水连通域的降水样本,其中Si为第i个降水连通域的降水样本,Pk为第i个降水连通域的降水样本中第k个降水像素,t为降水连通域的样本序号,Mi为第i个降水连通域的取样数量,Ni为第i个降水连通域的像素数。According to the formula S i ={P k |k=[(t-0.5)/M i *N i ], t∈N + , 1≤t≤M i }, the precipitation samples of the i-th precipitation connected domain are obtained, where S i is the precipitation sample of the i-th precipitation connected domain, P k is the k-th precipitation pixel in the precipitation sample of the i -th precipitation connected domain, t is the sample number of the precipitation Number of samples, Ni is the number of pixels in the i -th precipitation connected domain. 6.根据权利要求5所述的一种降水事件引导的遥感降水反演训练样本自适应选取系统,其特征在于,所述获取模块包括:6. The remote sensing precipitation inversion training sample adaptive selection system guided by a precipitation event according to claim 5, wherein the acquisition module comprises: 获取单元,用于获取降水情况图,所述降水情况图的像素值为降水量,所述降水情况图的像素位置为像素对应的地理位置;an acquiring unit, configured to acquire a precipitation situation map, where the pixel value of the precipitation situation map is the amount of precipitation, and the pixel position of the precipitation situation map is the geographic location corresponding to the pixel; 二值图确定单元,用于对所述降水情况图进行阈值分割得到降水二值图。The binary image determination unit is configured to perform threshold segmentation on the precipitation situation map to obtain a precipitation binary image. 7.根据权利要求5所述的一种降水事件引导的遥感降水反演训练样本自适应选取系统,其特征在于,还包括:处理模块,用于对所述降水二值图进行图像掩膜处理得到处理后的降水区域。7. The precipitation event-guided remote sensing precipitation inversion training sample adaptive selection system according to claim 5, further comprising: a processing module for performing image mask processing on the precipitation binary image Get the processed precipitation area. 8.根据权利要求5所述的一种降水事件引导的遥感降水反演训练样本自适应选取系统,其特征在于,所述非降水样本确定模块包括:8. The remote sensing precipitation inversion training sample adaptive selection system guided by a precipitation event according to claim 5, wherein the non-precipitation sample determination module comprises: 序号确定单元,用于根据所述非降水区域中各像素的位置对所述非降水区域中各像素进行标号得到所述非降水区域各像素的像素序号;a serial number determination unit, configured to label each pixel in the non-precipitation area according to the position of each pixel in the non-precipitation area to obtain the pixel serial number of each pixel in the non-precipitation area; 非降水样本确定单元,用于根据预设非降水样本数量和各像素的像素序号对所述非降水区域的像素进行随机取样得到非降水样本。A non-precipitation sample determination unit, configured to randomly sample the pixels in the non-precipitation area according to the preset number of non-precipitation samples and the pixel serial number of each pixel to obtain non-precipitation samples.
CN202110655233.5A 2021-06-11 2021-06-11 Rainfall event guided remote sensing rainfall inversion training sample self-adaptive selection method Active CN113269805B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110655233.5A CN113269805B (en) 2021-06-11 2021-06-11 Rainfall event guided remote sensing rainfall inversion training sample self-adaptive selection method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110655233.5A CN113269805B (en) 2021-06-11 2021-06-11 Rainfall event guided remote sensing rainfall inversion training sample self-adaptive selection method

Publications (2)

Publication Number Publication Date
CN113269805A CN113269805A (en) 2021-08-17
CN113269805B true CN113269805B (en) 2022-06-28

Family

ID=77234884

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110655233.5A Active CN113269805B (en) 2021-06-11 2021-06-11 Rainfall event guided remote sensing rainfall inversion training sample self-adaptive selection method

Country Status (1)

Country Link
CN (1) CN113269805B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115034317B (en) * 2022-06-17 2025-01-03 中国平安人寿保险股份有限公司 Training method and device for insurance policy recognition model, insurance policy recognition method and device
CN116468737B (en) * 2023-04-10 2024-07-26 华能澜沧江水电股份有限公司 Rainfall information acquisition method and device

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112053371A (en) * 2020-09-29 2020-12-08 成都星时代宇航科技有限公司 Water body extraction method and device in remote sensing image
WO2021094753A1 (en) * 2019-11-11 2021-05-20 University Of Newcastle Upon Tyne Configuration of global satellite navigation system infrastructure

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021094753A1 (en) * 2019-11-11 2021-05-20 University Of Newcastle Upon Tyne Configuration of global satellite navigation system infrastructure
CN112053371A (en) * 2020-09-29 2020-12-08 成都星时代宇航科技有限公司 Water body extraction method and device in remote sensing image

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
A Method for Detecting Rainfall From X-Band Marine Radar Images;Yan Zheng et al.;《IEEE Access》;20200122;第08卷;全文 *
Himawari-8/AHI红外光谱资料降水信号识别与反演初步应用研究;王根等;《红外与毫米波学报》;20200415(第02期);全文 *
KNN不同距离度量对FY-4A/ AGRI红外亮温反演降水的影响研究;王根等;《红外》;20200425(第04期);全文 *

Also Published As

Publication number Publication date
CN113269805A (en) 2021-08-17

Similar Documents

Publication Publication Date Title
CN109086811B (en) Multi-label image classification method and device and electronic equipment
CN109919910B (en) SAR image change detection method based on difference map fusion and improved level set
CN111523414A (en) Face recognition method and device, computer equipment and storage medium
CN113269805B (en) Rainfall event guided remote sensing rainfall inversion training sample self-adaptive selection method
CN111046787A (en) A Pedestrian Detection Method Based on Improved YOLO v3 Model
CN109272016A (en) Target detection method, device, terminal equipment and computer readable storage medium
CN112232506B (en) Network model training method, image target recognition method, device and electronic device
CN105549009B (en) A kind of SAR image CFAR object detection methods based on super-pixel
CN107123130B (en) A Kernel Correlation Filtering Target Tracking Method Based on Superpixel and Hybrid Hash
CN114494260A (en) Object defect detection method and device, computer equipment and storage medium
CN106780727B (en) Vehicle head detection model reconstruction method and device
WO2023116632A1 (en) Video instance segmentation method and apparatus based on spatio-temporal memory information
CN107169962A (en) The gray level image fast partition method of Kernel fuzzy clustering is constrained based on space density
CN110533632B (en) Image blurring tampering detection method and device, computer equipment and storage medium
CN110321808B (en) Method, apparatus and storage medium for detecting carry-over and stolen object
CN107240104B (en) Point cloud data segmentation method and terminal
CN108846845B (en) SAR image segmentation method based on thumbnail and hierarchical fuzzy clustering
CN110866872A (en) A method, device and electronic device for intelligent selection of pavement crack image preprocessing
CN110033012A (en) A kind of production method for tracking target based on channel characteristics weighted convolution neural network
CN111738319B (en) Clustering result evaluation method and device based on large-scale samples
CN112949519A (en) Target detection method, device, equipment and storage medium
CN105447488A (en) SAR (synthetic aperture radar) image target detection method based on sketch line segment topological structure
CN113486879B (en) Image area suggestion frame detection method, device, equipment and storage medium
CN111985439B (en) Face detection method, device, equipment and storage medium
CN110580483A (en) indoor and outdoor user distinguishing method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant