WO2019129032A1 - 遥感图像识别方法、装置、存储介质以及电子设备 - Google Patents
遥感图像识别方法、装置、存储介质以及电子设备 Download PDFInfo
- Publication number
- WO2019129032A1 WO2019129032A1 PCT/CN2018/123807 CN2018123807W WO2019129032A1 WO 2019129032 A1 WO2019129032 A1 WO 2019129032A1 CN 2018123807 W CN2018123807 W CN 2018123807W WO 2019129032 A1 WO2019129032 A1 WO 2019129032A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- remote sensing
- sensing image
- image block
- pixel
- probability information
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 79
- 238000012545 processing Methods 0.000 claims abstract description 95
- 238000013528 artificial neural network Methods 0.000 claims abstract description 92
- 230000009467 reduction Effects 0.000 claims abstract description 89
- 238000004590 computer program Methods 0.000 claims abstract description 27
- 238000003709 image segmentation Methods 0.000 claims description 14
- 238000012549 training Methods 0.000 claims description 7
- 230000004044 response Effects 0.000 claims description 4
- 230000008569 process Effects 0.000 description 12
- 238000004891 communication Methods 0.000 description 11
- 238000011946 reduction process Methods 0.000 description 11
- 238000004364 calculation method Methods 0.000 description 8
- 238000013527 convolutional neural network Methods 0.000 description 7
- 230000001133 acceleration Effects 0.000 description 6
- 230000004927 fusion Effects 0.000 description 6
- 210000002569 neuron Anatomy 0.000 description 6
- 230000009286 beneficial effect Effects 0.000 description 5
- 238000010586 diagram Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 238000013507 mapping Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000001960 triggered effect Effects 0.000 description 2
- 230000002411 adverse Effects 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 210000005036 nerve Anatomy 0.000 description 1
- 238000011176 pooling Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2415—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computing arrangements based on specific mathematical models
- G06N7/01—Probabilistic graphical models, e.g. probabilistic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/44—Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
- G06V10/443—Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components by matching or filtering
- G06V10/449—Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters
- G06V10/451—Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters with interaction between the filter responses, e.g. cortical complex cells
- G06V10/454—Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/764—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/10—Terrestrial scenes
- G06V20/13—Satellite images
Definitions
- the present application relates to computer vision technology, and in particular to a remote sensing image recognition method, a remote sensing image recognition device, a computer readable storage medium, and an electronic device.
- the prediction result of one pixel is usually determined by other pixels in a certain area around the pixel, and this area may be called the receptive field of the convolutional neural network neurons.
- the receptive field of convolutional neural network neurons is usually limited (such as a few hundred pixels), while the size of remote sensing images is usually large (such as the length and width of remote sensing images can reach thousands or even tens of thousands of pixels), therefore, neurons are limited. Receptive fields may not be able to obtain enough environmental information, which often leads to false recognition of pixels in remote sensing images by convolutional neural networks.
- the embodiment of the present application provides a technical solution for remote sensing image recognition.
- a remote sensing image recognition method which mainly includes: performing a resolution reduction process on a remote sensing image to be obtained, and obtaining a remote sensing image after the resolution reduction; Separating at least one remote sensing image block from the processed remote sensing image; determining a remote sensing image block to be processed from the at least one remote sensing image block; obtaining classification probability information of pixels in the to-be-processed remote sensing image block via a neural network; Determining, according to the classification probability information of the pixels in the remote sensing image block to be processed, the recognition result of the remote sensing image to be identified.
- a remote sensing image recognition apparatus includes: a resolution reduction module, configured to perform a resolution reduction process on the remote sensing image to be obtained, and obtain a remote sensing image after the resolution reduction processing a remote sensing image segmentation module, configured to slice at least one remote sensing image block from the reduced resolution processed remote sensing image; and select an image block module to determine a remote sensing to be processed from the at least one remote sensing image block a categorization processing module, configured to obtain, by using a neural network, classification probability information of pixels in the to-be-processed remote sensing image block; and determining a recognition result module, configured to perform classification probability information according to pixels in the to-be-processed remote sensing image block Determining the recognition result of the remote sensing image to be identified.
- a resolution reduction module configured to perform a resolution reduction process on the remote sensing image to be obtained, and obtain a remote sensing image after the resolution reduction processing
- a remote sensing image segmentation module configured to slice at least one remote sensing image block from the
- an electronic device comprising: a memory for storing a computer program; a processor for executing a computer program stored in the memory, and when the computer program is executed The method of any of the embodiments of the present application is implemented.
- a computer readable storage medium having stored thereon a computer program, which when executed by a processor, implements the method of any of the embodiments of the present application.
- a computer program is provided that, when executed by a processor in a device, implements the method of any of the embodiments of the present application.
- the present application selects a remote sensing image block from the remote sensing image after the resolution reduction processing, and inputs the neural network. It avoids the phenomenon that all the remote sensing image blocks that are segmented are input into the neural network after each resolution reduction, and the calculation of the neural network is large.
- FIG. 1 is a schematic flow chart of an embodiment of a remote sensing image recognition method according to the present application.
- FIG. 2 is a schematic flow chart of another embodiment of a remote sensing image recognition method according to the present application.
- FIG. 3 is a schematic flow chart of an embodiment of selecting a remote sensing image block according to a current confidence of a pixel of a remote sensing image to be identified according to the present application;
- FIG. 4 is a schematic flowchart of an embodiment of maintaining current classification probability information of at least one pixel of a remote sensing image to be recognized according to the present application;
- FIG. 5 is a schematic diagram of an embodiment of a remote sensing image recognition apparatus according to the present application.
- FIG. 6 is a block diagram of an exemplary device that implements an embodiment of the present application.
- FIG. 7 is a schematic diagram of an application scenario of the present application.
- Embodiments of the present application can be applied to electronic devices such as terminal devices, computer systems, servers, etc., which can operate with numerous other general purpose or special purpose computing system environments or configurations.
- Examples of well-known computing systems, environments, and/or configurations suitable for use with electronic devices such as terminal devices, computer systems, servers, and the like include, but are not limited to, personal computer systems, server computer systems, thin clients, thick clients, handhelds Or laptop devices, microprocessor-based systems, set-top boxes, programmable consumer electronics, networked personal computers, small computer systems, mainframe computer systems, and distributed cloud computing technology environments including any of the above, and the like.
- Electronic devices such as terminal devices, computer systems, servers, etc., can be described in the general context of computer system executable instructions (such as program modules) being executed by a computer system.
- program modules may include routines, programs, target programs, components, logic, and data structures, etc., which perform particular tasks or implement particular abstract data types.
- the computer system/server can be implemented in a distributed cloud computing environment where tasks are performed by remote processing devices that are linked through a communication network.
- program modules may be located on a local or remote computing system storage medium including storage devices.
- the technical solution for realizing remote sensing image recognition may be a single chip microcomputer, an FPGA (Field Programmable Gate Array), a microprocessor, a smart mobile phone, a notebook computer, a tablet computer, a desktop computer or a server.
- An electronic device capable of running a computer program (which may also be referred to as program code) may be implemented, and the computer program may be stored in a computer readable storage medium such as a flash memory, a cache, a hard disk, or an optical disk.
- FIG. 1 is a schematic flow chart of an embodiment of a remote sensing image recognition method according to the present application.
- the remote sensing image recognition method of the present embodiment mainly includes an operation S100, an operation S110, an operation S120, an operation S130, and an operation S140. The respective operations of the present application will be described in detail below.
- the present application may perform a resolution reduction process on the remote sensing image in a plurality of manners.
- the present application is preset with a reduced scale set, and the reduced scale set includes at least two different reduced values.
- An unreduced reduction value may be selected from the reduced scale set, and the remote sensing image is subjected to a resolution reduction process according to the selected reduced value.
- the present application when selecting a reduced value that has not been used, the present application usually preferentially selects the smallest reduced value that has not been used.
- the present application can also perform resolution reduction processing on the remote sensing image to be recognized by means of pooling processing or downsampling processing. The present application does not limit the implementation of the resolution reduction processing of the remote sensing image to be recognized.
- S100 may be executed by a processor invoking a corresponding instruction stored in a memory, or may be performed by a down-resolution module 500 that is executed by the processor.
- the present application generally divides the remote sensing image obtained by reducing the resolution processing into a plurality of remote sensing image blocks according to the size requirement of the neural network, and the size of all the remote sensing image blocks that are sliced is usually It's exactly the same.
- Each remote sensing image block that is segmented corresponds to one region in the remote sensing image to be identified, and the corresponding regions of different remote sensing image blocks are different.
- the size of the region corresponding to the remote sensing image block is usually related to the reduction value and the size requirement of the neural network for the input image; in one example, the size of the input image in the neural network is 561 ⁇ 561, and the resolution is reduced by two.
- the size of the remote sensing image block corresponding to the remote sensing image to be identified in the present application is 1122 ⁇ 1122.
- the first remote sensing image block that is segmented corresponds to the remote sensing image to be identified.
- the last remote sensing image block that is segmented corresponds to the area of the lower right corner 1122 ⁇ 1122 in the remote sensing image to be identified.
- the application does not limit the number of segmented remote sensing image blocks and the size of the remote sensing image block.
- S101 may be executed by a processor invoking a corresponding instruction stored in a memory, or may be performed by a remote sensing image segmentation module 510 that is executed by the processor.
- the present application can treat all the remote sensing image blocks as the remote sensing image blocks to be processed, and not in the case of performing the resolution reduction processing for the first time.
- the number of remote sensing image blocks to be processed determined by the present application is usually smaller than the number of all remote sensing image blocks currently segmented.
- the present application may select the at least one remote sensing image block that is segmented according to the current confidence of the pixel of the remote sensing image to be identified, and select The remote sensing image block is the remote sensing image block to be processed.
- the present application first determines a current confidence of at least one pixel according to current classification probability information of at least one pixel in the remote sensing image to be identified, for example, calculating a maximum of the pixel for a pixel in the remote sensing image to be identified.
- the ratio of the classification probability value to the sub-classification probability value, and the calculated ratio is used as the current confidence of the pixel, according to which the current confidence of all pixels in the remote sensing image to be identified can be obtained. Then, the number of pixels whose current confidence level of the pixel in the remote sensing image region to be identified corresponding to the remote sensing image block reaches a predetermined confidence level is counted, and the remote sensing image block whose number of pixels reaching the predetermined confidence level does not meet the predetermined requirement is determined as the remote sensing to be processed.
- the remote sensing image block determines whether the ratio of the number of pixels reaching the predetermined confidence level to the number of all pixels in the remote sensing image region to be recognized corresponding to the remote sensing image block is less than a predetermined ratio. If the ratio of the number of pixels reaching the predetermined confidence level to the number of all pixels in the remote sensing image region to be recognized corresponding to the remote sensing image block is less than a predetermined ratio, The remote sensing image block is determined as a remote sensing image block to be processed.
- the present application can obtain current classification probability information of at least one pixel of the remote sensing image to be identified by using the following two methods:
- the classification probability information of at least one pixel in each remote sensing image block to be processed output by the neural network is mapped into the remote sensing image to be recognized, and the remote sensing image to be recognized is obtained.
- Classification probability information for at least one pixel may use the classification probability information of at least one pixel of the currently obtained remote sensing image to be recognized as the current classification probability information of at least one pixel of the remote sensing image to be identified.
- the present application obtains current classification probability information of at least one pixel of the remote sensing image to be identified.
- S102 may be executed by a processor invoking a corresponding instruction stored in a memory, or may be performed by a selected image block module 520 executed by the processor.
- the neural network in the present application may be a convolutional neural network, and the present application does not limit the representation of the neural network.
- the present invention reduces the resolution processing of the remote sensing image to be recognized, and inputs the image block to be processed in the remote sensing image after the resolution reduction processing into the neural network, thereby improving the receptive field of the neuron in the neural network, thereby It is helpful to avoid the misidentification of the pixels of the remote sensing image by the neural network.
- the neural network in the present application is trained using remote sensing image samples having a plurality of different resolutions.
- the resolution of each of the remote sensing image samples belongs to a resolution corresponding to each of the reduced values in the reduced scale set.
- the scaling set includes: a first reduced value, a second reduced value, and a third reduced value, the first reduced value corresponds to the first resolution, the second reduced value corresponds to the second resolution, and the third reduced value corresponds to the first
- the present application can use the remote sensing image samples having the first resolution, the remote sensing image samples having the second resolution, and the remote sensing image samples having the third resolution to train the trained neural network, and successfully training.
- Neural networks can be used for remote sensing image recognition of the present application.
- S103 may be executed by a processor invoking a corresponding instruction stored in a memory, or may be performed by a classification processing module 530 executed by the processor.
- the application may perform the remote sensing image block according to the to-be-processed image block in response to the non-existing reduced value in the reduced scale set or the undetectable remote sensing image block from the at least one remote sensing image block.
- the operation of the classification probability information of the pixel in the determination of the recognition result of the remote sensing image to be recognized for example, for a pixel in the remote sensing image to be identified, determining the probability value belonging to each category in the current classification probability information of the pixel
- the maximum probability value, the category corresponding to the maximum probability value and the maximum probability value is used as the recognition result of the pixel.
- S104 may be executed by a processor invoking a corresponding instruction stored in a memory, or may be performed by a determination recognition result module 540 that is executed by the processor.
- FIG. 2 is a schematic flow chart of another embodiment of a remote sensing image recognition method according to the present application.
- the remote sensing image recognition method of the present embodiment mainly includes an operation S200, an operation S210, an operation S220, and an operation S230.
- S200 Perform a resolution reduction process on the remote sensing image to be recognized according to the minimum reduced value that is not used in the reduced ratio set.
- the reduced scale set is formed by a plurality of reduced values, each of which has a value greater than zero and no more than one.
- the reduced value, 0.25 means that the resolution of the remote sensing image to be identified is reduced to a quarter of the original resolution
- 0.5 means that the resolution of the remote sensing image to be identified is reduced to one-half of the original resolution
- 1 represents the remote sensing to be recognized. The resolution of the image.
- This embodiment takes 1 as a special reduction value, for example, it can be considered as a reduction value that keeps the remote sensing image to be recognized at the original resolution.
- the present application does not limit the magnitude of the reduction value included in the reduced scale set and the number of reduced values included.
- the present application should select a reduction value from the reduced scale set each time according to the arrangement order of the reduction values from small to large, and perform resolution reduction processing on the remote sensing image according to the selected reduced value.
- the selected reduction value becomes the used reduction value.
- S200 may be executed by a processor invoking a corresponding instruction stored in a memory, or may be performed by a down-resolution module 500 that is executed by the processor.
- the remote sensing image after the resolution reduction processing is divided into at least one remote sensing image block.
- the present application generally divides the remote sensing image obtained by reducing the resolution processing into a plurality of remote sensing image blocks according to the size requirement of the neural network, and the size of all the remote sensing image blocks that are sliced is usually It's exactly the same.
- Each remote sensing image block that is segmented corresponds to one region in the remote sensing image to be identified, and the corresponding regions of different remote sensing image blocks are different.
- the size of the region corresponding to the remote sensing image block is usually related to the reduction value and the size requirement of the neural network for the input image; in one example, the size of the input image in the neural network is 561 ⁇ 561, and the resolution is reduced by two.
- the size of the remote sensing image block corresponding to the remote sensing image to be identified in the present application is 1122 ⁇ 1122.
- the first remote sensing image block that is segmented corresponds to the remote sensing image to be identified.
- the last remote sensing image block that is segmented corresponds to the area of the lower right corner 1122 ⁇ 1122 in the remote sensing image to be identified.
- the application does not limit the number of segmented remote sensing image blocks and the size of the remote sensing image block.
- S201 may be executed by a processor invoking a corresponding instruction stored in a memory, or may be performed by a remote sensing image segmentation module 510 that is executed by the processor.
- S220 Select a remote sensing image block from the at least one remote sensing image block according to a current confidence of the pixel of the remote sensing image to be identified, and obtain, by using a neural network, classification probability information of at least one pixel in each selected remote sensing image block.
- the selected remote sensing image block is the above-mentioned remote sensing image block to be processed.
- the present application in the case of performing the resolution reduction processing for the first time, generally uses all of the segmented remote sensing image blocks as the selected remote sensing image blocks.
- the present application can be considered that in the case of performing the resolution reduction processing for the first time, the current confidence of the pixel of the remote sensing image to be recognized is initialized to 0, and therefore, all the remote sensing image blocks that are segmented are selected, and respectively Input into the neural network, so that the classification probability information of all the pixels of all the remote sensing image blocks that are cut out can be obtained via the neural network, and the classification probability information of all the pixels of all the remote sensing image blocks obtained this time can be used for updating The current confidence of the pixels of the remote sensing image is identified.
- the neural network of the present application can be classified for C categories. After each remote sensing image block is respectively transmitted via a neural network, the present application can obtain all pixels in all remote sensing image blocks for C respectively. Classification probability information for the category.
- the present application generally selects all the remote sensing image blocks that are cut out according to the current confidence of the pixels of the remote sensing image to be identified.
- Remote sensing image block It should be specially noted that the remote sensing image blocks selected this time may be all the remote sensing image blocks that have been segmented this time, and it is also possible to partially segment the remote sensing image blocks in all the remote sensing image blocks; It is also possible that no remote sensing image block can be selected from all the remote sensing image blocks that have been cut out this time.
- the present application reduces the resolution of the remote sensing image to be recognized, and inputs the image block in the remote sensing image after the resolution reduction into a neural network (such as a convolutional neural network), which is beneficial to improving the neural network (such as a convolutional neural network).
- a neural network such as a convolutional neural network
- the receptive field of the neuron is beneficial to avoid the misidentification of the pixels of the remote sensing image by the neural network.
- S202 may be executed by a processor invoking a corresponding instruction stored in a memory, or may be performed by a selected image block module 520 and a classification processing module 530 that are executed by the processor.
- FIG. 3 An optional example of selecting a remote sensing image block (ie, a remote sensing image block to be processed) according to the current confidence of the pixel of the remote sensing image to be identified is shown in FIG. 3 .
- 300 determining a current confidence level of at least one pixel according to current classification probability information of at least one pixel of the remote sensing image to be identified.
- the current classification probability information of at least one pixel of the remote sensing image to be identified in the present application is based on a classification of at least one pixel in the remote sensing image block output by the neural network for at least one remote sensing image block input in the past. Formed by probability information.
- the present application needs to maintain the current classification probability information of at least one pixel of the remote sensing image (such as updating the current classification probability information of at least one pixel of the remote sensing image to be recognized).
- the manner in which the present application maintains the current classification probability information of at least one pixel of the remote sensing image can be referred to the following description for FIG. 4.
- the present application determines the pixel according to the current classification probability information of the pixel (i, j) of the remote sensing image to be identified (i, j).
- the current confidence can be set by: the neural network can identify C categories from the remote sensing image, and the current classification probability information of the pixel (i, j) will include C probability values, which can be from this C Among the probability values, the maximum probability value and the second largest probability value (ie, the second largest probability value) are selected, and the ratio of the selected maximum classification probability value to the second largest classification probability value is calculated, and the ratio may be used as the ratio in the present application.
- the current confidence of the pixel (i, j) the maximum probability value and the second largest probability value (ie, the second largest probability value) are selected, and the ratio of the selected maximum classification probability value to the second largest classification probability value is calculated, and the ratio may be used as the ratio in the present application.
- the present application may also determine the current confidence of at least one pixel of the remote sensing image to be identified according to the current classification probability information of the at least one pixel of the remote sensing image to be identified.
- the present application does not limit the current determination of at least one pixel of the remote sensing image to be identified. The way to confidence.
- S300 may be executed by a processor invoking a corresponding instruction stored in a memory, or may be performed by a first unit that is executed by the processor.
- the predetermined confidence level in the present application may be set according to actual needs, for example, the predetermined confidence may be set to 4.
- the present application can respectively determine whether the current confidence of 1122 ⁇ 1122 pixels is reached. The judgment of the confidence is predetermined, so that the number of pixels reaching the predetermined confidence can be counted.
- S301 may be executed by a processor invoking a corresponding instruction stored in a memory or by a second unit being executed by the processor.
- a remote sensing image block that does not meet the predetermined requirement for the number of pixels with a predetermined degree of confidence is used as the selected remote sensing image block.
- the remote sensing image block may be input to the neural network as the selected remote sensing image block, thereby obtaining classification probability information of at least one pixel in the remote sensing image block via the neural network; if the ratio reaches a predetermined ratio , the remote sensing image block does not need to be input into the neural network.
- the predetermined ratio in the present application may be set according to actual needs, for example, the predetermined ratio may be set to 0.97 or the like.
- the present application may also use other methods to determine a remote sensing image block that does not meet the predetermined requirement, for example, determining the number of pixels that reach a predetermined confidence level in the remote sensing image region to be identified corresponding to the remote sensing image block and the region. If the difference between the number of pixels that have not reached the predetermined confidence reaches a predetermined difference, the remote sensing image block may be input to the neural network as the selected remote sensing image block. Otherwise, the remote sensing image block is not required.
- the remote sensing image block Inputting into the neural network; for example, determining that the ratio of the number of pixels reaching the predetermined confidence level in the remote sensing image region to be identified corresponding to the remote sensing image block to the number of pixels in the region that does not reach the predetermined confidence level does not reach the predetermined value In the case of the ratio, the remote sensing image block can be input into the neural network as the selected remote sensing image block. Otherwise, the remote sensing image block does not need to be input into the neural network.
- the present application does not limit the implementation of remote sensing image blocks that do not meet the predetermined requirements.
- S302 may be executed by a processor invoking a corresponding instruction stored in a memory, or may be performed by a third unit that is executed by the processor.
- FIG. 1 An example of maintaining the current classification probability information of at least one pixel of the remote sensing image to be recognized in the present application is shown in FIG.
- the remote sensing image recognition method of the present application begins.
- the present application may initialize current classification probability information of at least one pixel of the remote sensing image to be recognized when the remote sensing image recognition method starts, such as setting current classification probability information of at least one pixel of the remote sensing image to be identified to 0.
- current classification probability information of at least one pixel of the remote sensing image to be recognized it is completely feasible not to perform an initialization operation on the current classification probability information of at least one pixel of the remote sensing image to be recognized.
- the execution of the operation S440 is triggered.
- any of the remote sensing images to be identified is obtained.
- Classification probability information of one pixel (i, j) It can be expressed as:
- the upper corner mark 1 indicates the first execution of the resolution reduction process
- C indicates the number of categories in which the neural network performs classification. It is indicated that, in the case of performing the resolution reduction processing for the first time, any pixel (i, j) in the remote sensing image to be identified belongs to the classification probability information of the first category, It is indicated that, in the case where the resolution reduction processing is performed for the first time, any pixel (i, j) in the remote sensing image to be identified belongs to the classification probability information of the second category, It is indicated that in the case where the resolution reduction processing is performed for the first time, any pixel (i, j) in the remote sensing image to be identified belongs to the classification probability information of the category C.
- the classification probability information of at least one pixel of the remote sensing image to be identified obtained this time is used as the current classification probability information of the corresponding pixel of the remote sensing image to be identified, for example, the current of any pixel (i, j) of the remote sensing image to be identified.
- S420 and S430 may be executed by a processor invoking a corresponding instruction stored in a memory, or may be performed by a fourth unit executed by the processor.
- S450 Mapping classification probability information of at least one pixel in each remote sensing image block output by the neural network to the remote sensing image to be identified, thereby obtaining classification probability information of at least one pixel of the remote sensing image to be identified. Go to operation S460.
- the classification probability information of all the pixels in each remote sensing image block output by the neural network is obtained, and the to-be-identified is obtained.
- Classification probability information of any pixel (i, j) in remote sensing image It can be expressed as:
- n indicates the nth execution of the resolution reduction process
- C indicates the number of categories in which the neural network performs classification. Representing that in the case of performing the down resolution processing for the nth time, any pixel (i, j) in the remote sensing image to be identified belongs to the classification probability information of the first category, Representing that in the case of performing the resolution reduction processing for the nth time, any pixel (i, j) in the remote sensing image to be identified belongs to the classification probability information of the second category, It is indicated that in the case where the resolution reduction processing is performed for the nth time, any pixel (i, j) in the remote sensing image to be identified belongs to the classification probability information of the Cth category.
- the present application can Set to 0.
- S450 may be executed by a processor invoking a corresponding instruction stored in a memory, or may be performed by a fifth unit that is executed by the processor.
- the present application is to classify all the pixels in each of the to-be-processed remote sensing image blocks respectively obtained by accumulating the previous times. Probability information to obtain current classification probability information of all pixels of the remote sensing image to be identified.
- the classification probability information of any pixel (i, j) in the remote sensing image to be recognized obtained by the present application is set in the case where the resolution processing is performed for the first time.
- the classification probability information of any pixel (i, j) in the remote sensing image to be recognized obtained by the present application based on the neural network may be expressed as Wherein, if the pixel (i, j) in the remote sensing image to be identified belongs to the pixel of the region in the remote sensing image to be recognized by the remote sensing image block selected by the second resolution reduction process, The classification probability information of the corresponding pixel outputted by the neural network, and if the pixel (i, j) in the remote sensing image to be identified does not belong to the second time resolution processing process, the remote sensing image block is mapped to the remote sensing image to be recognized.
- the classification probability information of any pixel (i, j) in the remote sensing image to be recognized obtained based on the neural network may be expressed as Wherein, if the pixel (i, j) in the remote sensing image to be identified belongs to the pixel of the region in the remote sensing image to be recognized by the remote sensing image block selected by the third resolution reduction processing, The classification probability information of the corresponding pixel outputted by the neural network, and if the pixel (i, j) in the remote sensing image to be identified does not belong to the third time resolution processing process, the remote sensing image block is mapped to the remote sensing image to be recognized.
- the process of updating the current classification probability information of all pixels of the remote sensing image to be recognized is actually performing the previous classification probability information. Fusion, for example, The fusion is performed to form current classification probability information of all pixels of the updated remote sensing image to be identified.
- the current classification probability information of all pixels of the remote sensing image to be identified is the result of fusion of the previous classification probability information, for example, The result of the fusion.
- S460 may be executed by a processor invoking a corresponding instruction stored in a memory, or may be performed by a sixth unit executed by the processor.
- S230 may be executed by a processor invoking a corresponding instruction stored in a memory, or may be performed by a determination recognition result module 540 that is executed by the processor.
- the remote sensing image samples used are generally remote sensing image samples including multiple resolutions, and the number of different resolutions is usually smaller than the reduced value in the reduced ratio set.
- the number of correlations is, for example, in the case where the reduced scale set includes three kinds of reduced values, the sample data set should include at least the resolution of the remote sensing image samples corresponding to the three reduced values, and each of the remote sensing image samples has the labeled information (such as mask marking information, etc.).
- the number of remotely sensed image samples of different resolutions is approximately the same in the sample data set.
- the present application trains the neural network by using remote sensing image samples with different resolutions in the sample data set, which is beneficial to improving the accuracy of the neural network outputting the classification probability information for the input remote sensing image block to be processed, thereby facilitating the improvement of the neural network. Accuracy of remote sensing image recognition.
- Remote sensing images are usually different from photos or pictures or video frames in the traditional sense.
- the size of remote sensing images is usually large. For example, the length and width of remote sensing images can reach several thousand or even tens of thousands of pixels.
- the remote sensing image is identified and processed.
- the reliability of the recognition result may be adversely affected due to the limited receptive field of the neuron.
- the computational cost and time cost of the neural network may be higher.
- the present invention performs the resolution reduction processing on the remote sensing image to be identified, and the neural network processes the remote sensing image after the resolution reduction processing, which is beneficial to reducing the calculation cost and the time cost of the neural network.
- the present invention selects a remote sensing image block from the remote sensing image after the resolution reduction processing and inputs the neural network, which is beneficial to reducing the calculation amount of the neural network for identifying the corresponding local region in the remote sensing image after the resolution reduction processing. Since the present application can select a remote sensing image block according to the current confidence of the pixels in the remote sensing image to be identified during the resolution reduction process, the neural network is used for some easily recognizable image regions in the remote sensing image to be identified. The accuracy of the recognition is high. Therefore, after each resolution reduction, the neural network is repeatedly classified for such image regions, and the contribution to the accuracy of the final classification result of the image region is not prominent.
- the neural network is repeatedly classified for such image regions, and the contribution to the accuracy of the final classification result of the image region is prominent. Furthermore, in the process of reducing the resolution multiple times, the resolution of the remote sensing image after the previous resolution reduction is smaller than the resolution of the remote sensing image after the lower resolution processing, so that if the previous resolution is lowered After the accuracy of the recognition result of the image block processed by the rate has been satisfied, the application may not identify the image block in the corresponding region after the subsequent resolution reduction, thereby reducing the nerve to a large extent. The amount of calculation of the network and the processing time. It can be seen that the technical solution provided by the present application is advantageous for reducing the time and calculation cost of the neural network and improving the recognition efficiency of the remote sensing image of the neural network while ensuring the accuracy of the remote sensing image recognition.
- Any remote sensing image recognition provided by the embodiments of the present application may be performed by any suitable device having data processing capabilities, including but not limited to: a terminal device, a server, and the like.
- any remote sensing image recognition method provided by the embodiment of the present application may be executed by a processor.
- the processor performs any remote sensing image recognition method mentioned in the embodiment of the present application by calling a corresponding instruction stored in the memory. This will not be repeated below.
- the foregoing programs may be stored in a computer readable storage medium, and the program is executed when executed.
- the operation of the foregoing method embodiment is included; and the foregoing storage medium includes at least one medium that can store program codes, such as a ROM, a RAM, a magnetic disk, or an optical disk.
- FIG. 5 is a schematic structural diagram of an embodiment of a remote sensing image recognition apparatus according to the present application.
- the apparatus of this embodiment mainly includes: a resolution reduction module 500, a remote sensing image segmentation module 510, a selected image block module 520, a classification processing module 530, and a determination recognition result module 540.
- the device may further include: a training module 550.
- the resolution resolution module 500 is mainly used for performing resolution reduction processing on the remote sensing image to be obtained, and obtaining a remote sensing image after the resolution reduction processing.
- the down-resolution module 500 can be configured to perform a resolution reduction process on the remote sensing image to be recognized according to the reduced value that has not been used in the reduced scale set.
- the reduced set of scales includes: at least two different reduced values, and the smaller reduced values will take precedence over the larger reduced values and are used by the reduced resolution module 500.
- For the operations performed by the resolution reduction module 500 reference may be made to the descriptions of S100 in FIG. 1 and S200 in FIG. 2, and the description thereof will not be repeated here.
- the remote sensing image segmentation module 510 is mainly used to segment at least one remote sensing image block from the remote sensing image after the resolution reduction module 500 performs the resolution reduction processing.
- the remote sensing image segmentation module 510 reference may be made to the descriptions of S110 in FIG. 1 and S210 in FIG. 2, and the description thereof will not be repeated here.
- the selected image block module 520 is mainly used to determine the remote sensing image block to be processed from the remote sensing image block segmented by the remote sensing image segmentation module 510.
- the selected image block module 520 can use all the remote sensing image blocks that are segmented by the remote sensing image segmentation module 510 as the remote sensing image block to be processed.
- the number of the remote sensing image blocks to be processed selected by the selected image block module 520 is less than or equal to the remote sensing image block currently segmented by the remote sensing image segmentation module. Quantity.
- the selected image block module 520 may select the remote sensing image block that is segmented from the remote sensing image segmentation module 510 according to the current confidence of the pixel of the remote sensing image to be identified. Remote sensing image block to be processed.
- the selected image block module 520 can include a first unit, a second unit, and a third unit.
- the first unit is mainly configured to determine a current confidence level of the at least one pixel according to the current classification probability information of the at least one of the plurality of pixels included in the remote sensing image to be identified;
- the second unit is mainly used to calculate the corresponding to the remote sensing image block.
- the third unit is mainly configured to determine, as the to-be-processed remote sensing image block, the remote sensing image block that does not meet the predetermined requirement for the number of pixels that reach the predetermined confidence level.
- the third unit will The remote sensing image block is determined as a remote sensing image block to be processed.
- the operations performed by the first unit reference may be made to the description in S300 in FIG. 3, and the operations performed by the second unit may be referred to the description in S310 above for FIG. 3, and the operations performed by the third unit may be referred to the foregoing for FIG. 3.
- the description in S320 is not repeated here.
- the selected image block module 520 may further include: a fourth unit, a fifth unit, and a sixth unit.
- the fourth unit is mainly used for mapping the classification probability information of at least one pixel in each remote sensing image block to be processed outputted by the neural network to the remote sensing image to be identified, and obtaining the to-be-identified image, in the case of performing the resolution-reduction processing for the first time.
- Classification probability information of at least one pixel of the remote sensing image is used as current classification probability information of at least one pixel of the remote sensing image to be identified.
- the fifth unit is mainly used to map the classification probability information of at least one pixel in each remote sensing image block to be processed outputted by the neural network to the remote sensing image to be recognized, without obtaining the first resolution processing.
- the classification probability information of at least one pixel of the remote sensing image is identified.
- the sixth unit is mainly configured to update current classification probability information of the corresponding pixel of the remote sensing image to be recognized according to the classification probability information of the at least one pixel of the remote sensing image to be identified obtained this time.
- the sixth unit may be configured to calculate an average value of the classification probability information of the at least one pixel of the remote sensing image to be identified obtained and the current classification probability information of the corresponding pixel in the remote sensing image to be identified, and update the calculated average value.
- the current classification probability information of the corresponding pixel in the remote sensing image to be identified For the operations performed by the fourth unit, reference may be made to the descriptions in S420 and S430 in FIG. 4, and the operations performed by the fifth unit may be referred to the description in S450 above for FIG. 4, and the operations performed by the sixth unit may be referred to the foregoing. The description in S460 in Fig. 4 will not be repeated here.
- the classification processing module 530 is mainly used to obtain classification probability information of pixels in the remote sensing image block to be processed via the neural network.
- the classification processing module 530 For the operations performed by the classification processing module 530, reference may be made to the related descriptions in S130 in FIG. 1 and S220 in FIG. 2, and the description thereof will not be repeated here.
- the determination recognition result module 540 is mainly used to determine the recognition result of the remote sensing image to be recognized according to the classification probability information of the pixels in the remote sensing image block to be processed. For example, the determination recognition result module 540 may respond to the absence of the unused reduction value in the reduction ratio set or determine that there is no remote sensing image block to be processed in the remote sensing image block segmented by the remote sensing image segmentation module 510, and The classification probability information of the pixels in the remote sensing image block to be processed determines the recognition result of the remote sensing image to be identified.
- the determination recognition result module 540 determines a maximum probability value among the probability values belonging to each category in the current classification probability information of the pixel, and the maximum probability value and the maximum The category corresponding to the probability value is used as the recognition result of the pixel.
- the operation performed by the determination recognition result module 540 can be referred to the above description for S140 in FIG. 1 and S230 in FIG. 2, and the description thereof will not be repeated here.
- the training module 550 is mainly used to train the neural network using remote sensing image samples.
- the resolution of the remote sensing image sample includes: performing resolution reduction processing on the remote sensing image according to each of the reduced values in the reduced scale set, and obtaining the resolution of each remote sensing image.
- the scaling set includes: a first reduced value, a second reduced value, and a third reduced value, the first reduced value corresponds to the first resolution, the second reduced value corresponds to the second resolution, and the third reduced value corresponds to the third resolution
- the training module 550 can use the remote sensing image sample having the first resolution, the remote sensing image sample having the second resolution, and the remote sensing image sample having the third resolution to train the trained neural network, and successfully train the training.
- Neural networks can be used for remote sensing image recognition of the present application.
- FIG. 6 illustrates an exemplary device 600 suitable for implementing the present application, which may be a control system/electronic system configured in a car, a mobile terminal (eg, a smart mobile phone, etc.), a personal computer (PC, eg, a desktop computer Or a notebook computer, etc.), a tablet computer, a server, and the like.
- a mobile terminal eg, a smart mobile phone, etc.
- PC personal computer
- tablet computer eg, a tablet computer, a server, and the like.
- device 600 includes one or more processors, communication units, etc., which may be: one or more central processing units (CPUs) 601, and/or one or more utilized
- the neural network performs an acceleration unit 613 for remote sensing image recognition, etc.
- the acceleration unit 613 may include, but is not limited to, a GPU, an FPGA, other types of dedicated processors, etc.
- the processor may execute executable instructions according to the read-only memory (ROM) 602.
- ROM read-only memory
- RAM random access memory
- the communication unit 612 may include, but is not limited to, a network card, which may include, but is not limited to, an IB (Infiniband) network card.
- the processor can communicate with read only memory 602 and/or random access memory 603 to execute executable instructions, connect to communication portion 612 via bus 604, and communicate with other target devices via communication portion 612 to accomplish the corresponding in this application. operating.
- RAM 603 various programs and data required for the operation of the device can be stored.
- the CPU 601 or the acceleration unit 613, the ROM 602, and the RAM 603 are connected to each other through a bus 604.
- ROM 602 is an optional module.
- the RAM 603 stores executable instructions, or writes executable instructions to the ROM 602 at runtime, the executable instructions causing the processor to perform operations corresponding to the methods described above.
- An input/output (I/O) interface 605 is also coupled to bus 604.
- the communication unit 612 may be integrated, or may be configured to have a plurality of sub-modules (for example, a plurality of IB network cards) and be respectively connected to the bus.
- the following components are connected to the I/O interface 605: an input portion 606 including a keyboard, a mouse, etc.; an output portion 607 including, for example, a cathode ray tube (CRT), a liquid crystal display (LCD), and the like, and a storage portion 608 including a hard disk or the like. And a communication portion 609 including a network interface card such as a LAN card, a modem, or the like. The communication section 609 performs communication processing via a network such as the Internet.
- Driver 610 is also coupled to I/O interface 605 as needed.
- a removable medium 611 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory or the like is mounted on the drive 610 as needed so that a computer program read therefrom is installed in the storage portion 608 as needed.
- FIG. 6 is only an optional implementation manner.
- the number and types of the components in FIG. 6 may be selected, deleted, added, or replaced according to actual needs.
- separate implementations such as separate settings or integrated settings may also be employed.
- the acceleration unit 613 and the CPU 601 may be separately configured.
- the acceleration unit 613 may be integrated on the CPU 601, and the communication portion 612 may be separated.
- the settings may also be integrated on the CPU 601 or the acceleration unit 613, and the like.
- embodiments of the present application include a computer program product comprising tangibly embodied on a machine readable medium.
- a computer program comprising program code for performing the operations illustrated by the flowcharts, the program code comprising instructions corresponding to performing the operations provided herein.
- the computer program can be downloaded and installed from the network via communication portion 609, and/or installed from removable media 611.
- the computer program is executed by the central processing unit (CPU) 601, the above-described instructions described in the present application are executed.
- embodiments of the present disclosure also provide a computer program product for storing computer readable instructions that, when executed, cause a computer to perform the operations described in any of the above embodiments Remote sensing image recognition method.
- the computer program product can be implemented by means of hardware, software or a combination thereof.
- the computer program product is embodied as a computer storage medium.
- the computer program product is embodied as a software product, such as a software development kit (SDK), etc. Wait.
- SDK software development kit
- embodiments of the present disclosure further provide another remote sensing image recognition method and corresponding apparatus and electronic device, computer storage medium, computer program, and computer program product, wherein the method includes: The first device transmits a remote sensing image recognition indication to the second device, the indication causing the second device to perform the remote sensing image recognition method in any of the possible embodiments described above; the first device receiving the remote sensing image recognition result transmitted by the second device.
- the remote sensing image recognition indication may be specifically a call instruction
- the first device may instruct the second device to perform a remote sensing image recognition operation by calling, and correspondingly, in response to receiving the call instruction, the second device may perform The operation and/or flow in any of the above remote sensing image recognition methods.
- a plurality may mean two or more, and “at least one” may mean one, two or more.
- the methods and apparatus, electronic devices, and computer readable storage media of the present application are possible in many ways.
- the methods and apparatus, electronic devices, and computer readable storage media of the present application can be implemented in software, hardware, firmware, or any combination of software, hardware, or firmware.
- the above-described sequence of operations for the method is for illustrative purposes only, and the operation of the method of the present application is not limited to the order specifically described above unless otherwise specifically stated.
- the present application can also be implemented as a program recorded in a recording medium, the program including machine readable instructions for implementing the method according to the present application.
- the present application also covers a recording medium storing a program for executing the method according to the present application.
- FIG. 7 an application scenario in which an implementation may be implemented according to an embodiment of the present application is schematically illustrated.
- the neural network 700 is a neural network for identifying a remote sensing image to be recognized (such as forming a snow mask or a cloud snow mask, etc.).
- the sample data set used to train the neural network 700 typically includes a plurality of remote sensing image samples, and the resolution of all remote sensing image samples is not identical.
- the successfully trained neural network 700 can be used to identify the remote sensing image to be recognized from the satellite, and the recognition result of the neural network 700 can form a cloud.
- a mask such as a snow mask or a cloud snow mask.
- the methods and apparatus, electronic devices, computer programs, and computer readable storage media of the present application are possible in many ways.
- the methods and apparatus, electronic devices, computer programs, and computer readable storage media of the present application can be implemented by software, hardware, firmware or any combination of software, hardware, firmware.
- the above-described sequence of operations for the method is for illustrative purposes only, and the operation of the method of the present application is not limited to the order specifically described above unless otherwise specifically stated.
- the present application can also be implemented as a program recorded in a recording medium, the program including machine readable instructions for implementing the method according to the present application.
- the present application also covers a recording medium storing a program for executing the method according to the present application.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Computing Systems (AREA)
- Software Systems (AREA)
- Multimedia (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Mathematical Physics (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Medical Informatics (AREA)
- Databases & Information Systems (AREA)
- Astronomy & Astrophysics (AREA)
- Remote Sensing (AREA)
- Probability & Statistics with Applications (AREA)
- Biodiversity & Conservation Biology (AREA)
- Algebra (AREA)
- Computational Mathematics (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Pure & Applied Mathematics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Evolutionary Biology (AREA)
- Image Analysis (AREA)
Abstract
Description
Claims (31)
- 一种遥感图像识别方法,其特征在于,所述方法包括:对待识别遥感图像进行降分辨率处理,得到降分辨率处理后的遥感图像;从所述降分辨率处理后的遥感图像中切分出至少一个遥感图像块;从所述至少一个遥感图像块中确定待处理遥感图像块;经由神经网络获得所述待处理遥感图像块中的像素的分类概率信息;根据所述待处理遥感图像块中的像素的分类概率信息确定所述待识别遥感图像的识别结果。
- 根据权利要求1所述的方法,其特征在于,所述对待识别遥感图像进行降分辨率处理包括:根据缩小比例集合中未被采用过的缩小值对待识别遥感图像进行降分辨率处理。
- 根据权利要求2所述的方法,其特征在于,所述根据所述待处理遥感图像块中的像素的分类概率信息确定所述待识别遥感图像的识别结果包括:响应于缩小比例集合中不存在未被采用过的缩小值或者确定所述至少一个遥感图像块中不存在待处理遥感图像块,根据所述待处理遥感图像块中的像素的分类概率信息确定所述待识别遥感图像的识别结果。
- 根据权利要求2至3中任一项所述的方法,其特征在于,所述缩小比例集合包括:至少两个不同的缩小值,且较小的缩小值相对较大的缩小值先被采用。
- 根据权利要求1至4中任一项所述的方法,其特征在于,所述从所述至少一个遥感图像块中确定待处理遥感图像块包括:在第一次执行降分辨率处理的情况下,将所有遥感图像块均作为待处理遥感图像块。
- 根据权利要求1至5中任一项所述的方法,其特征在于,所述从所述至少一个遥感图像块中确定待处理遥感图像块包括:在非第一次执行降分辨率处理的情况下,待处理遥感图像块的数量小于或等于当前切分出的遥感图像块的数量,和/或,所述待识别遥感图像经当前执行降分辨率处理后的图像分辨率大于经上一次执行降分辨率处理后的图像分辨率。
- 根据权利要求6所述的方法,其特征在于,在非第一次执行降分辨率处理的情况下,根据待识别遥感图像的像素的当前置信度从所述至少一个遥感图像块中确定待处理遥感图像块。
- 根据权利要求7所述的方法,其特征在于,所述根据待识别遥感图像的像素的当前置信度从所述至少一个遥感图像块中确定待处理遥感图像块,包括:根据待识别遥感图像包括的多个像素中至少一个像素的当前分类概率信息确定所述至少一个像素的当前置信度;统计遥感图像块所对应的待识别遥感图像区域中所述至少一个像素的当前置信度达到预定置信度的像素数量;将所述达到预定置信度的像素数量不符合预定要求的遥感图像块确定为待处理遥感图像块。
- 根据权利要求8所述的方法,其特征在于,所述将所述达到预定置信度的像素数量不符合预定要求的遥感图像块确定为待处理遥感图像块包括:针对一遥感图像块而言,在达到预定置信度的像素数量与该遥感图像块所对应的待识别遥感图像区域中所述至少一个像素的数量的比值小于预定比值的情况下,将该遥感图像块确定为待处理遥感图像块。
- 根据权利要求8至9中任一项所述的方法,其特征在于,所述待识别遥感图像的至少一个像素的当前分类概率信息的获得方式包括:在第一次执行降分辨率处理的情况下,将神经网络输出的每个待处理遥感图像块中至少一个像素的分类概率信息映射到待识别遥感图像中,获得待识别遥感图像的至少一个像素的分类概率信息;其中,所述待识别遥感图像的至少一个像素的分类概率信息被作为待识别遥感图像的至少一个像素的当前分类概率信息。
- 根据权利要求8至10中任一项所述的方法,其特征在于,所述待识别遥感图像的至少一个像素的当前分类概率信息的获得方式包括:在非第一次执行降分辨率处理的情况下,将神经网络输出的每个待处理遥感图像块中至少一个像素的分类概率信息映射到待识别遥感图像中,获得待识别遥感图像的至少一个像素的分类概率信息;根据本次获得的待识别遥感图像的至少一个像素的分类概率信息更新所述待识别遥感图像的相应像素的当前分类概率信息。
- 根据权利要求11所述的方法,其特征在于,所述根据本次获得的待识别遥感图像的至少一个像素的分类概率信息更新所述待识别遥感图像的相应像素的当前分类概率信息包括:计算本次获得的待识别遥感图像的至少一个像素的分类概率信息与待识别遥感图像中相应像素的当前分类概率信息的平均值;利用计算出的平均值更新待识别遥感图像中相应像素的当前分类概率信息。
- 根据权利要求12所述的方法,其特征在于,所述根据所述待处理遥感图像块中的像素的分类概率信息确定所述待识别遥感图像的识别结果包括:针对待识别遥感图像中的一像素而言,确定该像素的当前分类概率信息中的属于每个类别的概率值中的最大概率值,将该最大概率值以及该最大概率值对应的类别,作为该像素的识别结果。
- 根据权利要求2至13中任一项所述的方法,其特征在于,对所述神经网络进行训练的遥感图像样本的分辨率包括:根据缩小比例集合中的每个缩小值分别对遥感图像进行降分辨率处理,所获得的每个遥感图像的分辨率。
- 一种遥感图像识别装置,其特征在于,所述装置包括:降分辨率模块,用于对待识别遥感图像进行降分辨率处理,得到降分辨率处理后的遥感图像;遥感图像切分模块,用于从所述降分辨率处理后的遥感图像中切分出至少一个遥感图像块;选取图像块模块,用于从所述至少一个遥感图像块中确定待处理遥感图像块;分类处理模块,用于经由神经网络获得所述待处理遥感图像块中的像素的分类概率信息;确定识别结果模块,用于根据所述待处理遥感图像块中的像素的分类概率信息确定所述待识别遥感图像的识别结果。
- 根据权利要求15所述的装置,其特征在于,所述降分辨率模块用于:根据缩小比例集合中未被采用过的缩小值对待识别遥感图像进行降分辨率处理。
- 根据权利要求16所述的装置,其特征在于,所述确定识别结果模块用于:响应于缩小比例集合中不存在未被采用过的缩小值或者确定所述至少一个遥感图像块中不存在待处理遥感图像块,根据所述待处理遥感图像块中的像素的分类概率信息确定所述待识别遥感图像的识别结果。
- 根据权利要求16至17中任一项所述的装置,其特征在于,所述缩小比例集合包括:至少两个不同的缩小值,且较小的缩小值相对较大的缩小值先被采用。
- 根据权利要求15至18中任一项所述的装置,其特征在于,所述选取图像块模块用于:在第一次执行降分辨率处理的情况下,将所有遥感图像块均作为待处理遥感图像块。
- 根据权利要求15至19中任一项所述的装置,其特征在于:在非第一次执行降分辨率处理的情况下,选取图像块模块确定出的待处理遥感图像块的数量小于或等于遥感图像切分模块当前切分出的遥感图像块的数量,且降分辨率模块当前执行的降分辨率大于上一次执行的降分辨率。
- 根据权利要求20所述的装置,其特征在于,在非第一次执行降分辨率处理的情况下,选取图像块模块根据待识别遥感图像的像素的当前置信度从所述至少一个遥感图像块中确定待处理遥感图像块。
- 根据权利要求21所述的装置,其特征在于,所述选取图像块模块包括:第一单元,用于根据待识别遥感图像包括的多个像素中至少一个像素的当前分类概率信息确定所述至少一个像素的当前置信度;第二单元,用于统计遥感图像块所对应的待识别遥感图像区域中所述至少一个像素的当前置信度达到预定置信度的像素数量;第三单元,用于将所述达到预定置信度的像素数量不符合预定要求的遥感图像块确定为待处理遥感图像块。
- 根据权利要求22所述的装置,其特征在于,所述第三单元用于:针对一遥感图像块而言,在达到预定置信度的像素数量与该遥感图像块所对应的待识别遥感图像区域中所述至少一个像素的数量的比值小于预定比值的情况下,将该遥感图像块确定为待处理遥感图像块。
- 根据权利要求22至23中任一项所述的装置,其特征在于,所述选取图像块模块还包括:第四单元,用于在第一次执行降分辨率处理的情况下,将神经网络输出的每个待处理遥感图像块中至少一个像素的分类概率信息映射到待识别遥感图像中,获得待识别遥感图像的至少一个像素的分类概率信息;其中,所述待识别遥感图像的至少一个像素的分类概率信息被作为待识别遥感图像的至少一个像素的当前分类概率信息。
- 根据权利要求22至23中任一项所述的装置,其特征在于,所述选取图像块模块还包括:第五单元,用于在非第一次执行降分辨率处理的情况下,将神经网络输出的每个待处理遥感图像块中至少一个像素的分类概率信息映射到待识别遥感图像中,获得待识别遥感图像的至少一个像素的分类概率信息;第六单元,用于根据本次获得的待识别遥感图像的至少一个像素的分类概率信息更新所述待识别遥感图像的相应像素的当前分类概率信息。
- 根据权利要求25所述的装置,其特征在于:所述第六单元用于:计算本次获得的待识别遥感图像的至少一个像素的分类概率信息与待识别遥感图像中相应像素的当前分类概率信息的平均值;利用计算出的平均值更新待识别遥感图像中相应像素的当前分类概率信息。
- 根据权利要求26所述的装置,其特征在于,所述确定识别结果模块用于:针对待识别遥感图像中的一像素而言,确定该像素的当前分类概率信息中的属于每个类别的概率值中的最大概率值,将该最大概率值以及该最大概率值对应的类别,作为该像素的识别结果。
- 根据权利要求16至27中任一项所述的装置,其特征在于,所述装置还包括:训练模块,用于利用遥感图像样本对待训练的神经网络进行训练;其中,所述遥感图像样本的分辨率包括:根据缩小比例集合中的每个缩小值分别对遥感图像进行降分辨率处理,所获得的每个遥感图像的分辨率。
- 一种电子设备,包括:存储器,用于存储计算机程序;处理器,用于执行所述存储器中存储的计算机程序,且所述计算机程序被执行时,实现权利要求1-15中任一项所述的方法。
- 一种计算机可读存储介质,其上存储有计算机程序,该计算机程序被处理器执行时,实现上述权利要求1-15中任一项所述的方法。
- 一种计算机程序,包括计算机指令,当所述计算机指令在设备的处理器中运行时,实现上述权利要求1-15中任一项所述的方法。
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020207021508A KR20200106909A (ko) | 2017-12-26 | 2018-12-26 | 원격 감지 이미지 인식 방법, 장치, 저장 매체 및 전자 기기 |
SG11202006035RA SG11202006035RA (en) | 2017-12-26 | 2018-12-26 | Remote sensing image recognition method and apparatus, storage medium and electronic device |
JP2020535252A JP7080978B2 (ja) | 2017-12-26 | 2018-12-26 | リモートセンシング画像認識方法、装置、記憶媒体及び電子機器 |
US16/909,291 US11074445B2 (en) | 2017-12-26 | 2020-06-23 | Remote sensing image recognition method and apparatus, storage medium and electronic device |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711436470.2 | 2017-12-26 | ||
CN201711436470.2A CN108304775B (zh) | 2017-12-26 | 2017-12-26 | 遥感图像识别方法、装置、存储介质以及电子设备 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US16/909,291 Continuation US11074445B2 (en) | 2017-12-26 | 2020-06-23 | Remote sensing image recognition method and apparatus, storage medium and electronic device |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2019129032A1 true WO2019129032A1 (zh) | 2019-07-04 |
Family
ID=62867455
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2018/123807 WO2019129032A1 (zh) | 2017-12-26 | 2018-12-26 | 遥感图像识别方法、装置、存储介质以及电子设备 |
Country Status (6)
Country | Link |
---|---|
US (1) | US11074445B2 (zh) |
JP (1) | JP7080978B2 (zh) |
KR (1) | KR20200106909A (zh) |
CN (1) | CN108304775B (zh) |
SG (1) | SG11202006035RA (zh) |
WO (1) | WO2019129032A1 (zh) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110866493A (zh) * | 2019-11-14 | 2020-03-06 | 生态环境部南京环境科学研究所 | 基于遥感影像的规模化生猪养殖场识别方法 |
CN111866427A (zh) * | 2019-09-06 | 2020-10-30 | 阳光学院 | 一种基于psvm和crnn的遥感影像分类显示装置 |
CN112232249A (zh) * | 2020-10-22 | 2021-01-15 | 中国科学院空天信息创新研究院 | 一种基于深度特征的遥感图像变化检测方法及装置 |
CN113553938A (zh) * | 2021-07-19 | 2021-10-26 | 黑芝麻智能科技(上海)有限公司 | 安全带检测方法、装置、计算机设备和存储介质 |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108304775B (zh) | 2017-12-26 | 2021-06-11 | 北京市商汤科技开发有限公司 | 遥感图像识别方法、装置、存储介质以及电子设备 |
CN108711159B (zh) * | 2018-03-30 | 2019-06-18 | 长安大学 | 基于Landsat8含雪影像的云检测方法 |
CN110189328B (zh) * | 2019-06-11 | 2021-02-23 | 北华航天工业学院 | 一种卫星遥感图像处理系统及其处理方法 |
CN112560577A (zh) * | 2020-11-13 | 2021-03-26 | 空间信息产业发展股份有限公司 | 一种基于语义分割的遥感图像地物分类方法 |
CN112579816B (zh) * | 2020-12-29 | 2022-01-07 | 二十一世纪空间技术应用股份有限公司 | 遥感图像的检索方法、装置、电子设备及存储介质 |
CN112857268B (zh) | 2021-01-21 | 2023-10-31 | 北京百度网讯科技有限公司 | 对象面积测量方法、装置、电子设备和存储介质 |
CN112818840A (zh) * | 2021-01-29 | 2021-05-18 | 中国科学院微小卫星创新研究院 | 无人机在线检测系统和方法 |
CN113570536B (zh) * | 2021-07-31 | 2022-02-01 | 中国人民解放军61646部队 | 基于cpu和gpu协同处理的全色和多光谱影像实时融合方法 |
CN114202691B (zh) * | 2021-12-09 | 2022-10-04 | 广东省科学院广州地理研究所 | 基于遥感图像的水稻识别方法、装置以及设备 |
CN114676907B (zh) * | 2022-01-17 | 2022-09-20 | 中国地质大学(北京) | 一种区域地质灾害预警方法、装置、存储介质及设备 |
CN114663412B (zh) * | 2022-04-01 | 2023-02-10 | 中国科学院地理科学与资源研究所 | 基于陆表水域生态红线的长连续遥感图像处理方法及装置 |
CN115019198B (zh) * | 2022-08-08 | 2022-10-18 | 中国科学院空天信息创新研究院 | 高频次遥感观测影像迭代更新制图方法 |
CN115080925B (zh) * | 2022-08-19 | 2022-11-11 | 北京数慧时空信息技术有限公司 | 基于区块链的遥感成果知识产权管理方法及系统 |
CN115346124B (zh) * | 2022-09-02 | 2023-05-30 | 浙大城市学院 | 一种水稻制图的方法、装置、设备及可读存储介质 |
CN115641445B (zh) * | 2022-12-23 | 2023-03-14 | 西南石油大学 | 一种非对称内卷积与Transformer相融合的遥感图像阴影检测方法 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101826157A (zh) * | 2010-04-28 | 2010-09-08 | 华中科技大学 | 一种地面静止目标实时识别跟踪方法 |
CN102186069A (zh) * | 2011-01-14 | 2011-09-14 | 王慧 | 一种保持量测性能的遥感影像数据压缩方法 |
CN103093182A (zh) * | 2011-11-03 | 2013-05-08 | 北京师范大学 | 一种基于多敏感性策略的遥感影像层次分类识别方法 |
CN106548192A (zh) * | 2016-09-23 | 2017-03-29 | 北京市商汤科技开发有限公司 | 基于神经网络的图像处理方法、装置和电子设备 |
CN108304775A (zh) * | 2017-12-26 | 2018-07-20 | 北京市商汤科技开发有限公司 | 遥感图像识别方法、装置、存储介质以及电子设备 |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE19955919C1 (de) * | 1999-11-20 | 2001-05-31 | Daimler Chrysler Ag | Verfahren zur Erkennung von Objekten in Bildern auf der Bildpixelebene |
JP2006011967A (ja) * | 2004-06-28 | 2006-01-12 | Fuji Xerox Co Ltd | 文字認識装置、文字認識プログラム |
US7359555B2 (en) * | 2004-10-08 | 2008-04-15 | Mitsubishi Electric Research Laboratories, Inc. | Detecting roads in aerial images using feature-based classifiers |
US8339470B2 (en) * | 2009-11-30 | 2012-12-25 | Indian Institute Of Technology Madras | Method and system for generating a high resolution image |
CN103150561A (zh) * | 2013-03-19 | 2013-06-12 | 华为技术有限公司 | 人脸识别方法和设备 |
CN103310195B (zh) * | 2013-06-09 | 2016-12-28 | 西北工业大学 | 车辆高分遥感图像的基于llc特征的弱监督识别方法 |
CN104463873B (zh) * | 2014-12-10 | 2017-11-07 | 西安电子科技大学 | 一种基于局部均匀缩放的图像目标重定位方法 |
CN105046231A (zh) * | 2015-07-27 | 2015-11-11 | 小米科技有限责任公司 | 人脸检测方法和装置 |
CN106778821B (zh) * | 2016-11-25 | 2019-05-03 | 西安电子科技大学 | 基于slic和改进的cnn的极化sar图像分类方法 |
-
2017
- 2017-12-26 CN CN201711436470.2A patent/CN108304775B/zh active Active
-
2018
- 2018-12-26 JP JP2020535252A patent/JP7080978B2/ja active Active
- 2018-12-26 SG SG11202006035RA patent/SG11202006035RA/en unknown
- 2018-12-26 KR KR1020207021508A patent/KR20200106909A/ko not_active Application Discontinuation
- 2018-12-26 WO PCT/CN2018/123807 patent/WO2019129032A1/zh active Application Filing
-
2020
- 2020-06-23 US US16/909,291 patent/US11074445B2/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101826157A (zh) * | 2010-04-28 | 2010-09-08 | 华中科技大学 | 一种地面静止目标实时识别跟踪方法 |
CN102186069A (zh) * | 2011-01-14 | 2011-09-14 | 王慧 | 一种保持量测性能的遥感影像数据压缩方法 |
CN103093182A (zh) * | 2011-11-03 | 2013-05-08 | 北京师范大学 | 一种基于多敏感性策略的遥感影像层次分类识别方法 |
CN106548192A (zh) * | 2016-09-23 | 2017-03-29 | 北京市商汤科技开发有限公司 | 基于神经网络的图像处理方法、装置和电子设备 |
CN108304775A (zh) * | 2017-12-26 | 2018-07-20 | 北京市商汤科技开发有限公司 | 遥感图像识别方法、装置、存储介质以及电子设备 |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111866427A (zh) * | 2019-09-06 | 2020-10-30 | 阳光学院 | 一种基于psvm和crnn的遥感影像分类显示装置 |
CN110866493A (zh) * | 2019-11-14 | 2020-03-06 | 生态环境部南京环境科学研究所 | 基于遥感影像的规模化生猪养殖场识别方法 |
CN112232249A (zh) * | 2020-10-22 | 2021-01-15 | 中国科学院空天信息创新研究院 | 一种基于深度特征的遥感图像变化检测方法及装置 |
CN112232249B (zh) * | 2020-10-22 | 2023-08-15 | 中国科学院空天信息创新研究院 | 一种基于深度特征的遥感图像变化检测方法及装置 |
CN113553938A (zh) * | 2021-07-19 | 2021-10-26 | 黑芝麻智能科技(上海)有限公司 | 安全带检测方法、装置、计算机设备和存储介质 |
CN113553938B (zh) * | 2021-07-19 | 2024-05-14 | 黑芝麻智能科技(上海)有限公司 | 安全带检测方法、装置、计算机设备和存储介质 |
Also Published As
Publication number | Publication date |
---|---|
KR20200106909A (ko) | 2020-09-15 |
CN108304775B (zh) | 2021-06-11 |
CN108304775A (zh) | 2018-07-20 |
JP7080978B2 (ja) | 2022-06-06 |
US20200320273A1 (en) | 2020-10-08 |
US11074445B2 (en) | 2021-07-27 |
JP2021508123A (ja) | 2021-02-25 |
SG11202006035RA (en) | 2020-07-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2019129032A1 (zh) | 遥感图像识别方法、装置、存储介质以及电子设备 | |
US11321593B2 (en) | Method and apparatus for detecting object, method and apparatus for training neural network, and electronic device | |
US11334763B2 (en) | Image processing methods, training methods, apparatuses, devices, media, and programs | |
US11657602B2 (en) | Font identification from imagery | |
US11062453B2 (en) | Method and system for scene parsing and storage medium | |
CN108229322B (zh) | 基于视频的人脸识别方法、装置、电子设备及存储介质 | |
CN108154222B (zh) | 深度神经网络训练方法和系统、电子设备 | |
US20190279014A1 (en) | Method and apparatus for detecting object keypoint, and electronic device | |
CN110431560B (zh) | 目标人物的搜索方法和装置、设备和介质 | |
WO2020020146A1 (zh) | 激光雷达稀疏深度图的处理方法、装置、设备及介质 | |
WO2018010657A1 (zh) | 结构化文本检测方法和系统、计算设备 | |
CN108229591B (zh) | 神经网络自适应训练方法和装置、设备、程序和存储介质 | |
US11538244B2 (en) | Extraction of spatial-temporal feature representation | |
CN109409288B (zh) | 图像处理方法、装置、电子设备和存储介质 | |
US11164004B2 (en) | Keyframe scheduling method and apparatus, electronic device, program and medium | |
CN108154153B (zh) | 场景分析方法和系统、电子设备 | |
CN108229680B (zh) | 神经网络系统、遥感图像识别方法、装置、设备及介质 | |
CN115861400A (zh) | 目标对象检测方法、训练方法、装置以及电子设备 | |
CN113469025B (zh) | 应用于车路协同的目标检测方法、装置、路侧设备和车辆 | |
US11881044B2 (en) | Method and apparatus for processing image, device and storage medium | |
CN115273148B (zh) | 行人重识别模型训练方法、装置、电子设备及存储介质 | |
CN113361536B (zh) | 图像语义分割模型训练、图像语义分割方法及相关装置 | |
CN111311604A (zh) | 用于分割图像的方法和装置 | |
CN115497112B (zh) | 表单识别方法、装置、设备以及存储介质 | |
US11837000B1 (en) | OCR using 3-dimensional interpolation |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 18897848 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2020535252 Country of ref document: JP Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: 20207021508 Country of ref document: KR Kind code of ref document: A |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 18897848 Country of ref document: EP Kind code of ref document: A1 |