CN115019107A - Method, system and medium for sonar simulation image generation based on style transfer - Google Patents
Method, system and medium for sonar simulation image generation based on style transfer Download PDFInfo
- Publication number
- CN115019107A CN115019107A CN202210752897.8A CN202210752897A CN115019107A CN 115019107 A CN115019107 A CN 115019107A CN 202210752897 A CN202210752897 A CN 202210752897A CN 115019107 A CN115019107 A CN 115019107A
- Authority
- CN
- China
- Prior art keywords
- image
- sonar
- images
- style
- satellite
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000012546 transfer Methods 0.000 title claims abstract description 61
- 238000000034 method Methods 0.000 title claims abstract description 48
- 238000004088 simulation Methods 0.000 title claims abstract description 40
- 238000012549 training Methods 0.000 claims abstract description 17
- 238000003709 image segmentation Methods 0.000 claims abstract description 12
- 238000012545 processing Methods 0.000 claims abstract description 7
- 238000013527 convolutional neural network Methods 0.000 claims description 40
- 230000006870 function Effects 0.000 claims description 23
- 239000011159 matrix material Substances 0.000 claims description 10
- 230000003044 adaptive effect Effects 0.000 claims description 8
- 230000011218 segmentation Effects 0.000 claims description 7
- 230000008569 process Effects 0.000 claims description 5
- 230000004913 activation Effects 0.000 claims description 3
- 238000004364 calculation method Methods 0.000 claims description 3
- 238000001514 detection method Methods 0.000 claims description 3
- 239000011796 hollow space material Substances 0.000 claims 1
- 238000010586 diagram Methods 0.000 description 7
- 238000010200 validation analysis Methods 0.000 description 6
- 230000009286 beneficial effect Effects 0.000 description 5
- 238000004590 computer program Methods 0.000 description 4
- 230000009466 transformation Effects 0.000 description 3
- 240000004282 Grewia occidentalis Species 0.000 description 2
- 238000003491 array Methods 0.000 description 2
- 238000013528 artificial neural network Methods 0.000 description 2
- 208000003464 asthenopia Diseases 0.000 description 2
- 230000000052 comparative effect Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 206010013647 Drowning Diseases 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 238000007635 classification algorithm Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000007123 defense Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 239000011435 rock Substances 0.000 description 1
- 230000001568 sexual effect Effects 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 239000003643 water by type Substances 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/764—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/084—Backpropagation, e.g. using gradient descent
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/26—Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
- G06V10/267—Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion by performing operations on regions, e.g. growing, shrinking or watersheds
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/10—Terrestrial scenes
- G06V20/13—Satellite images
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V2201/00—Indexing scheme relating to image or video recognition or understanding
- G06V2201/07—Target detection
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02T—CLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
- Y02T10/00—Road transport of goods or passengers
- Y02T10/10—Internal combustion engine [ICE] based vehicles
- Y02T10/40—Engine management systems
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Evolutionary Computation (AREA)
- Computing Systems (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Software Systems (AREA)
- General Health & Medical Sciences (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Databases & Information Systems (AREA)
- Medical Informatics (AREA)
- Biomedical Technology (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Astronomy & Astrophysics (AREA)
- Remote Sensing (AREA)
- Measurement Of Velocity Or Position Using Acoustic Or Ultrasonic Waves (AREA)
Abstract
本发明公开了一种基于风格迁移的声呐仿真图像生成方法、系统及介质,其中方法包括:根据真实声呐图像中所包含的目标物体类别,以卫星遥感图像为输入,对卫星遥感图像进行识别,识别出与目标物体类别相同的识别物,对识别物所在范围进行标记;对经过识别并标记的卫星遥感图像进行图像分割,以目标物体类别为分类,构建卫星子图像数据集;构建风格迁移网络,以真实的声呐图像作为风格图像,对分割后的卫星子图像进行风格迁移,生成声呐仿真图像,以作为训练样本图像。本发明基于遥感卫星图像,使用风格迁移网络,获得模拟的声呐图像,能够有效扩充声呐数据集的数量,为基于声呐图像的后续相关工作提供基础。本发明可广泛应用于图像处理技术领域。
The invention discloses a sonar simulation image generation method, system and medium based on style transfer, wherein the method includes: according to the target object category included in the real sonar image, taking the satellite remote sensing image as input, and recognizing the satellite remote sensing image, Identify the same type of object as the target object, and mark the range of the object; perform image segmentation on the identified and marked satellite remote sensing images, and use the target object category as the classification to construct a satellite sub-image dataset; construct a style transfer network , taking the real sonar image as the style image, and performing the style transfer on the segmented satellite sub-image to generate the sonar simulation image as the training sample image. The present invention is based on remote sensing satellite images, uses a style transfer network to obtain simulated sonar images, can effectively expand the number of sonar data sets, and provides a basis for subsequent related work based on sonar images. The invention can be widely used in the technical field of image processing.
Description
技术领域technical field
本发明涉及图像处理技术领域,尤其涉及一种基于风格迁移的声呐仿真图像生成方法、系统及介质。The invention relates to the technical field of image processing, in particular to a method, system and medium for generating a sonar simulation image based on style transfer.
背景技术Background technique
声呐作为一种高分辨率、多用途、低成本的水下成像设备,广泛应用于海洋、河流、湖泊等水域。它因为能获得高分辨率,连续的海底图像的特点,广泛应用于海洋测绘、近海勘探和水下搜索救援等领域。在水下搜索、救援活动中,声呐可以有效探测飞机残骸、沉船等水下目标。在长时间的水下搜索、救援任务中,救援人员需要连续仔细地检查声呐图像,以确认是否有目标物体。工作一段时间后,工作人员会产生视觉疲劳,容易错过救援目标。为了有效减少工作人员的工作量,减少视觉疲劳引起的错误判断,提高工作效率,对声呐图像进行自动分类具有现实意义。但是,现有技术存在以下缺陷:As a high-resolution, multi-purpose, low-cost underwater imaging device, sonar is widely used in oceans, rivers, lakes and other waters. Because it can obtain high-resolution, continuous seabed images, it is widely used in marine mapping, offshore exploration and underwater search and rescue and other fields. In underwater search and rescue activities, sonar can effectively detect underwater targets such as aircraft wreckage and sunken ships. During long underwater search and rescue missions, rescuers need to continuously and carefully check sonar images to confirm whether there is a target object. After working for a period of time, the staff will experience visual fatigue and easily miss the rescue target. In order to effectively reduce the workload of staff, reduce misjudgment caused by visual fatigue, and improve work efficiency, it is of practical significance to automatically classify sonar images. However, the prior art has the following defects:
近年来准确度较高的水底图像自动分类算法均使用深度神经网络算法,这类算法需要大量的声呐图像数据进行训练。但是由于声呐图像往往与紧急救援或者国防相关,所以获取数据集难度大,现在仍没有公开的、统一的大批量声呐图像。这使得卷积神经网络在声呐领域始终无法获得较大范围的应用。In recent years, the high-accuracy underwater image automatic classification algorithms all use deep neural network algorithms, which require a large amount of sonar image data for training. However, because sonar images are often related to emergency rescue or national defense, it is difficult to obtain data sets, and there is still no public, unified large-scale sonar images. This makes the convolutional neural network unable to obtain a wide range of applications in the field of sonar.
发明内容SUMMARY OF THE INVENTION
为至少一定程度上解决现有技术中存在的技术问题之一,本发明的目的在于提供一种基于风格迁移的声呐仿真图像生成方法、系统及介质。In order to solve one of the technical problems existing in the prior art at least to a certain extent, the purpose of the present invention is to provide a method, system and medium for sonar simulation image generation based on style transfer.
本发明所采用的技术方案是:The technical scheme adopted in the present invention is:
一种基于风格迁移的声呐仿真图像生成方法,包括以下步骤:A method for generating sonar simulation images based on style transfer, comprising the following steps:
根据真实声呐图像中所包含的目标物体类别,以卫星遥感图像为输入,对所述卫星遥感图像进行识别,识别出与所述目标物体类别相同的识别物,对识别物所在范围进行标记;According to the target object category contained in the real sonar image, the satellite remote sensing image is used as the input, the satellite remote sensing image is identified, the identification object of the same category as the target object is identified, and the range of the identification object is marked;
对经过识别并标记的卫星遥感图像进行图像分割,以目标物体类别为分类,构建卫星子图像数据集;Perform image segmentation on the identified and marked satellite remote sensing images, and construct satellite sub-image datasets based on the category of target objects;
构建风格迁移网络,以真实的声呐图像作为风格图像,对分割后的卫星子图像进行风格迁移,生成声呐仿真图像,以作为训练样本图像。A style transfer network is constructed, using real sonar images as style images, and performing style transfer on the segmented satellite sub-images to generate sonar simulation images as training sample images.
进一步地,所述对所述卫星遥感图像进行识别,识别出与所述目标物体类别相同的识别物,对识别物所在范围进行标记,包括:Further, identifying the satellite remote sensing image, identifying the identifier of the same type as the target object, and marking the range where the identifier is located, including:
采用经预设数据集训练过的目标识别网络,对卫星遥感图像进行目标检测,将检测的结果分为n+1类,其中n为目标物体类别数量;The target recognition network trained by the preset data set is used to detect the satellite remote sensing images, and the detection results are divided into n+1 categories, where n is the number of target object categories;
对目标物所在区域进行标记,其中,标记区域的分辨率根据不同的输入图像进行调整。Mark the area where the target is located, wherein the resolution of the marked area is adjusted according to different input images.
进一步地,所述对经过识别并标记的卫星遥感图像进行图像分割,以目标物体类别为分类,构建卫星子图像数据集,包括:Further, image segmentation is performed on the identified and marked satellite remote sensing images, and the target object category is used as a classification to construct a satellite sub-image data set, including:
对识别物进行标记后,获得目标区域,读取目标区域的参数x,y,w,h,其中,x,y为目标区域左上角的横纵坐标,w,h为目标区域横纵方向尺度;After marking the identifier, obtain the target area, read the parameters x, y, w, h of the target area, where x, y are the horizontal and vertical coordinates of the upper left corner of the target area, and w, h are the horizontal and vertical dimensions of the target area ;
根据读取到的参数,对目标区域进行自适应分辨率分割,分割的卫星子图像的区域为:According to the read parameters, adaptive resolution segmentation is performed on the target area, and the segmented satellite sub-image area is:
其中,为由原始图片的横纵方向尺度所决定的自适应分辨率缩放因子;为原始标记的分割范围,为自适应分辨率放缩以后的分割范围。in, is an adaptive resolution scaling factor determined by the horizontal and vertical dimensions of the original image; is the segmentation range of the original mark, Segmentation range after scaling for adaptive resolution.
进一步地,所述自适应分辨率缩放因子λ通过以下方式获得:Further, the adaptive resolution scaling factor λ is obtained in the following manner:
其中,f(w,h)为w,h中较小边。Among them, f(w,h) is the smaller side of w,h.
进一步地,所述风格迁移网络使用空洞空间金字塔结构提取风格特征图。Further, the style transfer network uses an empty space pyramid structure to extract the style feature map.
进一步地,所述构建风格迁移网络,以真实的声呐图像作为风格图像,对分割后的卫星子图像进行风格迁移,生成声呐仿真图像,以作为训练样本图像,包括:Further, the style transfer network is constructed, and the real sonar image is used as the style image to perform style transfer on the segmented satellite sub-images to generate sonar simulation images as training sample images, including:
构建卷积神经网络,利用所述卷积神经网络网络提取真实声呐图像和卫星子图像的特征,以两张图片特征间的欧式距离和格拉姆矩阵的欧氏距离为损失函数进行优化,获得声呐仿真图像。Construct a convolutional neural network, use the convolutional neural network to extract the features of real sonar images and satellite sub-images, and optimize the loss function with the Euclidean distance between the features of the two images and the Euclidean distance of the Gram matrix to obtain sonar. Simulation image.
进一步地,所述构建卷积神经网络,利用所述卷积神经网络网络提取真实声呐图像和卫星子图像的特征,以两张图片特征间的欧式距离和格拉姆矩阵的欧氏距离为损失函数进行优化,获得声呐仿真图像,包括:Further, the described construction convolutional neural network, utilizes the described convolutional neural network to extract the features of real sonar images and satellite sub-images, and takes the Euclidean distance between the two picture features and the Euclidean distance of the Gram matrix as the loss function Optimized for sonar simulation images, including:
以VGG16网络的前17层为基础,在VGG16前17层中添加shortcut连接,构建卷积神经网络;Based on the first 17 layers of the VGG16 network, shortcut connections are added to the first 17 layers of the VGG16 to build a convolutional neural network;
添加shortcut后的卷积神经网络计算表示为:The calculation of the convolutional neural network after adding the shortcut is expressed as:
xl+1=xl+F(xl)x l+1 = x l +F(x l )
式中,xl和xl+1分别表示网络第l层和第l+1层的特征;F(xl)表示网络的卷积和激活操作;添加shortcut后的网络反向传播过程为:In the formula, x l and x l+1 represent the characteristics of the lth layer and the l+1th layer of the network respectively; F(x l ) represents the convolution and activation operations of the network; the network backpropagation process after adding the shortcut is:
使用训练后的VGG16网络前17层作为所述卷积神经网络的主干,以真实的声呐图像作为风格图像,卫星子图像作为内容图像输入带有shortcut的VGG16网络,分别提取风格特征S和内容特征C;The first 17 layers of the trained VGG16 network are used as the backbone of the convolutional neural network, the real sonar image is used as the style image, and the satellite sub-image is used as the content image to input the VGG16 network with shortcut, and the style feature S and content feature are extracted respectively. C;
以风格特征S和内容特征C之间欧氏距离为内容损失,内容损失表示为:Taking the Euclidean distance between the style feature S and the content feature C as the content loss, the content loss is expressed as:
式中i表示特征图中第i个元素;where i represents the i-th element in the feature map;
风格损失函数为:The style loss function is:
式中是风格特征S(也称风格图片S)在CNN第l层第(i,j,k)位置的输出,其中(i,j,k)对应高,宽,通道三个维度;in the formula is the output of the style feature S (also known as the style image S) at the (i, j, k) position of the lth layer of the CNN, where (i, j, k) corresponds to the three dimensions of height, width and channel;
是风格图片在CNN第l层第(i,j,k′)位置的输出,其中(i,j,k′)对应高,宽,通道三个维度; is the output of the style image at the (i, j, k') position of the lth layer of the CNN, where (i, j, k') corresponds to the three dimensions of height, width and channel;
为是网络生成图像G在CNN第l层第(i,j,k)位置的输出,其中(i,j,k)对应高,宽,通道三个维度; It is the output of the image G generated by the network at the (i, j, k) position of the lth layer of the CNN, where (i, j, k) corresponds to the three dimensions of height, width and channel;
为CNN第l层高度维度上的总数; is the total number on the height dimension of the lth layer of CNN;
为CNN第l层宽度维度上的总数; is the total number on the width dimension of the lth layer of CNN;
为CNN第l层通道维度上的总数; is the total number on the channel dimension of the first layer of CNN;
为CNN第l层的风格特征S在通道k′和通道k的格拉姆矩阵; is the Gram matrix of the style feature S of the lth layer of CNN in channel k ′ and channel k;
为CNN第l层的内容特征C在通道k′和通道k的格拉姆矩阵; is the Gram matrix of the content feature C of the lth layer of CNN in channel k ' and channel k;
风格迁移的损失函数为:The loss function for style transfer is:
L=αLc+βLs L=αL c +βL s
式中α、β为权重系数。where α and β are weight coefficients.
本发明所采用的另一技术方案是:Another technical scheme adopted by the present invention is:
一种基于风格迁移的声呐仿真图像生成系统,包括:A sonar simulation image generation system based on style transfer, comprising:
卫星图像处理模块,用于根据真实声呐图像中所包含的目标物体类别,以卫星遥感图像为输入,对所述卫星遥感图像进行识别,识别出与所述目标物体类别相同的识别物,对识别物所在范围进行标记;The satellite image processing module is used to identify the satellite remote sensing image according to the target object category contained in the real sonar image, and use the satellite remote sensing image as input, identify the identification object of the same category as the target object, and identify the target object. mark the area where the object is located;
卫星图像分割模块,用于对经过识别并标记的卫星遥感图像进行图像分割,以目标物体类别为分类,构建卫星子图像数据集;The satellite image segmentation module is used to perform image segmentation on the identified and marked satellite remote sensing images, and use the target object category as a classification to construct a satellite sub-image dataset;
图像风格迁移模块,用于构建风格迁移网络,以真实的声呐图像作为风格图像,对分割后的卫星子图像进行风格迁移,生成声呐仿真图像,以作为训练样本图像。The image style transfer module is used to build a style transfer network, using real sonar images as style images, and performing style transfer on the segmented satellite sub-images to generate sonar simulation images as training sample images.
本发明所采用的另一技术方案是:Another technical scheme adopted by the present invention is:
一种基于风格迁移的声呐仿真图像生成系统,包括:A sonar simulation image generation system based on style transfer, comprising:
至少一个处理器;at least one processor;
至少一个存储器,用于存储至少一个程序;at least one memory for storing at least one program;
当所述至少一个程序被所述至少一个处理器执行,使得所述至少一个处理器实现上所述方法。When the at least one program is executed by the at least one processor, the at least one processor implements the above method.
本发明所采用的另一技术方案是:Another technical scheme adopted by the present invention is:
一种计算机可读存储介质,其中存储有处理器可执行的程序,所述处理器可执行的程序在由处理器执行时用于执行如上所述方法。A computer-readable storage medium in which a processor-executable program is stored, the processor-executable program, when executed by the processor, is used to perform the method as described above.
本发明的有益效果是:本发明通过遥感卫星图像为输入,以真实声呐图像为目标,使用风格迁移网络,获得模拟的声呐图像,能够有效扩充声呐数据集的数量,为基于声呐图像的后续相关工作提供基础。The beneficial effects of the present invention are: the present invention uses remote sensing satellite images as input, takes real sonar images as targets, and uses a style transfer network to obtain simulated sonar images, which can effectively expand the number of sonar data sets, and is a follow-up correlation based on sonar images. work provides the foundation.
附图说明Description of drawings
为了更清楚地说明本发明实施例或者现有技术中的技术方案,下面对本发明实施例或者现有技术中的相关技术方案附图作以下介绍,应当理解的是,下面介绍中的附图仅仅为了方便清晰表述本发明的技术方案中的部分实施例,对于本领域的技术人员而言,在无需付出创造性劳动的前提下,还可以根据这些附图获取到其他附图。In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the following descriptions are given to the accompanying drawings of the embodiments of the present invention or the related technical solutions in the prior art. It should be understood that the drawings in the following introduction are only In order to facilitate and clearly express some embodiments of the technical solutions of the present invention, for those skilled in the art, other drawings can also be obtained from these drawings without creative work.
图1是本发明实施例中一种基于风格迁移的声呐仿真图像生成方法的步骤流程图;Fig. 1 is the step flow chart of a kind of sonar simulation image generation method based on style transfer in the embodiment of the present invention;
图2是本发明实施例中一种基于风格迁移的声呐仿真图像生成方法的流程示意图;2 is a schematic flowchart of a method for generating a sonar simulation image based on style transfer in an embodiment of the present invention;
图3是本发明实施例中卷积神经网络的结构图;3 is a structural diagram of a convolutional neural network in an embodiment of the present invention;
图4是本发明实施例中VGG16网络为基础的风格迁移网络的工作流程图;Fig. 4 is the working flow chart of the style transfer network based on VGG16 network in the embodiment of the present invention;
图5是本发明实施例中所使用的卫星遥感图像的示意图;5 is a schematic diagram of a satellite remote sensing image used in an embodiment of the present invention;
图6是本发明实施例中经过目标识别后的卫星遥感图像的示意图;6 is a schematic diagram of a satellite remote sensing image after target identification in an embodiment of the present invention;
图7是本发明实施例中经过图像裁剪后的卫星遥感子图像的示意图;7 is a schematic diagram of a satellite remote sensing sub-image after image cropping in an embodiment of the present invention;
图8是本发明实施例中经过风格迁移获得的模拟声呐图像的示意图。FIG. 8 is a schematic diagram of a simulated sonar image obtained through style transfer in an embodiment of the present invention.
具体实施方式Detailed ways
下面详细描述本发明的实施例,所述实施例的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的,仅用于解释本发明,而不能理解为对本发明的限制。对于以下实施例中的步骤编号,其仅为了便于阐述说明而设置,对步骤之间的顺序不做任何限定,实施例中的各步骤的执行顺序均可根据本领域技术人员的理解来进行适应性调整。The following describes in detail the embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein the same or similar reference numerals refer to the same or similar elements or elements having the same or similar functions throughout. The embodiments described below with reference to the accompanying drawings are exemplary, only used to explain the present invention, and should not be construed as a limitation of the present invention. The numbers of the steps in the following embodiments are only set for the convenience of description, and the sequence between the steps is not limited in any way, and the execution sequence of each step in the embodiments can be adapted according to the understanding of those skilled in the art Sexual adjustment.
在本发明的描述中,需要理解的是,涉及到方位描述,例如上、下、前、后、左、右等指示的方位或位置关系为基于附图所示的方位或位置关系,仅是为了便于描述本发明和简化描述,而不是指示或暗示所指的装置或元件必须具有特定的方位、以特定的方位构造和操作,因此不能理解为对本发明的限制。In the description of the present invention, it should be understood that the azimuth description, such as the azimuth or position relationship indicated by up, down, front, rear, left, right, etc., is based on the azimuth or position relationship shown in the drawings, only In order to facilitate the description of the present invention and simplify the description, it is not indicated or implied that the indicated device or element must have a particular orientation, be constructed and operated in a particular orientation, and therefore should not be construed as limiting the present invention.
在本发明的描述中,若干的含义是一个或者多个,多个的含义是两个以上,大于、小于、超过等理解为不包括本数,以上、以下、以内等理解为包括本数。如果有描述到第一、第二只是用于区分技术特征为目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量或者隐含指明所指示的技术特征的先后关系。In the description of the present invention, the meaning of several is one or more, the meaning of multiple is two or more, greater than, less than, exceeding, etc. are understood as not including this number, above, below, within, etc. are understood as including this number. If it is described that the first and the second are only for the purpose of distinguishing technical features, it cannot be understood as indicating or implying relative importance, or indicating the number of the indicated technical features or the order of the indicated technical features. relation.
本发明的描述中,除非另有明确的限定,设置、安装、连接等词语应做广义理解,所属技术领域技术人员可以结合技术方案的具体内容合理确定上述词语在本发明中的具体含义。In the description of the present invention, unless otherwise clearly defined, words such as setting, installation, connection should be understood in a broad sense, and those skilled in the art can reasonably determine the specific meanings of the above words in the present invention in combination with the specific content of the technical solution.
如图1所示,本实施例提供一种基于风格迁移的声呐仿真图像生成方法,包括以下步骤:As shown in FIG. 1 , this embodiment provides a method for generating a sonar simulation image based on style transfer, which includes the following steps:
S1、根据真实声呐图像中所包含的目标物体类别,以卫星遥感图像为输入,对卫星遥感图像进行识别,识别出与目标物体类别相同的识别物,对识别物所在范围进行标记。S1. According to the target object category contained in the real sonar image, the satellite remote sensing image is used as the input to identify the satellite remote sensing image, identify the identified object with the same category as the target object, and mark the range where the identified object is located.
根据真实声呐图像中所包含的已存在的目标物体类别和需要添加的目标物类别,以卫星遥感图像为输入,识别与目标物类别相同的卫星遥感图像区域。According to the existing target object category contained in the real sonar image and the target object category to be added, the satellite remote sensing image is used as the input to identify the satellite remote sensing image area with the same category as the target object.
(1)真实声呐图像数据集中总共有两类:船舶残骸和飞机残骸,目标物种类较少,且数量分布不均匀,会使基于该真实声呐图像数据集的后续工作无法展开。为使声呐图像数据集种类多样,数量充足,本发明将船舶残骸、飞机残骸、落水人员以及石块作为目标物类别。(1) There are two types of real sonar image datasets: ship wreckage and aircraft wreckage. There are few types of targets, and the number distribution is uneven, which will make the follow-up work based on the real sonar image dataset impossible. In order to make the sonar image datasets diverse and sufficient, the present invention takes ship wrecks, aircraft wrecks, people in the water, and rocks as target object categories.
(2)以卫星遥感图像作为原始图像为输入,识别图像中上述的目标物,得到识别物,并将识别物所在图像区域进行标记,标记的数据为:识别物所属类别,识别物所在区域左上角的坐标x,y,识别物所在区域在图像高、宽两个方向的尺度h,w。(2) Take the satellite remote sensing image as the original image as input, identify the above-mentioned target object in the image, obtain the identified object, and mark the image area where the identified object is located. The coordinates x, y of the corner, and the scale h, w of the area where the recognized object is located in the height and width directions of the image.
S2、对经过识别并标记的卫星遥感图像进行图像分割,以目标物体类别为分类,构建卫星子图像数据集。S2. Perform image segmentation on the identified and marked satellite remote sensing images, and construct a satellite sub-image dataset by classifying the target object category.
依据上述步骤中的识别物标记数据,将标注区域进行图像裁剪,并判断裁剪后的子图像的长、宽、比例,并以不同的长、宽大小和长宽比对图像进行伸缩变换。According to the identifier marking data in the above steps, the marked area is cropped, and the length, width and ratio of the cropped sub-image are determined, and the image is scaled and transformed with different length, width and aspect ratio.
具体的步骤为:The specific steps are:
(1)依据上述步骤得到的x,y,h,w,对原始图像进行裁剪,裁剪的范围为:(1) According to the x, y, h, w obtained in the above steps, the original image is cropped, and the cropping range is:
{(x,y),(x+w,y),(x,y+h),(x+w,y+h)}{(x, y), (x+w, y), (x, y+h), (x+w, y+h)}
上式中的四个坐标为所裁剪图像的四角坐标,以四角坐标为裁剪范围进行图像裁剪,得到子图像。The four coordinates in the above formula are the four-corner coordinates of the cropped image, and the image is cropped with the four-corner coordinates as the cropping range to obtain a sub-image.
(2)依据上述步骤得到的子图像,以416×416像素作为标准进行图像伸缩变换。图像伸缩变换处理公式如下:(2) According to the sub-image obtained in the above steps, image scaling is performed with 416×416 pixels as a standard. The image scaling transformation processing formula is as follows:
S3、构建风格迁移网络,以真实的声呐图像作为风格图像,对分割后的卫星子图像进行风格迁移,生成声呐仿真图像,以作为训练样本图像。S3, constructing a style transfer network, using the real sonar image as the style image, performing style transfer on the segmented satellite sub-images, and generating a sonar simulation image as a training sample image.
构建卷积神经网络,并利用该网络提取真实声呐图像和子图像的特征,以两张图片特征间的欧式距离和格拉姆矩阵的欧氏距离为损失函数进行优化,得到模拟声呐图像。A convolutional neural network is constructed, and the features of real sonar images and sub-images are extracted by this network, and the Euclidean distance between the features of the two images and the Euclidean distance of the Gram matrix are used to optimize the loss function to obtain a simulated sonar image.
具体的步骤为:The specific steps are:
(1)以VGG16网络的前17层为基础,在VGG16前17层中添加shortcut连接,构建卷积神经网络。如图3所示。(1) Based on the first 17 layers of the VGG16 network, shortcut connections are added to the first 17 layers of the VGG16 to construct a convolutional neural network. As shown in Figure 3.
Shortcut连接是将一个输入添加到函数的输出。添加shortcut后的输出可以明确的表示为F(x)和x的线性叠加。将输出表述为输入和输入的一个非线性变换的线性叠加。在网络结构加深的情况下保证网络的训练效果。添加shortcut后的网络计算可以表示为:Shortcut connections are adding an input to the output of a function. The output after adding the shortcut can be explicitly expressed as a linear superposition of F(x) and x. Express the output as a linear superposition of the input and a nonlinear transformation of the input. In the case of deepening the network structure, the training effect of the network is guaranteed. The network calculation after adding shortcuts can be expressed as:
xl+1=xl+F(xl)x l+1 = x l +F(x l )
上式中xl和xl+1分别表示网络第l层和第l+1层的特征。F(xl)表示网络的卷积和激活操作。添加shortcut后的网络反向传播过程为:In the above formula, x l and x l+1 represent the characteristics of the lth layer and the l+1th layer of the network, respectively. F(x l ) represents the convolution and activation operations of the network. The network backpropagation process after adding the shortcut is:
(2)加载上述的卷积神经网络的参数,并将步骤S2得到的子图像以及真实声呐图像输入网络中进行风格迁移。如图4所示。(2) Load the parameters of the above-mentioned convolutional neural network, and input the sub-image obtained in step S2 and the real sonar image into the network for style transfer. As shown in Figure 4.
使用已经训练好的VGG16网络前17层作为上述卷积神经网络的主干,以真实的声呐图像作为风格图像,步骤S2中的到的子图像作为内容图像输入带有shortcut的VGG16网络分别提取风格特征S和内容特征C。The first 17 layers of the trained VGG16 network are used as the backbone of the above-mentioned convolutional neural network, the real sonar image is used as the style image, and the sub-image obtained in step S2 is used as the content image. Input the VGG16 network with shortcut to extract the style features respectively S and content features C.
以风格特征S和内容特征C之间欧氏距离为内容损失,内容损失可以表示为:Taking the Euclidean distance between the style feature S and the content feature C as the content loss, the content loss can be expressed as:
式中i表示特征图中第i个元素,特征图中每一个元素都是[0,1)之间的数值。where i represents the i-th element in the feature map, and each element in the feature map is a value between [0, 1).
风格损失函数为:The style loss function is:
式中是风格图片在CNN第l层第(i,j,k)位置的输出,其中(i,j,k)对应高,宽,通道in the formula is the output of the style image at the (i, j, k) position of the lth layer of CNN, where (i, j, k) corresponds to height, width, channel
风格迁移的损失函数为:The loss function for style transfer is:
L=αLc+βLs L=αL c +βL s
式中α、β为权重系数where α and β are weight coefficients
(3)设置对比实验,验证模拟声呐图像与真实声呐图像之间的相似度。(3) Set up a comparative experiment to verify the similarity between the simulated sonar image and the real sonar image.
实验组数据设置:将真实声呐图像与模拟声呐图像组合为混合数据集,并将数据集按照7:3的比例划分训练集和验证集。对照组数据设置:仅将真实声呐图像设置为数据集,并按照7:3的比例划分训练集和验证集。经图像分类网络检验后,两组实验数据集的准确率相近。Data setting of the experimental group: The real sonar image and the simulated sonar image are combined into a mixed dataset, and the dataset is divided into training set and validation set according to the ratio of 7:3. Control group data setting: only real sonar images were set as the dataset, and the training set and validation set were divided according to the ratio of 7:3. After being tested by the image classification network, the accuracy rates of the two sets of experimental data sets are similar.
以下结合附图和具体实施例对上述方法进行详细解释说明。The above method will be explained in detail below with reference to the accompanying drawings and specific embodiments.
参见图2,图2示出了本发明一个实施例提供的基于风格迁移的声呐仿真图像生成方法的流程图,该方法包括以下步骤:Referring to FIG. 2, FIG. 2 shows a flowchart of a method for generating a sonar simulation image based on style transfer provided by an embodiment of the present invention, and the method includes the following steps:
步骤S101,根据真实声呐图像中所包含的已存在的目标物体类别和需要添加的目标物类别,以卫星遥感图像为输入,识别与目标物类别相同的卫星遥感图像区域。Step S101 , according to the existing target object category and the target object category to be added contained in the real sonar image, using the satellite remote sensing image as input, identify the satellite remote sensing image area with the same target object category.
(1)本实施例中所涉及的真实声呐图像数据集中总共有两类:船舶残骸和飞机残骸,且为使模拟声呐图像目标物种类在救援打捞、水底探测中具有实际应用价值,添加落水人员以及石块也作为目标物类别。(1) There are two types of real sonar image data sets involved in this embodiment: ship wreckage and aircraft wreckage, and in order to make the simulated sonar image target types have practical application value in rescue and salvage and underwater detection, add drowning personnel And stones are also used as target types.
(2)图5为本实例所使用的卫星遥感图像中的一幅,将卫星遥感图像为输入,按照分类标准对遥感图像中的物体进行识别,并将识别到的区域进行标注。图5中包含三架飞机,故经过识别后应标注出三架飞机、以及对应的所在区域。经过识别后的标注图如图6所示。标注的数据分别为:(2) Figure 5 is one of the satellite remote sensing images used in this example. Taking the satellite remote sensing image as input, the objects in the remote sensing image are identified according to the classification criteria, and the identified areas are marked. Figure 5 contains three planes, so after identification, the three planes and their corresponding areas should be marked. The labeled map after identification is shown in Figure 6. The marked data are:
{airplane,(642,187),120,120}{airplane,(642,187),120,120}
{airplane,(151,360),210,180}{airplane,(151,360),210,180}
{airplane,(329,552),174,147}{airplane,(329,552),174,147}
步骤S102,依据上述步骤中的识别物标记数据,将标注区域进行图像裁剪,并判断裁剪后的子图像的长、宽、比例,并以不同的长、宽大小和长宽比对图像进行伸缩变换。Step S102, according to the identifier mark data in the above steps, the marked area is cropped, and the length, width and ratio of the cropped sub-image are judged, and the image is stretched with different length, width and aspect ratio. transform.
(1)依据上述步骤得到的{classes,(x,y),w,h},三幅图像的裁剪的范围为:(1) According to the {classes,(x,y),w,h} obtained by the above steps, the cropping range of the three images is:
{(642,187),(762,187),(642,370),(762,370)}{(642,187),(762,187),(642,370),(762,370)}
{(151,360),(361,360),(151,542),(361,542)}{(151,360),(361,360),(151,542),(361,542)}
{(329,552),(503,552),(329,699),(503,699)}{(329,552),(503,552),(329,699),(503,699)}
以四角坐标为裁剪范围进行图像裁剪,得到子图像如图7所示。The image is cropped with the coordinates of the four corners as the cropping range, and the sub-image is obtained as shown in Figure 7.
(2)依据上述步骤得到的子图像,以416×416像素作为标准进行图像伸缩变换。图像伸缩变换处理公式如下:(2) According to the sub-image obtained in the above steps, image scaling is performed with 416×416 pixels as a standard. The image scaling transformation processing formula is as follows:
放缩后三幅子图像的大小分别为:416×416,416×357,416×351。The sizes of the three sub-images after scaling are: 416×416, 416×357, and 416×351.
S103:构建卷积神经网络,并利用该网络提取真实声呐图像和子图像的特征,以两张图片特征间的欧式距离和格拉姆矩阵的欧氏距离为损失函数进行优化,得到模拟声呐图像。S103: Construct a convolutional neural network, and use the network to extract the features of real sonar images and sub-images, and optimize the loss function with the Euclidean distance between the features of the two images and the Euclidean distance of the Gram matrix to obtain a simulated sonar image.
(1)以VGG16网络的前17层为基础,在VGG16前17层中添加shortcut连接,构建卷积神经网络。(1) Based on the first 17 layers of the VGG16 network, shortcut connections are added to the first 17 layers of the VGG16 to construct a convolutional neural network.
Shortcut结构:除必要的下采样操作外,对输入网络的数据不做任何处理,直接与卷积后的特征图进行融合,提高神经网络提取特征的能力。Shortcut structure: In addition to the necessary downsampling operation, the data input to the network is not processed, and it is directly fused with the feature map after convolution to improve the ability of the neural network to extract features.
(2)加载上述的卷积神经网络的参数,该参数来自ImageNet数据集训练。并将步骤S102得到的子图像以及真实声呐图像输入网络中进行风格迁移。(2) Load the parameters of the above-mentioned convolutional neural network, which are trained from the ImageNet dataset. The sub-image and the real sonar image obtained in step S102 are input into the network for style transfer.
风格迁移后的模拟声呐图像如图8所示。The simulated sonar image after style transfer is shown in Figure 8.
(3)设置对比实验,验证模拟声呐图像与真实声呐图像之间的相似度。(3) Set up a comparative experiment to verify the similarity between the simulated sonar image and the real sonar image.
实验组数据设置:将真实声呐图像与模拟声呐图像组合并为混合数据集并将数据集按照7:3的比例划分训练集和验证集。训练集和验证集的数量分别为84,36。对照组数据设置:仅将真实声呐图像设置为数据集,并按照7:3的比例划分训练集和验证集。训练集和验证集的数量分别为44,18。经图像分类网络推理后,两组数据的准确率分别为:86.11%,83.33%。两组数据相差较劲,说明对于同一个图像分类网络来说,真实数据集和混合数据集具有较强的相似度。Data setting of the experimental group: The real sonar image and the simulated sonar image are combined into a mixed data set and the data set is divided into training set and validation set according to the ratio of 7:3. The numbers of training and validation sets are 84 and 36, respectively. Control group data setting: only real sonar images were set as the dataset, and the training set and validation set were divided according to the ratio of 7:3. The numbers of training and validation sets are 44 and 18, respectively. After inference by the image classification network, the accuracy rates of the two sets of data are: 86.11% and 83.33%, respectively. The two sets of data are quite different, indicating that for the same image classification network, the real data set and the mixed data set have strong similarity.
本实施例通过遥感卫星飞机图像为输入,以真实飞机残骸声呐图像为目标,使用风格迁移网络,获得模拟的声呐图像,有效扩充飞机声呐图像数据集的数量。This embodiment uses the remote sensing satellite aircraft image as the input, takes the real aircraft wreckage sonar image as the target, uses the style transfer network to obtain the simulated sonar image, and effectively expands the number of aircraft sonar image data sets.
综上所述,本实施例方法相对于现有技术,具有如下优点及有益效果:To sum up, the method of this embodiment has the following advantages and beneficial effects compared to the prior art:
本发明通过遥感卫星图像为输入,以真实声呐图像为目标,使用风格迁移网络,获得模拟的声呐图像。另外,本发明从输入图像到最终获得模拟声呐图像的过程中,不需要手动添加图像特征,能够根据不同分辨率的遥感图像快速获得模拟声呐图像,且获得的模拟声呐图像具有真实声呐图像的风格,能够有效扩充声呐数据集的数量,为基于声呐图像的后续相关工作提供基础。The invention takes the remote sensing satellite image as the input, takes the real sonar image as the target, and uses the style transfer network to obtain the simulated sonar image. In addition, in the process from inputting an image to finally obtaining a simulated sonar image, the present invention does not need to manually add image features, can quickly obtain simulated sonar images according to remote sensing images of different resolutions, and the obtained simulated sonar images have the style of real sonar images , which can effectively expand the number of sonar datasets and provide a basis for subsequent related work based on sonar images.
本实施例还提供一种基于风格迁移的声呐仿真图像生成系统,包括:This embodiment also provides a sonar simulation image generation system based on style transfer, including:
卫星图像处理模块,用于根据真实声呐图像中所包含的目标物体类别,以卫星遥感图像为输入,对所述卫星遥感图像进行识别,识别出与所述目标物体类别相同的识别物,对识别物所在范围进行标记;The satellite image processing module is used to identify the satellite remote sensing image according to the target object category contained in the real sonar image, and use the satellite remote sensing image as input, identify the identification object of the same category as the target object, and identify the target object. mark the area where the object is located;
卫星图像分割模块,用于对经过识别并标记的卫星遥感图像进行图像分割,以目标物体类别为分类,构建卫星子图像数据集;The satellite image segmentation module is used to perform image segmentation on the identified and marked satellite remote sensing images, and use the target object category as a classification to construct a satellite sub-image dataset;
图像风格迁移模块,用于构建风格迁移网络,以真实的声呐图像作为风格图像,对分割后的卫星子图像进行风格迁移,生成声呐仿真图像,以作为训练样本图像。The image style transfer module is used to build a style transfer network, using real sonar images as style images, and performing style transfer on the segmented satellite sub-images to generate sonar simulation images as training sample images.
本实施例的一种基于风格迁移的声呐仿真图像生成系统,可执行本发明方法实施例所提供的一种基于风格迁移的声呐仿真图像生成方法,可执行方法实施例的任意组合实施步骤,具备该方法相应的功能和有益效果。The system for generating a sonar simulation image based on style transfer in this embodiment can execute the method for generating a sonar simulation image based on style transfer provided by the method embodiment of the present invention, and can perform any combination of implementation steps of the method embodiment. The corresponding functions and beneficial effects of the method.
本实施例还提供一种基于风格迁移的声呐仿真图像生成系统,包括:This embodiment also provides a sonar simulation image generation system based on style transfer, including:
至少一个处理器;at least one processor;
至少一个存储器,用于存储至少一个程序;at least one memory for storing at least one program;
当所述至少一个程序被所述至少一个处理器执行,使得所述至少一个处理器实现图1所示方法。When the at least one program is executed by the at least one processor, the at least one processor implements the method shown in FIG. 1 .
本实施例的一种基于风格迁移的声呐仿真图像生成系统,可执行本发明方法实施例所提供的一种基于风格迁移的声呐仿真图像生成方法,可执行方法实施例的任意组合实施步骤,具备该方法相应的功能和有益效果。The system for generating a sonar simulation image based on style transfer in this embodiment can execute the method for generating a sonar simulation image based on style transfer provided by the method embodiment of the present invention, and can perform any combination of implementation steps of the method embodiment. The corresponding functions and beneficial effects of the method.
本申请实施例还公开了一种计算机程序产品或计算机程序,该计算机程序产品或计算机程序包括计算机指令,该计算机指令存储在计算机可读存介质中。计算机设备的处理器可以从计算机可读存储介质读取该计算机指令,处理器执行该计算机指令,使得该计算机设备执行图1所示的方法。Embodiments of the present application further disclose a computer program product or computer program, where the computer program product or computer program includes computer instructions, and the computer instructions are stored in a computer-readable storage medium. A processor of the computer device can read the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions to cause the computer device to perform the method shown in FIG. 1 .
本实施例还提供了一种存储介质,存储有可执行本发明方法实施例所提供的一种基于风格迁移的声呐仿真图像生成方法的指令或程序,当运行该指令或程序时,可执行方法实施例的任意组合实施步骤,具备该方法相应的功能和有益效果。This embodiment also provides a storage medium, which stores an instruction or program for executing a style transfer-based sonar simulation image generation method provided by the method embodiment of the present invention. When the instruction or program is executed, the method can be executed. Any combination of the implementation steps of the embodiments has the corresponding functions and beneficial effects of the method.
在一些可选择的实施例中,在方框图中提到的功能/操作可以不按照操作示图提到的顺序发生。例如,取决于所涉及的功能/操作,连续示出的两个方框实际上可以被大体上同时地执行或所述方框有时能以相反顺序被执行。此外,在本发明的流程图中所呈现和描述的实施例以示例的方式被提供,目的在于提供对技术更全面的理解。所公开的方法不限于本文所呈现的操作和逻辑流程。可选择的实施例是可预期的,其中各种操作的顺序被改变以及其中被描述为较大操作的一部分的子操作被独立地执行。In some alternative implementations, the functions/operations noted in the block diagrams may occur out of the order noted in the operational diagrams. For example, two blocks shown in succession may, in fact, be executed substantially concurrently or the blocks may sometimes be executed in the reverse order, depending upon the functionality/operations involved. Furthermore, the embodiments presented and described in the flowcharts of the present invention are provided by way of example in order to provide a more comprehensive understanding of the technology. The disclosed methods are not limited to the operations and logic flows presented herein. Alternative embodiments are contemplated in which the order of the various operations are altered and in which sub-operations described as part of larger operations are performed independently.
此外,虽然在功能性模块的背景下描述了本发明,但应当理解的是,除非另有相反说明,所述的功能和/或特征中的一个或多个可以被集成在单个物理装置和/或软件模块中,或者一个或多个功能和/或特征可以在单独的物理装置或软件模块中被实现。还可以理解的是,有关每个模块的实际实现的详细讨论对于理解本发明是不必要的。更确切地说,考虑到在本文中公开的装置中各种功能模块的属性、功能和内部关系的情况下,在工程师的常规技术内将会了解该模块的实际实现。因此,本领域技术人员运用普通技术就能够在无需过度试验的情况下实现在权利要求书中所阐明的本发明。还可以理解的是,所公开的特定概念仅仅是说明性的,并不意在限制本发明的范围,本发明的范围由所附权利要求书及其等同方案的全部范围来决定。Furthermore, while the invention is described in the context of functional modules, it is to be understood that, unless stated to the contrary, one or more of the described functions and/or features may be integrated in a single physical device and/or or software modules, or one or more functions and/or features may be implemented in separate physical devices or software modules. It will also be appreciated that a detailed discussion of the actual implementation of each module is not necessary to understand the present invention. Rather, given the attributes, functions, and internal relationships of the various functional modules in the apparatus disclosed herein, the actual implementation of the modules will be within the routine skill of the engineer. Accordingly, those skilled in the art, using ordinary skill, can implement the invention as set forth in the claims without undue experimentation. It is also to be understood that the specific concepts disclosed are illustrative only and are not intended to limit the scope of the invention, which is to be determined by the appended claims along with their full scope of equivalents.
所述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。The functions, if implemented in the form of software functional units and sold or used as independent products, may be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present invention can be embodied in the form of a software product in essence, or the part that contributes to the prior art or the part of the technical solution. The computer software product is stored in a storage medium, including Several instructions are used to cause a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the methods described in the various embodiments of the present invention. The aforementioned storage medium includes: U disk, mobile hard disk, Read-Only Memory (ROM, Read-Only Memory), Random Access Memory (RAM, Random Access Memory), magnetic disk or optical disk and other media that can store program codes .
在流程图中表示或在此以其他方式描述的逻辑和/或步骤,例如,可以被认为是用于实现逻辑功能的可执行指令的定序列表,可以具体实现在任何计算机可读介质中,以供指令执行系统、装置或设备(如基于计算机的系统、包括处理器的系统或其他可以从指令执行系统、装置或设备取指令并执行指令的系统)使用,或结合这些指令执行系统、装置或设备而使用。就本说明书而言,“计算机可读介质”可以是任何可以包含、存储、通信、传播或传输程序以供指令执行系统、装置或设备或结合这些指令执行系统、装置或设备而使用的装置。The logic and/or steps represented in flowcharts or otherwise described herein, for example, may be considered an ordered listing of executable instructions for implementing the logical functions, may be embodied in any computer-readable medium, For use with, or in conjunction with, an instruction execution system, apparatus, or device (such as a computer-based system, a system including a processor, or other system that can fetch instructions from and execute instructions from an instruction execution system, apparatus, or apparatus) or equipment. For the purposes of this specification, a "computer-readable medium" can be any device that can contain, store, communicate, propagate, or transport the program for use by or in connection with an instruction execution system, apparatus, or apparatus.
计算机可读介质的更具体的示例(非穷尽性列表)包括以下:具有一个或多个布线的电连接部(电子装置),便携式计算机盘盒(磁装置),随机存取存储器(RAM),只读存储器(ROM),可擦除可编辑只读存储器(EPROM或闪速存储器),光纤装置,以及便携式光盘只读存储器(CDROM)。另外,计算机可读介质甚至可以是可在其上打印所述程序的纸或其他合适的介质,因为可以例如通过对纸或其他介质进行光学扫描,接着进行编辑、解译或必要时以其他合适方式进行处理来以电子方式获得所述程序,然后将其存储在计算机存储器中。More specific examples (non-exhaustive list) of computer readable media include the following: electrical connections with one or more wiring (electronic devices), portable computer disk cartridges (magnetic devices), random access memory (RAM), Read Only Memory (ROM), Erasable Editable Read Only Memory (EPROM or Flash Memory), Fiber Optic Devices, and Portable Compact Disc Read Only Memory (CDROM). In addition, the computer readable medium may even be paper or other suitable medium on which the program may be printed, as the paper or other medium may be optically scanned, for example, followed by editing, interpretation, or other suitable medium as necessary process to obtain the program electronically and then store it in computer memory.
应当理解,本发明的各部分可以用硬件、软件、固件或它们的组合来实现。在上述实施方式中,多个步骤或方法可以用存储在存储器中且由合适的指令执行系统执行的软件或固件来实现。例如,如果用硬件来实现,和在另一实施方式中一样,可用本领域公知的下列技术中的任一项或他们的组合来实现:具有用于对数据信号实现逻辑功能的逻辑门电路的离散逻辑电路,具有合适的组合逻辑门电路的专用集成电路,可编程门阵列(PGA),现场可编程门阵列(FPGA)等。It should be understood that various parts of the present invention may be implemented in hardware, software, firmware or a combination thereof. In the above-described embodiments, various steps or methods may be implemented in software or firmware stored in memory and executed by a suitable instruction execution system. For example, if implemented in hardware, as in another embodiment, it can be implemented by any one or a combination of the following techniques known in the art: Discrete logic circuits, application specific integrated circuits with suitable combinational logic gates, Programmable Gate Arrays (PGA), Field Programmable Gate Arrays (FPGA), etc.
在本说明书的上述描述中,参考术语“一个实施方式/实施例”、“另一实施方式/实施例”或“某些实施方式/实施例”等的描述意指结合实施方式或示例描述的具体特征、结构、材料或者特点包含于本发明的至少一个实施方式或示例中。在本说明书中,对上述术语的示意性表述不一定指的是相同的实施方式或示例。而且,描述的具体特征、结构、材料或者特点可以在任何的一个或多个实施方式或示例中以合适的方式结合。In the above description of the present specification, reference to the description of the terms "one embodiment/example", "another embodiment/example" or "certain embodiments/examples" etc. means the description in conjunction with the embodiment or example. Particular features, structures, materials, or characteristics are included in at least one embodiment or example of the present invention. In this specification, schematic representations of the above terms do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.
尽管已经示出和描述了本发明的实施方式,本领域的普通技术人员可以理解:在不脱离本发明的原理和宗旨的情况下可以对这些实施方式进行多种变化、修改、替换和变型,本发明的范围由权利要求及其等同物限定。Although embodiments of the present invention have been shown and described, it will be understood by those of ordinary skill in the art that various changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, The scope of the invention is defined by the claims and their equivalents.
以上是对本发明的较佳实施进行了具体说明,但本发明并不限于上述实施例,熟悉本领域的技术人员在不违背本发明精神的前提下还可做作出种种的等同变形或替换,这些等同的变形或替换均包含在本申请权利要求所限定的范围内。The above is a specific description of the preferred implementation of the present invention, but the present invention is not limited to the above-mentioned embodiments, and those skilled in the art can also make various equivalent deformations or replacements on the premise of not violating the spirit of the present invention. Equivalent modifications or substitutions are included within the scope defined by the claims of the present application.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210752897.8A CN115019107B (en) | 2022-06-29 | 2022-06-29 | Sonar simulation image generation method, system and medium based on style migration |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210752897.8A CN115019107B (en) | 2022-06-29 | 2022-06-29 | Sonar simulation image generation method, system and medium based on style migration |
Publications (2)
Publication Number | Publication Date |
---|---|
CN115019107A true CN115019107A (en) | 2022-09-06 |
CN115019107B CN115019107B (en) | 2024-09-06 |
Family
ID=83079038
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210752897.8A Active CN115019107B (en) | 2022-06-29 | 2022-06-29 | Sonar simulation image generation method, system and medium based on style migration |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115019107B (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115880390A (en) * | 2022-12-07 | 2023-03-31 | 哈尔滨工程大学 | A method for generating simulated sonar images |
CN116596753A (en) * | 2023-07-20 | 2023-08-15 | 哈尔滨工程大学三亚南海创新发展基地 | Acoustic image dataset expansion method and system based on style migration network |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110991516A (en) * | 2019-11-28 | 2020-04-10 | 哈尔滨工程大学 | A target classification method for side-scan sonar images based on style transfer |
CN111582403A (en) * | 2020-05-18 | 2020-08-25 | 哈尔滨工程大学 | A zero-sample side-scan sonar image target classification method |
CN113052215A (en) * | 2021-03-15 | 2021-06-29 | 浙江大学 | Sonar image automatic target identification method based on neural network visualization |
US20210256304A1 (en) * | 2018-10-10 | 2021-08-19 | Guangdong Oppo Mobile Telecommunications Corp., Ltd. | Method and apparatus for training machine learning model, apparatus for video style transfer |
-
2022
- 2022-06-29 CN CN202210752897.8A patent/CN115019107B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20210256304A1 (en) * | 2018-10-10 | 2021-08-19 | Guangdong Oppo Mobile Telecommunications Corp., Ltd. | Method and apparatus for training machine learning model, apparatus for video style transfer |
CN110991516A (en) * | 2019-11-28 | 2020-04-10 | 哈尔滨工程大学 | A target classification method for side-scan sonar images based on style transfer |
CN111582403A (en) * | 2020-05-18 | 2020-08-25 | 哈尔滨工程大学 | A zero-sample side-scan sonar image target classification method |
CN113052215A (en) * | 2021-03-15 | 2021-06-29 | 浙江大学 | Sonar image automatic target identification method based on neural network visualization |
Non-Patent Citations (1)
Title |
---|
朱兆彤;付学志;胡友峰;: "一种利用迁移学习训练卷积神经网络的声呐图像识别方法", 水下无人系统学报, no. 01, 15 February 2020 (2020-02-15) * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115880390A (en) * | 2022-12-07 | 2023-03-31 | 哈尔滨工程大学 | A method for generating simulated sonar images |
CN116596753A (en) * | 2023-07-20 | 2023-08-15 | 哈尔滨工程大学三亚南海创新发展基地 | Acoustic image dataset expansion method and system based on style migration network |
CN116596753B (en) * | 2023-07-20 | 2024-02-02 | 哈尔滨工程大学三亚南海创新发展基地 | Acoustic image dataset expansion method and system based on style migration network |
Also Published As
Publication number | Publication date |
---|---|
CN115019107B (en) | 2024-09-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110503112B (en) | A Small Target Detection and Recognition Method Based on Enhanced Feature Learning | |
CN111738112B (en) | Remote sensing ship image target detection method based on deep neural network and self-attention mechanism | |
CN108734210B (en) | An object detection method based on cross-modal multi-scale feature fusion | |
CN114821357B (en) | A transformer-based optical remote sensing target detection method | |
CN111985376A (en) | A deep learning-based method for extracting ship contours from remote sensing images | |
CN110796048B (en) | Ship target real-time detection method based on deep neural network | |
CN110991513B (en) | A system and method for image target recognition with human-like continuous learning ability | |
CN115147731A (en) | A SAR Image Target Detection Method Based on Full Spatial Coding Attention Module | |
CN110956126A (en) | A Small Object Detection Method Based on Joint Super-Resolution Reconstruction | |
CN111753677A (en) | Multi-angle remote sensing ship image target detection method based on feature pyramid structure | |
Chen et al. | Ship target detection algorithm based on improved YOLOv3 for maritime image | |
CN115620180A (en) | A Target Detection Method for Aerial Images Based on Improved YOLOv5 | |
CN115019107A (en) | Method, system and medium for sonar simulation image generation based on style transfer | |
CN116977872B (en) | CNN+ transducer remote sensing image detection method | |
CN111241970A (en) | SAR image sea surface ship detection method based on yolov3 algorithm and sliding window strategy | |
CN112699967A (en) | Remote airport target detection method based on improved deep neural network | |
CN114594461A (en) | Sonar target detection method based on attention perception and scaling factor pruning | |
CN110097047A (en) | A vehicle detection method using single-line lidar based on deep learning | |
CN114676777A (en) | A self-supervised learning method for fine-grained image classification based on Siamese network | |
CN114359709A (en) | Target detection method and device for remote sensing image | |
Chen et al. | Object detection of optical remote sensing image based on improved faster RCNN | |
CN116823838A (en) | Ocean ship detection method and system with Gaussian prior label distribution and characteristic decoupling | |
CN118570480A (en) | Method for labeling semantic segmentation data set based on public large model | |
CN116630808A (en) | Rotating ship detection method based on remote sensing image feature extraction | |
CN112966558A (en) | Port automatic identification method and system based on optimized SSD target detection model |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |