WO2021046951A1 - Image identification method, system, and storage medium - Google Patents

Image identification method, system, and storage medium Download PDF

Info

Publication number
WO2021046951A1
WO2021046951A1 PCT/CN2019/110026 CN2019110026W WO2021046951A1 WO 2021046951 A1 WO2021046951 A1 WO 2021046951A1 CN 2019110026 W CN2019110026 W CN 2019110026W WO 2021046951 A1 WO2021046951 A1 WO 2021046951A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
anchor
target detection
detection module
module
Prior art date
Application number
PCT/CN2019/110026
Other languages
French (fr)
Chinese (zh)
Inventor
徐海青
陈是同
徐唯耀
秦浩
董媛媛
吴立刚
王维佳
余江斌
梁翀
宋杰
王文清
程琳
浦正国
郭庆
吴小华
张彬彬
胡心颖
胡丁丁
Original Assignee
安徽继远软件有限公司
国网信息通信产业集团有限公司
国家电网有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 安徽继远软件有限公司, 国网信息通信产业集团有限公司, 国家电网有限公司 filed Critical 安徽继远软件有限公司
Publication of WO2021046951A1 publication Critical patent/WO2021046951A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T3/00Geometric image transformation in the plane of the image
    • G06T3/60Rotation of a whole image or part thereof

Definitions

  • This application relates to the technical field of equipment maintenance, in particular to an image recognition method, system and storage medium.
  • the radiator bleed plugs currently used generally have the problem of not having a stop, which cannot achieve a good sealing effect.
  • the equipment structure design of some manufacturers is unreasonable, and it is extremely easy.
  • the oil leakage of oil-filled equipment is also related to the size of the load it carries. The higher the equipment load, the higher the equipment oil temperature, and the viscosity of the insulating oil becomes thinner, which is more likely to cause oil leakage.
  • the quality of the components of the oil-filled equipment is also one of the main reasons that cause it to easily lead to oil leakage.
  • the improper lifting, transportation, and installation operation methods adopted during the transportation and installation of the oil-filled equipment also caused oil leakage of the oil-filled equipment. Oil leakage caused by thermal expansion and contraction caused by changes in ambient temperature and load during long-term operation of oil-filled equipment.
  • the oil leakage defects of substation equipment are mainly inspected by manpower, and repairs are carried out after the oil leakage is found.
  • artificial intelligence has been widely used. How to effectively apply deep learning technology to the detection of oil leakage defects in substation equipment is a problem that needs to be solved urgently.
  • the embodiments of the present application provide an image recognition method, system, and storage medium.
  • the embodiment of the present application provides an image recognition method, including:
  • the data augmentation based on the optimized classification for the original image includes:
  • test set is a set of at least two original images, and classify the at least two original images
  • Data augmentation is performed based on the classification with the lowest retrieval accuracy.
  • performing image denoising based on the non-local mean of the adaptive Gaussian kernel includes:
  • the converted image rotation is not performed.
  • the construction of the anchor refinement module and the target detection module to optimize the network structure based on the constructed module includes:
  • the regression operation is performed through the target detection module, and the accurate target position and size of the anchors are obtained.
  • the anchor refinement module is used to adjust the position and size of the anchors
  • the target detection module is used to perform the regression operation to obtain the accurate target position and size of the anchors, including:
  • the anchor boxes are passed to the corresponding feature map of the target detection module to generate the target category and accurate target position and size; where n is a positive integer greater than or equal to 1.
  • the method further includes:
  • the anchor boxes are transferred to the corresponding feature map of the target detection module, and the target detection model is used to generate the target category and the accurate target position and size.
  • the method further includes:
  • the machine training of the deep learning neural network model is performed based on the calculation of the loss function, where the loss function includes the loss of the anchor refinement module and the loss of the target detection module.
  • the importing the image to be diagnosed into the trained neural network model to perform image data processing analysis and defect diagnosis includes:
  • the anchor refinement module is constructed by eliminating the classification layer of the classifier.
  • the target detection module is constructed by transmitting the output of the connection block.
  • the loss function is defined as follows:
  • p i and x i respectively represent the confidence that the predicted anchor i is a target and the coordinate of the refined anchor i in the anchor refinement module;
  • c i and t i respectively represent the object category and coordinates of the bounding box predicted in the target detection module
  • N arm and No odm are the number of anchors of the positive samples in the anchor refinement module and the target detection module, respectively, the classification loss L b is the cross-entropy loss of the two categories, and the multi- class loss L m is the normalized index of the confidence of multiple categories Function loss; use smooth L1 loss as regression loss L r ;
  • the embodiment of the present application also provides an image recognition system, including:
  • One or more processors are One or more processors;
  • Memory used to store one or more programs
  • the one or more processors are caused to execute any one of the image recognition methods described above.
  • the embodiment of the application adopts the above-mentioned technical solution, and has at least the following technical effects compared with the prior art:
  • FIG. 2 is a schematic diagram of a data augmentation process based on optimized classification according to an embodiment of the application
  • FIG. 3 is a schematic diagram of a processing flow of image denoising according to an embodiment of the application.
  • FIG. 4 is a flowchart of similarity comparison of rotating and matching image pieces according to an embodiment of this application.
  • FIG. 5 is a schematic diagram of the weight coefficient distribution of a noise image according to an embodiment of the application.
  • FIG. 6 is a schematic diagram of the adjustment position and size of the anchor refinement module based on the embodiment of the application.
  • an image recognition method which may specifically be an image recognition method for oil leakage of substation equipment based on single-stage target detection, including the following steps:
  • Step 1 Obtain the original image, and perform data augmentation based on the optimized classification for the original image;
  • the steps of data augmentation based on optimized classification include:
  • Step 11 Calculate the classification accuracy rates of all classes in the test set
  • the test set is a set of at least two original images, and at least two original images are classified.
  • the known images include several classifications and when it is known to manually classify and label each original image, calculate the The classification accuracy rate obtained by classifying the original images in the test set;
  • Step 12. retrieve the category with the lowest correct rate in the test set
  • Step 13 Perform data augmentation processing based on the classification with the lowest retrieval accuracy
  • the number of samples k in the training set is countable, and the number of input samples appearing in each category does not reach the number of weight parameters of the actual fully connected layer. K is much smaller than n.
  • equations The number is less than the number of unknowns, the equations are underdetermined equations, converted into matrix form as follows, the underdetermined equations have infinitely many solutions, that is, the equations have multiple solutions;
  • Step 2 For the augmented data, perform image denoising based on the non-local mean of the adaptive Gaussian kernel
  • processing steps of image denoising include the following:
  • Step 22 Set the main directions of the two images N i and N j to be ⁇ i and ⁇ j , respectively, and set ⁇ i, j to be the difference between the direction angles ⁇ i and ⁇ j ;
  • Step 23 If the value of ⁇ i,j is an integer multiple of 90, exchange pixels in the image N j to perform image rotation conversion;
  • Step 24 if the value of ⁇ i,j is not an integer multiple of 90, expand the image size of N j to a new image neighborhood, and perform image rotation conversion;
  • Step 25 if the value of ⁇ i,j does not exceed a preset threshold such as a value of 2, the images N i and N j are matching images, and no image rotation conversion is performed;
  • the construction of the anchor refinement module is realized by removing the classification layer of the classifier.
  • the target detection module is constructed by transmitting the output of the connection block.
  • Image matching based on the rotation of the current image in the image such that N i between the target image as an image N j establish a higher correlation to find more similar pixel, the pixel is similar to more facilitate better denoising effect.
  • the similarity distance is calculated based on the adaptive Gaussian kernel, and then the rotation matching similarity measure and the adaptive Gaussian kernel weight coefficient are associated to obtain the new similarity distance d K (i, j), the specific formula is as follows:
  • v (N i) is a pixel image of the N i, v (N 'j) with image V N j (N i) similar pixel.
  • the weight coefficient of the adaptive Gaussian kernel after normalization that is, the weight function defined as:
  • h(i) is the local adaptive similar weight parameter, that is, the denoising result can be calculated by the following formula:
  • the method for selecting local adaptive similarity weight parameters in this embodiment is based on image residual estimation, and the specific calculation includes the following:
  • R ⁇ r 1 , r 2 ,..., r
  • is the total number of pixels in the image
  • set R′ ⁇ r 1 , r 2 , ..., r
  • is the residual estimation of each point in the search area S(i)
  • the definition of the local adaptive similarity weight parameter h(i) is:
  • the constant ⁇ is used to adjust the parameters
  • is designed to be Functions related to ⁇ i:
  • FIG. 5 is a schematic diagram of the weight coefficient distribution of the non-local mean denoising method based on the adaptive Gaussian kernel and the traditional non-local method used in this embodiment.
  • the left side of the figure is a noisy image, and the middle is a traditional non-local mean image denoising.
  • Weight coefficient distribution, on the right is the non-local mean denoising weight coefficient distribution based on the adaptive Gaussian kernel.
  • the denoising method based on the non-local mean of the adaptive Gaussian kernel can find more similar pixels, and the rotation matching similarity comparison can find more images with similar patterns, ensuring the non-local mean based on the adaptive Gaussian kernel Denoising method has better denoising effect;
  • Step 3 Construct an anchor refinement module and a target detection module, and optimize the network structure based on the constructed module;
  • the network structure includes an anchor refinement module and a target detection module.
  • the anchor refinement module is used to remove negative sample anchors, so as to reduce the search space for the classifier, and at the same time roughly adjust the position and size of the anchors, so as to provide for the subsequent regression Better initialization results;
  • the target detection module is used to return the results to the accurate target position according to the refined anchors, and predict multi-category labels;
  • the anchor refinement module is constructed by removing the classification layer of the classifier; the target detection module is constructed by transmitting the output of the connection block; the transmission connection block is followed by the prediction layer, which generates the score of the target category and the coordinates relative to the refined anchors Offset;
  • the anchor refinement module and the target detection module establish a connection based on the transmission connection block, and convert the functions of different layers from the anchor refinement module into the form required by the target detection module, so that the target detection module can share the features from the anchor refinement module;
  • the transmission connection block is used for feature maps associated with anchors.
  • the transmission connection block also inherits large-scale context by adding advanced features to the transmitted features to improve the accuracy of detection;
  • the dimensions of the contexts of the scale are matched, and the inverse convolution operation is used to increase the advanced features, and then added to the convolutional layer after the summation to ensure the discernibility of the detected features;
  • this embodiment adopts a two-step cascade regression strategy to return to the target position and size, and first adjusts it through the anchor refinement module.
  • the position and size of the anchors are convenient to provide better initialization results for the regression operation of the target detection module.
  • the specific location and size of the anchors are adjusted based on the anchor refinement module, and then the target detection module is used to perform the regression operation.
  • the specific steps include the following:
  • Step 31 Generate n refined anchor boxes (windows) based on the units of the divided image feature map
  • Step 32 Pass the anchor boxes to the corresponding feature map of the target detection module by obtaining the generated n refined anchor boxes to generate the target category and accurate target position and size; n is a positive integer greater than or equal to 1.
  • this embodiment also rejects negative sample anchors that have been accurately classified by designing a negative sample filtering mechanism, and adjusts the problem of sample imbalance;
  • the anchor can be discarded in the process of training ODM; preferably, ⁇ in this implementation Set to 0.99;
  • the anchor will be discarded in the detection process of the target detection module
  • Step 4 Based on the anchor refinement module and the target detection module, perform deep learning neural network model machine training;
  • the loss function is defined as follows:
  • p i and x i respectively represent the confidence that the predicted anchor i is a target and the coordinate of the refined anchor i in the anchor refinement module;
  • N arm and No odm are the number of anchors of the positive samples in the anchor refinement module and the target detection module, respectively, the classification loss L b is the cross-entropy loss of the two categories, and the multi- class loss L m is the normalized index of the confidence of multiple categories Function loss; use smooth L1 loss as regression loss L r ;
  • this embodiment of the application selects four special layers, using VGG-16 as the backbone network, and the step size is 8, 16, 32, and 64 pixels, which are different from the predicted ones.
  • the anchors of the scale are associated;
  • the four selected feature layers are all associated with a specific anchor scale and three aspect ratios (ie 0.5, 1.0, 2.0); in particular, the anchor scale here is four times the corresponding step length;
  • the network model is trained end-to-end; by matching each real frame with the anchor boxes with the best overlap score, and then Match anchor boxes with any real boxes whose overlap degree is higher than 0.5;
  • Step 5 Import the image to be diagnosed into the trained neural network model for image data processing analysis and defect diagnosis
  • the identification Describe whether the image (patrol image) for the substation equipment to be diagnosed is an oil leakage image, to verify the accuracy of the algorithm in this case.
  • the image recognition method provided in the embodiments of this application uses deep learning techniques such as deep learning neural network models to intelligently identify and diagnose substation equipment inspection images, which improves the speed and accuracy of the diagnosis of oil leakage in substation equipment, and avoids the use of manpower
  • deep learning techniques such as deep learning neural network models to intelligently identify and diagnose substation equipment inspection images, which improves the speed and accuracy of the diagnosis of oil leakage in substation equipment, and avoids the use of manpower
  • the inspection of oil leakage has greatly saved manpower and material resources.
  • the original image is processed, and the denoising method based on the adaptive Gaussian kernel is used to eliminate the noise and interference information in the original image, improve the clarity of the original image, and help improve the accuracy of subsequent intelligent recognition and classification.
  • An embodiment of the present application also provides an image recognition system, and the system includes:
  • One or more processors are One or more processors;
  • Memory used to store one or more computer programs
  • the one or more processors are caused to execute the aforementioned image recognition method.
  • An embodiment of the present application also provides a computer-readable storage medium storing a computer program, wherein the program is executed by a processor to implement the aforementioned image recognition method.
  • the image recognition method, system, and storage medium provided by the embodiments of the application use deep learning technologies such as deep learning neural network models to intelligently identify and diagnose substation equipment inspection images, which improves the speed and accuracy of the diagnosis of oil leakage in substation equipment , It avoids the inspection of oil leakage by manpower, which greatly saves manpower and material resources.
  • deep learning technologies such as deep learning neural network models to intelligently identify and diagnose substation equipment inspection images, which improves the speed and accuracy of the diagnosis of oil leakage in substation equipment , It avoids the inspection of oil leakage by manpower, which greatly saves manpower and material resources.

Abstract

Provided in embodiments of the present application are an image identification method, a system, and a computer storage medium. The method comprises: acquiring an original image, and performing data augmentation on the original image on the basis of optimization classification; performing image denoising on the augmented data on the basis of a non-local mean value of an adaptive Gaussian kernel; constructing an anchor refinement module and a target detection module, and performing network structure optimization on the basis of the constructed modules; on the basis of the anchor refinement module and the target detection module, performing machine training on a deep learning neural network model; and importing an image to be diagnosed into the trained neural network model, and performing image data processing analysis and defect diagnosis.

Description

图像识别方法、系统及存储介质Image recognition method, system and storage medium
相关申请的交叉引用Cross-references to related applications
本申请基于申请号为201910846731.0、申请日为2019年09月09日的中国专利申请提出,并要求该中国专利申请的优先权,该中国专利申请的内容在此以引入方式并入本申请。This application is filed based on a Chinese patent application with application number 201910846731.0 and an application date of September 9, 2019, and claims the priority of the Chinese patent application. The content of the Chinese patent application is hereby incorporated into this application by way of introduction.
技术领域Technical field
本申请涉及设备维护技术领域,具体涉及一种图像识别方法、系统及存储介质。This application relates to the technical field of equipment maintenance, in particular to an image recognition method, system and storage medium.
背景技术Background technique
基于油绝缘的传统变电站充油设备依赖于绝缘油良好的绝缘特性,能有效防止设备内部短路,但受到设备制造质量、运输、安装以及长期运行等多重复杂因素的影响,运行充油设备产生渗漏油的情况比较常见,并且在设备带电情况下很难进行排查,只能在设备停电时的情况下进行排查。Traditional oil-filled equipment for substations based on oil insulation relies on the good insulation characteristics of insulating oil, which can effectively prevent internal short circuits in the equipment. However, due to multiple complex factors such as equipment manufacturing quality, transportation, installation, and long-term operation, the operation of oil-filled equipment produces leakage. Oil leakage is more common, and it is difficult to check when the equipment is powered on. It can only be checked when the equipment is powered off.
变电站充油设备渗漏油的原因很多,目前使用的散热器放气塞子普遍存在不带止口的问题,无法起到良好的密封效果,加上部分厂家的设备结构设计不合理,也极为容易造成渗漏油缺陷。充油设备渗漏油还与其所承载的负荷大小有关,设备承载负荷越高,则设备油温越高,绝缘油的粘度变得更加稀薄,更加容易导致渗漏油情况的出现。充油设备的组件质量也是造成其容易导致渗漏油的主要原因之一。在充油设备运输、安装过程中采取的起吊、运输、安装操作方法不当也造成的充油设备渗漏油。充油设备长期运行过程中因环境温度、负荷变化等情况发生的热胀冷缩造成的渗漏油问题。There are many reasons for oil leakage of oil-filled equipment in substations. The radiator bleed plugs currently used generally have the problem of not having a stop, which cannot achieve a good sealing effect. In addition, the equipment structure design of some manufacturers is unreasonable, and it is extremely easy. Causes oil leakage defects. The oil leakage of oil-filled equipment is also related to the size of the load it carries. The higher the equipment load, the higher the equipment oil temperature, and the viscosity of the insulating oil becomes thinner, which is more likely to cause oil leakage. The quality of the components of the oil-filled equipment is also one of the main reasons that cause it to easily lead to oil leakage. The improper lifting, transportation, and installation operation methods adopted during the transportation and installation of the oil-filled equipment also caused oil leakage of the oil-filled equipment. Oil leakage caused by thermal expansion and contraction caused by changes in ambient temperature and load during long-term operation of oil-filled equipment.
近年来,我国电力系统也在大力推进变电设备化,可是我国电力系统 中的变电所依然没有完全改造,在这些变电所中使用的变压器等设备,依然是老式变压器居多,如果计划改造所有的变电设备,需要耗费巨大的人力、物力、财力,还会对正常的电网运行有一定影响。In recent years, my country’s power system has also vigorously promoted the transformation of substation equipment, but the substations in my country’s power system have not been completely transformed. The transformers and other equipment used in these substations are still mostly old-fashioned transformers. All substation equipment requires huge manpower, material resources, and financial resources, and will also have a certain impact on the normal operation of the power grid.
目前变电站设备(至少包括变电站充油设备)渗漏油缺陷主要是人力巡检,发现漏油问题再进行维修。随着计算机技术和图像处理技术的发展,人工智能得到广泛的应用,如何有效将深度学习技术应用于变电站设备渗漏油缺陷检测,这是目前亟需解决的问题。At present, the oil leakage defects of substation equipment (including at least the oil filling equipment of the substation) are mainly inspected by manpower, and repairs are carried out after the oil leakage is found. With the development of computer technology and image processing technology, artificial intelligence has been widely used. How to effectively apply deep learning technology to the detection of oil leakage defects in substation equipment is a problem that needs to be solved urgently.
发明内容Summary of the invention
为解决上述现有技术中的不足,本申请实施例在于提供一种图像识别方法、系统及存储介质,通过对变电站设备巡检图像进行智能识别诊断,提升了变电站设备渗漏油的诊断的速度和精度,避免了通过人力进行渗漏油情况的巡检,大大节省了人力物力。In order to solve the above-mentioned shortcomings in the prior art, the embodiments of the present application provide an image recognition method, system, and storage medium. By intelligently identifying and diagnosing the inspection images of substation equipment, the speed of diagnosis of oil leakage in substation equipment is improved. And precision, avoiding the inspection of oil leakage by manpower, greatly saving manpower and material resources.
本申请实施例提供了一种图像识别方法,包括:The embodiment of the present application provides an image recognition method, including:
获取原始图像,针对原始图像基于优化分类进行数据增广;Obtain the original image, and perform data augmentation based on the optimized classification for the original image;
针对增广后的数据,基于自适应高斯核的非局部均值进行图像去噪;For the augmented data, perform image denoising based on the non-local mean of the adaptive Gaussian kernel;
构建锚点anchor细化模块及目标检测模块,基于构建的模块进行网络结构优化;Construct the anchor refinement module and target detection module, and optimize the network structure based on the constructed module;
基于anchor细化模块及目标检测模块,进行深度学习神经网络模型机器训练;Based on the anchor refinement module and the target detection module, perform deep learning neural network model machine training;
导入待诊断图像至训练完成的神经网络模型,进行图像数据处理分析和缺陷诊断。Import the image to be diagnosed into the trained neural network model for image data processing analysis and defect diagnosis.
在上述方案中,所述针对原始图像基于优化分类进行数据增广,包括:In the above solution, the data augmentation based on the optimized classification for the original image includes:
计算测试集合的所有类的分类正确率,所述测试集合为至少两张原始图像的集合,对至少两张原始图像进行分类;Calculate the classification accuracy rates of all classes in a test set, where the test set is a set of at least two original images, and classify the at least two original images;
检索测试集合中的正确率最低的分类;Retrieve the category with the lowest correct rate in the test set;
基于检索的正确率最低的分类进行数据增广处理。Data augmentation is performed based on the classification with the lowest retrieval accuracy.
在上述方案中,所述针对增广后的数据,基于自适应高斯核的非局部均值进行图像去噪,包括:In the above solution, for the augmented data, performing image denoising based on the non-local mean of the adaptive Gaussian kernel includes:
基于自适应高斯核框架,设置θ i方向角作为图像的主方向; Based on the adaptive Gaussian kernel framework, set the θ i direction angle as the main direction of the image;
设定两幅原始图像N i、N j的主方向为θ i、θ j,设定Δθ i,j为方向角θ i与θ j的差值; Set the main directions of the two original images N i and N j as θ i and θ j , and set Δθ i, where j is the difference between the direction angles θ i and θ j;
若Δθ i,j的数值是90的整数倍,交换原始图像N j内像素进行图像的旋转转换; If the value of Δθ i,j is an integer multiple of 90, exchange pixels in the original image N j to perform image rotation conversion;
若Δθ i,j的数值不是90的整数倍,扩大原始图像N j的图像尺寸至可选取到新的图像邻域,进行图像的旋转转换; If the value of Δθ i,j is not an integral multiple of 90, expand the image size of the original image N j to a new image neighborhood, and perform image rotation conversion;
若Δθ i,j的数值不超过预设阈值如数值2时,原始图像N i和原始N j为匹配图像,不执行图像的旋转转换。 If Δθ i, the value j does not exceed the predetermined threshold value, such as 2, and the original image N i N j to match the original image, the converted image rotation is not performed.
在上述方案中,所述构建锚点anchor细化模块及目标检测模块,基于构建的模块进行网络结构优化,包括:In the above solution, the construction of the anchor refinement module and the target detection module to optimize the network structure based on the constructed module includes:
基于anchor细化模块调整anchors的位置和大小;Adjust the position and size of anchors based on the anchor refinement module;
通过目标检测模块进行回归操作,得到anchors准确的目标位置和大小。The regression operation is performed through the target detection module, and the accurate target position and size of the anchors are obtained.
在上述方案中,所述基于anchor细化模块调整anchors的位置和大小,通过目标检测模块进行回归操作,得到anchors准确的目标位置和大小,包括:In the above solution, the anchor refinement module is used to adjust the position and size of the anchors, and the target detection module is used to perform the regression operation to obtain the accurate target position and size of the anchors, including:
基于划分的图像特征图的单元,生成n个细化的anchor窗口boxes;Generate n refined anchor window boxes based on the divided image feature map units;
通过生成的n个细化的anchor boxes,将anchor boxes传递至目标检测模块的对应的特征图,生成目标类别和准确的目标位置和大小;其中,n为大于等于1的正整数。Through the generated n refined anchor boxes, the anchor boxes are passed to the corresponding feature map of the target detection module to generate the target category and accurate target position and size; where n is a positive integer greater than or equal to 1.
在上述方案中,在所述基于划分的图像特征图的单元,生成n个细化的anchor窗口boxes之后,所述方法还包括:In the above solution, after generating n refined anchor window boxes based on the divided image feature map unit, the method further includes:
计算每个anchor boxes的负置信度;Calculate the negative confidence of each anchor box;
删除负置信度大于预先设定的置信度阈值的anchor boxes;Delete anchor boxes with negative confidence greater than the preset confidence threshold;
相应的,所述通过生成的n个细化的anchor boxes,将anchor boxes传递至目标检测模块的对应的特征图,利用目标检测模型生成目标类别和准确的目标位置和大小。Correspondingly, through the generated n refined anchor boxes, the anchor boxes are transferred to the corresponding feature map of the target detection module, and the target detection model is used to generate the target category and the accurate target position and size.
在上述方案中,所述方法还包括:In the above solution, the method further includes:
基于损失函数的计算进行深度学习神经网络模型的机器训练,其中,所述损失函数包括anchor细化模块的损失以及目标检测模块的损失。The machine training of the deep learning neural network model is performed based on the calculation of the loss function, where the loss function includes the loss of the anchor refinement module and the loss of the target detection module.
在上述方案中,所述导入待诊断图像至训练完成的神经网络模型,进行图像数据处理分析和缺陷诊断,包括:In the above solution, the importing the image to be diagnosed into the trained neural network model to perform image data processing analysis and defect diagnosis includes:
将待诊断的图像输入到训练完成的深度神经网络模型,识别所述待诊断的图像是否为渗漏油图像。Input the image to be diagnosed into the trained deep neural network model, and identify whether the image to be diagnosed is an oil leakage image.
所述anchor细化模块的构建通过剔除分类器的分类层。The anchor refinement module is constructed by eliminating the classification layer of the classifier.
在上述方案中,所述目标检测模块通过传输连接块的输出进行构建。In the above solution, the target detection module is constructed by transmitting the output of the connection block.
在上述方案中,所述损失函数定义如下:In the above scheme, the loss function is defined as follows:
设定i为一个小批次里面的anchor索引,
Figure PCTCN2019110026-appb-000001
是anchor i的真实类别标签,
Figure PCTCN2019110026-appb-000002
是anchor  i真实的位置和大小;
Set i to the anchor index in a small batch,
Figure PCTCN2019110026-appb-000001
Is the real category label of anchor i,
Figure PCTCN2019110026-appb-000002
Is the real position and size of anchor i;
p i和x i分别表示预测的anchor  i是一个目标的置信度和anchor细化模块中细化后的anchor  i的坐标; p i and x i respectively represent the confidence that the predicted anchor i is a target and the coordinate of the refined anchor i in the anchor refinement module;
c i和t i分别表示目标检测模块中预测的边界框的物体类别和坐标; c i and t i respectively represent the object category and coordinates of the bounding box predicted in the target detection module;
N arm和N odm分别为anchor细化模块及目标检测模块中正样本的anchors数目,分类损失L b是两个类别的交叉熵损失,多分类损失L m是多个类别置信度的归一化指数函数损失;使用平滑L1损失作为回归损失L rN arm and No odm are the number of anchors of the positive samples in the anchor refinement module and the target detection module, respectively, the classification loss L b is the cross-entropy loss of the two categories, and the multi- class loss L m is the normalized index of the confidence of multiple categories Function loss; use smooth L1 loss as regression loss L r ;
Figure PCTCN2019110026-appb-000003
Figure PCTCN2019110026-appb-000004
Figure PCTCN2019110026-appb-000003
Figure PCTCN2019110026-appb-000004
上式的指示函数
Figure PCTCN2019110026-appb-000005
当条件为真时,输出1;即
Figure PCTCN2019110026-appb-000006
反之输出为0;
Indicator function
Figure PCTCN2019110026-appb-000005
When the condition is true, output 1; that is
Figure PCTCN2019110026-appb-000006
Otherwise, the output is 0;
Figure PCTCN2019110026-appb-000007
表明负样本anchors的回归损失被忽略;
Figure PCTCN2019110026-appb-000007
Shows that the regression loss of negative sample anchors is ignored;
当N arm=0,设置
Figure PCTCN2019110026-appb-000008
Figure PCTCN2019110026-appb-000009
When N arm = 0, set
Figure PCTCN2019110026-appb-000008
with
Figure PCTCN2019110026-appb-000009
当N odm=0,设置
Figure PCTCN2019110026-appb-000010
Figure PCTCN2019110026-appb-000011
When N odm =0, set
Figure PCTCN2019110026-appb-000010
with
Figure PCTCN2019110026-appb-000011
本申请实施例还提供了一种图像识别系统,包括:The embodiment of the present application also provides an image recognition system, including:
一个或多个处理器;One or more processors;
存储器,用于存储一个或多个程序,Memory, used to store one or more programs,
当所述一个或多个程序被所述一个或多个处理器执行时,使得所述一个或多个处理器执行如上任一所述的一种图像识别方法。When the one or more programs are executed by the one or more processors, the one or more processors are caused to execute any one of the image recognition methods described above.
一种存储有计算机程序的计算机可读存储介质,该程序被处理器执行时实现如上任一所述的图像识别方法。A computer-readable storage medium in which a computer program is stored, and when the program is executed by a processor, the image recognition method as described above is realized.
本申请实施例采用上述的技术方案,与现有技术相比至少具有以下技术效果:The embodiment of the application adopts the above-mentioned technical solution, and has at least the following technical effects compared with the prior art:
1)、本申请实施例中,相当于针对对变电站设备采集的实际图像(原始图像)进行预处理,采用基于自适应高斯核的去噪方法消除原始图像中的噪声和干扰信息,提高原始图像的清晰度,有利于提高后续的智能识别分类的准确性;1). In the embodiment of this application, it is equivalent to preprocessing the actual image (original image) collected by the substation equipment, and adopting the denoising method based on the adaptive Gaussian kernel to eliminate the noise and interference information in the original image and improve the original image The clarity of is helpful to improve the accuracy of subsequent intelligent recognition and classification;
2)、本申请实施例中,采用的anchor细化模块和目标检测模块的二级级联结构,anchor细化模块旨在过滤负样本anchors和粗略的调整anchors的位置和大小,提供更好的初始化结果;目标检测模块则将细化的anchors作为输入进行anchors的位置和尺寸的计算,得到准确的目标位和尺寸,基于二级级联结构能够提高对充电站设备的渗漏油情况的检测精度,有效保证检测结果的准确性。2). In the embodiments of this application, the secondary cascade structure of the anchor refinement module and the target detection module is adopted. The anchor refinement module is designed to filter negative sample anchors and roughly adjust the position and size of the anchors to provide better Initialization results; the target detection module uses the refined anchors as input to calculate the position and size of the anchors to obtain accurate target positions and sizes. Based on the two-level cascade structure, it can improve the detection of oil leakage of charging station equipment Accuracy, effectively guarantee the accuracy of the test results.
附图说明Description of the drawings
通过阅读参照以下附图所作的对非限制性实施例所作的详细描述,本申请的其它特征、目的和优点将会变得更明显:By reading the detailed description of the non-limiting embodiments with reference to the following drawings, other features, purposes, and advantages of the present application will become more apparent:
图1为本申请实施例的流程示意图;Figure 1 is a schematic flow diagram of an embodiment of the application;
图2为本申请实施例基于优化分类的数据增广的流程示意图;FIG. 2 is a schematic diagram of a data augmentation process based on optimized classification according to an embodiment of the application;
图3为本申请实施例的图像去噪的处理流程示意图;FIG. 3 is a schematic diagram of a processing flow of image denoising according to an embodiment of the application;
图4为本申请实施例旋转匹配图像片相似性对比流程图;FIG. 4 is a flowchart of similarity comparison of rotating and matching image pieces according to an embodiment of this application;
图5为本申请实施例噪声图像权系数分布示意图;FIG. 5 is a schematic diagram of the weight coefficient distribution of a noise image according to an embodiment of the application;
图6为本申请实施例基于anchor细化模块的调整位置及其大小示意图。FIG. 6 is a schematic diagram of the adjustment position and size of the anchor refinement module based on the embodiment of the application.
具体实施方式detailed description
下面结合附图和实施例对本申请作进一步的详细说明。可以理解的是,此处所描述的具体实施例仅仅用于解释相关发明,而非对该发明的限定。另外还需要说明的是,为了便于描述,附图中仅示出了与发明相关的部分。The application will be further described in detail below with reference to the drawings and embodiments. It can be understood that the specific embodiments described here are only used to explain the related invention, but not to limit the invention. In addition, it should be noted that, for ease of description, only the parts related to the invention are shown in the drawings.
需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互组合。下面将参考附图并结合实施例来详细说明本申请。It should be noted that the embodiments in this application and the features in the embodiments can be combined with each other if there is no conflict. Hereinafter, the application will be described in detail with reference to the drawings and in conjunction with the embodiments.
如图1-6所示,本申请实施例公开了一种图像识别方法,具体可以是一种基于单阶段目标检测的变电站设备渗漏油图像识别方法,包括如下步骤:As shown in Figures 1-6, the embodiment of the present application discloses an image recognition method, which may specifically be an image recognition method for oil leakage of substation equipment based on single-stage target detection, including the following steps:
步骤1,获取原始图像,针对原始图像基于优化分类进行数据增广;Step 1. Obtain the original image, and perform data augmentation based on the optimized classification for the original image;
采集针对变电站设备的至少两张图像并将其作为原始图像,通过图像进行深度学习神经网络模型机器的训练。Collect at least two images for substation equipment and use them as original images, and train deep learning neural network model machines through the images.
其中,基于优化分类的数据增广的步骤具体包括:Among them, the steps of data augmentation based on optimized classification include:
步骤11,计算测试集合的所有类的分类正确率;Step 11. Calculate the classification accuracy rates of all classes in the test set;
所述测试集合为至少两张原始图像的集合,对至少两张原始图像进行图像的分类,在已知图像包括几种分类以及在已知对各个原始图像进行人工分类标注的情况下,计算对测试集合中的原始图像进行分类而得到的分 类正确率;The test set is a set of at least two original images, and at least two original images are classified. When the known images include several classifications and when it is known to manually classify and label each original image, calculate the The classification accuracy rate obtained by classifying the original images in the test set;
步骤12,检索测试集合中的正确率最低的分类;Step 12. Retrieve the category with the lowest correct rate in the test set;
提取属于分类正确率最低的分类结果的原始图像;Extract the original image belonging to the classification result with the lowest classification accuracy rate;
步骤13,基于检索的正确率最低的分类进行数据增广处理;Step 13. Perform data augmentation processing based on the classification with the lowest retrieval accuracy;
更具体的,将提取的属于分类正确率最低的分类结果的原始图像,作为输入的图像样本输入至卷积神经网络。可以理解,卷积神经网络至少包括至少一个卷积层和至少一个全连接层。其中,卷积层用于对输入的图像样本的特征向量进行计算并将计算结果输出至全连接层,卷积神经网络的末层全连接层与输出层连接,针对输入的图像样本的特征向量进行机器训练,在输入样本过程中,获取如下多元方程组:More specifically, the extracted original image belonging to the classification result with the lowest classification accuracy rate is input to the convolutional neural network as the input image sample. It can be understood that the convolutional neural network includes at least one convolutional layer and at least one fully connected layer. Among them, the convolutional layer is used to calculate the feature vector of the input image sample and output the calculation result to the fully connected layer. The final fully connected layer of the convolutional neural network is connected to the output layer, and is directed to the feature vector of the input image sample For machine training, in the process of inputting samples, the following multivariate equations are obtained:
Figure PCTCN2019110026-appb-000012
Figure PCTCN2019110026-appb-000012
上式中,x 1,x 2,…,x n为最后一层全连接层的输入、为图像样本的特征向量;a 1,a 2,…,a m为输出,m表示分类的类别数,w为全连接层的权值参数,b为全连接层的偏置; In the above formula, x 1 , x 2 ,..., x n are the input of the last fully connected layer and the feature vector of the image sample; a 1 , a 2 ,..., a m are the output, and m is the number of categories , W is the weight parameter of the fully connected layer, and b is the bias of the fully connected layer;
针对第i类(m=i)分析,当输入k个样本时,对于每个类别来说,存在一个多元方程组如下式所示,方程的未知数为该类别所对应的权值参数w,输入样本个数即为构成该方程组的方程个数:For the i-th category (m=i) analysis, when k samples are input, for each category, there is a multivariate equation system as shown in the following formula, the unknown of the equation is the weight parameter w corresponding to the category, enter The number of samples is the number of equations that constitute the equation group:
Figure PCTCN2019110026-appb-000013
Figure PCTCN2019110026-appb-000013
在训练网络模型时,训练集的样本数k是可数的,每类出现的输入样 本数量达不到实际全连接层权值参数的数量,k远小于n,针对多元线性方程组,方程组个数小于未知数个数,方程组为欠定方程组,转换成矩阵形式如下,欠定方程组有无穷多解也就是方程组具有多组解;When training the network model, the number of samples k in the training set is countable, and the number of input samples appearing in each category does not reach the number of weight parameters of the actual fully connected layer. K is much smaller than n. For multiple linear equations, equations The number is less than the number of unknowns, the equations are underdetermined equations, converted into matrix form as follows, the underdetermined equations have infinitely many solutions, that is, the equations have multiple solutions;
Figure PCTCN2019110026-appb-000014
Figure PCTCN2019110026-appb-000014
针对样本特征复杂的分类样本,通过增加输入样本个数,增加方程组的方程个数,基础解系所含向量个数增加,则促进网络模型达到全局最优;For classified samples with complex sample characteristics, by increasing the number of input samples, increasing the number of equations in the equation system, and increasing the number of vectors contained in the basic solution system, the network model is promoted to reach the global optimum;
通过基于优化分类的数据增广将分类效果不好的类输入样本增加,即增加该类所对应的方程组的方程个数,针对单类进行数据增广能够提高网络模型对该类的分类正确率进而提高整体的分类正确率;Through data augmentation based on optimized classification, the input samples of classes with poor classification effects are increased, that is, the number of equations corresponding to the class is increased. Data augmentation for a single class can improve the correct classification of the class by the network model And then improve the overall classification accuracy rate;
步骤2,针对增广后的数据,基于自适应高斯核的非局部均值进行图像去噪;Step 2: For the augmented data, perform image denoising based on the non-local mean of the adaptive Gaussian kernel;
具体的,图像去噪的处理步骤包括如下:Specifically, the processing steps of image denoising include the following:
步骤21,基于自适应高斯核框架,设置θ i方向角作为图像的主方向,利用θ i实现图像的旋转匹配; Step 21: Based on the adaptive Gaussian kernel framework, set the θ i direction angle as the main direction of the image, and use θ i to realize the rotation matching of the image;
步骤22,设定两幅图像N i和N j的主方向分别为θ i与θ j,设定Δθ i,j为练个方向角θ i与θ j的差值; Step 22: Set the main directions of the two images N i and N j to be θ i and θ j , respectively, and set Δθ i, j to be the difference between the direction angles θ i and θ j ;
步骤23,若Δθ i,j的数值是90的整数倍,交换图像N j内像素进行图像的旋转转换; Step 23: If the value of Δθ i,j is an integer multiple of 90, exchange pixels in the image N j to perform image rotation conversion;
步骤24,若Δθ i,j的数值不是90的整数倍,扩大N j的图像尺寸至可选取到新的图像邻域,进行图像的旋转转换; Step 24, if the value of Δθ i,j is not an integer multiple of 90, expand the image size of N j to a new image neighborhood, and perform image rotation conversion;
步骤25,若Δθ i,j的数值不超过预设阈值如数值2时,图像N i和N j为匹配图像,不执行图像的旋转转换; Step 25, if the value of Δθ i,j does not exceed a preset threshold such as a value of 2, the images N i and N j are matching images, and no image rotation conversion is performed;
具体的,所述anchor细化模块的构建通过剔除分类器的分类层来实现。目标检测模块通过传输连接块的输出进行构建。Specifically, the construction of the anchor refinement module is realized by removing the classification layer of the classifier. The target detection module is constructed by transmitting the output of the connection block.
基于图像的旋转匹配,使得在当前图像如图像N i与目标图像如图像N j之间建立更高的相关性,从而发现更多相似的像素,较多的相似像素便于获得更好的去噪效果。 Image matching based on the rotation of the current image in the image such that N i between the target image as an image N j establish a higher correlation to find more similar pixel, the pixel is similar to more facilitate better denoising effect.
具体的,在通过计算相似距离进行可靠的相似权系数的获取过程中,基于自适应高斯核来计算相似距离,再关联旋转匹配相似性度量以及自适应高斯核权系数获取新的相似距离d K(i,j),具体公式如下所示: Specifically, in the process of obtaining the reliable similarity weight coefficient by calculating the similarity distance, the similarity distance is calculated based on the adaptive Gaussian kernel, and then the rotation matching similarity measure and the adaptive Gaussian kernel weight coefficient are associated to obtain the new similarity distance d K (i, j), the specific formula is as follows:
Figure PCTCN2019110026-appb-000015
Figure PCTCN2019110026-appb-000015
上式中,v(N i)为图像N i的一个像素点,v(N' j)为图像N j中与v(N i)相似的像素。
Figure PCTCN2019110026-appb-000016
为归一化后自适应高斯核的权系数,即权函数
Figure PCTCN2019110026-appb-000017
定义为:
In the above formula, v (N i) is a pixel image of the N i, v (N 'j) with image V N j (N i) similar pixel.
Figure PCTCN2019110026-appb-000016
Is the weight coefficient of the adaptive Gaussian kernel after normalization, that is, the weight function
Figure PCTCN2019110026-appb-000017
defined as:
Figure PCTCN2019110026-appb-000018
Figure PCTCN2019110026-appb-000018
上式中h(i)为局部自适应相似权重参数,即去噪结果可由下式计算:In the above formula, h(i) is the local adaptive similar weight parameter, that is, the denoising result can be calculated by the following formula:
Figure PCTCN2019110026-appb-000019
Figure PCTCN2019110026-appb-000019
上式中,
Figure PCTCN2019110026-appb-000020
为归一化因子,S(i)为图像的搜索区域;
In the above formula,
Figure PCTCN2019110026-appb-000020
Is the normalization factor, S(i) is the search area of the image;
具体的,本实施例的局部自适应相似权重参数选取方法基于图像残差估计,具体计算包括如下:Specifically, the method for selecting local adaptive similarity weight parameters in this embodiment is based on image residual estimation, and the specific calculation includes the following:
计算含有噪声的样本图像中各个像素的残差r i Calculate the residual r i of each pixel in the sample image with noise:
Figure PCTCN2019110026-appb-000021
Figure PCTCN2019110026-appb-000021
其中
Figure PCTCN2019110026-appb-000022
用于保证在平坦区域
Figure PCTCN2019110026-appb-000023
E(.)表示数学期望;
among them
Figure PCTCN2019110026-appb-000022
Used to ensure that it is in a flat area
Figure PCTCN2019110026-appb-000023
E(.) represents mathematical expectation;
利用一个鲁棒的中值估计器估计图像局部区域的噪声标准σ i,即: Use a robust median estimator to estimate the noise standard σ i of the local area of the image, namely:
Figure PCTCN2019110026-appb-000024
Figure PCTCN2019110026-appb-000024
其中,R={r 1,r 2,...,r |Ω(i)|},|Ω(i)|为图像内像素的总个数;设R′={r 1,r 2,...,r |S(i)|}为搜索区域S(i)内各点的残差估计,搜索区域内的噪声水平定义为σ i=mean{R′} Among them, R={r 1 , r 2 ,..., r |Ω(i)| },|Ω(i)| is the total number of pixels in the image; set R′={r 1 , r 2 , ..., r |S(i)| } is the residual estimation of each point in the search area S(i), and the noise level in the search area is defined as σ i = mean{R′}
局部自适应相似权重参数h(i)的定义为:The definition of the local adaptive similarity weight parameter h(i) is:
h(i)=ησ i  (10) h(i)=ησ i (10)
其中,常量η用于调控参数;Among them, the constant η is used to adjust the parameters;
基于残差估计方法在图像细节部分存在高估噪声水平,在平坦部分噪声会低估噪声水平,将η设计为与
Figure PCTCN2019110026-appb-000025
和σ i有关的函数:
Based on the residual estimation method, there is an overestimation of the noise level in the detail part of the image, and the noise will underestimate the noise level in the flat part, so η is designed to be
Figure PCTCN2019110026-appb-000025
Functions related to σ i:
Figure PCTCN2019110026-appb-000026
Figure PCTCN2019110026-appb-000026
其中,η 1=1.1,η 2=1.6是经验值,通过实验优化得到; Among them, η 1 =1.1 and η 2 =1.6 are empirical values, which are obtained through experimental optimization;
参见图5,为本实施例采用的基于自适应高斯核的非局部均值的去噪方法与传统非局部方法权系数分布示意图,图中左侧是噪声图像,中间是传统非局部均值图像去噪权系数分布,右侧是基于自适应高斯核的非局部均值去噪权系数分布。图像块被噪声标准σ=20的高斯噪声污染,图中越亮的像素表示越大的权值。从图5中可以看出,基于自适应高斯核的非局部均值的去噪方法中具有相似边缘或纹理结构的像素被赋予较大的权值,即本实施例提出的权函数能够更有效地度量像素之间的相似性;Refer to Figure 5, which is a schematic diagram of the weight coefficient distribution of the non-local mean denoising method based on the adaptive Gaussian kernel and the traditional non-local method used in this embodiment. The left side of the figure is a noisy image, and the middle is a traditional non-local mean image denoising. Weight coefficient distribution, on the right is the non-local mean denoising weight coefficient distribution based on the adaptive Gaussian kernel. The image block is contaminated by Gaussian noise with the noise standard σ=20, and the brighter pixels in the figure indicate the greater the weight. It can be seen from Figure 5 that pixels with similar edges or texture structures in the non-local mean denoising method based on the adaptive Gaussian kernel are given larger weights, that is, the weight function proposed in this embodiment can be more effective Measures the similarity between pixels;
在纹理图像中基于自适应高斯核的非局部均值的去噪方法能够发现更多相似的像素,旋转匹配相似对比能够发现更多具有相似模式的图像,保证基于自适应高斯核的非局部均值的去噪方法有更好的去噪效果;In texture images, the denoising method based on the non-local mean of the adaptive Gaussian kernel can find more similar pixels, and the rotation matching similarity comparison can find more images with similar patterns, ensuring the non-local mean based on the adaptive Gaussian kernel Denoising method has better denoising effect;
步骤3,构建锚点anchor细化模块及目标检测模块,基于构建的模块进行网络结构优化;Step 3: Construct an anchor refinement module and a target detection module, and optimize the network structure based on the constructed module;
所述网络结构包括anchor细化模块和目标检测模块,anchor细化模块 用于移除负样本anchors,以便为分类器减少搜索空间,同时粗略调整anchors的位置和大小,以便为随后的回归器提供更好的初始化结果;目标检测模块用于根据细化后的anchors将结果回归到准确的目标位置,并预测多类别标签;The network structure includes an anchor refinement module and a target detection module. The anchor refinement module is used to remove negative sample anchors, so as to reduce the search space for the classifier, and at the same time roughly adjust the position and size of the anchors, so as to provide for the subsequent regression Better initialization results; the target detection module is used to return the results to the accurate target position according to the refined anchors, and predict multi-category labels;
anchor细化模块的构建通过剔除分类器的分类层;目标检测模块通过传输连接块的输出进行构建;传输连接块后面接预测层,其生成目标类别的分数和相对于细化后的anchors的坐标偏移量;The anchor refinement module is constructed by removing the classification layer of the classifier; the target detection module is constructed by transmitting the output of the connection block; the transmission connection block is followed by the prediction layer, which generates the score of the target category and the coordinates relative to the refined anchors Offset;
anchor细化模块和目标检测模块基于传输连接块建立联系,将来自anchor细化模块的不同层的功能转换为目标检测模块所需的形式,以便目标检测模块可以共享来自anchor细化模块的特征;The anchor refinement module and the target detection module establish a connection based on the transmission connection block, and convert the functions of different layers from the anchor refinement module into the form required by the target detection module, so that the target detection module can share the features from the anchor refinement module;
在此特别说明的是,传输连接块用于与anchors相关联的特征图,传输连接块还通过将高级特征添加到传输的特征来继承大规模的上下文,以提高检测的准确性;为了使得大规模的上下文之间的维度相匹配,采用逆卷积操作来增大高级特征,再求和之后添加到卷积层以确保检测的特征的可辩性;It is specifically explained here that the transmission connection block is used for feature maps associated with anchors. The transmission connection block also inherits large-scale context by adding advanced features to the transmitted features to improve the accuracy of detection; The dimensions of the contexts of the scale are matched, and the inverse convolution operation is used to increase the advanced features, and then added to the convolutional layer after the summation to ensure the discernibility of the detected features;
为了提高基于不同尺度下的各个特征层的单步回归预测目标的位置和大小的准确性,本实施例采用两步级联回归策略来回归目标的位置和大小,先通过anchor细化模块首次调整anchors的位置和大小,便于为目标检测模块的回归操作提供更好的初始化结果。In order to improve the accuracy of single-step regression predicting the position and size of the target based on each feature layer at different scales, this embodiment adopts a two-step cascade regression strategy to return to the target position and size, and first adjusts it through the anchor refinement module. The position and size of the anchors are convenient to provide better initialization results for the regression operation of the target detection module.
更具体的,基于anchor细化模块调整anchors的具体位置及其大小,再通过目标检测模块进行回归操作,具体步骤包括如下:More specifically, the specific location and size of the anchors are adjusted based on the anchor refinement module, and then the target detection module is used to perform the regression operation. The specific steps include the following:
步骤31,基于划分的图像特征图的单元,生成n个细化的anchor boxes(窗口);Step 31: Generate n refined anchor boxes (windows) based on the units of the divided image feature map;
步骤32,通过获取生成的n个细化的anchor boxes,将anchor boxes传递至目标检测模块的对应的特征图,生成目标类别和准确的目标位置和大 小;n为大于等于1的正整数。Step 32: Pass the anchor boxes to the corresponding feature map of the target detection module by obtaining the generated n refined anchor boxes to generate the target category and accurate target position and size; n is a positive integer greater than or equal to 1.
本实施例在计算2个类别分数和相对于细化anchor boxes的四个精确的偏移量,为每个细化anchor box产生6个输出以实现检测任务;本实施例使用两步级联回归策略,即anchor细化模块生成细化后的anchor boxes,目标检测模块再将这些细化后的anchor boxes作为输入用于进一步检测,尤其针对小目标而言,使得检测结果更加精确;In this embodiment, two category scores and four precise offsets relative to the refined anchor boxes are calculated, and 6 outputs are generated for each refined anchor box to realize the detection task; this embodiment uses two-step cascade regression The strategy, that is, the anchor refinement module generates refined anchor boxes, and the target detection module uses these refined anchor boxes as input for further detection, especially for small targets, making the detection results more accurate;
具体的,本实施例还通过设计负样本过滤机制,拒绝已经被精准分类的负样本anchors,并调节样本失衡问题;Specifically, this embodiment also rejects negative sample anchors that have been accurately classified by designing a negative sample filtering mechanism, and adjusts the problem of sample imbalance;
针对在训练阶段对于一个细化后的anchor box,若其负置信度大于一个预先设置好的(置信度)阈值θ,即可在训练ODM的过程中舍弃该anchor;优选的,本实施的θ设置为0.99;For a refined anchor box in the training phase, if its negative confidence is greater than a preset (confidence) threshold θ, the anchor can be discarded in the process of training ODM; preferably, θ in this implementation Set to 0.99;
通过传递经过细化后的负样本anchor boxes和正样本anchor boxes来训练目标检测模块;Train the target detection module by passing refined negative sample anchor boxes and positive sample anchor boxes;
针对在推理阶段,若存在一个经过细化后的anchor box被赋予一个大于θ的负置信度,则该anchor将在目标检测模块的检测过程中舍弃;In the inference stage, if there is a refined anchor box that is given a negative confidence greater than θ, the anchor will be discarded in the detection process of the target detection module;
步骤4,基于anchor细化模块及目标检测模块,进行深度学习神经网络模型机器训练;Step 4: Based on the anchor refinement module and the target detection module, perform deep learning neural network model machine training;
具体的,训练基于设计的anchor细化模块和目标检测模块的深度学习神经网络模型,本实施例的损失函数包括anchor细化模块的损失以及目标检测模块的损失,对于anchor细化模块,为每个anchor赋予一个二元类标签,表明是一个目标或者不是一个目标,并且同时回归他的位置和大小,以获得一个细化的anchor;Specifically, training a deep learning neural network model based on the designed anchor refinement module and target detection module. The loss function in this embodiment includes the loss of the anchor refinement module and the loss of the target detection module. For the anchor refinement module, it is every Each anchor is assigned a binary class label, indicating that it is a target or not a target, and returns to its position and size at the same time to obtain a refined anchor;
将具有小于阈值的负置信度的细化后的anchors传递给目标检测模块以便进一步预测目标类别和准确的目标位子和大小;Pass the refined anchors with a negative confidence level less than the threshold to the target detection module to further predict the target category and accurate target position and size;
损失函数定义如下:The loss function is defined as follows:
设定i为一个小批次里面的anchor索引,
Figure PCTCN2019110026-appb-000027
是anchor i的真实类别标签,
Figure PCTCN2019110026-appb-000028
是anchor  i真实的位置和大小;
Set i to the anchor index in a small batch,
Figure PCTCN2019110026-appb-000027
Is the real category label of anchor i,
Figure PCTCN2019110026-appb-000028
Is the real position and size of anchor i;
p i和x i分别表示预测的anchor  i是一个目标的置信度和anchor细化模块中细化后的anchor  i的坐标; p i and x i respectively represent the confidence that the predicted anchor i is a target and the coordinate of the refined anchor i in the anchor refinement module;
c i和t i分别表示目标检测模块中预测的边界框的物体类别和坐标; c i and t i respectively represent the object category and coordinates of the bounding box predicted in the target detection module;
N arm和N odm分别为anchor细化模块及目标检测模块中正样本的anchors数目,分类损失L b是两个类别的交叉熵损失,多分类损失L m是多个类别置信度的归一化指数函数损失;使用平滑L1损失作为回归损失L rN arm and No odm are the number of anchors of the positive samples in the anchor refinement module and the target detection module, respectively, the classification loss L b is the cross-entropy loss of the two categories, and the multi- class loss L m is the normalized index of the confidence of multiple categories Function loss; use smooth L1 loss as regression loss L r ;
Figure PCTCN2019110026-appb-000029
Figure PCTCN2019110026-appb-000029
上式的指示函数
Figure PCTCN2019110026-appb-000030
当条件为真时,输出1;即
Figure PCTCN2019110026-appb-000031
反之输出为0;
Indicator function
Figure PCTCN2019110026-appb-000030
When the condition is true, output 1; that is
Figure PCTCN2019110026-appb-000031
Otherwise, the output is 0;
Figure PCTCN2019110026-appb-000032
表明负样本anchors的回归损失被忽略;
Figure PCTCN2019110026-appb-000032
Shows that the regression loss of negative sample anchors is ignored;
当N arm=0,设置
Figure PCTCN2019110026-appb-000033
Figure PCTCN2019110026-appb-000034
When N arm = 0, set
Figure PCTCN2019110026-appb-000033
with
Figure PCTCN2019110026-appb-000034
当N odm=0,设置
Figure PCTCN2019110026-appb-000035
Figure PCTCN2019110026-appb-000036
When N odm =0, set
Figure PCTCN2019110026-appb-000035
with
Figure PCTCN2019110026-appb-000036
更具体的,为了处理不同尺度的目标,本申请实施例选取四个特种层,通过以VGG-16作为主干网络,步长大小分别为8,16,32和64像素,与预测的几种不同尺度的anchors进行关联;More specifically, in order to deal with targets of different scales, this embodiment of the application selects four special layers, using VGG-16 as the backbone network, and the step size is 8, 16, 32, and 64 pixels, which are different from the predicted ones. The anchors of the scale are associated;
选取的四个特征层均与一个特定的的anchors的尺度和三个纵横比(即0.5,1.0,2.0)相关联;特别说明的是,此处anchors的尺度为对应步长的四倍;The four selected feature layers are all associated with a specific anchor scale and three aspect ratios (ie 0.5, 1.0, 2.0); in particular, the anchor scale here is four times the corresponding step length;
训练过程当中,根据Jaccard重叠觉得anchors和真实框质检的对应关系,针对性的对于网络模型进行端与端的训练;通过将每个真实框与和其最佳重叠分数的anchor boxes进行匹配,再将anchor boxes与和其重叠度高于0.5的任何真实框进行匹配;During the training process, according to the corresponding relationship between the Jaccard overlapped anchors and the real frame quality inspection, the network model is trained end-to-end; by matching each real frame with the anchor boxes with the best overlap score, and then Match anchor boxes with any real boxes whose overlap degree is higher than 0.5;
步骤5,导入待诊断图像至训练完成的神经网络模型,进行图像数据处理分析和缺陷诊断;Step 5: Import the image to be diagnosed into the trained neural network model for image data processing analysis and defect diagnosis;
具体的,通过将待诊断的目标图像导入训练完成的神经网络模型,进行图像数据处理分析和缺陷诊断,基于将待诊断识别的变电站渗漏油图像输入到训练完成的深度神经网络模型,识别所述待诊断的针对变电设备的图像(巡检图像)是否为渗漏油图像,验证本案的算法的准确度。Specifically, by importing the target image to be diagnosed into the trained neural network model, image data processing analysis and defect diagnosis are performed. Based on the input of the substation oil leakage image to be diagnosed and identified into the trained deep neural network model, the identification Describe whether the image (patrol image) for the substation equipment to be diagnosed is an oil leakage image, to verify the accuracy of the algorithm in this case.
本申请实施例提供的图像识别方法,采用深度学习技术如深度学习神经网络模型,对变电站设备巡检图像进行智能识别诊断,提升了变电站设备渗漏油的诊断的速度和精度,避免了通过人力进行渗漏油情况的巡检,大大节省了人力物力。其中,对原始图像进行处理,采用基于自适应高斯核的去噪方法消除原始图像中的噪声和干扰信息,提高原始图像清晰度,有利于提高后续的智能识别分类的准确性。The image recognition method provided in the embodiments of this application uses deep learning techniques such as deep learning neural network models to intelligently identify and diagnose substation equipment inspection images, which improves the speed and accuracy of the diagnosis of oil leakage in substation equipment, and avoids the use of manpower The inspection of oil leakage has greatly saved manpower and material resources. Among them, the original image is processed, and the denoising method based on the adaptive Gaussian kernel is used to eliminate the noise and interference information in the original image, improve the clarity of the original image, and help improve the accuracy of subsequent intelligent recognition and classification.
本申请实施例还一种图像识别系统,所述系统包括:An embodiment of the present application also provides an image recognition system, and the system includes:
一个或多个处理器;One or more processors;
存储器,用于存储一个或多个计算机程序,Memory, used to store one or more computer programs,
当所述一个或多个计算机程序被所述一个或多个处理器执行时,使得所述一个或多个处理器执行前述的图像识别方法。When the one or more computer programs are executed by the one or more processors, the one or more processors are caused to execute the aforementioned image recognition method.
本申请实施例还一种存储有计算机程序的计算机可读存储介质,其中,该程序被处理器执行时实现前述的图像识别方法。An embodiment of the present application also provides a computer-readable storage medium storing a computer program, wherein the program is executed by a processor to implement the aforementioned image recognition method.
以上描述仅为本申请的较佳实施例以及对所运用技术原理的说明。本领域技术人员应当理解,本申请中所涉及的发明范围,并不限于上述技术特征的特定组合而成的技术方案,同时也应涵盖在不脱离所述发明构思的情况下,由上述技术特征或其等同特征进行任意组合而形成的其它技术方案。例如上述特征与本申请中公开的(但不限于)具有类似功能的技术特征进行互相替换而形成的技术方案。The above description is only a preferred embodiment of the present application and an explanation of the applied technical principles. Those skilled in the art should understand that the scope of the invention involved in this application is not limited to the technical solution formed by the specific combination of the above technical features, and should also cover the above technical features without departing from the inventive concept. Or other technical solutions formed by any combination of its equivalent features. For example, the above-mentioned features and the technical features disclosed in this application (but not limited to) with similar functions are mutually replaced to form a technical solution.
除说明书所述的技术特征外,其余技术特征为本领域技术人员的已知技术,为突出本申请的创新特点,其余技术特征在此不再赘述。Except for the technical features described in the specification, the other technical features are known to those skilled in the art. In order to highlight the innovative features of this application, the rest of the technical features are not repeated here.
工业实用性Industrial applicability
本申请实施例提供的图像识别方法、系统及存储介质,采用深度学习技术如深度学习神经网络模型,对变电站设备巡检图像进行智能识别诊断,提升了变电站设备渗漏油的诊断的速度和精度,避免了通过人力进行渗漏油情况的巡检,大大节省了人力物力。The image recognition method, system, and storage medium provided by the embodiments of the application use deep learning technologies such as deep learning neural network models to intelligently identify and diagnose substation equipment inspection images, which improves the speed and accuracy of the diagnosis of oil leakage in substation equipment , It avoids the inspection of oil leakage by manpower, which greatly saves manpower and material resources.

Claims (10)

  1. 一种图像识别方法,包括如下步骤:An image recognition method includes the following steps:
    获取原始图像,针对原始图像基于优化分类进行数据增广;Obtain the original image, and perform data augmentation based on the optimized classification for the original image;
    针对增广后的数据,基于自适应高斯核的非局部均值进行图像去噪;For the augmented data, perform image denoising based on the non-local mean of the adaptive Gaussian kernel;
    构建锚点anchor细化模块及目标检测模块,基于构建的模块进行网络结构优化;Construct the anchor refinement module and target detection module, and optimize the network structure based on the constructed module;
    基于anchor细化模块及目标检测模块,进行深度学习神经网络模型机器训练;Based on the anchor refinement module and the target detection module, perform deep learning neural network model machine training;
    导入待诊断图像至训练完成的神经网络模型,进行图像数据处理分析和缺陷诊断。Import the image to be diagnosed into the trained neural network model for image data processing analysis and defect diagnosis.
  2. 根据权利要求1所述的方法,其中,所述基于优化分类的数据增广,包括:The method according to claim 1, wherein the data augmentation based on optimized classification comprises:
    计算测试集合的所有类的分类正确率,所述测试集合为至少两张原始图像的集合,对至少两张原始图像进行图像的分类并计算对测试集合中的原始图像进行分类而得到的分类正确率;Calculate the classification accuracy rate of all classes in the test set, the test set is a set of at least two original images, perform image classification on the at least two original images, and calculate the correct classification obtained by classifying the original images in the test set rate;
    检索测试集合中的正确率最低的分类;Retrieve the category with the lowest correct rate in the test set;
    基于检索的正确率最低的分类进行数据增广处理。Data augmentation is performed based on the classification with the lowest retrieval accuracy.
  3. 根据权利要求1所述的方法,其中,所述基于自适应高斯核的非局部均值进行图像去噪,包括如下:The method according to claim 1, wherein the performing image denoising based on the non-local mean of the adaptive Gaussian kernel comprises the following:
    基于自适应高斯核框架,Based on the adaptive Gaussian kernel framework,
    设定两幅原始图像N i、N j的主方向为θ i、θ j,设定Δθ i,j为练个方向角θ i与θ j的差值; Set the main directions of the two original images N i and N j as θ i and θ j , and set Δθ i, j to be the difference between the direction angles θ i and θ j ;
    若Δθ i,j的数值是90的整数倍,交换原始图像N j内像素进行图像的旋转转换; If the value of Δθ i,j is an integer multiple of 90, exchange pixels in the original image N j to perform image rotation conversion;
    若Δθ i,j的数值不是90的整数倍,扩大原始N j的图像尺寸至可选取到新 的图像邻域,进行图像的旋转转换; If the value of Δθ i,j is not an integer multiple of 90, expand the image size of the original N j to a new image neighborhood, and perform image rotation conversion;
    若Δθ i,j的数值不超过预设阈值时,图像N i和N j为匹配,不执行图像的旋转转换。 If the value of Δθ i,j does not exceed the preset threshold, the images N i and N j are matched, and no image rotation conversion is performed.
  4. 根据权利要求1所述的方法,其中,所述构建锚点anchor细化模块及目标检测模块,基于构建的模块进行网络结构优化,包括:The method according to claim 1, wherein the constructing the anchor refinement module and the target detection module to optimize the network structure based on the constructed module comprises:
    基于anchor细化模块调整anchors的位置和大小;Adjust the position and size of anchors based on the anchor refinement module;
    通过目标检测模块进行回归操作,得到anchors准确的目标位置和大小。The regression operation is performed through the target detection module, and the accurate target position and size of the anchors are obtained.
  5. 根据权利要求4所述的方法,其中,所述基于anchor细化模块调整anchors的位置和大小,通过目标检测模块进行回归操作,得到anchors准确的目标位置和大小,包括:The method according to claim 4, wherein the adjusting the position and size of the anchors based on the anchor refinement module, and performing a regression operation through the target detection module to obtain the accurate target position and size of the anchors, comprises:
    基于划分的图像特征图的单元,生成n个细化的anchor窗口boxes;Generate n refined anchor window boxes based on the divided image feature map units;
    通过生成的n个细化的anchor boxes,将anchor boxes传递至目标检测模块的对应的特征图,生成目标类别和准确的目标位置和大小;其中,n为大于等于1的正整数。Through the generated n refined anchor boxes, the anchor boxes are passed to the corresponding feature map of the target detection module to generate the target category and accurate target position and size; where n is a positive integer greater than or equal to 1.
  6. 根据权利要求5所述的方法,其中,在所述基于划分的图像特征图的单元,生成n个细化的anchor窗口boxes之后,所述方法还包括:The method according to claim 5, wherein, after generating the n refined anchor window boxes based on the unit of the divided image feature map, the method further comprises:
    计算每个anchor boxes的负置信度;Calculate the negative confidence of each anchor box;
    删除负置信度大于预先设定的置信度阈值的anchor boxes;Delete anchor boxes with negative confidence greater than the preset confidence threshold;
    相应的,所述通过生成的n个细化的anchor boxes,将anchor boxes传递至目标检测模块的对应的特征图,利用目标检测模型生成目标类别和准确的目标位置和大小。Correspondingly, through the generated n refined anchor boxes, the anchor boxes are transferred to the corresponding feature map of the target detection module, and the target detection model is used to generate the target category and the accurate target position and size.
  7. 根据权利要求1所述的方法,其中,所述方法还包括:The method according to claim 1, wherein the method further comprises:
    基于损失函数的计算进行深度学习神经网络模型的机器训练,其中,所述损失函数包括anchor细化模块的损失以及目标检测模块的损失。The machine training of the deep learning neural network model is performed based on the calculation of the loss function, where the loss function includes the loss of the anchor refinement module and the loss of the target detection module.
  8. 根据权利要求1所述的方法,其中,所述导入待诊断图像至训练完 成的神经网络模型,进行图像数据处理分析和缺陷诊断,包括:The method according to claim 1, wherein the importing the image to be diagnosed into the trained neural network model to perform image data processing analysis and defect diagnosis comprises:
    将待诊断的图像输入到训练完成的深度神经网络模型,识别所述待诊断的图像是否为渗漏油图像。Input the image to be diagnosed into the trained deep neural network model, and identify whether the image to be diagnosed is an oil leakage image.
  9. 一种图像识别系统,其中,所述系统包括:An image recognition system, wherein the system includes:
    一个或多个处理器;One or more processors;
    存储器,用于存储一个或多个程序,Memory, used to store one or more programs,
    当所述一个或多个程序被所述一个或多个处理器执行时,使得所述一个或多个处理器执行如权利要求1-8任一所述的图像识别方法。When the one or more programs are executed by the one or more processors, the one or more processors are caused to execute the image recognition method according to any one of claims 1-8.
  10. 一种存储有计算机程序的计算机可读存储介质,其中,该程序被处理器执行时实现如权利要求1-8任一所述的图像识别方法。A computer-readable storage medium storing a computer program, wherein the program is executed by a processor to implement the image recognition method according to any one of claims 1-8.
PCT/CN2019/110026 2019-09-09 2019-10-08 Image identification method, system, and storage medium WO2021046951A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910846731.0A CN110705601A (en) 2019-09-09 2019-09-09 Transformer substation equipment oil leakage image identification method based on single-stage target detection
CN201910846731.0 2019-09-09

Publications (1)

Publication Number Publication Date
WO2021046951A1 true WO2021046951A1 (en) 2021-03-18

Family

ID=69195096

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/110026 WO2021046951A1 (en) 2019-09-09 2019-10-08 Image identification method, system, and storage medium

Country Status (2)

Country Link
CN (1) CN110705601A (en)
WO (1) WO2021046951A1 (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113011338A (en) * 2021-03-19 2021-06-22 华南理工大学 Lane line detection method and system
CN113627468A (en) * 2021-07-01 2021-11-09 浙江安防职业技术学院 Non-local neural network image processing method and system based on unsupervised learning
CN113670524A (en) * 2021-07-13 2021-11-19 江铃汽车股份有限公司 Detection method and detection system for fuel leakage in automobile collision
CN114648685A (en) * 2022-03-23 2022-06-21 成都臻识科技发展有限公司 Method and system for converting anchor-free algorithm into anchor-based algorithm
CN114693605A (en) * 2022-03-07 2022-07-01 重庆亲禾智千科技有限公司 Deepstream-based road crack detection method
TWI771010B (en) * 2021-05-20 2022-07-11 鴻海精密工業股份有限公司 Defect detection method, computer device, and storage medium
CN115393639A (en) * 2022-08-16 2022-11-25 广州市玄武无线科技股份有限公司 Intelligent marking method and system for commodities, terminal equipment and readable storage medium
CN115481694A (en) * 2022-09-26 2022-12-16 南京星环智能科技有限公司 Data enhancement method, device, equipment and storage medium for training sample set
CN116150417A (en) * 2023-04-19 2023-05-23 上海维智卓新信息科技有限公司 Multi-scale multi-fusion image retrieval method and device
CN116587327A (en) * 2023-06-20 2023-08-15 广东电网有限责任公司广州供电局 Motion control system, live working robot detection method and related equipment
CN117781661A (en) * 2024-02-27 2024-03-29 广东金湾高景太阳能科技有限公司 Silicon wafer drying improvement method and device based on D-LKA network model
CN117781661B (en) * 2024-02-27 2024-05-14 广东金湾高景太阳能科技有限公司 Silicon wafer drying improvement method and device based on D-LKA network model

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110705601A (en) * 2019-09-09 2020-01-17 安徽继远软件有限公司 Transformer substation equipment oil leakage image identification method based on single-stage target detection
CN111783941B (en) * 2020-06-07 2024-03-29 北京化工大学 Mechanical equipment diagnosis classification method based on probability confidence convolutional neural network
CN112036463A (en) * 2020-08-26 2020-12-04 国家电网有限公司 Power equipment defect detection and identification method based on deep learning
CN113033322A (en) * 2021-03-02 2021-06-25 国网江苏省电力有限公司南通供电分公司 Method for identifying hidden danger of oil leakage of transformer substation oil filling equipment based on deep learning
CN113065416A (en) * 2021-03-16 2021-07-02 深圳供电局有限公司 Leakage monitoring device integrated with transformer substation video monitoring device, method and medium
CN114119594A (en) * 2021-12-06 2022-03-01 华能东莞燃机热电有限责任公司 Oil leakage detection method and device based on deep learning
CN115457297B (en) * 2022-08-23 2023-09-26 中国航空油料集团有限公司 Oil leakage detection method and device for aviation oil depot and aviation oil safety operation and maintenance system
CN116977826B (en) * 2023-08-14 2024-03-22 北京航空航天大学 Reconfigurable neural network target detection method under edge computing architecture

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104200442A (en) * 2014-09-19 2014-12-10 西安电子科技大学 Improved canny edge detection based non-local means MRI (magnetic resonance image) denoising method
US20160180151A1 (en) * 2014-12-17 2016-06-23 Google Inc. Generating numeric embeddings of images
CN107220618A (en) * 2017-05-25 2017-09-29 中国科学院自动化研究所 Method for detecting human face and device, computer-readable recording medium, equipment
CN110705601A (en) * 2019-09-09 2020-01-17 安徽继远软件有限公司 Transformer substation equipment oil leakage image identification method based on single-stage target detection

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106340034B (en) * 2016-08-22 2019-01-29 成都信息工程大学 A kind of transformer Oil Leakage Detecting method
CN207807703U (en) * 2017-10-30 2018-09-04 国网福建省电力有限公司 A kind of equipment permeability detection device combined with crusing robot
CN109325520B (en) * 2018-08-24 2021-06-29 北京航空航天大学 Method, device and system for checking petroleum leakage
CN109780451B (en) * 2018-12-20 2021-01-05 国电大渡河沙坪水电建设有限公司 Power plant speed regulator oil circuit inspection method based on machine vision

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104200442A (en) * 2014-09-19 2014-12-10 西安电子科技大学 Improved canny edge detection based non-local means MRI (magnetic resonance image) denoising method
US20160180151A1 (en) * 2014-12-17 2016-06-23 Google Inc. Generating numeric embeddings of images
CN107220618A (en) * 2017-05-25 2017-09-29 中国科学院自动化研究所 Method for detecting human face and device, computer-readable recording medium, equipment
CN110705601A (en) * 2019-09-09 2020-01-17 安徽继远软件有限公司 Transformer substation equipment oil leakage image identification method based on single-stage target detection

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113011338B (en) * 2021-03-19 2023-08-22 华南理工大学 Lane line detection method and system
CN113011338A (en) * 2021-03-19 2021-06-22 华南理工大学 Lane line detection method and system
TWI771010B (en) * 2021-05-20 2022-07-11 鴻海精密工業股份有限公司 Defect detection method, computer device, and storage medium
CN113627468A (en) * 2021-07-01 2021-11-09 浙江安防职业技术学院 Non-local neural network image processing method and system based on unsupervised learning
CN113670524A (en) * 2021-07-13 2021-11-19 江铃汽车股份有限公司 Detection method and detection system for fuel leakage in automobile collision
CN114693605A (en) * 2022-03-07 2022-07-01 重庆亲禾智千科技有限公司 Deepstream-based road crack detection method
CN114648685A (en) * 2022-03-23 2022-06-21 成都臻识科技发展有限公司 Method and system for converting anchor-free algorithm into anchor-based algorithm
CN115393639A (en) * 2022-08-16 2022-11-25 广州市玄武无线科技股份有限公司 Intelligent marking method and system for commodities, terminal equipment and readable storage medium
CN115393639B (en) * 2022-08-16 2023-08-11 广州市玄武无线科技股份有限公司 Intelligent commodity marking method, intelligent commodity marking system, terminal equipment and readable storage medium
CN115481694A (en) * 2022-09-26 2022-12-16 南京星环智能科技有限公司 Data enhancement method, device, equipment and storage medium for training sample set
CN115481694B (en) * 2022-09-26 2023-09-05 南京星环智能科技有限公司 Data enhancement method, device and equipment for training sample set and storage medium
CN116150417A (en) * 2023-04-19 2023-05-23 上海维智卓新信息科技有限公司 Multi-scale multi-fusion image retrieval method and device
CN116150417B (en) * 2023-04-19 2023-08-04 上海维智卓新信息科技有限公司 Multi-scale multi-fusion image retrieval method and device
CN116587327A (en) * 2023-06-20 2023-08-15 广东电网有限责任公司广州供电局 Motion control system, live working robot detection method and related equipment
CN117781661A (en) * 2024-02-27 2024-03-29 广东金湾高景太阳能科技有限公司 Silicon wafer drying improvement method and device based on D-LKA network model
CN117781661B (en) * 2024-02-27 2024-05-14 广东金湾高景太阳能科技有限公司 Silicon wafer drying improvement method and device based on D-LKA network model

Also Published As

Publication number Publication date
CN110705601A (en) 2020-01-17

Similar Documents

Publication Publication Date Title
WO2021046951A1 (en) Image identification method, system, and storage medium
CN107944396B (en) Knife switch state identification method based on improved deep learning
CN110827251B (en) Power transmission line locking pin defect detection method based on aerial image
TWI742382B (en) Neural network system for vehicle parts recognition executed by computer, method for vehicle part recognition through neural network system, device and computing equipment for vehicle part recognition
CN109977808B (en) Wafer surface defect mode detection and analysis method
TWI729405B (en) Method and device for optimizing damage detection results
CN109118479B (en) Capsule network-based insulator defect identification and positioning device and method
WO2023155069A1 (en) Deep-learning-based surface defect detection method for mobile phone battery
WO2020177432A1 (en) Multi-tag object detection method and system based on target detection network, and apparatuses
CN111444939B (en) Small-scale equipment component detection method based on weak supervision cooperative learning in open scene of power field
CN112200143A (en) Road disease detection method based on candidate area network and machine vision
CN114972213A (en) Two-stage mainboard image defect detection and positioning method based on machine vision
CN106951915A (en) A kind of one-dimensional range profile multiple Classifiers Combination method of identification based on classification confidence level
CN115909093A (en) Power equipment fault detection method based on unmanned aerial vehicle inspection and infrared image semantic segmentation
CN113469950A (en) Method for diagnosing abnormal heating defect of composite insulator based on deep learning
CN116012291A (en) Industrial part image defect detection method and system, electronic equipment and storage medium
CN115439458A (en) Industrial image defect target detection algorithm based on depth map attention
CN111882554B (en) SK-YOLOv 3-based intelligent power line fault detection method
CN115830302A (en) Multi-scale feature extraction and fusion power distribution network equipment positioning identification method
CN115631154A (en) Power equipment state monitoring and analyzing method and system
CN114283280A (en) Water surface floating garbage identification method based on improved convolutional neural network
CN113313678A (en) Automatic sperm morphology analysis method based on multi-scale feature fusion
Xu et al. Research on the Application of Deep Learning Object Detection in Rust Defect Detection of Power Equipment
CN112861670A (en) Transmission line hardware detection method and system
CN111696070A (en) Multispectral image fusion power internet of things fault point detection method based on deep learning

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19945029

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19945029

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 27.09.2022)

122 Ep: pct application non-entry in european phase

Ref document number: 19945029

Country of ref document: EP

Kind code of ref document: A1