CN113256581A

CN113256581A - Automatic defect sample labeling method and system based on visual attention modeling fusion

Info

Publication number: CN113256581A
Application number: CN202110555658.9A
Authority: CN
Inventors: 孙佳; 王鹏; 黎万义; 罗永康
Original assignee: Institute of Automation of Chinese Academy of Science
Current assignee: Institute of Automation of Chinese Academy of Science
Priority date: 2021-05-21
Filing date: 2021-05-21
Publication date: 2021-08-13
Anticipated expiration: 2041-05-21
Also published as: CN113256581B

Abstract

The invention belongs to the technical field of part surface damage detection, and particularly relates to a defect sample automatic labeling method and system based on visual attention modeling fusion, aiming at solving the problem that the automatic labeling of the surface defects of precision part samples cannot be realized in the prior art; respectively calculating a saliency map based on visual attention modeling fusion for each image region, and segmenting the saliency regions in the saliency map by using a marker array segmentation method to serve as defect candidate regions; extracting the characteristics of the defect candidate area, clustering to obtain a plurality of defect samples of different categories, labeling each defect sample according to a small number of labels or known label definition rules, and manually labeling unknown categories through human-computer interaction; the invention can greatly reduce the workload of manual marking, and has high marking efficiency and good accuracy.

Description

Automatic defect sample labeling method and system based on visual attention modeling fusion

Technical Field

The invention belongs to the technical field of part surface damage detection, and particularly relates to a method and a system for automatically labeling a defect sample based on visual attention modeling fusion.

Background

The defect detection of precise devices is an important link of device production, and because the defects are not in fixed forms, have low contrast with the background, are difficult to find slight defects and the like, the defects are detected by artificial naked eyes for a long time in the actual industrial production. As a key technology for image processing, and for guaranteeing the quality of automated production, research on appearance detection methods has been started as early as the 80 s in the 20 th century, and has attracted a great deal of research by many scholars at home and abroad. The vision-based defect sample detection techniques can be divided into two categories: the defect detection method comprises the steps of carrying out feature extraction manually and then carrying out classification according to the features; the defect detection method based on learning is a method for automatically learning appearance defect characteristics by a machine learning method and carrying out detection and classification; due to the fact that the appearance defect forms are different and the contrast is low, the traditional defect detection method is difficult to achieve high detection precision.

While an important assumption that needs to be met for successful application of machine learning is: the number of training samples is large enough; however, in practical applications, it is difficult to collect a sufficient amount of labeled data samples on many occasions, and for this reason, a limited sample learning method is proposed in the prior art, which is mainly divided into two types, one type is based on training data to solve the sample limited problem, and the other type is based on a model to solve the sample limited problem, wherein the limited sample learning method based on data is to expand limited sample data through the existing priori knowledge to meet the requirements of model training on the number of samples, such as translation, inversion, shearing, scale transformation, mirror image, and the like.

Therefore, the problem of sample shortage can be fundamentally solved by collecting a large number of real samples and then marking typical defect targets, and at present, no related method for automatically marking defect samples of precision devices exists.

Disclosure of Invention

In order to solve the problems, namely solving the problem that the automatic marking of the surface defects of the precision part sample cannot be realized in the prior art, the invention provides a method and a system for automatically marking the defect sample based on visual attention modeling fusion.

The invention provides a first aspect of a method for automatically labeling a defect sample based on visual attention modeling fusion, which comprises the following steps: step S100, performing superpixel segmentation on an input original sample image, and automatically segmenting the image into a plurality of image areas with preset characteristics;

step S200, respectively calculating a saliency map based on visual attention modeling fusion for each image region, and segmenting the saliency region in the saliency map by using a marker array segmentation method to serve as a defect candidate region;

and step S300, extracting the characteristics of the defect candidate region, clustering to obtain a plurality of defect samples of different categories, labeling each defect sample according to a small number of labels or known label definition rules, and manually labeling unknown categories through man-machine interaction.

In some preferred embodiments, the superpixel segmentation in step S100 is a SLIC superpixel segmentation algorithm;

the image area is a super pixel; the superpixel is composed of adjacent pixel points with similar characteristics.

In some preferred embodiments, the method for calculating the saliency map in step S200 is specifically: calculating a first saliency map based on background contrast for a superpixel region obtained after each superpixel segmentation;

calculating a second saliency map based on sparse low-rank decomposition for a super-pixel region obtained after each super-pixel is segmented;

and performing fusion reconstruction on the first saliency map and the second saliency map by using a saliency difference modeling method to obtain a final saliency map.

In some preferred embodiments, the fused saliency value corresponding to the saliency map is M_i，

Wherein the content of the first and second substances,

is the saliency value corresponding to said first saliency map,

the corresponding significant value of the second significant map is obtained;

n is the number of the super pixel areas.

In some preferred embodiments, the method for obtaining the defect candidate region by segmentation specifically includes: sorting the obtained N fusion significant values from small to large, and if the corresponding fusion significant value is greater than a preset significant threshold, determining the marker array Seq corresponding to the super pixel region_j1, otherwise Seq_j0 to obtain a tag array Seq; the image is mapped into a binary image by using the mark array, wherein the area with the value of 1 is the defect candidate area.

In some preferred embodiments, the clustering in step S300 specifically includes:

performing image enhancement on the defect candidate area image by utilizing histogram statistics;

performing feature extraction on the defect candidate region image after image enhancement;

normalizing the extracted features to construct feature vectors;

and clustering the feature vectors to obtain a plurality of different categories.

The invention provides a defect sample automatic labeling system based on visual attention modeling fusion, which comprises an image super-pixel segmentation module, a defect candidate region segmentation module and a candidate region automatic labeling module, wherein the defect candidate region segmentation module is in communication connection with the image super-pixel segmentation module;

the image super-pixel segmentation module is configured to perform super-pixel segmentation on an input original sample image and automatically segment the image into a plurality of image areas with preset characteristics;

the defect candidate region segmentation module is configured to respectively calculate a saliency map based on visual attention modeling fusion for each image region, and segment the saliency region in the saliency map by using a marker array segmentation method to serve as a defect candidate region;

the automatic candidate region labeling module is configured to extract the characteristics of the defect candidate regions, perform clustering to obtain a plurality of different classes, label the class of samples according to a small number of labels or known label definition rules, and perform manual labeling on unknown classes through man-machine interaction.

In some preferred embodiments, the method for calculating the saliency map specifically includes: calculating a first saliency map based on background contrast for a superpixel region obtained after each superpixel segmentation;

A third aspect of the present invention provides an electronic device comprising: at least one processor; and

a memory communicatively coupled to at least one of the processors; wherein the content of the first and second substances,

the memory stores instructions executable by the processor for execution by the processor to implement the method for automatic labeling of defect samples based on visual attention modeling fusion as set forth in any one of the above.

In a fourth aspect of the present invention, a computer-readable storage medium is provided, where computer instructions are stored in the computer-readable storage medium for execution by the computer to implement any one of the above methods for automatically labeling a defect sample based on visual attention modeling fusion.

The automatic labeling method and system for the defect sample based on visual attention modeling fusion can greatly reduce the workload of manual labeling, provide enough samples for a machine learning model, play an important role in industrial application and provide effective data support for research of an appearance detection positioning method.

Drawings

Other features, objects and advantages of the present application will become more apparent upon reading of the following detailed description of non-limiting embodiments thereof, made with reference to the accompanying drawings in which:

FIG. 1 is a schematic diagram illustrating the steps of an embodiment of a method for automatically labeling a defect sample based on visual attention modeling fusion according to the present invention;

fig. 2 is a specific flowchart of the method for automatically labeling a defect sample based on visual attention modeling fusion in the present invention.

FIG. 3 is a block diagram of a computer system of a server for implementing embodiments of the method, system, and apparatus of the present application.

Detailed Description

In order to make the embodiments, technical solutions and advantages of the present invention more apparent, the technical solutions of the present invention will be described clearly and completely with reference to the accompanying drawings, and it is apparent that the described embodiments are some, but not all embodiments of the present invention. It should be understood by those skilled in the art that these embodiments are only for explaining the technical principle of the present invention, and are not intended to limit the scope of the present invention.

The invention provides a first aspect of a method for automatically labeling a defect sample based on visual attention modeling fusion, which comprises the following steps: step S100, performing superpixel segmentation on an input original sample image, and automatically segmenting the image into a plurality of image areas with preset characteristics; step S200, respectively calculating a saliency map based on visual attention modeling fusion for each image region, and segmenting the saliency region in the saliency map by using a marker array segmentation method to serve as a defect candidate region; and step S300, extracting the characteristics of the defect candidate region, clustering to obtain a plurality of defect samples of different categories, labeling each defect sample according to a small number of labels or known label definition rules, and manually labeling unknown categories through man-machine interaction.

The invention provides a system for automatically labeling a defect sample based on visual attention modeling fusion, which comprises an image super-pixel segmentation module, a defect candidate region segmentation module and a candidate region automatic labeling module, wherein the defect candidate region segmentation module is in communication connection with the image super-pixel segmentation module; the image super-pixel segmentation module is configured to perform super-pixel segmentation on an input original sample image and automatically segment the image into a plurality of image areas with preset characteristics; the defect candidate region segmentation module is configured to respectively calculate a saliency map based on visual attention modeling fusion for each image region, and segment the saliency region in the saliency map by using a marker array segmentation method to serve as a defect candidate region; the automatic candidate region labeling module is configured to extract the characteristics of the defect candidate regions, perform clustering to obtain a plurality of different classes, label the class of samples according to a small number of labels or known label definition rules, and perform manual labeling on unknown classes through human-computer interaction.

By the automatic defect sample labeling method and system based on visual attention modeling fusion, the problems that the efficiency of a traditional method for manually labeling defect samples of precision devices is low, labeling standards cannot be unified due to human subjective factors, and labeling workload is large can be effectively solved; the method comprises the following steps of performing superpixel segmentation on an input original sample image, automatically segmenting the image into a plurality of image regions with certain consistency, calculating significance detection based on background contrast and significance detection based on sparse low-rank decomposition for each image region respectively to obtain two significance maps, performing fusion reconstruction on the two significance maps by using a significance difference modeling method, and segmenting the significance regions by using a mark segmentation method; thirdly, extracting the characteristics of the significant region, clustering to obtain N different classes, and finally labeling the samples according to a small number of labels or known label definition rules; the method is applied to automatic labeling of the sample in the defect detection of the precision device, and has high labeling efficiency and good accuracy.

The invention is further described with reference to the following detailed description of embodiments with reference to the accompanying drawings.

Referring to fig. 1 and 2, a first aspect of the present invention provides a method for automatically labeling a defect sample based on visual attention modeling fusion, the method comprising the following steps: step S100, performing superpixel segmentation on an input original sample image, and automatically segmenting the image into a plurality of image areas with preset characteristics, wherein the image areas with the preset characteristics refer to certain consistent image areas in the embodiment; wherein, the superpixel is divided into SLIC superpixel division algorithm; the image area is a super pixel, and the super pixel is composed of adjacent pixel points with similar characteristics; for an input image I, an image is segmented into a plurality of superpixels P by using a SLIC superpixel segmentation algorithm_i(ii) a Each super pixel P_iThe image acquisition system is composed of adjacent pixel points with similar characteristics, so that the integrity of a subsequent detection obvious object is ensured, and the number of image blocks can be reduced.

Step S200, respectively calculating a saliency map based on visual attention modeling fusion for each image region, and segmenting the saliency region in the saliency map by using a marker array segmentation method to serve as a defect candidate region; specifically, the method comprises taking all the super-pixels P connected with the image frame_jForming an image background set B, and calculating a super pixel P outside the background_iSignificance M to background set B^B _iThe calculation formula is as follows:

wherein d (c)_i,c_j) Is a super pixel P_iAnd P_jEuclidean distance in color space, d (l)_i,l_j) Is a super pixel P_iAnd P_jEuclidean distance in location space; sigma_lFor controlling the weight of the spatial distance to the saliency value calculation. At the same time, for each super-pixel P obtained in the previous step_iComputing a significance value M based on sparse low rank decomposition^L _i(ii) a And for the two saliency maps obtained in the steps, performing fusion reconstruction on the two saliency maps by using a saliency difference modeling method.

The method for calculating the saliency map specifically comprises the following steps (namely the method for fusion reconstruction specifically comprises the following steps of): for each super pixel P_iCalculating a first saliency map based on background contrast from a super-pixel region obtained after segmentation; calculating a second saliency map based on sparse low-rank decomposition for a super-pixel region obtained after each super-pixel is segmented; and performing fusion reconstruction on the first saliency map and the second saliency map by using a saliency difference modeling method to obtain a final saliency map.

The fusion significance value corresponding to the significance map is M_i，

Wherein the content of the first and second substances,

is the saliency value corresponding to the first saliency map,

the corresponding significant value of the second significant map;

n is the number of the super pixel areas.

And (3) segmenting the salient regions of the final salient image obtained by fusion reconstruction by using a marker array segmentation method, wherein each salient region is a defect candidate region.

The method for acquiring the defect candidate area by segmentation specifically comprises the following steps: sorting the obtained N fusion significant values from small to large, and if the corresponding fusion significant value is greater than a preset significant threshold, determining the marker array Seq corresponding to the super pixel region_j1, otherwise Seq_j0 to obtain a tag array Seq; and mapping the image into a binary image by using the mark array, wherein the area with the value of 1 is a defect candidate area, and the area with the value of 0 is a normal surface area.

Wherein, clustering specifically comprises: performing image enhancement on the defect candidate area image by utilizing histogram statistics; extracting the features of the defect candidate region image after image enhancement, wherein five features including color, gray scale, area, size and texture are taken in the embodiment; normalizing the extracted features to construct a feature vector x_i；x_i＝(x_i1，x_i2，x_i3，x_i4，x_i5) And i represents the ith super pixel.

Clustering the feature vectors by a DBSCAN clustering method to obtain m clusters (C) of different classes₁，C₂，……，C_mObtained by clusteringDefect samples in the same category have similar characteristic parameters;

for a known defect sample set D with a label, calculating a feature vector y of each sample_j＝(y_j1，y_j2，y_j3，y_j4，y_j5) And j represents the jth known sample labeled Label_j；

Calculating the known sample y_jAnd cluster C obtained by clustering_mIf y is_j∈C_mThen C is_mAll sample tags within a sample are defined as Label_j；

Cluster C obtained by clustering_k，C_k+1，...，C_k+NIf there is no sample in the known sample set D, it is labeled as Unknown _0, Unknown _1, and Unknown _ N;

for the categories, the categories can be named according to a preset naming rule, or after manual naming is completed through human-computer interaction, new labels are uniformly modified.

The invention provides a defect sample automatic labeling system based on visual attention modeling fusion, which comprises an image super-pixel segmentation module, a defect candidate region segmentation module and a candidate region automatic labeling module, wherein the defect candidate region segmentation module is in communication connection with the image super-pixel segmentation module, and the candidate region automatic labeling module is in communication connection with the defect candidate region segmentation module.

The image super-pixel segmentation module is used for carrying out super-pixel segmentation on an input original sample image and automatically segmenting the image into a plurality of image areas with certain consistency.

The defect candidate region segmentation module is used for respectively calculating a saliency map based on visual attention modeling fusion for each image region, and segmenting the saliency regions in the saliency map by using a marker array segmentation method, wherein each saliency region is a defect candidate region.

And the automatic candidate region labeling module is used for extracting the characteristics of the defect candidate regions, clustering the characteristics to obtain a plurality of different classes, labeling the class of samples according to a small number of labels or known label definition rules, and manually labeling unknown classes through man-machine interaction.

Preferably, the image super-pixel segmentation module divides adjacent pixel points with similar characteristics in the input sample image into the same image area through SLIC super-pixel segmentation, so that the integrity of the subsequent detection of the significant object is ensured, and the number of image blocks can be reduced.

Preferably, the calculation method of the saliency map is specifically: calculating a first saliency map based on background contrast for a superpixel region obtained after each superpixel segmentation; calculating a second saliency map based on sparse low-rank decomposition for a super-pixel region obtained after each super-pixel is segmented; and performing fusion reconstruction on the first saliency map and the second saliency map by using a saliency difference modeling method to obtain a final saliency map.

Further, the specific calculation process of the defect candidate region segmentation module for calculating the saliency map based on visual attention modeling fusion for each image region respectively includes:

the fusion significance value corresponding to the significance map is M_i，

Wherein the content of the first and second substances,

is the saliency value corresponding to the first saliency map,

the corresponding significant value of the second significant map;

n is the number of the super pixel areas.

Marked array segmentation method is utilized in defect candidate region segmentation module to segment salient regionsThe segmentation to obtain all defect candidate regions specifically comprises: all super-pixel image blocks P in the image_iThe fusion significant value M obtained by the calculation_iSorting from small to large, if the significance value of the image block is greater than the set significance threshold value T_HThen the mark array Seq corresponding to the image block_j1 is ═ 1; otherwise Seq_j0. Thereby obtaining a tag array Seq; and mapping the image into a binary image by using the mark array, wherein the area with the value of 1 is a defect candidate area, and the area with the value of 0 is a normal surface area.

Further, the automatic candidate region labeling module specifically includes: performing feature extraction on the defect candidate region segmented by the previous module, wherein the feature extraction comprises color, gray scale, contour, entropy and the like, and obtaining a feature vector of the candidate region; clustering the feature vectors to obtain a plurality of different classes; automatically labeling the defect candidate areas according to labeled sample labels or known label definition rules; for classes different from known tags, the default definition is a new class, which can be named manually or be updated by waiting for a definition rule.

The clustering method specifically comprises the following steps: performing image enhancement on the defect candidate area image by utilizing histogram statistics; and performing feature extraction on the processed candidate region image, including but not limited to: color, gray scale, area, size, texture, etc.; normalizing the extracted features to construct feature vectors; and clustering the feature vectors to obtain a plurality of different categories.

The labeling method specifically comprises the following steps: the candidate defect regions, namely the extracted defect samples can be divided into a plurality of different categories through a clustering algorithm, and the defect samples in the same category have similar characteristic parameters;

calculating a feature vector of a known defect sample image with a label;

classifying the known defect samples with labels according to the clustering rule;

labeling all candidate samples classified as the same category as the known sample as the labels carried by the known sample;

after clustering, the classes which have no same characteristic parameters with the known samples are labeled Unknown _1 and Unknown _ 2;

for the clustering categories without known samples, the categories can be named according to a preset naming rule, or manually named through human-computer interaction, and then uniformly modified into new labels.

In practical application, the defect detection method based on deep learning is difficult to collect enough labeled data samples for model training. The traditional data capacity expansion method is to expand limited sample data through the prior knowledge so as to meet the requirements of model training on the number of samples, such as translation, turnover, shearing, scale transformation, mirror image and the like. The method is characterized in that each original sample is augmented by an artificially defined conversion rule, and the method only performs combined transformation on the original image and does not increase the diversity of the sample, so that more new features cannot be provided. The invention provides a precision device defect sample automatic labeling method based on visual attention modeling fusion, which is used for rapidly and effectively automatically labeling the defects on the surface of a precision device sample by acquiring a large number of real samples and labeling typical defect targets, so that the problem of sample shortage is fundamentally solved. The automatic sample labeling can greatly reduce the workload of manual labeling and provide enough samples for the machine learning model. The method not only can play an important role in industrial application, but also can provide effective data support for the research of an appearance detection positioning method.

An electronic apparatus according to a third embodiment of the present invention includes: at least one processor; and a memory communicatively coupled to at least one of the processors; wherein the memory stores instructions executable by the processor for execution by the processor to implement the method for automatic labeling of defect samples based on visual attention modeling fusion as set forth in any one of the above.

A computer-readable storage medium of a fourth embodiment of the present invention stores computer instructions for execution by the computer to implement the method for automatically labeling a defect sample based on visual attention modeling fusion as described in any one of the above.

It can be clearly understood by those skilled in the art that, for convenience and brevity of description, the specific working processes and related descriptions of the storage system and the processing system described above may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.

Reference is now made to FIG. 3, which illustrates a block diagram of a computer system of a server for implementing embodiments of the method, system, and apparatus of the present application. The server shown in fig. 3 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present application.

As shown in fig. 3, the computer system includes a Central Processing Unit (CPU)601, which can perform various appropriate actions and processes according to a program stored in a Read Only Memory (ROM) 602 or a program loaded from a storage section 608 into a Random Access Memory (RAM) 603. In the RAM 603, various programs and data necessary for system operation are also stored. The CPU 601, ROM 602, and RAM 603 are connected to each other via a bus 604. An Input/Output (I/O) interface 605 is also connected to bus 604.

The following components are connected to the I/O interface 605: an input portion 606 including a keyboard, a mouse, and the like; an output section 607 including a Cathode Ray Tube (CRT), a Liquid Crystal Display (LCD), and the like, a speaker, and the like; a storage section 608 including a hard disk and the like; and a communication section 609 including a Network interface card such as a LAN (Local Area Network) card, a modem, or the like. The communication section 609 performs communication processing via a network such as the internet. The driver 610 is also connected to the I/O interface 605 as needed. A removable medium 611 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like is mounted on the drive 610 as necessary, so that a computer program read out therefrom is mounted in the storage section 608 as necessary.

In particular, according to an embodiment of the present disclosure, the processes described above with reference to the flowcharts may be implemented as computer software programs. For example, embodiments of the present disclosure include a computer program product comprising a computer program embodied on a computer readable medium, the computer program comprising program code for performing the method illustrated in the flow chart. In such an embodiment, the computer program may be downloaded and installed from a network through the communication section 609, and/or installed from the removable medium 611. The computer program performs the above-described functions defined in the method of the present application when executed by a Central Processing Unit (CPU) 601. It should be noted that the computer readable medium mentioned above in the present application may be a computer readable signal medium or a computer readable storage medium or any combination of the two. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, or device, or any combination of the foregoing. More specific examples of the computer readable storage medium may include, but are not limited to: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the present application, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, or device. In this application, however, a computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may also be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, or device. Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to: wireless, wire, fiber optic cable, RF, etc., or any suitable combination of the foregoing.

Computer program code for carrying out operations for aspects of the present application may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any type of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet service provider).

The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present application. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.

The terms "first," "second," and the like are used for distinguishing between similar elements and not necessarily for describing or implying a particular order or sequence.

It should be noted that in the description of the present invention, the terms "center", "upper", "lower", "left", "right", "vertical", "horizontal", "inner", "outer", etc. indicating the directions or positional relationships are based on the directions or positional relationships shown in the drawings, which are only for convenience of description, and do not indicate or imply that the system or the element must have a specific orientation, be constructed and operated in a specific orientation, and thus, should not be construed as limiting the present invention. Furthermore, the terms "first," "second," and "third" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance.

Furthermore, it should be noted that, in the description of the present invention, unless otherwise explicitly specified or limited, the terms "mounted," "connected," and "connected" are to be construed broadly, and may be, for example, fixedly connected, detachably connected, or integrally connected; can be mechanically or electrically connected; they may be connected directly or indirectly through intervening media, or they may be interconnected between two elements. The specific meanings of the above terms in the present invention can be understood by those skilled in the art according to specific situations.

The terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, article, or apparatus/system that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, article, or apparatus/system.

So far, the technical solutions of the present invention have been described in connection with the preferred embodiments shown in the drawings, but it is easily understood by those skilled in the art that the scope of the present invention is obviously not limited to these specific embodiments. Equivalent changes or substitutions of related technical features can be made by those skilled in the art without departing from the principle of the invention, and the technical scheme after the changes or substitutions can fall into the protection scope of the invention.

Claims

1. A method for automatically labeling a defect sample based on visual attention modeling fusion is characterized by comprising the following steps:

step S100, performing superpixel segmentation on an input original sample image, and automatically segmenting the image into a plurality of image areas with preset characteristics;

and step S300, extracting the characteristics of the defect candidate region, clustering to obtain a plurality of defect samples of different categories, labeling each defect sample according to a small number of labels or known label definition rules, and manually labeling unknown categories through human-computer interaction.

2. The method for automatically labeling defect samples based on visual attention modeling fusion as claimed in claim 1, wherein said superpixel segmentation in step S100 is SLIC superpixel segmentation algorithm;

3. The method for automatically labeling defect samples based on visual attention modeling fusion according to claim 1, wherein the method for calculating the saliency map in step S200 specifically comprises:

calculating a first saliency map based on background contrast for a superpixel region obtained after each superpixel segmentation;

4. The method for automatically labeling defect samples based on visual attention modeling fusion as claimed in claim 3, wherein the fusion saliency value corresponding to the saliency map is

Wherein the content of the first and second substances,

is the saliency value corresponding to said first saliency map,

the corresponding significant value of the second significant map is obtained;

n is the number of the super pixel areas.

5. The method for automatically labeling the defect sample based on the visual attention modeling fusion as claimed in claim 1, wherein the method for segmenting and acquiring the defect candidate region specifically comprises:

sorting the obtained N fusion significant values from small to large, and if the corresponding fusion significant value is greater than a preset significant threshold, determining the marker array Seq corresponding to the super pixel region_j1, otherwise Seq_j0 to obtain a tag array Seq; the image is mapped into a binary image by using the mark array, wherein the area with the value of 1 is the defect candidate area.

6. The method for automatically labeling defect samples based on visual attention modeling fusion according to claim 1, wherein the clustering in step S300 specifically comprises:

normalizing the extracted features to construct feature vectors;

7. A defect sample automatic labeling system based on visual attention modeling fusion is characterized by comprising an image super-pixel segmentation module, a defect candidate region segmentation module and a candidate region automatic labeling module, wherein the defect candidate region segmentation module is in communication connection with the image super-pixel segmentation module;

the automatic labeling module of the candidate area is configured to extract the characteristics of the defect candidate area, perform clustering to obtain a plurality of different types, and label the type of sample according to a small number of labels or known label definition rules.

8. The method for automatically labeling the defect sample based on the visual attention modeling fusion as claimed in claim 6, wherein the method for calculating the saliency map is specifically as follows:

9. An electronic device, comprising:

at least one processor; and

a memory communicatively coupled to at least one of the processors; wherein the memory stores instructions executable by the processor for implementing the method for automatic labeling of defect samples based on visual attention modeling fusion of any of claims 1-6.

10. A computer-readable storage medium storing computer instructions for execution by the computer to implement the method for automatic labeling of defect samples based on visual attention modeling fusion according to any one of claims 1 to 6.