CN116664839A

CN116664839A - Weak supervision semi-supervision defect perception segmentation method and device and electronic equipment

Info

Publication number: CN116664839A
Application number: CN202310641253.6A
Authority: CN
Inventors: 林旭新; 梁延研; 强孙源; 李国钊
Original assignee: Boyan Technology Zhuhai Co ltd
Current assignee: Boyan Technology Zhuhai Co ltd
Priority date: 2023-05-31
Filing date: 2023-05-31
Publication date: 2023-08-29
Anticipated expiration: 2043-05-31
Also published as: CN116664839B

Abstract

The invention discloses a weak supervision semi-supervision defect perception segmentation method, a device and electronic equipment, wherein the method comprises the steps of obtaining original characteristics of a target defect image through a downsampling module and a first residual convolution block of a defect repairing device; the method comprises the steps that an original binary mask corresponding to a defect position in a target defect image is obtained in advance according to original characteristics through a defect activation module of a defect repairing device; and inputting an original binary mask obtained by the defect activation module according to the original characteristics into a residual convolution block normalized and constructed by a defect example to obtain a target non-defect image. And carrying out binary difference on the target defect image and the obtained corresponding target non-defect image to obtain a predicted binary mask for predicting the defect position. The invention can reduce the workload of defect data marking and improve the defect perception performance, and can be widely applied to the field of computer vision.

Description

Weak supervision semi-supervision defect perception segmentation method and device and electronic equipment

Technical Field

The invention relates to the field of computer vision, in particular to a weak supervision semi-supervision defect perception segmentation method and device and electronic equipment.

Background

Defect detection has been the focus of research community attention due to the practical importance of quality assurance and maintenance of industrial products. However, while the development of computer vision has prompted effective resolution of defect detection tasks, the high cost of defect data labeling, particularly in task scenarios of pixel-level defect segmentation, limits the ability of deep learning-based approaches.

Therefore, the above-described problems are to be solved.

Disclosure of Invention

In view of the above, the embodiments of the present invention provide a method, an apparatus, and an electronic device for performing weak supervision and semi-supervision of defect sensing segmentation, which are used for reducing the workload of defect data labeling and improving the performance of defect sensing.

An aspect of an embodiment of the present invention provides a weakly-supervised semi-supervised defect-aware segmentation method, including:

inputting a target defect image into a pre-trained defect repair device to obtain a target non-defect image corresponding to the target defect image;

performing binary difference on the target defect image and the obtained corresponding target non-defect image to obtain a predicted binary mask for predicting the defect position;

the training process of the defect repair device is to perform counterdamage training with a defect generator, and the process of the counterdamage training comprises the following steps:

inputting a first defect image into a defect repairing device to obtain a first non-defect image corresponding to the first defect image; inputting the first non-defect image into a defect generator to obtain a defect image corresponding to the first non-defect image;

taking a new non-defect image as a second non-defect image, and inputting the second non-defect image into the defect generator to obtain a defect image corresponding to the second non-defect image; inputting a defect image corresponding to the second non-defect image into the defect repairing device to obtain a non-defect image corresponding to the second non-defect image;

and performing alternating countermeasure training on the non-defect images corresponding to the second non-defect images according to the defect images corresponding to the first non-defect images to obtain a pre-trained defect repair device.

Optionally, the inputting the target defect image to a pre-trained defect healer to obtain a target non-defect image corresponding to the target defect image includes:

acquiring original characteristics of a target defect image through a downsampling module of the defect healer and a first residual convolution block;

acquiring an original binary mask corresponding to a defect position in the target defect image according to the original characteristic by a defect activation module of the defect repair device;

and inputting the original binary mask obtained by the defect activation module according to the original characteristics into a residual convolution block constructed by normalizing the defect example to obtain a target non-defect image.

Optionally, the obtaining, by the defect activation module of the defect healer, an original binary mask corresponding to a defect position in the target defect image according to the original feature includes:

respectively carrying out global average pooling and global maximum pooling on the original features through a defect activation module to obtain a first classification weight corresponding to the defect repairing device and a second classification weight corresponding to the defect generator;

and acquiring an original binary mask corresponding to the defect position in the target defect image according to the first classification weight and the second classification weight and combining a thermodynamic diagram of the characteristic activation response.

Optionally, the obtaining an original binary mask corresponding to the defect position in the target defect image according to the first classification weight and the second classification weight and combining the thermodynamic diagram of the feature activation response includes:

calculating a thermodynamic activation graph of the original feature according to the first classification weight and the second classification weight;

and acquiring an original binary mask corresponding to the defect position in the target defect image according to the thermal activation diagram.

Optionally, the construction process of the second residual convolution block consisting of defect instance normalization includes:

extracting foreground features and background features through the original binary mask;

calculating the mean and variance of the foreground features and calculating the mean and variance of the background features;

normalizing the mean value and the variance of the foreground features and the mean value and the variance of the background features to obtain normalized features;

and merging the normalized features, and embedding the merged normalized features into a residual convolution block to obtain the second residual convolution block.

Optionally, the defect generator and the defect healer include identifying a data distribution of defect data and non-defect data during the loss fight training process.

Optionally, the penalty functions of the contrast penalty training process include training a contrast penalty function, a cyclic consistency penalty function, an identity mapping penalty function, a binary cross entropy penalty function, and a background consistency penalty function;

wherein the training fight loss function is used to transform the input image from the source domain to the target domain;

the cyclic consistency loss function is used for reconstructing the input image converted to the target domain back to the source domain image;

the identity mapping loss function is used for determining identity mapping output of a defect perception model, and the defect perception model is obtained according to the defect healer and the defect generator;

the binary cross entropy loss function is used for training a preset classifier so that the classifier can identify distinguishing features of an input defect image and a non-defect image;

the background consistency loss function is used for keeping the input defect image consistent with the background of the non-defect image.

Another aspect of the embodiments of the present invention further provides a weakly-supervised semi-supervised defect-aware segmentation apparatus, including:

the original characteristic acquisition unit is used for acquiring the original characteristic of the target defect image through the downsampling module of the defect healer and the first residual convolution block;

an original binary mask obtaining unit, configured to obtain, by using a defect activation module of the defect repair device, an original binary mask corresponding to a defect position in the target defect image according to the original feature;

the defect repairing unit is used for inputting the original binary mask obtained by the defect activating module according to the original characteristics into a residual convolution block constructed by normalizing defect examples to obtain a target non-defect image;

And the predicted binary mask obtaining unit is used for carrying out binary difference on the target defect image and the obtained corresponding target non-defect image to obtain a predicted binary mask of the predicted defect position.

Another aspect of the embodiment of the invention also provides an electronic device, which includes a processor and a memory;

the memory is used for storing programs;

the processor executing the program implements the method of any of the above.

Another aspect of the embodiments of the present invention also provides a computer-readable storage medium storing a program that is executed by a processor to implement the method of any one of the above.

Embodiments of the present invention also disclose a computer program product or computer program comprising computer instructions stored in a computer readable storage medium. The computer instructions may be read from a computer-readable storage medium by a processor of a computer device, and executed by the processor, to cause the computer device to perform the foregoing method.

The training process of the defect repairing device needs to be participated with a defect generator, and the defect repairing device can construct a defect image and non-defect image conversion neural network, namely the conversion from the defect image to the non-defect image and the conversion from the non-defect image to the defect image. After the defect repairing device and the defect generator are trained, the defect image can be directly predicted through the defect repairing device, and a corresponding non-defect image can be obtained. The invention can perform weak supervision and semi-supervision training based on the defect image only, does not need to use a large number of marked images, and meanwhile, the trained defect repairing device has higher prediction performance, can accurately output the non-defect image corresponding to the defect image, and realizes efficient segmentation of defect perception.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings required for the description of the embodiments will be briefly described below, and it is apparent that the drawings in the following description are only some embodiments of the present invention, and other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.

FIG. 1 is a diagram illustrating an exemplary implementation of a defect repair apparatus and a defect generator for performing loss-of-countermeasure training according to an embodiment of the present invention;

fig. 2 is a schematic flow chart of a weak supervision semi-supervised defect perception segmentation method according to an embodiment of the present invention;

FIG. 3 is an exemplary flowchart of a weakly-supervised semi-supervised defect aware segmentation method according to an embodiment of the present invention;

fig. 4 is a block diagram of a weak supervision semi-supervised defect sensing segmentation apparatus according to an embodiment of the present invention.

Detailed Description

The present invention will be described in further detail with reference to the drawings and examples, in order to make the objects, technical solutions and advantages of the present invention more apparent. It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the invention.

The embodiment of the invention provides a weak supervision semi-supervision defect perception segmentation method, which specifically comprises the following steps:

inputting the target defect image into a pre-trained defect repair device to obtain a target non-defect image corresponding to the target defect image.

And carrying out binary difference on the target defect image and the obtained corresponding target non-defect image to obtain a predicted binary mask for predicting the defect position.

First, describing the above-mentioned training process of the defect repair device, referring to fig. 1, an exemplary diagram of the defect repair device and the defect generator performing counterdamage training is provided in an embodiment of the present invention, which may specifically include the following steps:

s1, inputting a first defect image into a defect repair device to obtain a first non-defect image corresponding to the first defect image; and inputting the first non-defect image into a defect generator to obtain a defect image corresponding to the first non-defect image.

S2, taking a new non-defect image as a second non-defect image, and inputting the second non-defect image into the defect generator to obtain a defect image corresponding to the second non-defect image; and inputting the defect image corresponding to the second non-defect image into the defect repairing device to obtain the non-defect image corresponding to the second non-defect image.

S3, performing alternate countermeasure training on the non-defect image corresponding to the second non-defect image according to the defect image corresponding to the first non-defect image, and obtaining the pre-trained defect repair device.

In particular, the present invention may include a defect healer and a defect generator that construct a two-set image conversion neural network, i.e., a conversion of a defective image Y to a non-defective image X, and a conversion of the non-defective image X to the defective image Y. The defect healer and the defect generator of the invention can identify the data distribution of the defect data and the non-defect data so as to realize the countermeasure training process. After the defect repairing device and the defect generator are trained, the defect repairing network R can be used for directly predicting the defect image and obtaining the corresponding non-defect image X, namely, absolute value difference calculation abs (Y-X) can be directly carried out on the two images to obtain a predicted binary mask so as to extract the position of the defect.

Further, the above-mentioned step of inputting the target defect image into the pre-trained defect repair device to obtain the target non-defect image corresponding to the target defect image will be described in detail with reference to fig. 2.

S100: and acquiring original characteristics of the target defect image through a downsampling module of the defect healer and a first residual convolution block.

Specifically, the target defect image may be first input to a value downsampling module for downsampling, and then residual convolution is performed to obtain the original feature.

S110: and acquiring an original binary mask corresponding to the defect position in the target defect image according to the original characteristic by a defect activation module of the defect repairing device.

Specifically, given the original feature obtained by downsampling, the embodiment of the invention additionally trains a classifier to distinguish whether the original feature corresponds to a non-defective image or a defective image, namely, a defective image and a non-defective image based on the original feature are classified. Specifically, global average pooling and global maximum pooling can be performed on the original features, then corresponding classification weights are obtained through a defect image classifier and a non-defect image classifier respectively, logits are obtained, and the calculation expression of the classification weights is as follows:

F _R-classifier (y)＝[P _GMP ，P _GAP ]，

wherein Feat (y) represents the thermodynamic diagram characteristics after passing through the residual block. sigma represents the sigmoid activation function. Wgmp and Wgap are the classification weights of the two sets of classifiers, respectively. GMP and GAP represent global max pooling and global average pooling, respectively. [, ] represents a con-figuration splice operation.

Through the obtained classification weight, the embodiment of the invention can calculate the thermodynamic activation diagram about the original feature, and further obtain the original binary mask corresponding to the defect position in the target defect image according to the thermodynamic activation diagram, and the specific calculation process is as follows:

A _GMP ＝w _GMP ⊙Feat(y)，

A _GAP ＝w _GAP ⊙ Feat(y)，

A＝[A _GMP ，A _GAP ]，

where a represents features obtained by two sets of classifier weights and thermodynamic diagram features. As indicated by the channel level dot product.

Where Ac represents the characteristic of the c-th channel of characteristic A, and summing is performed according to the channels to obtain the thermodynamic activation diagram.

S120: and inputting the original binary mask obtained by the defect activation module according to the original characteristics into a residual convolution block constructed by normalizing the defect example to obtain a target non-defect image.

Specifically, a second residual convolution block composed of defect instance normalization is first described.

The embodiment of the invention provides a feature normalization method based on a response thermodynamic diagram of a defect. Specifically, according to the binary mask obtained by thermodynamic diagram, extracting the binary mask to obtain the foreground feature and the background feature, wherein the extraction and calculation process is as follows:

where BKGD represents foreground features and FGD represents background features.

Then, calculating the mean and variance of the foreground features, calculating the mean and variance of the background features, normalizing, and calculating the mean and variance of the foreground features as follows:

where H and W represent feature height and width, respectively.

The mean and variance calculation process of the background features is as follows, and the normalization calculation process is as follows:

the normalized features are combined, and the specific calculation process is as follows:

and finally embedding the combined normalized features into a residual convolution block to obtain a second residual convolution block.

Next, the defect healer and defect generator are presented with loss functions during the combat loss training process, including loss functions including training combat loss functions, cyclic consistency loss functions, identity mapping loss functions, binary cross entropy loss functions, and background consistency loss functions.

Specifically, training the contrast loss function is used to transform the input image from the source domain to the target domain, expressed as follows:

where Dx represents a non-defect discriminator and R represents a defect healer. x and y represent a non-defective image and a defective image, respectively.

The cyclic consistency loss function is used to reconstruct the input image converted to the target domain back to the source domain image, expressed as follows:

as above, where R represents the defect healer and y represents the defect image. G represents a defect generator.

According to the embodiment of the invention, a two-class classifier can be additionally added and trained for identifying the distinguishing characteristics of the defect image and the non-defect image, so that the response to the specific defect area by using the activation thermodynamic diagram can be well performed. The binary cross entropy loss function is used to train the classifier, and the expression is as follows:

where F represents the classifier probability output.

Because, in the embodiment of the invention, the background of the defect image and the background of the corresponding non-defect image can be extracted through the binary mask, the background consistency loss function is used for keeping the input defect image consistent with the background of the non-defect image, and only the response of the defect point is concerned, and the background consistency loss function expression is as follows:

where M represents the operation of background extraction.

The identity mapping loss function is used to determine an identity mapping output of a defect perception model, which is derived from the defect healer and the defect generator.

In an alternative embodiment, the above loss functions are weighted and summed to obtain a total loss function of the anti-loss training process, where the expression is as follows:

where λ, β, γ, η represent the weight parameters of the total loss function, respectively.

And after the defect repairing device is trained, giving a target defect image, inputting the target defect image into a residual convolution block which is normalized and constructed by a defect example through the original binary mask obtained by the defect activating module according to the original characteristics, and obtaining a target non-defect image. And carrying out binary difference on the target defect image and the obtained corresponding target non-defect image to obtain a predicted binary mask for predicting the defect position.

The application of the present invention will be described in the following with specific examples.

Referring to fig. 3, an example flow chart of a weakly-supervised semi-supervised defect aware segmentation method is provided by an embodiment of the present invention.

Specifically, the defect healer may convert the defect image into a corresponding healed non-defect image. The architecture of the defect healer may include a defect activation module and a residual convolution block of defect instance normalization construction. First, the original features for the defective image are obtained by convolving the downsampled and residual convolution block. The original features pass through a defect activation module, the defect activation module performs unsupervised response on a defect region of the defect image, and outputs a binary mask of the defect position for a subsequent process of converting the defect image into a non-defect image. Then, the original features are sent to a residual convolution block normalized by the defect instance, further feature extraction and image conversion are carried out, and finally, a non-defect image corresponding to the defect image is obtained through convolution up-sampling, and the non-defect image is output. And carrying out binary difference on the defect image and the output non-defect image to obtain a predicted binary mask for predicting the defect position.

Referring to fig. 4, an embodiment of the present invention provides a weak supervision semi-supervised defect aware segmentation apparatus, including:

an original binary mask obtaining unit, configured to obtain, in advance, an original binary mask corresponding to a defect position in the target defect image according to the original feature by using a defect activation module of the defect repair device;

Embodiments of the present invention also disclose a computer program product or computer program comprising computer instructions stored in a computer readable storage medium. The computer instructions may be read from a computer-readable storage medium by a processor of a computer device, and executed by the processor, to cause the computer device to perform the method shown in fig. 2.

In some alternative embodiments, the functions/acts noted in the block diagrams may occur out of the order noted in the operational illustrations. For example, two blocks shown in succession may in fact be executed substantially concurrently or the blocks may sometimes be executed in the reverse order, depending upon the functionality/acts involved. Furthermore, the embodiments presented and described in the flowcharts of the present invention are provided by way of example in order to provide a more thorough understanding of the technology. The disclosed methods are not limited to the operations and logic flows presented herein. Alternative embodiments are contemplated in which the order of various operations is changed, and in which sub-operations described as part of a larger operation are performed independently.

Furthermore, while the invention is described in the context of functional modules, it should be appreciated that, unless otherwise indicated, one or more of the described functions and/or features may be integrated in a single physical device and/or software module or one or more functions and/or features may be implemented in separate physical devices or software modules. It will also be appreciated that a detailed discussion of the actual implementation of each module is not necessary to an understanding of the present invention. Rather, the actual implementation of the various functional modules in the apparatus disclosed herein will be apparent to those skilled in the art from consideration of their attributes, functions and internal relationships. Accordingly, one of ordinary skill in the art can implement the invention as set forth in the claims without undue experimentation. It is also to be understood that the specific concepts disclosed are merely illustrative and are not intended to be limiting upon the scope of the invention, which is to be defined in the appended claims and their full scope of equivalents.

The functions, if implemented in the form of software functional units and sold or used as a stand-alone product, may be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present invention may be embodied essentially or in a part contributing to the prior art or in a part of the technical solution, in the form of a software product stored in a storage medium, comprising several instructions for causing a computer device (which may be a personal computer, a server, a network device, etc.) to perform all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), a magnetic disk, or an optical disk, or other various media capable of storing program codes.

Logic and/or steps represented in the flowcharts or otherwise described herein, e.g., a ordered listing of executable instructions for implementing logical functions, can be embodied in any computer-readable medium for use by or in connection with an instruction execution system, apparatus, or device, such as a computer-based system, processor-containing system, or other system that can fetch the instructions from the instruction execution system, apparatus, or device and execute the instructions. For the purposes of this description, a "computer-readable medium" can be any means that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device.

More specific examples (a non-exhaustive list) of the computer-readable medium would include the following: an electrical connection (electronic device) having one or more wires, a portable computer diskette (magnetic device), a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber device, and a portable compact disc read-only memory (CDROM). In addition, the computer readable medium may even be paper or other suitable medium on which the program is printed, as the program may be electronically captured, via, for instance, optical scanning of the paper or other medium, then compiled, interpreted or otherwise processed in a suitable manner, if necessary, and then stored in a computer memory.

It is to be understood that portions of the present invention may be implemented in hardware, software, firmware, or a combination thereof. In the above-described embodiments, the various steps or methods may be implemented in software or firmware stored in a memory and executed by a suitable instruction execution system. For example, if implemented in hardware, as in another embodiment, may be implemented using any one or combination of the following techniques, as is well known in the art: discrete logic circuits having logic gates for implementing logic functions on data signals, application specific integrated circuits having suitable combinational logic gates, programmable Gate Arrays (PGAs), field Programmable Gate Arrays (FPGAs), and the like.

In the description of the present specification, a description referring to terms "one embodiment," "some embodiments," "examples," "specific examples," or "some examples," etc., means that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the present invention. In this specification, schematic representations of the above terms do not necessarily refer to the same embodiments or examples. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.

While embodiments of the present invention have been shown and described, it will be understood by those of ordinary skill in the art that: many changes, modifications, substitutions and variations may be made to the embodiments without departing from the spirit and principles of the invention, the scope of which is defined by the claims and their equivalents.

While the preferred embodiment of the present invention has been described in detail, the present invention is not limited to the embodiments described above, and those skilled in the art can make various equivalent modifications or substitutions without departing from the spirit of the present invention, and these equivalent modifications or substitutions are included in the scope of the present invention as defined in the appended claims.

Claims

1. A weakly-supervised semi-supervised defect-aware segmentation method, comprising:

2. The method of claim 1, wherein the inputting the target defect image to a pre-trained defect healer to obtain a target non-defect image corresponding to the target defect image comprises:

acquiring an original binary mask corresponding to a defect position in the target defect image in advance according to the original characteristics by a defect activation module of the defect repair device;

3. The method of claim 2, wherein the pre-acquiring, by the defect activation module of the defect healer, the original binary mask corresponding to the defect position in the target defect image according to the original feature comprises:

and acquiring an original binary mask corresponding to the defect position in the target defect image according to the first classification weight and the second classification weight and combining original features.

4. A weakly-supervised semi-supervised defect aware segmentation method as set forth in claim 3, wherein the obtaining the original binary mask corresponding to the defect location in the target defect image based on the first classification weight and the second classification weight in combination with the original feature comprises:

5. The method of claim 2, wherein the process of constructing the second residual convolution block comprising normalization of defect instances comprises:

6. The method of claim 1, wherein the defect generator and the defect healer include identifying a data distribution of defect data and non-defect data during the loss-of-immunity training.

7. The method of claim 1, wherein the loss functions of the counterloss training process include training counterloss functions, cyclic consistency loss functions, identity mapping loss functions, binary cross entropy loss functions, and background consistency loss functions;

8. A weakly-supervised semi-supervised defect aware segmentation apparatus, comprising:

an original binary mask obtaining unit, configured to obtain, by using a defect activation module of the defect healer, a binary mask corresponding to a defect position in the target defect image according to the original feature;

according to the defect image corresponding to the first non-defect image, alternating countermeasure training is carried out on the non-defect image corresponding to the second non-defect image, and a pre-trained defect repairing device is obtained;

9. An electronic device comprising a processor and a memory;

the memory is used for storing programs;

the processor executing the program implements the method of any one of claims 1 to 7.

10. A computer-readable storage medium, characterized in that the storage medium stores a program that is executed by a processor to implement the method of any one of claims 1 to 7.