CN113658183B

CN113658183B - Workpiece quality inspection method and device and computer equipment

Info

Publication number: CN113658183B
Application number: CN202111223818.6A
Authority: CN
Inventors: 肖智恒; 郭骏; 潘正颐; 侯大为
Original assignee: Changzhou Weiyizhi Technology Co Ltd
Current assignee: Changzhou Weiyizhi Technology Co Ltd
Priority date: 2021-10-21
Filing date: 2021-10-21
Publication date: 2022-02-08
Anticipated expiration: 2041-10-21
Also published as: CN113658183A

Abstract

The invention relates to the technical field of industrial quality inspection, and provides a workpiece quality inspection method, a workpiece quality inspection device and computer equipment, wherein the method comprises the following steps: acquiring a test image corresponding to a workpiece to be tested and a trained neural network model; acquiring a training sample of a deep Q network; constructing a deep Q network, and performing learning training on the deep Q network based on training samples to obtain a state-behavior deep Q network model corresponding to the neural network model; inputting the test image into a depth Q network model to obtain at least one target enhancement behavior corresponding to the test image; performing data enhancement on the test image according to at least one target enhancement behavior, and inputting the enhanced test image into the neural network model to obtain at least one prediction result; and performing quality inspection on the workpiece according to at least one prediction result. Therefore, the testing effect can be ensured, and meanwhile, the testing time is greatly reduced, so that the workpiece quality inspection efficiency is improved, and the workpiece quality inspection accuracy is improved.

Description

Workpiece quality inspection method and device and computer equipment

Technical Field

The invention relates to the technical field of industrial quality inspection, in particular to a workpiece quality inspection method, a workpiece quality inspection device and computer equipment.

Background

Industrial quality control is understood to be quality control of workpieces or products in various processes of industrial manufacturing and industrial production, for example, defects such as tiny scratches and pits on the surface of the products can be detected, and the defects such as tiny scratches and pits on the surface of the products can cause serious accidents, for example, surface defects such as aircraft tires can directly affect the use effect and even bring irreparable loss to passengers. It is seen that industrial quality control is very important in modern manufacturing.

In the related art, when workpiece quality inspection is performed, a picture is usually directly input into a trained neural network model, and since the picture has various versions, for example, a picture in an inverted version, when the picture is predicted by the trained neural network model, problems of low model prediction accuracy, poor effect and long prediction time exist, so that problems of low industrial quality inspection accuracy, poor quality inspection effect and low efficiency are caused.

Disclosure of Invention

In order to solve one of the above technical problems, the present invention proposes the following technical solutions.

The embodiment of the first aspect of the invention provides a workpiece quality inspection method, which comprises the following steps: acquiring a test image corresponding to a workpiece to be tested and a trained neural network model; obtaining training samples of a deep Q network, the training samples comprising: the method comprises the following steps that a plurality of original images, data enhancement images corresponding to the original images and data enhancement behaviors are obtained; constructing a deep Q network, and performing learning training on the deep Q network based on the training sample to obtain a state-behavior deep Q network model corresponding to the neural network model; inputting the test image into the depth Q network model to obtain at least one target enhancement behavior corresponding to the test image; performing data enhancement on the test image according to the at least one target enhancement behavior, and inputting the enhanced test image into the neural network model to obtain at least one prediction result; and performing quality inspection on the workpiece according to the at least one prediction result.

In addition, the workpiece quality inspection method according to the above embodiment of the present invention may have the following additional features.

According to an embodiment of the present invention, the learning and training the deep Q network based on the training samples to obtain a state-behavior deep Q network model corresponding to the neural network model includes: a plurality of data-enhanced images are grouped into a state space set S of an environment,

wherein S is_tIndicating the state of the environment at time t, S_t+1Indicating the state of the environment at time t +1, S_TRepresenting the final state of the environment; composing the plurality of data-enhanced behaviors into a behavior space set A of the agent,

wherein A is_kRepresenting a kth data enhancement behavior; and obtaining the state-behavior deep Q network model based on the interaction between the agent and the environment.

According to an embodiment of the present invention, obtaining the state-behavior deep Q network model based on the interaction between the agent and the environment comprises: obtaining the environmental state S at the moment t_t(ii) a The environmental state S at the time t is measured_tInputting the agent to enable the agent to be in accordance with the environmental state S at the time t_tPerforming a data enhancement action A at time t_t(ii) a Data enhancement action A at the moment of execution t of the agent_tThen, the state of the environment is changed to an environment state S at the moment t +1_t+1Simultaneously, feeding back the feedback reward value at the time of t +1 to the intelligent agent so that the intelligent agent executes the data enhancement behavior at the time of t + 1; obtaining the feedback reward value corresponding to each time, so as to combine the feedback reward values into a feedback reward set R,

wherein R is_tIndicating the corresponding feedback prize value, R, at time t_t+1Representing the corresponding feedback reward value at the t +1 moment; determining a cost function of the agent according to each feedback reward value; and obtaining the state-behavior depth Q network model based on the cost function.

According to one embodiment of the invention, determining a cost function for the agent based on the respective feedback reward values comprises: acquiring a control strategy pi of the agent, and a state-value function and a behavior-value function corresponding to the control strategy pi;

obtaining the state-behavior depth Q network model based on the cost function, including: optimizing the deep Q network model by maximizing the state-cost function and the behavior-cost function.

According to one embodiment of the present invention, the state-cost function corresponding to the control strategy pi is:

，

the behavior-value function corresponding to the control strategy pi is as follows:

，

wherein,

indicating when an agent takes a policy

The expectation of the time-random variable is,

represents a discount factor, R_t+k+1Denotes a feedback award value at time t + k +1, and S denotes a grant S_tA represents the value given to A_tThe value of (c).

According to an embodiment of the present invention, the quality inspection of the workpiece according to the at least one prediction result comprises: when the number of the prediction results is multiple, preprocessing the multiple prediction results; and performing quality inspection on the workpiece according to the prediction result after pretreatment.

According to one embodiment of the invention, the plurality of data enhancement activities include: turning, rotating, zooming and brightness adjusting.

In a second aspect, an embodiment of the present invention provides a workpiece quality inspection apparatus, including: the first acquisition module is used for acquiring a test image corresponding to a workpiece to be tested and a trained neural network model; a second obtaining module, configured to obtain a training sample of the deep Q network, where the training sample includes: the method comprises the following steps that a plurality of original images, data enhancement images corresponding to the original images and data enhancement behaviors are obtained; the training module is used for constructing a deep Q network and carrying out learning training on the deep Q network based on the training samples so as to obtain a state-behavior deep Q network model corresponding to the neural network model; a first determining module, configured to input the test image into the depth Q network model to obtain at least one target enhancement behavior corresponding to the test image; the second determination module is used for performing data enhancement on the test image according to the at least one target enhancement behavior and inputting the enhanced test image into the neural network model to obtain at least one prediction result; and the quality inspection module is used for performing quality inspection on the workpiece according to the at least one prediction result.

In addition, the workpiece quality inspection apparatus according to the above embodiment of the present invention may have the following additional features.

According to an embodiment of the invention, the training module comprises: a first composing unit for composing the plurality of data-enhanced images into a state space set S of the environment,

wherein S is_tIndicating the state of the environment at time t, S_t+1Indicating the state of the environment at time t +1, S_TRepresenting the final state of the environment; a second composing unit for composing the plurality of data-enhanced behaviors into a behavior space set A of a smart agent,

wherein A is_kRepresenting a kth data enhancement behavior; and the interaction unit is used for obtaining the state-behavior deep Q network model based on the interaction between the agent and the environment.

A third aspect of the present invention provides a computer device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein the processor executes the computer program to implement the workpiece quality inspection method according to the first aspect of the present invention.

According to the technical scheme of the embodiment of the invention, before the neural network model is adopted to realize workpiece quality inspection, the deep Q network model is trained, the optimal data enhancement behavior is determined through the deep Q network model, then the optimal data enhancement behavior is used for carrying out data enhancement on the test image, the test image is input into the neural network model after the data enhancement, and the quality inspection is realized according to the prediction result of the neural network model. Therefore, the testing effect can be ensured, and meanwhile, the testing time is greatly reduced, so that the workpiece quality inspection efficiency is improved, and the workpiece quality inspection accuracy is improved.

Drawings

Fig. 1 is a flowchart of a workpiece quality inspection method according to an embodiment of the invention.

Fig. 2 is a schematic diagram illustrating a principle of data enhancement of a test image by TTA in the related art.

Fig. 3 is a schematic diagram illustrating a principle of data enhancement of a test image by a deep Q network model according to an embodiment of the present invention.

FIG. 4 is a schematic diagram of the interaction between agents and the environment of a deep Q network, according to one embodiment of the present invention.

Fig. 5 is a block diagram of a workpiece quality inspection apparatus according to an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

In the related art, for a neural network model used in industrial quality inspection, a test data enhancement (Tset Time Augmentation, TTA) technology is usually adopted to perform data enhancement processing on training data or test data of the model to increase the training data or improve the test accuracy, where TTA is to perform data enhancement operation on a test image in a test stage, input enhanced images of multiple versions into a trained model to obtain an output result, and then calculate the output result to obtain an average output as a final prediction result. This approach works well because the area displayed by the original test image may lack some important features, and so it is possible to take multiple versions of the input image and average them across the model.

However, a problem of the neural network model is that the image transformation such as rotation and brightness is not robust, and in order to enhance the generalization ability of the neural network model, a series of data enhancement is usually performed on the image before training to enhance the generalization ability of the model.

In summary, TTA is equivalent to adapting a test image to a model through different data enhancements, so as to improve the effect of the model, but this method wastes a large amount of test time; training data enhancement improves the test effect of the model by increasing the diversity of the training data, which is equivalent to adapting the model to the test data, but essentially improves the "insight" of the model to some extent, and the model is not really robust to data enhancement.

In human thinking habit, after a picture is rotated, the picture is recognized not by memorizing the form of each rotation angle of the picture, but by rotating the picture to a normal angle in the brain before recognition, that is, a human has the ability of rotating thinking, but a convolutional neural network model lacks the ability.

That is, TTA obtains a prediction result by performing a large amount of expansion on a test image and then screening and combining various test results, which requires a large amount of test time and is difficult to use in a real-world scenario. There is no way for the training data enhancement to cover all the variations, and the effect of the model can only be improved to a limited extent. For example, a model trained by some non-rotated pictures only shows good performance on the non-rotated data, and when the rotated pictures are input into the model, the prediction capability of the model is reduced, resulting in reduction of prediction accuracy.

Therefore, the convolutional neural network model in the industrial quality inspection does not rotate to a normal or proper angle before identifying the test image, and then identifies the test image, that is, a person has the ability of rotating thinking, that is, the convolutional neural network has the problem of poor robustness, so that the workpiece quality inspection efficiency is low, the accuracy is low, and the quality inspection effect is influenced.

Aiming at the problems, the invention provides a workpiece quality inspection method, a workpiece quality inspection device and computer equipment.

Specific embodiments of the present invention will be described below with reference to the drawings.

It should be noted that the main execution body of the workpiece quality inspection method according to the embodiment of the present invention may be an electronic device in an industrial field, and specifically, the electronic device may be, but is not limited to, an industrial computer and a mobile terminal. The application scenario of the embodiment of the invention can be a scenario that a workpiece needs to be subjected to quality inspection in an industrial production site, wherein the workpiece can be an industrial object, such as an industrial computer, a tire and the like.

As shown in fig. 1, the workpiece quality inspection method includes the following steps S1 to S6.

And S1, obtaining a test image corresponding to the workpiece to be inspected and a trained neural network model.

The trained neural network model can be a neural network model required by each stage of industrial quality inspection, and the trained neural network model has the function of outputting a prediction result corresponding to a test image according to the input test image. For example, the neural network model may be an image recognition model, an object detection model, an instance segmentation model, or a defect detection model.

Specifically, when a workpiece to be quality-tested needs to be quality-tested, a camera can be used to capture a test image of the workpiece, and a trained neural network model, such as a trained target detection model, needed by quality testing is obtained, and the target detection model can detect an object, such as a kitten, in the test image.

S2, obtaining a training sample of the deep Q network, wherein the training sample comprises: the image enhancement method comprises a plurality of original images, data enhancement images corresponding to the original images and a plurality of data enhancement behaviors.

Wherein the original image may be an image of a workpiece, as many images of the workpiece as possible may be acquired in embodiments of the present invention. The type of the acquired workpiece image can be determined according to the function of the trained model, for example, when the function of the trained neural network model is image content identification detection, a workpiece image which can contain image content can be acquired; when the trained neural network model functions as defect detection, a workpiece image with defects can be acquired.

Specifically, after the neural network model is acquired, a plurality of original images are acquired, after the plurality of original images are acquired, as many data enhancement behaviors as possible, such as rotation, inversion, scaling, brightness adjustment, and the like, can be acquired, and each original image is sequentially subjected to data enhancement by the plurality of data enhancement behaviors, so that a data enhancement image corresponding to each original image can be acquired.

For example, assuming an original image W, three data enhancement actions of rotation, scaling and brightness adjustment are performed, and W can be rotated to obtain a data enhanced image W₁W is mixed with₁Zooming to obtain a data enhanced image W₂W is mixed with₂Adjusting brightness to obtain data enhanced image W₃Scaling W to obtain a data enhanced image W₄Jing W₄Adjusting brightness to obtain data enhanced image W₅Adjusting the brightness of W to obtain a data enhanced image W₆Thereby obtaining a plurality of data enhanced images W₁、W₂、W₃、W₄、W₅And W₆。

S3, constructing a deep Q network, and performing learning training on the deep Q network based on the training samples to obtain a state-behavior deep Q network model corresponding to the neural network model.

Specifically, after the training sample is obtained, a deep Q network is constructed, and the deep Q network is subjected to learning training based on the training sample to obtain a state-behavior deep Q network model corresponding to the trained neural network model.

The state-behavior depth Q network model is used for outputting a data enhancement behavior corresponding to the test image according to the state or version of the test image input to the state-behavior depth Q network model, and the data enhancement behavior belongs to the optimal data enhancement behavior for the trained neural network model and the trained test image. One trained model corresponds to a state-behavior deep Q-network model, and one test image corresponds to one or more optimal data enhancement behaviors (in the embodiment of the present invention, such optimal data enhancement behaviors are referred to as target enhancement behaviors).

And S4, inputting the test image into the depth Q network model to obtain at least one target enhancement behavior corresponding to the test image.

Specifically, after the depth Q network model corresponding to the neural network model is obtained, the obtained test image corresponding to the workpiece may be input into the depth Q network model, and the depth Q network model outputs at least one target enhancement behavior corresponding to the test image according to the state of the test image.

And S5, enhancing the test image according to the at least one target enhancement behavior, and inputting the enhanced test image into the trained neural network model to obtain at least one prediction result.

In the embodiment of the present invention, a result output by the neural network model according to the enhanced test image may be referred to as a prediction result.

Specifically, after the target enhancement behavior corresponding to the test image is determined by using the depth Q network model, data enhancement can be performed on the test image by using the target enhancement behavior to obtain an enhanced test image, then the enhanced test image is input into the trained neural network model, and then the neural network model outputs a prediction result according to the test image. When the test image is subjected to data enhancement by the target enhancement behavior and then is input into the neural network model, compared with the neural network model which identifies the test image without data enhancement and the test images of other versions, the neural network model has the advantages of highest identification accuracy, shortest test time and best identification effect on the enhanced test image.

It should be noted that the target enhancement behaviors may correspond to the prediction results one by one, and when one target enhancement behavior is used, one prediction result is used; when the target enhancement behaviors are multiple, the prediction results are multiple, and the neural network model identifies the enhanced test image, so that the identification accuracy is high, the phenomenon that the prediction results are inaccurate due to inaccurate identification is avoided for the prediction results output by the neural network model, and the multiple prediction results can be consistent.

And S6, performing quality inspection on the workpiece according to at least one prediction result.

Specifically, in practical application, when the trained neural network model is needed to implement quality inspection on a workpiece, step S1 is first executed to obtain a test image and a trained neural network model corresponding to the workpiece to be inspected, steps S2 and S3 are then executed to obtain a training sample of a deep Q network, and the deep Q network is then subjected to learning training based on the training sample to obtain a state-behavior deep Q network model corresponding to the trained neural network model. After the depth Q network model is trained, executing a step S4 of determining a target enhancement behavior, inputting a test image into the depth Q network model to obtain at least one target enhancement behavior corresponding to the test image, then executing a step S5 of enhancing the test image by the target enhancement behavior, and inputting the enhanced test image into the trained neural network model, so that the neural network model outputs at least one prediction result according to the test image, wherein the prediction result has the characteristic of high accuracy. And finally, performing quality inspection on the workpiece according to at least one prediction result to obtain a quality inspection result.

For example, when the workpiece to be subjected to quality inspection is a vase, the prediction result can be patterns and positions where the patterns are located, the patterns on the surface of the vase can be accurately identified by using the depth Q network model and the neural network model, then, quality inspection of the vase can be realized through the characteristics of patterns, such as pattern style, size and the like, for example, when the patterns are regularly distributed and uniform in size, the quality inspection of the vase is passed, and the vase is qualified.

That is to say, in the embodiment of the present invention, a deep Q network model corresponding to a neural network model is trained, when quality inspection needs to be performed using the model, an optimal enhancement strategy is selected before using the neural network model, a test image is enhanced by the optimal enhancement strategy to enhance the test image to a state or version most suitable for the test of the neural network model, then, prediction of the model is implemented by using the neural network model, and quality inspection is performed according to a prediction result output by the neural network model, thereby substantially solving the problem of robustness of the neural network model. The method for selecting the optimal enhancement strategy has good universality, real-time performance and testing effect, is closer to the thinking mode of human beings, and can greatly reduce the testing time while ensuring the testing effect.

Assume that, as shown in FIGS. 2 and 3, it is assumed that

Is a test image for the purpose of testing,

representing a trained neural network model, training

Sample image and test image of

With the same data distribution. In the related art, referring to fig. 2,

obtaining a data set through a series of data enhancement

，

，

……

TTA is

，

，

……

All are sent to

Middle prediction to obtain output result

，

，

……

Then all the output results are merged to obtain the final prediction result

And realizing industrial quality inspection according to the prediction result. Referring to FIG. 3, the embodiment of the present invention is training first

A corresponding depth Q network model, and determining the test image according to the depth Q network model

Enhancing the data to obtain a target enhancement behavior, and enhancing the test image according to the target enhancement behavior to obtain an enhanced test image

Will be

Input device

Obtaining a predicted result

And realizing industrial quality inspection according to the prediction result.

According to the embodiment of the invention, the decision of data enhancement behavior is realized based on deep reinforcement learning, and then model prediction is carried out, so that the test speed is higher compared with TTA, the decision of an enhancement strategy can be realized in a shorter time, the problem of poor robustness of the convolutional neural network model on rotation and brightness change can be effectively solved on the basis, and the thinking habit of human brain is better met.

For example, a neural network model is trained according to images acquired in a factory, but if the brightness of a camera platform changes during image prediction, the acquired images are different from images of the previously trained neural network model, the recognition effect of the neural network model is reduced, and in order to avoid the phenomenon of reduction of the recognition effect, a decision on the optimal enhancement behavior of the acquired images is made through the depth Q network model of the embodiment of the invention, so that some state or version adjustment is performed on the images, and then the images are input into the neural network model, so that the model recognition effect can be improved.

According to the workpiece quality inspection method provided by the embodiment of the invention, before the workpiece quality inspection is realized by adopting the neural network model, the deep Q network model is trained, the optimal data enhancement behavior is determined through the deep Q network model, the data enhancement is further carried out on the test image through the optimal data enhancement behavior, the test image is input into the neural network model after the data enhancement, and the quality inspection is realized according to the prediction result of the neural network model. Therefore, the testing effect can be ensured, and meanwhile, the testing time is greatly reduced, so that the workpiece quality inspection efficiency is improved, and the workpiece quality inspection accuracy is improved.

It should be noted that the deep Q network model is trained based on an Agent and an Environment (Environment).

That is, in an embodiment of the present invention, the learning and training of the deep Q network based on the training samples in the step S3 to obtain the state-behavior deep Q network model corresponding to the neural network model may include the following steps S31 to S33.

S31, forming a plurality of data enhancement images into a state space set S of the environment,

wherein S is_tIndicating the state of the environment at time t, S_t+1Indicating the state of the environment at time t +1, S_TRepresenting the final state of the environment.

S32, forming a plurality of data enhancement behaviors into a behavior space set A of the agent,

wherein A is_kIndicating the kth data enhancement behavior.

And S33, obtaining a state-behavior deep Q network model based on the interaction between the agent and the environment.

Further, the step S33 may include the following steps S331 to S336.

S331, obtaining the environmental state S at the time t_t。

S332, setting the environmental state S at the time t_tInputting the agent to make the agent according to the environmental state S at the time t_tPerforming a data enhancement action A at time t_t。

S333, executing data enhancement action A in the intelligent agent_tThereafter, the state of the environment is changed to the environment state S at the time t +1_t+1And simultaneously, the feedback reward value at the time of t +1 is fed back to the intelligent agent, so that the intelligent agent executes the data enhancement behavior at the time of t + 1.

S334, obtaining the feedback reward value corresponding to each time, so as to combine the feedback reward values into a feedback reward set R,

wherein R is_tIndicating the corresponding feedback prize value, R, at time t_t+1Indicating the corresponding feedback prize value at time t + 1.

And S335, determining the value function of the intelligent agent according to the feedback reward values.

In one example, determining a cost function for the agent based on the respective feedback reward values may include: and acquiring a control strategy pi of the agent, and a state-value function and a behavior-value function corresponding to the control strategy pi.

Wherein, the state-value function corresponding to the control strategy pi is as follows:

the behavior-cost function corresponding to the control strategy pi is:

wherein,

indicating when an agent is taking policySlightly less than

The expectation of the time-random variable is,

And S336, obtaining a state-behavior depth Q network model based on the cost function.

Further, a state-behavior deep Q network model is obtained based on the cost function, and the method comprises the following steps: the deep Q network model is optimized by maximizing the state-cost function and the behavior-cost function.

Specifically, after acquiring the data enhanced images and the data enhanced behaviors corresponding to the original images in the above step S2, the data enhanced images are combined into the state space set S of the environment,

and a plurality of data enhancement behaviors are combined into a behavior space set A of the agent,

to complete the configuration of the agents and environments of the deep Q network. And interacting the configured intelligent agent with the environment to obtain a state-behavior depth Q network model corresponding to the trained model.

FIG. 4 is a schematic diagram of the interaction between agents and the environment of a deep Q network, according to one embodiment of the present invention. In fig. 4: action stands for (data enhanced) behavior, State stands for status of the environment, and Reward stands for feedback rewards. As shown in FIG. 4, upon interaction between an Agent and the environment, the initial state of the Agent is the original test image

Inputting new state S into training each time the environment is in new state SIn the good Model, obtaining a prediction result, evaluating the prediction result through a real label of the test image X and providing a feedback reward set

And as feedback, real label represents the image content really represented by the image, such as the image of a cat, the cat is the real label of the image, the image is input into a trained model to obtain a prediction result, and a corresponding feedback reward value is calculated according to the matching degree of the prediction result and the real label.

And then, acquiring a control strategy pi of the agent and a value function corresponding to the control strategy pi, maximizing the value function, optimizing to obtain a state-behavior deep Q network model, and obtaining an optimal strategy, namely a target enhancement behavior through the deep Q network model. The purpose of an agent is to learn a control strategy

To maximize the expected reward.

It should be noted that, in the embodiment of the present invention, there may be one or more target enhancement behaviors, and when there is one target enhancement behavior, the target enhancement behavior is used to perform data enhancement on the test image, and the enhanced test image is input into the model to obtain a prediction result correspondingly; and when the target enhancement behaviors are multiple, performing data enhancement on the test image by using the multiple target enhancement behaviors respectively, and inputting the enhanced test image into the model in sequence to correspondingly obtain multiple prediction results.

In an embodiment of the present invention, the step S6 of performing quality inspection on the workpiece according to at least one prediction result may include: when the number of the prediction results is multiple, preprocessing the multiple prediction results; and performing quality inspection on the workpiece according to the prediction result after pretreatment.

Specifically, after obtaining the plurality of prediction results, preprocessing may be performed, for example, accuracy and recognition time corresponding to the plurality of prediction results are determined, the prediction result with the highest accuracy and the shortest recognition time is selected, and other prediction results are deleted, so that the processed prediction results only include the prediction result with the highest accuracy and the shortest recognition time, and industrial quality inspection is implemented according to the prediction results.

Therefore, on the basis that the model outputs a plurality of prediction results, the prediction results are screened, the accuracy of industrial quality inspection is further ensured, and the efficiency of industrial quality inspection is improved.

In summary, the embodiment of the invention provides a model test mode with high efficiency and good robustness for industrial quality inspection. A deep Q network model is adopted to make a decision on a data enhancement mode, a general idea is provided for training and testing of a deep learning algorithm, and a convenient and efficient model identification mode is provided for industrial quality inspection.

Corresponding to the workpiece quality inspection method of the above embodiment, the invention further provides a workpiece quality inspection device.

As shown in fig. 5, the workpiece quality inspection apparatus 100 includes: the system comprises a first acquisition module 10, a second acquisition module 20, a training module 30, a first determination module 40, a second determination module 50 and a quality inspection module 60.

The first acquisition module 10 is used for acquiring a test image corresponding to a workpiece to be tested and a trained neural network model; the second obtaining module 20 is configured to obtain training samples of the deep Q network, where the training samples include: the method comprises the following steps that a plurality of original images, data enhancement images corresponding to the original images and data enhancement behaviors are obtained; the training module 30 is configured to construct a deep Q network, and perform learning training on the deep Q network based on a training sample to obtain a state-behavior deep Q network model corresponding to the neural network model; the first determining module 40 is configured to input the test image into the depth Q network model to obtain at least one target enhancement behavior corresponding to the test image; the second determining module 50 is configured to perform data enhancement on the test image according to at least one target enhancement behavior, and input the enhanced test image into the neural network model to obtain at least one prediction result; the quality inspection module 60 is used for performing quality inspection on the workpiece according to at least one prediction result.

In one embodiment of the present invention, training module 30 may include: a first composing unit for composing the plurality of data-enhanced images into a state space set S of the environment,

wherein S is_tIndicating the state of the environment at time t, S_t+1Indicating the state of the environment at time t +1, S_TRepresenting the final state of the environment; a second composing unit for composing the plurality of data-enhanced behaviors into a behavior space set A of the agent,

wherein A is_kRepresenting a kth data enhancement behavior; and the interaction unit is used for obtaining a state-behavior deep Q network model based on the interaction between the intelligent agent and the environment.

In an embodiment of the present invention, the interaction unit may be specifically configured to: obtaining the environmental state S at the moment t_t(ii) a The environmental state S at the moment t_tInputting the agent to make the agent according to the environmental state S at the time t_tPerforming a data enhancement action A at time t_t(ii) a Data enhancement behavior A at the time of execution of the agent t_tThereafter, the state of the environment is changed to the environment state S at the time t +1_t+1Simultaneously, feeding back the feedback reward value at the time of t +1 to the intelligent agent to enable the intelligent agent to execute the data enhancement behavior at the time of t + 1; obtaining the feedback reward value corresponding to each time, so as to combine the feedback reward values into a feedback reward set R,

wherein R is_tIndicating the corresponding feedback prize value, R, at time t_t+1Representing the corresponding feedback reward value at the t +1 moment; determining a value function of the agent according to each feedback reward value; and obtaining a state-behavior depth Q network model based on the cost function.

In an embodiment of the invention, the interaction unit, when determining the cost function of the agent according to the respective feedback reward values, may be specifically configured to: acquiring a control strategy pi of an agent and a state-value function and a behavior-value function corresponding to the control strategy pi; the interaction unit, when obtaining the state-behavior depth Q network model based on the cost function, may be specifically configured to: the state-behavior deep Q network model is optimized by maximizing the state-value function and the behavior-value function.

In one embodiment of the present invention, the state-cost function for a control strategy pi is:

the behavior-cost function corresponding to the control strategy pi is:

wherein,

indicating when an agent takes a policy

The expectation of the time-random variable is,

In one embodiment of the present invention, the quality inspection module 60 includes: the processing unit is used for preprocessing a plurality of prediction results when the number of the prediction results is multiple; and the quality inspection unit is used for performing quality inspection on the workpiece according to the preprocessed prediction result.

In one embodiment of the invention, the plurality of data enhancement activities include: turning, rotating, zooming and brightness adjusting.

It should be noted that, for the specific implementation and implementation principle of the workpiece quality inspection apparatus, reference may be made to the specific implementation of the workpiece quality inspection method, and details are not described here again to avoid redundancy.

According to the workpiece quality inspection device provided by the embodiment of the invention, before the workpiece quality inspection is realized by adopting the neural network model, the deep Q network model is trained, the optimal data enhancement behavior is determined through the deep Q network model, then the data enhancement is carried out on the test image through the optimal data enhancement behavior, the test image is input into the neural network model after the data enhancement, and the quality inspection is realized according to the prediction result of the neural network model. Therefore, the testing effect can be ensured, and meanwhile, the testing time is greatly reduced, so that the workpiece quality inspection efficiency is improved, and the workpiece quality inspection accuracy is improved.

The invention further provides a computer device corresponding to the embodiment.

The computer device of the embodiment of the invention comprises a memory, a processor and a computer program which is stored on the memory and can run on the processor, and when the processor executes the computer program, the workpiece quality inspection method of the embodiment of the invention can be realized.

According to the computer device provided by the embodiment of the invention, when a processor executes a computer program stored on a memory, a test image and a trained neural network model corresponding to a workpiece to be quality-tested are firstly obtained, a training sample of a deep Q network is obtained, then the deep Q network is constructed, the deep Q network is subjected to learning training based on the training sample to obtain a state-behavior deep Q network model corresponding to the neural network model, then the test image is input into the deep Q network model to obtain at least one target enhancement behavior corresponding to the test image, then the test image is subjected to data enhancement according to the at least one target enhancement behavior, the enhanced test image is input into the neural network model to obtain at least one prediction result, and finally the workpiece is subjected to quality testing according to the at least one prediction result.

According to the computer equipment provided by the embodiment of the invention, before the neural network model is adopted to realize the workpiece quality inspection, the deep Q network model is trained, the optimal data enhancement behavior is determined through the deep Q network model, the optimal data enhancement behavior is further used for carrying out data enhancement on the test image, the test image is input into the neural network model after the data enhancement, and the quality inspection is realized according to the prediction result of the neural network model. Therefore, the testing effect can be ensured, and meanwhile, the testing time is greatly reduced, so that the workpiece quality inspection efficiency is improved, and the workpiece quality inspection accuracy is improved.

In the description of the present invention, the terms "first" and "second" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implying any number of technical features indicated. Thus, a feature defined as "first" or "second" may explicitly or implicitly include one or more of that feature. The meaning of "plurality" is two or more unless specifically limited otherwise.

In the description herein, references to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above are not necessarily intended to refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. Furthermore, various embodiments or examples and features of different embodiments or examples described in this specification can be combined and combined by one skilled in the art without contradiction.

Any process or method descriptions in flow charts or otherwise described herein may be understood as representing modules, segments, or portions of code which include one or more executable instructions for implementing specific logical functions or steps of the process, and alternate implementations are included within the scope of the preferred embodiment of the present invention in which functions may be executed out of order from that shown or discussed, including substantially concurrently or in reverse order, depending on the functionality involved, as would be understood by those reasonably skilled in the art of the present invention.

It should be understood that portions of the present invention may be implemented in hardware, software, firmware, or a combination thereof. In the above embodiments, the various steps or methods may be implemented in software or firmware stored in memory and executed by a suitable instruction execution system. For example, if implemented in hardware, as in another embodiment, any one or combination of the following techniques, which are known in the art, may be used: a discrete logic circuit having a logic gate circuit for implementing a logic function on a data signal, an application specific integrated circuit having an appropriate combinational logic gate circuit, a Programmable Gate Array (PGA), a Field Programmable Gate Array (FPGA), or the like.

It will be understood by those skilled in the art that all or part of the steps carried by the method for implementing the above embodiments may be implemented by hardware related to instructions of a program, which may be stored in a computer readable storage medium, and when the program is executed, the program includes one or a combination of the steps of the method embodiments. In addition, functional units in the embodiments of the present invention may be integrated into one processing module, or each unit may exist alone physically, or two or more units are integrated into one module. The integrated module can be realized in a hardware mode, and can also be realized in a software functional module mode. The integrated module, if implemented in the form of a software functional module and sold or used as a stand-alone product, may also be stored in a computer readable storage medium.

Although embodiments of the present invention have been shown and described above, it is understood that the above embodiments are exemplary and should not be construed as limiting the present invention, and that variations, modifications, substitutions and alterations can be made to the above embodiments by those of ordinary skill in the art within the scope of the present invention.

Claims

1. A method of inspecting a workpiece, comprising:

acquiring a test image corresponding to a workpiece to be tested and a trained neural network model;

obtaining training samples of a deep Q network, the training samples comprising: the method comprises the following steps that a plurality of original images, data enhancement images corresponding to the original images and data enhancement behaviors are obtained;

constructing a deep Q network, and performing learning training on the deep Q network based on the training sample to obtain a state-behavior deep Q network model corresponding to the neural network model;

inputting the test image into the depth Q network model to obtain at least one target enhancement behavior corresponding to the test image;

performing data enhancement on the test image according to the at least one target enhancement behavior, and inputting the enhanced test image into the neural network model to obtain at least one prediction result;

and performing quality inspection on the workpiece according to the at least one prediction result.

2. The workpiece quality inspection method according to claim 1, wherein learning and training the deep Q network based on the training samples to obtain a state-behavior deep Q network model corresponding to the neural network model comprises:

a plurality of data-enhanced images are grouped into a state space set S of an environment,

wherein S is_tIndicating the state of the environment at time t, S_t+1Indicating the state of the environment at time t +1, S_TRepresenting the final state of the environment;

composing the plurality of data-enhanced behaviors into a behavior space set A of the agent,

wherein A is_kRepresenting a kth data enhancement behavior;

and obtaining the state-behavior deep Q network model based on the interaction between the agent and the environment.

3. The method of claim 2, wherein deriving the state-behavior deep Q-network model based on interactions between the agent and the environment comprises:

obtaining the environmental state S at the moment t_t；

The environmental state S at the time t is measured_tInputting the agent to enable the agent to be in accordance with the environmental state S at the time t_tPerforming a data enhancement action A at time t_t；

Data enhancement action A at the moment of execution t of the agent_tThen, the state of the environment is changed to an environment state S at the moment t +1_t+1Simultaneously, feeding back the feedback reward value at the time of t +1 to the intelligent agent so that the intelligent agent executes the data enhancement behavior at the time of t + 1;

obtaining the feedback reward value corresponding to each time, so as to combine the feedback reward values into a feedback reward set R,

wherein R is_tIndicating the corresponding feedback prize value, R, at time t_t+1Representing the corresponding feedback reward value at the t +1 moment;

determining a cost function of the agent according to each feedback reward value;

and obtaining the state-behavior depth Q network model based on the cost function.

4. A method according to claim 3, wherein determining a cost function for the agent based on the respective feedback reward values comprises:

acquiring a control strategy pi of the agent, and a state-value function and a behavior-value function corresponding to the control strategy pi;

obtaining the state-behavior depth Q network model based on the cost function, including:

optimizing the state-behavior deep Q network model by maximizing the state-cost function and the behavior-cost function.

5. The method of claim 4, wherein the state-cost function for the control strategy pi is:

，

the behavior-cost function corresponding to the control strategy pi is as follows:

，

wherein,

indicating when an agent takes a policy

The expectation of the time-random variable is,

6. The method of claim 1, wherein inspecting the workpiece based on the at least one predictor comprises:

when the number of the prediction results is multiple, preprocessing the multiple prediction results;

and performing quality inspection on the workpiece according to the prediction result after pretreatment.

7. The method of claim 1, wherein the plurality of data enhancement activities comprise: turning, rotating, zooming and brightness adjusting.

8. A workpiece quality inspection apparatus, comprising:

the first acquisition module is used for acquiring a test image corresponding to a workpiece to be tested and a trained neural network model;

a second obtaining module, configured to obtain a training sample of the deep Q network, where the training sample includes: the method comprises the following steps that a plurality of original images, data enhancement images corresponding to the original images and data enhancement behaviors are obtained;

the training module is used for constructing a deep Q network and carrying out learning training on the deep Q network based on the training samples so as to obtain a state-behavior deep Q network model corresponding to the neural network model;

a first determining module, configured to input the test image into the depth Q network model to obtain at least one target enhancement behavior corresponding to the test image;

the second determination module is used for performing data enhancement on the test image according to the at least one target enhancement behavior and inputting the enhanced test image into the neural network model to obtain at least one prediction result;

and the quality inspection module is used for performing quality inspection on the workpiece according to the at least one prediction result.

9. The workpiece quality inspection apparatus of claim 8, wherein the training module comprises:

a first composing unit for composing the plurality of data-enhanced images into a state space set S of the environment,

a second composing unit for composing the plurality of data-enhanced behaviors into a behavior space of an agentIn the set a, the first and second sets of data,

wherein A is_kRepresenting a kth data enhancement behavior;

and the interaction unit is used for obtaining the state-behavior deep Q network model based on the interaction between the agent and the environment.

10. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor, when executing the computer program, implements the method of workpiece quality inspection according to any one of claims 1-7.