CN116109982B

CN116109982B - Biological sample collection validity checking method based on artificial intelligence

Info

Publication number: CN116109982B
Application number: CN202310121257.1A
Authority: CN
Inventors: 刘志岩; 张东
Original assignee: Harbin Xingyun Intelligent Manufacturing Technology Co ltd
Current assignee: Harbin Xingyun Intelligent Manufacturing Technology Co ltd
Priority date: 2023-02-16
Filing date: 2023-02-16
Publication date: 2023-07-28
Anticipated expiration: 2043-02-16
Also published as: CN116109982A

Abstract

The invention discloses an artificial intelligence-based biological sample collection validity test method, which comprises the following steps: s1, acquiring an oral cavity image and a pharyngeal swab acquisition process image of a user through a high-definition video camera; s2, preprocessing the image to obtain a video frame; s3, calculating polygon vertex coordinates of a pharyngeal collecting area through a collecting part indication algorithm; s4, receiving an input video frame through a swab head identification algorithm, outputting polygon vertex coordinate information of a pharyngeal swab head in the video frame, and identifying a pharyngeal swab head part; s5, comprehensively evaluating the contact degree and the contact position by combining an original image through a swab head and detection position contact detection algorithm, and giving corresponding scores; s6, detecting and judging a plurality of continuous video frames of the video stream through an action completion evaluation algorithm. The invention solves the problem that the prior art lacks a recognition mechanical arm for effectively contacting the collecting part of the pharyngeal swab with the swab head and completing the inspection.

Description

Biological sample collection validity checking method based on artificial intelligence

Technical Field

The invention belongs to the technical field of image processing, and particularly relates to an artificial intelligence-based biological sample collection validity checking method.

Background

At present, the nucleic acid detection is an effective measure for coping with new coronary epidemic situations, and the field of nucleic acid detection generally comprises an acquisition end and a detection end. At the collection end, manual collection is the main, and each sampling pavilion needs to be provided with a data entry member and a nurse to collect pharyngeal swabs. In order to save human resources, the adoption of an unattended automatic nucleic acid collection booth mode is one direction of research of related factories in the field at present.

The unmanned automatic collection pavilion is realized in most of the current modes that a robot uses a mechanical arm clamp to clamp a swab to extend into the oral cavity of a person to be detected for collecting the pharyngeal swab. This approach has three distinct disadvantages: 1, the mechanical arm generates no small psychological pressure on the detected personnel; 2, in the stroke of the mechanical arm, the evasion and displacement of the detected personnel can cause acquisition failure, so that the acquisition time is long and the acquisition efficiency is low; and 3, the use cost and the maintenance cost of the mechanical arm are high, so that the mechanical arm is not beneficial to point distribution and maintenance.

The defect in the prior art is that a method for identifying whether an acquisition part of a throat swab is effectively contacted with a swab head in the acquisition of the throat swab by a mechanical arm and completing the inspection is lacking.

Disclosure of Invention

Aiming at the problem that whether the collecting part is effectively contacted with the swab head and the test is finished in the collecting of the pharyngeal swab by the identification mechanical arm is lacking in the prior art, the invention provides an artificial intelligence-based biological sample collecting validity test method.

In order to achieve the technical purpose, the invention adopts the following technical scheme:

an artificial intelligence based biological sample collection validity test method comprises the following steps:

s1, acquiring an oral cavity image and a pharyngeal swab acquisition process image of a user through a high-definition video camera;

s2, preprocessing the image to obtain a video frame;

s3, calculating polygon vertex coordinates of a pharyngeal collecting area through a collecting part indication algorithm;

s4, receiving an input video frame through a swab head identification algorithm, outputting polygon vertex coordinate information of a pharyngeal swab head in the video frame, and identifying a pharyngeal swab head part;

s5, comprehensively evaluating the contact degree and the contact position by combining an original image through a swab head and detection position contact detection algorithm, and giving corresponding scores;

s6, detecting and judging a plurality of continuous video frames of the video stream through an action completion evaluation algorithm. And when the acquisition action of the video stream record reaches the algorithm appointed condition, judging a classification result of the completion of the acquisition action.

Further, the acquisition part indication algorithm is a UNet image segmentation algorithm, and the definition of the UNet network structure function of the UNet image segmentation algorithm comprises the following steps:

s301, defining a Conv2d two-dimensional graph convolution function, wherein the calculation formula is as follows:

wherein:

the x symbols are cross-correlation operators, here implemented using matrix convolution operations;

n is the number of image frames input by one time calculation, namely, a batch-size value;

C _in is the number of channels of the input image, C _out The number of channels of the output image is c=3 for the original RGB image and the actual number of channels of the feature map for the feature map of the intermediate layer;

input represents the input map four-dimensional tensor, the tensor shape being a four-tuple (picture element width, picture element height, number of picture channels, batch-size value).

output represents the output map four-dimensional tensor, the tensor shape being a four-tuple (picture element width, picture element height, number of picture channels, batch-size value).

The padding function represents expanding the input four-dimensional graph tensor size by P pixel values.

weight represents a four-dimensional tensor of a convolution kernel weight parameter, and the tensor is shaped as a quadruple (convolution kernel width, convolution kernel height, number of output image channels, batch-size value). The prescribed Conv2d operation fixing uses 3*3 convolution kernel, and the step S is fixed to 1.

bias represents the bias weight tensor calculated by Conv2d this time, and the shape is consistent with weight.

The input-output image size of Conv2d is calculated with reference to the following formula:

wherein:

W _in is the input image size;

W _out is the output image size;

ks is the convolution kernel size;

p and S are consistent with Conv2d operation definition;

in the definition of Conv2d operation, we take p=1, s=1, ks=3, so the output image size of Conv2d operation is the same as the input:

s302, defining classification functions, wherein 1*1 convolution kernels are adopted for fixation, and the classification functions are completely consistent with Conv2d and are marked as Class2d;

s303, defining a Maxpool function:

the max operation is to assign 2 x 2 kernels to perform element maximum value calculation according to step length 2, so that the size relation of input and output images can be obtained:

W _out ＝W _in /2

H _out ＝H _in /2

s304, defining a Relu function as an activation function of a conv2d function:

output(N，k，W，H)＝relu(input(N，k，W，H))＝max(0，input(N，k，W，H))

s305, defining an up sampling function of the sampled:

wherein Weight upsamples the Weight tensor.

S306, defining a channel feature map stacking function:

wherein. C (C) _out ＝C _in1 +C _in2 The + operator simply stacks the two feature map channels of the same input size.

S307, defining a feature map residual network action judging function resnet:

firstly defining a residual network characteristic extraction function of a certain layer l:

output _l+1 ＝input _l +F(input _l +weight _l )

where input and output are input and output tensors, and the size is determined by (N, C, W, H).

The F calculation involves a single sequential calculation combination of arithmetic normalization for each element of the tensor, conv2d calculation, and Relu calculation.

Continuing to define a residual network feature extraction function for any layer L starting from layer 1:

s308 defines a full link layer fc and a softmax layer function fc_softmax,

the softmax function is:

the fc_softmax function is:

output _o ＝fc_softmax(input _i )＝softmax(weight _i，o ×input _i +bais _i，o )

where weight is the linear weight matrix and bias is the bias matrix.

After passing through the full connection layer, carrying out one-dimensional stretching on tensors with the sizes of (N, C, W, H) to obtain multi-batch one-dimensional tensors with the sizes of (N, C, W, H), and then calculating through a weight matrix and a bias matrix to obtain the final judging probability value meeting the acquisition action completion standard and not meeting the acquisition action completion standard.

Further, on the basis of the UNet image segmentation algorithm, the step of constructing an acquisition part indication algorithm comprises the following steps:

s3001, input 1920 x 1080-sized Yuv video frame input through a video acquisition device ₀₀₁ ；

S3002, preprocessing the image, inputting ₀₀₁ The color channels are converted to BGR channels,

the image is centrally intercepted to obtain 512 x 3 tensor input data output ₀₀₂ Where the first two 512 represent the image width and height and the third parameter 3 represents the blue, green, and red color component channels.

S3003, setting input video frame input _n ＝relu(output _n-1 ) Or input _n ＝output _n-1 Performing Conv2d, maxpool, upsampled and channel calculation until a 64-channel feature map of the tensor output data output036 acquisition part indication area is obtained;

s3004, repeating the step S3003 to obtain a 64-channel feature map of the swab head identification area, which is marked as input' ₀₃₆ ；

S3005, set input ₀₃₇ ＝relu(output ₀₃₆ ) And perform Conv2d operation

output ₀₃₇ (N＝8，C _out ＝2，W＝512，H＝512)＝Class2d(input ₀₃₇ ，N＝8，C _in ＝64，C _out ＝2，W＝512，H＝512，P＝1)

Through the steps, the pixel point classification information of the collecting part indication area and the swab head identification area can be obtained respectively, wherein one classification is background pixel points, and the other classification is collecting part or swab head pixel points.

The two classifications are randomly initialized at the beginning of the model operation, and the accuracy of the classification information can be gradually improved through the following training steps.

Further, cross entropy evaluation is carried out on the pixel point classification information in the step S3005 through a trained loss function, and inverse gradient calculation is carried out; defining a loss function of training by adopting a cross entropy function, wherein the formula is as follows:

will output ₀₃₇ And performing cross entropy evaluation on the two classification calculation results and the two classification target results of the input video annotation, and performing inverse gradient calculation. Weight and bias values in functions such as Conv2d, class2d, upsampled and the like are updated so that H (p, q) tends to be minimum, and an acquisition part indication and swab head recognition algorithm model can be obtained. The acquisition part and the swab head are marked in the video frame by mixing the original pixel values of the acquisition part and the swab head pixels indicated by the acquisition part indication and the swab head identification algorithm respectively with different colors, so that a person to be tested is guided to perform acquisition operation.

Further, the construction of the contact detection algorithm continues on the basis of the result of [ S036 ].

S5001, set input ₀₃₈ ＝relu(output ₀₃₆ ) Input 'is set' ₀₃₈ ＝relu(output’ ₀₃₆ ) And perform a channel cat operation

S5002, set input ₀₃₉ ＝output ₀₃₈ And perform Conv2d operation

output ₀₃₉ (N＝8，C _out ＝64，W＝512，H＝512)＝Conv2d(input ₀₃₉ ，N＝8，C _in ＝128，C _out ＝64，W＝512，H＝512，P＝1)

S5003, set input ₀₄₀ ＝relu(output ₀₃₉ ) And perform Conv2d operation

output ₀₄₀ (N＝8，C _out ＝64，W＝512，H＝512)＝Conv2d(input ₀₄₀ ，N＝8，C _in ＝64，C _out ＝64，W＝512，H＝512，P＝1)

S5004, set input ₀₄₁ ＝relu(output ₀₄₀ ) And performs a network depth of 8 resnet ₈ Calculation of

output ₀₄₁ (N＝8，Co _ut ＝64，W＝512，H＝512)＝resnet ₈ (input ₀₄₁ ，N＝8，C _in ＝64，W＝512，H＝512，P＝1)

S5005, set input ₀₄₂ ＝output ₀₄₁ And carrying out characteristic probability result discrimination of fc_softmax, wherein discrimination results are a contact classification result and a non-contact classification result:

output ₀₄₂ (N＝8，o＝2)＝fc_softmax(input ₀₄₂ (N＝8，i＝64))

s5006, obtaining a contact probability index of the pharyngeal swab and the pharyngeal collecting part in the current video frame through the steps S5001-S5005, wherein the value range is between (0-1).

Further, after the contact detection algorithm provides the contact probability value, finally defining an action completion evaluation algorithm to realize a final decision on whether the sampling action is completed:

defining a decision function:

defining a scoring function:

wherein fps is the current video stream frame rate, cp is the confidence probability, and the current acquisition action completion score is obtained after accumulating the judgment value J of the video frames within 2 seconds. A value in the score threshold range (0, fps x 2-1) may be set as the action completion threshold, and acquisition may be considered complete when score is greater than or equal to this threshold.

Compared with the prior art, the invention has the following beneficial effects:

the algorithm module provided by the invention can be used for an unattended self-help nucleic acid sampling pavilion, and can be matched with a monocular high-definition camera to realize the completion discrimination of actions in the self-help sampling process of a person to be detected. Compared with the mechanical arm type nucleic acid acquisition scheme, the mental pressure on the detected personnel is minimum, the one-time acquisition success rate and the acquisition efficiency are higher, and the construction cost is lower.

The invention integrates four algorithm modules (an acquisition part indication algorithm, a swab head identification algorithm, a contact detection algorithm and an action completion evaluation algorithm) into one video processing flow, and one set of flow can complete three tasks proposed by a nucleic acid acquisition process on an algorithm model, namely acquisition part indication, pharyngeal swab head identification and indication and action completion detection and evaluation.

Through hidden layer sharing and result utilization in the model, computing platform resources required by the algorithm module are saved to the greatest extent. The reasoning process of the algorithm module can be operated on low-end cpu such as intel i3 12100f, and the algorithm deployment cost is greatly saved due to no special requirement on the cpu, and the algorithm module also has the capability of being operated on mobile low-power consumption platforms such as android.

Compared with the common Unet model, the conv2d calculation does not change the size of the input feature map by introducing the convolution kernel and padding with special sizes in the conv2d calculation, so that the operation amount is reduced, and the model has stronger universality. Not only is suitable for nucleic acid detection scenes, but also is suitable for other self-service sample acquisition scenes.

Drawings

FIG. 1 is a general flow chart of an artificial intelligence based biological specimen collection validity test method of the present invention;

FIG. 2 is a flowchart illustrating steps of an acquisition location indication algorithm according to an embodiment of the present invention;

FIG. 3 is a flow chart of a touch detection algorithm constructed in accordance with an embodiment of the present invention.

Detailed Description

The invention will be further described with reference to examples and drawings, to which reference is made, but which are not intended to limit the scope of the invention.

As shown in fig. 1, the present embodiment provides an artificial intelligence based biological sample collection validity test method, which includes the steps of: s1, acquiring an oral cavity image and a pharyngeal swab acquisition process image of a user through a high-definition video camera; s2, preprocessing the image to obtain a video frame; s3, calculating polygon vertex coordinates of a pharyngeal collecting area through a collecting part indication algorithm; s4, receiving an input video frame through a swab head identification algorithm, outputting polygon vertex coordinate information of a pharyngeal swab head in the video frame, and identifying a pharyngeal swab head part; s5, comprehensively evaluating the contact degree and the contact position by combining an original image through a swab head and detection position contact detection algorithm, and giving corresponding scores; s6, detecting and judging a plurality of continuous video frames of the video stream through an action completion evaluation algorithm. And when the acquisition action of the video stream record reaches the algorithm appointed condition, judging a classification result of the completion of the acquisition action.

The acquisition part indication algorithm is optimized on the basis of a UNet image segmentation algorithm, and the definition of a Unet network structure function of the UNet image segmentation algorithm comprises the following steps:

wherein:

W _in is the input image size;

W _out is the output image size;

ks is the convolution kernel size;

p and S are consistent with Conv2d operation definition;

s303, defining a Maxpool function:

W _out ＝W _in /2

H _out ＝H _in /2

s304, defining a Relu function as an activation function of a conv2d function:

s305, defining an up sampling function of the sampled:

wherein Weight upsamples the Weight tensor.

S306, defining a channel feature map stacking function:

S307, defining a feature map residual network action judging function resnet:

firstly, defining a residual network characteristic extraction function of a certain layer 1:

output _l+1 ＝input _l +F(input _l +weight _l )

Continuing to define a residual network feature extraction function for any layer L starting from layer L:

s308 defines a full link layer fc and a softmax layer function fc_softmax,

the softmax function is:

the fc_softmax function is:

where weight is the linear weight matrix and bias is the bias matrix.

As shown in fig. 2, on the basis of the UNet image segmentation algorithm, the steps of constructing an acquisition part indication algorithm include:

S3002, preprocessing the image, and changing input ₀₀₁ The color channels are converted to BGR channels,

the image is centrally truncated to obtain 512 x 3 tensor input data output002, wherein the first two 512 represent the width and height of the image, and the third parameter 3 represents the three color component channels of blue, green and red.

S3003, setting input video frame input _n ＝relu(output _n-1 ) Or input _n ＝output _n-1 Performing Conv2d, maxpool, upsampled and channel calculation until a 64-channel feature map of the tensor output data output036 acquisition part indication area is obtained; the method comprises the following specific steps:

set input ₀₀₃ ＝output ₀₀₂ And perform Conv2d operation

output ₀₀₃ (N＝8，C _out ＝64，W＝512，H＝512)＝Conv2d(input ₀₀₃ ，N＝8，C _in ＝3，C _out ＝64，W＝512，H＝512，P＝1)

Set input ₀₀₄ ＝relu(output ₀₀₃ ) And perform Conv2d operation

output ₀₀₄ (N＝8，C _out ＝64，W＝512，H＝512)＝Conv2d(input ₀₀₄ ，N＝8，C _in ＝64，C _out ＝64，W＝512，H＝512，P＝1)

Set input ₀₀₅ ＝relu(output ₀₀₄ ) And performing Maxpool operation

output_005(N＝8，C＝64，W＝256，H＝256)＝maxpool(input_005，N＝8，C＝64，W＝512，H＝512)

Set input ₀₀₆ ＝output ₀₀₅ And perform Conv2d operation

output ₀₀₆ (N＝8，C _out ＝128，W＝256，H＝256)＝Conv2d(input ₀₀₆ ，N＝8，C _in ＝64，C _out ＝128，W＝256，H＝256，P＝1)

Set input ₀₀₇ ＝relu(output ₀₀₆ ) And perform Conv2d operation

output ₀₀₇ (N＝8，C _out ＝128，W＝256，H＝256)＝Conv2d(input ₀₀₇ ，N＝8，C _in ＝128，C _out ＝128，W＝256，H＝256，P＝1)

Set input ₀₀₈ ＝relu(output ₀₀₇ ) And performing Maxpool operation

output ₀₀₈ (N＝8，C＝128，W＝128，H＝128)＝maxpool(input ₀₀₈ ，N＝8，C＝128，W＝256，H＝256)

Set input ₀₀₉ ＝output ₀₀₈ And perform Conv2d operation

output ₀₀₉ (N＝8，C _out ＝256，W＝128，H＝128)＝Conv2d(input ₀₀₉ ，N＝8，C _in ＝128，C _out ＝256，W＝128，H＝128，P＝1)

Set input ₀₁₀ ＝relu(output ₀₀₉ ) And perform Conv2d operation

output ₀₁₀ (N＝8，C _out ＝256，W＝128，H＝128)＝Conv2d(input ₀₁₀ ，N＝8，C _in ＝256，C _out ＝256，W＝128，H＝128，P＝1)

Set input ₀₁₁ ＝relu(output ₀₁₀ ) And performing Maxpool operation

output ₀₁₁ (N＝8，C＝256，W＝64，H＝64)＝maxpool(input ₀₁₁ ，N＝8，C＝256，W＝128，H＝128)

Set input ₀₁₂ ＝output ₀₁₁ And perform Conv2d operation

output ₀₁₂ (N＝8，C _out ＝512，W＝64，H＝64)＝Conv2d(input ₀₁₂ ，N＝8，C _in ＝256，C _out ＝512，W＝64，H＝64，P＝1)

Set input ₀₁₃ ＝relu(output ₀₁₂ ) And perform Conv2d operation

output ₀₁₃ (N＝8，C _out ＝512，W＝64，H＝64)＝Conv2d(input ₀₁₃ ，N＝8，C _in ＝512，C _out ＝512，W＝64，H＝64，P＝1)

Set input ₀₁₄ ＝relu(output ₀₁₃ ) And performing Maxpool operation

output ₀₁₄ (N＝8，C＝512，W＝32，H＝32)＝maxpool(input ₀₁₄ ，N＝8，C＝512，W＝64，H＝64)

Set input ₀₁₅ ＝output ₀₁₄ And perform Conv2d operation

output ₀₁₅ (N＝8，C _out ＝1024，W＝32，H＝32)＝Conv2d(input ₀₁₅ ，N＝8，C _in ＝512，C _out ＝1024，W＝32，H＝32，P＝1)

Set input ₀₁₆ ＝relu(output ₀₁₅ ) And perform Conv2d operation

output ₀₁₆ (N＝8，C _out ＝1024，W＝32，H＝32)＝Conv2d(input ₀₁₆ ，N＝8，C _in ＝1024，C _out ＝512，W＝32，H＝32，P＝1)

Set input ₀₁₇ ＝relu(output ₀₁₆ ) And perform the upsampled operation

output ₀₁₇ (N＝8，C＝1024，W＝64，H＝64)＝upsampled(input ₀₁₇ ，N＝8，C＝1024，W＝32，H＝32)

Set input ₀₁₈ ＝output ₀₁₇ And perform Conv2d operation

output ₀₁₈ (N＝8，C _out ＝512，W＝64，H＝64)＝Conv2d(input ₀₁₈ ，N＝8，C _in ＝1024，C _out ＝512，W＝64，H＝64，P＝1)

Set input ₀₁₉ ＝relu(output ₀₁₈ ) And perform a channel cat operation

Set input ₀₂₀ ＝output ₀₁₉ And perform Conv2d operation

output ₀₂₀ (N＝8，C _out ＝512，W＝64，H＝64)＝Conv2d(input ₀₂₀ ，N＝8，C _in ＝1024，C _out ＝512，W＝64，H＝64，P＝1)

Set input ₀₂₁ ＝relu(output ₀₂₀ ) And perform Conv2d operation

output ₀₂₁ (N＝8，C _out ＝512，W＝64，H＝64)＝Conv2d(input ₀₂₁ ，N＝8，C _in ＝512，C _out ＝512，W＝64，H＝64，P＝1)

Set input ₀₂₂ ＝relu(output ₀₂₁ ) And perform the upsampled operation

output _o22 (N＝8，C＝512，W＝128，H＝128)＝upsampled(input ₀₂₂ ，N＝8，C＝512，W＝64，H＝64)

Set input ₀₂₃ ＝output ₀₂₂ And perform Conv2d operation

output ₀₂₃ (N＝8，C _out ＝256，W＝128，H＝128)＝Conv2d(input ₀₂₃ ，N＝8，C _in ＝512，C _out ＝256，W＝128，H＝128，P＝1)

Set input ₀₂₄ ＝relu(output ₀₂₃ ) And perform a channel cat operation

Set input ₀₂₅ ＝output ₀₂₄ And perform Conv2d operation

output ₀₂₅ (N＝8，C _out ＝256，W＝128，H＝128)＝Conv2d(input ₀₂₅ ，N＝8，C _in ＝512，C _out ＝256，W＝128，H＝128，P＝1)

Set input ₀₂₆ ＝relu(output ₀₂₅ ) And perform Conv2d operation

output ₀₂₆ (N＝8，C _out ＝256，W＝128，H＝128)＝Conv2d(input ₀₂₆ ，N＝8，C _in ＝256，C _out ＝256，W＝128，H＝128，P＝1)

Set input ₀₂₇ ＝relu(output ₀₂₆ ) And perform the upsampled operation

output ₀₂₇ (N＝8，C＝256，W＝256，H＝256)＝upsampled(input ₀₂₇ ，N＝8，C＝256，W＝128，H＝128)

Set input ₀₂₈ ＝output ₀₂₇ And perform Conv2d operation

output ₀₂₈ (N＝8，C _out ＝128，W＝256，H＝256)＝Conv2d(input ₀₂₈ ，N＝8，C _in ＝256，C _out ＝128，W＝256，H＝256，P＝1)

Set input ₀₂₉ ＝relu(output ₀₂₈ ) And perform a channel cat operation

Set input ₀₃₀ ＝output ₀₂₉ And perform Conv2d operation

output ₀₃₀ (N＝8，C _out ＝128，W＝256，H＝256)＝Conv2d(input ₀₃₀ ，N＝8，C _in ＝256，C _out ＝128，W＝256，H＝256，P＝1)

Set input ₀₃₁ ＝relu(output ₀₃₀ ) And perform Conv2d operation

output ₀₃₁ (N＝8，C _out ＝128，W＝256，H＝256)＝Conv2d(input ₀₃₁ ，N＝8，C _in ＝128，C _out ＝128，W＝256，H＝256，P＝1)

Set input ₀₃₂ ＝relu(output ₀₃₁ ) And perform the upsampled operation

output ₀₃₂ (N＝8，C＝128，W＝512，H＝512)＝upsampled(input ₀₃₂ ，N＝8，C＝128，W＝256，H＝256)

Set input ₀₃₃ ＝output ₀₃₂ And perform Conv2d operation

output ₀₃₃ (N＝8，C _out ＝64，W＝512，H＝512)＝Conv2d(input ₀₃₃ ，N＝8，C _in ＝128，C _out ＝64，W＝512，H＝512，P＝1)

Set input ₀₃₄ ＝relu(output ₀₃₃ ) And perform a channel cat operation

Set input ₀₃₅ ＝output ₀₃₄ And perform Conv2d operation

output ₀₃₅ (N＝8，C _out ＝64，W＝512，H＝512)＝Conv2d(input ₀₃₅ ，N＝8，C _in ＝128，C _out ＝64，W＝512，H＝512，P＝1)

Setting upinput ₀₃₆ ＝relu(output ₀₃₅ ) And perform Conv2d operation

output ₀₃₆ (N＝8，C _out ＝64，W＝512，H＝512)＝Conv2d(input ₀₃₆ ，N＝8，C _in ＝64，C _out ＝64，W＝512，H＝512，P＝1)

This time, output ₀₃₆ Namely a 64-channel characteristic map of the acquisition part indication area.

S3004, repeating the step S3003 to obtain a 64-channel characteristic diagram of the swab head identification area, which is marked as input' ₀₃₆ 。

Through the steps, the pixel point classification information of the acquisition part indication area and the swab head identification area can be obtained respectively, one classification is the background pixel point, the other classification is the acquisition part or the swab head pixel point, the two classifications are randomly initialized at the beginning of model operation, and the accuracy of the classification information can be gradually improved through the following training steps.

Performing cross entropy evaluation on the pixel point classification information in the step S3005 through a trained loss function and performing inverse gradient calculation; defining a loss function of training by adopting a cross entropy function, wherein the formula is as follows:

will output ₀₃₇ And performing cross entropy evaluation on the two classification calculation results and the two classification target results of the input video annotation, and performing inverse gradient calculation. Weight and bias values in functions such as Conv2d, class2d, upsampled and the like are updated so that H (p, q) tends to be minimum, and an acquisition part indication and swab head recognition algorithm model can be obtained. Indicating the collecting position and swabThe acquisition part and the swab head indicated by the head identification algorithm are mixed with original pixel values by different colors respectively, so that the acquisition part and the swab head can be marked in a video frame, and a person to be detected is guided to perform acquisition operation.

As shown in fig. 3, the construction of the contact detection algorithm based on the result of step S3003 is continued, including the steps of:

S5002, set input ₀₃₉ ＝output ₀₃₈ And perform Conv2d operation

output ₀₄₁ (N＝8，C _out ＝64，W＝512，H＝512)＝resnet ₈ (input ₀₄₁ ，N＝8，C _in ＝64，W＝512，H＝512，P＝1)

output ₀₄₂ (N＝8，o＝2)＝fc_softmax(input ₀₄₂ (N＝8，i＝64))

As in step S3005, multiple sample training is performed using the cross entropy function as a loss function, and output is performed after step S5005 is completed ₀₄₂ And performing cross entropy evaluation on the two-class calculation result and the two-class target result marked by contact and non-contact of the input picture, and performing inverse gradient calculation so as to optimize and update the weight tensor and the bias tensor in the resnet and the fc, so that the final cross entropy function output tends to 0.

After the contact detection algorithm provides the contact probability value, finally defining an action completion evaluation algorithm to realize the final determination of whether the sampling action is completed or not:

defining a decision function:

/>

defining a scoring function:

The biological sample collection validity test method based on artificial intelligence is provided in detail. The description of the specific embodiments is only intended to facilitate an understanding of the method of the present application and its core ideas. It should be noted that it would be obvious to those skilled in the art that various improvements and modifications can be made to the present application without departing from the principles of the present application, and such improvements and modifications fall within the scope of the claims of the present application.

Claims

1. An artificial intelligence-based biological sample collection validity test method is characterized by comprising the following steps:

s2, preprocessing the image to obtain a video frame;

s6, detecting and judging a plurality of continuous video frames of the video stream through an action completion evaluation algorithm;

the acquisition part indication algorithm is a UNet image segmentation algorithm, and the definition of a Unet network structure function of the UNet image segmentation algorithm comprises the following steps:

wherein: x symbols are cross-correlation operators, N is the number of image frames that are input by one computation,

C _in is the number of channels of the input image, C _out Is the number of channels of the output image,

input represents the four-dimensional tensor of the input graph, the tensor shape is a quadruple,

output represents the output graph four-dimensional tensor,

the padding function represents expanding the input four-dimensional graph tensor size by P pixel values,

weight represents the convolution kernel weight parameter, a four-dimensional tensor, the tensor shape is a quadruple,

bias represents the bias weight tensor calculated by the current Conv2d, and the shape is consistent with weight;

s302, defining a classification function, wherein 1*1 convolution kernels are adopted for fixation, and the classification function is completely consistent with Conv2d, namely Class2d;

s303, defining a Maxpool function:

W _out ＝W _in /2

H _out ＝H _in /2

s304, defining a Relu function as an activation function of a conv2d function:

s305, defining an up sampling function of the sampled:

wherein Weight upsamples the Weight tensor;

s306, defining a channel feature map stacking function:

wherein C is _out ＝C _in1 +C _in2 The +operator simply stacks the two feature map channels with the same input size;

s307, defining a feature map residual network action judging function resnet:

defining a residual network characteristic extraction function of a certain layer 1:

output _l+1 ＝input _l +F(input _l +weight _l )

where input is the input tensor, output tensor, size is determined by (N, C, W, H),

the F calculation comprises a single sequential calculation combination of arithmetic normalization for each element of the tensor, conv2d calculation, relu calculation,

s308 defines a full link layer fc and a softmax layer function fc_softmax,

the softmax function is:

the fc_softmax function is:

where weight is the linear weight matrix and bias is the bias matrix;

after passing through the full connection layer, carrying out one-dimensional stretching on tensors with the sizes of (N, C, W, H) to obtain multi-batch one-dimensional tensors with the sizes of (N, C, W, H), and then calculating through a weight matrix and a bias matrix to obtain a final judging probability value which accords with the acquisition action completion standard and does not accord with the acquisition action completion standard;

the step of constructing the acquisition part indication algorithm comprises the following steps:

s3001, inputting a Yuv video frame input001 with 1920×1080 size through a video acquisition device;

s3002, performing image preprocessing, namely converting an input001 into a BGR channel, and performing central interception on an image to obtain tensor output data output002 of 512 x 3;

S3005, set input ₀₃₇ ＝relu(output ₀₃₆ ) And performing Class2d operation to obtain pixel point classification information of the acquisition part indication area and the swab head identification area respectively, wherein one classification is background pixel points and one classification is acquisition part or swab head pixel points.

2. The biological sample collection validity checking method based on artificial intelligence according to claim 1, wherein the cross entropy evaluation is performed on the pixel point classification information in step S3005 through a trained loss function and the inverse gradient calculation is performed;

the cross entropy function formula is:

will output ₀₃₇ And performing cross entropy evaluation on the two classification calculation results and the two classification target results of the input video annotation, and performing inverse gradient calculation.

3. The artificial intelligence based biological specimen collection validity test method of claim 2, wherein the constructing of the contact detection algorithm based on the result of step S3003 includes the steps of:

s5001, set input ₀₃₈ ＝relu(output ₀₃₆ ) Input 'is set' ₀₃₈ ＝relu(output’ ₀₃₆ ) And carrying out a channel cat operation;

s5002, set input ₀₃₉ ＝output ₀₃₈ Performing Conv2d operation;

output ₀₃₉ (N＝8，C _out ＝64，W＝512，H＝512)

＝Conv2d(input ₀₃₉ ，N＝8，C _in ＝128，C _out

＝64，W＝512，H＝512，P＝1)

s5003, set input ₀₄₀ ＝relu(output ₀₃₉ ) Performing Conv2d operation;

s5004, set input ₀₄₁ ＝relu(output ₀₄₀ ) And performs a network depth of 8 resnet ₈ Calculating;

s5005, set input ₀₄₂ ＝output ₀₄₁ And carrying out characteristic probability result discrimination of fc_softmax, wherein the discrimination result is a contact/non-contact classification result;

output ₀₄₂ (N＝8，o＝2)＝fc_softmax(input ₀₄₂ (N＝8，i＝64))

4. The artificial intelligence based biological sample collection validity test method of claim 3, wherein after the contact detection algorithm provides the contact probability value, an action completion evaluation algorithm is finally defined to realize the final determination of whether the sampling action is completed;

defining a decision function:

defining a scoring function:

wherein fps is the current video stream frame rate, cp is the confidence probability, and the current acquisition action completion score is obtained after accumulating the judgment value J of the video frames within 2 seconds.