CN116229333A - Difficulty target decoupling detection method based on difficulty level self-adaptive dynamic adjustment - Google Patents
Difficulty target decoupling detection method based on difficulty level self-adaptive dynamic adjustment Download PDFInfo
- Publication number
- CN116229333A CN116229333A CN202310505377.1A CN202310505377A CN116229333A CN 116229333 A CN116229333 A CN 116229333A CN 202310505377 A CN202310505377 A CN 202310505377A CN 116229333 A CN116229333 A CN 116229333A
- Authority
- CN
- China
- Prior art keywords
- target
- difficult
- decoupling
- difficulty
- score
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/41—Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/774—Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V2201/00—Indexing scheme relating to image or video recognition or understanding
- G06V2201/07—Target detection
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02T—CLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
- Y02T10/00—Road transport of goods or passengers
- Y02T10/10—Internal combustion engine [ICE] based vehicles
- Y02T10/40—Engine management systems
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Software Systems (AREA)
- General Physics & Mathematics (AREA)
- Computing Systems (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Multimedia (AREA)
- General Health & Medical Sciences (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Databases & Information Systems (AREA)
- Computational Linguistics (AREA)
- Medical Informatics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Data Mining & Analysis (AREA)
- Molecular Biology (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Image Analysis (AREA)
Abstract
The invention relates to a difficulty target decoupling detection method based on difficulty level self-adaptive dynamic adjustment, which comprises the following steps: constructing a difficult-to-easy sample decoupling convolutional neural network model, wherein the difficult-to-easy sample decoupling convolutional neural network model comprises a feature extraction main network and n+1 difficult-to-easy target decoupling detection heads; collecting a plurality of difficulty samples, and labeling the targets in all the difficulty samples with labels with difficulty grades from 0 to n to obtain a training sample set; calculating an initial prior score of a target, and weighting the loss of the difficult sample decoupling convolutional neural network model so as to train the difficult sample decoupling convolutional neural network model; calculating a posterior confidence score based on the initial prior score of the target; and dynamically adjusting the initial prior score through the posterior confidence score to obtain a normalized score, and updating the initial prior score by using the normalized score until the decoupling convolutional neural network model of the difficult sample is stable. The invention aims to reduce the omission rate and false detection rate of target identification.
Description
Technical Field
The invention relates to the technical field of video target detection, in particular to a difficulty target decoupling detection method based on difficulty level self-adaptive dynamic adjustment.
Background
When identifying the target to be detected in the video, the data needs to be marked manually, and when the target which is difficult to identify or the data which is poor is encountered, the data is generally discarded directly so as to prevent the problem that the training is not converged due to oscillation caused by the data in the model training process.
The traditional problem related to difficult sample detection refers to that a classifier of the neural network is difficult to distinguish whether the sample is a target or a background, and the classifier is easy to distinguish manually, so that the neural network can pay more attention to the samples when training a model, and the samples can be correctly classified.
However, for samples which are difficult to distinguish manually, such as targets which are easy to identify and difficult to identify exist in the video (such as when a bird in the video is relatively close to the video and flies slowly, the targets are relatively clear and easy to identify, and when the bird is relatively far from the video, or the background is complex, or flies relatively fast, the targets are difficult to identify), if no historical prior information exists, the targets cannot be marked correctly. If targets which are not easy to identify are abandoned and do not participate in model training, missed detection is caused during model deployment; if targets which are not easy to identify participate in model training and are forcedly marked as positive samples, the model training is unstable, the model training is caused to oscillate, and false detection occurs during reasoning.
Disclosure of Invention
The invention aims to provide a difficult target decoupling detection method based on adaptive dynamic adjustment of difficult grades, which is used for avoiding the influence of difficult samples on model training and reducing the omission rate and false detection rate of target identification on the premise that the difficult samples are not discarded.
In order to achieve the above object, the embodiment of the present invention provides the following technical solutions:
a difficulty target decoupling detection method based on difficulty level self-adaptive dynamic adjustment comprises the following steps:
step 2, collecting a plurality of difficult samples, and labeling the targets in all the difficult samples with labels with difficulty grades from 0 to n to obtain a training sample set;
step 3, calculating an initial prior score of the target, and weighting the loss of the difficult sample decoupling convolutional neural network model by using the initial prior score so as to train the difficult sample decoupling convolutional neural network model;
step 4, calculating posterior confidence score based on the initial prior score of the target; and (3) dynamically adjusting the initial prior score through the posterior confidence score to obtain a normalized score, updating the initial prior score by using the normalized score, and repeating the step (3) until the decoupling convolutional neural network model of the difficult sample reaches stability.
Compared with the prior art, the invention has the beneficial effects that:
according to the invention, for the targets which are difficult to identify, the processing path is increased, and the detection rate of the targets which are difficult to identify is improved; weighting the model training process by using the difficulty degree of the identification target, preventing the target which is difficult to identify from vibrating in the model training process, and reducing the false detection rate; meanwhile, the model is used for training the prediction score, the difficulty degree of target identification is subjected to self-adaptive dynamic adjustment, and the subjectivity of manual difficulty degree marking is eliminated.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings that are needed in the embodiments will be briefly described below, it being understood that the following drawings only illustrate some embodiments of the present invention and therefore should not be considered as limiting the scope, and other related drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a schematic diagram of a model structure of a convolutional neural network with decoupling of difficult and easy samples;
FIG. 2 is a schematic diagram of a structure of a decoupling detection head for a difficult target according to an embodiment of the present invention;
FIG. 3 is a schematic diagram of weighting the loss of a hard-easy sample decoupled convolutional neural network model according to an embodiment of the present invention;
FIG. 4 is a schematic diagram of dynamic adjustment of an initial prior score using posterior confidence scores in an embodiment of the present invention;
fig. 5 is a schematic diagram illustrating the partitioning of initial prior scores according to an embodiment of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. The components of the embodiments of the present invention generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations. Thus, the following detailed description of the embodiments of the invention, as presented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention. All other embodiments, which can be made by a person skilled in the art without making any inventive effort, are intended to be within the scope of the present invention.
It should be noted that: like reference numerals and letters denote like items in the following figures, and thus once an item is defined in one figure, no further definition or explanation thereof is necessary in the following figures. Also, in the description of the present invention, the terms "first," "second," and the like are used merely to distinguish one from another, and are not to be construed as indicating or implying a relative importance or implying any actual such relationship or order between such entities or operations. In addition, the terms "connected," "coupled," and the like may be used to denote a direct connection between elements, or an indirect connection via other elements.
Examples:
the invention discloses a difficult target decoupling detection method based on difficult grade self-adaptive dynamic adjustment, which comprises the following steps:
Referring to fig. 1, the feature extraction backbone network is the first subsection of a hard and easy sample decoupling convolutional neural network model, and adopts a network structure such as a DarkNet 53. The second part is formed by sequentially connecting n+1 difficult target decoupling detection heads in series, wherein the 0 th difficult target decoupling detection head has the lowest difficulty level and is connected with a feature extraction backbone network; the difficulty level corresponding to the nth difficult target decoupling detection head is highest. The difficult target decoupling detection head with high difficulty level is arranged behind the target decoupling detection head, the processing path is longer, and the target with higher difficulty level is predicted.
The structure of each difficult and easy target decoupling detection head is the same, the internal network design of the difficult and easy target decoupling detection head adopts a U-shaped network, the U-shaped network comprises z downsampled convolution feature layers and z-1 upsampled convolution feature layers, the upsampled convolution feature layers and the downsampled convolution feature layers with the same dimension are spliced in the channel dimension so as to fully fuse shallow layer positioning information and high layer semantic information, and each difficult and easy target decoupling detection head predicts the confidence level, the category and the position of a target.
In this embodiment, please refer to fig. 2, which includes 4 downsampled convolution feature layers and 3 upsampled convolution feature layers, which are respectively a first convolution feature layer, a second convolution feature layer, a third convolution feature layer, a fourth convolution feature layer, a fifth convolution feature layer, a sixth convolution feature layer, and a seventh convolution feature layer, wherein the first convolution feature layer to the fourth convolution feature layer are downsampled in sequence, the fifth convolution feature layer to the seventh convolution feature layer are upsampled in sequence, and the third convolution feature layer and the fifth convolution feature layer with the same scale are spliced in the channel dimension, and otherwise the same is not repeated.
And 2, collecting a plurality of difficulty samples, and labeling the targets in all the difficulty samples with labels with difficulty grades from 0 to n to obtain a training sample set.
And (3) collecting a plurality of difficulty samples, wherein one difficulty sample possibly comprises 1 or more targets, labeling the labels of categories and positions of the targets in all the difficulty samples, and labeling the difficulty levels of 0-n for each target according to manual experience on the basis, wherein the difficulty level of 0 represents the easiest recognition, and the difficulty level of n represents the least easy recognition. Although the artificial experience is marked with subjective influence, the influence can be eliminated to a certain extent by the follow-up self-adaptive dynamic adjustment of the difficult sample.
Assuming that n=3, there are 4 difficult targets decoupling detection heads, and 4 difficulty levels are also designed, namely, a target with a difficulty level of 0 (a simple and easily identifiable target), a target with a difficulty level of 1 (a general target), a target with a difficulty level of 2 (a predicted target which is slightly difficult to identify), and a target with a difficulty level of 3 (a difficult-to-identify target); it is easy to understand that targets in the same difficulty level are also easily distinguished.
And step 3, calculating an initial prior score of the target, and weighting the loss of the difficult sample decoupling convolutional neural network model by using the initial prior score so as to train the difficult sample decoupling convolutional neural network model.
According to the classified difficulty levels, calculating initial prior scores corresponding to the difficulty levels, wherein the higher the difficulty level is, the lower the corresponding initial prior score is, and vice versa, but the initial prior scores are all between (0 and 1). The initial prior score is used for weighting the loss function during the adaptive dynamic adjustment of the subsequent difficult sample.
First, a range of initial prior scores of different difficulty levels is set, for example, a target difficulty level of which the initial prior score is between (0.75, 1) is 0, a target difficulty level of which the initial prior score is between (0.50, 0.75) is 1, a target difficulty level of which the initial prior score is between (0.25, 0.50) is 2, and a target difficulty level of which the initial prior score is between (0, 0.25) is 3.
Then calculating initial prior scores under different difficulty levels according to the formula, wherein the formula isWhere j represents the jth target in a difficulty sample, the difficulty sample has m targets in total, and the difficulty level of the target j is i (i=0, 1,2, 3), S j Is an initial a priori score of target j. The calculated difficulty level isThe initial prior score of 0 is 0.875, the initial prior score of difficulty level 1 is 0.625, the initial prior score of difficulty level 2 is 0.375, and the initial prior score of difficulty level 3 is 0.175.
Randomly taking out a sample picture x (i.e. a difficult sample) and a corresponding label gt from a training sample set j (Label gt) j The method comprises the steps of classifying the targets j, locating the targets j and the difficulty levels), distributing the targets j to corresponding difficult target decoupling detection heads according to the difficulty levels of the targets in the sample picture x, and distributing the targets j to the difficult target decoupling detection heads 1 if the difficulty level of the targets j is 1.
And finally outputting a predicted target out= HardEasyDeCoupleModel (x) of the sample picture x by the difficult sample decoupling convolutional neural network model, wherein the HardEasyDecoupleModel represents the difficult sample decoupling convolutional neural network model, and out contains all the predicted targets in the sample picture x.
Referring to fig. 3, calculating a target loss L between the output of the difficult and easy target decoupling detection head i and the tag in the difficult and easy sample decoupling convolutional neural network model by using a confidence loss function, a category classification loss function and a positioning regression loss function i,j :
Wherein L is i,j Representing a target loss between the output of the target j assigned to the difficulty target decoupling detection head i and the tag; l (L) conf Representing a confidence loss function, L cls Representing class classification loss function, L loc Representing a locating regression loss function; out of conf Output corresponding to the confidence loss function, out cls Output corresponding to the class classification loss function, out loc Representing the output corresponding to the positioning regression loss function;the label representing the target j in the confidence loss function,tag representing object j in class classification loss function, < ->Representing the labels that locate target j in the regression loss function.
Weighting all target losses distributed to the difficult target decoupling detection head i by using the initial prior score to obtain positive sample losses:
wherein L is i,pos Representing positive sample loss of the difficult target decoupling detection head i, wherein m 'represents the target number distributed to the difficult target decoupling detection head i, and m' is less than or equal to m;
calculating the total loss L of all difficult target decoupling detection heads:
wherein L is i,neg Negative sample loss of the detection head i is difficult to be decoupled from the target; count () represents the total number, i.e., number m.
The total loss L is back propagated to update the weights of the difficulty sample decoupling convolutional neural network model. And randomly taking a plurality of sample pictures x from the training sample set, repeating the training on the difficult sample decoupling convolutional neural network model, if N sample pictures are taken in total, obtaining N total losses L, and adjusting the weight of the difficult sample decoupling convolutional neural network model through a gradient descent optimization algorithm. After a certain step length or loss is trained to be converged to an expected value, fixing weights to obtain the weights of the well-trained difficult and easy sample decoupling convolutional neural network model.
Step 4, calculating posterior confidence score based on the initial prior score of the target; and (3) dynamically adjusting the initial prior score through the posterior confidence score to obtain a normalized score, updating the initial prior score by using the normalized score, and repeating the step (3) until the decoupling convolutional neural network model of the difficult sample reaches stability.
Assuming that N sample pictures are shared in the training sample set, sharing M targets, j epsilon M, and counting the ratio M of the target numbers corresponding to the difficulty level in the labels of the training sample set 0 :M 1 :M 2 :M 3 Ensuring that the target number of each difficulty level is unchanged, wherein M 0 Represents the target number of difficulty level 0, M 0 +M 1 +M 2 +M 3 =M。
Referring to fig. 4, N sample pictures are input into a trained difficulty sample decoupling convolutional neural network model, and the output is decoded to obtain a prediction score F of each target j . Combining an initial prior score S of a target j j Calculating posterior confidence scoresWherein->。
After running all sample pictures, sorting all targets from high to low according to the posterior confidence scores of the targets, and according to the ratio M of the target numbers of the difficulty grades in the training sample set 0 :M 1 :M 2 :M 3 And reclassifying the difficulty level of all targets. Ordered in frontThe difficulty level of the target division of (2) is 0, < th->To->The difficulty level of the target division of (1)/(1)>To the point ofThe difficulty level of the target division of (2) < th->To the point ofThe difficulty level of the target division is 3; count (M) represents the target total number M.
Ranking M targets from high to low according to a posterior confidence score, assuming ranking to the firstTo->Is aimed at [ P ] t ,...,P j ,...,P k ]Wherein P is t Representation ordering to->Posterior confidence score, P, of target t of bit k Representation ordering to the thPosterior confidence score, P, of target k of bits j Representation ordering to the thTo->Posterior confidence score of any target j in (j= [ t, k)]). According to the initial prior score, the initial prior score of the object with the difficulty level of 2 is in the range of (0.25, 0.50]But after recalculation, a posterior confidence score [ P ] t ,...,P j ,...,P k ]Is not necessarily at (0.25, 0.50]Within the scope, the present approach therefore works by scoring the posterior confidence score [ P ] t ,...,P j ,...,P k ]And the classification is within the classification score of the difficulty level 2, so that the self-adaptive dynamic adjustment is realized.
In detail, referring to fig. 5, assume that the initial a priori score of difficulty level 2 is maximum at S i2 Minimum value is S i1 (i=2),[P t ,...,P j ,...,P k ]Confidence score of (a)The number is not necessarily in (0.25, 0.50]The following four cases are within the scope:
1. when P t <S i2 And P is k <S i1 When in use;
setting P max =S i2 ,P min =P k ,S max =S i2 ,S min =S i1 The posterior confidence score P for the target j can be calculated using equation (3) j And normalized score S j Is a function of:
from equation (3), each P is selected j Obtaining a corresponding S j . Then the obtained S j S brought into formula (1) j The loss weighting for the difficulty sample decoupled convolutional neural network model may be updated again. Through multiple pairs of normalized scores S j The adaptive dynamic adjustment of the model can train the decoupling convolutional neural network model of the difficult and easy sample more stably.
2. When S is i1 ≤P t ≤S i2 And S is i1 ≤P k ≤S i2 When in use;
setting P max =P t ,P min =P k ,S max =P t ,S min =P k Each P is selected using equation (3) j Obtaining a corresponding S j The loss weighting of the difficulty sample decoupled convolutional neural network model is updated in the same way.
3. When S is i2 <P t And P is k <S i1 When in use;
setting P max =P t ,P min =P k ,S max =S i2 ,S min =S i1 Each P is selected using equation (3) j Obtaining a corresponding S j Updating the model of the convolution neural network for decoupling the difficult samples in the same wayIs added to the loss weighting of (2).
4. When S is i2 <P t And S is i1 <P k When in use;
setting P max =P t ,P min =P k ,S max =S i2 ,S min =P k Each P is selected using equation (3) j Obtaining a corresponding S j The loss weighting of the difficulty sample decoupled convolutional neural network model is updated in the same way.
To sum up, a normalized score S' is obtained j And (3) repeating the training process in the step until the training of the difficult sample decoupling convolutional neural network model is stable, and accurately identifying the target of the real-time sample by using the difficult sample decoupling convolutional neural network model.
The foregoing is merely illustrative of the present invention, and the present invention is not limited thereto, and any person skilled in the art will readily recognize that variations or substitutions are within the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.
Claims (8)
1. The difficulty target decoupling detection method based on the difficulty level self-adaptive dynamic adjustment is characterized by comprising the following steps of: the method comprises the following steps:
step 1, constructing a difficult and easy sample decoupling convolutional neural network model, wherein the difficult and easy sample decoupling convolutional neural network model comprises a feature extraction main network and n+1 difficult and easy target decoupling detection heads;
step 2, collecting a plurality of difficult samples, and labeling the targets in all the difficult samples with labels with difficulty grades from 0 to n to obtain a training sample set;
step 3, calculating an initial prior score of the target, and weighting the loss of the difficult sample decoupling convolutional neural network model by using the initial prior score so as to train the difficult sample decoupling convolutional neural network model;
step 4, calculating posterior confidence score based on the initial prior score of the target; and (3) dynamically adjusting the initial prior score through the posterior confidence score to obtain a normalized score, updating the initial prior score by using the normalized score, and repeating the step (3) until the decoupling convolutional neural network model of the difficult sample reaches stability.
2. The difficulty target decoupling detection method based on the difficulty level self-adaptive dynamic adjustment according to claim 1, wherein the method is characterized in that: n+1 difficult and easy target decoupling detection heads are sequentially connected in series, the difficulty level corresponding to the front difficult and easy target decoupling detection head is the lowest, and the difficulty level corresponding to the rear difficult and easy target decoupling detection head is the highest.
3. The difficulty target decoupling detection method based on the difficulty level self-adaptive dynamic adjustment according to claim 1, wherein the method is characterized in that: each difficult and easy target decoupling detection head comprises z downsampled convolution feature layers and z-1 upsampled convolution feature layers, and the upsampled convolution feature layers and the downsampled convolution feature layers with the same scale are spliced in the channel dimension.
4. The difficulty target decoupling detection method based on the difficulty level self-adaptive dynamic adjustment according to claim 1, wherein the method is characterized in that: in the step 2, the target with the difficulty level of 0 is the easiest to identify, and the target with the difficulty level of n is the least easy to identify.
5. The difficulty target decoupling detection method based on the difficulty level self-adaptive dynamic adjustment according to claim 1, wherein the method is characterized in that: in the step 3, an initial prior score of the target is calculated, and the loss of the hard and easy sample decoupling convolutional neural network model is weighted by using the initial prior score, so that the step of training the hard and easy sample decoupling convolutional neural network model comprises the following steps:
calculating an initial prior score for the target:
wherein j represents the j-th target in one difficulty sample, and m targets are in total in the difficulty sample; the difficulty rating of target j is i, i=0, 1,.. j An initial prior score for target j;
randomly taking out a sample picture x and a corresponding label gt from a training sample set j According to the difficulty level of each target in the sample picture x, distributing the targets to corresponding difficult target decoupling detection heads;
calculating target loss L between the output of a difficult and easy target decoupling detection head i and a label in the difficult and easy sample decoupling convolution neural network model by using a confidence loss function, a category classification loss function and a positioning regression loss function i,j :
Wherein L is i,j Representing a target loss between the output of the target j assigned to the difficulty target decoupling detection head i and the tag; l (L) conf Representing a confidence loss function, L cls Representing class classification loss function, L loc Representing a locating regression loss function; out of conf Output corresponding to the confidence loss function, out cls Output corresponding to the class classification loss function, out loc Representing the output corresponding to the positioning regression loss function;tag representing object j in confidence loss function,/">Tag representing object j in class classification loss function, < ->A tag representing a target j in the location regression loss function;
weighting all target losses distributed to the difficult target decoupling detection head i by using the initial prior score to obtain positive sample losses:
wherein L is i,pos Representing positive sample loss of the difficult target decoupling detection head i, wherein m 'represents the target number distributed to the difficult target decoupling detection head i, and m' is less than or equal to m;
calculating the total loss L of all difficult target decoupling detection heads:
wherein L is i,neg Negative sample loss of the detection head i is difficult to be decoupled from the target; count () represents the total number, i.e., number m;
and back-propagating the total loss L to update the weight of the difficult and easy sample decoupling convolutional neural network model, and fixing the weight to obtain the weight of the trained difficult and easy sample decoupling convolutional neural network model after training a certain step length or converging the loss to an expected value.
6. The difficulty target decoupling detection method based on the difficulty level self-adaptive dynamic adjustment according to claim 5, wherein the method is characterized in that: in the step 4, the step of calculating the posterior confidence score based on the initial prior score of the target includes:
n difficult samples in the training sample set, M targets in total, j E M, and the ratio M of the target numbers corresponding to the difficulty level in the labels of the training sample set 0 :...:M i :...:M n Wherein M is i Represents the target number of difficulty level i, M 0 +...+M i +...+M n =M;
Inputting N difficult and easy samples into a trained difficult and easy sample decoupling convolutional neural network model, and decoding the output to obtain the prediction score F of each target j The method comprises the steps of carrying out a first treatment on the surface of the Combining an initial prior of target jScore S j Calculating posterior confidence scores:
7. The difficulty target decoupling detection method based on the difficulty level self-adaptive dynamic adjustment according to claim 6, wherein the method is characterized in that: in the step 4, the initial prior score is dynamically adjusted through the posterior confidence score to obtain a normalized score, the normalized score is used for updating the initial prior score, the step 3 is repeated until the decoupling convolutional neural network model of the difficult sample reaches stability, and the method comprises the following steps:
after running all the difficult and easy samples, sorting all the targets from high to low according to the posterior confidence scores of the targets, and according to the ratio M of the target numbers of the difficulty grades in the training sample set 0 :...:M i :...:M n Reclassifying the difficulty level of all targets; ordered in frontThe difficulty level of the target division of (2) is 0, < th->To the point ofThe difficulty level of the target division of (1)/(1)>To the point ofThe difficulty level of the target division of (2) and so on, the>To->The difficulty level of the target division is n; count (M) represents the target total number M;
posterior confidence score [ P ] of object classified into difficulty level i t ,...,P j ,...,P k ]And the degree of difficulty is classified into a classification score of the difficulty grade i.
8. The difficulty target decoupling detection method based on the difficulty level self-adaptive dynamic adjustment according to claim 7, wherein the method is characterized in that: the posterior confidence score P of the object to be classified into the difficulty level i t ,...,P j ,...,P k ]A step of categorizing into a categorization score of a difficulty level i, comprising:
let the initial prior score maximum of the difficulty level i be S i2 Minimum value is S i1 ;
When P t <S i2 And P is k <S i1 At the time, P is set max =S i2 ,P min =P k ,S max =S i2 ,S min =S i1 Calculating by using a formula (3) to obtain a normalized score S j :
When S is i1 ≤P t ≤S i2 And S is i1 ≤P k ≤S i2 At the time, P is set max =P t ,P min =P k ,S max =P t ,S min =P k Obtaining a normalized score S' by using a formula (3) j ;
When S is i2 <P t And P is k <S i1 At the time, P is set max =P t ,P min =P k ,S max =S i2 ,S min =S i1 Obtaining a normalized score S' by using a formula (3) j ;
When S is i2 <P t And S is i1 <P k At the time, P is set max =P t ,P min =P k ,S max =S i2 ,S min =P k Obtaining a normalized score S' by using a formula (3) j ;
The obtained normalized score S j Updating S of equation (1) j And updating the loss weighting of the uncoupling convolutional neural network model of the difficulty sample.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310505377.1A CN116229333B (en) | 2023-05-08 | 2023-05-08 | Difficulty target decoupling detection method based on difficulty level self-adaptive dynamic adjustment |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310505377.1A CN116229333B (en) | 2023-05-08 | 2023-05-08 | Difficulty target decoupling detection method based on difficulty level self-adaptive dynamic adjustment |
Publications (2)
Publication Number | Publication Date |
---|---|
CN116229333A true CN116229333A (en) | 2023-06-06 |
CN116229333B CN116229333B (en) | 2023-07-21 |
Family
ID=86584639
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310505377.1A Active CN116229333B (en) | 2023-05-08 | 2023-05-08 | Difficulty target decoupling detection method based on difficulty level self-adaptive dynamic adjustment |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116229333B (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116912290A (en) * | 2023-09-11 | 2023-10-20 | 四川都睿感控科技有限公司 | Memory-enhanced method for detecting small moving targets of difficult and easy videos |
CN116934058A (en) * | 2023-09-18 | 2023-10-24 | 西南交通大学 | Product service decision method based on multi-agent reinforcement learning |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111310850A (en) * | 2020-03-02 | 2020-06-19 | 杭州雄迈集成电路技术股份有限公司 | License plate detection model construction method and system and license plate detection method and system |
CN112560876A (en) * | 2021-02-23 | 2021-03-26 | 中国科学院自动化研究所 | Single-stage small sample target detection method for decoupling measurement |
US20220092366A1 (en) * | 2020-09-24 | 2022-03-24 | Sri International | Region metrics for class balancing in machine learning systems |
CN115049884A (en) * | 2022-08-15 | 2022-09-13 | 菲特(天津)检测技术有限公司 | Broad-sense few-sample target detection method and system based on fast RCNN |
CN115240052A (en) * | 2022-08-18 | 2022-10-25 | 安徽大学 | Method and device for constructing target detection model |
CN115345165A (en) * | 2022-08-18 | 2022-11-15 | 北京邮电大学 | Specific entity identification method oriented to label scarcity or distribution unbalance scene |
WO2023014789A1 (en) * | 2021-08-03 | 2023-02-09 | The Regents Of The University Of California | System and method for pathology image analysis using a trained neural network and active learning framework |
CN115830371A (en) * | 2022-11-24 | 2023-03-21 | 广州松兴电气股份有限公司 | Deep learning-based rail transit subway steering frame rod member classification detection method |
CN115890641A (en) * | 2022-10-31 | 2023-04-04 | 吉林大学 | Decoupling type series elastic electric artificial muscle actuator and control method thereof |
-
2023
- 2023-05-08 CN CN202310505377.1A patent/CN116229333B/en active Active
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111310850A (en) * | 2020-03-02 | 2020-06-19 | 杭州雄迈集成电路技术股份有限公司 | License plate detection model construction method and system and license plate detection method and system |
US20220092366A1 (en) * | 2020-09-24 | 2022-03-24 | Sri International | Region metrics for class balancing in machine learning systems |
CN112560876A (en) * | 2021-02-23 | 2021-03-26 | 中国科学院自动化研究所 | Single-stage small sample target detection method for decoupling measurement |
WO2023014789A1 (en) * | 2021-08-03 | 2023-02-09 | The Regents Of The University Of California | System and method for pathology image analysis using a trained neural network and active learning framework |
CN115049884A (en) * | 2022-08-15 | 2022-09-13 | 菲特(天津)检测技术有限公司 | Broad-sense few-sample target detection method and system based on fast RCNN |
CN115240052A (en) * | 2022-08-18 | 2022-10-25 | 安徽大学 | Method and device for constructing target detection model |
CN115345165A (en) * | 2022-08-18 | 2022-11-15 | 北京邮电大学 | Specific entity identification method oriented to label scarcity or distribution unbalance scene |
CN115890641A (en) * | 2022-10-31 | 2023-04-04 | 吉林大学 | Decoupling type series elastic electric artificial muscle actuator and control method thereof |
CN115830371A (en) * | 2022-11-24 | 2023-03-21 | 广州松兴电气股份有限公司 | Deep learning-based rail transit subway steering frame rod member classification detection method |
Non-Patent Citations (2)
Title |
---|
SHIDA ZHENG等: "Dual Decoupling Training for Semi-supervised Object Detection with Noise-Bypass Head", 《PROCEEDINGS OF THE AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE》, vol. 36, no. 3, pages 3526 - 3534 * |
陈灏然: "基于卷积神经网络的小目标检测算法研究", 《中国优秀硕士学位论文全文数据库 (信息科技辑)》, no. 3, pages 138 - 2639 * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116912290A (en) * | 2023-09-11 | 2023-10-20 | 四川都睿感控科技有限公司 | Memory-enhanced method for detecting small moving targets of difficult and easy videos |
CN116912290B (en) * | 2023-09-11 | 2023-12-15 | 四川都睿感控科技有限公司 | Memory-enhanced method for detecting small moving targets of difficult and easy videos |
CN116934058A (en) * | 2023-09-18 | 2023-10-24 | 西南交通大学 | Product service decision method based on multi-agent reinforcement learning |
CN116934058B (en) * | 2023-09-18 | 2023-12-26 | 西南交通大学 | Product service decision method based on multi-agent reinforcement learning |
Also Published As
Publication number | Publication date |
---|---|
CN116229333B (en) | 2023-07-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN116229333B (en) | Difficulty target decoupling detection method based on difficulty level self-adaptive dynamic adjustment | |
CN111414479B (en) | Label extraction method based on short text clustering technology | |
US7362892B2 (en) | Self-optimizing classifier | |
CN110929848B (en) | Training and tracking method based on multi-challenge perception learning model | |
CN106506327B (en) | Junk mail identification method and device | |
US8051027B2 (en) | Method and system for transitioning from a case-based classifier system to a rule-based classifier system | |
CN110674305A (en) | Deep feature fusion model-based commodity information classification method | |
US20110119210A1 (en) | Multiple Category Learning for Training Classifiers | |
CN111832650B (en) | Image classification method based on generation of antagonism network local aggregation coding semi-supervision | |
CN107526805B (en) | ML-kNN multi-tag Chinese text classification method based on weight | |
CN112800232B (en) | Case automatic classification method based on big data | |
CN112861758B (en) | Behavior identification method based on weak supervised learning video segmentation | |
CN111832580B (en) | SAR target recognition method combining less sample learning and target attribute characteristics | |
CN112668698A (en) | Neural network training method and system | |
US8301584B2 (en) | System and method for adaptive pruning | |
JP6988995B2 (en) | Image generator, image generator and image generator | |
WO2020135054A1 (en) | Method, device and apparatus for video recommendation and storage medium | |
CN116662832A (en) | Training sample selection method based on clustering and active learning | |
CN113688867B (en) | Cross-domain image classification method | |
CN112257765B (en) | Zero sample image classification method and system based on unknown similarity class set | |
CN114693997A (en) | Image description generation method, device, equipment and medium based on transfer learning | |
CN115018006A (en) | Dempster-Shafer framework-based classification method | |
US7167587B2 (en) | Sequential classifier for use in pattern recognition system | |
CN111814468B (en) | Self-adaptive architecture semantic distribution text understanding method and system | |
CN114595695A (en) | Self-training model construction method for few-sample intention recognition system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |