CN111612076B - Image fine recognition method based on DCL and Cascade - Google Patents
Image fine recognition method based on DCL and Cascade Download PDFInfo
- Publication number
- CN111612076B CN111612076B CN202010444726.XA CN202010444726A CN111612076B CN 111612076 B CN111612076 B CN 111612076B CN 202010444726 A CN202010444726 A CN 202010444726A CN 111612076 B CN111612076 B CN 111612076B
- Authority
- CN
- China
- Prior art keywords
- image
- region
- dcl
- cascade
- loss
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 30
- 238000012549 training Methods 0.000 claims abstract description 31
- 238000013528 artificial neural network Methods 0.000 claims abstract description 12
- 230000006378 damage Effects 0.000 claims abstract description 8
- 230000008485 antagonism Effects 0.000 claims description 18
- 239000013598 vector Substances 0.000 claims description 11
- 238000004458 analytical method Methods 0.000 claims description 8
- 238000010586 diagram Methods 0.000 claims description 8
- 230000004044 response Effects 0.000 claims description 6
- 230000006870 function Effects 0.000 claims description 4
- 238000013507 mapping Methods 0.000 claims description 3
- 238000011176 pooling Methods 0.000 claims description 3
- 238000012545 processing Methods 0.000 claims description 3
- 238000009827 uniform distribution Methods 0.000 claims description 3
- 238000010276 construction Methods 0.000 abstract description 2
- 238000004891 communication Methods 0.000 abstract 1
- 230000000007 visual effect Effects 0.000 abstract 1
- 238000004364 calculation method Methods 0.000 description 5
- 230000007246 mechanism Effects 0.000 description 4
- 230000008569 process Effects 0.000 description 2
- 230000000452 restraining effect Effects 0.000 description 2
- 241000255777 Lepidoptera Species 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Evolutionary Computation (AREA)
- Molecular Biology (AREA)
- Computational Linguistics (AREA)
- Software Systems (AREA)
- Mathematical Physics (AREA)
- Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Computing Systems (AREA)
- General Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Image Analysis (AREA)
- Image Processing (AREA)
Abstract
The invention discloses an image fine recognition method based on DCL (digital communication link) and Cascade, which is characterized in that an original image is input according to blocks according to the DCL (Destruction and Construction Learning) concept to be disturbed, so that structural information in the original image is damaged, then a Cascade Cascade classifier is used for training a neural network to recognize images with damaged local area sequences, the neural network is forced to grasp key visual areas, the image recognition fineness and recognition efficiency are improved through a Cascade weak classifier and a Cascade strong classifier, and the images are thinned to texture information.
Description
Technical Field
The invention relates to the technical field of artificial intelligence image recognition, in particular to a DCL and Cascade-based image fine recognition method.
Background
With the rapid progress of science and technology, in the past decade, general target recognition has made steady progress with large-scale labeling data and complex model design. However, identifying fine object categories (e.g., birds, butterflies, automobile models, SKU-grade merchandise) remains a challenging task. Slightly different objects tend to visually resemble cursory glances, but they can be correctly identified by distinctive local area details, so learning the discriminative feature representation locally from distinctive objects plays a key role in fine image recognition. The existing fine identification methods can be roughly divided into two categories, specifically as follows:
(1) One method is that firstly, a target local area with discriminant is positioned, and then classification is carried out according to the discriminant areas, the two-step method needs to add additional boundary box marks on the target or the target local, and the cost of the marks is very high;
(2) Another is to try to locate the discrimination area automatically by attention mechanism in an unsupervised manner, so no additional comments are required. However, these approaches typically require additional network structures (e.g., attention mechanisms), thus introducing additional computational overhead for the training and prediction phases.
Therefore, how to overcome the above problems, improve the accuracy of the fine recognition method, reduce the calculation overhead, and ensure the efficiency is currently continuously solved.
Disclosure of Invention
The invention aims to overcome the problems of high cost and additional calculation overhead introduced in the training and predicting stages of the existing fine recognition method. According to the fine image identification method based on DCL and Cascade, a DCL (Destruct and Construction Learning) branch is introduced to automatically learn a discriminant region by disordering the input of an original image according to blocks and further 'destroying' structural information in the image, firstly, the input image is destroyed to emphasize discriminant local details, and then, semantic correlation between the local regions is modeled to reconstruct the image; on one hand, the DCL automatically positions the judgment area, so that additional marking is not needed during training; on the other hand, the DCL structure is only adopted in a training stage, so that the calculation overhead is not introduced in the prediction process, the disordered images ignore irrelevant areas which are not important for fine recognition and force the network to classify the images based on the local details of discriminability, although the recognition becomes more difficult, the experts can still easily find differences, the neural network classifies and recognizes the damaged images by cascading knowledge classifiers (from weak to strong) of learning experts, and in order to prevent the noise patterns introduced by the network learning damaged global structures, the adversarial resistance loss is used for inhibiting the introduced noise patterns, so that the application prospect is good.
In order to achieve the purpose, the technical scheme adopted by the invention is as follows:
a DCL and Cascade-based image fine recognition method comprises the following steps,
after an original image is generated, carrying out initial analysis on data of the original image, wherein the original image is a high-definition image;
based on a DCL image destruction algorithm, destroying an original image subjected to initial analysis to emphasize details of local regions with discriminant, modeling semantic correlation between the local regions to reconstruct an image, and enabling a network to classify the reconstructed image based on the local details with discriminant;
step (C), a noise mode and a region reconstruction loss mode are introduced, so that the reconstructed image solves the antagonism loss and the region alignment loss, and a fine identification image is obtained;
classifying the strength of the damaged reconstructed image from weak to strong through a knowledge classifier of a cascade learning expert on the basis of a neural network to form a knowledge classifier;
and (E) identifying and judging the fine identification image obtained in the step (C) according to the accuracy required in advance by comparing the fine identification image with the knowledge classifier.
The fine image recognition method based on DCL and Cascade, step (B), based on DCL image destruction algorithm, destroys the original image after initial analysis to emphasize the detail of the local regions with discriminant, models the semantic correlation between the local regions to reconstruct the image, makes the network classify the reconstructed image based on the local details with discriminant, includes the following steps,
(B1) Given an original image I, the image is first uniformly divided into N sub-regions, each sub-region defined by R i,j Representing, wherein i and j are horizontal and vertical indexes respectively, i is more than or equal to 1, and N is more than or equal to j;
(B2) From the subregion R i,j Mixed in their 2D neighbourhoods, for the sub-area of row j, a random vector number sequence q of length N is generated j Wherein the ith element q j,i =i+r,Is a random variable subject to uniform distribution, and k is an adjustable parameter defining a neighborhood range (k is more than or equal to 1 and less than N);
(B3) By counting the sequence q of random vectors j Reordering to obtain a new arrangement of jth row areaThis transforms the region coordinates in the original image from (i, j) to σ (i, j) to reconstruct the image.
The fine image recognition method based on DCL and Cascade, step (C), introducing noise mode and region reconstruction loss mode, making the reconstructed image solve the antagonism loss and region alignment loss, obtaining the fine recognition image, includes the following steps,
(C1) Introducing a noise mode to make the reconstructed image solve the antagonism loss
Original image I, useA k characteristic diagram showing the m layer, which is used for visualizing the characteristics of the backbone classification network ResNet-50 and comprises two conditions of using and not using antagonism loss, and taking the output characteristics of the layer in front of the last fully-connected layer to carry out antagonism learning, wherein the k convolution core of the m convolution layer responds to the real category c,
wherein,is the weight between the kth feature map and the corresponding class c, i.e. the response r k And (I, c) is equal to the weight of the characteristic graph corresponding to the kth convolution kernel multiplied by the weight of the full-connection layer corresponding to c, so that whether the convolution kernel can map the input image to c is measured, and the higher the response is, the higher the credibility of the mapping is.
The fine image recognition method based on DCL and Cascade, step (C), introducing noise mode and region reconstruction loss mode, making the reconstructed image solve the antagonism loss and region alignment loss, obtaining the fine recognition image, includes the following steps,
(C2) Introducing a region reconstruction loss mode to enable the reconstructed image to solve the region alignment loss
(C21) Given an original image I and a corresponding reconstructed image phi (I), a region R at a position (I, j) in the original image I i,j With the region R in the reconstructed image phi (I) σ(i,j) Consistency;
(C22) The area alignment network isOutput feature map for nth convolutional layer of classification networkPerforming convolution processing on the operation characteristic diagram to obtain output with only two channels;
(C23) The output is processed by a ReLU linear rectification function and average pooling to obtain a feature map with the size of 2 × N, and the output of the area alignment network can be written as:the two channels in M (I) are respectively corresponding to row and column position coordinates, h is an area alignment network, and theta loc Is a parameter of the area alignment network, namely, each spatial position point of the output characteristic graph of the two channels predicts an area position, each spatial position point has two values to respectively predict the horizontal and vertical coordinates of the area, a total of N multiplied by N sub-areas are recorded in the area R σ(i,j) The predicted position is M σ(i,j) (phi (I)) for the region R i,j Is M i,j (I, j), the true values for both predicted positions are (I, j);
(C24) Calculating the region alignment loss L loc Defined as L of the predicted coordinates and the original coordinates 1 Distance:
(C25) According to the region alignment penalty L loc And the reconstructed image is made to solve the region alignment loss.
The fine image recognition method based on DCL and Cascade, step (D), based on neural network, through the knowledge classifier of the Cascade learning expert, from weak to strong to classify the intensity of the damaged reconstructed image, forming the knowledge classifier, includes the following steps,
(D1) Training each sample in the data and giving a weight to the sample;
(D2) Training a weak classifier on the training data and calculating its errorsError rate, then the updated weights on the unified data set againWeak score of training
The classifier reduces the weight of the paired samples, improves the weight of the error, and has the error rate as follows:
weight value:
(D3) And calculating to obtain a vector D through the parameters, repeating the steps (D1) - (D2) to enter the next iteration, and continuously repeating the training and adjusting the weight until the training error rate is 0 or the target value is reached to form the strong classifier.
In the fine image recognition method based on DCL and Cascade, step (E), by comparing with the knowledge classifier, if the weak classification stage is not executed, the strong classification stage is not entered.
The invention has the beneficial effects that: according to the image fine recognition method based on the DCL and the Cascade, a DCL branch is introduced to automatically learn a discriminant region, firstly, an input image is damaged to emphasize local details with discriminant, and then semantic correlation between local regions is modeled to reconstruct an image; on one hand, the DCL automatically positions the discrimination area, so that additional marking is not needed during training; on the other hand, the DCL structure is only used in the training phase, so that no computational overhead is introduced in prediction, and for "breaking", a regional obfuscation mechanism is used to divide the input image into local tiles and then randomly obfuscate. Local details play a more important role in fine recognition than global structures, since images from different fine classes often have the same global structure or shape, differing only in local details. Leaving up the global structure to maintain local details may force the network to focus on local areas that are discriminative. After being scrambled, irrelevant areas that are not important for fine recognition will be ignored and the network will be forced to classify the image based on local details of discriminability. Although recognition becomes more difficult, the expert can easily find the difference. The neural network classifies and identifies the damaged images through a knowledge classifier (from weak to strong) of a cascade learning expert, and in order to prevent noise patterns introduced by network learning damage of a global structure, antagonism loss is used for restraining the introduced noise patterns, so that the neural network has a good application prospect.
Drawings
FIG. 1 is a flow chart of the image fine recognition method based on DCL and Cascade of the present invention;
fig. 2 is a flow chart of the present invention for calculating vector D.
Detailed Description
The invention will be further described with reference to the accompanying drawings.
As shown in fig. 1, the image fine recognition method based on DCL and Cascade of the present invention includes the following steps,
after an original image is generated, carrying out initial analysis on data of the original image, wherein the original image is a high-definition image;
step (B), based on DCL image destruction algorithm, destroying the original image after initial analysis to emphasize the details of local regions with discriminant, modeling semantic correlation between the local regions to reconstruct the image, and making the network classify the reconstructed image based on the local details with discriminant, including the following steps,
(B1) Given an original image I, the image is first uniformly divided into N sub-regions, each sub-region defined by R i,j Representing, wherein i and j are horizontal and vertical indexes respectively, i is more than or equal to 1, and N is more than or equal to j;
(B2) From the subregion R i,j Mixed in their 2D neighbourhoods, for the sub-area of row j, a random vector number sequence q of length N is generated j Wherein, the ith element q j,i =i+r,Is a random variable subject to uniform distribution, and k is an adjustable parameter defining a neighborhood range (k is more than or equal to 1 and less than N);
(B3) By counting the sequence q of random vectors j Reordering to obtain a new arrangement of jth row areaThus, the region coordinates in the original image are converted from (i, j) to sigma (i, j) to reconstruct the image;
at the same time, it can be verified that:
thus, the region coordinates in the original image are converted from (i, j) to σ (i, j):
the scrambling method can destroy the global structure and simultaneously ensure that the local area randomly changes in the neighborhood with adjustable size, the original image I, the destroyed version phi (I) of the original image I and the one-vs-all label indicating the real fine category of the original image I are combined together during training<I,φ(I),l>. The classification network maps the input image to a probability distribution vector C (I, theta) cls ) Wherein θ cls Are learnable parameters that classify all layers in the network. Loss function L of a classification network cls Can be written as:
where Γ is all training sets;
step (C), a noise mode and a region reconstruction loss mode are introduced to enable the reconstructed image to solve the antagonism loss and the region alignment loss to obtain a fine identification image,
(C1) Introducing a noise mode to make the reconstructed image solve the antagonism loss
Original image I, useA k characteristic diagram showing the m layer, which is used for visualizing the characteristics of the backbone classification network ResNet-50 and comprises two conditions of using and not using antagonism loss, and taking the output characteristics of the layer in front of the last fully-connected layer to carry out antagonism learning, wherein the k convolution core of the m convolution layer responds to the real category c,
wherein,is the weight between the kth feature map and the corresponding class c, i.e. the response r k (I, c) is equal to the weight of the characteristic graph corresponding to the kth convolution kernel multiplied by the weight of the c corresponding to the full connection layer, so that whether the input image can be mapped to the c by the convolution kernel is measured, and the higher the response is, the higher the reliability of the mapping is; to minimize the loss, neither rough features of the overall contour nor edge-type noise patterns can be learned, but features common to both are learned, thus enhancing local detail with discriminability and filtering out irrelevant features;
(C2) Lead toEntering a region reconstruction loss mode, so that the reconstructed image solves the region alignment loss (C21), and the region R located at the position (I, j) in the original image I is given the original image I and the corresponding reconstructed image phi (I) i,j With the region R in the reconstructed image phi (I) σ(i,j) Consistency;
(C22) The area alignment network is the output characteristic diagram of the nth convolution layer of the classification networkPerforming convolution processing on the operation characteristic diagram to obtain output of only two channels;
(C23) The output is processed by a ReLU linear rectification function and average pooling to obtain a feature map with the size of 2 × N, and the output of the area alignment network can be written as:the two channels in M (I) respectively correspond to the position coordinates of a row and a column, h is an area alignment network, and theta is loc Is a parameter of the area alignment network, namely, each spatial position point of the output characteristic graph of the two channels predicts an area position, each spatial position point has two values to respectively predict the horizontal and vertical coordinates of the area, a total of N multiplied by N sub-areas are recorded in the area R σ(i,j) The predicted position is M σ(i,j) (phi (I)) for the region R i,j Is M i,j (I, j), the true values for both predicted positions are (I, j);
(C24) Calculating the region alignment loss L loc Defined as L of the predicted coordinates and the original coordinates 1 Distance:
(C25) According to the region alignment penalty L loc Enabling the reconstructed image to solve the problem of region alignment loss;
step (D) of classifying the intensity of the damaged reconstructed image from weak to strong by a knowledge classifier of a cascade learning expert based on a neural network to form the knowledge classifier, comprising the steps of,
(D1) Training each sample in the data and giving a weight to the sample;
(D2) Training a weak classifier on the training data and calculating its error rate, then updating the weight again on the unified data setTraining weak classifier, paired->The sample weight is reduced, the error weight is improved, and the error rate is as follows:
weight value:
(D3) According to the parameters, a vector D is obtained through calculation according to the process shown in FIG. 2, the steps (D1) - (D2) are repeated to enter the next iteration, and the training and the weight adjustment are repeated continuously until the training error rate is 0 or the target value is reached, so that a strong classifier is formed.
And (E) identifying and judging the fine identification image obtained in the step (C) according to the accuracy required in advance by comparing the fine identification image with a knowledge classifier, and if the weak classification stage is not executed, the strong classification stage is not started by comparing the fine identification image with the knowledge classifier.
According to the image fine recognition method based on the DCL and the Cascade, the judgment area is automatically positioned in the DCL, so that additional marking is not needed during training; the DCL structure is only adopted in a training stage, so that the calculation cost is not introduced during prediction, the global structure is abandoned to keep local details, the network can be forced to focus on local areas with discriminability, after the local areas are disturbed, irrelevant areas which are not important for fine recognition can be ignored, and the network is forced to classify the images based on the discriminability local details.
In summary, in the image fine recognition method based on the DCL and the Cascade of the present invention, the DCL branch is introduced to automatically learn the discriminant region, firstly, the input image is destroyed to emphasize the discriminant local details, and then the semantic correlation between the local regions is modeled to reconstruct the image; on one hand, the DCL automatically positions the discrimination area, so that additional marking is not needed during training; on the other hand, the DCL structure is only used in the training phase, so that no computational overhead is introduced in prediction, and for "breaking", a regional obfuscation mechanism is used to divide the input image into local tiles and then randomly obfuscate. Local details play a more important role in fine recognition than global structures, since images from different fine classes often have the same global structure or shape, differing only in local details. Leaving up the global structure to maintain local details may force the network to focus on local areas that are discriminative. After being scrambled, irrelevant areas that are not important for fine recognition will be ignored and the network will be forced to classify the image based on local details of discriminability. Although recognition becomes more difficult, the expert can still easily find the differences. The neural network classifies and identifies the damaged images through a knowledge classifier (from weak to strong) of a cascade learning expert, and in order to prevent noise patterns introduced by network learning damage of a global structure, antagonism loss is used for restraining the introduced noise patterns, so that the neural network has a good application prospect.
The foregoing illustrates and describes the principles, general features, and advantages of the present invention. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, which are described in the specification and illustrated only to illustrate the principle of the present invention, but that various changes and modifications may be made therein without departing from the spirit and scope of the present invention, which fall within the scope of the invention as claimed. The scope of the invention is defined by the appended claims and equivalents thereof.
Claims (6)
1. The fine image recognition method based on DCL and Cascade is characterized by comprising the following steps: comprises the following steps of (a) carrying out,
after an original image is generated, carrying out initial analysis on data of the original image, wherein the original image is a high-definition image;
based on a DCL image destruction algorithm, destroying an original image after initial analysis to emphasize details of discriminant local regions, modeling semantic correlation between the local regions to reconstruct the image, and enabling a network to classify the reconstructed image based on the discriminant local details;
step (C), a noise mode and a region reconstruction loss mode are introduced, so that the reconstructed image solves the antagonism loss and the region alignment loss, and a fine identification image is obtained;
classifying the strength of the damaged reconstructed image from weak to strong through a knowledge classifier of a cascade learning expert on the basis of a neural network to form a knowledge classifier;
and (E) identifying and judging the fine identification image obtained in the step (C) according to the accuracy required in advance by comparing the fine identification image with the knowledge classifier.
2. The fine image recognition method based on DCL and Cascade according to claim 1, wherein: step (B), based on DCL image destruction algorithm, destroying the original image after initial analysis to emphasize the details of local regions with discriminant, modeling semantic correlation between the local regions to reconstruct the image, and making the network classify the reconstructed image based on the local details with discriminant, including the following steps,
(B1) Given an original image I, the image is first uniformly divided into N sub-regions, each sub-region defined by R i,j Representing, wherein i and j are respectively horizontal and vertical indexes, i is more than or equal to 1, and N is more than or equal to j;
(B2) From the subregion R i,j Mixed in their 2D neighbourhoods, for the sub-area of row j, a random vector number sequence q of length N is generated j Wherein the ith element q j,i =i+r,Is a random variable subject to uniform distribution, k is an adjustable parameter defining a neighborhood range, and k is more than or equal to 1 and is less than N;
3. The fine image recognition method based on DCL and Cascade according to claim 1, wherein: step (C), introducing a noise mode and a region reconstruction loss mode to enable the reconstructed image to solve the antagonism loss and the region alignment loss to obtain a fine identification image, comprising the following steps,
(C1) Introducing a noise pattern to allow the reconstructed image to account for the loss of antagonism
Original image I, useA k characteristic diagram showing the m layer, which is used for visualizing the characteristics of the backbone classification network ResNet-50 and comprises two conditions of using and not using antagonism loss, and taking the output characteristics of the layer in front of the last fully-connected layer to carry out antagonism learning, wherein the k convolution core of the m convolution layer responds to the real category c,
wherein,is the weight between the kth feature map and the corresponding class c, i.e. the response r k (I, c) is equal to the weight of the characteristic graph corresponding to the kth convolution kernel multiplied by the weight of the c corresponding to the full connection layer, so as to measure whether the convolution kernel can map the input image to the c, and the larger the response isIndicating a higher confidence in the mapping.
4. The fine image recognition method based on DCL and Cascade according to claim 2, wherein: step (C), introducing a noise mode and a region reconstruction loss mode to enable the reconstructed image to solve the antagonism loss and the region alignment loss to obtain a fine identification image, comprising the following steps,
(C2) Introducing a region reconstruction loss mode to enable the reconstructed image to solve the region alignment loss
(C21) Given an original image I and a corresponding reconstructed image phi (I), a region R at a position (I, j) in the original image I i,j With the region R in the reconstructed image phi (I) σ(i,j) The consistency is achieved;
(C22) The area alignment network is the output characteristic winter of the nth convolution layer of the classification networkPerforming I multiplied by I convolution processing on the operation characteristic diagram to obtain output with only two channels;
(C23) And obtaining a characteristic diagram with the size of 2 multiplied by N by the output through a ReLU linear rectification function and average pooling, wherein the output of the area alignment network is written as follows:the two channels in M (I) are respectively corresponding to row and column position coordinates, h is an area alignment network, and theta loc Is the parameter of the area alignment network, namely, each spatial position point of the output characteristic graph of the two channels predicts an area position, each spatial position point has two values to respectively predict the horizontal and vertical coordinates of the area, and has N multiplied by N sub-areas in total, and the area R is recorded σ(i,j) The predicted position is M σ(i,j) (phi (I)) is different for each region i,j Is M i,j (I, j), the true values for both predicted positions are (I, j);
(C24) Calculating the region alignment loss L loc Defined as L of the predicted coordinates and the original coordinates 1 Distance:
(C25) According to the region alignment penalty L loc And the reconstructed image is enabled to solve the problem of region alignment loss.
5. The DCL and Cascade-based image fine recognition method according to claim 1, wherein: step (D) of classifying the intensity of the damaged reconstructed image from weak to strong by a knowledge classifier of a cascade learning expert based on a neural network to form the knowledge classifier, comprising the steps of,
(D1) Training each sample in the data and giving a weight to the sample;
(D2) Training a weak classifier on the training data and calculating its error rate, then updating the weight again on the unified data setTraining weak classifier, paired->The sample weight is reduced, the error weight is improved, and the error rate is as follows:
weight value:
(D3) And calculating to obtain a vector D through the parameters, repeating the steps (D1) - (D2) to enter the next iteration, and continuously repeating the training and adjusting the weight until the training error rate is 0 or the target value is reached to form the strong classifier.
6. The fine image recognition method based on DCL and Cascade according to claim 1, wherein: and (E) comparing the weak classification stage with the knowledge classifier, and if the weak classification stage is not executed, not entering the strong classification stage.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010444726.XA CN111612076B (en) | 2020-05-23 | 2020-05-23 | Image fine recognition method based on DCL and Cascade |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010444726.XA CN111612076B (en) | 2020-05-23 | 2020-05-23 | Image fine recognition method based on DCL and Cascade |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111612076A CN111612076A (en) | 2020-09-01 |
CN111612076B true CN111612076B (en) | 2023-04-18 |
Family
ID=72202381
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010444726.XA Active CN111612076B (en) | 2020-05-23 | 2020-05-23 | Image fine recognition method based on DCL and Cascade |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111612076B (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113159171B (en) * | 2021-04-20 | 2022-07-22 | 复旦大学 | Plant leaf image fine classification method based on counterstudy |
CN116129426A (en) * | 2022-11-24 | 2023-05-16 | 广东劢智医疗科技有限公司 | Fine granularity classification method for cervical cell smear 18 category |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108037133A (en) * | 2017-12-27 | 2018-05-15 | 武汉市智勤创亿信息技术股份有限公司 | A kind of power equipments defect intelligent identification Method and its system based on unmanned plane inspection image |
CN111178432A (en) * | 2019-12-30 | 2020-05-19 | 武汉科技大学 | Weak supervision fine-grained image classification method of multi-branch neural network model |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10733334B2 (en) * | 2017-02-22 | 2020-08-04 | Middle Chart, LLC | Building vital conditions monitoring |
-
2020
- 2020-05-23 CN CN202010444726.XA patent/CN111612076B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108037133A (en) * | 2017-12-27 | 2018-05-15 | 武汉市智勤创亿信息技术股份有限公司 | A kind of power equipments defect intelligent identification Method and its system based on unmanned plane inspection image |
CN111178432A (en) * | 2019-12-30 | 2020-05-19 | 武汉科技大学 | Weak supervision fine-grained image classification method of multi-branch neural network model |
Also Published As
Publication number | Publication date |
---|---|
CN111612076A (en) | 2020-09-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Dong et al. | Deep metric learning-based for multi-target few-shot pavement distress classification | |
CN107609601B (en) | Ship target identification method based on multilayer convolutional neural network | |
CN107133974B (en) | Gaussian Background models the vehicle type classification method combined with Recognition with Recurrent Neural Network | |
CN103605972B (en) | Non-restricted environment face verification method based on block depth neural network | |
CN110276248B (en) | Facial expression recognition method based on sample weight distribution and deep learning | |
CN110619369A (en) | Fine-grained image classification method based on feature pyramid and global average pooling | |
CN106778854A (en) | Activity recognition method based on track and convolutional neural networks feature extraction | |
Badawi et al. | A hybrid memetic algorithm (genetic algorithm and great deluge local search) with back-propagation classifier for fish recognition | |
CN102054176B (en) | Method used for establishing semantic scene models for scene images of moving targets by utilizing computer | |
Chuang et al. | Deep-learning based joint iris and sclera recognition with yolo network for identity identification | |
CN107133569A (en) | The many granularity mask methods of monitor video based on extensive Multi-label learning | |
CN115170805A (en) | Image segmentation method combining super-pixel and multi-scale hierarchical feature recognition | |
CN105139004A (en) | Face expression identification method based on video sequences | |
CN105825212A (en) | Distributed license plate recognition method based on Hadoop | |
CN111612076B (en) | Image fine recognition method based on DCL and Cascade | |
CN112990282B (en) | Classification method and device for fine-granularity small sample images | |
CN106127228A (en) | Remote sensing image ship detection candidate area identification method based on decision template classifier fusion | |
CN103035239B (en) | Speaker recognition method based on partial learning | |
CN112686242B (en) | Fine-grained image classification method based on multilayer focusing attention network | |
CN114549834A (en) | Semi-supervised hybrid training semantic segmentation method and system | |
Anwer et al. | Accident vehicle types classification: a comparative study between different deep learning models | |
CN111310820A (en) | Foundation meteorological cloud chart classification method based on cross validation depth CNN feature integration | |
Liu et al. | Remote sensing image vehicle detection based on pre-training and random-initialized fusion network | |
Dorrani | Traffic Scene Analysis and Classification using Deep Learning | |
CN112613474A (en) | Pedestrian re-identification method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: A Fine Image Recognition Method Based on DCL and Cascade Granted publication date: 20230418 Pledgee: Bank of China Limited by Share Ltd. Yangjiang branch Pledgor: NANJING UNIVERSITY SMARTERCITY PROGRAM DESIGN Co.,Ltd. Registration number: Y2024980001801 |