CN111612076B - Image fine recognition method based on DCL and Cascade - Google Patents

Image fine recognition method based on DCL and Cascade Download PDF

Info

Publication number
CN111612076B
CN111612076B CN202010444726.XA CN202010444726A CN111612076B CN 111612076 B CN111612076 B CN 111612076B CN 202010444726 A CN202010444726 A CN 202010444726A CN 111612076 B CN111612076 B CN 111612076B
Authority
CN
China
Prior art keywords
image
region
dcl
cascade
loss
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010444726.XA
Other languages
Chinese (zh)
Other versions
CN111612076A (en
Inventor
李旻
沈华飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing University Smartercity Program Design Co ltd
Original Assignee
Nanjing University Smartercity Program Design Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing University Smartercity Program Design Co ltd filed Critical Nanjing University Smartercity Program Design Co ltd
Priority to CN202010444726.XA priority Critical patent/CN111612076B/en
Publication of CN111612076A publication Critical patent/CN111612076A/en
Application granted granted Critical
Publication of CN111612076B publication Critical patent/CN111612076B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Molecular Biology (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Mathematical Physics (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computing Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Image Analysis (AREA)
  • Image Processing (AREA)

Abstract

The invention discloses an image fine recognition method based on DCL (digital communication link) and Cascade, which is characterized in that an original image is input according to blocks according to the DCL (Destruction and Construction Learning) concept to be disturbed, so that structural information in the original image is damaged, then a Cascade Cascade classifier is used for training a neural network to recognize images with damaged local area sequences, the neural network is forced to grasp key visual areas, the image recognition fineness and recognition efficiency are improved through a Cascade weak classifier and a Cascade strong classifier, and the images are thinned to texture information.

Description

Image fine recognition method based on DCL and Cascade
Technical Field
The invention relates to the technical field of artificial intelligence image recognition, in particular to a DCL and Cascade-based image fine recognition method.
Background
With the rapid progress of science and technology, in the past decade, general target recognition has made steady progress with large-scale labeling data and complex model design. However, identifying fine object categories (e.g., birds, butterflies, automobile models, SKU-grade merchandise) remains a challenging task. Slightly different objects tend to visually resemble cursory glances, but they can be correctly identified by distinctive local area details, so learning the discriminative feature representation locally from distinctive objects plays a key role in fine image recognition. The existing fine identification methods can be roughly divided into two categories, specifically as follows:
(1) One method is that firstly, a target local area with discriminant is positioned, and then classification is carried out according to the discriminant areas, the two-step method needs to add additional boundary box marks on the target or the target local, and the cost of the marks is very high;
(2) Another is to try to locate the discrimination area automatically by attention mechanism in an unsupervised manner, so no additional comments are required. However, these approaches typically require additional network structures (e.g., attention mechanisms), thus introducing additional computational overhead for the training and prediction phases.
Therefore, how to overcome the above problems, improve the accuracy of the fine recognition method, reduce the calculation overhead, and ensure the efficiency is currently continuously solved.
Disclosure of Invention
The invention aims to overcome the problems of high cost and additional calculation overhead introduced in the training and predicting stages of the existing fine recognition method. According to the fine image identification method based on DCL and Cascade, a DCL (Destruct and Construction Learning) branch is introduced to automatically learn a discriminant region by disordering the input of an original image according to blocks and further 'destroying' structural information in the image, firstly, the input image is destroyed to emphasize discriminant local details, and then, semantic correlation between the local regions is modeled to reconstruct the image; on one hand, the DCL automatically positions the judgment area, so that additional marking is not needed during training; on the other hand, the DCL structure is only adopted in a training stage, so that the calculation overhead is not introduced in the prediction process, the disordered images ignore irrelevant areas which are not important for fine recognition and force the network to classify the images based on the local details of discriminability, although the recognition becomes more difficult, the experts can still easily find differences, the neural network classifies and recognizes the damaged images by cascading knowledge classifiers (from weak to strong) of learning experts, and in order to prevent the noise patterns introduced by the network learning damaged global structures, the adversarial resistance loss is used for inhibiting the introduced noise patterns, so that the application prospect is good.
In order to achieve the purpose, the technical scheme adopted by the invention is as follows:
a DCL and Cascade-based image fine recognition method comprises the following steps,
after an original image is generated, carrying out initial analysis on data of the original image, wherein the original image is a high-definition image;
based on a DCL image destruction algorithm, destroying an original image subjected to initial analysis to emphasize details of local regions with discriminant, modeling semantic correlation between the local regions to reconstruct an image, and enabling a network to classify the reconstructed image based on the local details with discriminant;
step (C), a noise mode and a region reconstruction loss mode are introduced, so that the reconstructed image solves the antagonism loss and the region alignment loss, and a fine identification image is obtained;
classifying the strength of the damaged reconstructed image from weak to strong through a knowledge classifier of a cascade learning expert on the basis of a neural network to form a knowledge classifier;
and (E) identifying and judging the fine identification image obtained in the step (C) according to the accuracy required in advance by comparing the fine identification image with the knowledge classifier.
The fine image recognition method based on DCL and Cascade, step (B), based on DCL image destruction algorithm, destroys the original image after initial analysis to emphasize the detail of the local regions with discriminant, models the semantic correlation between the local regions to reconstruct the image, makes the network classify the reconstructed image based on the local details with discriminant, includes the following steps,
(B1) Given an original image I, the image is first uniformly divided into N sub-regions, each sub-region defined by R i,j Representing, wherein i and j are horizontal and vertical indexes respectively, i is more than or equal to 1, and N is more than or equal to j;
(B2) From the subregion R i,j Mixed in their 2D neighbourhoods, for the sub-area of row j, a random vector number sequence q of length N is generated j Wherein the ith element q j,i =i+r,
Figure BDA0002505370180000041
Is a random variable subject to uniform distribution, and k is an adjustable parameter defining a neighborhood range (k is more than or equal to 1 and less than N);
(B3) By counting the sequence q of random vectors j Reordering to obtain a new arrangement of jth row area
Figure BDA0002505370180000042
This transforms the region coordinates in the original image from (i, j) to σ (i, j) to reconstruct the image.
The fine image recognition method based on DCL and Cascade, step (C), introducing noise mode and region reconstruction loss mode, making the reconstructed image solve the antagonism loss and region alignment loss, obtaining the fine recognition image, includes the following steps,
(C1) Introducing a noise mode to make the reconstructed image solve the antagonism loss
Original image I, use
Figure BDA0002505370180000051
A k characteristic diagram showing the m layer, which is used for visualizing the characteristics of the backbone classification network ResNet-50 and comprises two conditions of using and not using antagonism loss, and taking the output characteristics of the layer in front of the last fully-connected layer to carry out antagonism learning, wherein the k convolution core of the m convolution layer responds to the real category c,
Figure BDA0002505370180000052
wherein,
Figure BDA0002505370180000053
is the weight between the kth feature map and the corresponding class c, i.e. the response r k And (I, c) is equal to the weight of the characteristic graph corresponding to the kth convolution kernel multiplied by the weight of the full-connection layer corresponding to c, so that whether the convolution kernel can map the input image to c is measured, and the higher the response is, the higher the credibility of the mapping is.
The fine image recognition method based on DCL and Cascade, step (C), introducing noise mode and region reconstruction loss mode, making the reconstructed image solve the antagonism loss and region alignment loss, obtaining the fine recognition image, includes the following steps,
(C2) Introducing a region reconstruction loss mode to enable the reconstructed image to solve the region alignment loss
(C21) Given an original image I and a corresponding reconstructed image phi (I), a region R at a position (I, j) in the original image I i,j With the region R in the reconstructed image phi (I) σ(i,j) Consistency;
(C22) The area alignment network isOutput feature map for nth convolutional layer of classification network
Figure BDA0002505370180000054
Performing convolution processing on the operation characteristic diagram to obtain output with only two channels;
(C23) The output is processed by a ReLU linear rectification function and average pooling to obtain a feature map with the size of 2 × N, and the output of the area alignment network can be written as:
Figure BDA0002505370180000061
the two channels in M (I) are respectively corresponding to row and column position coordinates, h is an area alignment network, and theta loc Is a parameter of the area alignment network, namely, each spatial position point of the output characteristic graph of the two channels predicts an area position, each spatial position point has two values to respectively predict the horizontal and vertical coordinates of the area, a total of N multiplied by N sub-areas are recorded in the area R σ(i,j) The predicted position is M σ(i,j) (phi (I)) for the region R i,j Is M i,j (I, j), the true values for both predicted positions are (I, j);
(C24) Calculating the region alignment loss L loc Defined as L of the predicted coordinates and the original coordinates 1 Distance:
Figure BDA0002505370180000062
(C25) According to the region alignment penalty L loc And the reconstructed image is made to solve the region alignment loss.
The fine image recognition method based on DCL and Cascade, step (D), based on neural network, through the knowledge classifier of the Cascade learning expert, from weak to strong to classify the intensity of the damaged reconstructed image, forming the knowledge classifier, includes the following steps,
(D1) Training each sample in the data and giving a weight to the sample;
(D2) Training a weak classifier on the training data and calculating its errorsError rate, then the updated weights on the unified data set again
Figure BDA0002505370180000063
Weak score of training
Figure BDA0002505370180000064
The classifier reduces the weight of the paired samples, improves the weight of the error, and has the error rate as follows:
weight value:
increasing the weight value:
Figure BDA0002505370180000071
reducing the weight value:
Figure BDA0002505370180000072
(D3) And calculating to obtain a vector D through the parameters, repeating the steps (D1) - (D2) to enter the next iteration, and continuously repeating the training and adjusting the weight until the training error rate is 0 or the target value is reached to form the strong classifier.
In the fine image recognition method based on DCL and Cascade, step (E), by comparing with the knowledge classifier, if the weak classification stage is not executed, the strong classification stage is not entered.
The invention has the beneficial effects that: according to the image fine recognition method based on the DCL and the Cascade, a DCL branch is introduced to automatically learn a discriminant region, firstly, an input image is damaged to emphasize local details with discriminant, and then semantic correlation between local regions is modeled to reconstruct an image; on one hand, the DCL automatically positions the discrimination area, so that additional marking is not needed during training; on the other hand, the DCL structure is only used in the training phase, so that no computational overhead is introduced in prediction, and for "breaking", a regional obfuscation mechanism is used to divide the input image into local tiles and then randomly obfuscate. Local details play a more important role in fine recognition than global structures, since images from different fine classes often have the same global structure or shape, differing only in local details. Leaving up the global structure to maintain local details may force the network to focus on local areas that are discriminative. After being scrambled, irrelevant areas that are not important for fine recognition will be ignored and the network will be forced to classify the image based on local details of discriminability. Although recognition becomes more difficult, the expert can easily find the difference. The neural network classifies and identifies the damaged images through a knowledge classifier (from weak to strong) of a cascade learning expert, and in order to prevent noise patterns introduced by network learning damage of a global structure, antagonism loss is used for restraining the introduced noise patterns, so that the neural network has a good application prospect.
Drawings
FIG. 1 is a flow chart of the image fine recognition method based on DCL and Cascade of the present invention;
fig. 2 is a flow chart of the present invention for calculating vector D.
Detailed Description
The invention will be further described with reference to the accompanying drawings.
As shown in fig. 1, the image fine recognition method based on DCL and Cascade of the present invention includes the following steps,
after an original image is generated, carrying out initial analysis on data of the original image, wherein the original image is a high-definition image;
step (B), based on DCL image destruction algorithm, destroying the original image after initial analysis to emphasize the details of local regions with discriminant, modeling semantic correlation between the local regions to reconstruct the image, and making the network classify the reconstructed image based on the local details with discriminant, including the following steps,
(B1) Given an original image I, the image is first uniformly divided into N sub-regions, each sub-region defined by R i,j Representing, wherein i and j are horizontal and vertical indexes respectively, i is more than or equal to 1, and N is more than or equal to j;
(B2) From the subregion R i,j Mixed in their 2D neighbourhoods, for the sub-area of row j, a random vector number sequence q of length N is generated j Wherein, the ith element q j,i =i+r,
Figure BDA0002505370180000091
Is a random variable subject to uniform distribution, and k is an adjustable parameter defining a neighborhood range (k is more than or equal to 1 and less than N);
(B3) By counting the sequence q of random vectors j Reordering to obtain a new arrangement of jth row area
Figure BDA0002505370180000092
Thus, the region coordinates in the original image are converted from (i, j) to sigma (i, j) to reconstruct the image;
at the same time, it can be verified that:
Figure BDA0002505370180000101
similarly, we can use it in the column
Figure BDA0002505370180000102
To rearrange the regions while also verifying that:
Figure BDA0002505370180000103
thus, the region coordinates in the original image are converted from (i, j) to σ (i, j):
Figure BDA0002505370180000104
the scrambling method can destroy the global structure and simultaneously ensure that the local area randomly changes in the neighborhood with adjustable size, the original image I, the destroyed version phi (I) of the original image I and the one-vs-all label indicating the real fine category of the original image I are combined together during training<I,φ(I),l>. The classification network maps the input image to a probability distribution vector C (I, theta) cls ) Wherein θ cls Are learnable parameters that classify all layers in the network. Loss function L of a classification network cls Can be written as:
Figure BDA0002505370180000105
where Γ is all training sets;
step (C), a noise mode and a region reconstruction loss mode are introduced to enable the reconstructed image to solve the antagonism loss and the region alignment loss to obtain a fine identification image,
(C1) Introducing a noise mode to make the reconstructed image solve the antagonism loss
Original image I, use
Figure BDA0002505370180000111
A k characteristic diagram showing the m layer, which is used for visualizing the characteristics of the backbone classification network ResNet-50 and comprises two conditions of using and not using antagonism loss, and taking the output characteristics of the layer in front of the last fully-connected layer to carry out antagonism learning, wherein the k convolution core of the m convolution layer responds to the real category c,
Figure BDA0002505370180000112
wherein,
Figure BDA0002505370180000113
is the weight between the kth feature map and the corresponding class c, i.e. the response r k (I, c) is equal to the weight of the characteristic graph corresponding to the kth convolution kernel multiplied by the weight of the c corresponding to the full connection layer, so that whether the input image can be mapped to the c by the convolution kernel is measured, and the higher the response is, the higher the reliability of the mapping is; to minimize the loss, neither rough features of the overall contour nor edge-type noise patterns can be learned, but features common to both are learned, thus enhancing local detail with discriminability and filtering out irrelevant features;
(C2) Lead toEntering a region reconstruction loss mode, so that the reconstructed image solves the region alignment loss (C21), and the region R located at the position (I, j) in the original image I is given the original image I and the corresponding reconstructed image phi (I) i,j With the region R in the reconstructed image phi (I) σ(i,j) Consistency;
(C22) The area alignment network is the output characteristic diagram of the nth convolution layer of the classification network
Figure BDA0002505370180000114
Performing convolution processing on the operation characteristic diagram to obtain output of only two channels;
(C23) The output is processed by a ReLU linear rectification function and average pooling to obtain a feature map with the size of 2 × N, and the output of the area alignment network can be written as:
Figure BDA0002505370180000121
the two channels in M (I) respectively correspond to the position coordinates of a row and a column, h is an area alignment network, and theta is loc Is a parameter of the area alignment network, namely, each spatial position point of the output characteristic graph of the two channels predicts an area position, each spatial position point has two values to respectively predict the horizontal and vertical coordinates of the area, a total of N multiplied by N sub-areas are recorded in the area R σ(i,j) The predicted position is M σ(i,j) (phi (I)) for the region R i,j Is M i,j (I, j), the true values for both predicted positions are (I, j);
(C24) Calculating the region alignment loss L loc Defined as L of the predicted coordinates and the original coordinates 1 Distance:
Figure BDA0002505370180000122
(C25) According to the region alignment penalty L loc Enabling the reconstructed image to solve the problem of region alignment loss;
step (D) of classifying the intensity of the damaged reconstructed image from weak to strong by a knowledge classifier of a cascade learning expert based on a neural network to form the knowledge classifier, comprising the steps of,
(D1) Training each sample in the data and giving a weight to the sample;
(D2) Training a weak classifier on the training data and calculating its error rate, then updating the weight again on the unified data set
Figure BDA0002505370180000123
Training weak classifier, paired->
Figure BDA0002505370180000124
The sample weight is reduced, the error weight is improved, and the error rate is as follows:
weight value:
increasing the weight value:
Figure BDA0002505370180000131
reducing the weight value:
Figure BDA0002505370180000132
(D3) According to the parameters, a vector D is obtained through calculation according to the process shown in FIG. 2, the steps (D1) - (D2) are repeated to enter the next iteration, and the training and the weight adjustment are repeated continuously until the training error rate is 0 or the target value is reached, so that a strong classifier is formed.
And (E) identifying and judging the fine identification image obtained in the step (C) according to the accuracy required in advance by comparing the fine identification image with a knowledge classifier, and if the weak classification stage is not executed, the strong classification stage is not started by comparing the fine identification image with the knowledge classifier.
According to the image fine recognition method based on the DCL and the Cascade, the judgment area is automatically positioned in the DCL, so that additional marking is not needed during training; the DCL structure is only adopted in a training stage, so that the calculation cost is not introduced during prediction, the global structure is abandoned to keep local details, the network can be forced to focus on local areas with discriminability, after the local areas are disturbed, irrelevant areas which are not important for fine recognition can be ignored, and the network is forced to classify the images based on the discriminability local details.
In summary, in the image fine recognition method based on the DCL and the Cascade of the present invention, the DCL branch is introduced to automatically learn the discriminant region, firstly, the input image is destroyed to emphasize the discriminant local details, and then the semantic correlation between the local regions is modeled to reconstruct the image; on one hand, the DCL automatically positions the discrimination area, so that additional marking is not needed during training; on the other hand, the DCL structure is only used in the training phase, so that no computational overhead is introduced in prediction, and for "breaking", a regional obfuscation mechanism is used to divide the input image into local tiles and then randomly obfuscate. Local details play a more important role in fine recognition than global structures, since images from different fine classes often have the same global structure or shape, differing only in local details. Leaving up the global structure to maintain local details may force the network to focus on local areas that are discriminative. After being scrambled, irrelevant areas that are not important for fine recognition will be ignored and the network will be forced to classify the image based on local details of discriminability. Although recognition becomes more difficult, the expert can still easily find the differences. The neural network classifies and identifies the damaged images through a knowledge classifier (from weak to strong) of a cascade learning expert, and in order to prevent noise patterns introduced by network learning damage of a global structure, antagonism loss is used for restraining the introduced noise patterns, so that the neural network has a good application prospect.
The foregoing illustrates and describes the principles, general features, and advantages of the present invention. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, which are described in the specification and illustrated only to illustrate the principle of the present invention, but that various changes and modifications may be made therein without departing from the spirit and scope of the present invention, which fall within the scope of the invention as claimed. The scope of the invention is defined by the appended claims and equivalents thereof.

Claims (6)

1. The fine image recognition method based on DCL and Cascade is characterized by comprising the following steps: comprises the following steps of (a) carrying out,
after an original image is generated, carrying out initial analysis on data of the original image, wherein the original image is a high-definition image;
based on a DCL image destruction algorithm, destroying an original image after initial analysis to emphasize details of discriminant local regions, modeling semantic correlation between the local regions to reconstruct the image, and enabling a network to classify the reconstructed image based on the discriminant local details;
step (C), a noise mode and a region reconstruction loss mode are introduced, so that the reconstructed image solves the antagonism loss and the region alignment loss, and a fine identification image is obtained;
classifying the strength of the damaged reconstructed image from weak to strong through a knowledge classifier of a cascade learning expert on the basis of a neural network to form a knowledge classifier;
and (E) identifying and judging the fine identification image obtained in the step (C) according to the accuracy required in advance by comparing the fine identification image with the knowledge classifier.
2. The fine image recognition method based on DCL and Cascade according to claim 1, wherein: step (B), based on DCL image destruction algorithm, destroying the original image after initial analysis to emphasize the details of local regions with discriminant, modeling semantic correlation between the local regions to reconstruct the image, and making the network classify the reconstructed image based on the local details with discriminant, including the following steps,
(B1) Given an original image I, the image is first uniformly divided into N sub-regions, each sub-region defined by R i,j Representing, wherein i and j are respectively horizontal and vertical indexes, i is more than or equal to 1, and N is more than or equal to j;
(B2) From the subregion R i,j Mixed in their 2D neighbourhoods, for the sub-area of row j, a random vector number sequence q of length N is generated j Wherein the ith element q j,i =i+r,
Figure QLYQS_1
Is a random variable subject to uniform distribution, k is an adjustable parameter defining a neighborhood range, and k is more than or equal to 1 and is less than N;
(B3) By counting the sequence q of random vectors j Reordering to obtain a new arrangement of jth row area
Figure QLYQS_2
This transforms the region coordinates in the original image from (i, j) to σ (i, j) to reconstruct the image.
3. The fine image recognition method based on DCL and Cascade according to claim 1, wherein: step (C), introducing a noise mode and a region reconstruction loss mode to enable the reconstructed image to solve the antagonism loss and the region alignment loss to obtain a fine identification image, comprising the following steps,
(C1) Introducing a noise pattern to allow the reconstructed image to account for the loss of antagonism
Original image I, use
Figure QLYQS_3
A k characteristic diagram showing the m layer, which is used for visualizing the characteristics of the backbone classification network ResNet-50 and comprises two conditions of using and not using antagonism loss, and taking the output characteristics of the layer in front of the last fully-connected layer to carry out antagonism learning, wherein the k convolution core of the m convolution layer responds to the real category c,
Figure QLYQS_4
wherein,
Figure QLYQS_5
is the weight between the kth feature map and the corresponding class c, i.e. the response r k (I, c) is equal to the weight of the characteristic graph corresponding to the kth convolution kernel multiplied by the weight of the c corresponding to the full connection layer, so as to measure whether the convolution kernel can map the input image to the c, and the larger the response isIndicating a higher confidence in the mapping.
4. The fine image recognition method based on DCL and Cascade according to claim 2, wherein: step (C), introducing a noise mode and a region reconstruction loss mode to enable the reconstructed image to solve the antagonism loss and the region alignment loss to obtain a fine identification image, comprising the following steps,
(C2) Introducing a region reconstruction loss mode to enable the reconstructed image to solve the region alignment loss
(C21) Given an original image I and a corresponding reconstructed image phi (I), a region R at a position (I, j) in the original image I i,j With the region R in the reconstructed image phi (I) σ(i,j) The consistency is achieved;
(C22) The area alignment network is the output characteristic winter of the nth convolution layer of the classification network
Figure QLYQS_6
Performing I multiplied by I convolution processing on the operation characteristic diagram to obtain output with only two channels;
(C23) And obtaining a characteristic diagram with the size of 2 multiplied by N by the output through a ReLU linear rectification function and average pooling, wherein the output of the area alignment network is written as follows:
Figure QLYQS_7
the two channels in M (I) are respectively corresponding to row and column position coordinates, h is an area alignment network, and theta loc Is the parameter of the area alignment network, namely, each spatial position point of the output characteristic graph of the two channels predicts an area position, each spatial position point has two values to respectively predict the horizontal and vertical coordinates of the area, and has N multiplied by N sub-areas in total, and the area R is recorded σ(i,j) The predicted position is M σ(i,j) (phi (I)) is different for each region i,j Is M i,j (I, j), the true values for both predicted positions are (I, j);
(C24) Calculating the region alignment loss L loc Defined as L of the predicted coordinates and the original coordinates 1 Distance:
Figure QLYQS_8
(C25) According to the region alignment penalty L loc And the reconstructed image is enabled to solve the problem of region alignment loss.
5. The DCL and Cascade-based image fine recognition method according to claim 1, wherein: step (D) of classifying the intensity of the damaged reconstructed image from weak to strong by a knowledge classifier of a cascade learning expert based on a neural network to form the knowledge classifier, comprising the steps of,
(D1) Training each sample in the data and giving a weight to the sample;
(D2) Training a weak classifier on the training data and calculating its error rate, then updating the weight again on the unified data set
Figure QLYQS_9
Training weak classifier, paired->
Figure QLYQS_10
The sample weight is reduced, the error weight is improved, and the error rate is as follows:
weight value:
increasing the weight value:
Figure QLYQS_11
reducing the weight value:
Figure QLYQS_12
(D3) And calculating to obtain a vector D through the parameters, repeating the steps (D1) - (D2) to enter the next iteration, and continuously repeating the training and adjusting the weight until the training error rate is 0 or the target value is reached to form the strong classifier.
6. The fine image recognition method based on DCL and Cascade according to claim 1, wherein: and (E) comparing the weak classification stage with the knowledge classifier, and if the weak classification stage is not executed, not entering the strong classification stage.
CN202010444726.XA 2020-05-23 2020-05-23 Image fine recognition method based on DCL and Cascade Active CN111612076B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010444726.XA CN111612076B (en) 2020-05-23 2020-05-23 Image fine recognition method based on DCL and Cascade

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010444726.XA CN111612076B (en) 2020-05-23 2020-05-23 Image fine recognition method based on DCL and Cascade

Publications (2)

Publication Number Publication Date
CN111612076A CN111612076A (en) 2020-09-01
CN111612076B true CN111612076B (en) 2023-04-18

Family

ID=72202381

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010444726.XA Active CN111612076B (en) 2020-05-23 2020-05-23 Image fine recognition method based on DCL and Cascade

Country Status (1)

Country Link
CN (1) CN111612076B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113159171B (en) * 2021-04-20 2022-07-22 复旦大学 Plant leaf image fine classification method based on counterstudy
CN116129426A (en) * 2022-11-24 2023-05-16 广东劢智医疗科技有限公司 Fine granularity classification method for cervical cell smear 18 category

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108037133A (en) * 2017-12-27 2018-05-15 武汉市智勤创亿信息技术股份有限公司 A kind of power equipments defect intelligent identification Method and its system based on unmanned plane inspection image
CN111178432A (en) * 2019-12-30 2020-05-19 武汉科技大学 Weak supervision fine-grained image classification method of multi-branch neural network model

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10733334B2 (en) * 2017-02-22 2020-08-04 Middle Chart, LLC Building vital conditions monitoring

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108037133A (en) * 2017-12-27 2018-05-15 武汉市智勤创亿信息技术股份有限公司 A kind of power equipments defect intelligent identification Method and its system based on unmanned plane inspection image
CN111178432A (en) * 2019-12-30 2020-05-19 武汉科技大学 Weak supervision fine-grained image classification method of multi-branch neural network model

Also Published As

Publication number Publication date
CN111612076A (en) 2020-09-01

Similar Documents

Publication Publication Date Title
Dong et al. Deep metric learning-based for multi-target few-shot pavement distress classification
CN107609601B (en) Ship target identification method based on multilayer convolutional neural network
CN107133974B (en) Gaussian Background models the vehicle type classification method combined with Recognition with Recurrent Neural Network
CN103605972B (en) Non-restricted environment face verification method based on block depth neural network
CN110276248B (en) Facial expression recognition method based on sample weight distribution and deep learning
CN110619369A (en) Fine-grained image classification method based on feature pyramid and global average pooling
CN106778854A (en) Activity recognition method based on track and convolutional neural networks feature extraction
Badawi et al. A hybrid memetic algorithm (genetic algorithm and great deluge local search) with back-propagation classifier for fish recognition
CN102054176B (en) Method used for establishing semantic scene models for scene images of moving targets by utilizing computer
Chuang et al. Deep-learning based joint iris and sclera recognition with yolo network for identity identification
CN107133569A (en) The many granularity mask methods of monitor video based on extensive Multi-label learning
CN115170805A (en) Image segmentation method combining super-pixel and multi-scale hierarchical feature recognition
CN105139004A (en) Face expression identification method based on video sequences
CN105825212A (en) Distributed license plate recognition method based on Hadoop
CN111612076B (en) Image fine recognition method based on DCL and Cascade
CN112990282B (en) Classification method and device for fine-granularity small sample images
CN106127228A (en) Remote sensing image ship detection candidate area identification method based on decision template classifier fusion
CN103035239B (en) Speaker recognition method based on partial learning
CN112686242B (en) Fine-grained image classification method based on multilayer focusing attention network
CN114549834A (en) Semi-supervised hybrid training semantic segmentation method and system
Anwer et al. Accident vehicle types classification: a comparative study between different deep learning models
CN111310820A (en) Foundation meteorological cloud chart classification method based on cross validation depth CNN feature integration
Liu et al. Remote sensing image vehicle detection based on pre-training and random-initialized fusion network
Dorrani Traffic Scene Analysis and Classification using Deep Learning
CN112613474A (en) Pedestrian re-identification method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
PE01 Entry into force of the registration of the contract for pledge of patent right
PE01 Entry into force of the registration of the contract for pledge of patent right

Denomination of invention: A Fine Image Recognition Method Based on DCL and Cascade

Granted publication date: 20230418

Pledgee: Bank of China Limited by Share Ltd. Yangjiang branch

Pledgor: NANJING UNIVERSITY SMARTERCITY PROGRAM DESIGN Co.,Ltd.

Registration number: Y2024980001801