CN109816022A

CN109816022A - A kind of image-recognizing method based on three decisions and CNN

Info

Publication number: CN109816022A
Application number: CN201910084149.5A
Authority: CN
Inventors: 邵帅; 梁星; 余静; 贾敦新; 张泽烈; 杨航; 赵翔宇; 余洋; 程宇翔; 曾诚
Original assignee: CHONGQING GEOGRAPHICAL INFORMATION CENTER
Current assignee: CHONGQING GEOGRAPHICAL INFORMATION CENTER
Priority date: 2019-01-29
Filing date: 2019-01-29
Publication date: 2019-05-28

Abstract

The invention discloses a kind of image-recognizing methods based on three decisions and CNN, including input sample data images training CNN classifier；Three decision thoughts are introduced in the image segmentation stage, the forward data collection image classified using CNN classifier and negative sense data images obtain three Decision Classfication devices；Positive region, negative region and Delayed Decision region are divided the image into using three Decision Classfication devices；Classification processing is iterated to the Delayed Decision region that the classifier is partitioned into；Remaining Delayed Decision region is judged, if reach critical value；When Delayed Decision region reaches critical value, illustrate that images to be recognized can not be divided again, CNN image recognition then is carried out to all positive regions；The present invention combines CNN image recognition technology and three decision theories, can make full use of useful information in image, makes to reach higher discrimination.

Description

A kind of image-recognizing method based on three decisions and CNN

Technical field

The present invention relates to Intelligent Information Processing and high-performance image identification technology field, and in particular to one kind is based on three The image-recognizing method of decision and CNN.

Background technique

Image recognition is allowed to handle image using computer, analyzed and understood, to identify different target and right The technology of elephant.The bottom semanteme of the existing eventful surplus image of image-recognizing method is studied, due to computer self performance The problem of, can only sequencing the single operational order of execution, cause conventional method low to the discrimination of image.Therefore, someone It proposes based on three decisions (Three-way Decision, abbreviation 3WD), convolutional neural networks (Convolutional Neural Network, CNN) the methods of image recognition technology.

Wherein, three decisions have been developed in recent years a kind of method for handling unascertained decision, are a kind of compound " three points and control " (Trisecting and Acting) model of human cognitive.Three decisions are a kind of to be recognized based on meeting the mankind The decision-making mode known, it considers that: for people in practice decision process, the things accepted or rejected for tool beyond reasonable doubt can Quick judgement is made immediately；For the things which cannot make a policy immediately, people often postpone the judgement to event, That is Delayed Decision.In more practice decision process, there are the complexity of policy setting, the incompleteness of acquisition of information, groups Each expert opinion inconsistency in decision, the uncertain conditions such as the finiteness and thinking ambiguity of the had knowledge of policymaker, certainly Plan person is in most cases difficult accurately to provide the specific value of loss function, also results in its accuracy of identification and often reaches not To necessary requirement.

In addition, convolutional neural networks are to grow up in recent years and a kind of widely used efficient neural network machine Learning method.The main advantage of CNN can directly input original image in avoiding image pretreated complex process early period. In general, CNN basic structure includes two layers: feature extraction layer and Feature Mapping layer.Its conventional machines learning method it is main Difference is that CNN includes a feature extractor being made of a convolutional layer and sub-sampling layer.Sub-sampling is also referred to as pond (Pooling), usually there are two kinds of forms of mean value sub-sampling (Mean Pooling) and maximum value sub-sampling (Max Pooling). Sub-sampling is considered as a kind of special convolution process.Convolution sum sub-sampling not only simplifies model complexity, but also reduces The parameter of model.But the treatment process of convolutional neural networks identification is single, inflexible, the characteristic value needed compares tool Body causes the feature obtained very fixed；In addition, convolution process is easy to cause the missing of some useful informations in image, so that figure The accuracy of identification of picture is not satisfactory.

Therefore, how three decision-making techniques to be combined with CNN image-recognizing method, realizes that more preferably image is known Other technology becomes urgent problem to be solved.

Summary of the invention

In view of the deficiencies of the prior art, the object of the present invention is to provide a kind of image recognitions based on three decisions and CNN Method, by the way that three decision theories are introduced into CNN image recognition, and the conceptual design classifier based on three decisions, it builds Formwork erection type carries out image recognition, can be improved image recognition accuracy rate.

In order to achieve the above objectives, The technical solution adopted by the invention is as follows:

A kind of image-recognizing method based on three decisions and CNN, key be the following steps are included:

S1, input images to be recognized；

S2, CNN classifier to be trained is trained using sample image, and has been trained CNN classifier, forward data collection figure Picture and negative sense data images, the training process of the CNN classifier to be trained are as follows:

S201, CNN classifier to be trained is established, and sets the initial parameter value of filter in CNN classifier to be trained；

S202, input sample image, respectively pre-process each image, and calculate in each image positive area with Output probability corresponding to negative area；

S203, the overall error for calculating separately output layer；

S204, gradient of the error relative to all weights is calculated using back-propagation algorithm, and updated with gradient descent method The weight and parameter value of all filters, so that output error minimizes；

S205, setting value of the smallest parameter value of output error as filter is selected, CNN classifier has been trained in acquisition；

S3, three Decision Classfication devices are obtained using the forward data collection image and negative sense data images；

S4, according to the class condition of three Decision Classfication devices, images to be recognized is divided into positive region, negative region, is prolonged Slow decision region；

S5, using having trained CNN classifier to carry out image recognition processing to the positive region that splits；

S6, output image recognition result.

Further, described to obtain the tool of three Decision Classfication devices using forward data collection image and negative sense data images Steps are as follows for body:

S301, the input forward data collection image and negative sense data images；

S302, according to the attribute of three decisions, forward data collection image and negative sense data images are respectively trained, It include: by forward data collection image or negative sense data images respectively according to forward data collection image or negative sense data images Characteristics of image is split, and the characteristics of image attribute value of forward data collection image or negative sense data images is obtained, according to forward direction Attribute value λ needed for the characteristics of image attribute value of data images obtains the training of forward data collection image_αp1、λ_αn1、λ_βp1、λ_βn1、 λ_ξp1、λ_ξn1, attribute value needed for obtaining the training of negative sense data images according to the characteristics of image attribute value of negative sense data images λ_αp2、λ_αn2、λ_βp2、λ_βn2、λ_ξp2、λ_ξn2；Wherein, λ_αp1、λ_αp2It respectively indicates and meets forward data collection image, negative sense data images Decision condition under take the cost of acceptance decision, λ_αn1、λ_αn2It respectively indicates and is unsatisfactory for forward data collection image, negative sense data set The cost of acceptance decision, λ are taken under the decision condition of image_βp1、λ_βp2It respectively indicates and meets forward data collection image, negative sense data Collect the cost that refusal decision is taken under the decision condition of image, λ_βn1、λ_βn2It respectively indicates and is unsatisfactory for forward data collection image, negative sense The cost for taking refusal decision under the decision condition of data images, λ_ξp1、λ_ξp2Respectively indicate meet forward data collection image, The cost for not promising to undertake decision, λ are taken under the decision condition of negative sense data images_ξn1、λ_ξn2It respectively indicates and is unsatisfactory for forward data Collect and takes the cost for not promising to undertake decision under the decision condition of image, negative sense data images；

S303, using the decision problem cost matrix of three decisions, calculate separately out the positive region of forward data collection image Optimal value α₁With negative region optimal value β₁And negative sense data images positive region optimal value α₂With negative region optimal value β₂:

S304, the positive region optimal value α according to forward data collection image₁With negative region optimal value β₁, obtain forward data collection The Delayed Decision regional value ξ of image₁: β₁< ξ₁< α₁；According to negative sense data images positive region optimal value α₂It is optimal with negative region Value β₂, obtain the Delayed Decision regional value ξ of forward data collection image₂: β₂< ξ₂< α₂；

S305, according to the α₁、α₂、β₁、β₂、ξ₂And ξ₂, respectively obtain three Decision Classfication devices of forward data collection image With three Decision Classfication devices of negative sense data images.

Further, the characteristics of image of the forward data collection image or negative sense data images include image outline, it is bright One of degree, color, gray scale are a variety of.

Further, according to the class condition of three Decision Classfication devices, the images to be recognized is divided into three areas Domain includes: that forward data collection image and negative sense data images are quantified as the region that size is 1, is 1 by the size The positive region optimal value and negative region optimal value of the forward data collection image that region is classified according to three decision-making devices or The positive region optimal value and negative region optimal value of negative sense data images are respectively divided into three regions, according to risk function, selection The decision of least risk, obtains evaluation of risk；According to the evaluation of risk, the images to be recognized is taken into three areas The decision in some region in domain；

Wherein, three regions specifically: [α₁,1]、[α₂, 1] and it is respectively forward data collection image, negative sense data set figure The positive region of picture, [0, β₁]、[0,β₂] be respectively forward data collection image, negative sense data images negative region, (α₁,β₁)、 (α₂,β₂) be respectively forward data collection image, negative sense data images Delayed Decision region；Wherein, α₁、α₂Indicate positive number According to collection image, the optimal value of negative sense data images, β₁、β₂Indicate the negative region of forward data collection image, negative sense data images Optimal value；Corresponding three decisions, use α₁、α₂It respectively indicates and takes receiving to determine forward data collection image, negative sense data images Plan, β₁、β₂It respectively indicates and refusal decision, ξ is taken to forward data collection image, negative sense data images₁、ξ₂It respectively indicates to just Delayed Decision is taken to data images, negative sense data images；

The evaluation of risk includes:

Receive risk:

R(α₁| y)=λ_αp1·P_β1(Y|[y])+(1-P_β1(Y|[y]))；

R(α₂| y)=λ_αp2·P_β2(Y|[y])+(1-P_β2(Y|[y]))；

Refuse risk:

R(β₁| y)=λ_βp1·P_β1(Y|[y])+(1-P_β1(Y|[y]))；

R(β₂| y)=λ_βp2·P_β2(Y|[y])+(1-P_β2(Y|[y]))；

Delayed Decision risk:

R(ξ₁| y)=λ_ξp1·P_β1(Y|[y])+(1-P_β1(Y|[y]))；

R(ξ₂| y)=λ_ξp2·P_β2(Y|[y])+(1-P_β2(Y|[y]))；

According to the evaluation of risk, the images to be recognized takes determining for some region into three regions Plan includes:

When meeting condition R (α₁|y)≤R(r₁|y)∧R(α₁|y)≤R(n₁| when y), images to be recognized is taken to positive number According to the decision of the receiving of collection image；

When meeting condition R (α₂|y)≤R(r₂|y)∧R(α₂|y)≤R(n₂| when y), images to be recognized is taken to negative sense number According to the decision of the receiving of collection image；

When meeting condition R (r₁|y)≤R(α₁|y)∧R(r₁|y)≤R(n₁| when y), images to be recognized is taken to positive number According to the decision of the refusal of collection image；

When meeting condition R (r₂|y)≤R(α₂|y)∧R(r₂|y)≤R(n₂| when y), images to be recognized is taken to negative sense number According to the decision of the refusal of collection image；

When meeting condition R (n₁|y)≤R(α₁|y)∧R(n₁|y)≤R(r₁| when y), images to be recognized is taken to positive number According to the decision that do not promise to undertake of collection image；

When meeting condition R (n₂|y)≤R(α₂|y)∧R(n₂|y)≤R(r₂| when y), images to be recognized is taken to negative sense number According to the decision that do not promise to undertake of collection image；

Wherein, evaluation function is defined as P_r(Y | [y]), risk function are R (Δ | y), wherein Y indicates to act y decision, Δ indicates to act the decision of y, and y indicates to determine, R (α₁|y)、R(α₂| y) indicate to forward data collection image, negative sense data set Receive the risk function of state, R (β in image decision process₁|y)、R(β₂| y) indicate to forward data collection image, negative sense number According to the risk function of disarmed state in collection image decision process, R (ξ₁|y)、R(ξ₂| it y) indicates to forward data collection image, negative The risk function of Delayed Decision state into data images decision process；P_β1(Y|[y])、P_β2(Y | [y]) indicate to forward direction The evaluation function of disarmed state in data images, negative sense data images decision process；Risk function R (r₁|y)、R(r₂|y) Indicate that forward data collection image, negative sense data images decision act r_iRisk mathematic expectaion, risk function R (n₁|y)、R (n₂| y) indicate forward data collection image, in negative sense data images the risk of decision movement n mathematic expectaion, ∧ indicates simultaneously Meet.

Further, described according to the evaluation of risk, the images to be recognized is taken into three regions The decision in some region further include:

For forward data collection image:

Ensure

WhenImages to be recognized takes the decision of the receiving to forward data collection image；

WhenImages to be recognized takes the decision of the refusal to forward data collection image；

WhenImages to be recognized takes the decision that do not promise to undertake to forward data collection image；

Wherein,It indicates acting r to the decision in forward data collection image decision process₁Risk Evaluation function, λ_αp1Expression meets the cost that acceptance decision is taken under decision condition, λ_αn1Foot forward data collection figure with thumb down The cost of acceptance decision, λ are taken under the decision condition of picture_βp1It indicates to meet to take under the decision condition of forward data collection image and refuse The cost of exhausted decision, λ_βn1The cost for taking refusal decision under the decision condition of foot forward data collection image with thumb down, λ_ξp1 It indicates to meet under the decision condition of forward data collection image and takes the cost for not promising to undertake decision, λ_ξn1Foot forward data with thumb down Collect and takes the cost for not promising to undertake decision under the decision condition of image；

For negative sense data images:

Ensure

WhenImages to be recognized takes the decision of the receiving to negative sense data images；

WhenImages to be recognized takes the decision of the refusal to negative sense data images；

WhenImages to be recognized takes the decision that do not promise to undertake to negative sense data images；

Wherein,It indicates acting r to the decision in negative sense data images decision process₂Risk Evaluation function, λ_αp2Expression meets the cost that acceptance decision is taken under decision condition, λ_αn2Foot negative sense data set figure with thumb down The cost of acceptance decision, λ are taken under the decision condition of picture_βp2It indicates to meet to take under the decision conditions of negative sense data images and refuse The cost of exhausted decision, λ_βn2The cost for taking refusal decision under the decision condition of foot negative sense data images with thumb down, λ_ξp2 It indicates to meet under the decision conditions of negative sense data images and takes the cost for not promising to undertake decision, λ_ξn2Foot negative sense data with thumb down Collect and takes the cost for not promising to undertake decision under the decision condition of image.

Further, the image to be identified is divided into three by the class condition according to three Decision Classfication devices Region further include: introduce a pair of of threshold value (α_i,β_i), then set U is divided for following three regions:

To forward data collection image:

Positive region: POS (α₁,β₁)={ u ∈ ∪ | v (u) >=α₁}；

Negative region: NEG (α₁,β₁)={ u ∈ ∪ | v (u)≤β₁}；

Boundary Region:

Three decision rules can be constructed by above three region: positive region is corresponding to be received, the corresponding refusal of negative region, side Boundary domain corresponds to Delayed Decision, and Boundary Region is also Delayed Decision region, wherein and u ∈ ∪ | v (u) >=α₁Indicate to meet v (u) >=α₁ And when u ∈ ∪, the value of image u；{u∈∪|v(u)≤β₁Indicate to meet v (u)≤β₁And when u ∈ ∪, the value of image u；It indicates to meetAnd when u ∈ ∪, the value of image u；Corresponding ordering relation ≤,Show stringent ordering relation；That is:And if only if β₁≤v(u)≤α₁And β₁≠v(u)≠α₁；α₁Table Show and acceptance decision, β are taken to forward data collection image₁It indicates to take refusal decision, ξ to forward data collection image₁Expression pair Decision is not promised to undertake in taking for forward data collection image；U indicates that forward data image, v (u) indicate to quantify forward data image u Value afterwards；

To negative sense data images:

Positive region: POS (α₂,β₂)={ u ∈ ∪ | v (u) >=α₂}；

Negative region: NEG (α₂,β₂)={ u ∈ ∪ | v (u)≤β₂}；

Boundary Region:

Three decision rules can be constructed by above three region: positive region is corresponding to be received, the corresponding refusal of negative region, side Boundary domain corresponds to Delayed Decision, and Boundary Region is also Delayed Decision region, wherein and u ∈ ∪ | v (u) >=α₂Indicate to meet v (u) >=α₂ And when u ∈ ∪ image u value；{u∈∪|v(u)≤β₂Indicate to meet v (u)≤β₂And when u ∈ ∪ image u value；It indicates to meetAnd when u ∈ ∪ image u value；Corresponding ordering relation ≤,Indicate stringent ordering relation；That is:And if only if β₂≤v(u)≤α₂And β₂≠v(u)≠α₂；α₂ Expression takes acceptance decision, β to negative sense data images₂It indicates to take refusal decision, ξ to negative sense data images₂It indicates Decision is not promised to undertake to taking for negative sense data images；U indicates that negative sense data image, v (u) are indicated to after negative sense data u quantization Value.

Further, the class condition according to three Decision Classfication devices, is divided into the image to be identified Behind three regions, to Delayed Decision region subseries again, comprising: using the positive region sorted out as new positive sample, The negative region sorted out is training condition as new negative sample, is trained again to three Decision Classfication devices, directly It can not divide again to Delayed Decision region part, reach critical value.

Further, the formula for having trained CNN classifier to carry out image recognition processing to the positive region split Are as follows:

Wherein, x_iIndicate input vector, i.e., R, G, B eigenmatrix of every positive region image, i ∈ (1,2,3)；W_iIt indicates Input vector x_iCorresponding weight, i.e. convolution kernel；B indicates biasing；h_W,bIt (x) is output characteristic value.

Further, the establishment process of the CNN classifier to be trained includes the sample number strong point according to images to be recognized Obtained three classes sample makes classifier.

Remarkable result of the invention is: for image recognition technology in the prior art adaptivity it is poor the problems such as, pass through The principle of three decision theories and CNN image recognition is analyzed, it is creative that three decision theories have been introduced into CNN figure In identifying, and the conceptual design based on three decisions establishes CNN classifier, and has carried out adaptive instruction to CNN classifier Practice, more accurate image feature information can be got, three Decision Classfication devices pair are generated according to training the data obtained collection image later Images to be recognized is iterated segmentation, is finally known using the CNN classifier after training to resulting positive region image is divided Other places reason, effectively raises the accurate rate of image recognition.

Detailed description of the invention

Fig. 1 is flow chart of the method for the present invention；

Fig. 2 is the training flow diagram of the CNN classifier to be trained；

Fig. 3 is the flow diagram of three Decision Classfication devices in the present invention.

Specific embodiment

Specific embodiment and working principle of the present invention will be described in further detail with reference to the accompanying drawing.

As shown in Figure 1, a kind of image-recognizing method based on three decisions and CNN, the specific steps are as follows:

S1, input images to be recognized；

S2, CNN classifier to be trained is trained using sample image, and has been trained CNN classifier, forward data collection figure Picture and negative sense data images；

As shown in Fig. 2, the training process of CNN classifier to be trained described in this example are as follows:

S203, the overall error for calculating separately output layer；

S3, three Decision Classfication devices are obtained using the forward data collection image and negative sense data images；Referring to attached drawing 3, described using forward data collection image and negative sense data images to obtain three Decision Classfication devices specific step is as follows:

S301, the input forward data collection image and negative sense data images；

1 two states decision problem cost matrix of table

S4, according to the class condition of three Decision Classfication devices, images to be recognized is divided into positive region, negative region, is prolonged Slow decision region, and iterate to Delayed Decision region and can not divide again；

Preferably, alternatively, according to the class condition of three Decision Classfication devices, described to be identified It includes: that forward data collection image and negative sense data images are quantified as the region that size is 1 that image, which is divided into three regions, will The positive region optimal value for the forward data collection image that the region that the size is 1 is classified according to three decision-making devices and negative The positive region optimal value and negative region optimal value of region optimal value or negative sense data images are respectively divided into three regions, according to wind Dangerous function, the smallest decision of risk of selection, obtains evaluation of risk；According to the evaluation of risk, the images to be recognized take into Enter the decision in some region in three regions；

The evaluation of risk includes:

Receive risk:

R(α₁| y)=λ_αp1·P_β1(Y|[y])+(1-P_β1(Y|[y]))；

R(α₂| y)=λ_αp2·P_β2(Y|[y])+(1-P_β2(Y|[y]))；

Refuse risk:

R(β₁| y)=λ_βp1·P_β1(Y|[y])+(1-P_β1(Y|[y]))；

R(β₂| y)=λ_βp2·P_β2(Y|[y])+(1-P_β2(Y|[y]))；

Delayed Decision risk:

R(ξ₁| y)=λ_ξp1·P_β1(Y|[y])+(1-P_β1(Y|[y]))；

R(ξ₂| y)=λ_ξp2·P_β2(Y|[y])+(1-P_β2(Y|[y]))；

In the specific implementation process, described according to the evaluation of risk, the images to be recognized is taken into described three The decision in some region in region further include:

For forward data collection image:

Ensure

For negative sense data images:

Ensure

To forward data collection image:

Positive region: POS (α₁,β₁)={ u ∈ ∪ | v (u) >=α₁}；

Negative region: NEG (α₁,β₁)={ u ∈ ∪ | v (u)≤β₁}；

Boundary Region:

Three decision rules can be constructed by above three region: positive region is corresponding to be received, the corresponding refusal of negative region, side Boundary domain corresponds to Delayed Decision, and Boundary Region is also Delayed Decision region, in which:

{u∈∪|v(u)≥α₁Indicate to meet v (u) >=α₁And when u ∈ ∪, the value of image u；

{u∈∪|v(u)≤β₁Indicate to meet v (u)≤β₁And when u ∈ ∪, the value of image u；

It indicates to meetAnd when u ∈ ∪, the value of image u；

Corresponding ordering relation≤,Indicate stringent ordering relation；That is:And if only if β₁≤v(u) ≤α₁And β₁≠v(u)≠α₁；α₁Expression takes acceptance decision, β to forward data collection image₁It indicates to forward data collection image Take refusal decision, ξ₁Expression does not promise to undertake decision to taking for forward data collection image；U indicates forward data image, v (u) It indicates to the value after forward data image u quantization；

To negative sense data images:

Positive region: POS (α₂,β₂)={ u ∈ ∪ | v (u) >=α₂}；

Negative region: NEG (α₂,β₂)={ u ∈ ∪ | v (u)≤β₂}；

Boundary Region:

Finally, the image to be identified, is divided into three by the class condition according to three Decision Classfication devices Behind region, to Delayed Decision region subseries again, comprising: using the positive region sorted out as new positive sample, divided The negative region that class goes out is training condition as new negative sample, is trained again to three Decision Classfication devices, Zhi Daosuo Stating Delayed Decision region part can not divide again, reach critical value.

S5, using CNN classifier has been trained, to the positive region progress image recognition processing split, detailed process is such as Under:

In the convolutional layer of convolutional neural networks, a neuron is only connect with part adjacent bed neuron.Usually CNN's It can include several characteristic planes (Feature Map), wherein being made of the neuron of some rectangular arrangeds in one convolutional layer Each characteristic plane, and the neuron of same characteristic plane shares weight, shared weight is exactly convolution kernel here.General convolution Core is initialized by the form of random decimal matrix, and then convolution kernel obtains study reasonably in the training process of network Weight.Reducing the most direct mode of connection between each layer of network is shared weight (convolution kernel), but reduces over-fitting simultaneously Risk.Sub-sampling is also referred to as pond (Pooling), usually there is mean value sub-sampling (Mean Pooling) and maximum value sub-sampling (Max Pooling) two kinds of forms.Sub-sampling is considered as a kind of special convolution process.Convolution sum sub-sampling not only simplifies Model complexity, and also reduce the parameter of model.

According to the theory of neural network, it is as follows to correspond to formula:

The CNN image recognition processes include:

Neural network model is multiple units to combine, and contain layered structure, wherein corresponding formula is as follows:

Wherein, x₁, x₂, x₃Indicate input vector；W indicates weight；B indicates biasing；h_W,bIt (x) is output； Indicate activation primitive, activate Neuron characteristics, feature is retained by function and is mapped out come.Numerous neurons receive a large amount of non- Linear shape inputs information, this information is known as input vector；Hidden layer is numerous neurons and link-group between input layer, output layer At every aspect；Output layer is expressed as information and transmits in neuron link, analyzes, weighs obtained output result.

Corresponding three decisions and CNN image recognition are nominal variables since CNN is suitable for dependent variable, and by three or three The case where a above classification composition, according to the process of three decisions, we take the state set to beIt respectively indicates The object belongs to C and is not belonging to C.Action collection A={ a_P,a_B,a_N, wherein a_P, a_B, a_NRespectively indicate can take when x classification three Kind is taken action, and determines x ∈ POS (C) respectively, x ∈ BND (C), generated risk under three kinds of states of x ∈ NEG (C), thus structure A loss matrix is made, as shown in table 2.

The loss matrix of 2 two states decision problem of table

In table, λ_PP, λ_BP, λ_NPIt respectively indicates as x ∈ C, take action a_P, a_B, a_NWhen loss；λ_PN, λ_BN, λ_NNPoint Biao Shi it not work asWhen, take action a_P, a_B, a_NWhen loss.If P_r(C | [x]) is the conditional probability of object x ∈ C, Object x will be described by its equivalence class [x] when calculating.For an object, expected loss R when three kinds of different action is taken (a_i| [x]) it may be expressed as:

R(a_P| [x])=λ_PPP_r(C|[x])+λ_PNP_r(C/[x])

R(a_B| [x])=λ_BPP_r(C|[x])+λ_BNP_r(C|[x])

R(a_N| [x])=λ_NPP_r(C|[x])+λ_NNP_r(C|[x])

S6, output image recognition result.

The characteristics of image of the forward data collection image or negative sense data images includes image outline, brightness, color, ash One of degree is a variety of.

Training block includes these sample data set images in a random order, but some trained blocks may be wrapped than other classification Containing more images.Training block clock is every a kind of comprising 5000 images.Raw data set is all monocular logo image, in order to verify three 4 images in original sample data set image are synthesized one by the accuracy rate of branch Decision Classfication device, the present invention, i.e., each survey Examination item has 4 targets to be identified.

Further, alternatively, 4000 picture conducts provided in sample data set image are chosen Image to be identified, wherein every class random selection picture 400 is opened, and experiment is 100 pictures every time, and final result is averaged. Statistical data can obtain table 3, and as shown in table 2, the image recognition based on three decisions can be achieved on the identification of multi-Target Image.

According to the data of table 3 it is found that method of the invention accuracy rate average in data set cifar-10 is 90.85%, average F value is 91.25%, it can thus be concluded that the image recognition based on three Decision Classfication devices can obtain higher identification Rate and accuracy rate, cifar-10 are that Canadian government is taken the lead one of investment advanced science project research collected one and is used for The data set of pervasive object identification.

Table 3 is based on three decision cifar-10 multi-targets recognition rates

Further, as another optional way, using Car detection on UIUC data set as to be identified Image: the multi-targets recognition of Car detection on UIUC data set is compared, in order to verify three Decision Classfication devices pair In the accuracy rate of multi-targets recognition, present invention uses data set Car detection on UIUC as a comparison.Wherein, Car Detection on UIUC data set is University of Illinois's automotive check image data base, wherein including 1050 training figures This 1050 training images are split according to characteristics of image, obtain positive sample 550, negative sample 500 by picture；There are two Test data set, one has 200 targets to be identified comprising 170 pictures, another include 108 pictures have 139 wait know Other target.

As shown in table 4, for UIUC data set, the identification target of characteristics of image compares zero in sample data set image It dissipating, is easier to ignore useful information, wherein CNN-3WD is method proposed by the invention, and FD is quick detection framework, Precision is accuracy rate, and Recall is recall rate, and F-Measure is F value, and all data are all figures in data set in table As data are averaged, analysis can obtain the image recognition accurate rate of the invention based on three decisions and CNN and be higher than quickly detection Frame FD.The method of the present invention discrimination reaches 99.41%, F value and is slightly above FD method for 99.26%.

Table 4 is compared based on UIUC discrimination

Further, as a kind of arbitrary way, using KITTI data set to the image of the invention based on three decisions Recognition methods is verified.Wherein, KITTI data set is by the Karlsruhe, Germany Institute of Technology and American Institute of Technology, Toyota Joint is established, and is the computer vision algorithms make evaluation and test data set under current automatic Pilot scene maximum in the world.The evaluation and test number Real image data according to collection comprising the scenes acquisition such as urban district, rural area and highway, in every image for up to 15 vehicles and 30 pedestrians are blocked and are truncated there are also various degrees of.

Wherein, CS-AdaBoost is that Masnadishirazi H is proposed in paper Cost-sensitive boosting Method, AdaBoost+LDA be Wu J, Brubaker S C, Mullin M D, et al is in paper Fast Asymmetric The method proposed in Learning for Cascade Face Detection, AdaBoost are Viola P, and Jones M is being discussed The method proposed in literary Robust real-time face detection, Pruning are that Paisitkriangkrai S is being discussed The method proposed in literary Asymmetric Pruning for Learning Cascade Detectors, FD are Hu Q, Paisitkriangkrai S, Shen C, et al are in paper Fast Detection of Multiple Objects in The method proposed in Traffic Scenes With a Common Detection Framework；

KITTI data set is divided into three classifications, is vehicle, pedestrian and bicycle respectively, but due to pedestrian and bicycle sample There is the case where interfering with each other in this image itself, and its criterion is not known yet, so the present invention uses the image of vehicle Tested and compared, the method made comparisons be also use image in information of vehicles as object of experiment.

As shown in Table 5, information of vehicles in data set KITTI is identified, the discrimination of 3WD method is 89.27%, F value It is 88.86%, is higher than FD due to the particularity of the data set and only has trained the image of vehicle, acquired results and its other party herein The information of vehicles of method compares, and as shown in table 5, the method applied in the present invention has higher accuracy rate and F compared to other methods Value.

Table 5 is compared based on KITTI data set identify rate

Wherein, mBoW is Behley J in paper Laser-based segment classification using a The method proposed in mixture of bag-of-words, MDPM are Forsyth D in paper Object Detection The method proposed in Discriminatively Trained Part-Based Models, DPM-C8B1 are Yebes J J is in paper Supervised learning and evaluation of KITTI's cars detector with DPM The method of proposition, OC-DPM are Pepikj B in paper Occlusion Patterns for Object Class The method proposed in Detection, AOG are Li B, and Wu T, Zhu S C is in paper Integrating Context and The method proposed in Occlusion for Car Detection by Hierarchical And-Or Model, SubCat are Ohn-Bar E is in paper Fast and Robust Object Detection Using Visual Subcategories The method of proposition, Regionlets are Wang X in paper Regionlets for Generic Object Detection The method of proposition.

Technical solution provided by the present invention is described in detail above.Specific case used herein is to this hair Bright principle and embodiment is expounded, method of the invention that the above embodiments are only used to help understand and its Core concept.It should be pointed out that for those skilled in the art, in the premise for not departing from the principle of the invention Under, it can be with several improvements and modifications are made to the present invention, these improvement and modification also fall into the protection of the claims in the present invention In range.

Claims

1. a kind of image-recognizing method based on three decisions and CNN, which comprises the following steps:

S1, input images to be recognized；

S2, using sample image training CNN classifier train, and trained CNN classifier, forward data collection image and Negative sense data images, the training process of the CNN classifier to be trained are as follows:

S202, input sample image, respectively pre-process each image, and calculate positive area and negative sense in each image Output probability corresponding to region；

S203, the overall error for calculating separately output layer；

S204, gradient of the error relative to all weights is calculated using back-propagation algorithm, and updated and owned with gradient descent method Filter weight and parameter value so that output error minimize；

S4, according to the class condition of three Decision Classfication devices, images to be recognized is divided into positive region, negative region, delay are determined Plan region；

S6, output image recognition result.

2. the image-recognizing method according to claim 1 based on three decisions and CNN, it is characterised in that: the utilization Forward data collection image and negative sense data images obtain three Decision Classfication devices, and specific step is as follows:

S301, the input forward data collection image and negative sense data images；

S302, according to the attribute of three decisions, forward data collection image and negative sense data images are respectively trained, wrap It includes: by forward data collection image or negative sense data images respectively according to forward data collection image or the figure of negative sense data images As feature is split, the characteristics of image attribute value of forward data collection image or negative sense data images is obtained, according to positive number Attribute value λ needed for obtaining the training of forward data collection image according to the characteristics of image attribute value of collection image_αp1、λ_αn1、λ_βp1、λ_βn1、λ_ξp1、 λ_ξn1, attribute value λ needed for obtaining the training of negative sense data images according to the characteristics of image attribute value of negative sense data images_αp2、 λ_αn2、λ_βp2、λ_βn2、λ_ξp2、λ_ξn2；Wherein, λ_αp1、λ_αp2Respectively indicate meet forward data collection image, negative sense data images certainly The cost of acceptance decision, λ are taken under the conditions of plan_αn1、λ_αn2It respectively indicates and is unsatisfactory for forward data collection image, negative sense data images Decision condition under take the cost of acceptance decision, λ_βp1、λ_βp2It respectively indicates and meets forward data collection image, negative sense data set figure The cost of refusal decision, λ are taken under the decision condition of picture_βn1、λ_βn2It respectively indicates and is unsatisfactory for forward data collection image, negative sense data Collect the cost for taking refusal decision under the decision condition of image, λ_ξp1、λ_ξp2It respectively indicates and meets forward data collection image, negative sense The cost for not promising to undertake decision, λ are taken under the decision condition of data images_ξn1、λ_ξn2It respectively indicates and is unsatisfactory for forward data collection figure As, negative sense data images decision condition under take the cost for not promising to undertake decision；

S303, using the decision problem cost matrix of three decisions, the positive region for calculating separately out forward data collection image is optimal Value α₁With negative region optimal value β₁And negative sense data images positive region optimal value α₂With negative region optimal value β₂:

S304, the positive region optimal value α according to forward data collection image₁With negative region optimal value β₁, obtain forward data collection image Delayed Decision regional value ξ₁: β₁< ξ₁< α₁；According to negative sense data images positive region optimal value α₂With negative region optimal value β₂, Obtain the Delayed Decision regional value ξ of forward data collection image₂: β₂< ξ₂< α₂；

S305, according to the α₁、α₂、β₁、β₂、ξ₂And ξ₂, respectively obtain three Decision Classfication devices of forward data collection image and bear To three Decision Classfication devices of data images.

3. the image-recognizing method according to claim 1 based on three decisions and CNN, it is characterised in that: the forward direction The characteristics of image of data images or negative sense data images includes one of image outline, brightness, color, gray scale or more Kind.

4. the image-recognizing method according to claim 1 based on three decisions and CNN, it is characterised in that: according to described The class condition of three Decision Classfication devices, the images to be recognized be divided into three regions include: by forward data collection image with Negative sense data images are quantified as the region that size is 1, and the region that the size is 1 is classified according to three decision-making devices The positive region optimal value and negative region optimal value of obtained forward data collection image or the positive region of negative sense data images are optimal Value and negative region optimal value are respectively divided into three regions, and according to risk function, the smallest decision of risk of selection obtains risk and estimates Meter；According to the evaluation of risk, the images to be recognized takes the decision in some region into three regions；

Wherein, three regions specifically: [α₁,1]、[α₂, 1] and it is respectively forward data collection image, negative sense data images Positive region, [0, β₁]、[0,β₂] be respectively forward data collection image, negative sense data images negative region, (α₁,β₁)、(α₂,β₂) The respectively Delayed Decision region of forward data collection image, negative sense data images；Wherein, α₁、α₂Indicate forward data collection figure Picture, the optimal value of negative sense data images, β₁、β₂Indicate the optimal of the negative region of forward data collection image, negative sense data images Value；Corresponding three decisions, use α₁、α₂It respectively indicates and acceptance decision, β is taken to forward data collection image, negative sense data images₁、 β₂It respectively indicates and refusal decision, ξ is taken to forward data collection image, negative sense data images₁、ξ₂It respectively indicates to forward data Collection image, negative sense data images take Delayed Decision；

The evaluation of risk includes:

Receive risk:

R(α₁| y)=λ_αp1·P_β1(Y|[y])+(1-P_β1(Y|[y]))；

R(α₂| y)=λ_αp2·P_β2(Y|[y])+(1-P_β2(Y|[y]))；

Refuse risk:

R(β₁| y)=λ_βp1·P_β1(Y|[y])+(1-P_β1(Y|[y]))；

R(β₂| y)=λ_βp2·P_β2(Y|[y])+(1-P_β2(Y|[y]))；

Delayed Decision risk:

R(ξ₁| y)=λ_ξp1·P_β1(Y|[y])+(1-P_β1(Y|[y]))；

R(ξ₂| y)=λ_ξp2·P_β2(Y|[y])+(1-P_β2(Y|[y]))；

According to the evaluation of risk, the images to be recognized takes the decision package in some region into three regions It includes:

When meeting condition R (α₁|y)≤R(r₁|y)∧R(α₁|y)≤R(n₁| when y), images to be recognized is taken to forward data collection figure The decision of the receiving of picture；

When meeting condition R (α₂|y)≤R(r₂|y)∧R(α₂|y)≤R(n₂| when y), images to be recognized is taken to negative sense data set figure The decision of the receiving of picture；

When meeting condition R (r₁|y)≤R(α₁|y)∧R(r₁|y)≤R(n₁| when y), images to be recognized is taken to forward data collection figure The decision of the refusal of picture；

When meeting condition R (r₂|y)≤R(α₂|y)∧R(r₂|y)≤R(n₂| when y), images to be recognized is taken to negative sense data set figure The decision of the refusal of picture；

When meeting condition R (n₁|y)≤R(α₁|y)∧R(n₁|y)≤R(r₁| when y), images to be recognized is taken to forward data collection figure The decision that do not promise to undertake of picture；

When meeting condition R (n₂|y)≤R(α₂|y)∧R(n₂|y)≤R(r₂| when y), images to be recognized is taken to negative sense data set figure The decision that do not promise to undertake of picture；

Wherein, evaluation function is defined as P_r(Y | [y]), risk function are R (Δ | y), wherein Y indicates to act y decision, Δ table Show the decision movement to y, y indicates to determine, R (α₁|y)、R(α₂| y) indicate to forward data collection image, negative sense data images Receive the risk function of state, R (β in decision process₁|y)、R(β₂| y) indicate to forward data collection image, negative sense data set The risk function of disarmed state in image decision process, R (ξ₁|y)、R(ξ₂| y) indicate to forward data collection image, negative sense number According to the risk function of Delayed Decision state in collection image decision process；P_β1(Y|[y])、P_β2(Y | [y]) indicate to forward data The evaluation function of disarmed state in collection image, negative sense data images decision process；Risk function R (r₁|y)、R(r₂| y) indicate Forward data collection image, negative sense data images decision act r_iRisk mathematic expectaion, risk function R (n₁|y)、R(n₂| Y) indicate forward data collection image, in negative sense data images the risk of decision movement n mathematic expectaion, ∧ indicates to expire simultaneously Foot.

5. the image-recognizing method according to claim 4 based on three decisions and CNN, it is characterised in that: the basis The evaluation of risk, the images to be recognized take the decision in some region into three regions further include:

For forward data collection image:

Ensure

Wherein,It indicates acting r to the decision in forward data collection image decision process₁Risk comment Valence function, λ_αp1Expression meets the cost that acceptance decision is taken under decision condition, λ_αn1Foot forward data collection image with thumb down The cost of acceptance decision, λ are taken under decision condition_βp1Indicating to meet under the decision condition of forward data collection image takes refusal to determine The cost of plan, λ_βn1The cost for taking refusal decision under the decision condition of foot forward data collection image with thumb down, λ_ξp1It indicates Meet the cost taken under the decision condition of forward data collection image and do not promise to undertake decision, λ_ξn1Foot forward data collection figure with thumb down The cost for not promising to undertake decision is taken under the decision condition of picture；

For negative sense data images:

Ensure

Wherein,It indicates acting r to the decision in negative sense data images decision process₂Risk comment Valence function, λ_αp2Expression meets the cost that acceptance decision is taken under decision condition, λ_αn2Foot negative sense data images with thumb down The cost of acceptance decision, λ are taken under decision condition_βp2Indicating to meet under the decision conditions of negative sense data images takes refusal to determine The cost of plan, λ_βn2The cost for taking refusal decision under the decision condition of foot negative sense data images with thumb down, λ_ξp2It indicates Meet the cost taken under the decision condition of negative sense data images and do not promise to undertake decision, λ_ξn2Foot negative sense data set figure with thumb down The cost for not promising to undertake decision is taken under the decision condition of picture.

6. the image-recognizing method according to claim 1 based on three decisions and CNN, it is characterised in that: the basis The image to be identified is divided into three regions by the class condition of three Decision Classfication devices further include: introduces a pair of of threshold value (α_i,β_i), then set U is divided for following three regions:

To forward data collection image:

Positive region: POS (α₁,β₁)={ u ∈ ∪ | v (u) >=α₁}；

Negative region: NEG (α₁,β₁)={ u ∈ ∪ | v (u)≤β₁}；

Boundary Region:

Three decision rules can be constructed by above three region: positive region is corresponding to be received, the corresponding refusal of negative region, Boundary Region Corresponding Delayed Decision, Boundary Region is also Delayed Decision region, wherein and u ∈ ∪ | v (u) >=α₁Indicate to meet v (u) >=α₁And u ∈ When ∪, the value of image u；{u∈∪|v(u)≤β₁Indicate to meet v (u)≤β₁And when u ∈ ∪, the value of image u；It indicates to meetAnd when u ∈ ∪, the value of image u；Corresponding ordering relation ≤,Indicate stringent ordering relation；That is:And if only if β₁≤v(u)≤α₁And β₁≠v(u)≠α₁；α₁ Expression takes acceptance decision, β to forward data collection image₁It indicates to take refusal decision, ξ to forward data collection image₁It indicates Decision is not promised to undertake to taking for forward data collection image；U indicates that forward data image, v (u) are indicated to forward data image u amount Value after change；

To negative sense data images:

Positive region: POS (α₂,β₂)={ u ∈ ∪ | v (u) >=α₂}；

Negative region: NEG (α₂,β₂)={ u ∈ ∪ | v (u)≤β₂}；

Boundary Region:

Three decision rules can be constructed by above three region: positive region is corresponding to be received, the corresponding refusal of negative region, and Boundary Region is corresponding Delayed Decision, Boundary Region are also Delayed Decision region, wherein and u ∈ ∪ | v (u) >=α₂Indicate to meet v (u) >=α₂And image when u ∈ ∪ The value of u；{u∈∪|v(u)≤β₂Indicate to meet v (u)≤β₂And when u ∈ ∪ image u value； It indicates to meetAnd when u ∈ ∪ image u value；Corresponding ordering relation≤,Indicate that stringent total order is closed System；That is:And if only if β₂≤v(u)≤α₂And β₂≠v(u)≠α₂；α₂It indicates to negative sense data images Take acceptance decision, β₂It indicates to take refusal decision, ξ to negative sense data images₂Negative sense data images are taken in expression Do not promise to undertake decision；U indicates that negative sense data image, v (u) are indicated to the value after negative sense data u quantization.

7. the image-recognizing method based on three decisions and CNN described in -6 according to claim 1, it is characterised in that: described According to the class condition of three Decision Classfication devices, after the image to be identified is divided into three regions, determine to the delay Plan region subseries again, comprising: using the positive region sorted out as new positive sample, the negative region sorted out as new Negative sample is training condition, is trained again to three Decision Classfication devices, until Delayed Decision region part not It can divide again, reach critical value.

8. the image-recognizing method according to claim 1 based on three decisions and CNN, it is characterised in that: described to have instructed Practice the formula that CNN classifier carries out image recognition processing to the positive region split are as follows:

Wherein, x_iIndicate input vector, i.e., R, G, B eigenmatrix of every positive region image, i ∈ (1,2,3)；W_iIndicate input Vector x_iCorresponding weight, i.e. convolution kernel；B indicates biasing；h_W,bIt (x) is output characteristic value.

9. the image-recognizing method according to claim 1 based on three decisions and CNN, it is characterised in that: described wait instruct The establishment process for practicing CNN classifier includes the three classes sample obtained according to the sample number strong point of images to be recognized, makes classifier.