CN102750702B - Monocular infrared image depth estimation method based on optimized BP (Back Propagation) neural network model - Google Patents

Monocular infrared image depth estimation method based on optimized BP (Back Propagation) neural network model Download PDF

Info

Publication number
CN102750702B
CN102750702B CN201210206701.1A CN201210206701A CN102750702B CN 102750702 B CN102750702 B CN 102750702B CN 201210206701 A CN201210206701 A CN 201210206701A CN 102750702 B CN102750702 B CN 102750702B
Authority
CN
China
Prior art keywords
depth
pixel
yardstick
infrared image
image block
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201210206701.1A
Other languages
Chinese (zh)
Other versions
CN102750702A (en
Inventor
孙韶媛
席林
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Donghua University
Original Assignee
Donghua University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Donghua University filed Critical Donghua University
Priority to CN201210206701.1A priority Critical patent/CN102750702B/en
Publication of CN102750702A publication Critical patent/CN102750702A/en
Application granted granted Critical
Publication of CN102750702B publication Critical patent/CN102750702B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Image Analysis (AREA)

Abstract

The invention relates to a monocular infrared image depth estimation method based on an optimized BP (Back Propagation) neural network model. The method comprises the following steps of: acquiring a monocular infrared image and a depth map to which the monocular infrared image corresponds; setting at least three feature regions with different scales for pixel points in the monocular infrared image; calculating feature vectors of the feature regions to which the pixel points in the monocular infrared image correspond; screening all the feature vectors by successively using stepwise linear regression and independent component analysis methods to obtain feature vectors conforming to depth information of the infrared image; constructing a depth training sample set by using the obtained feature vectors and the depth map to which the infrared image corresponds, and performing nonlinear fitting on the feature vectors in the set and depth values of the depth map by using a BP neuron network, and optimizing the BP neuron network through a genetic algorithm, and then constructing a depth model; and analyzing the monocular infrared image through the depth model to obtain a depth estimated value. By using the monocular infrared image depth estimation method based on the optimized BP neural network model, the depth information of the infrared image can be relatively accurately estimated.

Description

Monocular infrared image depth estimation method based on Optimized BP Neural Network model
Technical field
The present invention relates to infrared image estimation of Depth technical field, particularly relate to a kind of monocular infrared image depth estimation method based on genetic algorithm optimization BP neural network model.
Background technology
The estimation of Depth of image is from image, to obtain depth distance information, is the problem of a depth perception in essence.The spatial positional information being built by depth perception characterizes is the surperficial relative distance detecting in from observer to scene.Recover the now existing more satisfactory algorithm of depth distance information in coloured image, but for infrared image, because of its reflection be the Temperature Distribution of scene, have the defects such as low signal-to-noise ratio, low contrast, the degree of depth algorithm that recovers this image still belongs to blank.If can recover the depth information of infrared image, the understanding effect of human eye to this image will greatly be improved so.
Picture depth method of estimation is mainly to launch for binocular Depth cue and the estimation of Depth based on image sequence at present, and these two kinds of methods all depend on the feature difference between image.And estimate for monocular depth, comparatively classical in the middle of traditional algorithm is in early days " by shape from shading (shape from shading) ", it is theoretical foundation that this algorithm be take space multistory how much, according to light source, being irradiated to that light and shade that body surface produces changes is that the shade of image recovers Object Depth, but because this algorithm needs priori (as reflection model and light source direction etc.) that the limitation of application is increased.Afterwards, some researchers find the importance of experience gradually, start to utilize the method for machine learning to go to address this problem.The team of the Andrew Ng of Stanford University, by utilizing the model of Markov field training to carry out estimation of Depth to single image, has reached good effect; The Aloysha Efros team of CMU is manual simple classification in Calibration Field scape before training, such as sky, trees, ground and perpendicular line etc., then utilize a large amount of data to learn these classifications, thereby and finally by building Bayesian model, new images is classified and recovered depth information.Although this method is comparatively applicable to the simple picture of a series of scenes, and produces a desired effect, for having in scene, the classification of study is not inaccurate often.
Summary of the invention
Technical matters to be solved by this invention is to provide a kind of monocular infrared image depth estimation method based on Optimized BP Neural Network model, and the method can be estimated the depth information of infrared image comparatively exactly.
The technical solution adopted for the present invention to solve the technical problems is: a kind of monocular infrared image depth estimation method based on Optimized BP Neural Network model is provided, comprises the following steps:
(1) obtain any monocular infrared image I (x, y) and with the corresponding depth map of described monocular infrared image I (x, y);
(2) be described monocular infrared image I (x, y) each pixel in is set the characteristic area at least three different scales, three ascending being respectively of different scale: the first yardstick, the second yardstick and the 3rd yardstick, wherein, the characteristic area of each yardstick at least comprises and is positioned at the image block at center and the image block adjacent with described image block upper and lower, left and right, and i pixel image block that is positioned at center on the first yardstick is i pixel itself; I pixel image block that is positioned at center on the second yardstick comprises all image blocks on the first yardstick; I pixel image block that is positioned at center on the 3rd yardstick comprises all image blocks on the second yardstick; By that analogy;
(3) calculate monocular infrared image I (x, y) proper vector of the corresponding characteristic area of each pixel in, the characteristic component of the proper vector of i pixel at least comprises: the gray-scale value of i pixel all image blocks on the first yardstick, the texture energy of i pixel each image block on other yardsticks except the first yardstick, the gradient energy of different directions and the average of all gradient energy and the variance of i pixel each image block on other yardsticks except the first yardstick, and the sharpness of i pixel each image block on other yardsticks except the first yardstick,
(4) to all proper vectors of obtaining, utilize successively the method for progressively linear regression analysis and independent component analysis to screen, obtain meeting the proper vector of infrared image depth information;
(5) utilize screening in step (4) to obtain proper vector and described monocular infrared image I (x, y) corresponding depth map builds the set of degree of depth training sample, the depth value of the proper vector in the set of degree of depth training sample and depth map is carried out to nonlinear fitting by BP neural network, and by genetic algorithm, the initial weight of BP neural network and threshold value are optimized, and then build depth model;
(6) the depth model analysis new monocular infrared image collecting being obtained by structure obtains estimation of Depth value.
Texture energy in described step (3) calculates by Louth mask, and concrete steps are: adopt N basic two-dimentional Louth mask, be designated as M 1..., M n, described monocular infrared image I (x, y) is done to convolution with each two-dimentional Louth mask, the value after monocular infrared image I (x, y) and k two-dimentional Louth mask convolution is: T k(x, y)=I (x, y) * M k, k=1 ..., N, i pixel m image block N on j yardstick i(m) texture energy obtaining after monocular infrared image I (x, y) and k two-dimentional Louth mask convolution is E n N i ( m ) j , E n N i ( m ) j = Σ x , y ∈ N i ( m ) | T k ( x , y ) | .
In described step (3), the gradient energy of different directions of i pixel each image block on the second yardstick and the 3rd yardstick and the average of all gradient energy and variance calculate by following steps, concrete steps are: monocular infrared image I (x, y) is tried to achieve respectively on x direction of principal axis and y direction of principal axis to x axial gradient figure I gradx(x, y) and y axial gradient figure I grady(x, y), monocular infrared image I (x, y) is at angle θ lgrad G in direction l(x, y)=I gradx(x, y) * cos (θ l)+I grady(x, y) * sin (θ l), wherein, l=0 ..., 7, calculate subsequently the gradient energy of the different directions of each image block of each pixel on other yardsticks except the first yardstick, wherein, i pixel m image block N on j yardstick i(m) at angle θ lgradient energy in direction is i pixel m image block N on j yardstick i(m) at angle θ lthe average of the gradient energy in direction i pixel m image block N on j yardstick i(m) at angle θ lgradient energy variance in direction wherein, size is image block N i(m) number of the pixel comprising,
In described step (3), the sharpness of i pixel each image block on the second yardstick and the 3rd yardstick calculates by following steps, and concrete steps are: i pixel m image block N on j yardstick i(m) sharpness wherein, I (x, y) is image block N i(m) contain the gray scale of pixel, for image block N i(m) average of the gray scale of contained pixel, size is image block N i(m) number of the pixel comprising.
In described step (4) progressively linear regression analysis comprise following sub-step:
(411) related coefficient of each characteristic component and depth value in calculated characteristics vector, obtains the sequence of each characteristic component to effect of depth degree according to the absolute value of related coefficient is descending;
(412) from the characteristic component of the absolute value maximum of related coefficient, start progressively to introduce regression equation, and do regression equation significance test, if significantly do not think, selected whole characteristic component is not all the principal element that affects depth value, if significantly again from depth value is affected to the descending regression equation of introducing one by one successively;
(413) new characteristic component of every introducing all needs each characteristic component contained in regression equation to carry out significance test, by that in new regression equation not significantly and depth value is affected to minimum characteristic component rejection, repeat this step until each characteristic component in regression equation is remarkable;
(414) introduce again the characteristic component in the characteristic component of not introducing, depth value being had the greatest impact, repeating step (413) and step (414), until cannot reject selected characteristic component, till also cannot introducing new characteristic component.
What independent component analysis was used in described step (4) is Fast ICA algorithm, by Fast ICA algorithm, to analyzing through the proper vector of progressively linear regression analysis, makes between each component independently as much as possible, comprises following sub-step:
(421) specifying the proper vector through M pixel after linear regression analysis is progressively observation data X, and observation data X is carried out to centralization, and making it average is 0;
(422), by the observation data X albefaction of centralization, be about to after observation data X projects to new subspace become albefaction vector Z, Z=W 0x, wherein, W 0for albefaction matrix, W 0-1/2u t, Λ is the eigenvalue matrix of the covariance matrix of observation data X, U is the eigenvectors matrix of the covariance matrix of observation data X;
(423) upgrade W *make W *=E{Zg (W tz) }-E{g ' (W tz}W, wherein g () is nonlinear function, and then to W *standardization: W=W */ || W *||, if do not restrain, repeat this step, wherein, select initial random weight vector that a mould is 1 as the initial value of W.
Initial weight and the threshold value of in described step (5), utilizing genetic algorithm to be optimized BP neural network also comprise following sub-step:
(511) according to training sample, build initial BP neural network, and determine the number of input layer, hidden layer and the output layer of this network;
(512) carry out the initialization of Population in Genetic Algorithms, wherein each individuality of population is to consist of the threshold value of the weights between the weights between input layer and hidden layer, hidden layer and output layer, hidden layer and the threshold value of output layer four parts, and all weights and threshold value are listed as into a real number vector;
(513) maximum iteration time is set as stopping criterion for iteration, iterative process each time comprises the operation of selection, crossover and mutation, when reaching stopping criterion for iteration, output the last reign of a dynasty population optimum individual be the approximate optimal solution of initial weight and threshold value.
Beneficial effect
Owing to having adopted above-mentioned technical scheme, the present invention compared with prior art, there is following advantage and good effect: the present invention utilizes infrared image " spatial context " and " multiple dimensioned " information extraction depth characteristic vector, by the method for progressively linear regression analysis and independent component analysis, the feature of extracting is screened successively, be conducive to find the depth characteristic that is more applicable to infrared image, and build degree of depth training set with this; Adopt back propagation learning theory to carry out nonlinear fitting to training set, and be optimized by genetic algorithm, improved the training speed of experiment and the accuracy rate of matching.
Accompanying drawing explanation
Fig. 1 is process flow diagram of the present invention.
Embodiment
Below in conjunction with specific embodiment, further set forth the present invention.Should be understood that these embodiment are only not used in and limit the scope of the invention for the present invention is described.In addition should be understood that those skilled in the art can make various changes or modifications the present invention after having read the content of the present invention's instruction, these equivalent form of values fall within the application's appended claims limited range equally.
As shown in Figure 1, the present embodiment discloses a kind of monocular infrared image depth estimation method of the BP neural network model based on genetic algorithm optimization, the steps include:
Step 1, obtain any monocular infrared image I (x, y) and with the corresponding depth map of this monocular infrared image I (x, y);
Step 2, for monocular infrared image I (x, y) each pixel in is set the characteristic area on three different scales, three ascending being respectively of different scale: the first yardstick, the second yardstick and the 3rd yardstick, wherein, the characteristic area of each yardstick comprise be positioned at center image block and with this image block on, under, left, right adjacent image block, i pixel image block that is positioned at center on the first yardstick is i pixel itself, the image block that be positioned at center of this i pixel on the second yardstick comprises all image blocks on the first yardstick, the image block that be positioned at center of this i pixel on the 3rd yardstick comprises all image blocks on the second yardstick,
Step 3, calculate monocular infrared image I (x, y) proper vector of the corresponding characteristic area of each pixel in, the characteristic component of the proper vector of i pixel at least comprises: the gray-scale value of i pixel all image blocks on the first yardstick, the texture energy of i pixel each image block on the second yardstick and the 3rd yardstick, gradient energy in 8 directions of i pixel each image block on the second yardstick and the 3rd yardstick and average and the variance of all gradient energy, and the sharpness of i pixel each image block on the second yardstick and the 3rd yardstick,
The texture energy of i pixel each image block on the second yardstick and the 3rd yardstick calculates by following steps:
Adopt 9 basic two-dimentional Louth masks, be designated as M 1..., M 9, monocular infrared image I (x, y) is done to convolution with each two-dimentional Louth mask, the value after monocular infrared image I (x, y) and k two-dimentional Louth mask convolution is: T k(x, y)=I (x, y) * M k, k=1 ..., 9, i pixel m image block N on j yardstick i(m) texture energy obtaining after monocular infrared image I (x, y) and k two-dimentional Louth mask convolution is E n N i ( m ) j , E n N i ( m ) j = Σ x , y ∈ N i ( m ) | T k ( x , y ) | , j = 2,3 .
The gradient energy of different directions of i pixel each image block on the second yardstick and the 3rd yardstick and the average of all gradient energy and variance calculate by following steps:
Monocular infrared image I (x, y) is tried to achieve respectively on x direction of principal axis and y direction of principal axis to x axial gradient figure I gradx(x, y) and y axial gradient figure I grady(x, y), monocular infrared image I (x, y) is at angle θ lgrad G in direction l(x, y)=I gradx(x, y) * cos (θ l)+I grady(x, y) * sin (θ l), wherein, l=0 ..., 7, calculate subsequently the gradient energy of the different directions of each image block of each pixel on other yardsticks except the first yardstick, wherein, i pixel m image block N on j yardstick i(m) at angle θ lgradient energy in direction is i pixel m image block N on j yardstick i(m) at angle θ lthe average of the gradient energy in direction i pixel m image block N on j yardstick i(m) at angle θ lgradient energy variance in direction wherein, size is image block N i(m) number of the pixel comprising,
The sharpness of i pixel each image block on the second yardstick and the 3rd yardstick calculates by following steps:
I pixel m image block N on j yardstick i(m) sharpness wherein, I (x, y) is image block N i(m) contain the gray scale of pixel, for image block N i(m) average of the gray scale of contained pixel,
Step 4, to all proper vectors of having obtained in step 3, utilize successively the method for progressively linear regression analysis and independent component analysis to screen, obtain comparatively meeting the proper vector of infrared image depth information.
In step 4, progressively the concrete steps of linear regression analysis are:
The related coefficient of each characteristic component and depth value in step 4.1.1, calculated characteristics vector, obtains the sequence of each characteristic component to effect of depth degree according to the absolute value of related coefficient is descending;
Step 4.1.2, from the characteristic component of the absolute value maximum of related coefficient, start progressively to introduce regression equation, and do regression equation significance test, if significantly can not think, selected whole characteristic component is not all the principal element that affects depth value, if significantly again from depth value is affected to the descending regression equation of introducing one by one successively;
Step 4.1.3, new characteristic component of every introducing all need each characteristic component contained in regression equation to carry out significance test, by that in new regression equation not significantly and depth value is affected to minimum characteristic component rejection, repeat this step until each characteristic component in regression equation is remarkable;
Step 4.1.4, introduce the characteristic component in the characteristic component of not introducing, depth value being had the greatest impact again, repeating step 4.1.3 and step 4.1.4, until cannot reject selected characteristic component, till also cannot introducing new characteristic component.
What in step 4, independent component analysis was used is Fast ICA algorithm, by Fast ICA algorithm, to analyzing through the proper vector of progressively linear regression analysis, makes between each component independent as much as possible.In the present embodiment, Fast ICA algorithm is usingd negentropy maximum as searching direction, and its concrete steps are as follows:
Step 4.2.1, to specify the proper vector through M pixel after linear regression analysis be progressively observation data X, and observation data X is carried out to centralization, and making it average is 0;
Step 4.2.2, by the observation data X albefaction of centralization, be about to after observation data X will project to new subspace become albefaction vector Z, Z=W 0x, wherein, W 0for albefaction matrix, W 0-1/2u t, Λ is the eigenvalue matrix of the covariance matrix of observation data X, U is the eigenvectors matrix of the covariance matrix of observation data X;
Step 4.2.3, renewal W *be to solve the separation matrix drawing, the separation matrix W finally drawing *and observation data X multiplies each other and can obtain between each component as far as possible independently source data and make W *=E{Zg (W tz) }-E{g ' (W tz) } W, wherein E () is expectation computing, wherein g () is nonlinear function, and then to W *standardization: W=W */ || W *||, if do not restrain, repeat this step, wherein, select initial random weight vector that a mould is 1 as the initial value of W.
The proper vector of infrared image that step 5, utilization are drawn by step 4 and the depth map of infrared image build degree of depth training sample set { f i, depth i, wherein, f ibe the proper vector of i pixel, i=1 ..., l, f i∈ χ, depth ibe i the corresponding depth value of pixel.Then utilize back propagation learning the Theory Construction depth model, and be optimized by genetic algorithm, and then new infrared image is carried out to estimation of Depth.
In step 5, the concrete steps of BP neural metwork training are as follows:
Step 5.1.1, according to proper vector f iwith depth value depth idetermine the input layer of BP neural network and the nodes of output layer.Then by reference to formula roughly definite hidden layer node is counted scope, and wherein n is input layer number, and m is output layer nodes, and a is the constant between 0 ~ 10, and l is hidden layer node number.By trial method, determine again the hidden layer node number of square error minimum;
Weights and the threshold value of step 5.1.2, initialization BP neural network, and given maximum iteration time, Study rate parameter η and neuron excitation function wherein a is constant;
The output H of step 5.1.3, calculating hidden layer:
H j = f ( Σ i = 1 n ω ij x i - a j ) , j = 1,2 , . . . , l
Wherein l is hidden layer node number, ω ijfor connecting the weights between input layer and hidden layer, x ibe the proper vector f mentioning in step 5.1.1 i, a jfor the threshold value of each hidden layer node, the neuron excitation function of f (.) for mentioning in step 5.1.2;
Step 5.1.4, calculating prediction output O:
O k = Σ j = 1 l H j ω jk - b k , k = 1,2 , . . . , m
Wherein m is output layer nodes, ω jkfor connecting the weights between hidden layer and output layer, b kthreshold value for each output layer node;
Step 5.1.5, according to prediction output and desired output error of calculation e, and according to error e, upgrade weights and the threshold value of network:
e k=Y k-O k,k=1,...,m
ω ij = ω ij + η H j ( 1 - H j ) x ( i ) Σ k = 1 m ω jk e k , i = 1 , . . . , n ; j = 1 , . . . , l
ω jkjk+ηH je k,j=1,...,l;k=1,...,m
a j = a j + η ( 1 - H j ) Σ k = 1 m ω jk e k , j = 1 , . . . , l
b k=b k+e k,k=1,...,m
Wherein n is input layer number, Y kfor desired output, i.e. depth value in training set;
Step 5.1.6, circulation step 5.1.3 are to step 5.1.5, until finish after reaching maximum iteration time;
Step 5.1.7, new infrared image is obtained to the proper vector of each pixel after by the feature extraction of step 1 and step 2 and screening, each component of proper vector is corresponded to respectively to the input layer of the BP neural network of having trained, by prediction, draw corresponding depth value, just completed the estimation to new infrared image depth value.
The concrete steps that in step 5, BP neural network initial weight and threshold value are optimized by genetic algorithm are as follows:
Step 5.2.1, carry out the initialization of Population in Genetic Algorithms, wherein each individuality of population is to consist of the threshold value of the weights between the weights between input layer and hidden layer, hidden layer and output layer, hidden layer and the threshold value of output layer four parts.All weights and threshold value are listed as into a real number vector;
Step 5.2.2, carry out the selection operation of genetic algorithm, first individuality is acted on to BP neural network, the Error Absolute Value between the prediction output drawing and desired output is as this individual fitness value F.According to each individual adaptive value, select, the selection Probability p of each individual i ifor:
f i=k/F i
P i = f i / Σ j = 1 N f i
F wherein ifor individual fitness value, the coefficient of k for arranging, is a constant; N is population at individual number, because of F iwhat reflect is the error between desired output and prediction output, and more the bright individuality of novel is more excellent for error, and selecteed probability is just larger, so by F iget inverse and become f iso that the size of probability is proportional with selecting; f jmeaning and f iidentical, wherein subscript j represents population individual amount;
Step 5.2.3, carry out the interlace operation of genetic algorithm, need the chromosome intersecting and need the position intersecting to select at random.Suppose k chromosome a kwith l chromosome a lj position, intersect, method of operating is as follows:
a kj = a kj ( 1 - b ) + a lj b a lj = a lj ( 1 - b ) + a kj b
Wherein b is the random number between [0,1];
Step 5.2.4, carry out the mutation operation of genetic algorithm, need the chromosome of variation and the position of variation to select at random.Suppose the gene a of i j individual position ijneed to make a variation, method of operating is as follows:
a ij = a ij + ( a ij - a max ) * f ( g ) , r &GreaterEqual; 0.5 a ij + ( a min - a ij ) * f ( g ) , r < 0.5
A wherein maxfor gene a ijthe upper bound, a minfor gene a ijlower bound; r and r 2be all the random number between [0,1], g is current iteration number of times, G msxit is maximum iteration time.
Step 5.2.5, circulation step 5.2.2, to step 5.2.4, arrange circulation maximum times, when circulation reaches maximum times, stop, and take out the individuality of fitness value F minimum as initial weight and the threshold value of neural metwork training.

Claims (5)

1. the monocular infrared image depth estimation method based on Optimized BP Neural Network model, is characterized in that, comprises the following steps:
(1) obtain any monocular infrared image I (x, y) and with the corresponding depth map of described monocular infrared image I (x, y);
(2) be described monocular infrared image I (x, y) each pixel in is set the characteristic area on three different scales, three ascending being respectively of different scale: the first yardstick, the second yardstick and the 3rd yardstick, wherein, the characteristic area of each yardstick at least comprises and is positioned at the image block at center and the image block adjacent with described image block upper and lower, left and right, and i pixel image block that is positioned at center on the first yardstick is i pixel itself; I pixel image block that is positioned at center on the second yardstick comprises all image blocks on the first yardstick; I pixel image block that is positioned at center on the 3rd yardstick comprises all image blocks on the second yardstick;
(3) calculate monocular infrared image I (x, y) proper vector of the corresponding characteristic area of each pixel in, the characteristic component of the proper vector of i pixel at least comprises: the gray-scale value of i pixel all image blocks on the first yardstick, the texture energy of i pixel each image block on other yardsticks except the first yardstick, the gradient energy of different directions and the average of all gradient energy and the variance of i pixel each image block on other yardsticks except the first yardstick, and the sharpness of i pixel each image block on other yardsticks except the first yardstick, wherein, the gradient energy of different directions of i pixel each image block on the second yardstick and the 3rd yardstick and the average of all gradient energy and variance calculate by following steps, concrete steps are: monocular infrared image I (x, y) is tried to achieve respectively on x direction of principal axis and y direction of principal axis to x axial gradient figure I gradx(x, y) and y axial gradient figure I grady(x, y), monocular infrared image I (x, y) is at angle θ lgrad G in direction l(x, y)=I gradx(x, y) * cos (θ l)+I grady(x, y) * sin (θ l), wherein, calculate subsequently the gradient energy of the different directions of each image block of each pixel on other yardsticks except the first yardstick, wherein, i pixel m image block N on j yardstick i(m) at angle θ lgradient energy in direction is i pixel m image block N on j yardstick i(m) at angle θ lthe average of the gradient energy in direction i pixel m image block N on j yardstick i(m) at angle θ lgradient energy variance in direction wherein, size is image block N i(m) number of the pixel comprising, the sharpness of i pixel each image block on the second yardstick and the 3rd yardstick calculates by following steps, and concrete steps are: i pixel m image block N on j yardstick i(m) sharpness wherein, I (x, y) is image block N i(m) contain the gray scale of pixel, for image block N i(m) average of the gray scale of contained pixel, size is image block N i(m) number of the pixel comprising,
(4) to all proper vectors of obtaining, utilize successively the method for progressively linear regression analysis and independent component analysis to screen, obtain meeting the proper vector of infrared image depth information;
(5) utilize screening in step (4) to obtain proper vector and described monocular infrared image I (x, y) corresponding depth map builds the set of degree of depth training sample, the depth value of the proper vector in the set of degree of depth training sample and depth map is carried out to nonlinear fitting by BP neural network, and by genetic algorithm, the initial weight of BP neural network and threshold value are optimized, and then build depth model;
(6) the depth model analysis new monocular infrared image collecting being obtained by structure obtains estimation of Depth value.
2. the monocular infrared image depth estimation method based on Optimized BP Neural Network model according to claim 1, it is characterized in that, texture energy in described step (3) calculates by Louth mask, and concrete steps are: adopt N basic two-dimentional Louth mask, be designated as M 1..., M n, described monocular infrared image I (x, y) is done to convolution with each two-dimentional Louth mask, the value after monocular infrared image I (x, y) and k two-dimentional Louth mask convolution is: T k(x, y)=I (x, y) * M k, k=1 ..., N, i pixel m image block N on j yardstick i(m) texture energy obtaining after monocular infrared image I (x, y) and k two-dimentional Louth mask convolution is
En N i ( m ) j = &Sigma; x , y &Element; N i ( m ) | T k ( x , y ) | .
3. the monocular infrared image depth estimation method based on Optimized BP Neural Network model according to claim 1, is characterized in that, in described step (4) progressively linear regression analysis comprise following sub-step:
(411) related coefficient of each characteristic component and depth value in calculated characteristics vector, obtains the sequence of each characteristic component to effect of depth degree according to the absolute value of related coefficient is descending;
(412) from the characteristic component of the absolute value maximum of related coefficient, start progressively to introduce regression equation, and do regression equation significance test, if significantly do not think, selected whole characteristic component is not all the principal element that affects depth value, if significantly again from depth value is affected to the descending regression equation of introducing one by one successively;
(413) new characteristic component of every introducing all needs each characteristic component contained in regression equation to carry out significance test, by that in new regression equation not significantly and depth value is affected to minimum characteristic component rejection, repeat this step until each characteristic component in regression equation is remarkable;
(414) introduce again the characteristic component in the characteristic component of not introducing, depth value being had the greatest impact, repeating step (413) and step (414), until cannot reject selected characteristic component, till also cannot introducing new characteristic component.
4. the monocular infrared image depth estimation method based on Optimized BP Neural Network model according to claim 1, it is characterized in that, what in described step (4), independent component analysis was used is Fast ICA algorithm, by Fast ICA algorithm to analyzing through the proper vector of progressively linear regression analysis, make between each component independently as much as possible, comprise following sub-step:
(421) specifying the proper vector through M pixel after linear regression analysis is progressively observation data X, and observation data X is carried out to centralization, and making it average is 0;
(422), by the observation data X albefaction of centralization, be about to after observation data X projects to new subspace become albefaction vector Z, Z=W 0x, wherein, W 0for albefaction matrix, W 0-1/2u t, Λ is the eigenvalue matrix of the covariance matrix of observation data X, U is the eigenvectors matrix of the covariance matrix of observation data X;
(423) upgrade W *make W *=E{Zg (W tz) }-E{g'(W tz) } W, wherein g () is nonlinear function, and then to W *standardization: W=W */ || W *||, if do not restrain, repeat this step, wherein, select initial random weight vector that a mould is 1 as the initial value of W.
5. the monocular infrared image depth estimation method based on Optimized BP Neural Network model according to claim 1, it is characterized in that, initial weight and the threshold value of in described step (5), utilizing genetic algorithm to be optimized BP neural network also comprise following sub-step:
(511) according to training sample, build initial BP neural network, and determine the number of input layer, hidden layer and the output layer of this network;
(512) carry out the initialization of Population in Genetic Algorithms, wherein each individuality of population is to consist of the threshold value of the weights between the weights between input layer and hidden layer, hidden layer and output layer, hidden layer and the threshold value of output layer four parts, and all weights and threshold value are listed as into a real number vector;
(513) maximum iteration time is set as stopping criterion for iteration, iterative process each time comprises the operation of selection, crossover and mutation, when reaching stopping criterion for iteration, output the last reign of a dynasty population optimum individual be the approximate optimal solution of initial weight and threshold value.
CN201210206701.1A 2012-06-21 2012-06-21 Monocular infrared image depth estimation method based on optimized BP (Back Propagation) neural network model Expired - Fee Related CN102750702B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210206701.1A CN102750702B (en) 2012-06-21 2012-06-21 Monocular infrared image depth estimation method based on optimized BP (Back Propagation) neural network model

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210206701.1A CN102750702B (en) 2012-06-21 2012-06-21 Monocular infrared image depth estimation method based on optimized BP (Back Propagation) neural network model

Publications (2)

Publication Number Publication Date
CN102750702A CN102750702A (en) 2012-10-24
CN102750702B true CN102750702B (en) 2014-10-15

Family

ID=47030851

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210206701.1A Expired - Fee Related CN102750702B (en) 2012-06-21 2012-06-21 Monocular infrared image depth estimation method based on optimized BP (Back Propagation) neural network model

Country Status (1)

Country Link
CN (1) CN102750702B (en)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107690663B (en) * 2015-06-05 2022-04-12 渊慧科技有限公司 Whitening neural network layer
CN106408015A (en) * 2016-09-13 2017-02-15 电子科技大学成都研究院 Road fork identification and depth estimation method based on convolutional neural network
EP3507771B1 (en) 2016-09-15 2020-06-24 Google LLC Image depth prediction neural networks
CN107204010B (en) * 2017-04-28 2019-11-19 中国科学院计算技术研究所 A kind of monocular image depth estimation method and system
CN107578436B (en) * 2017-08-02 2020-06-12 南京邮电大学 Monocular image depth estimation method based on full convolution neural network FCN
CN107607207B (en) * 2017-08-22 2019-06-28 中国科学院深圳先进技术研究院 A kind of power equipment thermal fault detection method, system and electronic equipment
CN107993239B (en) * 2017-12-25 2022-04-12 北京邮电大学 Method and device for calculating depth order of monocular image
CN108335322B (en) 2018-02-01 2021-02-12 深圳市商汤科技有限公司 Depth estimation method and apparatus, electronic device, program, and medium
CN109035319B (en) * 2018-07-27 2021-04-30 深圳市商汤科技有限公司 Monocular image depth estimation method, monocular image depth estimation device, monocular image depth estimation apparatus, monocular image depth estimation program, and storage medium
CN109461177B (en) * 2018-09-29 2021-12-10 浙江科技学院 Monocular image depth prediction method based on neural network
CN109377524B (en) * 2018-10-29 2021-02-23 山东师范大学 Method and system for recovering depth of single image
CN109146944B (en) * 2018-10-30 2020-06-26 浙江科技学院 Visual depth estimation method based on depth separable convolutional neural network
CN109598704B (en) * 2018-11-19 2023-05-02 电子科技大学 Fecal microscopic image definition evaluation method based on BP neural network
CN109903393B (en) * 2019-02-22 2021-03-16 清华大学 New visual angle scene synthesis method and device based on deep learning
CN112329189A (en) * 2019-07-30 2021-02-05 天津大学青岛海洋技术研究院 Transient electromagnetic useful signal extraction method based on independent quantity analysis
CN112036470A (en) * 2020-08-28 2020-12-04 扬州大学 Cloud transmission-based multi-sensor fusion cucumber bemisia tabaci identification method
CN112215410B (en) * 2020-09-24 2023-01-10 国网福建省电力有限公司 Power load prediction method based on improved deep learning

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101789123A (en) * 2010-01-27 2010-07-28 中国科学院半导体研究所 Method for creating distance map based on monocular camera machine vision

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101491556B1 (en) * 2008-12-02 2015-02-09 삼성전자주식회사 Device and method for depth estimation

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101789123A (en) * 2010-01-27 2010-07-28 中国科学院半导体研究所 Method for creating distance map based on monocular camera machine vision

Non-Patent Citations (8)

* Cited by examiner, † Cited by third party
Title
Claudia Hendrix et al..Relationship between monocular and binocular depth cues for judgements of spatial information and spatial instrument design.《Displays》.1995,第16卷(第3期),第103-113页.
Depth Estimation from a Monocular Outdoor Image;Tien-Ying Kuo et al.;《2011 IEEE International Conference on Consumer Electronics》;20110112;第161-162页 *
Relationship between monocular and binocular depth cues for judgements of spatial information and spatial instrument design;Claudia Hendrix et al.;《Displays》;19950630;第16卷(第3期);第103-113页 *
Tien-Ying Kuo et al..Depth Estimation from a Monocular Outdoor Image.《2011 IEEE International Conference on Consumer Electronics》.2011,第161-162页.
基于图像分析的自然彩色夜视成像方法研究;谷小婧;《中国博士学位论文全文数据库》;20120615;第1-166页 *
基于多尺度纹理能量测度的单幅图像深度估计;蓝建梁等;《计算机工程与设计》;20110115;第32卷(第1期);第224-227页,第231页 *
蓝建梁等.基于多尺度纹理能量测度的单幅图像深度估计.《计算机工程与设计》.2011,第32卷(第1期),第224-227页,第231页.
谷小婧.基于图像分析的自然彩色夜视成像方法研究.《中国博士学位论文全文数据库》.2012,第1-166页.

Also Published As

Publication number Publication date
CN102750702A (en) 2012-10-24

Similar Documents

Publication Publication Date Title
CN102750702B (en) Monocular infrared image depth estimation method based on optimized BP (Back Propagation) neural network model
Ayoub A review on machine learning algorithms to predict daylighting inside buildings
CN110555446B (en) Remote sensing image scene classification method based on multi-scale depth feature fusion and migration learning
CN110111335B (en) Urban traffic scene semantic segmentation method and system for adaptive countermeasure learning
CN107132516B (en) A kind of Radar range profile&#39;s target identification method based on depth confidence network
CN108038445B (en) SAR automatic target identification method based on multi-view deep learning framework
CN110427875A (en) Infrared image object detection method based on depth migration study and extreme learning machine
CN110473592B (en) Multi-view human synthetic lethal gene prediction method
CN112862090B (en) Air temperature forecasting method based on deep space-time neural network
CN106920243A (en) The ceramic material part method for sequence image segmentation of improved full convolutional neural networks
CN102708569A (en) Monocular infrared image depth estimating method on basis of SVM (Support Vector Machine) model
CN105787557A (en) Design method of deep nerve network structure for computer intelligent identification
CN110852393A (en) Remote sensing image segmentation method and system
CN105869178A (en) Method for unsupervised segmentation of complex targets from dynamic scene based on multi-scale combination feature convex optimization
CN114092697B (en) Building facade semantic segmentation method with attention fused with global and local depth features
CN114092832A (en) High-resolution remote sensing image classification method based on parallel hybrid convolutional network
CN105095913B (en) The Classifying Method in Remote Sensing Image and system represented based on neighbour&#39;s canonical joint sparse
CN112766280A (en) Remote sensing image road extraction method based on graph convolution
CN111275677A (en) Ceiling earthquake damage identification method based on convolutional neural network
CN116206185A (en) Lightweight small target detection method based on improved YOLOv7
CN104850677A (en) Deep learning cellular automaton simulation analysis method based on geographic information system
CN106991411A (en) Remote Sensing Target based on depth shape priori becomes more meticulous extracting method
CN115169543A (en) Short-term photovoltaic power prediction method and system based on transfer learning
CN114693064A (en) Building group scheme generation performance evaluation method
Lin et al. Optimal CNN-based semantic segmentation model of cutting slope images

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20141015

Termination date: 20170621

CF01 Termination of patent right due to non-payment of annual fee