CN105930853A

CN105930853A - Automatic image capturing device for content generation

Info

Publication number: CN105930853A
Application number: CN201610231049.7A
Authority: CN
Inventors: 吴本刚
Original assignee: Individual
Current assignee: Individual
Priority date: 2016-04-14
Filing date: 2016-04-14
Publication date: 2016-09-07

Abstract

The invention discloses an automatic image capturing device for content generation. The device comprises an image preprocessing module, an image extreme point detection module, an image characteristic point positioning module, a main direction determining module, a feature extraction module and a content generation module, wherein the image characteristic point positioning module determines extreme points as characteristic points by rejecting noise sensitive low-contract points and instable edge points from different extreme points; and the main direction determining module connects two random adjacent peak values, in a gradient direction histogram related to the characteristic points, into a line to form multiple sub line segments, merges the adjacent sub line segments with similar gradients into line segments, and uses the direction of an optimal line segment among the multiple line segments as the main direction of the characteristic points. The device has the advantages that content generation is high in precision and speed.

Description

A kind of automatic image capture device for generating content

Technical field

The present invention relates to art of image analysis, be specifically related to a kind of automatic image capture device for generating content.

Background technology

In correlation technique, the most only image itself is identified, and does not generate its content.It is desirable that machine can not only Target is detected, additionally it is possible to as people, image is understood.Additionally, in order to substantial amounts of view data is processed, Need to improve analyzing and processing efficiency and precision.

Summary of the invention

For the problems referred to above, the present invention provides a kind of automatic image capture device for generating content.

The purpose of the present invention realizes by the following technical solutions:

Provide a kind of automatic image capture device for generating content, including:

(1) image pre-processing module, it includes the image transform subblock for coloured image is converted into gray level image and is used for The image filtering submodule that described gray level image is filtered, the gradation of image conversion formula of described image transform subblock is:

\begin{matrix} I (x, y) = \frac{m a x (R (x, y), G (x, y), B (x, y)) + m i n (R (x, y), G (x, y), B (x, y))}{2} \\ + 2 [\max (R (x, y), G (x, y), B (x, y)) - \min (R (x, y), G (x, y), B (x, y))] \end{matrix}

Wherein, (x, y), (x, y), (x, (x, y) the intensity red green blue value at place, (x y) represents I B G R y) to represent pixel respectively Pixel (x, y) gray value at place；

(2) image extreme point detection module, it by being carried out the Gauss of the image that convolution is created as by difference of Gaussian and image Difference scale space detects the position of each extreme point, when sampled point relative to it with 8 consecutive points of yardstick and neighbouring When the value of 18 points that yardstick is corresponding is the biggest, described sampled point is maximum point, when sampled point relative to it with 8 of yardstick When the value of 18 points that consecutive points are corresponding with neighbouring yardstick is the least, described sampled point is minimum point, described difference of Gaussian chi The reduced mechanical model in degree space is:

D (x, y, σ)=(G (x, k σ)-G (x, σ)) * I'(x, y)+(G (y, k σ)-G (y, σ)) * I'(x, y)

Herein

G (x, σ) = \frac{1}{\sqrt{2 π} σ} e^{- x^{2} / 2 σ^{2}}, G (y, σ) = \frac{1}{\sqrt{2 π} σ} e^{- y^{2} / 2 σ^{2}}

Wherein, D (x, y, σ) represents Gaussian difference scale space function, I'(x, is y) by the image letter of image transformant module output Number, * represents that convolution algorithm, σ represent the Gaussian function that the metric space factor, G (x, σ), G (y, σ) they are the changeable scale defined, K is constant multiplication factor；

(3) image characteristic point locating module, it is by rejecting in described each extreme point the low contrast point of noise-sensitive and not Stable marginal point determines the extreme point as characteristic point, positions for extreme point pinpoint first including be sequentially connected with Submodule, for removing the second locator module of low contrast point and for removing the 3rd locator module of mobile rim point, Wherein:

A, described first locator module are by carrying out secondary Taylor expansion to described Gaussian difference scale space function and derivation obtains The exact position of extreme point, the metric space function of extreme point is:

D (\hat{X}) = D (x, y, σ) + \frac{\partial D {(x, y, σ)}^{T}}{\partial x} \hat{X}

Wherein,Represent the metric space function of extreme point, D (x, y, σ)^TFor the side-play amount relative to extreme point,Represent The exact position of extreme point；

B, described second locator module carry out grey level enhancement, normalized successively to the image exported soon by image conversion submodule Rear rejecting described low contrast point, enhanced gray value is:

Herein

Described low contrast point judge formula as:

D (\hat{X}) < T_{1}, T_{1} &Element; [0.01, 0.06]

Wherein, I " (x, y) represents the enhanced image function of gray value,For comprising the correction coefficient of local message, M is The maximum gradation value of pixel, described maximum gradation value M=255, m_HFor all pixels higher than 128 of the gray value in image Average, m_LIt is the gray value average that is less than all pixels of 128, ψ (x, y) is the image after being processed by image filtering submodule, T₁For the threshold value set；

C, described 3rd locator module obtain this extreme value by the Hessian matrix H that Location Scale is 2 × 2 calculating extreme point The principal curvatures of point, and by rejecting principal curvatures ratio more than threshold value T set₂Extreme point reject described mobile rim point, Wherein threshold value T₂Span be [10,15], described principal curvatures ratio is come really by the ratio between the eigenvalue of comparator matrix H Fixed；

Preferably, the described automatic image capture device for generating content, also include:

(1) principal direction determines module, including the connection sub module being sequentially connected with, merges submodule and processes submodule, described company Line is used for two peak value lines of the arbitrary neighborhood in the gradient orientation histogram about described characteristic point in module to form many height Line segment, described merging submodule is for merging formation one in the longitudinal direction by having close slope and adjacent sub-line section Line segment, described process submodule for using the direction of the optimum line segment in a plurality of line segment as the principal direction of characteristic point, described optimum Line segment judge formula as:

L_{Y} = L_{{\overset{&OverBar;}{g}}_{\max}}, {\overset{&OverBar;}{g}}_{\max} = \max ({\overset{&OverBar;}{g}}_{L_{n}}), {\overset{&OverBar;}{g}}_{L_{n}} = \frac{1}{k} Σ_{k = 1}^{k} g_{k}, L_{n} &Element; L_{&upsi;})

Wherein, L_YRepresent optimum line segment,For average gradient value it isLine segment,For nth bar in described a plurality of line segment The average gradient value of line segment, g_kFor the kth strip line segment in described nth bar line segment, L_υFor described a plurality of line segment middle conductor length Line segment aggregate more than average line segment length；

(2) characteristic extracting module, it is according to described main formula always hyperspin feature neighborhood of a point, and according to postrotational neighborhood to institute State characteristic point to be described, thus generate the descriptor of described characteristic point；

(3) content generating module, the feature of extraction, through processing, completes content and generates.

Further, the sub-line section described in close slope is that slope differences is less than predetermined threshold value T₃Sub-line section, described threshold value T₃'s Span be (0,0.1].

The invention have the benefit that

1, the image pre-processing module arranged considers visual custom and the human eye perceptibility to different color with colouring intensity Non-linear relation, it is possible to describe image the most accurately；

2, propose the reduced mechanical model of Gaussian difference scale space, decrease operand, improve arithmetic speed, Jin Erti The high speed of graphical analysis；

3, the image characteristic point locating module arranged carries out low contrast point and the removal of mobile rim point to extreme point, it is ensured that special Levy effectiveness a little, wherein the gray value of image is strengthened, it is possible to be greatly increased the stability of image, the most right Low contrast point is removed, and then improves the accuracy of graphical analysis；

4, principal direction is set and determines module, it is proposed that the judgement formula of optimum line segment, with appointing in characteristic point gradient orientation histogram The direction of the optimum line segment in the line segment that adjacent two peak value lines of anticipating are formed is as the principal direction of characteristic point, and line segment is relative to point more Add stable so that the descriptor of image characteristic of correspondence point has repeatability, improves the accuracy of feature descriptor, and then More fast and accurately image can be identified detection, there is the highest robustness.

Accompanying drawing explanation

The invention will be further described to utilize accompanying drawing, but the embodiment in accompanying drawing does not constitute any limitation of the invention, for Those of ordinary skill in the art, on the premise of not paying creative work, it is also possible to obtains the attached of other according to the following drawings Figure.

Fig. 1 is the connection diagram of each module of the present invention.

Detailed description of the invention

The invention will be further described with the following Examples.

Embodiment 1

Seeing Fig. 1, the present embodiment is used for generating the automatic image capture device of content, including:

\begin{matrix} I (x, y) = \frac{m a x (R (x, y), G (x, y), B (x, y)) + m i n (R (x, y), G (x, y), B (x, y))}{2} \\ + 2 [\max (R (x, y), G (x, y), B (x, y)) - \min (R (x, y), G (x, y), B (x, y))] \end{matrix}

Herein

G (x, σ) = \frac{1}{\sqrt{2 π} σ} e^{- x^{2} / 2 σ^{2}}, G (y, σ) = \frac{1}{\sqrt{2 π} σ} e^{- y^{2} / 2 σ^{2}}

D (\hat{X}) = D (x, y, σ) + \frac{\partial D {(x, y, σ)}^{T}}{\partial x} \hat{X}

Herein

Described low contrast point judge formula as:

D (\hat{X}) < T_{1}, T_{1} &Element; [0.01, 0.06]

L_{Y} = L_{{\overset{&OverBar;}{g}}_{\max}}, {\overset{&OverBar;}{g}}_{\max} = \max ({\overset{&OverBar;}{g}}_{L_{n}}), {\overset{&OverBar;}{g}}_{L_{n}} = \frac{1}{k} Σ_{k = 1}^{k} g_{k}, L_{n} &Element; L_{&upsi;})

It is strong with color that the image pre-processing module that the present embodiment is arranged considers visual custom and the human eye perceptibility to different color The non-linear relation of degree, it is possible to describe image the most accurately；Propose the reduced mechanical model of Gaussian difference scale space, subtract Lack operand, improve arithmetic speed, and then improve the speed of graphical analysis；The image characteristic point locating module pair arranged Extreme point carries out low contrast point and the removal of mobile rim point, it is ensured that the effectiveness of characteristic point, the wherein gray value to image Strengthen, it is possible to be greatly increased the stability of image, the most accurate low contrast point is removed, and then improve image The accuracy analyzed；Principal direction is set and determines module, it is proposed that the judgement formula of optimum line segment, with characteristic point gradient direction Nogata The direction of the optimum line segment in the line segment that two peak value lines of the arbitrary neighborhood in figure are formed is as the principal direction of characteristic point, line segment phase More stable for point so that the descriptor of image characteristic of correspondence point has repeatability, improves the accurate of feature descriptor Property, and then can more fast and accurately image be identified detection, there is the highest robustness；The present embodiment takes threshold value T₁=0.01, T₂=10, T₃=0.1, the precision generating content improves 2%, and speed improves 1%.

Embodiment 2

\begin{matrix} I (x, y) = \frac{m a x (R (x, y), G (x, y), B (x, y)) + m i n (R (x, y), G (x, y), B (x, y))}{2} \\ + 2 [\max (R (x, y), G (x, y), B (x, y)) - \min (R (x, y), G (x, y), B (x, y))] \end{matrix}

Herein

G (x, σ) = \frac{1}{\sqrt{2 π} σ} e^{- x^{2} / 2 σ^{2}}, G (y, σ) = \frac{1}{\sqrt{2 π} σ} e^{- y^{2} / 2 σ^{2}}

D (\hat{X}) = D (x, y, σ) + \frac{\partial D {(x, y, σ)}^{T}}{\partial x} \hat{X}

Herein

Described low contrast point judge formula as:

D (\hat{X}) < T_{1}, T_{1} &Element; [0.01, 0.06]

L_{Y} = L_{{\overset{&OverBar;}{g}}_{\max}}, {\overset{&OverBar;}{g}}_{\max} = \max ({\overset{&OverBar;}{g}}_{L_{n}}), {\overset{&OverBar;}{g}}_{L_{n}} = \frac{1}{k} Σ_{k = 1}^{k} g_{k}, L_{n} &Element; L_{v})

It is strong with color that the image pre-processing module that the present embodiment is arranged considers visual custom and the human eye perceptibility to different color The non-linear relation of degree, it is possible to describe image the most accurately；Propose the reduced mechanical model of Gaussian difference scale space, subtract Lack operand, improve arithmetic speed, and then improve the speed of graphical analysis；The image characteristic point locating module pair arranged Extreme point carries out low contrast point and the removal of mobile rim point, it is ensured that the effectiveness of characteristic point, the wherein gray value to image Strengthen, it is possible to be greatly increased the stability of image, the most accurate low contrast point is removed, and then improve image The accuracy analyzed；Principal direction is set and determines module, it is proposed that the judgement formula of optimum line segment, with characteristic point gradient direction Nogata The direction of the optimum line segment in the line segment that two peak value lines of the arbitrary neighborhood in figure are formed is as the principal direction of characteristic point, line segment phase More stable for point so that the descriptor of image characteristic of correspondence point has repeatability, improves the accurate of feature descriptor Property, and then can more fast and accurately image be identified detection, there is the highest robustness；The present embodiment takes threshold value T₁=0.02, T₂=11, T₃=0.08, the precision generating content improves 1%, and speed improves 1.5%.

Embodiment 3

\begin{matrix} I (x, y) = \frac{m a x (R (x, y), G (x, y), B (x, y)) + m i n (R (x, y), G (x, y), B (x, y))}{2} \\ + 2 [\max (R (x, y), G (x, y), B (x, y)) - \min (R (x, y), G (x, y), B (x, y))] \end{matrix}

Herein

G (x, σ) = \frac{1}{\sqrt{2 π} σ} e^{- x^{2} / 2 σ^{2}}, G (y, σ) = \frac{1}{\sqrt{2 π} σ} e^{- y^{2} / 2 σ^{2}}

D (\hat{X}) = D (x, y, σ) + \frac{\partial D {(x, y, σ)}^{T}}{\partial x} \hat{X}

Herein

Described low contrast point judge formula as:

D (\hat{X}) < T_{1}, T_{1} &Element; [0.01, 0.06]

L_{Y} = L_{{\overset{&OverBar;}{g}}_{\max}}, {\overset{&OverBar;}{g}}_{\max} = \max ({\overset{&OverBar;}{g}}_{L_{n}}), {\overset{&OverBar;}{g}}_{L_{n}} = \frac{1}{k} Σ_{k = 1}^{k} g_{k}, L_{n} &Element; L_{&upsi;})

It is strong with color that the image pre-processing module that the present embodiment is arranged considers visual custom and the human eye perceptibility to different color The non-linear relation of degree, it is possible to describe image the most accurately；Propose the reduced mechanical model of Gaussian difference scale space, subtract Lack operand, improve arithmetic speed, and then improve the speed of graphical analysis；The image characteristic point locating module pair arranged Extreme point carries out low contrast point and the removal of mobile rim point, it is ensured that the effectiveness of characteristic point, the wherein gray value to image Strengthen, it is possible to be greatly increased the stability of image, the most accurate low contrast point is removed, and then improve image The accuracy analyzed；Principal direction is set and determines module, it is proposed that the judgement formula of optimum line segment, with characteristic point gradient direction Nogata The direction of the optimum line segment in the line segment that two peak value lines of the arbitrary neighborhood in figure are formed is as the principal direction of characteristic point, line segment phase More stable for point so that the descriptor of image characteristic of correspondence point has repeatability, improves the accurate of feature descriptor Property, and then can more fast and accurately image be identified detection, there is the highest robustness；The present embodiment takes threshold value T₁=0.03, T₂=12, T₃=0.06, the precision generating content improves 2.5%, and speed improves 3%.

Embodiment 4

\begin{matrix} I (x, y) = \frac{m a x (R (x, y), G (x, y), B (x, y)) + m i n (R (x, y), G (x, y), B (x, y))}{2} \\ + 2 [\max (R (x, y), G (x, y), B (x, y)) - \min (R (x, y), G (x, y), B (x, y))] \end{matrix}

Herein

G (x, σ) = \frac{1}{\sqrt{2 π} σ} e^{- x^{2} / 2 σ^{2}}, G (y, σ) = \frac{1}{\sqrt{2 π} σ} e^{- y^{2} / 2 σ^{2}}

D (\hat{X}) = D (x, y, σ) + \frac{\partial D {(x, y, σ)}^{T}}{\partial x} \hat{X}

Herein

Described low contrast point judge formula as:

D (\hat{X}) < T_{1}, T_{1} &Element; [0.01, 0.06]

L_{Y} = L_{{\overset{&OverBar;}{g}}_{\max}}, {\overset{&OverBar;}{g}}_{\max} = \max ({\overset{&OverBar;}{g}}_{L_{n}}), {\overset{&OverBar;}{g}}_{L_{n}} = \frac{1}{k} Σ_{k = 1}^{k} g_{k}, L_{n} &Element; L_{&upsi;})

It is strong with color that the image pre-processing module that the present embodiment is arranged considers visual custom and the human eye perceptibility to different color The non-linear relation of degree, it is possible to describe image the most accurately；Propose the reduced mechanical model of Gaussian difference scale space, subtract Lack operand, improve arithmetic speed, and then improve the speed of graphical analysis；The image characteristic point locating module pair arranged Extreme point carries out low contrast point and the removal of mobile rim point, it is ensured that the effectiveness of characteristic point, the wherein gray value to image Strengthen, it is possible to be greatly increased the stability of image, the most accurate low contrast point is removed, and then improve image The accuracy analyzed；Principal direction is set and determines module, it is proposed that the judgement formula of optimum line segment, with characteristic point gradient direction Nogata The direction of the optimum line segment in the line segment that two peak value lines of the arbitrary neighborhood in figure are formed is as the principal direction of characteristic point, line segment phase More stable for point so that the descriptor of image characteristic of correspondence point has repeatability, improves the accurate of feature descriptor Property, and then can more fast and accurately image be identified detection, there is the highest robustness；The present embodiment takes threshold value T₁=0.04, T₂=13, T₃=0.04, the precision generating content improves 1.5%, and speed improves 2%.

Embodiment 5

\begin{matrix} I (x, y) = \frac{m a x (R (x, y), G (x, y), B (x, y)) + m i n (R (x, y), G (x, y), B (x, y))}{2} \\ + 2 [\max (R (x, y), G (x, y), B (x, y)) - \min (R (x, y), G (x, y), B (x, y))] \end{matrix}

Herein

G (x, σ) = \frac{1}{\sqrt{2 π} σ} e^{- x^{2} / 2 σ^{2}}, G (y, σ) = \frac{1}{\sqrt{2 π} σ} e^{- y^{2} / 2 σ^{2}}

D (\hat{X}) = D (x, y, σ) + \frac{\partial D {(x, y, σ)}^{T}}{\partial x} \hat{X}

Herein

Described low contrast point judge formula as:

D (\hat{X}) < T_{1}, T_{1} &Element; [0.01, 0.06]

L_{Y} = L_{{\overset{&OverBar;}{g}}_{\max}}, {\overset{&OverBar;}{g}}_{\max} = \max ({\overset{&OverBar;}{g}}_{L_{n}}), {\overset{&OverBar;}{g}}_{L_{n}} = \frac{1}{k} Σ_{k = 1}^{k} g_{k}, L_{n} &Element; L_{&upsi;})

It is strong with color that the image pre-processing module that the present embodiment is arranged considers visual custom and the human eye perceptibility to different color The non-linear relation of degree, it is possible to describe image the most accurately；Propose the reduced mechanical model of Gaussian difference scale space, subtract Lack operand, improve arithmetic speed, and then improve the speed of graphical analysis；The image characteristic point locating module pair arranged Extreme point carries out low contrast point and the removal of mobile rim point, it is ensured that the effectiveness of characteristic point, the wherein gray value to image Strengthen, it is possible to be greatly increased the stability of image, the most accurate low contrast point is removed, and then improve image The accuracy analyzed；Principal direction is set and determines module, it is proposed that the judgement formula of optimum line segment, with characteristic point gradient direction Nogata The direction of the optimum line segment in the line segment that two peak value lines of the arbitrary neighborhood in figure are formed is as the principal direction of characteristic point, line segment phase More stable for point so that the descriptor of image characteristic of correspondence point has repeatability, improves the accurate of feature descriptor Property, and then can more fast and accurately image be identified detection, there is the highest robustness；The present embodiment takes threshold value T₁=0.05, T₂=14, T₃=0.02, the precision generating content improves 1.8%, and speed improves 1.5%.

Last it should be noted that, above example is only in order to illustrate technical scheme, rather than to scope Restriction, although having made to explain to the present invention with reference to preferred embodiment, it will be understood by those within the art that, Technical scheme can be modified or equivalent, without deviating from the spirit and scope of technical solution of the present invention.

Claims

1., for generating an automatic image capture device for content, it is characterized in that, including:

\begin{matrix} I (x, y) = \frac{m a x (R (x, y), G (x, y), B (x, y)) + m i n (R (x, y), G (x, y), B (x, y))}{2} \\ + 2 [m a x (R (x, y), G (x, y), B (x, y)) - m i n (R (x, y), G (x, y), B (x, y))] \end{matrix}

(2) image extreme point detection module, it by being carried out the Gauss of the image that convolution is created as by difference of Gaussian and image Difference scale space detects the position of each extreme point, when sampled point relative to it with 8 consecutive points of yardstick and neighbouring chi When the value of 18 points that degree is corresponding is the biggest, described sampled point is maximum point, when sampled point relative to it with 8 phases of yardstick When the value of 18 points that adjoint point is corresponding with neighbouring yardstick is the least, described sampled point is minimum point, described Gaussian difference scale The reduced mechanical model in space is:

Herein

G (x, σ) = \frac{1}{\sqrt{2 π} σ} e^{- x^{2} / 2 σ^{2}}, G (y, σ) = \frac{1}{\sqrt{2 π} σ} e^{- y^{2} / 2 σ^{2}}

(3) image characteristic point locating module, it is by rejecting in described each extreme point the low contrast point of noise-sensitive and not Stable marginal point determines the extreme point as characteristic point, including be sequentially connected with for pinpoint first locator of extreme point Module, for removing the second locator module of low contrast point and for removing the 3rd locator module of mobile rim point, its In:

D (\hat{X}) = D (x, y, σ) + \frac{\partial D {(x, y, σ)}^{T}}{\partial x} \hat{X}

B, described second locator module carry out grey level enhancement, normalization successively to the image exported soon by image conversion submodule Rejecting described low contrast point after reason, enhanced gray value is:

Herein

Described low contrast point judge formula as:

D (\hat{X}) < T_{1}, T_{1} &Element; [0.01, 0.06]

Wherein, I " (x, y) represents the enhanced image function of gray value,For comprising the correction coefficient of local message, M is The maximum gradation value of pixel, described maximum gradation value M=255, m_HFor all pixels equal higher than 128 of the gray value in image Value, m_LBeing the gray value average that is less than all pixels of 128, (x y) is the image after being processed by image filtering submodule, T to ψ₁ For the threshold value set；

C, described 3rd locator module obtain this extreme value by the Hessian matrix H that Location Scale is 2 × 2 calculating extreme point The principal curvatures of point, and by rejecting principal curvatures ratio more than threshold value T set₂Extreme point reject described mobile rim point, Wherein threshold value T₂Span be [10,15], described principal curvatures ratio is come really by the ratio between the eigenvalue of comparator matrix H Fixed.

A kind of automatic image capture device for generating content the most according to claim 1, is characterized in that, also include:

L_{Y} = L_{{\overset{&OverBar;}{g}}_{\max}}, {\overset{&OverBar;}{g}}_{\max} = m a x ({\overset{&OverBar;}{g}}_{L_{n}}), {\overset{&OverBar;}{g}}_{L_{n}} = \frac{1}{k} Σ_{k = 1}^{k} g_{k}, L_{n} &Element; L_{&upsi;})

A kind of automatic image capture device for generating content the most according to claim 1, is characterized in that, described in have close The sub-line section of slope is that slope differences is less than predetermined threshold value T₃Sub-line section, described threshold value T₃Span be (0,0.1].