CN110378355B

CN110378355B - FAST feature point detection method based on FPGA hardware fusion image

Info

Publication number: CN110378355B
Application number: CN201910581890.2A
Authority: CN
Inventors: 张俊举; 洪宇; 黄奕峰; 杨刘; 严松; 李亚; 周园松
Original assignee: Nanjing University of Science and Technology
Current assignee: Nanjing University of Science and Technology
Priority date: 2019-06-30
Filing date: 2019-06-30
Publication date: 2022-09-30
Anticipated expiration: 2039-06-30
Also published as: CN110378355A

Abstract

The invention provides a FAST feature point detection method based on an FPGA hardware fusion image, which comprises the following steps: step S1, inputting two paths of video images of an infrared image and a visible light image; step S2, respectively preprocessing two paths of input images; step S3, establishing a Gaussian pyramid for the two input pictures respectively; step S4, establishing a Gaussian difference pyramid for the two Gaussian pyramids; step S5, fusing the Laplacian pyramid according to a certain functional relation; step S6, performing FAST feature point detection on each pixel in the fused Laplace pyramid image; step S7: and outputting an image through the OLED display module.

Description

FAST feature point detection method based on FPGA hardware fusion image

Technical Field

The invention relates to a video processing technology, in particular to a FAST feature point detection method based on an FPGA hardware fusion image.

Background

The FAST feature detection algorithm is a FAST corner detection algorithm, the calculation speed of which is faster than that of other algorithms, but when the gray contrast of an image is not obvious enough or under complex illumination, a large number of false detection rates are generated. When the number of noise points in the image is large, the robustness is not good, and the FAST operator does not have multi-scale features and therefore has no rotation invariance.

Disclosure of Invention

The invention aims to provide a FAST feature point detection method based on an FPGA hardware fusion image.

The technical scheme for realizing the purpose of the invention is as follows: a FAST feature point detection method based on FPGA hardware fusion images comprises the following steps:

step S1, inputting two paths of video images of an infrared image and a visible light image;

step S2, respectively preprocessing two paths of input images;

step S3, establishing a Gaussian pyramid for the two input pictures respectively;

step S4, establishing a Gaussian difference pyramid for the two Gaussian pyramids;

step S5, fusing the Laplacian pyramid according to a certain functional relation;

step S6, performing FAST feature point detection on each pixel in the fused Laplace pyramid image;

step S7: and outputting an image through the OLED display module.

Further, the preprocessing in step S2 specifically includes:

step S201, two paths of detectors respectively input 8bit signals into an FPGA chip through IO ports;

step S202, stretching the infrared image contrast by selecting a histogram equalization method;

step S203, selecting a median filtering method to remove visible light image noise.

Further, step S201 specifically includes:

storing a complete image in an image effective interval;

calculating the occurrence frequency of each gray value of the current frame image, using a DPRAM on an FPGA chip as a memory record, wherein the address is 0,1 and … 255 for 256 gray levels, and the occurrence frequency of the corresponding gray level in each address register is recorded;

reading data in the RAM in an image blanking interval, performing accumulation calculation, and calculating an upper limit A1 and a lower limit A2 of image gray;

after the effective signal of the next frame image comes, the infrared image is linearly stretched by using the upper and lower limit parameters A1 and A2 in the image blanking interval.

Further, the divisor is moved to the left by n bits through a shift register for amplification, and the result after calculation is moved to the right by n bits for reduction; wherein the total number of image pixels is used as a divisor.

Further, the following steps: for each pixel, a 3x3 template is constructed, pixels in each row are sorted from large to small, the minimum value of the maximum value column, the median value of the middle column and the maximum value of the minimum value column are compared, and the median value among the three is taken to replace the pixel.

Further, the step S3 selects a 3-layer gaussian pyramid, and the specific process is as follows: each G layer of the Gaussian pyramid obtained by low-pass filtering and alternate-line alternate-column downsampling shown in formula (1) ₁ ,G ₂ ,...G _N

Wherein, p is the number of decomposition layers, ω (m, N) is the filter coefficient of the corresponding coordinate (m, N) in the filter template, (i, j) is the coordinate position of the current pixel point, l is 0,1 ₀ Each layer in the pyramid is 1/4 of the previous layer for the original image.

Further, step S4 specifically includes: g obtained after sampling in step S3 ₁ ,G ₂ Interpolating and amplifying by the formula (2) to obtain G' ₁ ,G′ ₂ So that they maintain the same resolution as the next layer

Further, step S5 is specifically: by using

Obtaining a first layer LP of the Laplacian pyramid ₁ ，

Obtaining a Laplacian pyramid intermediate layer LP ₂ Uppermost layer G ₃ Remain unchanged as the laplacian pyramid top layer.

Further, the step S6 includes:

step S601, constructing a 7 x 7 template and traversing pixel points;

in step S602, a threshold value t is defined,calculate the center pixel p and the point p on the circumference with radius 3 ₁₄ 、p ₇₄ 、p ₄₁ And p ₄₇ The pixel difference of (2); if the absolute value of the pixel difference of more than 3 points is larger than the threshold value t, marking as a candidate point to perform the next operation, otherwise ignoring the point;

step S603, if the pixel p is a candidate point, further calculating the center pixel p and the point p on the circle with radius 3 ₁₃ ,p ₁₄ ,p ₁₅ ,p ₂₂ ,p ₂₆ ,p ₃₁ ,p ₃₇ ,p ₄₁ ,p ₄₇ ,p ₅₁ ,p ₅₇ ,p ₆₂ ,p ₆₆ ,p ₇₃ ,p ₇₄ ,p ₇₅ (ii) a If the absolute value of the pixel difference between the continuous more than 9 points and the central point is greater than a threshold value t, marking as a characteristic point;

step S604, calculating the sum of the pixel differences of the 16 points around the feature point and the central point, and recording the sum as S;

step S605, taking a 3x3 template, if more than two angular points exist in the template range, comparing the value of the angular point S, and taking the maximum value as the angular point; if there is only one, the corner value is reserved;

step S606, a laplacian pyramid reconstruction is performed.

Compared with the prior art, the invention has the following advantages: (1) according to the method, the infrared and visible light double-light fusion image is utilized, and the feature point detection is carried out on the fusion image so as to solve the problems of low detection rate and poor robustness of the existing algorithm; (2) the parallel execution capacity and the pipeline operation of the FPGA have strong advantages in the field of image processing with large data volume, and the configuration is flexible and can be modified.

The invention is further described below with reference to the accompanying drawings.

Drawings

FIG. 1 is a flow chart of the method of the present invention.

Fig. 2 is a schematic diagram of points on a circle with a radius of 3 around the central pixel in step 6.

Detailed Description

With reference to fig. 1, a method for detecting FAST feature points based on FPGA hardware fusion images includes the following steps:

step S2, respectively preprocessing two paths of input images;

step S7: and outputting an image through the OLED display module.

The preprocessing in step S2 specifically includes:

Step S201 specifically includes:

storing a complete image in an image effective interval;

The divisor is moved to the left by n bits through a shift register for amplification, and the result after calculation is moved to the right by n bits for reduction; wherein the total number of image pixels is used as a divisor.

Step S203 specifically includes: for each pixel, a 3x3 template is constructed, pixels in each row are sorted from large to small, the minimum value of the maximum value column, the median value of the middle column and the maximum value of the minimum value column are compared, and the median value among the three is taken to replace the pixel.

In step S3, selecting a 3-level gaussian pyramid, specifically: each G layer of the Gaussian pyramid obtained by low-pass filtering and alternate-row interlaced downsampling shown in formula (1) ₁ ,G ₂ ,...G _N

Where, p is the number of decomposition layers, ω (m, N) is the filter coefficient of the corresponding coordinate (m, N) in the filter template, (i, j) is the coordinate position of the current pixel, and l is 0,1 ₀ Each layer in the pyramid is 1/4 of the previous layer for the original image.

Step S4 specifically includes: g obtained after sampling in step S3 ₁ ,G ₂ Interpolating and amplifying by the formula (2) to obtain G' ₁ ,G′ ₂ So that they maintain the same resolution as the next layer

Step S5 is specifically performed by

Obtaining a first layer LP of the Laplacian pyramid ₁ ，

Obtaining the Laplacian pyramid intermediate layer LP ₂ Uppermost layer G ₃ Left unchanged as the Laplacian pyramid top level

With reference to fig. 2, the step S6 includes:

step S601, constructing a 7 x 7 template and traversing pixel points;

step S602, defining a threshold t, calculating a center pixel p and a point p on a circle with a radius of 3 ₁₄ 、p ₇₄ 、p ₄₁ And p ₄₇ The pixel difference of (a); if the absolute value of the pixel difference of more than 3 points is larger than the threshold value t, marking as a candidate point to perform the next operation, otherwise ignoring the point;

step S604, calculating the sum of the difference values of the 16 points around the feature point and the pixel of the central point, and recording the sum as S;

step S606, a laplacian pyramid reconstruction is performed.

Claims

1. A FAST feature point detection method based on FPGA hardware fusion images is characterized by comprising the following steps:

step S2, respectively preprocessing two paths of input images;

step S6, performing FAST feature point detection on each pixel in the fused Laplace pyramid image; the method comprises the following steps:

step S601, constructing a 7 x 7 template and traversing pixel points;

step S602, defining a threshold t, calculating a center pixel p and a point p on a circle with a radius of 3 ₁₄ 、p ₇₄ 、p ₄₁ And p ₄₇ The pixel difference of (2); if the absolute value of the pixel difference of more than 3 points is larger than the threshold value t, marking as a candidate point to perform the next operation, otherwise ignoring the point;

in step S603, if the pixel p is a candidate point, the central pixel p and the point p on the circumference with radius 3 are further calculated ₁₃ ,p ₁₄ ,p ₁₅ ,p ₂₂ ,p ₂₆ ,p ₃₁ ,p ₃₇ ,p ₄₁ ,p ₄₇ ,p ₅₁ ,p ₅₇ ,p ₆₂ ,p ₆₆ ,p ₇₃ ,p ₇₄ ,p ₇₅ (ii) a If the absolute value of the pixel difference between the continuous more than 9 points and the central point is greater than a threshold value t, marking as a characteristic point;

step S606, reconstructing a Laplacian pyramid;

step S7: and outputting an image through the OLED display module.

2. The method according to claim 1, wherein the preprocessing in step S2 specifically includes:

3. The method according to claim 2, wherein step S202 specifically includes:

storing a complete image in an image effective interval;

calculating the occurrence frequency of each gray value of the current frame image, using a DPRAM (dual-port random access memory) on an FPGA (field programmable gate array) chip as a memory record, wherein the addresses are 0,1 and … 255 for 256 gray levels, and recording the occurrence frequency of the corresponding gray levels in each address register;

4. The method of claim 3, wherein the divisor is shifted to the left by n bits through the shift register for amplification, and the result after calculation is shifted to the right by n bits for restoration; where the total number of image pixels is used as a divisor.

5. The method according to claim 2, wherein step S203 specifically comprises: for each pixel, a 3x3 template is constructed, pixels in each row are sorted from large to small, the minimum value of the maximum value column, the median value of the middle column and the maximum value of the minimum value column are compared, and the median value among the three is taken to replace the pixel.

6. The method according to claim 1, wherein the step S3 of selecting the 3-level gaussian pyramid comprises: each G layer of the Gaussian pyramid obtained by low-pass filtering and alternate-line alternate-column downsampling shown in formula (1) ₁ ,G ₂ ,...G _N

Where, p is the number of decomposition layers, ω (m, N) is the filter coefficient of the corresponding coordinate (m, N) in the filter template, (i, j) is the coordinate position of the current pixel, and l is 0,1 ₀ As an original image, a pyramidEach of which is 1/4 of the previous layer.

7. The method according to claim 6, wherein step S4 specifically comprises: g obtained after sampling in step S3 ₁ ,G ₂ Interpolating and amplifying by formula (2) to obtain G' ₁ ,G' ₂ So that they maintain the same resolution as the next layer

8. The method according to claim 7, wherein step S5 is implemented by

Obtaining a first layer LP of the Laplacian pyramid ₁ ，