KR20170038996A - Reference Image Projection Method for Tilted Images in SIFT - Google Patents
Reference Image Projection Method for Tilted Images in SIFT Download PDFInfo
- Publication number
- KR20170038996A KR20170038996A KR1020150137733A KR20150137733A KR20170038996A KR 20170038996 A KR20170038996 A KR 20170038996A KR 1020150137733 A KR1020150137733 A KR 1020150137733A KR 20150137733 A KR20150137733 A KR 20150137733A KR 20170038996 A KR20170038996 A KR 20170038996A
- Authority
- KR
- South Korea
- Prior art keywords
- reference image
- image
- projecting
- sift
- camera
- Prior art date
Links
Images
Classifications
-
- G06K9/6203—
-
- G06K9/32—
-
- G06K9/3208—
-
- G06K2009/485—
Landscapes
- Image Analysis (AREA)
Abstract
The present invention relates to a method of projecting a reference image for improving an object recognition rate inclined in SIFT; A first step of receiving an image photographed through a camera; A second step of calculating an angle of an inclined object in the input image; And a third step of projecting the reference image using the tilted angle of the object.
According to the present invention, an inclined angle of an object in an image captured through a camera before execution of a general SIFT algorithm is calculated, a reference image is projected, and an object is searched to efficiently search for a slanted object in the SIFT algorithm . In particular, the preprocessing process of projecting the reference image proposed by the present invention can be usefully used in a method of supplementing an existing descriptor by projecting an image around a keypoint in a step of generating a descriptor of the SIFT algorithm.
Description
The present invention relates to a method of projecting a reference image for improving the recognition accuracy of an oblique object in a SIFT, and more particularly, to a method for projecting a reference image in an SIFT And then projecting the reference image to the calculated angle.
Generally, Scale Invariant Feature Transform (SIFT) is known as an algorithm for extracting feature points from an image to generate a descriptor.
This SIFT algorithm is proposed by David G. Lowe as in Ref. 1, and can be roughly divided into two stages: a process of finding a feature point and a process of generating a descriptor of the selected feature point.
First, the feature point extraction process extracts candidate feature points on the scale space and checks the stability of the candidate feature points to correct the positions of the stable feature points to the detailed positions. Next, in the descriptor generation process, a direction component is obtained through a gradient of the surrounding region around the selected feature points, and a descriptor is generated by resetting the region of interest around the obtained direction component.
1 is a diagram showing a processing flow of a general SIFT algorithm. The SIFT sequentially performs steps of Get Image (S1), Scale-space extrema detection (S2), KeyPoint localization (S3), Orientation assignment (S4), and Make KeyPoint descriptor (S5) Characteristics such as position, scale and direction of the feature are extracted in consideration of the characteristics.
Thus, it is used to compare the descriptors extracted through the SIFT algorithm to find the object of the reference image in another image. However, the SIFT algorithm is robust against changes in size and rotation, but has a problem that the recognition rate is drastically lowered when the object is inclined.
To complement this, of course, JM. Morel has proposed an ASIFT algorithm to search objects by projecting images at various angles.
However, since the ASIFT algorithm proposed in
Therefore, the ASIFT algorithm has high computational complexity and improves the inaccuracies of the descriptor, resulting in low recognition rate.
SUMMARY OF THE INVENTION Accordingly, the present invention has been made keeping in mind the above problems occurring in the prior art, and it is an object of the present invention to provide an image processing apparatus, And it is an object of the present invention to provide a projection method capable of efficiently projecting light.
In particular, it is an object of the present invention to provide a projection method capable of efficiently projecting a tilted substance in an input image captured by a stereo camera or a depth camera.
In order to solve such a technical problem,
A first step of receiving an image photographed through a camera; A second step of calculating an angle of an inclined object in the input image; And a third step of projecting the reference image using the tilted angle of the object. The present invention also provides a method of projecting a reference image for enhancing the inclination of an object recognition rate in SIFT.
In this case, the camera may be a stereo camera or a depth camera.
The third step is a step of projecting a reference image of a frontal view at a previously calculated angle.
And a fourth step of projecting the reference image to generate a descriptor after the third step.
The fourth step may include: selecting a plurality of sample points or candidate pixels by searching for a maximum or a minimum in a scale space; Selecting keypoints among the sample points or candidate pixels; And assigning a direction to the key point and generating a key point descriptor.
According to the present invention, an inclined angle of an object in an image captured through a camera before execution of a general SIFT algorithm is calculated, a reference image is projected, and an object is searched to efficiently search for an inclined object in the SIFT algorithm .
In particular, the preprocessing process of projecting the reference image proposed by the present invention can be usefully used in a method of supplementing an existing descriptor by projecting an image around a keypoint in a step of generating a descriptor of the SIFT algorithm.
1 is a flowchart of a conventional SIFT algorithm.
2 is a system configuration diagram for projecting a reference image for improving the object recognition rate inclined in SIFT according to the present invention.
FIG. 3 is a flowchart of a SIFT algorithm for projecting a reference image for improving the object recognition rate inclined in the SIFT according to the present invention.
4 is a comparison table of the number of matching keypoints according to the image projection method according to the present invention.
FIG. 5 is a diagram showing a result of key point matching after 60-degree projection of a near-point reference image according to the image projection method of the present invention.
Hereinafter, a method of projecting a reference image for improving the object recognition rate inclined in SIFT according to the present invention will be described in detail with reference to the accompanying drawings.
Prior to this, terms and words used in the present specification and claims should not be construed as limited to ordinary or dictionary terms, and the inventor should appropriately interpret the concepts of the terms appropriately It should be interpreted in accordance with the meaning and concept consistent with the technical idea of the present invention based on the principle that it can be defined.
Therefore, the embodiments described in the present specification and the configurations shown in the drawings are only the most preferred embodiments of the present invention, and not all of the technical ideas of the present invention are described. Therefore, It should be understood that various equivalents and modifications may be present.
Referring to FIGS. 2 and 3, a method of projecting a reference image for enhancing an object recognition rate inclined by SIFT according to the present invention calculates a tilted angle of an object in an input image captured by a camera, So that the inclined object can be efficiently projected in the SIFT algorithm.
At this time, various types of cameras such as a stereo camera or a depth camera can be used as the camera.
The stereoscopic camera is a stereoscopic camera, which is a special camera that can acquire two images at the same time. This is a method in which two photographing lenses are arranged at regular intervals and the same object is photographed. When you use the stereo viewer, the images appear three-dimensionally.
In other words, the stereo camera is a video image pickup apparatus that captures an object by arranging the
The left image and the right image captured by the
Hereinafter, with reference to FIG. 2 and FIG. 3, a projection process of a reference image for enhancing the inclined object recognition rate in the SIFT according to the present invention will be described in detail.
The method includes receiving an image captured through a camera (S100: Get Image), calculating an angle of an inclined object in the input image (S110: refer to the calculated angle of the object) (S130: Scale-space extreme detection) of selecting a sample point (or a candidate pixel) by searching the maximum and minimum (extrema) in a scale space (S120: Project the reference image by using angle) A step of selecting a key point (S140: KeyPoint localization), a step of assigning a direction to a key point (S150: Orientation assignment), and a step of generating a key point descriptor (S160: KeyPoint descriptor) After a descriptor is created, it is compared with the descriptor of another image to match the object.
That is, the present invention performs a pre-processing step (S110 to S120) before performing a general SIFT (Scale Invariant Feature Transform) step (S130 to S160).
Each step will be described in detail below.
The step (S100: Get Image) is a step of inputting an input image photographed through a camera to the
A calculation angle of object image (S110) is calculated through a stereo camera or a depth camera through the step S100 and an inclination angle of an object in the input image is calculated.
At this time, a tilted angle of an object in the input image may be calculated by measuring the distance of a specific object, and then calculating the tilted angle of the object in the image.
A step S120 of projecting a reference image after calculating a tilted angle of an object in the input image through a calculation angle of an object image S110, .
In this case, since the reference image of the frontal view is projected at a previously calculated angle, the calculation complexity is low and the interpolation process is not used So that it has a high recognition rate.
After the reference image is projected through the Projection reference image using angle (S120), a step (S130: Scale-space extreme detection) of selecting a sample point (or a candidate pixel) in the input image is performed .
In this case, the step (S130: Scale-space extreme detection) is a step of selecting a sample point (or a candidate pixel) for searching an input image for an object of a reference image, It is preferable to select the sample point (or the candidate pixel) by searching the maximum and minimum (extrema) in the scale space (Scale-space) generated through the function.
After selecting a sample point (or candidate pixel) of an object through the Scale-space extreme detection (S130), a key point localization (S140: KeyPoint) is performed.
At this time, a plurality of key points are selected based on the stability value through the step (S140: KeyPoint localization).
Then, the keypoints are selected through the keypoint localization (S140), and one or more directions are assigned to each keypoint (S150: Orientation assignment).
Also, after performing the above step S150 (Orientation assignment), a step (S160: Make KeyPoint descriptor) is performed to generate a key point descriptor using local image gradients.
After creating the keypoint descriptor, the object is compared with the descriptor of another image.
Hereinafter, a description will be given of a projection process of a reference image for improving the object recognition rate at a slant in the SIFT according to the present invention, with reference to FIGS. 4 and 5. FIG.
According to this embodiment, in order to verify the performance of the method proposed in the present invention, the number of keypoints that are matched in different experimental conditions such as an angle, an oblique direction and a distance of an image is the same as the comparison table of FIG.
In this case, the present invention searches for an object of a reference image size of 573 x 670 in an image of 1024 x 768 size. casel, case2 and case3 are close to the object in the image and are tilted at + 60 °, -50 °, + 45 °, respectively. In
The second column is a case of applying the proposed reference image projection technique, and the third column is a case of projecting the camera input image to the frontal view. According to the experimental results shown in the comparison table of FIG. 4, the number of keypoints matched in the projection technique of the reference image according to the present invention is much larger, which means that the recognition rate is improved.
It will be apparent to those skilled in the art that various modifications and variations can be made in the present invention without departing from the spirit or scope of the invention. The scope of protection of the present invention should be construed under the following claims, and all technical ideas within the scope of equivalents thereof should be construed as being included in the scope of the present invention.
10: Left camera 20: Right camera
30: Analytical means
Claims (5)
A second step of calculating an angle of an inclined object in the input image; And
A third step of projecting the reference image using the tilted angle of the object;
The method of claim 1, wherein the reference image is a projection image of a reference image for enhancing a slanting object recognition rate in a SIFT.
Wherein the camera is a stereo camera or a depth camera. The method of claim 1, wherein the camera is a stereo camera or a depth camera.
Wherein the third step is a step of projecting a reference image of a frontal view at an angle calculated in advance by the SIFT, and the step of projecting the reference image for enhancing the inclined object recognition rate in the SIFT.
And a fourth step of, after the third step, projecting the reference image to generate a descriptor. The method of claim 1, wherein the step of projecting the reference image comprises the steps of:
Selecting a plurality of sample points or candidate pixels by searching for a maximum or minimum in a scale space; Selecting keypoints among the sample points or candidate pixels; And assigning a direction to the key point and generating a key point descriptor. A method of projecting a reference image for enhancing a slanted object recognition rate in a SIFT.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020150137733A KR101740572B1 (en) | 2015-09-30 | 2015-09-30 | Reference Image Projection Method for Tilted Images in SIFT |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020150137733A KR101740572B1 (en) | 2015-09-30 | 2015-09-30 | Reference Image Projection Method for Tilted Images in SIFT |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20170038996A true KR20170038996A (en) | 2017-04-10 |
KR101740572B1 KR101740572B1 (en) | 2017-06-09 |
Family
ID=58581190
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020150137733A KR101740572B1 (en) | 2015-09-30 | 2015-09-30 | Reference Image Projection Method for Tilted Images in SIFT |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR101740572B1 (en) |
-
2015
- 2015-09-30 KR KR1020150137733A patent/KR101740572B1/en active IP Right Grant
Non-Patent Citations (2)
Title |
---|
참고문헌 1 : Lowe, David G. "Distinctive image features from scale-invariant keypoints." International journal of computer vision 60.2 (2004): 91-110. |
참고문헌 2 : Morel, Jean-Michel, and Guoshen Yu. "ASIFT : A new framework for fully affine invariant image comparision", SIAM Journal on Imaging Sciences 2.2(2009): 438-469. |
Also Published As
Publication number | Publication date |
---|---|
KR101740572B1 (en) | 2017-06-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Bulatov et al. | Smart IDReader: Document recognition in video stream | |
US10395383B2 (en) | Method, device and apparatus to estimate an ego-motion of a video apparatus in a SLAM type algorithm | |
KR101333871B1 (en) | Method and arrangement for multi-camera calibration | |
US9418313B2 (en) | Method for searching for a similar image in an image database based on a reference image | |
Mistry et al. | Image stitching using Harris feature detection | |
US20200349187A1 (en) | Method and apparatus for data retrieval in a lightfield database | |
CN105069457B (en) | Image recognition method and device | |
US20170147609A1 (en) | Method for analyzing and searching 3d models | |
US20180307940A1 (en) | A method and a device for image matching | |
EP3035242B1 (en) | Method and electronic device for object tracking in a light-field capture | |
CN110544202A (en) | parallax image splicing method and system based on template matching and feature clustering | |
Vaikundam et al. | Anomaly region detection and localization in metal surface inspection | |
CN112102404B (en) | Object detection tracking method and device and head-mounted display equipment | |
Rodríguez et al. | Robust estimation of local affine maps and its applications to image matching | |
Zhao et al. | Locposenet: Robust location prior for unseen object pose estimation | |
KR101705330B1 (en) | Keypoints Selection method to Find the Viewing Angle of Objects in a Stereo Camera Image | |
Krishnan et al. | A survey on image matching methods | |
CN114926508B (en) | Visual field boundary determining method, device, equipment and storage medium | |
KR101740572B1 (en) | Reference Image Projection Method for Tilted Images in SIFT | |
KR101705333B1 (en) | Estimation Method of Depth Vatiation around SIFT Features using a Stereo Camera | |
Yang et al. | Design flow of motion based single camera 3D mapping | |
US10713808B2 (en) | Stereo matching method and system using rectangular window | |
Lehiani et al. | Object identification and tracking for steady registration in mobile augmented reality | |
Hwang et al. | Real-Time 2D Orthomosaic Mapping from Drone-Captured Images Using Feature-Based Sequential Image Registration | |
KR101733013B1 (en) | Projection Method based on the Angle of Viewpoint in SIFT Matching |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
E902 | Notification of reason for refusal | ||
AMND | Amendment | ||
E601 | Decision to refuse application | ||
AMND | Amendment | ||
X701 | Decision to grant (after re-examination) | ||
GRNT | Written decision to grant |