CN110728280A

CN110728280A - Travel license information extraction method based on color difference

Info

Publication number: CN110728280A
Application number: CN201910949386.3A
Authority: CN
Inventors: 慕乾勇; 韩永昌
Original assignee: Qingdao Contel Network Technology Co Ltd
Current assignee: Qingdao Contel Network Technology Co Ltd
Priority date: 2019-10-08
Filing date: 2019-10-08
Publication date: 2020-01-24

Abstract

The invention belongs to the technical field of image processing, and discloses a driving license information extraction method based on color difference, which comprises the following steps: s1, obtaining a difference image by solving the difference value of the G channel and the B channel of the input image in the RGB format; s2, performing morphological expansion operation on the difference image to fill the holes and noise points of the image; s3, determining a threshold value by utilizing the Otsu method, and then carrying out binarization operation on the image obtained in the step S2 to obtain a binary image of the character area; s4, searching for connected regions in the character region, counting the area of each connected region, and then removing a part of regions with smaller areas; and S5, extracting characters to obtain the driving license information. The invention has high running speed and high positioning precision, and can be widely applied to the traffic field.

Description

Travel license information extraction method based on color difference

Technical Field

The invention belongs to the technical field of image processing, and particularly relates to a driving license information extraction method based on color difference.

Background

In recent years, with the development of information automation, a technology for automatically extracting and identifying information has attracted more and more attention, and the automatic identification of characters in a driving license is an example of practical application. The picture is used as an important information carrier, and the text information in the image is important content for understanding the whole image. Image content can be divided into two aspects: perceptual content and semantic content. The perception content comprises a plurality of attributes such as color, intensity, shape, texture and space-time variation of the color, the intensity, the shape and the texture; semantic content refers to objects, things, and their relationships. The research and application of low-level perception content in a series of images are reported, and semantic content (such as texts, human faces, vehicles, gestures and the like) in the images also attracts wide attention. Where text information attracts particular interest because: 1) the text is very useful for the image content of the bidding document; 2) the text is easier to extract relative to other semantic contents; 3) the text extraction has important application in aspects of image indexing, automatic recording, safety monitoring and the like based on key words; 4) the optical character recognition software is more mature.

The current image text recognition method can be summarized into the following four methods:

1) a text detection method based on edges. For the image which only has one line of characters and the direction of the character arrangement is horizontal or vertical, the characters are divided and distinguished by projecting in the horizontal or vertical direction and then utilizing the gaps among the characters and the projection of the characters. This method is simple and quick, but only applicable to simple situations.

2) A text detection method based on texture. And judging whether a pixel point or a pixel area belongs to characters or not by utilizing the judging texture characteristics. The method can better detect the characters in the complex background, but the algorithm has the defects of inaccurate positioning and higher algorithm complexity.

3) A text detection method based on regions. This method tries to detect the character as a single color region that satisfies a particular heuristic. The method has high processing speed and accurate positioning, but is only suitable for binary images and has limitation.

4) A learning-based text detection method. The method introduces a learning mechanism, firstly learns the network for detection through the selected samples, and then detects the characters by using the trained network prediction. On one hand, the method needs a large number of samples to train the network; on the other hand, the similarity between the training sample and the test sample affects the final recognition effect.

In summary, the current algorithm only utilizes partial information of the image, and fails to fully utilize the information of the image. Our driving license is written with black text on a green background. By using the difference of the background and the characters, the text content of the driving license can be effectively extracted.

Disclosure of Invention

In order to meet the actual requirements in the traffic field, the invention overcomes the defects in the prior art, and the technical problem to be solved is to provide a driving license information extraction method based on color difference so as to realize automatic extraction of text information on a driving license image and positioning of each character.

In order to solve the technical problems, the invention adopts the technical scheme that: a driving license information extraction method based on color difference comprises the following steps:

s1, obtaining a difference value image by solving a difference value of a G channel and a B channel of an input image in an RGB format, wherein the G channel represents a green channel, the B channel represents a blue channel, the size of the input image is H multiplied by W, H is the height of the image, W represents the width of the image, and mask (i, j) represents the difference value image;

s2, performing morphological expansion operation on the difference image to fill the holes and noise points of the image;

s3, determining a threshold value by utilizing the Otsu method, and then carrying out binarization operation on the image obtained in the step S2 to obtain a binary image of the character area;

s4, searching for connected regions in the character region, counting the area of each connected region, and then removing a part of regions with smaller areas, thereby eliminating the influence of noise regions;

and S5, extracting characters from the region obtained in the step S5 by a vertical projection method to obtain the driving license information.

In step S1, the calculation formula for obtaining the difference image by calculating the difference value between the G and B channels of the input image in RGB format is:

mask(i,j)＝|G(i,j)-B(i,j)|,1≤i≤H,1≤j≤W；

wherein, (i, j) represents the pixel point coordinate of the image, G (i, j) represents the green channel value of the pixel point with the coordinate (i, j) on the image, and B (i, j) represents the blue channel value of the pixel point with the coordinate (i, j) on the image.

In step S2, the calculation formula for performing morphological dilation operation on the difference image is:

mask′(i,j)＝Dilation(mask(i,j)),1≤i≤H,1≤j≤W；

wherein mask (i, j) represents a difference image; the scale represents the Dilation function, and the mask' (i, j) represents the image after the morphological Dilation operation.

In step S3, the calculation formula of the binarization operation is:

where mask "(i, j) represents a binary image, K represents a threshold, and mask' (i, j) represents an image after a morphological dilation operation.

In step S4, the specific method of removing the region with a small partial area is as follows: and finding the region with the largest area in each connected region, setting a threshold value according to the region with the largest area, and then removing the connected regions with the areas smaller than the threshold value.

In step S4, the threshold value is 10% of the area of the region having the largest area.

Compared with the prior art, the invention has the following beneficial effects: the invention obtains the difference value of the channels G and B of the image in RGB format, then fills the holes and the noise points by using morphological expansion, then obtains a binary image by using a threshold value, thereby obtaining the area of each character, and finally obtains a small area so as to remove the noise area. The method is used for extracting the driving license information, so that the running speed of the invention is high, and the character segmentation can be carried out in real time; the character area positioning precision of the invention is high, and the error rate of the character area is low.

Drawings

Fig. 1 is a schematic step diagram of a method for extracting license information based on color difference according to an embodiment of the present invention.

Detailed Description

In order to make the technical solutions and advantages of the present invention clearer, the technical solutions of the present invention will be clearly and completely described below with reference to specific embodiments and accompanying drawings, and it is obvious that the described embodiments are some, but not all embodiments of the present invention; all other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

The embodiment of the invention provides a driving license information extraction method based on color difference, which comprises the following steps:

s1, obtaining a difference value image by solving the difference value of the G channel and the B channel of the input image in the RGB format, wherein the G channel represents a green channel, the B channel represents a blue channel, the size of the input image is H multiplied by W, H is the height of the image, W represents the width of the image, and mask (i, j) represents the difference value image.

mask(i,j)＝|G(i,j)-B(i,j)|,1≤i≤H,1≤j≤W； (1)

And S2, performing morphological expansion operation on the difference image to fill in holes and noise points of the image.

mask′(i,j)＝Dilation(mask(i,j)),1≤i≤H,1≤j≤W； (2)

And S3, determining a threshold value by utilizing the Otsu method, and then performing binarization operation on the image obtained in the step S2 to obtain a binary image of the character area.

In step S3, the tsu method is a conventional method in the field of image processing, and therefore is not described herein, and the calculation formula of the binarization operation is:

S4, searching the connected regions in the character region, counting the area of each connected region, then finding the region with the largest area, and removing the region with the area less than 10% of the largest region, thereby eliminating the influence of the noise region. Wherein, the calculation formula of the area with the area less than 10% of the maximum area is:

mask″′＝Remove(mask″)； (4)

wherein Remove represents the removal function, and mask' ″ represents the image after the removal of the small region.

The searching method of the connected region of the image can be carried out by adopting the conventional technology in the field, and the errors caused by the noise region in the image to the subsequent character extraction can be removed by removing the smaller area of the region, so that the error rate is reduced.

According to the method, the difference value of the channels G and B of the image in the RGB format is firstly solved, then the morphological expansion is utilized to fill the cavity and the noise point, then the threshold value is utilized to solve the binary image, so that the region where each character is located is obtained, and finally the small-area region is removed so that the noise region is removed, so that the method is high in operation speed, capable of performing character segmentation in real time and high in character region positioning accuracy; the error rate of misinterpreting character regions is low.

The embodiments of the present invention have been described in detail with reference to the accompanying drawings, but the present invention is not limited to the above embodiments, and various changes can be made within the knowledge of those skilled in the art without departing from the gist of the present invention.

Claims

1. A driving license information extraction method based on color difference is characterized by comprising the following steps:

s1, obtaining a difference value image by solving a difference value of a G channel and a B channel of an input image in an RGB format, wherein the G channel represents a green channel, the B channel represents a blue channel, the size of the input image is H multiplied by W, H is the height of the image, and W represents the width of the image;

2. The method for extracting driver license information based on color difference as claimed in claim 1, wherein in step S1, the calculation formula of the difference value image obtained by the difference value of the G and B channels of the input image in RGB format is:

mask(i,j)＝|G(i,j)-B(i,j)|,1≤i≤H,1≤j≤W；

wherein, (i, j) represents the pixel coordinates of the image, G (i, j) represents the green channel value of the pixel with the coordinate (i, j) on the image, B (i, j) represents the blue channel value of the pixel with the coordinate (i, j) on the image, and mask (i, j) represents the difference image.

3. The method for extracting license information based on color difference as claimed in claim 1, wherein in step S2, the calculation formula for morphological dilation operation on difference image is:

mask′(i,j)＝Dilation(mask(i,j)),1≤i≤H,1≤j≤W；

4. The method for extracting license information based on color difference as claimed in claim 1, wherein in step S3, the calculation formula of the binarization operation is:

5. The method for extracting license information based on color difference as claimed in claim 1, wherein the specific method of removing the region with smaller partial area in step S4 is as follows:

and finding the region with the largest area in each connected region, setting a threshold value according to the region with the largest area, and then removing the connected regions with the areas smaller than the threshold value.

6. The method for extracting license information based on color difference according to claim 1, wherein in step S4, the threshold value is 10% of the area of the region with the largest area.