CN111274858A

CN111274858A - Business license identification method in network transaction supervision

Info

Publication number: CN111274858A
Application number: CN201811514261.XA
Authority: CN
Inventors: 叶炳坤; 王志永; 郭建辉; 林文东
Original assignee: Xiamen Meiya Shangding Information Technology Co ltd
Current assignee: Xiamen Meiya Shangding Information Technology Co ltd
Priority date: 2018-12-04
Filing date: 2018-12-04
Publication date: 2020-06-12

Abstract

The invention discloses a business license identification method in network transaction supervision. The license identification includes: s1, removing watermarks by a generating type countermeasure network; s2, fixing the width to detect small character blocks; s3, removing the overlapped small character blocks with fixed width; s4, connecting the small character blocks to obtain character blocks and angles thereof, and removing overlapped character blocks; s5, oblique cutting is carried out on the character blocks, and the sizes are normalized; s6, character block feature extraction; s7, identifying a recurrent neural network; and S8, post-processing the identification result. The invention obtains the watermark information to automatically add the watermark, and the countermeasure type neural network trains the watermark removing model, thereby obtaining better effect on the problems that the character block can not be detected due to the watermark interference, the character recognition rate is low, and the like, and improving the accuracy rate of character recognition of pictures of business licenses.

Description

Business license identification method in network transaction supervision

The invention discloses a business license identification method in network transaction supervision. The license identification includes: s1, removing watermarks by a generating type countermeasure network; s2, fixing the width to detect small character blocks; s3, removing the overlapped small character blocks; s4, connecting the small character blocks to obtain character blocks and angles thereof, and removing overlapped character blocks; s5, oblique cutting is carried out on the character blocks, and the sizes are normalized; s6, character block feature extraction; s7, identifying a recurrent neural network; and S8, post-processing the identification result. The invention obtains the watermark information to automatically add the watermark, and the countermeasure type neural network trains the watermark removing model, thereby obtaining better effect on the problems that the character block can not be detected due to the watermark interference, the character recognition rate is low, and the like, and improving the accuracy rate of character recognition of pictures of business licenses.

Technical Field

The invention relates to the technical field of internet, in particular to a picture watermark removing method and a business license identification method based on the picture watermark removing method.

Background

In OCR, for the image added with the watermark, the watermark is also a character, the character watermark in the image brings great interference to the detection and identification of subsequent character blocks, and the traditional business license identification mainly detects, cuts and identifies a single character. The method cannot eliminate the interference of the watermark, only can identify the text with clear background and more regular and simple characters, cannot adapt to the picture with complex background, and has poor adaptability to the picture of a business license containing the watermark. A method for removing the watermark based on priori knowledge and the like cannot obtain effective effects aiming at the position change, inclination and the like of the watermark.

Disclosure of Invention

The invention aims to provide a business license identification method in network transaction supervision.

In order to achieve the purpose, the invention adopts the following technical method:

the method for identifying the business license in the network transaction supervision is characterized by comprising the following steps:

s1, removing watermarks by a generating type countermeasure network;

s2, fixing the width to detect small character blocks;

s3, removing the overlapped small character blocks;

s4, connecting the small character blocks to obtain character blocks and angles thereof, and removing overlapped character blocks;

s5, oblique cutting is carried out on the character blocks, and the sizes are normalized;

s6, extracting character block characteristics;

s7, cyclic neural network character recognition;

and S8, post-processing the identification result.

Further, the dehydration model described in step S1 splits the generator into two sub-networks G ═ G1, G2 }: global generator network G1 and local enhancement network G2, where G1 consists of a convolution front-end, a series of residual blocks and a transposed convolution back-end, and G2 also consists of a convolution front-end, a series of residual blocks and a transposed convolution (sometimes called deconvolution) back-end. The input and output of the network are three-channel color pictures. Wherein training the dehydrated impression type comprises:

s11, obtaining watermark information, such as inclination, length-width ratio, color, space of watermarks in the picture and the like, and making a watermark picture template;

s12, collecting business license pictures without watermarks; establishing a confrontation type neural network watermark removing model, and modifying corresponding training parameters;

s13, adding a watermark picture template, setting information such as transparency, inclination and the like, and pasting the manufactured watermark template into a business license picture;

and S14, training a watermark removing model, executing a network to obtain an output picture, calculating a loss function value, and updating the network until the network is converged.

Further, in step S2, first, a convolutional neural network is used to extract picture features from the input picture, and a fixed number of window features are taken from each position of the feature map; adopting a bidirectional cyclic neural network as the characteristic identification of the sequence, inputting the characteristics corresponding to all windows of each row into the cyclic neural network, and obtaining an output category calibration characteristic vector; and inputting the category calibration feature vector into a classification and regression layer to obtain category information and position information. And mapping the recognition result on the feature map back to the original image to obtain the recognition result of the small blocks in the original image with the fixed width.

Further, in step S3, a non-maximum suppression method is used to filter out small text blocks with fixed width, whose overlap ratio is greater than a threshold value and whose confidence coefficient is smaller.

Further, step S4 obtains a set of text blocks with a fixed width by a simple text line construction algorithm, and obtains an angle of a text block and positions of four points of a quadrangle by fitting a straight line with a midpoint of each small block according to the midpoint set, the upper left point set, and the lower left point set.

Further, step S5 rotates the picture according to the four vertices and the angles of the text block, and captures the quadrilateral picture, and normalizes the quadrilateral picture into a picture with a fixed height and a width scaled by the height.

Further, step S6 obtains local information by convolution, and integrates the local information at a higher layer to obtain global information; and (3) carrying out dimensionality reduction operation by utilizing downsampling, processing the height of the character block into only 1 pixel, increasing the number of characteristic channels corresponding to the width, and expressing the characteristic channels as characteristic vectors.

Further, step S7 identifies the feature vector using the bidirectional recurrent neural network, obtains the probability distribution of each row of features, obtains the maximum value in the probability, and obtains the most likely identification result corresponding to the feature vector, which is also the identification result of the position.

Further, step S8 transcribes the probabilities into corresponding recognition characters, removing repeated consecutive identical characters, and obtaining the final result.

Compared with the background technology, the invention has the following advantages:

the invention automatically generates the watermark picture through analyzing the watermark, trains the watermark removing model, then detects and identifies the character of the watermark removing picture, eliminates the watermark interference and improves the accuracy of character detection and identification. The method has strong adaptability, can identify the watermark pictures of the business licenses of most of online platforms, and optimizes the efficiency of manual input.

Drawings

FIG. 1 is a flow chart of license identification;

FIG. 2 is a diagram of a photo watermarking network architecture;

FIG. 3 is a flow chart of text detection and recognition;

FIG. 4 is a drawing of a license and a watermark removal effect of the license;

FIG. 5 is a block detection and recognition effect picture;

Detailed Description

In order to make the objects, technical means and advantages of the present invention more apparent, the present invention will be further described in detail with reference to the accompanying drawings and examples. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

Examples

Referring to fig. 2 and 4, the method for removing watermark from a picture, which synthesizes a watermark picture, trains a watermark removal model to improve the detection and identification of license text blocks, includes the following steps:

and R1, acquiring watermark information, such as inclination, length-width ratio, color, space of watermarks in the picture and the like, and making a watermark picture template.

The watermark picture information is obtained through a business license watermark picture downloaded from a network platform, specific parameters are obtained through visual observation or by means of an image processing tool, random parameters are obtained through Gaussian distribution during actual use, and each pair of parameter values corresponds to one type of watermark.

And R2, collecting the business license pictures without the watermarks.

The business license pictures are collected through manual screening and actual shooting by using other browsers such as Baidu browsers, Google browsers and the like and using a web crawler to obtain a considerable part of pictures.

And R3, establishing a confrontation type neural network watermark removing model and modifying corresponding training parameters.

Image synthesis and processing at high resolution using a antagonistic neural network, splitting the generator into two sub-networks G ═ G1, G2 }: global generator network G1 and local enhancement network G2, where G1 consists of a convolution front-end, a series of residual blocks and a transposed convolution back-end, and G2 also consists of a convolution front-end, a series of residual blocks and a transposed convolution (sometimes called deconvolution) back-end. The input and output of the network are three channels of RGB color pictures.

And R4, adding the watermark picture template, setting information such as transparency, inclination and the like, and pasting the manufactured watermark template into a business license picture.

Adding a business license watermark picture synthesis module, adding a watermark template addition module in front of a network, setting information such as a watermark inclination angle, a width-height ratio, a horizontal distance, transparency and the like, sticking the watermark to the business license picture, and synthesizing the watermark picture.

R5. training the watermark removing model, executing the network to obtain the output picture, calculating the loss function value, and updating the network until the network is converged.

And adopting multi-scale discriminators which have the same network structure, introducing characteristic matching loss, and updating the network by calculating a loss function until convergence.

Referring to fig. 1, fig. 3 and fig. 5, the detection and identification of the text block of the watermark removed picture includes the following steps:

A1. and detecting the text block.

The text block detection specifically comprises:

A11. adopting CNN to extract features, and taking a fixed number of window features at each position on the feature map;

A12. adopting a bidirectional cyclic neural network as the characteristic identification of the sequence, inputting the characteristics corresponding to all windows of each row into the cyclic neural network, and obtaining an output category calibration characteristic vector;

A13. the category-specific feature vectors are input into the classification and regression layer. Acquiring category information and position information;

A14. and combining the fixed small rectangular boxes of the characters obtained by classification into a text line by using a text line construction algorithm.

A2. Character block feature extraction: extracting image features by using a standard CNN and expressing the image features into feature vectors;

adopting 5Conv +3 MaxPholing +2bn, obtaining local information by convolution, and integrating the local information at a higher layer to obtain global information; and (5) performing dimensionality reduction operation by using Pooling.

A3. Character block recognition: the bidirectional recurrent neural network obtains the probability of belonging to each category;

and (4) identifying the feature vectors by using the bidirectional LSTM to obtain the probability distribution of each row of feature identification results.

A4. And (4) performing recognition result post-processing to remove non-character and repeated recognition results.

And analyzing the identification result as hh e ll o, wherein the non-character is'll'. Removing repeated characters, and knowing that adjacent repeated results, such as hh, ll, cannot occur in normal recognition results, even if there are adjacent repeated results in the image, such as l in hello, there will be non-character separation between two l in the recognition results; removing non-characters, the final recognition result is hello.

The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present invention are included in the scope of the present invention. Therefore, the protection scope of the present invention should be subject to the protection scope of the claims.

Claims

1. The method for identifying the business license in the network transaction supervision is characterized by comprising the following steps:

s1, removing watermarks by a generating type countermeasure network;

s2, fixing the width to detect small character blocks;

s3, removing the overlapped small character blocks with fixed width;

s5, obliquely cutting character blocks and normalizing the size of the picture;

s6, extracting character block characteristics;

s7, cyclic neural network character recognition;

and S8, post-processing the identification result.

2. A method of license identification in network transaction supervision as claimed in claim 1, wherein: in step S1, the watermarking model splits the generator into two sub-networks G ═ G1, G2: global generator network G1 and local enhancement network G2, where G1 consists of a convolution front-end, a series of residual blocks and a transposed convolution back-end, and G2 also consists of a convolution front-end, a series of residual blocks and a transposed convolution back-end. The input and output of the network are three-channel color pictures.

3. A method of license identification in network transaction supervision as claimed in claim 1, wherein: in step S2, first, a convolutional neural network is used to extract picture features from an input picture, and a fixed number of window features are taken at each position of a feature map; adopting a bidirectional cyclic neural network as the characteristic identification of the sequence, inputting the characteristics corresponding to all windows of each row into the cyclic neural network, and obtaining an output category calibration characteristic vector; and inputting the category calibration feature vector into a classification and regression layer to obtain category information and position information. And mapping the recognition result on the feature map back to the original image to obtain the recognition result of the small blocks in the original image with the fixed width.

4. A method of license identification in network transaction supervision as claimed in claim 1, wherein: in step S3, the non-maximum suppression method is used to filter out small text blocks with fixed width, whose overlap ratio is greater than the threshold and whose confidence is smaller.

5. A method of license identification in network transaction supervision as claimed in claim 1, wherein: in step S4, a set of text blocks with a fixed width is obtained by a simple text line construction algorithm, and a straight line is fitted according to the midpoint set, the upper left point set, and the lower left point set by using the midpoint of each small block to obtain the angle of the text block and the positions of four points in the quadrangle.

6. A method of license identification in network transaction supervision as claimed in claim 1, wherein: in step S5, the image is rotated according to the four vertices and angles of the text block, and the quadrilateral image is clipped and normalized into an image with a fixed height and a width scaled according to the height scaling ratio.

7. A method of license identification in network transaction supervision as claimed in claim 1, wherein: in step S6, local information is obtained by convolution, and local information is integrated at a higher layer to obtain global information; and (3) carrying out dimensionality reduction operation by utilizing downsampling, processing the height of the character block into only 1 pixel, increasing the number of characteristic channels corresponding to the width, and representing the characteristic channels as characteristic vectors.

8. A method of license identification in network transaction supervision as claimed in claim 1, wherein: in step S7, the feature vector is identified using the bidirectional recurrent neural network, the probability distribution of each row of features is obtained, the maximum value in the probability is obtained, and the maximum possible identification result corresponding to the feature vector is obtained, which is also the identification result of the position.

9. A method of license identification in network transaction supervision as claimed in claim 1, wherein: in step S8, the probability is transcribed into a corresponding recognition character, and repeated consecutive identical characters are removed.