CN102419817A - Automatic document scanning, analyzing and processing system based on intelligent image identification - Google Patents

Automatic document scanning, analyzing and processing system based on intelligent image identification Download PDF

Info

Publication number
CN102419817A
CN102419817A CN2010102932905A CN201010293290A CN102419817A CN 102419817 A CN102419817 A CN 102419817A CN 2010102932905 A CN2010102932905 A CN 2010102932905A CN 201010293290 A CN201010293290 A CN 201010293290A CN 102419817 A CN102419817 A CN 102419817A
Authority
CN
China
Prior art keywords
document
file
image
picture
projection
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2010102932905A
Other languages
Chinese (zh)
Inventor
赵黔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guizhou Province Power Information Technology Co Ltd Of Speeding
Original Assignee
Guizhou Province Power Information Technology Co Ltd Of Speeding
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guizhou Province Power Information Technology Co Ltd Of Speeding filed Critical Guizhou Province Power Information Technology Co Ltd Of Speeding
Priority to CN2010102932905A priority Critical patent/CN102419817A/en
Publication of CN102419817A publication Critical patent/CN102419817A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention relates to an image processing and mode identifying technology, in particular to an orthogonal generalized projection-based analyzing method applied to document scanning, analyzing and processing. The analyzing method comprises the following steps of: (1) determination of an effective region in an document image: locating an effective ROI (Region of Interest) in the document and removing four margin regions in the document image to obtain a minimum rectangular region containing the ROI; (2) 90-degree deflection detection of the document image: determining whether the image is deflected for 90 degrees by calculating a generalized projection curve variance of the document image, and performing corresponding processing on the document image; and (3) 180-degree inversion detection of the document image: determining whether the image is inverted for 180 degrees by calculating a mean piecewise function of the vertical grey integral projection of the document image which is processed in the step (1), and performing corresponding processing on the document image.

Description

A kind of automatic document scanning analysis disposal system based on intelligent image identification
Technical field
The present invention relates to image processing and pattern recognition, particularly a kind of analytical approach that is applied to the file scanning analyzing and processing based on the extensive projection of quadrature.
Background technology
The proposition of file scanning analytic system is that the user takes place in the process of file scanning because of not 90 ° of deflections of file and picture and 180 ° of inversions to causing of document placement direction to be scanned in order to discern and to correct, and improves the intellectuality and the robotization of scanning input process.Along with the electronic degree of daily life and working environment is increasingly high, people can face scans the back to deposit computing machine in a large amount of paper documents, and at this moment, the use value of this system will be highlighted.
Scanner is converted into electronic data with paper document and deposits computing machine in as a kind of computing machine external instrument equipment, and the user can be handled document on computers, and makes the more convenient and standard of preservation, transmission and management of file.The document placement direction is improper before scanning, and 90 ° of deflections or 180 ° of phenomenons such as inversion appear in the file and picture after usually can occurring scanning, and are as shown in Figure 1.When the number of documents of scanning more for a long time, whether inspection exists the image that direction puts upside down and corrects accordingly, as far as the user, becomes a loaded down with trivial details job.
Be used for scanned document deflection at present or be inverted the method for correcting mainly containing the OCR recognition technology, OCR is the abbreviation of English Optical Character Recognition, means optical character identification.Be mainly used in scanning document image is carried out analyzing and processing, obtain literal and layout information.This method is through translating into the computing machine ISN with character identifying method with the character code shape; Obtain the highest contrast ratio as discrimination through comparing with the comparison database Chinese words; Accomplish the identification of pictograph; Confirm document image information through analyzing correlation parameters such as discrimination simultaneously, whether have deflection or inversion, and then accomplish the rotation rectification of image like the document image.Wherein, the height of OCR discrimination has determined its performance.But the discrimination of OCR receives the influence of many factors, like scanning resolution, and the quality of picture, form, font and language etc.Scanning resolution is crossed and low can be reduced discrimination, but too highly not only can not improve discrimination, also can reduce treatment effeciency.And the whether clear and font of document own all can reduce the discrimination of OCR.And discrimination just will finally influence the judgement of information that whether file and picture is put upside down.
Sciagraphy is a kind of method of effectively image characteristics extraction.Usually, a width of cloth two dimensional image can be analyzed by the one dimension projection function of two quadratures.The characteristic of analysis image is convenient in the reduction of dimension, and has reduced calculated amount, so sciagraphy becomes a kind of important images analytical approach.Since in the document character code be distributed with that it is intrinsic, always according to from top to bottom, rule is from left to right arranged, so character code density presents the characteristics of upper left greater than lower right-most portion in the file and picture like character code in the document.When sciagraphy was applied to the Pixel Information of file and picture, the character code characteristic distributions can fully be reflected in drop shadow curve in the file and picture.Sciagraphy is different from the OCR technology, and it does not receive scanning resolution, picture format, font, even the influence of factor such as spoken and written languages.And the OCR recognition technology mainly is to utilize the method repeat to search comparison database to calculate each character code discrimination, and projection function only acts on the one-dimension information of image, the latter aspect operation efficiency far above the former.The present invention proposes a kind of analytical approach, can accurately judge and correct the direction of scanned document based on watershed segmentation and the extensive projection of quadrature.
Summary of the invention
The objective of the invention is to whether scanned document 90 ° of deflections or 180 ° of inversions are taken place judges and corrects according to judged result.
In order to achieve the above object, technical solution of the present invention provides a kind of analytical approach based on the extensive projection of quadrature, and it may further comprise the steps:
(1) effective coverage is definite in the file and picture: (Region of Interest, four margin zones in the file and picture are removed in ROI) location, obtain comprising the regional minimum rectangular area of character code in effective character code zone in the document.
(2) 90 ° of deflections of file and picture detect: through calculating the extensive drop shadow curve of file and picture variance, confirm whether 90 ° of deflections of image, and the document image is done handled.
(3) file and picture is inverted for 180 ° and is detected: through calculating the vertical gray integration projection average piecewise function through the file and picture after step (1) processing, confirm whether image 180 ° of inversions take place, and the document image is done handled.
The file scanning analysis process system of above-mentioned analytical approach based on the extensive projection of quadrature; Its (1) step; Obtain in the file and picture pixel wide L1 in top and left margin zone, L2 removes then that pixel wide is L1 about the file and picture; Left and right sides pixel wide is the zone of L2, obtains the smallest effective rectangular area ROI that comprises the character code zone in the document.
Above-mentioned analytical approach file scanning analysis process system based on the extensive projection of quadrature, its (2) step, the extensive projection function OGPF of the quadrature curve of calculating file and picture, wherein, the extensive projection function OGPF of quadrature comprises the extensive projection function OGPF of level h(y) with vertical extensive projection function OGPF v(x), they are respectively to each pixel information in the image, like the integral projection of gray scale, gradient, variance etc.Calculate OGPF more respectively h(y) and OGPF v(x) variance of function curve judges whether image 90 degree deflections take place.Its step is following:
1) the extensive projection function OGPF of calculated level h(y) with vertical extensive projection function OGPF v(x).OGPF h(y) be meant each row pixel value of information summation in the image.OGPF v(x) be meant each row pixel information summation in the image.If original image size is M * N, each point Pixel Information value be I (x, y), computing formula is following:
OGPF h ( x ) = Σ y = 1 N I ( x , y )
OGPF v ( y ) = Σ x = 1 M I ( x , y )
2) the extensive projection function OGPF of calculated level h(y) curve variance GPFV hWith vertical extensive projection function OGPF v(x) curve variance GPFV vWherein,
Figure BSA00000285283700043
representes level and vertical extensive projection function curve average respectively, and calculation procedure is following:
u ‾ h = 1 M Σ x = 1 M Σ y = 1 N I ( x , y )
u ‾ v = 1 N Σ Y = 1 N Σ X = 1 M I ( x , y )
GPFV h = Σ x = 1 M ( OGPF h ( x ) - u ‾ h ) 2
GPFV v = Σ y = 1 N ( OGPF v ( y ) - u ‾ v ) 2
3) judge whether file and picture deflects.Work as GPFV h>=GPFV vThe time, 90 ° of deflections do not take place in file and picture, work as GPFV h<GPFV vThe time, 90 ° of deflections take place in document.
4) image is corrected.According to judged result,, then former document image is carried out 90 ° rotation rectification, as the 1st correction result figure if 90 ° of deflections take place file and picture.If 90 ° of deflections do not take place, then with former document image as the 1st correction result figure.
Above-mentionedly judge that according to the extensive projection function curve of file and picture quadrature variance the foundation whether image 90 ° of deflections take place is: document is characterised in that; At first, there is the white space of certain area in four margins of document, secondly; The blank spaces that has fixing regularity between each row character code of document; Once more, because the character code quantity of every row is different with width, the character code that makes document can not occur aliging is listed as or regular character code row interval.The file and picture of 90 ° of deflections does not take place; The Wave crest and wave trough phenomenon of occurrence law property on the coordinate of each character code row and between-line spacing correspondence in the extensive projection function curve of level; And this waveform transformation is comparatively violent; Its vertical extensive projection function curve waveform does not then have this rule, and its waveform transformation is mild than the former.As shown in Figure 4.And towards the file and picture that 90 ° of deflections take place, its level is then just opposite with the former with vertical extensive projection function curve distribution.The acute variation of curve shows as the acute variation of each element value in the pairing one-dimensional vector of curve in discrete space, when respectively being worth acute variation in the one-dimensional vector, its variance also will increase.
The file scanning analysis process system of above-mentioned analytical approach based on the extensive projection of quadrature; In (3) step; Calculate the vertical gray integration projection average piecewise function of file and picture, the 1st correction result figure that this step institute analysis image is produced when being (2) EOS.Its step is following:
Figure BSA00000285283700051
be the vertical gray integration projection of image average piecewise function; As x ∈ [1; M]; Y ∈ [1; N] time;
Figure BSA00000285283700052
expression is positioned at file and picture upper left regions perpendicular gray integration projection average, and
Figure BSA00000285283700053
representes to be positioned at file and picture lower right-most portion regions perpendicular gray integration projection average.Calculate as follows:
u 11 ‾ = 1 n Σ y = 1 n Σ x = 1 m I ( x , y )
u 22 ‾ = 1 N - n Σ y = n + 1 N Σ x = m + 1 M I ( x , y )
Wherein, the image size is M * N, M=2m or M=2m+1, N=2n or N=2n+1
uPF v ‾ ( y ) = u ‾ 11 x ∈ [ 1 , m ] , y ∈ [ 1 , n ] u ‾ 22 x ∈ [ m + 1 , M ] , y ∈ [ n + 1 , N ]
During as
Figure BSA00000285283700057
; Be that the vertical gray integration projection of file and picture upper left average is greater than file and picture lower right-most portion regions perpendicular gray integration projection average; Can judge that then 180 ° of inversions do not take place file and picture; Otherwise 180 ° of deflections take place in image.
According to judged result,, then the 1st correction result image is carried out 180 ° rotation rectification, as net result figure if 180 ° of inversions take place file and picture.If 180 ° of inversions do not take place, then with the 1st correction result image as net result figure.
Calculate before the vertical gray integration projection average piecewise function of file and picture, should earlier the 1st correction result figure be converted into gray level image or bianry image, and guarantee that the gray level image background pixel value is lower than the character code pixel value.
Above-mentioned judge that according to the vertical gray integration projection of file and picture average piecewise function the foundation whether image 180 ° of upsets take place is: document is characterised in that the custom that people write document is from top to bottom, from left to right.So character code distribution density upper left is higher than lower right-most portion in the document.Like this, when character code pixel point value is higher than the background pixel point value, towards the pairing row gray integration of its upper left of correct file and picture average greater than it in the pairing row gray integration of document lower right-most portion average.Rule can judge whether file and picture 180 ° of deflections take place according to this.
From the file scanning analyzing and processing process of described analytical approach based on the extensive projection of quadrature, the method has fully been used the characteristics of document self, promptly normally towards document, following characteristic is arranged: 1, the character code row horizontal alignment of document.2, there is the character code between-line spacing of rule in document.3, can be analyzed by left-to-right custom of writing document from top to bottom by people, the distribution density of character code presents upper left and is higher than lower right-most portion in the document.Through the level and vertical extensive projection function curve variance of contrast file and picture, judge whether file and picture 90 ° of deflections take place; Analyze the characteristics of vertical gray integration projection average piecewise function again, judge whether file and picture 180 ° of deflections take place.
Beneficial effect of the present invention:
Scanned document analysis and processing method of the present invention has good performance, for variations such as font, language, gray scales stronger adaptability is arranged all, can satisfy the needs of most scanning systems.The user can not improve the intellectuality and the robotization of scanning input process because of not 90 ° of deflections of file and picture and 180 ° of inversions to causing of document placement direction to be scanned in the process of file scanning.
Description of drawings
Fig. 1 scanning document image comprises:
(a), file and picture
(b), 90 ° of deflections take place in file and picture
(c), 180 ° of inversions take place in file and picture
Fig. 2 comprises the minimum rectangular area of character code
Fig. 3 file scanning analysis process system process flow diagram
The projection of Fig. 4 file and picture quadrature gray integration comprises:
(a) former document image
(b) horizontal gray integration projection
(c) vertical gray integration projection
Embodiment
The method of the present invention's design has made full use of the character code regularity of distribution of document, and employing is judged file and picture based on the analytical approach of the extensive projection of quadrature and corrected.Overall procedure is as shown in Figure 1.The practical implementation process is following:
Step 1 is obtained file scanned image, removes the interference of four blank page edge regions in the file and picture.
Step 2 is calculated the extensive projection function curve of file and picture quadrature variance.As shown in Figure 4, be file and picture quadrature gray integration projection function curve.Calculating is also compared its horizontal integral projection function curve variance and vertical integral projection function curve variance.Again according to comparative result, if there are 90 ° of deflections, former document scan image is turned clockwise 90 °, as the 1st correction result image, otherwise, with former document image as the 1st correction result image.
Step 3 pair the 1st correction result image calculates respectively and movement images upper left
Figure BSA00000285283700081
zone and bottom right regional vertical gray integration projection average.If there are 180 ° of deflections, then with the 1st correction result figure Rotate 180 °, as the algorithm process result, otherwise the 1st correction result figure is as the algorithm process result.

Claims (5)

1. automatic document scanning analysis disposal system based on intelligent image identification; It is characterized in that: belong to a kind of analytical approach based on the extensive projection of quadrature; It may further comprise the steps: the effective coverage is confirmed in (1) file and picture: effective character code zone (Region ofInterest in the document; ROI) four margin zones in the file and picture are removed in location, obtain comprising the minimum rectangular area in character code zone;
(2) 90 ° of deflections of file and picture detect: through calculating the extensive drop shadow curve of file and picture variance, confirm whether 90 ° of deflections of image, and the document image is done handled;
(3) file and picture is inverted for 180 ° and is detected: through calculating the vertical gray integration projection average piecewise function through the file and picture after step (1) processing, confirm whether image 180 ° of inversions take place, and the document image is done handled.
2. a kind of automatic document scanning analysis disposal system based on intelligent image identification according to claim 1 is characterized in that: remove four margin zones in the file and picture in (1) step, obtain comprising the minimum rectangular area in character code zone.
3. a kind of automatic document scanning analysis disposal system according to claim 1 based on intelligent image identification; It is characterized in that: the extensive projection function curve of quadrature (Orthogoral Generalized Project Function in (2) step; OGPF) and variance, through the extensive projection function curve of contrast level variance OGPF h(Generalized Project Function Variance horizontal, OGPF h) with vertical extensive projection function curve variance OGPF v(Generalized Project Function Variance vertical, OGPF v), judge whether this image 90 ° of deflections take place.
4. a kind of automatic document scanning analysis disposal system based on intelligent image identification according to claim 1 is characterized in that: through calculating vertical gray integration projection average piecewise function, judge whether to take place 180 ° of deflections in (3) step.
5. the automatic document scanning analysis disposal system based on intelligent image identification according to claim 1; It is characterized in that: the analytical approach of the extensive projection of described quadrature confirms that the foundation of scanning document image direction is: normally towards document, following characteristic is arranged: 1, the character code row horizontal alignment of document; 2, there is the character code between-line spacing of rule in document; 3, can be analyzed by left-to-right custom of writing document from top to bottom by people, the distribution density of character code presents upper left and is higher than lower right-most portion in the document.
CN2010102932905A 2010-09-27 2010-09-27 Automatic document scanning, analyzing and processing system based on intelligent image identification Pending CN102419817A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2010102932905A CN102419817A (en) 2010-09-27 2010-09-27 Automatic document scanning, analyzing and processing system based on intelligent image identification

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2010102932905A CN102419817A (en) 2010-09-27 2010-09-27 Automatic document scanning, analyzing and processing system based on intelligent image identification

Publications (1)

Publication Number Publication Date
CN102419817A true CN102419817A (en) 2012-04-18

Family

ID=45944220

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2010102932905A Pending CN102419817A (en) 2010-09-27 2010-09-27 Automatic document scanning, analyzing and processing system based on intelligent image identification

Country Status (1)

Country Link
CN (1) CN102419817A (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102945377A (en) * 2012-11-28 2013-02-27 上海合合信息科技发展有限公司 Method and device for obtaining content in paper notebook
CN103593643A (en) * 2012-08-16 2014-02-19 百度在线网络技术(北京)有限公司 Image recognizing method and system
CN103941574A (en) * 2014-04-18 2014-07-23 邓伟廷 Intelligent spectacles
CN106372639A (en) * 2016-08-19 2017-02-01 西安电子科技大学 Morphology and integral projection-based printed Uygur document segmentation method
CN107292307A (en) * 2017-07-21 2017-10-24 华中科技大学 One kind is inverted Chinese character identifying code automatic identifying method and system
CN108256475A (en) * 2018-01-17 2018-07-06 北方民族大学 A kind of bill image inversion detection method
CN108345827A (en) * 2017-01-24 2018-07-31 富士通株式会社 Identify method, system and the neural network in document direction
CN110647882A (en) * 2019-09-20 2020-01-03 上海眼控科技股份有限公司 Image correction method, device, equipment and storage medium

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101315668A (en) * 2008-07-01 2008-12-03 上海大学 Automatic detection method for test paper form

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101315668A (en) * 2008-07-01 2008-12-03 上海大学 Automatic detection method for test paper form

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
张吉玲等: "《数学形态学和投影方差在文档图像倾斜校正中的应用》", 《福建电脑》, no. 3, 30 March 2008 (2008-03-30), pages 100 - 104 *
张顺利等: "《基于投影的文档图像倾斜校正方法》", 《计算机工程与应用》, vol. 46, no. 3, 21 January 2010 (2010-01-21), pages 166 - 168 *

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103593643B (en) * 2012-08-16 2019-02-12 百度在线网络技术(北京)有限公司 A kind of method and system of image recognition
CN103593643A (en) * 2012-08-16 2014-02-19 百度在线网络技术(北京)有限公司 Image recognizing method and system
CN102945377B (en) * 2012-11-28 2016-06-08 上海合合信息科技发展有限公司 Obtain method and the device of content in papery notebook
CN102945377A (en) * 2012-11-28 2013-02-27 上海合合信息科技发展有限公司 Method and device for obtaining content in paper notebook
CN103941574A (en) * 2014-04-18 2014-07-23 邓伟廷 Intelligent spectacles
CN106372639B (en) * 2016-08-19 2019-03-08 西安电子科技大学 Block letter Uighur document cutting method based on morphology and integral projection
CN106372639A (en) * 2016-08-19 2017-02-01 西安电子科技大学 Morphology and integral projection-based printed Uygur document segmentation method
CN108345827A (en) * 2017-01-24 2018-07-31 富士通株式会社 Identify method, system and the neural network in document direction
CN108345827B (en) * 2017-01-24 2021-11-30 富士通株式会社 Method, system and neural network for identifying document direction
CN107292307A (en) * 2017-07-21 2017-10-24 华中科技大学 One kind is inverted Chinese character identifying code automatic identifying method and system
CN107292307B (en) * 2017-07-21 2019-12-17 华中科技大学 Automatic identification method and system for inverted Chinese character verification code
CN108256475A (en) * 2018-01-17 2018-07-06 北方民族大学 A kind of bill image inversion detection method
CN108256475B (en) * 2018-01-17 2021-05-11 北方民族大学 Bill image inversion detection method
CN110647882A (en) * 2019-09-20 2020-01-03 上海眼控科技股份有限公司 Image correction method, device, equipment and storage medium

Similar Documents

Publication Publication Date Title
CN102419817A (en) Automatic document scanning, analyzing and processing system based on intelligent image identification
US9235756B2 (en) Table grid detection and separation
US6873732B2 (en) Method and apparatus for resolving perspective distortion in a document image and for calculating line sums in images
US8712188B2 (en) System and method for document orientation detection
US8818099B2 (en) Document image binarization and segmentation using image phase congruency
US20070253040A1 (en) Color scanning to enhance bitonal image
Shi et al. Text extraction from gray scale historical document images using adaptive local connectivity map
CN112183038A (en) Form identification and typing method, computer equipment and computer readable storage medium
JP5082637B2 (en) Image processing program, image processing method, and image processing apparatus
Su et al. A novel stroke extraction method for Chinese characters using Gabor filters
CN103034848B (en) A kind of recognition methods of form types
US10140691B2 (en) Correcting perspective distortion in double-page spread images
US20080199082A1 (en) Method and apparatus for recognizing boundary line in an image information
US20050031208A1 (en) Apparatus for extracting ruled line from multiple-valued image
JPH11219407A (en) Document image recognizing device and storage medium for document image recognizing program
EP2438574A2 (en) Edge detection
JP4395188B2 (en) Document image recognition apparatus and storage medium for document image recognition program
CN105760901A (en) Automatic language identification method for multilingual skew document image
CN113888756A (en) Method for determining effective area parameters, image acquisition method and test system
CN115497109A (en) Character and image preprocessing method based on intelligent translation
US8311331B2 (en) Resolution adjustment of an image that includes text undergoing an OCR process
Okun et al. Document skew estimation without angle range restriction
Shafait et al. Page frame detection for marginal noise removal from scanned documents
KR20150099116A (en) Method for recognizing a color character using optical character recognition and apparatus thereof
Amin et al. Fast algorithm for skew detection

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
DD01 Delivery of document by public notice

Addressee: The Guizhou Province power information Technology Co., Ltd. of speeding

Document name: Notification of Publication and of Entering the Substantive Examination Stage of the Application for Invention

DD01 Delivery of document by public notice

Addressee: The Guizhou Province power information Technology Co., Ltd. of speeding

Document name: the First Notification of an Office Action

DD01 Delivery of document by public notice

Addressee: The Guizhou Province power information Technology Co., Ltd. of speeding

Document name: Notification that Application Deemed to be Withdrawn

C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20120418