US11087448B2 - Apparatus, method, and non-transitory recording medium for a document fold determination based on the change point block detection - Google Patents
Apparatus, method, and non-transitory recording medium for a document fold determination based on the change point block detection Download PDFInfo
- Publication number
- US11087448B2 US11087448B2 US16/426,226 US201916426226A US11087448B2 US 11087448 B2 US11087448 B2 US 11087448B2 US 201916426226 A US201916426226 A US 201916426226A US 11087448 B2 US11087448 B2 US 11087448B2
- Authority
- US
- United States
- Prior art keywords
- block
- change point
- inclination
- characters
- interval
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related, expires
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/32—Digital ink
-
- G06K9/00463—
-
- G06K9/348—
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/0002—Inspection of images, e.g. flaw detection
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/14—Image acquisition
- G06V30/146—Aligning or centring of the image pick-up or image-field
- G06V30/1475—Inclination or skew detection or correction of characters or of image to be recognised
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/14—Image acquisition
- G06V30/148—Segmentation of character regions
- G06V30/158—Segmentation of character regions using character size, text spacings or pitch estimation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/18—Extraction of features or characteristics of the image
- G06V30/18086—Extraction of features or characteristics of the image by performing operations within image blocks or by using histograms
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/40—Document-oriented image-based pattern recognition
- G06V30/41—Analysis of document content
- G06V30/413—Classification of content, e.g. text, photographs or tables
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/40—Document-oriented image-based pattern recognition
- G06V30/41—Analysis of document content
- G06V30/414—Extracting the geometrical structure, e.g. layout tree; Block segmentation, e.g. bounding boxes for graphics or text
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10004—Still image; Photographic image
- G06T2207/10008—Still image; Photographic image from scanner, fax or copier
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30168—Image quality inspection
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30176—Document
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
Definitions
- the present disclosure relates to an image processing apparatus that processes a read image, which is generated by optically reading a document by an image scanner.
- An image scanner optically reads a document and generates a read image.
- an image processing apparatus including:
- controller circuit configured to operate as
- an image obtaining unit configured to obtain a read image, the read image being generated by optically reading a document including multiple characters by an image scanner,
- a block processing unit configured to detect a change point block, the change point block being a block including characters having an inclination included in a first inclination interval, a number of the characters being equal to or larger than a first threshold, and including characters having an inclination included in a second inclination interval, a number of the characters being equal to or larger than the first threshold, the second inclination interval being different from the first inclination interval, and
- a fold determining unit configured to determine that the document is folded if the change point block is detected.
- a non-transitory computer readable recording medium that records an image processing program causing a controller circuit of an image processing apparatus to operate as:
- an image obtaining unit configured to obtain a read image, the read image being generated by optically reading a document including multiple characters by an image scanner;
- a block processing unit configured to detect a change point block, the change point block being a block including characters having an inclination included in a first inclination interval, a number of the characters being equal to or larger than a first threshold, and including characters having an inclination included in a second inclination interval, a number of the characters being equal to or larger than the first threshold, the second inclination interval being different from the first inclination interval; and
- a fold determining unit configured to determine that the document is folded if the change point block is detected.
- an image processing method including:
- an image obtaining unit obtaining a read image, the read image being generated by optically reading a document including multiple characters by an image scanner;
- a block processing unit detecting a change point block, the change point block being a block including characters having an inclination included in a first inclination interval, a number of the characters being equal to or larger than a first threshold, and including characters having an inclination included in a second inclination interval, a number of the characters being equal to or larger than the first threshold, the second inclination interval being different from the first inclination interval; and
- a fold determining unit determining that the document is folded if the change point block is detected.
- FIG. 1 shows a hardware configuration of an image forming apparatus according to an embodiment of the present disclosure
- FIG. 2 shows a functional configuration of the image forming apparatus
- FIG. 3 shows an operational flow of the image forming apparatus
- FIG. 4 shows an example of a read image
- FIG. 5 shows an example of the read image, from which handwritten characters are removed
- FIG. 6 shows an example of a read image divided into multiple blocks
- FIG. 7 schematically shows multiple characters included in one block in a read image
- FIG. 8 shows a part of a character table of an example
- FIG. 9 shows an example of a histogram
- FIG. 10 illustrates an example of blocks included in a read image
- FIG. 11 schematically shows an example of the blocks of FIG. 10 ;
- FIG. 12 schematically shows another example of blocks in a read image.
- FIG. 1 shows a hardware configuration of an image forming apparatus according to an embodiment of the present disclosure.
- the image forming apparatus 10 (for example, MFP, Multifunction Peripheral) including the image scanner 12 will be described as an example of an image processing apparatus.
- an image processing apparatus may be an information processing apparatus (personal computer, tablet computer, smartphone, etc.) (not shown) configured to receive a read image, which is generated by optically reading a document by an external image forming apparatus (not shown).
- the information processing apparatus may receive the read image by directly communicating with the external image forming apparatus.
- the information processing apparatus may download the read image, which is uploaded onto a server apparatus or the like (not shown) by the external image forming apparatus.
- the image forming apparatus 10 includes the controller circuit 100 .
- the controller circuit 100 includes the CPU (Central Processing Unit) 11 a (processor), the RAM (Random Access Memory) 11 b , the ROM (Read Only Memory) 11 c (memory), dedicated hardware circuits, and the like and performs overall operational control of the image forming apparatus 10 .
- the CPU 11 a loads information processing programs (including image processing program) stored in the ROM 11 c in the RAM 11 b and executes the information processing programs.
- the nonvolatile ROM 11 c stores information processing programs executed by the CPU 11 a and data.
- the ROM 11 c is an example of a non-transitory computer readable recording medium.
- the controller circuit 100 is connected to the image scanner 12 , the image processor 14 (including GPU (Graphics Processing Unit)), the image memory 15 , the image forming device 16 (printer), the operation device 17 including the display device 17 a (touch panel), the large-volume nonvolatile storage device 18 such as an HDD (Hard Disk Drive) or an SSD (Solid State Drive), the facsimile communication device 19 , the network communication interface 13 , and the like.
- the controller circuit 100 performs operational control of the respective devices connected thereto and sends/receives signals and data to/from those devices.
- the operation device 17 including the display device 17 a (touch panel) is an embodiment of an input device.
- a sound input device including a microphone may be provided as an input device.
- the image scanner 12 optically reads a document, and generates an image (hereinafter referred to as read image).
- the document optically read by the image scanner 12 includes multiple characters.
- the “characters” are text data generated by using word processor software and printed on a medium (typically, paper).
- the “characters” do not include handwritten characters.
- the document includes, as the multiple characters, multiple characters arrayed in one direction (for example, lateral direction), the multiple characters being arrayed in series in a direction example, longitudinal direction) that crosses (typically, perpendicularly) the one direction.
- the “document” simply means a physical medium (typically, paper) in the present embodiment.
- the image scanner 12 optically reads a document fed by an automatic feeder or a document put on a platen (not shown) by a user.
- FIG. 2 shows a functional configuration of the image forming apparatus.
- the CPU 11 a loads an image processing program stored in the ROM 11 c in the RAM 11 b and executes the image processing program to thereby operate as the image obtaining unit 101 , the handwriting removing unit 102 , the character determining unit 103 , the block processing unit 104 , the fold determining unit 105 , and the fold information output unit 106 .
- FIG. 3 shows an operational flow of the image forming apparatus.
- the image obtaining unit 101 obtains the read image I 1 generated by optically reading a document by the image scanner 12 (Step S 101 ).
- FIG. 4 shows an example of a read image.
- the read image I 1 includes the handwritten characters H 1 and H 2 .
- the handwriting removing unit 102 detects the handwritten characters H 1 and H 2 from the read image I 1 by using a known art (edge detection, etc.). If the handwriting removing unit 102 detects the handwritten characters H 1 and H 2 , the handwriting removing unit 102 removes the handwritten characters H 1 and H 2 from the read image I 1 (Step S 102 ). Note that, if the read image includes no handwritten character (no handwritten character is detected), the handwriting removing unit 102 removes no handwritten character from the read image.
- FIG. 5 shows an example of the read image, from which handwritten characters are removed.
- the read image I 2 is generated by removing the handwritten characters H 1 and H 2 from the read image I 1 ( FIG. 4 ).
- FIG. 6 shows an example of a read image divided into multiple blocks.
- the read image I 3 is a read image different from the read image I 2 ( FIG. 5 ).
- the character determining unit 103 detects multiple characters included in each block (Step S 106 ).
- FIG. 7 schematically shows multiple characters included in one block in a read image.
- the read image I 4 is different from the read image I 3 ( FIG. 6 ) and the read image I 2 ( FIG. 5 ).
- the read image I 4 includes one block M 2 .
- the character determining unit 103 detects multiple characters T 1 to T 5 . . . T 50 to T 52 . . . included in the block M 2 by using a known art (edge detection, etc.).
- the character determining unit 103 detects the inclination of each of the multiple characters T 1 to T 5 . . . T 50 to T 52 . . . included in the block M 2 .
- the “inclination of a character” means the inclination of a character with respect to an arbitrary reference (for example, coordinate system of read image).
- the character determining unit 103 records the inclination and the length of each of the multiple characters as the character table 200 in, for example, the RAM 11 b (Step S 107 ).
- FIG. 8 shows a part of a character table of an example.
- the character table 200 records, for each character, the character number 201 and the inclination 202 .
- the characters T 1 to T 5 and T 50 to T 52 recorded in the character table 200 correspond to the characters T 1 to T 5 and T 50 to T 52 ( FIG. 7 ) detected from the block M 2 in the read image I 4 .
- the character number 201 is a serial number that uniquely identifies each character and, in addition, identifies the serial order of each character.
- the inclination 202 is an angle of ⁇ 180° to 180°, where the lateral direction of the coordinate system of a read image is 0° (reference), for example.
- the block processing unit 104 creates a histogram of the inclinations 202 of all the characters of each of the blocks (Step S 108 ). Specifically, the block processing unit 104 determines an interval, in which the inclination (°) of each character is included.
- FIG. 9 shows an example of a histogram.
- the inclination interval may be a value other than 1° and may be 0.5°, for example.
- the respective characters included in the histogram correspond to the characters recorded in the character table 200 ( FIG. 8 ) and the multiple characters T 1 to T 5 . . . T 50 to T 52 . . . included in the block M 2 in the read image I 4 .
- a block including multiple inclination intervals will be referred to as a “change point block”.
- the character determining unit 103 and the block processing unit 104 executes the processing on and after Step S 106 for all the blocks (Steps S 104 , S 105 , and S 112 ). As a result, all the blocks are sorted into the change point blocks (including multiple inclination intervals), the normal blocks (including one inclination interval), and the invalid blocks (including no valid inclination interval).
- the fold determining unit 105 determines whether or not all the blocks except for the invalid blocks are normal blocks (Step S 113 ). In other words, the fold determining unit 105 determines whether or not a change point block is detected. The case that “all the blocks except for the invalid blocks are normal blocks” (Step S 113 , Yes) means that the inclinations of almost all the characters included in the read image are almost the same. Therefore the fold determining unit 105 determines that the document is not folded (alternatively, probability that at least part including characters is not folded is high) (Step S 119 ).
- Step S 113 the case that “a change point block is detected” (Step S 113 , No) means that the inclinations of the characters included in the read image have variations, i.e., there is a probability that the document is folded.
- the fold determining unit 105 determines change point block types of the change point blocks in the read image as follows.
- FIG. 10 illustrates an example of blocks included in a read image.
- FIG. 11 schematically shows an example of the blocks of FIG. 10 .
- the read image I 5 is a read image different from the read image I 4 ( FIG. 7 ), the read image I 3 ( FIG. 6 ), and the read image I 2 ( FIG. 5 ).
- the read image I 2 includes the change point blocks M 3 , M 4 , M 5 , and M 6 .
- FIG. 11 schematically shows the read image I 5 of FIG. 10 without the characters in order to improve visualization. The arrangement of the blocks of FIG. 10 is the same as the arrangement of the blocks of FIG. 11 .
- the change point block M 3 will be described as an example.
- the representative values (Step S 110 ) of the change point block M 3 are the first inclination interval (“5.5°” indicates the first inclination interval for convenience, which means the inclination interval equal to or larger than 5° and smaller than 6°) and the second inclination interval (“0.5°” indicates the second inclination interval for convenience, which means the inclination interval equal to or larger than 0° and smaller than 1°).
- the fold determining unit 105 virtually locates the first inclination interval (5.5°) of the change point block M 3 in the change point block M 3 such that the first inclination interval (5.5°) is adjacent to one normal block M 31 , the one normal block M 31 including characters having the inclination included in the first inclination interval (5.5°) and being adjacent to the change point block M 3 .
- the fold determining unit 105 virtually locates the second inclination interval (0.5°) of the change point block M 3 in the change point block M 3 such that the second inclination interval (0.5°) is adjacent to one other normal block M 32 , the one other normal block M 32 including characters having the inclination included in the second inclination interval (0.5°) and being adjacent to the change point block M 3 .
- the fold determining unit 105 virtually locates the first inclination interval (5.5°) in the change point block M 3 such that the first inclination interval (5.5°) is adjacent to the normal block M 31 having the common first inclination interval (5.5°).
- the fold determining unit 105 virtually locates the second inclination interval (0.5°) in the change point block M 3 such that the second inclination interval (0.5°) is adjacent to the normal block M 32 having the common second inclination interval (0.5°).
- the fold determining unit 105 virtually locates the first inclination interval (5.5°) in the change point block M 4 such that the first inclination interval (5.5°) is adjacent to one normal block M 41 having the common first inclination interval (5.5°).
- the fold determining unit 105 virtually locates the second inclination interval (0.5°) in the change point block M 4 such that the second inclination interval (0.5°) is adjacent to one other normal block M 42 having the common second inclination interval (0.5°).
- the fold determining unit 105 virtually locates the first inclination interval (5.5°) in the change point block M 5 such that the first inclination interval (5.5°) is adjacent to one normal block M 51 having the common first inclination interval (5.5°).
- the fold determining unit 105 virtually locates the second inclination interval (0.5°) in the change point block M 5 such that the second inclination interval (0.5°) is adjacent to one other normal block M 52 having the common second inclination interval (0.5°).
- the fold determining unit 105 virtually locates the first inclination interval (5.5°) in the change point block M 6 such that the first inclination interval (5.5°) is adjacent to one normal block M 61 having the common first inclination interval (5.5°).
- the fold determining unit 105 virtually locates the second inclination interval (0.5°) in the change point block M 6 such that the second inclination interval (0.5°) is adjacent to one other normal block M 62 having the common second inclination interval (0.5°).
- a change point block type is defined by values and a positional relation of the first inclination interval and the second inclination interval.
- the change point block type of all the change point blocks M 3 , M 4 , M 5 , and M 6 is common.
- the number of the change point block type of the change point blocks M 3 , M 4 , M 5 , and M 6 is one.
- the fold determining unit 105 determines whether or not the document is folded (alternatively, at least the probability that the document is folded is high) on the basis of the number of change point block types included in the document image.
- Step S 114 determines that the document image includes even one change point block type (Step S 116 , Yes)
- the fold determining unit 105 determines that the document is folded (alternatively, at least the probability that the document is folded is high) (Step S 120 ).
- Step S 116 determines that the document image includes no change point block
- Step S 119 determines that the document is not folded (alternatively, probability that at least part including characters is not folded is high
- Step S 114 determines that the number of change point block types of the change point blocks included in the document image is equal to or larger than 1 and equal to or smaller than 3 (equal to or smaller than second threshold) (Step S 117 , Yes)
- the fold determining unit 105 determines that the document is folded (alternatively, at least the probability that the document is folded is high) (Step S 120 ).
- the fold determining unit 105 determines that the number of change point block types of the change point blocks included in the document image is larger than 3 (larger than second threshold) (Step S 117 , No), the fold determining unit 105 determines that the document is not folded (alternatively, probability that at least part including characters is not folded is high) (Step S 119 ).
- Step S 114 determines that the number of change point block types of the change point blocks included in the document image is equal to or larger than 1 and equal to or smaller than 5 (equal to or smaller than second threshold) (Step S 118 , Yes)
- the fold determining unit 105 determines that the document is folded (alternatively, at least the probability that the document is folded is high) (Step S 120 ).
- the fold determining unit 105 determines that the number of change point block types of the change point blocks included in the document image is larger than 5 (larger than second threshold) (Step S 118 , No), the fold determining unit 105 determines that the document is not folded (alternatively, probability that at least part including characters is not folded is high) (Step S 119 ).
- the fold determining unit 105 determines that the number of the change point block types is larger than the second threshold (Step S 117 , No, and S 118 , No). The reason of that will be described.
- FIG. 12 schematically shows another example of blocks in a read image.
- FIG. 12 schematically shows a read image (not shown) without characters in order to improve visualization.
- the read image I 6 includes the change point blocks M 71 , M 72 , M 73 , M 74 , M 81 , M 82 , M 83 , and M 84 .
- the values (5.5/0.5) and the positional relation (low/high in coordinate system) of the first inclination interval and the second inclination interval of the change point blocks M 71 , M 72 , M 73 , and M 74 are common.
- the values (5.5/0.5) and the positional relation (high/low in coordinate system) of the first inclination interval and the second inclination interval of the change point blocks M 81 , M 82 , M 83 , and M 84 are common.
- the values (5.5/0.5) of the first inclination interval and the second inclination interval of the change point blocks M 71 , M 72 , M 73 , M 74 , M 81 , M 82 , M 83 , and M 84 are common.
- the positional relation (low/high or high/low in coordinate system) of the values (5.5/0.5) is not common. Therefore, in this case, the number of change point block types defined by the values and the positional relation of the first inclination interval and the second inclination interval is “2”.
- the fold determining unit 105 determines that the document is folded (alternatively, at least the probability that the document is folded is high) (Step S 120 ).
- the fold determining unit 105 determines that the number of the change point block types is larger than the second threshold (Step S 117 , No, and S 118 , No).
- the fold determining unit 105 determines that the document is not folded (Step S 119 ). Note that, if the number of the change point block types is relatively large, there is a probability that a user should have folded the document on purpose (in order to mask part of the document, etc.). However, if the user have folded the document on purpose, the fold determining unit 105 may determine that the document is not (unintentionally) folded (Step S 119 ).
- the fold information output unit 106 outputs (for example, displays on the display device 17 a ) information indicating that the document is folded (Step S 121 ).
- the “information indicating that the document is folded” includes, for example, a message for urging a user to re-scan the document.
- the “information indicating that the document is folded” includes the result of optical character recognition (OCR) of the read image I 2 and a message for urging a user to check the result.
- OCR optical character recognition
- the fold information output unit 106 may output the information indicating that the document is folded before starting the operational flow (on and after Step S 101 ) for the next page.
- the change point block being a block including characters having an inclination included in a first inclination interval, a number of the characters being equal to or larger than a first threshold, and including characters having an inclination included in a second inclination interval, a number of the characters being equal to or larger than the first threshold, the second inclination interval being different from the first inclination interval (Step S 116 , Yes)
- the fold determining unit 105 determines that the document is folded (Step S 120 ).
- the fold determining unit 105 determines that a document is folded if the fold determining unit 105 detects characters having different inclination intervals in one block, the number of the characters included in each of the different inclination intervals being equal to or larger than the first threshold. Therefore the fold determining unit 105 may determine that a document is folded with a high degree of precision.
- the fold determining unit 105 determines that a document is folded (Step S 120 ) on the basis of the number of the change point block types (Step S 117 , Yes, and S 118 , Yes). Therefore the fold determining unit 105 may determine that a document is folded with a much higher degree of precision than a case where a change point block type is not determined.
- the fold determining unit 105 determines that a document is not folded (Step S 119 ). So, if the number of the change point block types is larger than the second threshold because, for example, there is a probability that the characters are originally inclined randomly (for example, in multiple directions from one point), the fold determining unit 105 does not determine that the document is folded. Therefore the fold determining unit 105 may determine that a document is folded with a much higher degree of precision.
- the block processing unit 104 excludes, from the change point block, an invalid block (Step S 109 , No), the invalid block being a block failing to include an inclination interval including characters having an inclination, a number of the characters being equal to or larger than the first threshold (Step S 111 ).
- an invalid block (Step S 109 , No)
- the invalid block being a block failing to include an inclination interval including characters having an inclination, a number of the characters being equal to or larger than the first threshold (Step S 111 ).
- a block only including a very small number (for example, less than first threshold) of characters is excluded from a block including characters whose inclinations are to be determined.
- the probability that it is improperly determined that a document is folded even if the document is not folded actually may be reduced.
- the fold determining unit 105 determines that the document is not folded if the change point block is not detected (Step S 113 , Yes). As a result, if the inclinations of the many characters included in the read image are approximately the same, the fold determining unit 104 determines that the document is not folded (alternatively, the probability that at least part including the characters is not folded is high). Therefore the fold determining unit 104 may determine that a document is not folded with a high degree of precision.
- the character determining unit 103 detects the multiple characters (Step S 103 ) from the read image I 2 from which the handwritten characters H 1 and H 2 are removed (Step S 102 ).
- the inclination of handwritten characters may be different from the inclination of printed characters. Because of that, if characters (including the handwritten characters H 1 and H 2 ) are detected from the read image I 1 including the handwritten characters H 1 and H 2 , since the inclinations of handwritten characters H 1 and H 2 are different from the inclination of prints characters, it may be improperly determined that a document is folded even if the document is not folded actually. To the contrary, according to the present embodiment, the handwritten characters H 1 and H 2 are removed. As a result, the probability that it is improperly determined that a document is folded even if the document is not folded actually may be reduced.
- fold information output unit 107 outputs information indicating that the document is folded (Step S 121 ).
- a user may re-scan the document.
- a user may check, with the eyes, the result of optical character recognition (OCR) of the read image I 2 .
- OCR optical character recognition
- the fold information output unit 105 may output the information indicating that the document is folded before starting the operational flow (on and after Step S 101 ) for the next page.
- a user may know which page of document is folded. In other words, it is not necessary for the user to search all the pages of read images for a folded document page afterwards. So there is no loss of time.
Landscapes
- Engineering & Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Computer Graphics (AREA)
- Geometry (AREA)
- Quality & Reliability (AREA)
- Character Input (AREA)
Abstract
Description
-
- divide the read image into multiple blocks, each of the multiple blocks including multiple characters, and
- determine an inclination of each of the multiple characters included in each of the multiple blocks,
-
- divide the read image into multiple blocks, each of the multiple blocks including multiple characters, and
- determine an inclination of each of the multiple characters included in each of the multiple blocks;
-
- dividing the read image into multiple blocks, each of the multiple blocks including multiple characters, and
- determining an inclination of each of the multiple characters included in each of the multiple blocks;
Claims (19)
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US16/426,226 US11087448B2 (en) | 2019-05-30 | 2019-05-30 | Apparatus, method, and non-transitory recording medium for a document fold determination based on the change point block detection |
| JP2020068435A JP2020198086A (en) | 2019-05-30 | 2020-04-06 | Image processing apparatus, image processing program, and image processing method |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US16/426,226 US11087448B2 (en) | 2019-05-30 | 2019-05-30 | Apparatus, method, and non-transitory recording medium for a document fold determination based on the change point block detection |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| US20200380657A1 US20200380657A1 (en) | 2020-12-03 |
| US11087448B2 true US11087448B2 (en) | 2021-08-10 |
Family
ID=73549541
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US16/426,226 Expired - Fee Related US11087448B2 (en) | 2019-05-30 | 2019-05-30 | Apparatus, method, and non-transitory recording medium for a document fold determination based on the change point block detection |
Country Status (2)
| Country | Link |
|---|---|
| US (1) | US11087448B2 (en) |
| JP (1) | JP2020198086A (en) |
Citations (34)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4637057A (en) * | 1984-07-30 | 1987-01-13 | Xerox Corporation | Rotation of digital images |
| US5172422A (en) * | 1991-05-13 | 1992-12-15 | Eastman Kodak Company | Fast character segmentation of skewed text lines for optical character recognition |
| US5359706A (en) * | 1991-11-07 | 1994-10-25 | Thomas Sterling | Image rotation using block transfers |
| US5692069A (en) * | 1995-03-17 | 1997-11-25 | Eastman Kodak Company | Apparatus for performing character segmentation using slant histograms |
| US5986672A (en) * | 1997-10-14 | 1999-11-16 | Minnesota, Mining And Manufacturing 3M Center | Method and system for forming a rotated image on an imaging element using limited system resources |
| US6173088B1 (en) * | 1996-10-01 | 2001-01-09 | Canon Kabushiki Kaisha | Image forming method and apparatus |
| JP2002051218A (en) | 2000-05-31 | 2002-02-15 | Internatl Business Mach Corp <Ibm> | Method and device for correcting defect of digital image |
| US6566340B2 (en) * | 1998-10-02 | 2003-05-20 | Aventis Pharma Deutschland Gmbh | Aryl-substituted propanolamine derivatives, their preparation, pharmaceuticals comprising them, and their use |
| US6785428B1 (en) * | 1999-10-05 | 2004-08-31 | Adobe Systems Incorporated | Rotated transform of an image using block transfers |
| EP1455299A2 (en) * | 2003-01-30 | 2004-09-08 | Samsung Electronics Co., Ltd. | Device and method for binarizing image |
| US6956587B1 (en) * | 2003-10-30 | 2005-10-18 | Microsoft Corporation | Method of automatically cropping and adjusting scanned images |
| US7003160B2 (en) * | 2000-08-30 | 2006-02-21 | Minolta Co., Ltd. | Image processing apparatus, image processing method, and computer readable recording medium recording image processing program for processing image obtained by picking up or reading original |
| US7106904B2 (en) * | 2001-04-25 | 2006-09-12 | Hitachi, Ltd. | Form identification method |
| KR20070078509A (en) * | 2006-01-27 | 2007-08-01 | 노틸러스효성 주식회사 | Character recognition method of giro ticket holder |
| US20080267502A1 (en) * | 2007-04-30 | 2008-10-30 | Kevin Youngers | Variable skew correction system and method |
| US20100149603A1 (en) * | 2008-12-16 | 2010-06-17 | Brother Kogyo Kabushiki Kaisha | Image reading apparatus |
| US20100238459A1 (en) * | 2009-03-23 | 2010-09-23 | Yoshirou Yamazaki | Dot position measurement method and apparatus, and computer readable medium |
| US20100239165A1 (en) * | 2006-03-02 | 2010-09-23 | Compulink Management Center ,Inc. a corporation | Model-Based Dewarping Method And Apparatus |
| JP2014068243A (en) | 2012-09-26 | 2014-04-17 | Canon Electronics Inc | Image processing system, control method thereof, and sheet for reading |
| US20140126811A1 (en) * | 2012-11-02 | 2014-05-08 | Fuji Xerox Co., Ltd. | Image processing apparatus, image processing method, and storage medium |
| US8855418B2 (en) * | 2009-05-18 | 2014-10-07 | Citrix Systems, Inc. | Systems and methods for block recomposition for compound image compression |
| US20150281513A1 (en) * | 2014-03-31 | 2015-10-01 | Brother Kogyo Kabushiki Kaisha | Technique for image processing |
| JP2015198306A (en) | 2014-03-31 | 2015-11-09 | ブラザー工業株式会社 | Image processing apparatus and computer program |
| US20160180164A1 (en) * | 2013-08-12 | 2016-06-23 | Beijing Branch Office Of Foxit Corporation | Method for converting paper file into electronic file |
| JP2016157113A (en) | 2015-02-24 | 2016-09-01 | 日東電工株式会社 | Sound absorption material |
| JP2016158113A (en) * | 2015-02-25 | 2016-09-01 | 京セラドキュメントソリューションズ株式会社 | Image reading device and image forming apparatus |
| JP2017028447A (en) | 2015-07-21 | 2017-02-02 | キヤノン電子株式会社 | Image processing device and image processing method |
| US20170220886A1 (en) * | 2009-11-10 | 2017-08-03 | Icar Vision Systems, S.L. | Method and system for reading and validating identity documents |
| US20170366705A1 (en) * | 2016-06-17 | 2017-12-21 | Pfu Limited | Image-processing apparatus, method, and computer program product for correcting skew in scanned images |
| US20190205638A1 (en) * | 2017-12-28 | 2019-07-04 | Baidu Online Network Technology (Beijing) Co., Ltd . | Method and apparatus for training a character detector based on weak supervision, system and medium |
| US20190303702A1 (en) * | 2018-03-28 | 2019-10-03 | I.R.I.S. | Image processing system and an image processing method |
| US20190354791A1 (en) * | 2018-05-17 | 2019-11-21 | Idemia Identity & Security France | Character recognition method |
| US20190354818A1 (en) * | 2018-05-18 | 2019-11-21 | Sap Se | Two-dimensional document processing |
| US10606933B2 (en) * | 2002-03-01 | 2020-03-31 | Xerox Corporation | Method and system for document image layout deconstruction and redisplay |
-
2019
- 2019-05-30 US US16/426,226 patent/US11087448B2/en not_active Expired - Fee Related
-
2020
- 2020-04-06 JP JP2020068435A patent/JP2020198086A/en active Pending
Patent Citations (37)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4637057A (en) * | 1984-07-30 | 1987-01-13 | Xerox Corporation | Rotation of digital images |
| US5172422A (en) * | 1991-05-13 | 1992-12-15 | Eastman Kodak Company | Fast character segmentation of skewed text lines for optical character recognition |
| US5359706A (en) * | 1991-11-07 | 1994-10-25 | Thomas Sterling | Image rotation using block transfers |
| US5692069A (en) * | 1995-03-17 | 1997-11-25 | Eastman Kodak Company | Apparatus for performing character segmentation using slant histograms |
| US6173088B1 (en) * | 1996-10-01 | 2001-01-09 | Canon Kabushiki Kaisha | Image forming method and apparatus |
| US5986672A (en) * | 1997-10-14 | 1999-11-16 | Minnesota, Mining And Manufacturing 3M Center | Method and system for forming a rotated image on an imaging element using limited system resources |
| US6566340B2 (en) * | 1998-10-02 | 2003-05-20 | Aventis Pharma Deutschland Gmbh | Aryl-substituted propanolamine derivatives, their preparation, pharmaceuticals comprising them, and their use |
| US6785428B1 (en) * | 1999-10-05 | 2004-08-31 | Adobe Systems Incorporated | Rotated transform of an image using block transfers |
| US6731795B1 (en) * | 2000-05-31 | 2004-05-04 | International Business Machines Corporation | Method and apparatus for removing defects from digital images |
| JP2002051218A (en) | 2000-05-31 | 2002-02-15 | Internatl Business Mach Corp <Ibm> | Method and device for correcting defect of digital image |
| US7003160B2 (en) * | 2000-08-30 | 2006-02-21 | Minolta Co., Ltd. | Image processing apparatus, image processing method, and computer readable recording medium recording image processing program for processing image obtained by picking up or reading original |
| US7106904B2 (en) * | 2001-04-25 | 2006-09-12 | Hitachi, Ltd. | Form identification method |
| US10606933B2 (en) * | 2002-03-01 | 2020-03-31 | Xerox Corporation | Method and system for document image layout deconstruction and redisplay |
| EP1455299A2 (en) * | 2003-01-30 | 2004-09-08 | Samsung Electronics Co., Ltd. | Device and method for binarizing image |
| EP1455299A3 (en) * | 2003-01-30 | 2006-05-03 | Samsung Electronics Co., Ltd. | Device and method for binarizing image |
| US6956587B1 (en) * | 2003-10-30 | 2005-10-18 | Microsoft Corporation | Method of automatically cropping and adjusting scanned images |
| KR20070078509A (en) * | 2006-01-27 | 2007-08-01 | 노틸러스효성 주식회사 | Character recognition method of giro ticket holder |
| US20100239165A1 (en) * | 2006-03-02 | 2010-09-23 | Compulink Management Center ,Inc. a corporation | Model-Based Dewarping Method And Apparatus |
| US20080267502A1 (en) * | 2007-04-30 | 2008-10-30 | Kevin Youngers | Variable skew correction system and method |
| US20100149603A1 (en) * | 2008-12-16 | 2010-06-17 | Brother Kogyo Kabushiki Kaisha | Image reading apparatus |
| US20100238459A1 (en) * | 2009-03-23 | 2010-09-23 | Yoshirou Yamazaki | Dot position measurement method and apparatus, and computer readable medium |
| US8855418B2 (en) * | 2009-05-18 | 2014-10-07 | Citrix Systems, Inc. | Systems and methods for block recomposition for compound image compression |
| US20170220886A1 (en) * | 2009-11-10 | 2017-08-03 | Icar Vision Systems, S.L. | Method and system for reading and validating identity documents |
| JP2014068243A (en) | 2012-09-26 | 2014-04-17 | Canon Electronics Inc | Image processing system, control method thereof, and sheet for reading |
| US20140126811A1 (en) * | 2012-11-02 | 2014-05-08 | Fuji Xerox Co., Ltd. | Image processing apparatus, image processing method, and storage medium |
| US20160180164A1 (en) * | 2013-08-12 | 2016-06-23 | Beijing Branch Office Of Foxit Corporation | Method for converting paper file into electronic file |
| US20150281513A1 (en) * | 2014-03-31 | 2015-10-01 | Brother Kogyo Kabushiki Kaisha | Technique for image processing |
| JP2015198306A (en) | 2014-03-31 | 2015-11-09 | ブラザー工業株式会社 | Image processing apparatus and computer program |
| US9374500B2 (en) * | 2014-03-31 | 2016-06-21 | Brother Kogyo Kabushiki Kaisha | Image processing apparatus configured to execute correction on scanned image |
| JP2016157113A (en) | 2015-02-24 | 2016-09-01 | 日東電工株式会社 | Sound absorption material |
| JP2016158113A (en) * | 2015-02-25 | 2016-09-01 | 京セラドキュメントソリューションズ株式会社 | Image reading device and image forming apparatus |
| JP2017028447A (en) | 2015-07-21 | 2017-02-02 | キヤノン電子株式会社 | Image processing device and image processing method |
| US20170366705A1 (en) * | 2016-06-17 | 2017-12-21 | Pfu Limited | Image-processing apparatus, method, and computer program product for correcting skew in scanned images |
| US20190205638A1 (en) * | 2017-12-28 | 2019-07-04 | Baidu Online Network Technology (Beijing) Co., Ltd . | Method and apparatus for training a character detector based on weak supervision, system and medium |
| US20190303702A1 (en) * | 2018-03-28 | 2019-10-03 | I.R.I.S. | Image processing system and an image processing method |
| US20190354791A1 (en) * | 2018-05-17 | 2019-11-21 | Idemia Identity & Security France | Character recognition method |
| US20190354818A1 (en) * | 2018-05-18 | 2019-11-21 | Sap Se | Two-dimensional document processing |
Non-Patent Citations (1)
| Title |
|---|
| A Survey of Methods—Segmentation, Richard G. Casey et al., IEEE, 0162-8828, Jul. 1996, pp. 1-17 (Year: 1996). * |
Also Published As
| Publication number | Publication date |
|---|---|
| US20200380657A1 (en) | 2020-12-03 |
| JP2020198086A (en) | 2020-12-10 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US9189694B2 (en) | Image processing device and image processing method | |
| US10740899B2 (en) | Image processing apparatus for identifying region within image, information processing method, and storage medium | |
| US10395131B2 (en) | Apparatus, method and non-transitory storage medium for changing position coordinates of a character area stored in association with a character recognition result | |
| US9626738B2 (en) | Image processing apparatus, image processing method, and storage medium | |
| KR20100000190A (en) | Method for recognizing character and apparatus therefor | |
| US8538154B2 (en) | Image processing method and image processing apparatus for extracting heading region from image of document | |
| US10359727B2 (en) | Image processing apparatus, image processing method, and storage medium, that determine a type of edge pixel | |
| US10706581B2 (en) | Image processing apparatus for clipping and sorting images from read image according to cards and control method therefor | |
| US9818028B2 (en) | Information processing apparatus for obtaining a degree of similarity between elements | |
| US10638001B2 (en) | Information processing apparatus for performing optical character recognition (OCR) processing on image data and converting image data to document data | |
| US11470211B2 (en) | Image processing apparatus for generating an electronic file of a document image from an optically captured image, and non-transitory computer readable recording medium that records image processing program for generating an electronic file of a document image from an optically captured image | |
| US10623603B1 (en) | Image processing apparatus, non-transitory computer readable recording medium that records an image processing program, and image processing method | |
| US11087448B2 (en) | Apparatus, method, and non-transitory recording medium for a document fold determination based on the change point block detection | |
| US11354890B2 (en) | Information processing apparatus calculating feedback information for partial region of image and non-transitory computer readable medium storing program | |
| US10049269B2 (en) | Information processing apparatus, information processing method, and non-transitory computer readable medium | |
| US12183101B2 (en) | Image processing system, image processing method, and storage medium | |
| US20140300790A1 (en) | Image processing apparatus, and non-transitory computer readable medium storing image processing program | |
| US10356276B2 (en) | Image processing apparatus, image forming apparatus, and computer readable medium | |
| US11972208B2 (en) | Information processing device and information processing method | |
| US20200410276A1 (en) | Apparatus, storage medium, and control method | |
| US10623598B2 (en) | Image processing apparatus and non-transitory computer readable medium for extracting and connecting inherent regions of multiple pages of document data | |
| US12424012B2 (en) | Information processing apparatus | |
| US11316995B2 (en) | Bending detection device and image processing apparatus | |
| US10129430B2 (en) | Information processing apparatus and data arrangement method for creating an electronic watermark | |
| JP2011070327A (en) | Device, method and program for determining image attribute |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: KYOCERA DOCUMENT SOLUTIONS INC., JAPAN Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:NAKAMURA, MASAYOSHI;REEL/FRAME:049317/0900 Effective date: 20190528 |
|
| FEPP | Fee payment procedure |
Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO EX PARTE QUAYLE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED |
|
| STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
| FEPP | Fee payment procedure |
Free format text: MAINTENANCE FEE REMINDER MAILED (ORIGINAL EVENT CODE: REM.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
| LAPS | Lapse for failure to pay maintenance fees |
Free format text: PATENT EXPIRED FOR FAILURE TO PAY MAINTENANCE FEES (ORIGINAL EVENT CODE: EXP.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
| STCH | Information on status: patent discontinuation |
Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362 |
|
| FP | Lapsed due to failure to pay maintenance fee |
Effective date: 20250810 |