INCORPORATION BY REFERENCE
The disclosure of Japanese Patent Application No. 2013-271375 filed on Dec. 27, 2013 including the specification, drawings and abstract is incorporated herein by reference in its entirety.
BACKGROUND
This disclosure relates to an image processing apparatus.
Some systems use a business management server that extracts specific information from images of application forms using suitable clipping patterns for various types of business document formats.
SUMMARY
An image processing apparatus according to an aspect of the present disclosure includes an additional-object registration unit and a read-image processing unit. A setting form contains: (a) an additional-object specification field used by a user to present an additional object that is placed onto a document in order to specify an extract area to be extracted from an image read from the document; and (b) a processing specification field used by the user to select processing to be performed on information obtained from the extract area. The additional-object registration unit identifies an image of the additional object presented in the additional-object specification field and the processing selected in the processing specification field on the read image of the setting form. The additional-object registration unit establishes an association between the image of the additional object and the processing, and registers the image and the processing. The read-image processing unit searches the read image of the document for the image of the additional object. The read-image processing unit performs the processing associated with the image of the additional object on the information obtained from the extract area specified by the image of the additional object.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 is a block diagram showing the configuration of an image processing apparatus according to an embodiment of the disclosure.
FIG. 2 illustrates an example of a setting form according to the first embodiment.
FIG. 3 illustrates an example of the setting form with additional objects placed and check boxes marked by a user to specify processing to be performed.
FIG. 4 illustrates an example of a document.
FIG. 5 illustrates an example of the document with registered additional objects placed thereon.
FIG. 6 illustrates an example of a registration information sheet.
FIG. 7 illustrates an example of specification of an extract area by transparent sticky notes.
FIG. 8 illustrates an example of the setting form according to the fourth embodiment.
FIG. 9 illustrates an example of a condition sheet.
DETAILED DESCRIPTION
With reference to the accompanying drawings, embodiments of the present disclosure will be described below.
First Embodiment
FIG. 1 is a block diagram showing the configuration of an image processing apparatus according to the first embodiment of the disclosure. The image processing apparatus 1 shown in FIG. 1 is a multifunctional peripheral including a communication device 11, a printing device 12, an image reading device 13, a processing device 14, and a storage device 15.
The communication device 11 is connectable to a server 2 via a network and performs data communications using a predetermined communications protocol. The server 2 is used to perform character recognition processing to read handwritten characters.
The printing device 12 is an internal device to print an image of an original document onto a printer sheet in, for example, an electrophotographic method. The printing device 12 subjects original image data to predetermined image processing, such as rasterization, color conversion, and screen processing, to produce output image data that is in turn printed out. The printing device 12 is used to print various kinds of forms and sheets which will be described later.
The image reading device 13 is an internal device that optically reads an image from an original document (various forms and sheets described below) to produce image data of the original document image.
The processing device 14 is a computer equipped with a central processing unit (CPU), a read only memory (ROM), a random access memory (RAM) and other components and functionally operates as various processing units by loading a program stored in the storage device 15, such as the ROM, into the RAM and executing the program with the CPU. The storage device 15 is a nonvolatile storage device that stores data, programs, etc.
The processing device 14 functions as an additional-object registration unit 21 and a read-image processing unit 22.
A setting form includes: (a) an additional-object specification field used by a user to present an additional object that is placed onto a document to specify an extract area to be extracted from an image read from the document; and (b) a processing specification field used by the user to select processing to be performed on information obtained from the extract area. The additional-object registration unit 21 identifies an image of the additional object presented in the additional-object specification field and the processing selected in the processing specification field on the read image of the setting form, establishes an association between the image of the additional object and the processing, and registers the image and processing associated therewith.
The read-image processing unit searches the read image of the document for the image of the additional object and performs the processing associated with the image of the additional object on information obtained from the extract area specified by the image of the additional object.
In the first embodiment, a single setting form includes a plurality of additional-object specification fields and a plurality of processing specification fields associated with the additional-object specification fields, respectively. The additional-object registration unit 21 identifies a plurality of images of additional objects presented in the additional-object specification fields and a plurality of types of processing selected in the processing specification fields on the read image of the setting form, establishes associations between each of the images of the additional objects and the processing, and registers the images and the processing associated therewith. Then, the read-image processing unit 22 searches the read image of the document for the registered images of the additional objects and performs processing associated with the detected images of the additional objects on information obtained from extract areas specified by the images of the detected additional objects.
In the first embodiment, the additional objects are sticky notes (repositionable notes) having a predetermined shape, a color and a pattern. The read-image processing unit 22 searches the read image of the document for the images of the additional objects by pattern-matching.
In addition, the additional objects in the first embodiment may be sticky notes having the same shape, but different patterns (e.g., sequential numbers, 1, 2, 3 . . . or alphabetical letters, a, b, c . . . ).
Furthermore, the aforementioned “processing” in the first embodiment includes: (a) creation of a file name of a file for the read image of the document based on the information extracted from the extract areas; (b) character recognition processing performed on the information extracted from the extract areas; and (c) creation of metadata of the file based on the information extracted from the extract areas. The metadata includes various types of attribute data contained in the files.
FIG. 2 illustrates an example of the setting form according to the first embodiment.
The setting form shown in FIG. 2 includes a plurality of additional-object specification fields 41 and a plurality of check box arrays 42 that are processing specification fields associated with the additional-object specification fields 41.
Each of the check box arrays 42 includes three check boxes ((1), (2), (3) of FILE NAME) to create a file name, two check boxes ((1), (2) of FOLDER NAME) to create a folder name, a check box to select typewritten character recognition processing (OCR TYPE), a check box to select handwritten character recognition processing (OCR HAND), a check box to cut out an image in the extract areas (CUT IMAGE), and five check boxes (HEADING, NUMBER, DATE, ADDRESS, and NAME of METADATA) to create metadata (HEADING; NUMBER, DATE, ADDRESS, and NAME). If a checkmark is placed in the check box of “OCR HAND”, the read-image processing unit 22 transmits an image in an extract area to the server 2 through the communication device 11, causes the server 2 to perform the handwritten character recognition processing on the image in the extract area, and receives the processing results from the server 2.
The check box (i) (i=1, 2, 3) of FILE NAME is used to designate text obtained from the corresponding extract area as the i-th word of a file name, while the check box (i) (i=1, 2) of FOLDER NAME is used to designate text obtained from the corresponding extract area as the i-th word of a folder name. The i-th word and the (i-th+1) word are linked with a punctuation character that is selected by a user who places a checkmark in a check box for selecting a punctuation character in the setting form.
FIG. 3 illustrates an example of the setting form with additional objects placed and check boxes marked by a user to specify processing to be performed. Specifically, the user places desired additional objects (i.e., sticky notes in this embodiment) in the additional-object specification fields to register the additional objects and places checkmarks in check boxes to perform desired processing on information (text, image, etc.) obtained from extract areas extracted by the additional objects.
For example, FIG. 3 shows a setting form with an additional object 51 placed in an additional-object specification field 41 and checkmarks in check box (1) of FILE NAME, in check box (1) of FOLDER NAME, and in the check box of OCR TYPE. Thus, an image obtained from an extract area specified by the additional object 51 is subjected to typewritten character recognition processing, and the text resulted from the processing is used as the first word of the file name and the first word of the folder name of the document image.
The setting form shown in FIG. 3 also has additional objects 52 to 55 placed in other additional-object specification fields 41 and checkmarks placed in check boxes of check box arrays 42 associated with the additional objects.
FIG. 4 illustrates an example of the document. FIG. 5 illustrates an example of the document with registered additional objects placed thereon.
As shown in FIGS. 4 and 5, the additional objects 51 specify an extract area so as to include a character string “INVOICE”, which is the title of the document, the additional objects 52 specify an extract area so as to include a character string of a name of an issuer of the document, the additional objects 53 specify an extract area so as to include a character string of date, the additional objects 54 specify an extract area so as to include a handwritten signature of a person in charge, and the additional objects 55 specify an extract area so as to include a character string of a typewritten name of the person in charge.
Next, the operation of the image processing apparatus will be described.
(1) Registration of Additional Object Used to Specify Extract Area in Document
In response to predetermined user operation, the additional-object registration unit 21 causes the printing device 12 to print out a setting form. Since an image data of the setting form is stored in the storage device 15 in advance, the setting form is printed out from the image data. Then, a user uses the printed setting form as shown in FIG. 2 to place desired additional objects in additional-object specification fields 41 and checkmarks in check boxes of check box arrays 42 as shown in FIG. 3, for example. Then, the user operates the image processing apparatus 1 to read the setting form with the additional objects placed in the additional-object specification fields 41 and the checkmarks placed in the check boxes of the check box arrays 42 through the image reading device 13.
In the image processing apparatus 1, the image reading device 13 produces image data of the read image of the setting form with the additional objects placed in the additional-object specification fields 41 and the checkmarks placed in the check boxes of the check box arrays 42. The additional-object registration unit 21 refers to the image data to extract images of the additional objects in the additional-object specification fields 41, while identifying check boxes with the checkmarks in the check box arrays 42 associated with the additional objects to identify processing specified by the user based on the identified check boxes. The additional-object registration unit 21 establishes associations between the images of the additional objects and the processing and stores the images and processing in the storage device 15.
Through the procedure, the user's desired additional objects and the associated processing are registered.
(2) Document Processing
After registration of the additional objects and processing associated therewith, for example, the user uses a document to be read as shown in FIG. 4 to place additional objects on the document to specify extract areas as shown in FIG. 5. Then, the user operates the image processing apparatus 1 to read the document with the additional objects placed thereon through the image reading device 13.
In the image processing apparatus 1, the image reading device 13 produces image data of the read image of the document with the additional objects placed thereon. The read-image processing unit 22 refers to the image data to search the read image of the document for the registered images of the additional objects by pattern-matching.
Upon detecting two additional object images of one kind, the read-image processing unit 22 identifies an extract area enclosed by the two additional object images. For example, the extract area identified is a rectangle with a diagonal line connecting the two additional object images at the shortest distance. Alternatively, if the images of the additional objects are in a predetermined shape, like a rectangle, the extract area may be configured to be a rectangle with a diagonal line connecting predetermined vertices of the two additional object images at the shortest distance.
Then, the read-image processing unit 22 extracts an image in the identified extract area and performs specified processing on the image in the extract area (e.g., character recognition, creation of file name and metadata).
For example, if the additional objects and processing on the setting form as shown in FIG. 3 are registered and the extract areas are specified by the additional objects 51 to 55 as shown in FIG. 5, the file name for the read image of the document is determined as “INVOICE_ABC Co. Ltd.—20131010.pdf”, and the folder name is determined as “INVOICE/ABC Co. Ltd./”. In this embodiment, if date “Oct. 10, 2013” is extracted through character recognition processing, the date is automatically translated into a string of numbers. Then, the read image of the document is saved under the file name of “INVOICE_ABC Co. Ltd.—20131010.pdf” in a folder named “INVOICE/ABC Co. Ltd./” in the storage device 15.
According to the above-described first embodiment, the read image of the setting form includes: (a) the additional-object specification fields used by a user to present the additional objects that are placed onto a document to specify extract areas to be extracted from the image read from the document; and (b) the processing specification fields used by the user to select processing to be performed on information obtained from the extract areas. The additional-object registration unit 21 identifies the images of the additional objects presented in the additional-object specification fields and the processing selected in the processing specification fields, establishes associations between the images of the additional objects and the processing, and registers the images of the identified additional objects and the processing associated therewith. The read-image processing unit 22 searches the read image of the document for the images of the additional objects and performs the processing associated with the image of the additional objects on the information obtained from the extract areas specified by the images of the additional objects.
Thus, the user can set a position to extract particular information from various types of documents in a suitable way for the respective document formats.
Second Embodiment
An image processing apparatus 1 according to the second embodiment includes a function of outputting a registration information sheet from a printing device 12 or other output units in response to a user's predetermined operation after the additional-object registration unit 21 registers images of additional objects and processing associated with the additional objects, in addition to functions the same as those of the image processing apparatus 1 of the first embodiment.
FIG. 6 illustrates an example of the registration information sheet. The registration information sheet includes images of additional objects and the positional information of the aforementioned extract areas associated with the images of the additional objects. In addition, the registration information sheet in the second embodiment includes a two-dimensional code 61 (QR code (trademark) in this embodiment) containing the positional information of the extract areas and information about processing associated with the extract areas. The registration information sheet still includes images (extracted images) in the extract areas obtained from the read image of a document.
In the case where the image processing apparatus 1 of the second embodiment is equipped with a key (e.g., a shortcut key) that performs a predetermined function in response to a user's single operation, the key may be assigned to an output operation of the registration information sheet as the function. This allows the user to print out the registration information sheet from a simple apparatus and to see the images (extracted images) in the extract areas obtained from the read image of the document.
The other configurations of the image processing apparatus 1 of the second embodiment are the same as those of the first embodiment, and therefore the explanation will not be reiterated.
Third Embodiment
An image processing apparatus 1 according to the third embodiment enables use of a plurality of transparent sticky notes in different colors as additional objects. In the third embodiment, the read-image processing unit 22 searches for the images of the additional objects by pattern-matching in consideration of the color mixture of the transparent sticky notes.
FIG. 7 illustrates an example of specification of extract areas by transparent sticky notes. In FIG. 7, the transparent sticky notes 71, 72 have different colors from each other. For example, as shown in FIG. 7, if the transparent sticky notes 71 and 72 overlap one another, the color of the overlapped part is a mixed color of the colors of the transparent sticky note 71 and transparent sticky note 72. Thus, the read-image processing unit 22 performs pattern-matching to search the read image for additional objects (transparent sticky notes 71, 72) including the mixed color part.
The other configurations of the image processing apparatus 1 of the third embodiment are the same as those of the first and second embodiments, and therefore the explanation will not be reiterated.
Fourth Embodiment
FIG. 8 illustrates an example of a setting form according to the fourth embodiment.
In the fourth embodiment, the setting form includes a translation specification field used by a user to select whether to use a translation rule to change a plurality of similar character strings in an extract area into a single character string. The translation specification field in the fourth embodiment is check boxes 81 in FIG. 8.
In the image processing apparatus 1 of the fourth embodiment, when the additional-object registration unit 21 detects that the translation rule is specified to be used in a translation specification field, the additional-object registration unit 21 acquires the translation rule, establishes an association between the translation rule and an image of an additional object associated with an extract area, and registers the image of the additional object and the translation rule associated therewith.
If at least one check box 81 is marked in the fourth embodiment, a user operates the image processing apparatus 1 to cause the image reading device 13 to read a condition sheet on which translation rules including the aforementioned translation rule are written and causes the additional-object registration unit 21 to identify the translation rule from the read image of the condition sheet or text information obtained from the read image through character recognition processing.
FIG. 9 illustrates an example of the condition sheet. The condition sheet shown in FIG. 9 includes a statement of conditions (translation rules) associated with the first word and the second word of the file name, respectively. For example, “Condition {ABC Co. Ltd., ABC} ABC” denotes that if the character string obtained by character recognition processing from the image in the extract area is “ABC Co. Ltd.” or “ABC”, “ABC” is used as the second word of the file name.
Then, the read-image processing unit 22 translates the character string obtained from the extract area under the translation rule associated with the extract area and performs the aforementioned processing (e.g., creation of a file name) on the translated character string.
Accordingly, even if there are different character strings of the same meaning, the character strings are replaced with a single character string that is in turn subjected to subsequent processing.
The other configurations of the image processing apparatus 1 of the fourth embodiment are the same as those of the first to third embodiments, and therefore the explanation will not be reiterated.
Although the foregoing embodiments are preferred examples of the present disclosure, it is to be noted that the present disclosure is not limited by the embodiments, and that various modifications and changes can be made without departing from the spirit of the present disclosure.
For example, the additional objects are sticky notes through the first to fourth embodiments; however, the additional objects can be characters or symbols handwritten with ink or graphite of a pen, a pencil or other writing implements.
In addition, the pattern-matching performed in the first to fourth embodiments can detect inclined additional objects, and therefore users are allowed to place the additional objects at an angle.
The present disclosure is applicable to, for example, multifunctional peripherals.