JP4817669B2 - Data processing apparatus, data processing method, and program - Google Patents

Data processing apparatus, data processing method, and program Download PDF

Info

Publication number
JP4817669B2
JP4817669B2 JP2005025613A JP2005025613A JP4817669B2 JP 4817669 B2 JP4817669 B2 JP 4817669B2 JP 2005025613 A JP2005025613 A JP 2005025613A JP 2005025613 A JP2005025613 A JP 2005025613A JP 4817669 B2 JP4817669 B2 JP 4817669B2
Authority
JP
Japan
Prior art keywords
form
additional information
form type
image
step
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP2005025613A
Other languages
Japanese (ja)
Other versions
JP2006155551A (en
JP2006155551A5 (en
Inventor
浩太郎 松田
Original Assignee
キヤノン株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority to JP2004311080 priority Critical
Priority to JP2004311080 priority
Application filed by キヤノン株式会社 filed Critical キヤノン株式会社
Priority to JP2005025613A priority patent/JP4817669B2/en
Priority claimed from US11/343,997 external-priority patent/US7787158B2/en
Publication of JP2006155551A5 publication Critical patent/JP2006155551A5/ja
Publication of JP2006155551A publication Critical patent/JP2006155551A/en
Application granted granted Critical
Publication of JP4817669B2 publication Critical patent/JP4817669B2/en
Application status is Expired - Fee Related legal-status Critical
Anticipated expiration legal-status Critical

Links

Images

Description

The present invention relates to a data processing apparatus and a data processing method applicable when an image obtained by scanning a paper form by a scanning device or a document received by a facsimile (hereinafter, FAX) device is digitized and processed as an electronic form. And the program.

  Conventionally, as a method of scanning a paper form, digitizing it, and processing it as an electronic form, a method of scanning the form using a scanning device and transmitting the obtained image data to a computer is generally used. .

  As a method of determining which form type the image data obtained from the form is, a method of determining the form type by comparing with a pre-registered form type using image pattern matching or the like Alternatively, there is a method for discriminating the form type by performing barcode recognition from the obtained image data by embedding a barcode in a part of the form in advance. Since the business process performed by the form type differs in the processes after the form type determination, it is important to correctly classify the image data according to the form type.

On the other hand, as a conventional example, a method has been proposed in which feature amounts of form image data are extracted and form similarity is calculated in form recognition (see, for example, Patent Document 1 and Patent Document 2).
JP 2000-285187 A JP 2000-293596 A

  However, in the conventional form type determination method, depending on the scanning quality of the image data, the form type may be misrecognized as another form type, or may be recognized as an unknown form in some cases. There is a problem that the recognition rate is not high.

  In addition, the conventional method that uses barcodes in the form classification method requires time and labor to prepare a new paper form embedded with barcodes on the premise that barcodes will be used. There is a problem that it is not always possible to meet the user's desire to digitize the paper form.

  Furthermore, in the conventional form type discrimination method, the computer receives image data obtained by scanning the form from the scanning device and performs the form type discrimination process, which causes a large processing load on the computer side. There is a point.

An object of the present invention is to reduce the load of the form type determination process in the information management apparatus, enable data to be correctly classified in advance for each form type, and improve the accuracy of specifying the form type A processing device, a data processing method , and a program are provided.

In order to achieve at least one of the above objects, the data processing apparatus of the present invention has determined that there is additional information indicating the form type in the image, and a determination unit that determines whether there is additional information by the determination unit. The form type ID is read from the additional information, and a determination unit that determines whether or not to double check the form type, and the determination unit determines that the form type is not to be double checked. A form recognition processing means for performing a form recognition process based on an image when no recognition processing is performed and it is determined by the determination means that there is no additional information, or when it is determined by the determination means to double check the form type; If the recognition result processed by the form recognition processing unit is different from the additional information, the recognition result processed by the additional information and the form recognition processing unit A character for performing character recognition processing based on the processing means for displaying the mismatch and reselecting the form type, and the additional information, the reselected form type, or the form type recognized by the form recognition means The image processing apparatus includes a recognition processing unit, and a storage unit that stores the image, the additional information or the form type, and a character recognition result by the character recognition processing unit.

According to the present invention, in the data processing apparatus of the transfer destination, the form type determination process is unnecessary, and the load can be reduced. Furthermore, it is basically possible to perform form recognition based on additional information given to the user, perform double check when it is desired to be strict, and perform character recognition by confirming whether the additional information given to the user is correct. .

  Hereinafter, embodiments of the present invention will be described with reference to the drawings.

  FIG. 1 is a schematic diagram showing a configuration example of a network system according to an embodiment of the present invention.

  1, a network system includes a multifunction device (hereinafter abbreviated as MFP) 100, a logical network 101, a server computer 102, a storage device 103, a client personal computer (hereinafter client PC) 104, a telephone / FAX line. 105 and a gateway device 106. The logical network 101 can be connected to an intranet / Internet through a gateway device 106. Although FIG. 1 shows a case where one MFP is installed, a plurality of MFPs can be installed.

  The MFP 100 is a multifunction device having a plurality of functions (image reading function, image forming function, communication function, etc.). Server computer 102 is a computer on which a program for managing communication with MFP 100 or a program used in business processing of a form operates. The storage device 103 is configured as a database system or a file storage system used by the server computer 102. The client PC 104 is a computer connected to the server computer 102 and processed by the user.

  FIG. 2 is a diagram illustrating an appearance of the MFP 100 as the data processing apparatus.

  In FIG. 2, an MFP 100 includes an MFP main body 200, a document scanning unit 201, an automatic document feeder (hereinafter referred to as a feeder) 202, an output finishing unit (hereinafter referred to as a finisher) 203, an output tray unit 204, and a display / touch panel. The unit 205 is provided.

  The MFP main body 200 includes a paper feed cassette that stores paper, a transport unit that transports paper, a printer unit that forms images on paper, and the like. The document scanning unit 201 scans the document fed to the reading position and reads an image. The feeder 202 feeds the document to the reading position. The finisher 203 performs predetermined post-processing on the paper on which the image formation discharged from the MFP main body 200 has been completed. The output tray unit 204 includes a plurality of trays for sorting and stacking sheets that have been post-processed by the finisher 203. The display / touch panel unit 205 includes a display for displaying various screens and a touch panel for performing various settings.

  FIG. 3 is a block diagram showing a logical configuration of the information processing function of MFP 100.

  3, the MFP 100 includes a display / touch panel unit 205, a network interface unit 301, a CPU 302, a ROM 303, a RAM 304, an HDD (Hard Disk Drive) 305, and an input / output interface 306.

  The above units are connected via an input / output interface 306. The display / touch panel unit 205 provides a user interface to the user. The network interface unit 301 communicates with an external device via the network 101. The CPU 302 controls each part of the MFP, and executes the processing on the MFP side shown in FIG. 8 based on the control program stored in the ROM 303. The ROM 303 stores a control program and data. The RAM 304 is used as a temporary storage area or a work area. The HDD 305 has a large capacity storage area and stores various data.

  FIG. 4 is a block diagram showing a logical configuration of an information processing function of a computer (a server computer 102 as an information management apparatus and a client PC 104).

  4, the computer includes an input control unit 400, a display unit 401, a network interface unit 402, a CPU 403, a ROM 404, a RAM 405, an HDD 406, and an input / output interface 407.

  The above units are connected via an input / output interface 407. The input control unit 400 controls a keyboard / mouse that receives input from the user. The display unit 401 provides an output screen to the user. The network interface unit 402 communicates with an external device via the network 101. The CPU 403 controls each part of the computer. When the computer is the server computer 102, the CPU 403 executes processing on the server computer side shown in FIG. 8 based on a control program stored in the ROM 404. The ROM 404 stores a control program and data. The RAM 405 is used as a temporary storage area or a work area. The HDD 406 has a large-capacity storage area and stores various data.

  FIG. 5 is a diagram illustrating a configuration of software capable of executing an application program (hereinafter, application) that can be customized in the MFP 100.

  In FIG. 5, reference numeral 501 denotes an example of an execution environment for controlling the MFP 100. In general, each module of the real-time OS that can control various functions of the MFP 100 in real time, or the CPU 302 of the MFP 100 is instructed critically. This is a library group that can control each function including optional devices and expansion cards. The execution environment (real-time OS) 501 is realized by a module group that provides an interface command to an application that operates on the higher level.

  Reference numeral 502 denotes a controller control unit that operates on the execution environment 501, and includes a module that controls the document scanning unit 201, the printer unit, the modem communication unit, the PDL (Page Description Language) development unit, and the like of the MFP 100. Reference numeral 503 denotes an application programming interface (hereinafter referred to as API), a function for accessing the controller control unit 502 by the API 503 in response to a command sequence when a command is input from the application, and a network interface unit. 301 has a function of controlling.

  Reference numeral 504 denotes an optimum execution environment for executing a specific application. For example, the execution environment 504 is realized by a Java (registered trademark) virtual machine, and is executed without depending on the OS or the device environment. Is possible. A resource management unit 505 manages resources used by the execution environment 504 and operates on the real-time OS 501. The resource management unit 505 restricts the virtual machine that implements the execution environment 504 itself, the API 503, or all applications on the real-time OS 501 from using a resource resource such as a memory so that it cannot use more than a predetermined resource. . Reference numerals 506, 507, and 508 are examples of application groups that can operate in the MFP 100.

  FIG. 6 is a diagram illustrating an example of an application UI (User Interface) 600.

  In FIG. 6, the UI 600 can be customized and is displayed on the display / touch panel unit 205 of the MFP 100. This application is one or more applications that operate in the execution environment 504, such as the application 506 in FIG. When the execution environment 504 is, for example, a Java (registered trademark) virtual machine, this application is provided by a Java (registered trademark) program.

  This application can access various resources of the display / touch panel unit 205 to the HDD 305 of the MFP 100, and gives a control command to the controller control unit 502 via the API 503, whereby the document scanning unit 201, the printer unit, and the modem communication of the MFP 100. Control unit, PDL development unit, and the like. In addition, this application can communicate with the server computer 102 via the network interface unit 301.

  The UI 600 shows an example in which this application is customized in accordance with the business content of the financial industry, for example, “◯ △ × Financial Company”. 601 is a name / icon display tab of the application, 602 is a logo image of the user company, 603 is a title of the operation screen (example: business menu), and 604 is a message to be displayed to the user (example: “I want to process” Please select a business menu according to the form type. ”), 605 is a business process selection button (button control) that allows the user to select a desired business process (eg, credit, personal loan, savings, various reports). ).

  FIG. 7 is a diagram illustrating an example of a UI configuration template 700 of an application.

  In FIG. 7, 701 is an application name / icon display area, 702 is a user-defined logo image area, 703 is a title area, 704 is a message display area, and 705 is a menu selection area. The application installed in the MFP 100 appropriately downloads components (configuration file and configuration content) constituting the UI from the server computer 102 according to the business content of the user's form processing, and follows the settings of the UI configuration template. By arranging these parts, the UI configuration can be changed.

  FIG. 8 is a diagram illustrating communication processing when updating the UI configuration of the application between the MFP 100 and the server computer 102.

  In FIG. 8, after the application is installed in MFP 100, server computer 102 automatically detects MFP 100 on the network only for the first time, or the operator manually detects MFP 100 on the server computer 102. CPU 403 of server computer 102 registers the host address of server computer 102 in the application of MFP 100 (step S801). This operation may be performed via the display unit 401 of the client PC 104 or the display / touch panel unit 205 of the MFP 100.

  The application of MFP 100 designates the registered host address and transmits a configuration file download request to server computer 102 (step S802). The CPU 403 of the server computer 102 sends the configuration file to the MFP 100 based on the download request (step S803).

  FIG. 9 is a diagram illustrating an example in which a part of the configuration file is described in XML (Extensible Markup Language).

  9, the configuration file includes area information (position / size), which is configuration information of the UI configuration template 700 of the application of the MFP 100, area attribute information (character string, image, control type), and each area of FIG. A list of character strings, images, control information (button control arrangement, menu hierarchy expression) to be displayed in 701 to 705, form type information (form type ID, form name, number of form pages) handled in this application; Stores scan setting information (single side / double side designation, resolution, color / monochrome type, paper size, etc.) corresponding to the form type.

  In the description example shown in FIG. 9, in the MenuItem element, the menu name “personal loan” as the Name element, the display message “Individual loan application form (hereinafter abbreviated)” as the Form element, and the form as the FormId element The type ID “loan_xxx_yyy_001” is stored. These pieces of information are downloaded from the server computer 102 to the MFP 100, analyzed by the application of the MFP 100, and used for a menu configuration of a “personal loan” part of the UI configuration shown in FIG. 6.

  Returning to FIG. 8, for an image file or the like that is not stored in the configuration file, the application of the MFP 100 transmits a download request for the configuration content corresponding to the content ID information described in the configuration file to the server computer 102 in advance. (Step S804), it is possible to individually download the configuration content from the server computer 102 to the MFP 100 (Step S805). The application of the MFP 100 ends the download of the configuration file and the configuration content from the server computer 102, confirms that the application is in a processing standby (idle) state, and then updates the UI (step S806).

  Here, the configuration file and the configuration content are downloaded from the server computer 102 to the MFP 100 by a communication method using a Web service. In the present embodiment, the configuration file is described on the assumption that it is XML. However, in practice, it can be implemented in any format such as a text file, a binary file, and a data stream. Keep it.

  Next, processing when changing the UI configuration of the application of MFP 100 at an arbitrary time will be described.

  In FIG. 8, the user performs a UI configuration change operation for an application of a certain MFP on the server computer 102, and transmits a configuration file update notification from the server computer 102 to the MFP (step S807). The MFP application downloads the configuration file and the configuration content from the server computer 102 in the same manner as in steps S802 to S805.

  Here, in the above configuration file, configuration change time designation is described. The MFP application caches new configuration information, and when the designated time is reached by a method such as timer monitoring, confirms that the application is in a processing waiting (idle) state, and then updates the UI. (Step S809). In the configuration change time designation, an immediate update can be designated based on an instruction from the server computer 102, and the MFP side configuration change can be performed at an arbitrary timing.

  Next, a form scanning process in which a paper form is scanned, digitized, and processed as an electronic form will be described.

  In FIG. 8, the user sets a form document in the feeder unit 202 of the MFP 100, operates the UI of the application of the MFP 100 via the display / touch panel unit 205, and selects, for example, “Personal” from the business process selection button (button control) 605. “Loan” is selected, and the scan of the document is started by operating the button control on the UI or the hardware button of the MFP 100 (step S810). The form document is read by the document scanning unit 201 of the MFP 100 and is generated as an image file on the HDD 305. The application of MFP 100 attaches the additional information file to the image file and uploads it to server computer 102 (step S811).

  Here, the image file to which the additional information file is attached is transmitted from MFP 100 to server computer 102 by a communication method using a Web service. In addition, depending on the contents of business processing, customer information may be included in the form image. Therefore, in this embodiment, encrypted communication represented by SSL (Secure Sockets Layer) can be performed from the viewpoint of security. is there.

  FIG. 15 is a diagram illustrating an example in which the additional information file 1500 to be uploaded is expressed in XML.

  In FIG. 15, the name of the user who performed the scanning operation in the UserName element, the date and time of scanning in the ScannedDateTime element, the number of image files generated as a result of scanning in the NumberOfFiles element, and the form name selected in the FormName element are in the FormId element. Each selected form type ID is stored. Each element below ScanSettings stores the setting items for scanning. The above processing may be performed by a scanner driver or an image processing program installed in the MFP 100.

  The menu “personal loan” selected by the user in the UI shown in FIG. 6 and the form type ID “loan_xxx_yyy_001” associated with the menu “personal loan” as described in the example shown in FIG. Stored as shown. Further, in this additional information file, it is possible to easily expand a necessary data field between the application of the MFP 100 and the server computer 102, and to upload arbitrary extension data. deep. In this embodiment, the description is made assuming that the format of the additional information file is XML. However, in practice, it can be implemented in an arbitrary format such as a text file, a binary file, or a data stream. Let me supplement.

  18 and 19 are flowcharts showing a flow of processing of an image file and an additional information file received from the MFP 100 by a program operating on the server computer 102.

  18 and 19, when the program operating on the server computer 102 accepts the upload of the image file of the form scanned by the MFP 100, the form type ID is obtained by analyzing the additional information file attached to the image file. It will be described that it is possible to classify image files in advance for each form type and select an appropriate process.

  The CPU 403 of the server computer 102 monitors the generation of the image file and the additional information file on the file system by a program operating on the server computer 102 (step S1801). The program has a form recognition processing function for image data and an information analysis function for an additional information file. When the CPU 403 described in detail below detects the generation of the image file and the additional information file, it checks whether there is an additional information file in the folder of the file system (step S1802). If there is no additional information file, the CPU 403 acquires only the image file, and proceeds to the form recognition process in step S1808 to detect the form type based on the data of the image file.

  If there is an additional information file, the CPU 403 identifies the file format of the additional information file (step S1803). If it is assumed that the additional information file has a file format as shown in FIG. 15 (file format A), the CPU 403 reads the form type ID from the additional information file (step S1804).

  The CPU 403 determines whether or not to double check the form type by detecting the form type based on the form recognition on the server computer 102 in addition to the form type extracted in step S1804 (step S1807). When performing the double check of the form type, the CPU 403 causes the form recognition application to execute a predetermined form recognition logic, and detects the form type by the form recognition process (step S1808). The CPU 403 determines whether or not the form type detected in step S1804 matches the form type detected by the form recognition process in step S1808 (step S1809). If both form types do not match, the CPU 403 notifies the operator that the detection results do not match via the display unit 401. Accordingly, the operator selects a form type (step S1810).

  Since the form type is determined by the processing so far, the CPU 403 distributes the process according to the form type (step S1901). Since the area setting for character recognition differs depending on the form type, the CPU 403 performs the character recognition process for each form type (step S1902). When the character recognition process is completed for each form type, the CPU 403 stores the data handled in a series of processes, such as original image data, additional information, form type ID, and character recognition result, in the storage device 103 for each form type. (Step S1903).

  After saving the data, the CPU 403 determines whether or not the original file detected in step S1801 may be deleted (step S1904). When deleting the original file, the original file and directory are deleted from the file system ( Step S1905). Thereafter, the computerized form by the above process is taken over to the subsequent process according to the business content of the form process.

  FIG. 10 is a diagram illustrating an example in which the content of data held in the storage device (database system or file storage system) 103 used by the server computer 102 is expressed in a table format.

  In FIG. 10, the server computer 102 manages the storage table 1000 in the storage device 103 by storing the correspondence relationship between the form type ID 1001, form name 1002, and form file 1003. By using the storage table 1000, it is possible to search for the form name indicated by the form type ID and the original file of the form.

  When the image file of the form classified by the registered form type ID shown in FIG. 10 is uploaded from the MFP 100, the server computer 102, as described with reference to FIGS. Subsequent form processing operations can be started. The appropriate form processing operations described here are the start of a workflow (a mechanism for making a business flow a rule and performing business processing and business support on a computer), sending an e-mail attached with a form, and a form to a core business system. This is a process that can be appropriately defined according to the business content of the user's form processing, such as data transmission.

  Next, a method for enabling the above-described UI configuration to be edited in the server computer 102 by an operation from the client PC 104 will be described.

  11 to 13 are diagrams showing screens for editing the application UI configuration.

  In FIG. 11, reference numeral 1100 denotes an example of a UI configuration editing screen displayed on the display unit 401 of the client PC 104. When the user wants to create a new UI configuration on the UI configuration editing screen 1100, the user can name the configuration alias 1101. In the UI configuration editing screen 1100, the button name 1102 is designated as “personal loan”, and the form type 1103 processed when the corresponding button is operated on the UI is designated as “personal loan application”. An example is shown. In the figure, 1105 is a save button, and 1106 is a cancel button.

  By referring to the storage table 1000 of the storage device 103 of the server computer 102, the “personal loan application form” is associated with the form type ID “loan_xxx_yyy_001”, and the FormName element below the ScanForm element of the XML data in FIG. The data value of “loan_xxx_yyy_001” is set in the “personal loan application form” and FormId element. Similarly, the display message 1104 edited on the UI configuration editing screen 1100 is set in the Message element of the XML data.

  Similarly, on the UI configuration editing screen 1200 in FIG. 12 and the UI configuration editing screen 1300 in FIG. 13, it is possible to specify scan setting information or an image file according to the form type. Thereafter, the configuration file and the configuration content are transmitted from the client PC 104 to the server computer 102 and stored in the server computer 102.

  As described above, the configuration file and the configuration content stored in the server computer 102 are stored in the server when there is a download request from the MFP 100 as shown in steps S801 to S805 of FIG. 8, and similarly in steps S807 to S808. It is transmitted from the computer 102 to the MFP 100.

  In the UI configuration editing screen 1200 of FIG. 12, a print form file can be specified. In the figure, 1201 is a configuration alias, 1202 is a form type, 1203 is a scan setting (double-sided / single-sided designation, resolution, color / monochrome type), 1204 is the number of pages, 1205 is information on a print form file, 1206 is a save button, 1207 Is a cancel button.

  This screen is an example of a configuration for making it possible to immediately print out an arbitrary form corresponding to the selected business content via a customized UI using the characteristics of the MFP 100. For example, a user who has selected the “personal loan” business process on the UI wants to print out an unfilled form on the personal loan application form or print out a handbook on how to write the personal loan application form. In this case, it is possible to immediately print out via the UI.

  In the UI configuration editing screen 1300 in FIG. 13, a logo image file corresponding to a logo image displayed in the user-defined logo image area of the UI and an icon image file corresponding to the icon image displayed in the icon display area of the UI can be designated. Is possible. In the figure, 1301 is a configuration alias, 1302 is logo image file information, 1303 is icon image file information, 1304 is a save button, and 1305 is a cancel button.

  Next, a description will be given of a method in which each MFP has a different UI configuration when a plurality of MFPs are connected to the network in the network system of FIG.

  FIG. 14 is a diagram illustrating an example in which the content of data held in the storage device 103 used by the server computer 102 is expressed in a table format.

  In FIG. 14, the server computer 102 manages the storage table 1400 in the storage device 103 by storing the correspondence relationship between the machine ID 1401, MFP machine alias 1402, and configuration alias ID 1403 assigned to each MFP. Yes. By managing the storage table 1400 on the server computer 102 side, since the machine ID is transmitted from the MFP to the server computer 102 in step S802 of FIG. 8 and also in step S808, the server computer 102 searches the storage table 1400. The configuration file and the configuration content corresponding to the corresponding configuration alias are individually transmitted to each MFP.

  Thus, each MFP can configure different UIs by using different configuration aliases or by using the same configuration alias.

  As described above, according to the present embodiment, the UI configuration of MFP 100 can be arbitrarily changed by downloading UI components from server computer 102 in accordance with the business contents of the user's form processing. . Further, when the user scans a form with the MFP, the form type can be selected via the UI of the MFP, and the MFP uploads an additional information file including the form type ID to the server computer 102 together with the form image file. As a result, the server computer 102 that accepts the upload of the form image file does not require the form type determination process on the server computer side, and the load can be reduced. When a plurality of MFPs are connected to the network, each MFP can configure a different UI.

  If the additional information file is attached to the image file received from the MFP 100, the server computer 102 reads out the corresponding form type ID according to the format of the additional information file, and determines whether to double check the form type. Determine whether. When performing a double check of the form type, the form type is detected by the form recognition process, whether or not the detected form type and the form type corresponding to the additional information file match is determined, and according to the specified form type Perform processing distribution. Thereby, it is possible to improve the accuracy of specifying the form type by double checking the form type. Furthermore, it becomes possible to correctly classify image data in advance for each form type based on the additional information, and it is possible to quickly start subsequent business processing.

[Other embodiments]
In the above embodiment, the case where the network system has the configuration shown in FIG. 1 has been described as an example. However, the present invention is not limited to this, and the number of apparatuses (MFPs, client PCs) connected to the network is not limited thereto. Can be any number. In addition to the MFP, a scanner, a printer, or the like may be connected to the network. Further, the network can be installed in any form (LAN, WAN, Internet, etc.).

  In the above embodiment, the case where the UI of the MFP 100 is customized in accordance with the business contents of the financial industry has been described as an example. However, the present invention is not limited to this and is applied to various industrial fields other than the financial industry. Is possible.

  In the above embodiment, the UI of MFP 100 can be customized, and the scanned form is processed as an electronic form. However, the present invention is not limited to this, and MFP 100 receives a facsimile from an external device. The present invention can also be applied when processing a document (hereinafter referred to as a FAX document) as an electronic form.

  Hereinafter, as another embodiment, an example will be described in which the MFP 100 processes a FAX document received from an external device as an electronic form.

  In FIG. 1, when receiving an ordinary FAX document from an external device, the MFP 100 receives the FAX document via the telephone / FAX line 105. Further, when the MFP 100 receives a FAX document (Internet FAX document) from an external device via the Internet, the MFP 100 receives the FAX document via the gateway device 106 and the network 101.

  FIG. 16 is a diagram showing communication processing when uploading a file between the MFP 100 and the server computer 102.

  In FIG. 16, when the CPU 302 of the MFP 100 receives a FAX document by the network interface unit 301 (step S1601), the received FAX document is converted into an image file, and information such as a destination FAX number and a FAX reception time is stored in the image file. The file is attached as an additional information file and uploaded to the server computer 102 (step S1602).

  FIG. 20 is a diagram illustrating an example in which the additional information file 2000 is expressed in XML.

  In FIG. 20, information such as the transmission source FAX number is stored in the From element, the destination FAX number is stored in the To element, the date and time when the FAX document is received in the ReceivedDateTime element, and the number of image files generated in the NumberOfFiles element.

  In FIG. 18 and FIG. 19, the CPU 403 of the server computer 102 proceeds with the processing from step S1801 to step S1803 in the same manner as described above according to the program running on the server computer 102. In the file format B), the destination FAX number is read out from the additional information file (step S1805).

  In the case of a FAX document, for example, “03-5555-6666” is assigned to the application reception FAX number of the “credit card application form”, and “03-5555-6667” is assigned to the “loan application form” form. As shown in FIG. 4, a correspondence table 1700 of form type IDs of FAX documents that reach the destination FAX number is prepared. The CPU 403 can detect the form type of the received image file by searching the correspondence table 1700 (step S1806). Thereafter, the processing proceeds from step S1807 to step S1810 and from step S1901 to step S1905 in the same manner as described above.

  The above is the description of another embodiment using the FAX document reception process as an example. As a result, by expanding the format of the additional information file, it is possible to acquire information for specifying the form type from various devices.

  In each of the above embodiments, the form recognition function of the program stored in the server device, which is an example of the data processing apparatus, causes the CPU to execute a predetermined form recognition logic, thereby recognizing the form type, and the form document A form recognition process is performed on the image data file generated by reading. At this time, the form recognition process processed by the form recognition function of the program is controlled using an additional information file which is an example of the control information acquired by the program.

  Further, the additional information file acquired by the program may be defined corresponding to the function of the image forming apparatus, such as a facsimile that reads an image data file from a paper form or a multifunction peripheral having a form recognition function.

  The additional information file may be information that can specify whether or not a predetermined form recognition process has been performed before the program acquires the image data.

  If the additional information file indicates that a predetermined form recognition process has been performed on, for example, a multifunction machine before reception, the form recognition process is skipped and the form recognition process for image data is skipped. You may control to omit.

  Furthermore, even if the additional information file indicates that a predetermined form recognition process has been performed in advance, the form recognition logic is activated and displayed so as to control the re-recognition process. When there is an input to enable the re-recognition processing function by the input to the input control unit 400 generated by the user operating the mouse when viewing the input screen displayed on the display unit 401 of the apparatus, The form recognition process may be performed on the data that has been subjected to the form recognition process.

  When the identifier used for communication of the additional information file is, for example, a fax number or a telephone number, the program may omit the form recognition using the form recognition logic. The form corresponding to the image data file may be recognized as a form corresponding to the fax number or telephone number, and the form distribution process may be automatically performed.

  The above is the feature of the server device side.

  As the data sending side, an image processing program that reads a form document and generates image data, a network interface unit that transfers the generated image data, and an additional information file that controls form recognition processing at the transfer destination of the image data are stored The multi-function device 101 has been described.

  In addition, when a user reads a form with the reading means of the data processing apparatus, the form type can be selected via the user interface of the data processing apparatus, and the form type information is displayed in the form image file from the data processing apparatus to the information management apparatus. Add and send. As a result, in the information management apparatus that accepts the transmission of the form image file, the form type determination process is unnecessary, and the load can be reduced. Further, when a plurality of data processing devices are connected to a network, each data processing device can configure a different user interface.

  In addition, according to each of the above embodiments, when performing the collation process for collating the form type acquired based on the additional information attached to the image file and the form type by the form recognition process, whether the form type matches And the process is distributed according to the specified form type. Thereby, it is possible to improve the accuracy of specifying the form type by double checking the form type. Furthermore, it becomes possible to correctly classify image data in advance for each form type based on the additional information, and it is possible to quickly start subsequent business processing.

  In addition, an object of the present invention is to supply a storage medium storing software program codes for realizing the functions of the embodiments to a system or apparatus, and a computer (or CPU, MPU, etc.) of the system or apparatus as a storage medium. This can also be achieved by reading and executing the stored program code.

  In this case, the program code itself read from the storage medium realizes the functions of the above-described embodiments, and the program code and the storage medium storing the program code constitute the present invention.

  Examples of the storage medium for supplying the program code include a floppy (registered trademark) disk, a hard disk, a magneto-optical disk, a CD-ROM, a CD-R, a CD-RW, a DVD-ROM, a DVD-RAM, and a DVD. -RW, DVD + RW, magnetic tape, nonvolatile memory card, ROM, etc. can be used. Alternatively, the program code may be downloaded via a network.

  Further, by executing the program code read by the computer, not only the functions of the above-described embodiments are realized, but also an OS (operating system) running on the computer based on the instruction of the program code. A case where part or all of the actual processing is performed and the functions of the above-described embodiments are realized by the processing is also included.

  Further, after the program code read from the storage medium is written in a memory provided in a function expansion board inserted into the computer or a function expansion unit connected to the computer, the function expansion is performed based on the instruction of the program code. This includes the case where the CPU or the like provided in the board or function expansion unit performs part or all of the actual processing, and the functions of the above-described embodiments are realized by the processing.

  Further, by executing the program code read out by the computer, not only the functions of the above-described embodiments are realized, but also the OS running on the computer based on the instruction of the program code is actually Needless to say, the present invention also includes a case in which the functions of the above-described embodiments are realized by performing part or all of the processing and the processing.

  In this case, the program is supplied by downloading directly from a storage medium storing the program or from another computer or database (not shown) connected to the Internet, a commercial network, a local area network, or the like.

  The form of the program may be in the form of object code, program code executed by an interpreter, script data supplied to an OS (operating system), and the like.

It is the schematic which shows the structural example of the network system which concerns on embodiment of this invention. 1 is a diagram illustrating an appearance of an MFP as a data processing apparatus. 2 is a block diagram illustrating a logical configuration of an information processing function of the MFP. FIG. It is a block diagram which shows the logical structure of the information processing function of a computer (a server computer, client PC). FIG. 2 is a diagram illustrating a configuration of software capable of executing an application that can be customized in an MFP. It is a figure which shows an example of application UI. It is a figure which shows an example of UI structure template of an application. FIG. 4 is a diagram illustrating communication processing when updating the UI configuration of an application between an MFP and a server computer. It is a figure which shows the example which described a part of structure file by XML. It is a figure which shows the example which expressed the content of the data hold | maintained at the memory | storage device which a server computer utilizes in the table format. It is a figure which shows the screen which edits application UI structure. It is a figure which shows the screen which edits application UI structure. It is a figure which shows the screen which edits application UI structure. It is a figure which shows the example which expressed the content of the data hold | maintained at the memory | storage device which a server computer utilizes in the table format. It is a figure which shows the example which described a part of additional information file by XML. It is a figure which shows the communication process at the time of uploading the file between MFP and a server computer. It is a figure which shows the example which expressed the content of the data hold | maintained at the memory | storage device which a server computer utilizes in the table format. 6 is a flowchart showing a flow of processing an image file and an additional information file received from an MFP by a program operating on a server computer. It is a continuation of the flowchart of FIG. It is a figure which shows the example which expressed the additional information file by XML.

Explanation of symbols

100 MFP
102 Server Computer 201 Document Scanning Unit 205 Display / Touch Panel Unit 301 Network Interface Unit 302 CPU
403 CPU

Claims (9)

  1. A discriminating means for discriminating whether or not the image has additional information indicating the form type;
    A determination unit that reads out the form type ID from the additional information when the determination unit determines that there is additional information, and determines whether to double check the form type;
    If it is determined that the form type is not double-checked by the determination unit, the form recognition process is not performed based on the image, and if the determination unit determines that there is no additional information, or the form type is determined by the determination unit. A form recognition processing means for performing a form recognition process based on the image,
    When the additional information and the recognition result processed by the form recognition processing means are different, a process for displaying that the additional information and the recognition result processed by the form recognition processing means do not match, and reselecting the form type Means,
    Character recognition processing means for performing character recognition processing based on the additional information, the reselected form type, or the form type recognized by the form recognition means;
    A data processing apparatus comprising: a storage unit that stores the image, the additional information or the form type, and a character recognition result by the character recognition processing unit.
  2. The image includes an image read by an image reading device,
    The data processing apparatus according to claim 1, wherein the additional information includes information of a form type designated by the user in the image reading apparatus.
  3.   2. The data processing apparatus according to claim 1, further comprising a reading unit that reads a destination FAX number and reads a form type assigned to the read FAX number when the additional information is additional information of a FAX document. .
  4. A determination step of determining whether or not the image has additional information indicating the form type;
    If it is determined that there is additional information in the determination step, a determination step of reading out the form type ID from the additional information and determining whether to double check the form type;
    If the determination step determines that the form type is not double-checked, the form recognition process is not performed based on the image, and the determination step determines that there is no additional information, or the determination step determines that the form type is two. If it is determined to double check, a form recognition processing step for performing a form recognition process based on the image;
    When the recognition result processed by the form recognition processing step is different from the additional information, a process for displaying that the recognition result processed by the additional information and the form recognition processing step is inconsistent and reselecting the form type Steps,
    A character recognition processing step for performing character recognition processing based on the additional information, the reselected form type, or the form type recognized by the form recognition step;
    A data processing method comprising: storing the image, the additional information or the form type, and a character recognition result obtained by the character recognition processing step.
  5. The image includes an image read by an image reading device,
    5. The data processing method according to claim 4, wherein the additional information includes information on a form type designated by a user in the image reading apparatus.
  6.   5. The data processing method according to claim 4, further comprising a reading step of reading out a destination FAX number and reading out a form type assigned to the read FAX number when the additional information is additional information of a FAX document. .
  7. A determination step of determining whether or not the image has additional information indicating the form type;
    If it is determined that there is additional information in the determination step, a determination step of reading out the form type ID from the additional information and determining whether to double check the form type;
    If the determination step determines that the form type is not double-checked, the form recognition process is not performed based on the image, and the determination step determines that there is no additional information, or the determination step determines that the form type is two. If it is determined to double check, a form recognition processing step for performing a form recognition process based on the image;
    When the recognition result processed by the form recognition processing step is different from the additional information, a process for displaying that the recognition result processed by the additional information and the form recognition processing step is inconsistent and reselecting the form type Steps,
    A character recognition processing step for performing character recognition processing based on the additional information, the reselected form type, or the form type recognized by the form recognition step;
    A program for causing a computer to execute the image, the additional information or the form type, and a storing step for storing a character recognition result obtained by the character recognition processing step.
  8. The image includes an image read by an image reading device,
    8. The program according to claim 7, wherein the additional information includes information on a form type designated by a user in the image reading apparatus.
  9.   8. The computer according to claim 7, wherein when the additional information is additional information of a FAX document, the computer further executes a reading step of reading a destination FAX number and reading a form type assigned to the read FAX number. program.
JP2005025613A 2004-10-26 2005-02-01 Data processing apparatus, data processing method, and program Expired - Fee Related JP4817669B2 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
JP2004311080 2004-10-26
JP2004311080 2004-10-26
JP2005025613A JP4817669B2 (en) 2004-10-26 2005-02-01 Data processing apparatus, data processing method, and program

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
JP2005025613A JP4817669B2 (en) 2004-10-26 2005-02-01 Data processing apparatus, data processing method, and program
US11/343,997 US7787158B2 (en) 2005-02-01 2006-01-31 Data processing apparatus, image processing apparatus, data processing method, image processing method, and programs for implementing the methods
EP06250555A EP1686784A1 (en) 2005-02-01 2006-02-01 Data processing apparatus, image processing apparatus, data processing method, image processing method and programs for implementing the methods
CN 200610002812 CN100421121C (en) 2005-02-01 2006-02-05 Data processing apparatus, image processing apparatus, data processing method, image processing method

Publications (3)

Publication Number Publication Date
JP2006155551A5 JP2006155551A5 (en) 2006-06-15
JP2006155551A JP2006155551A (en) 2006-06-15
JP4817669B2 true JP4817669B2 (en) 2011-11-16

Family

ID=36633725

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2005025613A Expired - Fee Related JP4817669B2 (en) 2004-10-26 2005-02-01 Data processing apparatus, data processing method, and program

Country Status (1)

Country Link
JP (1) JP4817669B2 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4916910B2 (en) * 2007-02-20 2012-04-18 リコーItソリューションズ株式会社 Workflow processing system and workflow processing method
JP2008293221A (en) 2007-05-23 2008-12-04 Ricoh Co Ltd Document processor and processing method
JP4755166B2 (en) * 2007-12-03 2011-08-24 キヤノンマーケティングジャパン株式会社 Printing system, image forming apparatus, server apparatus, printing method, and program

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6154575A (en) * 1984-08-24 1986-03-18 Fujitsu Ltd Character reader
JPH10105654A (en) * 1996-09-30 1998-04-24 Aibitsukusu Kk Character recognition device for form
JPH118746A (en) * 1997-06-17 1999-01-12 Oki Electric Ind Co Ltd Format identification method and character recognition device
JP2000251012A (en) * 1999-03-01 2000-09-14 Hitachi Ltd Method and system for document processing
JP4416890B2 (en) * 1999-12-20 2010-02-17 富士通フロンテック株式会社 Form identification device
JP2003331216A (en) * 2002-05-16 2003-11-21 Oki Electric Ind Co Ltd Business form reading method

Also Published As

Publication number Publication date
JP2006155551A (en) 2006-06-15

Similar Documents

Publication Publication Date Title
JP3771923B2 (en) Digital computing device and an external device
JP4033857B2 (en) Print system, print management server, print method, print management method, and program
US7382479B2 (en) Linking information making apparatus and recording medium
US6480304B1 (en) Scanning system and method
KR100420424B1 (en) Fully modular multifunction device
US7496233B2 (en) Service processing apparatus and service processing method
US8326090B2 (en) Search apparatus and search method
US6965958B1 (en) Searching for printers over a network using intermediate print data
US20060158675A1 (en) Method of and apparatus for image formation, and computer product
US20070081186A1 (en) Image forming apparatus and method for controlling image forming apparatus
US6208436B1 (en) Use of a header page to provide scan control information for a scan
US8179542B2 (en) Information processing device, image forming apparatus, and process control system
US8339645B2 (en) Managing apparatus, image processing apparatus, and processing method for the same, wherein a first user stores a temporary object having attribute information specified but not partial-area data, at a later time an object is received from a second user that includes both partial-area data and attribute information, the storage unit is searched for the temporary object that matches attribute information of the received object, and the first user is notified in response to a match
JP5163035B2 (en) Document management system, apparatus, and method
US7475336B2 (en) Document information processing apparatus and document information processing program
US8201072B2 (en) Image forming apparatus, electronic mail delivery server, and information processing apparatus
JP5301627B2 (en) Print control system, print control server, image forming apparatus, processing method thereof, and program
JP4115474B2 (en) Image forming apparatus, image forming apparatus control method, image forming system, program, and storage medium
JP4581404B2 (en) Information processing apparatus and information processing program
KR20010098567A (en) Fully modular multifunction device
US20060136488A1 (en) Image forming system, image forming apparatus, information processing apparatus, and recording medium having recorded control program
KR100832955B1 (en) Apparatus for displaying user interface in accordance with user's intention, method therefor and storage medium
CN103946853A (en) Image forming apparatus, server apparatus, and information processing apparatus
US8341633B2 (en) Identifying executable process contents of flow executor(s) in flow generation system
JP4265249B2 (en) Service processing apparatus, service processing method, and program

Legal Events

Date Code Title Description
RD03 Notification of appointment of power of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7423

Effective date: 20060420

RD05 Notification of revocation of power of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7425

Effective date: 20070626

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20080129

A521 Written amendment

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20080129

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20100610

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20100629

A521 Written amendment

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20100827

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20110412

A521 Written amendment

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20110608

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20110822

A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20110830

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20140909

Year of fee payment: 3

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20140909

Year of fee payment: 3

LAPS Cancellation because of no payment of annual fees