JP6470071B2

JP6470071B2 - Image processing device

Info

Publication number: JP6470071B2
Application number: JP2015044761A
Authority: JP
Inventors: 大樹八島; 大輔向山
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 2015-03-06
Filing date: 2015-03-06
Publication date: 2019-02-13
Anticipated expiration: 2035-03-06
Also published as: JP2016165059A

Description

本発明は、画像処理装置に関し、特に、原稿の画像を読取る画像読取り機能を持つ画像処理装置に関する。 The present invention relates to an image processing apparatus, and more particularly to an image processing apparatus having an image reading function for reading an image of a document.

画像処理装置の１種として、多くの事業所（会社、事務所等）に画像形成装置（代表的にはコピー機）が導入されている。このような画像形成装置の１つである複合機（ＭＦＰ：ＭｕｌｔｉｆｕｎｃｔｉｏｎＰｅｒｉｐｈｅｒａｌ）のように、コピーモード、ネットワーク対応のプリントモード、スキャナモード、及びファクシミリモードのような複数の動作モードを有するものも多くなってきている。 As one type of image processing apparatus, an image forming apparatus (typically a copier) is introduced in many offices (company, office, etc.). Many of these image forming apparatuses, such as multifunction peripherals (MFPs), have a plurality of operation modes such as a copy mode, a network-compatible print mode, a scanner mode, and a facsimile mode. It has become to.

画像処理装置は、原稿の画像を原稿画像データとして読取る画像読取り機能を持つ。こうした画像処理装置のなかには、読取った原稿画像データをファイルとして記憶したり、記憶したファイルを外部機器に送信したりする機能を持つものがある。原稿画像データの記憶時には当該原稿画像データにファイル名が付される。ファイル名はユーザによって手動で入力されることが多い。 The image processing apparatus has an image reading function for reading an image of a document as document image data. Some of such image processing apparatuses have a function of storing the read document image data as a file and transmitting the stored file to an external device. When document image data is stored, a file name is assigned to the document image data. File names are often entered manually by the user.

また従来、画像読取り時の日時と画像処理装置の機種名等（「日時＋マシン名等のマシン固有情報」）とを組合わせたファイル名を自動で生成し、当該ファイル名で原稿画像データを記憶する画像処理装置も知られている。しかし、日時と機種名等との組合せからなるファイル名では、ファイル名からファイルの内容を把握するのが困難である。ファイルの内容を把握するためにファイルを開く必要があり、こうした作業に手間がかかる。さらに、こうしたファイル名が付されたファイルを外部機器に送信した場合、そのファイルを受信したユーザは、ファイルを開くまでファイルの内容を把握できない。 Conventionally, a file name is automatically generated by combining the date and time when reading an image and the model name of the image processing apparatus (“date and time + machine specific information such as machine name”), and the original image data is generated with the file name. An image processing apparatus for storing is also known. However, it is difficult to grasp the contents of a file from the file name with a file name composed of a combination of date and time and model name. It is necessary to open the file in order to grasp the contents of the file, and this work is troublesome. Furthermore, when a file with such a file name is transmitted to an external device, the user who has received the file cannot grasp the contents of the file until the file is opened.

こうした問題に対して、後掲の特許文献１は、読取った原稿画像データに対して当該原稿画像データの内容を考慮したファイル名を自動で設定する画像形成装置を開示する。この画像形成装置は、読取った原稿画像データに対してＯＣＲ（ＯｐｔｉｃａｌＣｈａｒａｃｔｅｒＲｅｃｏｇｎｉｔｉｏｎ）処理を行ない、当該原稿画像データの内容を象徴する、比較的大きな文字列、赤色の文字列、特別なフォントの文字列、タイトル領域の文字列、複数回出現する文字列を抽出する。画像形成装置はさらに、抽出した文字列を組合わせた文字列を原稿画像データのファイル名に設定して記憶部に記憶する。 In order to solve such a problem, Japanese Patent Application Laid-Open No. 2004-228561 discloses an image forming apparatus that automatically sets a file name in consideration of the content of the original image data for the read original image data. This image forming apparatus performs OCR (Optical Character Recognition) processing on the read original image data, and relatively large character strings, red character strings, special font characters that symbolize the contents of the original image data. Extract strings, character strings in the title area, and character strings that appear multiple times. Further, the image forming apparatus sets a character string obtained by combining the extracted character strings as a file name of the document image data and stores it in the storage unit.

特開２０１１−１５５５４８号公報JP 2011-155548 A

特許文献１の画像形成装置は、読取った原稿画像データに対してＯＣＲ処理を行なうことで、当該原稿画像データの内容を象徴する文字列を抽出する。しかし、こうした文字列を抽出するためのＯＣＲ処理には時間がかかるため、原稿画像データにファイル名を設定するまでの時間が長くなる。特許文献１に記載の技術では、読取った原稿画像データを記憶部に記憶するまでの時間が長くなるため、ユーザの使い勝手が低下するという問題が生じる。 The image forming apparatus disclosed in Patent Document 1 performs OCR processing on the read document image data, thereby extracting a character string symbolizing the content of the document image data. However, since the OCR process for extracting such a character string takes time, it takes a long time to set a file name in the document image data. In the technique described in Patent Document 1, the time until the read document image data is stored in the storage unit becomes long, so that there is a problem that the usability of the user is lowered.

本発明は、上記のような課題を解決するためになされたものであり、本発明の１つの目的は、使い勝手の低下を抑制しつつ、読取った原稿画像データにファイル名を付して記憶することが可能であり、且つユーザの手間を省くことが可能な画像処理装置を提供することである。 The present invention has been made to solve the above-described problems, and one object of the present invention is to store a read original image data with a file name while suppressing a decrease in usability. It is also possible to provide an image processing apparatus that can save the trouble of the user.

本発明の一の局面に係る画像処理装置は、原稿の画像を原稿画像データとして読取るための原稿読取手段と、原稿読取手段により読取られた原稿画像データのうち、当該原稿画像の一部の領域に対応する画像データに対して文字認識処理を行なうための文字認識手段と、文字認識手段により認識された文字列に基づいて原稿画像データのファイル名を生成するための生成手段と、原稿読取手段により読取られた原稿画像データを生成手段により生成されたファイル名を用いて記憶するための記憶手段とを含む。 An image processing apparatus according to an aspect of the present invention includes a document reading unit for reading an image of a document as document image data, and a partial area of the document image among the document image data read by the document reading unit. A character recognition unit for performing character recognition processing on image data corresponding to the image data, a generation unit for generating a file name of document image data based on a character string recognized by the character recognition unit, and a document reading unit Storage means for storing the original image data read by the generation means using the file name generated by the generation means.

原稿読取手段が原稿の画像を読取ると、文字認識手段が読取られた原稿画像データのうち、当該原稿画像の一部の領域に対応する画像データに対して文字認識処理を行なう。文字認識処理によって文字列が認識されると、生成手段が認識された文字列に基づいて原稿画像データのファイル名を生成する。原稿読取手段によって読取られた原稿画像データは記憶手段によって記憶される。その際、生成手段によって生成されたファイル名が当該原稿画像データのファイル名に用いられる。すなわち、読取られた原稿画像データは、生成手段によって生成されたファイル名が付されて記憶される。 When the document reading unit reads an image of the document, character recognition processing is performed on image data corresponding to a partial area of the document image in the document image data read by the character recognition unit. When the character string is recognized by the character recognition process, the file name of the document image data is generated based on the recognized character string. Document image data read by the document reading unit is stored in the storage unit. At this time, the file name generated by the generation unit is used as the file name of the document image data. That is, the read document image data is stored with the file name generated by the generation unit.

このように、原稿画像の一部の領域に対応する画像データに対して文字認識処理を行なうことにより、一度に原稿画像の全領域に対して文字認識処理を行なう場合に比べて、文字認識処理にかかる時間を短縮できる。そのため、生成したファイル名を用いて原稿画像データを記憶する場合であっても、当該原稿画像データが記憶されるまでの時間を短縮できる。これにより、ユーザの使い勝手が低下するのを抑制できる。読取った原稿画像データに付されるファイル名は生成手段によって自動で生成されるため、ファイル名を手動で入力する手間を省くことができる。さらに、ファイル名は原稿画像の文字列に基づいて生成されるため、原稿画像データに付されるファイル名から当該原稿画像データ（ファイル）の内容が把握し易くなる。ファイルの内容を把握するためにファイルを開く、といった操作を行なう必要性が低減されるため、これによっても、ユーザの手間を省くことができる。 In this way, by performing character recognition processing on image data corresponding to a partial area of a document image, character recognition processing is performed as compared with a case where character recognition processing is performed on all areas of a document image at once. Can reduce the time it takes. Therefore, even when document image data is stored using the generated file name, the time until the document image data is stored can be shortened. Thereby, it can suppress that a user's usability falls. Since the file name attached to the read document image data is automatically generated by the generation unit, it is possible to save the trouble of manually inputting the file name. Furthermore, since the file name is generated based on the character string of the document image, it is easy to grasp the contents of the document image data (file) from the file name attached to the document image data. Since the necessity of performing an operation such as opening a file in order to grasp the contents of the file is reduced, this also saves the user's trouble.

好ましくは、画像処理装置はさらに、ファイル名の候補となる文字列として、所定の空白領域によって挟まれた文字列が文字認識処理によって認識されたか否かを判定するための判定手段と、判定手段の判定結果が否定であることに応答して、原稿画像の他の一部の領域に対応する画像データに対して文字認識処理を行なうよう、文字認識手段を制御するための認識処理制御手段とを含み、生成手段は、判定手段の判定結果が肯定であることに応答して、認識された文字列に基づいて原稿画像データのファイル名を生成するためのファイル名生成手段を含む。 Preferably, the image processing apparatus further includes a determination unit for determining whether or not a character string sandwiched between predetermined blank areas has been recognized by the character recognition process as a character string to be a file name candidate, and a determination unit A recognition processing control means for controlling the character recognition means to perform character recognition processing on image data corresponding to another part of the original image in response to the negative determination result And the generation means includes a file name generation means for generating a file name of the document image data based on the recognized character string in response to the determination result of the determination means being affirmative.

より好ましくは、認識処理制御手段は、原稿画像の先頭から後尾に向かって所定の大きさの領域単位で順に、各領域に対応する画像データに対して文字認識処理を行なうよう、文字認識手段を制御する。 More preferably, the recognition processing control means sets the character recognition means so as to perform character recognition processing on image data corresponding to each area in order of area of a predetermined size from the beginning to the tail of the document image. Control.

さらに好ましくは、判定手段は、予め設定された文字列、及び予め定められた文字数以上の文字列をファイル名の候補から除外したうえで、ファイル名の候補となる文字列が文字認識処理によって認識されたか否かを判定するための文字列判定手段を含む。 More preferably, the determination means excludes a predetermined character string and a character string having a predetermined number of characters or more from the file name candidates, and recognizes the character string as the file name candidate by the character recognition process. It includes character string determination means for determining whether or not it has been done.

さらに好ましくは、画像処理装置はさらに、文字列判定手段の判定結果が否定であることに応答して、除外した文字列をファイル名の候補に設定するための設定手段を含み、生成手段はさらに、設定手段によって設定された文字列に基づいて原稿画像データの代替のファイル名を生成するための代替ファイル名生成手段を含む。 More preferably, the image processing apparatus further includes a setting unit for setting the excluded character string as a file name candidate in response to the determination result of the character string determining unit being negative. , Including alternative file name generation means for generating an alternative file name of the document image data based on the character string set by the setting means.

さらに好ましくは、画像処理装置はさらに、判定手段の判定結果が肯定であることに応答して、ファイル名の候補となる文字列が複数認識されたか否かを判定するための候補数判定手段と、候補数判定手段の判定結果が肯定であることに応答して、ファイル名の候補となる複数の文字列の１つをユーザに選択させるための選択手段とを含み、生成手段は、候補数判定手段の判定結果に応じて、認識された文字列、又は認識された複数の文字列のうち、選択手段を介してユーザに選択された文字列に基づいて原稿画像データのファイル名を生成する。 More preferably, the image processing apparatus further includes a candidate number determination unit for determining whether or not a plurality of character strings as file name candidates are recognized in response to a positive determination result of the determination unit. And a selection means for causing the user to select one of a plurality of character strings as file name candidates in response to the determination result of the candidate number determination means being affirmative. A file name of the document image data is generated based on the recognized character string or a character string selected by the user via the selection unit among the recognized character strings in accordance with the determination result of the determination unit. .

さらに好ましくは、画像処理装置はさらに、文字列判定手段の判定結果が否定であることに応答して、複数の文字列がファイル名の候補に設定されたか否かを判定するための候補数判定手段と、候補数判定手段の判定結果が肯定であることに応答して、ファイル名の候補となる複数の文字列の１つをユーザに選択させるための選択手段とを含み、生成手段は、候補数判定手段の判定結果に応じて、設定された文字列、又は設定された複数の文字列のうち、選択手段を介してユーザに選択された文字列に基づいて原稿画像データのファイル名を生成する。 More preferably, the image processing apparatus further determines the number of candidates for determining whether or not a plurality of character strings are set as file name candidates in response to the determination result of the character string determining means being negative. And a selection means for causing the user to select one of a plurality of character strings that are file name candidates in response to the determination result of the candidate number determination means being affirmative. Based on the determination result of the candidate number determination means, the file name of the document image data is set based on the set character string or the character string selected by the user through the selection means among the set character strings. Generate.

さらに好ましくは、画像処理装置はさらに、複数枚の原稿の画像が原稿読取手段によって読取られたことに応答して、１枚目の原稿の原稿画像データに対する文字認識処理が完了したか否かを判定し、判定結果に応じて、文字認識処理を停止するよう文字認識手段を制御するための処理停止手段を含む。 More preferably, the image processing apparatus further determines whether or not the character recognition processing for the original image data of the first original is completed in response to the reading of the images of the plurality of originals by the original reading unit. Processing stop means for determining and controlling the character recognition means to stop the character recognition processing according to the determination result is included.

以上より、本発明によれば、使い勝手の低下を抑制しつつ、読取った原稿画像データにファイル名を付して記憶することが可能であり、且つユーザの手間を省くことが可能な画像処理装置を得ることができる。 As described above, according to the present invention, an image processing apparatus that can store a read document image data with a file name while suppressing a decrease in usability and saves a user's trouble. Can be obtained.

本発明の第１の実施の形態に係る画像処理装置のハードウェア構成を示す制御ブロック図である。It is a control block diagram which shows the hardware constitutions of the image processing apparatus which concerns on the 1st Embodiment of this invention. 図１に示す画像処理装置で表示されるシステム設定画面の例を示す図である。It is a figure which shows the example of the system setting screen displayed with the image processing apparatus shown in FIG. ファイル名の自動生成処理を説明するための図である。It is a figure for demonstrating the automatic generation process of a file name. ファイル名の自動生成処理を説明するための図である。It is a figure for demonstrating the automatic generation process of a file name. 図１に示す画像処理装置で実行されるプログラムの制御構造を示すフローチャートである。It is a flowchart which shows the control structure of the program performed with the image processing apparatus shown in FIG. 図１に示す画像処理装置で実行されるプログラムの制御構造を示すフローチャートである。It is a flowchart which shows the control structure of the program performed with the image processing apparatus shown in FIG. 図１に示す画像処理装置の動作を説明するための図である。It is a figure for demonstrating operation | movement of the image processing apparatus shown in FIG. 図１に示す画像処理装置の動作を説明するための図である。It is a figure for demonstrating operation | movement of the image processing apparatus shown in FIG. 図１に示す画像処理装置で表示される選択画面の例を示す図である。It is a figure which shows the example of the selection screen displayed with the image processing apparatus shown in FIG. 本発明の第２の実施の形態に係る画像処理装置で実行されるプログラムの制御構造を示すフローチャートである。It is a flowchart which shows the control structure of the program performed with the image processing apparatus which concerns on the 2nd Embodiment of this invention. 本発明の第２の実施の形態に係る画像処理装置で実行されるプログラムの制御構造を示すフローチャートである。It is a flowchart which shows the control structure of the program performed with the image processing apparatus which concerns on the 2nd Embodiment of this invention. 本発明の第３の実施の形態に係る画像処理装置で表示される原稿選択画面の例を示す図である。It is a figure which shows the example of the original document selection screen displayed with the image processing apparatus which concerns on the 3rd Embodiment of this invention. 本発明の第３の実施の形態に係る画像処理装置の動作を説明するための図である。It is a figure for demonstrating operation | movement of the image processing apparatus which concerns on the 3rd Embodiment of this invention. 本発明の第３の実施の形態に係る画像処理装置の動作を説明するための図である。It is a figure for demonstrating operation | movement of the image processing apparatus which concerns on the 3rd Embodiment of this invention. 本発明の第３の実施の形態に係る画像処理装置で表示される通知画面の例を示す図である。It is a figure which shows the example of the notification screen displayed with the image processing apparatus which concerns on the 3rd Embodiment of this invention.

以下の実施の形態では、同一の部品には同一の参照番号を付してある。それらの機能及び名称も同一である。したがって、それらについての詳細な説明は繰返さない。 In the following embodiments, the same parts are denoted by the same reference numerals. Their functions and names are also the same. Therefore, detailed description thereof will not be repeated.

（第１の実施の形態）
図１を参照して、本実施の形態に係る画像処理装置１００は、例えば、コピーモード、スキャナモード、及びスキャン送信モード等の複数の動作モードを備える複合機（ＭＦＰ）である。この画像処理装置１００は、レーザー光を露光に利用する、所謂レーザー方式（電子写真方式）の印刷機能を備える。しかし、他の形式の印刷機能を備えたものであってもよい。 (First embodiment)
Referring to FIG. 1, an image processing apparatus 100 according to the present embodiment is a multi-function peripheral (MFP) having a plurality of operation modes such as a copy mode, a scanner mode, and a scan transmission mode. The image processing apparatus 100 has a so-called laser (electrophotographic) printing function that uses laser light for exposure. However, other types of printing functions may be provided.

画像処理装置１００は、原稿の画像を原稿画像データとして読取る画像読取機能を持つ。コピーモードでは、画像処理装置１００は、読取った原稿画像データに基づいて記録用紙に多色又は単色の画像を形成する。スキャナモードでは、画像処理装置１００は、読取った原稿画像データをファイルとして内部の記憶装置に記憶させる。スキャン送信モードでは、画像処理装置１００は、読取った原稿画像データをファイルとして情報処理装置等の外部機器（図示せず。）に送信する。ファイルとして記憶される原稿画像データ、又はファイルとして送信される原稿画像データには、ファイル名が付される。原稿画像データに付される（設定される）ファイル名は、画像処理装置１００によって自動で生成される。 The image processing apparatus 100 has an image reading function for reading an image of a document as document image data. In the copy mode, the image processing apparatus 100 forms a multicolor or single color image on a recording sheet based on the read document image data. In the scanner mode, the image processing apparatus 100 stores the read document image data as a file in an internal storage device. In the scan transmission mode, the image processing apparatus 100 transmits the read document image data as a file to an external device (not shown) such as an information processing apparatus. A document name is assigned to document image data stored as a file or document image data transmitted as a file. The file name assigned (set) to the document image data is automatically generated by the image processing apparatus 100.

画像処理装置１００はさらに、読取った原稿画像データに対してＯＣＲによる文字認識処理（以下「ＯＣＲ処理」と呼ぶ場合がある。）を実行するＯＣＲ機能を持つ。画像処理装置１００は、ＯＣＲ処理により得られた情報（文字群）からファイル名の候補となる文字列を認識し、認識した文字列に基づいてファイル名を自動で生成する。 The image processing apparatus 100 further has an OCR function for executing character recognition processing by OCR (hereinafter sometimes referred to as “OCR processing”) on the read document image data. The image processing apparatus 100 recognizes a character string that is a candidate for a file name from information (character group) obtained by the OCR process, and automatically generates a file name based on the recognized character string.

本実施の形態では、画像処理装置１００は、上記ＯＣＲ機能により、原稿画像データのうち、原稿画像の一部の領域に対応する画像データに対してＯＣＲ処理を実行する。ＯＣＲ処理により得られた情報からファイル名の候補となる文字列が認識されると、画像処理装置１００は、認識された文字列に基づいてファイル名を生成する。ファイル名の候補となる文字列が認識されなかった場合、画像処理装置１００は、当該原稿画像の他の一部の領域に対応する画像データに対してＯＣＲ処理を実行する。 In the present embodiment, the image processing apparatus 100 performs OCR processing on image data corresponding to a partial area of the document image in the document image data by the OCR function. When a character string that is a candidate for the file name is recognized from the information obtained by the OCR process, the image processing apparatus 100 generates a file name based on the recognized character string. When a character string that is a candidate for a file name is not recognized, the image processing apparatus 100 performs OCR processing on image data corresponding to another partial area of the document image.

画像処理装置１００は、１枚（１枚目）の原稿からファイル名の候補となる文字列が認識（抽出）されるまで、上記ＯＣＲ処理を繰返す。複数枚の原稿の画像が読取られた場合、１枚目の原稿の画像データに対してＯＣＲ処理が行なわれる。ただし、１枚目の原稿が白紙の原稿の場合、ＯＣＲ処理は行なわれない。この場合、画像読取り時の日時と画像処理装置１００の固有情報（例えば機種名等）とを組合わせたファイル名が自動で生成される。本実施の形態では、１枚の原稿からファイル名の候補となる文字列が認識されなかった場合も、１枚目の原稿が白紙の原稿の場合と同様、画像読取り時の日時と画像処理装置１００の固有情報（例えば機種名等）とを組合わせたファイル名が自動で生成される。なお、ファイル名の生成処理の詳細については後述する。 The image processing apparatus 100 repeats the OCR process until a character string that is a candidate for a file name is recognized (extracted) from one (first) document. When images of a plurality of documents are read, OCR processing is performed on the image data of the first document. However, if the first document is a blank document, the OCR process is not performed. In this case, a file name is automatically generated by combining the date and time at the time of image reading and unique information (for example, model name) of the image processing apparatus 100. In this embodiment, even when a character string that is a candidate for a file name is not recognized from one original, the date and time at the time of image reading and the image processing apparatus are the same as when the first original is a blank original. A file name combining 100 unique information (for example, model name) is automatically generated. Details of the file name generation process will be described later.

［ハードウェア構成］
画像処理装置１００は、制御部１１０、操作ユニット１２０、原稿読取部１３０、画像処理部１４０、画像形成部１５０、給紙部１６０、及びＮＩＣ（ＮｅｔｗｏｒｋＩｎｔｅｒｆａｃｅＣａｒｄ）１７０を含む。 [Hardware configuration]
The image processing apparatus 100 includes a control unit 110, an operation unit 120, a document reading unit 130, an image processing unit 140, an image forming unit 150, a paper feeding unit 160, and a NIC (Network Interface Card) 170.

制御部１１０は、実質的にコンピュータであって、画像処理装置１００全体を制御するＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）１１２、プログラム等を記憶するためのＲＯＭ（Ｒｅａｄ−ＯｎｌｙＭｅｍｏｒｙ）１１４、揮発性の記憶装置であるＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）１１６、及び記憶装置１１８を含む。記憶装置１１８は、通電が遮断された場合にもデータを保持する不揮発性記憶装置であり、例えばハードディスクドライブ（ＨＤＤ）又はフラッシュメモリ等である。ＣＰＵ１１２には、ＢＵＳライン１８０が接続されており、このＢＵＳライン１８０には、ＲＯＭ１１４、ＲＡＭ１１６及び記憶装置１１８が電気的に接続される。 The control unit 110 is substantially a computer, and includes a CPU (Central Processing Unit) 112 that controls the entire image processing apparatus 100, a ROM (Read-Only Memory) 114 for storing programs, and a volatile storage device. A RAM (Random Access Memory) 116 and a storage device 118. The storage device 118 is a non-volatile storage device that retains data even when power is cut off, and is, for example, a hard disk drive (HDD) or a flash memory. A BUS line 180 is connected to the CPU 112, and a ROM 114, a RAM 116, and a storage device 118 are electrically connected to the BUS line 180.

ＣＰＵ１１２は、操作ユニット１２０等からの指示に応じて各種コンピュータプログラムを実行することによって、画像処理装置１００の各部の動作及びパーソナルコンピュータ（ＰＣ）等の外部機器との通信等の所望の処理を実行する。上記の各種コンピュータプログラムは、予めＲＯＭ１１４又は記憶装置１１８に記憶されており、所望の処理の実行時において、当該ＲＯＭ１１４又は記憶装置１１８から読出されてＲＡＭ１１６に転送される。ＣＰＵ１１２は、ＣＰＵ１１２内の図示しないプログラムカウンタと呼ばれるレジスタに格納された値によって指定される、ＲＡＭ１１６内のアドレスからプログラムの命令を読出し、解釈する。ＣＰＵ１１２はまた、読出された命令によって指定されるアドレスから演算に必要なデータを読出し、そのデータに対し命令に対応する演算を実行する。実行の結果も、ＲＡＭ１１６、記憶装置１１８及びＣＰＵ１１２内のレジスタ等の、命令によって指定されるアドレスに格納される。 The CPU 112 executes various computer programs in accordance with instructions from the operation unit 120 and the like, thereby executing desired processing such as operations of each unit of the image processing apparatus 100 and communication with an external device such as a personal computer (PC). To do. The various computer programs are stored in advance in the ROM 114 or the storage device 118, and are read from the ROM 114 or the storage device 118 and transferred to the RAM 116 when desired processing is executed. The CPU 112 reads and interprets a program instruction from an address in the RAM 116 specified by a value stored in a register called a program counter (not shown) in the CPU 112. The CPU 112 also reads data necessary for the operation from the address specified by the read instruction, and executes an operation corresponding to the instruction on the data. The execution result is also stored in an address specified by an instruction, such as a register in the RAM 116, the storage device 118, and the CPU 112.

ＲＯＭ１１４又は記憶装置１１８には、画像処理装置１００の一般的な動作等を実現するためのコンピュータプログラムとともに、読取った原稿画像データに対してＯＣＲ処理を行ない、当該原稿画像データのファイル名を生成する処理を実現するためのコンピュータプログラムが記憶される。このコンピュータプログラムは、画像処理装置１００の製造時にＲＯＭ１１４又は記憶装置１１８に書込まれる。なお、このコンピュータプログラムは、ＮＩＣ１７０を介して、外部機器等から提供されてもよい。さらにこのコンピュータプログラムは、そのコンピュータプログラムが記録された、例えばＤＶＤ等の記憶媒体によって提供されてもよい。すなわち、例えばコンピュータプログラムの記録媒体としてのＤＶＤが、画像処理装置１００内に内蔵されるＤＶＤドライブ（図示せず。）に装着され、そのＤＶＤからコンピュータプログラムが読出されて記憶装置１１８にインストールされてもよい。記憶装置１１８は、他に、原稿読取部１３０で読取った原稿画像データ等の各種データを記憶する。 The ROM 114 or the storage device 118 performs OCR processing on the read original image data together with a computer program for realizing general operations of the image processing apparatus 100, and generates a file name of the original image data. A computer program for realizing the processing is stored. This computer program is written in the ROM 114 or the storage device 118 when the image processing apparatus 100 is manufactured. The computer program may be provided from an external device or the like via the NIC 170. Further, the computer program may be provided by a storage medium such as a DVD on which the computer program is recorded. That is, for example, a DVD as a computer program recording medium is loaded into a DVD drive (not shown) built in the image processing apparatus 100, and the computer program is read from the DVD and installed in the storage device 118. Also good. In addition, the storage device 118 stores various data such as document image data read by the document reading unit 130.

ＢＵＳライン１８０には、さらに、操作ユニット１２０、原稿読取部１３０、画像処理部１４０、画像形成部１５０、給紙部１６０、及びＮＩＣ１７０が電気的に接続される。 Further, the operation unit 120, the document reading unit 130, the image processing unit 140, the image forming unit 150, the paper feeding unit 160, and the NIC 170 are electrically connected to the BUS line 180.

操作ユニット１２０はユーザによる操作を受付ける。操作ユニット１２０は、入出力インターフェイス（図示せず。）を介して、ＣＰＵ１１２と通信を行なう。この操作ユニット１２０は、操作パネル１２２を含む。操作パネル１２２は、液晶パネル等で構成された表示パネルと、表示パネルの上に配置され、タッチされた位置を検出するタッチパネルとを含む。表示パネルは、画像処理装置１００の状態及び各種処理の状態に関する情報等の各種情報をユーザに提供する。この操作パネル１２２は、ユーザに対して対話的な操作インターフェイス（ＵＩ）を提供する。この対話的な操作インターフェイスは、タッチパネルから画像処理装置１００全体の動作に対するユーザの指示を受付け、その指示の内容を表示パネルに表示するとともに、その指示に応じた制御信号を制御部１１０等に対して出力する。 The operation unit 120 receives an operation by a user. The operation unit 120 communicates with the CPU 112 via an input / output interface (not shown). The operation unit 120 includes an operation panel 122. The operation panel 122 includes a display panel constituted by a liquid crystal panel and the like, and a touch panel that is disposed on the display panel and detects a touched position. The display panel provides the user with various information such as information relating to the state of the image processing apparatus 100 and the state of various processes. The operation panel 122 provides an interactive operation interface (UI) to the user. This interactive operation interface receives a user instruction for the operation of the entire image processing apparatus 100 from the touch panel, displays the contents of the instruction on the display panel, and sends a control signal corresponding to the instruction to the control unit 110 and the like. Output.

原稿読取部１３０は、光源を含む原稿走査ユニット、反射ミラー、光学レンズ及びＣＣＤ（Ｃｈａｒｇｅ−ＣｏｕｐｌｅｄＤｅｖｉｃｅ）ラインセンサ（以上いずれも図示せず。）を含む。原稿走査ユニットは、原稿載置台（図示せず。）上に載置された原稿の画像表面に対し光源から光を照射することによって反射光像を得る。反射ミラー及び光学レンズは、得られる反射光像をＣＣＤラインセンサ上に結像させる。ＣＣＤラインセンサは、結像した反射光像を順次光電変換して画像データとして画像処理部１４０に対して出力する。すなわち、原稿読取部１３０は、原稿のコピー時又はスキャン時に、原稿載置台に載置される原稿から画像情報を読取り、読取った画像情報を電気信号に変換して原稿画像データとして画像処理部１４０に対して出力する。 The document reading unit 130 includes a document scanning unit including a light source, a reflection mirror, an optical lens, and a CCD (Charge-Coupled Device) line sensor (all of which are not shown). The document scanning unit obtains a reflected light image by irradiating light from a light source onto the image surface of a document placed on a document placement table (not shown). The reflection mirror and the optical lens form the resulting reflected light image on the CCD line sensor. The CCD line sensor sequentially photoelectrically converts the formed reflected light image and outputs it to the image processing unit 140 as image data. That is, the document reading unit 130 reads image information from a document placed on a document placing table when copying or scanning a document, converts the read image information into an electrical signal, and converts the read image information into document image data. Output for.

画像処理部１４０は、ＭＰＵ（ＭｉｃｒｏＰｒｏｃｅｓｓｉｎｇＵｎｉｔ、図示せず。）を含む。画像処理部１４０は、原稿読取部１３０、又は、外部機器等から受信した画像データに対して、例えば、ラスタライズ処理等の各種処理を施して所定の階調の印刷データを作成する。印刷処理時には、画像処理部１４０は、作成した印刷データを画像形成部１５０に対して出力する。 The image processing unit 140 includes an MPU (Micro Processing Unit, not shown). The image processing unit 140 performs various processes such as rasterization processing on the image data received from the document reading unit 130 or an external device, and creates print data of a predetermined gradation. During the printing process, the image processing unit 140 outputs the created print data to the image forming unit 150.

画像形成部１５０は、画像データによって示される画像をカラー又は単色で記録用紙に印刷するものであって、例えば、感光体ドラム、帯電装置、レーザースキャンユニット（ＬＳＵ）、現像装置、転写装置、クリーニング装置、定着装置、及び除電装置等を備えている。画像形成部１５０には、例えば、搬送路が設けられており、給紙部１６０から給紙されてきた記録用紙が搬送路に沿って搬送される。 The image forming unit 150 prints an image indicated by image data on a recording sheet in color or single color, and includes, for example, a photosensitive drum, a charging device, a laser scanning unit (LSU), a developing device, a transfer device, and a cleaning device. A device, a fixing device, a static eliminator, and the like. For example, the image forming unit 150 is provided with a conveyance path, and the recording paper fed from the paper feeding unit 160 is conveyed along the conveyance path.

給紙部１６０は、記録用紙を収納する給紙トレイ（図示せず。）を含む。給紙部１６０は、給紙トレイに収納された記録用紙を１枚ずつ引出して記録用紙を画像形成部１５０の搬送路へと送り出す。画像形成部１５０の搬送路に沿って記録用紙が搬送されている途中で、記録用紙が感光体ドラムと転写装置との間を通過し、さらに定着装置を通過して、記録用紙に対する印刷が行なわれる。 The paper feed unit 160 includes a paper feed tray (not shown) that stores recording paper. The paper feeding unit 160 pulls out the recording paper stored in the paper feeding tray one by one and sends the recording paper to the conveyance path of the image forming unit 150. While the recording paper is being transported along the transport path of the image forming unit 150, the recording paper passes between the photosensitive drum and the transfer device, and further passes through the fixing device to perform printing on the recording paper. It is.

感光体ドラムは、一方向に回転し、その表面は、クリーニング装置と除電装置によりクリーニングされた後、帯電装置により均一に帯電される。レーザースキャンユニットは、印刷対象の画像データに基づいてレーザー光を変調し、このレーザー光によって感光体ドラムの表面を主走査方向に繰返し走査して、静電潜像を感光体ドラムの表面に形成する。現像装置は、トナーを感光体ドラムの表面に供給して静電潜像を現像し、トナー像を感光体ドラムの表面に形成する。転写装置は、転写装置と感光体ドラムとの間を通過していく記録用紙に感光体ドラムの表面のトナー像を転写する。 The photosensitive drum rotates in one direction, and its surface is cleaned by a cleaning device and a static eliminator and then uniformly charged by a charging device. The laser scanning unit modulates the laser beam based on the image data to be printed, and repeatedly scans the surface of the photosensitive drum in the main scanning direction with this laser beam to form an electrostatic latent image on the surface of the photosensitive drum. To do. The developing device supplies toner to the surface of the photosensitive drum to develop the electrostatic latent image, and forms a toner image on the surface of the photosensitive drum. The transfer device transfers the toner image on the surface of the photosensitive drum to a recording sheet passing between the transfer device and the photosensitive drum.

定着装置は、記録用紙を加熱するための加熱ローラと、記録用紙を加圧するための加圧ローラとを含む。記録用紙が加熱ローラによって加熱され、且つ、加圧ローラによって加圧されることによって、記録用紙上に転写されたトナー像が記録用紙に定着される。定着装置から排出された（印刷された）記録用紙は排紙トレイに排出される。 The fixing device includes a heating roller for heating the recording paper and a pressure roller for pressing the recording paper. The recording sheet is heated by the heating roller and pressed by the pressure roller, whereby the toner image transferred onto the recording sheet is fixed on the recording sheet. The recording paper discharged (printed) from the fixing device is discharged to a paper discharge tray.

ＮＩＣ１７０は、ネットワーク５０とのインターフェイスをとる。画像処理装置１００は、このＮＩＣ１７０を介して、ネットワーク５０上の外部機器等と、所定の通信プロトコルにしたがったデータ通信を行なうことができる。画像処理装置１００は、ＮＩＣ１７０を介して、ＰＣ等から印刷ジョブ等の各種処理の実行を命令する命令信号を受信できる。 The NIC 170 has an interface with the network 50. The image processing apparatus 100 can perform data communication according to a predetermined communication protocol with an external device or the like on the network 50 via the NIC 170. The image processing apparatus 100 can receive an instruction signal for instructing execution of various processes such as a print job from a PC or the like via the NIC 170.

［ファイル名の自動生成処理］
操作パネル１２２に表示されるシステム設定画面２００（図２参照）において、ファイル名を自動で生成するよう設定されると、スキャナモード、又はスキャン送信モードにおける原稿画像の読取り時（スキャン時）にＯＣＲ処理が行なわれる。 [Automatic file name generation]
When the system setting screen 200 (see FIG. 2) displayed on the operation panel 122 is set to automatically generate a file name, OCR is performed when reading a document image (scanning) in the scanner mode or the scan transmission mode. Processing is performed.

図２を参照して、システム設定画面２００は、スキャン時にファイル名候補を自動で生成するよう設定する場合にチェックが入れられるチェックボックス２１０、ファイル名候補が複数ある場合の動作を選択するためのラジオボタン２２０、及びファイル名候補の精度を高める場合にチェックが入れられるチェックボックス２３０を含む。チェックボックス２１０にチェックが入れられていると、原稿画像の読取り時（スキャン時）にＯＣＲ処理が実行される。ＯＣＲ処理は、読取った原稿画像データのうち、原稿画像の一部の領域に対応する画像データを制御部１１０が読込むことによって、読込んだ画像データに対して行なわれる。ＯＣＲ処理が行なわれると、当該ＯＣＲ処理により得られた情報（文字群）からファイル名の候補となる文字列が認識されて抽出される。 Referring to FIG. 2, a system setting screen 200 is used to select a check box 210 to be checked when setting to automatically generate file name candidates at the time of scanning, and an operation when there are a plurality of file name candidates. It includes a radio button 220 and a check box 230 that is checked to increase the accuracy of the file name candidate. When the check box 210 is checked, the OCR process is executed when the document image is read (scanning). The OCR processing is performed on the read image data when the control unit 110 reads image data corresponding to a partial area of the document image in the read document image data. When the OCR process is performed, a character string that is a candidate for a file name is recognized and extracted from information (character group) obtained by the OCR process.

具体的には、ＯＣＲ処理により得られた文字群から所定の空白領域によって挟まれた文字群が一つの文字列として認識される。文字列が横書きの場合は、左右（前後）に所定の空白領域のある文字群が一つの文字列として認識され、文字列が縦書きの場合は、上下（前後）に所定の空白領域のある文字群が一つの文字列として認識される。横書きか縦書きかの判定には、例えば特開２００９−９８７７７号公報に記載の技術（文字認識の記述方向の判定技術）を用いることができる。所定の空白領域は、ＯＣＲ処理により得られた文字群のなかから一つの文字列を認識することが可能な大きさであればよく、例えば１文字以上の大きさの領域であるのが好ましく、３文字以上の大きさの領域であればより好ましい。こうした文字列が認識されると、認識された文字列から一般的な文字列及び予め定められた文字数（以下「一定の文字数」と呼ぶ。）以上の文字列が候補の対象から除外される。一般的な文字列は、日付、部署名、又は「社外秘」等の定型的な文字列を含む。こうした文字列は、予め画像処理装置１００に登録されている。一定の文字数以上の文字列とは、例えばファイル名に適用可能な文字数（例えば３０文字）を超える文字列である。一般的な文字列及び一定の文字数以上の文字列が認識された文字列から除外されることにより、例えば、原稿の内容を示す件名、表題、又は見出し等の文字列がファイル名の候補として抽出される。 Specifically, a character group sandwiched by a predetermined blank area from a character group obtained by OCR processing is recognized as one character string. If the character string is written horizontally, a group of characters with a predetermined blank area on the left and right (front and back) is recognized as one character string. If the character string is written vertically, there is a predetermined blank area on the top and bottom (front and back). A character group is recognized as one character string. For the determination of horizontal writing or vertical writing, for example, a technique described in Japanese Patent Application Laid-Open No. 2009-98777 (determination technique for character recognition description direction) can be used. The predetermined blank area only needs to have a size capable of recognizing one character string from the character group obtained by the OCR process, and is preferably an area having a size of one character or more, for example. It is more preferable if the area has a size of 3 characters or more. When such a character string is recognized, a general character string and a character string greater than a predetermined number of characters (hereinafter referred to as “a certain number of characters”) are excluded from candidate characters. The general character string includes a fixed character string such as date, department name, or “confidential”. Such character strings are registered in the image processing apparatus 100 in advance. A character string having a certain number of characters or more is a character string exceeding the number of characters (for example, 30 characters) applicable to a file name, for example. By excluding general character strings and character strings exceeding a certain number of characters from recognized character strings, for example, character strings such as subject, title, or headline indicating the contents of the manuscript are extracted as file name candidates. Is done.

ファイル名の候補は複数抽出されることがあり得る。こうした場合に、自動でファイル名を選択するのか、手動でファイル名を選択するのかがラジオボタン２２０によって予め設定される。自動でファイル名を選択するよう設定されていると、ファイルの最も先頭側に位置する文字列がファイル名として選択される。手動でファイル名を選択するよう設定されていると、ファイル名とする文字列をユーザに選択させるための後述する選択画面が操作パネル１２２に表示される。 A plurality of file name candidates may be extracted. In such a case, whether to select a file name automatically or manually select a file name is preset by the radio button 220. If the file name is set to be automatically selected, the character string located at the top of the file is selected as the file name. If the file name is set to be manually selected, a selection screen (to be described later) for allowing the user to select a character string as the file name is displayed on the operation panel 122.

チェックボックス２３０にチェックが入れられていると、認識された文字列から一般的な文字列及び一定の文字数以上の文字列が除外されることによってファイル名の候補となる文字列が抽出されない場合、ファイル名候補の精度を高めるために、当該原稿画像の他の一部の領域に対応する画像データに対してＯＣＲ処理が行なわれる。チェックボックス２３０にチェックが入れられていないと、認識された文字列から一般的な文字列及び一定の文字数以上の文字列が除外されることによってファイル名の候補となる文字列が抽出されない場合、除外された文字列がファイル名の候補として抽出（設定）される。 When the check box 230 is checked, when a character string that is a candidate for a file name is not extracted by excluding a general character string and a character string of a certain number of characters or more from the recognized character string, In order to improve the accuracy of the file name candidate, OCR processing is performed on image data corresponding to another part of the original image. If the check box 230 is not checked, a character string that is a candidate for a file name is not extracted by excluding a general character string and a character string of a certain number of characters or more from the recognized character string. The excluded character string is extracted (set) as a file name candidate.

図３を参照して、読取った原稿画像データに対するＯＣＲ処理は、まず、原稿画像の先頭の領域２５０に対応する画像データに対して行なわれる。先頭の領域２５０は、原稿画像の先頭から例えば１０％の領域（原稿画像の大きさ（Ｌ１）に対する領域２５０の大きさ（Ｌ２）の割合が１０％の領域）である。領域２５０の大きさは、原稿画像に対して１０％以上５０％以下であるのが好ましい。先頭の領域２５０においてファイル名の候補となる文字列が抽出されると、それ以上、ＯＣＲ処理は行なわれない。なお、領域２５０の境界線上の文字列は、ＯＣＲ処理によって認識されない。 Referring to FIG. 3, the OCR process for the read document image data is first performed on the image data corresponding to the top region 250 of the document image. The leading area 250 is, for example, an area of 10% from the beginning of the document image (an area where the ratio of the size (L2) of the region 250 to the size (L1) of the document image is 10%). The size of the region 250 is preferably 10% or more and 50% or less with respect to the document image. When a character string that is a candidate for a file name is extracted in the first area 250, no further OCR processing is performed. Note that the character string on the boundary line of the region 250 is not recognized by the OCR process.

図４を参照して、先頭の領域２５２において文字列が認識されなかった場合（図４（Ａ）参照）、原稿画像の他の一部の領域２５４に対応する画像データに対してＯＣＲ処理が実行される（図４（Ｂ）参照）。領域２５４は、先頭の領域２５２の次の領域である。この領域２５４の大きさは、先頭の領域２５２の大きさと同じである。次の領域２５４においてファイル名の候補となる文字列が抽出されると、それ以上、ＯＣＲ処理は行なわれない。一方、次の領域２５４においても文字列が抽出されない場合は、さらにその次の領域に対応する画像データに対してＯＣＲ処理が実行される。このように、原稿画像の先頭から後尾（矢印Ｙ方向）に向かって所定の大きさの領域単位で順に、各領域に対応する画像データに対してＯＣＲ処理が実行される。この処理は、上記のように、１枚の原稿からファイル名の候補となる文字列が抽出されるまで繰返される。ただし、上記したＯＣＲ処理は、白紙の原稿に対しては実行されない。読取った原稿が白紙の原稿か否かは、原稿画像データのデータサイズに基づいて判定される。 Referring to FIG. 4, when a character string is not recognized in the leading area 252 (see FIG. 4A), OCR processing is performed on image data corresponding to another partial area 254 of the document image. It is executed (see FIG. 4B). The area 254 is an area next to the head area 252. The size of this area 254 is the same as the size of the top area 252. When a character string that is a candidate for a file name is extracted in the next area 254, no further OCR processing is performed. On the other hand, if no character string is extracted in the next area 254, OCR processing is further performed on the image data corresponding to the next area. As described above, the OCR process is executed on the image data corresponding to each region in order of a region having a predetermined size from the head of the document image to the tail (arrow Y direction). As described above, this process is repeated until a character string that is a candidate for a file name is extracted from one original. However, the OCR process described above is not executed for a blank document. Whether or not the read original is a blank original is determined based on the data size of the original image data.

なお、ＯＣＲ処理を行なう領域の境界線上に文字列が存在する場合、当該ＯＣＲ処理によって境界線上に文字列が存在することが認識される。この場合、さらにＯＣＲ処理が実行されると、次の領域は、前の領域の境界線上の文字列を含むように設定される。例えば、先頭の領域２５２の境界線上に文字列が存在すると、次の領域２５４は、先頭の領域２５２の境界線上の文字列が認識されるように先頭の領域２５２と一部重複するように設定される。 When a character string exists on the boundary line of the region where the OCR process is performed, it is recognized that the character string exists on the boundary line by the OCR process. In this case, when the OCR process is further executed, the next area is set to include the character string on the boundary line of the previous area. For example, if a character string exists on the boundary line of the leading area 252, the next area 254 is set to partially overlap the leading area 252 so that the character string on the boundary line of the leading area 252 is recognized. Is done.

［ソフトウェア構成］
図５及び図６を参照して、読取った原稿画像データに設定するファイル名を自動で生成する処理を行なうために、画像処理装置１００で実行されるコンピュータプログラムの制御構造について説明する。このプログラムは、図２に示されるシステム設定画面２００のチェックボックス２１０にチェックが入れられた状態で、原稿読取部１３０による原稿画像データの読取処理が開始（スキャン開始）されたことに応じて開始する。 Software configuration
With reference to FIGS. 5 and 6, a control structure of a computer program executed by the image processing apparatus 100 in order to automatically generate a file name to be set for the read document image data will be described. This program starts when the document image data reading process by the document reading unit 130 is started (scanning is started) with the check box 210 of the system setting screen 200 shown in FIG. 2 checked. To do.

図５を参照して、このプログラムは、読取った原稿画像データのデータサイズに基づいて、１枚目の原稿が白紙か否かを判定し、判定結果に応じて制御の流れを分岐させるステップＳ１０００と、ステップＳ１０００において、１枚目の原稿が白紙ではないと判定された場合に実行され、１枚目の原稿の先頭領域に対応する画像データを読込み、当該画像データに対してＯＣＲ処理を実行するステップＳ１０１０と、ステップＳ１０１０の後に実行され、ＯＣＲ処理によって文字列が認識されたか否かを判定し、判定結果に応じて制御の流れを分岐させるステップＳ１０２０と、ステップＳ１０２０において、文字列が認識されたと判定された場合に実行され、認識された文字列から、一般的な文字列及び一定の文字数以上の文字列をファイル名の候補から除外するステップＳ１０３０と、ステップＳ１０３０の後に実行され、除外した後にファイル名の候補となる文字列があるか否かを判定し、判定結果に応じて制御の流れ分岐させるステップＳ１０４０とを含む。 Referring to FIG. 5, the program determines whether or not the first document is blank based on the data size of the read document image data, and branches the control flow according to the determination result (step S1000). In step S1000, the process is executed when it is determined that the first document is not blank, and the image data corresponding to the first area of the first document is read, and the OCR process is executed on the image data. Step S1010 is executed, and after Step S1010, it is determined whether or not the character string is recognized by the OCR process, and the flow of control is branched according to the determination result. In Step S1020, the character string is recognized. If it is determined that the character string has been deleted, a general character string and a character string with a certain number of characters or more are extracted from the recognized character string. Step S1030 for excluding from name candidates, Step S1040 executed after step S1030, determining whether there is a character string that is a candidate for a file name after being excluded, and branching the control flow according to the determination result including.

このプログラムはさらに、ステップＳ１０４０において、ファイル名の候補があると判定された場合に実行され、ファイル名の候補が複数存在するか否かを判定し、判定結果に応じて制御の流れを分岐させるステップＳ１０５０と、ステップＳ１０５０において、ファイル名の候補が複数存在すると判定された場合に実行され、システム設定画面２００において、「手動でファイル名を選択する」設定がＯＮにされているか（選択されているか）否かを判定し、判定結果に応じて制御の流れを分岐させるステップＳ１０６０と、ステップＳ１０６０において、「手動でファイル名を選択する」設定がＯＮにされていると判定された場合に実行され、操作パネル１２２に選択画面を表示して、ユーザによるファイル名の選択操作を受付けるステップＳ１０７０と、ステップＳ１０５０において、ファイル名の候補が複数存在しないと判定された場合、ステップＳ１０６０において、「手動でファイル名を選択する」設定がＯＮにされていない、すなわち「自動でファイル名を選択する」設定がＯＮにされていると判定された場合、又はステップＳ１０７０の後に実行され、認識された文字列、又はユーザによって選択された文字列をファイル名に決定するステップＳ１０８０とを含む。 This program is further executed when it is determined in step S1040 that there are file name candidates, and it is determined whether there are a plurality of file name candidates, and the control flow is branched according to the determination result. This is executed when it is determined in step S1050 and step S1050 that there are a plurality of file name candidates. In the system setting screen 200, the “manually select file name” setting is set to ON (selected) Executed when it is determined in step S1060 that branches the control flow according to the determination result and in step S1060 that the “manually select file name” setting is ON. A selection screen is displayed on the operation panel 122, and a step for accepting a file name selection operation by the user is received. In step S1070 and step S1050, if it is determined that there are not a plurality of file name candidates, in step S1060, the “manually select file name” setting is not turned on, that is, “automatically select a file name”. When it is determined that the “select” setting is ON, or after step S1070, a recognized character string or a character string selected by the user is determined as a file name.

図６を参照して、このプログラムはさらに、ステップＳ１０４０（図５参照）において、ファイル名の候補がないと判定された場合に実行され、システム設定画面２００において、「ファイル名候補の精度を高める」設定がＯＮにされているか否か、すなわち、チェックボックス２３０にチェックが入れられているか否かを判定し、判定結果に応じて制御の流れを分岐させるステップＳ１０９０と、ステップＳ１０９０において、「ファイル名候補の精度を高める」設定がＯＮにされていないと判定された場合に実行され、除外した一般的な文字列及び一定の文字数以上の文字列をファイル名の候補に設定するステップＳ１１００と、ステップＳ１０９０において、「ファイル名候補の精度を高める」設定がＯＮにされていると判定された場合に実行され、除外した一般的な文字列及び一定の文字数以上の文字列をファイル名のサブ候補に追加するステップＳ１１１０と、ステップＳ１１１０の後、又はステップＳ１０２０（図５参照）において、ＯＣＲ処理によって文字列が認識されなかったと判定された場合に実行され、１枚目の原稿画像データの読込みが完了したか否か、すなわち１枚目の原稿画像の全領域に対してＯＣＲ処理が完了したか否かを判定し、判定結果に応じて制御の流れを分岐させるステップＳ１１２０とを含む。 Referring to FIG. 6, this program is further executed when it is determined in step S1040 (see FIG. 5) that there is no file name candidate. In step S1090 and step S1090, it is determined whether or not the setting is turned on, that is, whether or not the check box 230 is checked, and the control flow branches according to the determination result. Step S1100 is executed when it is determined that the setting of “improving the accuracy of name candidates” is not turned on, and a general character string excluded and a character string of a certain number of characters or more are set as file name candidates; If it is determined in step S1090 that the “increase accuracy of file name candidate” setting is ON In step S1110 to add the excluded general character string and the character string of a certain number of characters or more to the file name sub-candidate, and after step S1110 or in step S1020 (see FIG. 5), the OCR process This is executed when it is determined that the character string has not been recognized. Whether or not reading of the first document image data has been completed, that is, whether or not the OCR processing has been completed for all areas of the first document image. Step S1120 for determining whether or not and branching the flow of control according to the determination result.

このプログラムはさらに、ステップＳ１１２０において、１枚目の原稿画像データの読込みが完了していないと判定された場合に実行され、１枚目の原稿画像の次の領域に対応する画像データを読込み、当該画像データに対してＯＣＲ処理を実行するステップＳ１１３０と、ステップＳ１１２０において、１枚目の原稿画像データの読込みが完了したと判定された場合に実行され、ファイル名のサブ候補として追加された文字列があるか否かを判定し、判定結果に応じて制御の流れを分岐させるステップＳ１１４０と、ステップＳ１１４０において、サブ候補があると判定された場合に実行され、そのサブ候補をファイル名の候補に設定するステップＳ１１５０と、ステップＳ１１４０において、サブ候補がないと判定された場合、又は、ステップＳ１０００（図５参照）において、１枚目の原稿が白紙であると判定された場合に実行され、画像読取り時の日時と画像処理装置１００の固有情報とを組合わせた文字列（「日時＋マシン固有情報」）をファイル名として作成するステップＳ１１６０とを含む。ステップＳ１１３０の処理が終了すると、制御は図５に示すステップＳ１０２０に戻る。ステップＳ１１００又はステップＳ１１５０の処理が終了すると、制御は図５に示すステップＳ１０５０に戻る。 This program is further executed when it is determined in step S1120 that reading of the first document image data has not been completed, and image data corresponding to the next area of the first document image is read. Characters added as sub-candidates of file names that are executed when it is determined in step S1130 that executes OCR processing on the image data and in step S1120 that reading of the first document image data is completed. Step S1140 for determining whether or not there is a column and branching the flow of control according to the determination result, and when it is determined that there is a sub-candidate in step S1140, the sub-candidate is a candidate for a file name. If it is determined in step S1150 and step S1140 that there is no sub-candidate, In step S1000 (see FIG. 5), the process is executed when it is determined that the first document is blank, and a character string (“ Step S1160 of creating “date and time + machine specific information”) as a file name. When the process of step S1130 ends, control returns to step S1020 shown in FIG. When the process of step S1100 or step S1150 ends, control returns to step S1050 shown in FIG.

再び図５を参照して、ステップＳ１０８０又は図６に示すステップＳ１１６０の処理が終了すると、このプログラムは終了する。 Referring to FIG. 5 again, when the process of step S1080 or step S1160 shown in FIG. 6 ends, the program ends.

［動作］
本実施の形態に係る画像処理装置１００は以下のように動作する。以下の説明では、画像処理装置１００の動作の内、本発明に関連する部分のみを説明する。他の動作は従来の画像処理装置の動作と同じである。 [Operation]
The image processing apparatus 100 according to the present embodiment operates as follows. In the following description, only the part related to the present invention in the operation of the image processing apparatus 100 will be described. Other operations are the same as those of the conventional image processing apparatus.

画像処理装置１００のユーザは、図７に示される原稿の画像を原稿読取部１３０で読取り、読取った原稿画像データを記憶装置１１８に保存するものとする。ユーザは、システム設定画面２００のチェックボックス２１０及びチェックボックス２３０にチェックを入れ、スキャナモードにて原稿の画像を読取る。複数枚の原稿の画像を読取る場合、画像処理装置１００は、画像を読取る処理と並行して、ファイル名の生成処理を実行する。 Assume that the user of the image processing apparatus 100 reads the image of the document shown in FIG. 7 with the document reading unit 130 and stores the read document image data in the storage device 118. The user checks the check box 210 and the check box 230 on the system setting screen 200, and reads the image of the document in the scanner mode. When reading an image of a plurality of documents, the image processing apparatus 100 executes a file name generation process in parallel with the image reading process.

画像処理装置１００の制御部１１０は、読取った原稿画像データのデータサイズに基づいて、１枚目の原稿が白紙か否かを判定する。図７に示される原稿の１枚目の原稿２６０は、白紙の原稿ではないため、制御部１１０は、１枚目の原稿２６０は白紙ではないと判定する（図５に示すステップＳ１０００においてＮＯ）。制御部１１０は、１枚目の原稿２６０の先頭の領域２７０に対応する画像データを読込み、当該画像データに対してＯＣＲ処理を行なう（ステップＳ１０１０）。制御部１１０は、ＯＣＲ処理によって文字列が認識されたか否かを判定する。先頭の領域２７０には、所定の空白領域によって挟まれた文字列２６２及び文字列２６４が存在するため、制御部１１０は文字列が認識されたと判定する（ステップＳ１０２０においてＹＥＳ）。なお、文字列２６６は、先頭の領域２７０の境界線上に位置するため、ＯＣＲ処理によって認識されない。制御部１１０は、認識された文字列から一般的な文字列及び一定の文字数以上の文字列をファイル名の候補から除外する（ステップＳ１０３０）。文字列２６２は日付であり、文字列２６４は部署名である。これらは一般的な文字列として予め登録されているものとする。文字列２６２及び文字列２６４は、ファイル名の候補から除外される。 The control unit 110 of the image processing apparatus 100 determines whether or not the first document is blank based on the data size of the read document image data. Since the first document 260 shown in FIG. 7 is not a blank document, control unit 110 determines that first document 260 is not a blank document (NO in step S1000 shown in FIG. 5). . The control unit 110 reads image data corresponding to the first area 270 of the first document 260 and performs OCR processing on the image data (step S1010). The control unit 110 determines whether or not a character string is recognized by the OCR process. Since the character string 262 and the character string 264 sandwiched between predetermined blank areas exist in the first area 270, the control unit 110 determines that the character string has been recognized (YES in step S1020). Note that the character string 266 is not recognized by the OCR process because it is located on the boundary line of the head region 270. The control unit 110 excludes a general character string and a character string of a certain number of characters or more from the recognized character strings from file name candidates (step S1030). The character string 262 is a date, and the character string 264 is a department name. These are registered in advance as general character strings. The character string 262 and the character string 264 are excluded from file name candidates.

制御部１１０は、文字列２６２及び文字列２６４を除外した後にファイル名の候補となる文字列があるか否かを判定する。先頭の領域２７０には、除外した後にファイル名の候補となる文字列がないため、制御部１１０は、ファイル名の候補となる文字列が存在しないと判定する（ステップＳ１０４０においてＮＯ）。システム設定画面２００のチェックボックス２３０にチェックが入れられているため、制御部１１０は、「ファイル名候補の精度を高める」設定がＯＮにされていると判定する（図６に示すステップＳ１０９０においてＹＥＳ）。制御部１１０は、除外した文字列２６２及び文字列２６４をファイル名のサブ候補に追加し（ステップＳ１１１０）、１枚目の原稿画像データの読込みが完了したか否かを判定する。１枚目の原稿画像データの読込みは完了していないため（ステップＳ１１２０においてＮＯ）。制御部１１０は、１枚目の原稿２６０の次の領域２７２に対応する画像データを読込み、当該画像データに対してＯＣＲ処理を行なう（ステップＳ１１３０）。なお、先頭の領域２７０の境界線上に文字列２６６が存在するため、次の領域２７２は、この文字列２６６が認識されるように先頭の領域２７０と一部重複するように設定される。制御部１１０は、ＯＣＲ処理によって文字列が認識されたか否かを判定する。次の領域２７２には、所定の空白領域によって挟まれた文字列２６６及び文字列２６８が存在するため、制御部１１０は文字列が認識されたと判定する（図５に示すステップＳ１０２０においてＹＥＳ）。 The control unit 110 determines whether there is a character string that is a candidate for the file name after the character string 262 and the character string 264 are excluded. Since there is no character string that becomes a candidate for the file name in the first area 270 after exclusion, the control unit 110 determines that there is no character string that becomes a candidate for the file name (NO in step S1040). Since the check box 230 of the system setting screen 200 is checked, the control unit 110 determines that the “increase the accuracy of the file name candidate” setting is ON (YES in step S1090 shown in FIG. 6). ). The control unit 110 adds the excluded character string 262 and character string 264 to the file name sub-candidate (step S1110), and determines whether reading of the first document image data is completed. This is because reading of the first document image data has not been completed (NO in step S1120). The control unit 110 reads image data corresponding to the next area 272 of the first original 260 and performs OCR processing on the image data (step S1130). Since the character string 266 exists on the boundary line of the leading area 270, the next area 272 is set so as to partially overlap the leading area 270 so that the character string 266 is recognized. The control unit 110 determines whether or not a character string is recognized by the OCR process. Since the character string 266 and the character string 268 sandwiched between predetermined blank areas exist in the next area 272, the control unit 110 determines that the character string has been recognized (YES in step S1020 shown in FIG. 5).

制御部１１０は、認識された文字列から一般的な文字列及び一定の文字数以上の文字列をファイル名の候補から除外する（ステップＳ１０３０）。文字列２６６は、定型的なものであるため、一般的な文字列として予め登録されているものとする。文字列２６６は、ファイル名の候補から除外される。制御部１１０は、文字列２６６を除外した後にファイル名の候補となる文字列があるか否かを判定する。次の領域２７２には、除外した後にファイル名の候補となる文字列２６８があるため、制御部１１０は、ファイル名の候補となる文字列が存在すると判定する（ステップＳ１０４０においてＹＥＳ）。ファイル名の候補となる文字列は文字列２６８の１つだけであるため（ステップＳ１０５０においてＮＯ）、この文字列２６８がファイル名として決定される（ステップＳ１０８０）。 The control unit 110 excludes a general character string and a character string of a certain number of characters or more from the recognized character strings from file name candidates (step S1030). Since the character string 266 is fixed, it is assumed that it is registered in advance as a general character string. The character string 266 is excluded from file name candidates. The control unit 110 determines whether there is a character string that is a candidate for the file name after the character string 266 is excluded. Since there is a character string 268 that becomes a file name candidate after exclusion in the next area 272, the control unit 110 determines that there is a character string that becomes a file name candidate (YES in step S1040). Since there is only one character string 268 as a file name candidate (NO in step S1050), this character string 268 is determined as the file name (step S1080).

制御部１１０は、このようにして生成したファイル名を、読取った原稿画像データのファイル名に設定して、当該原稿画像データを記憶装置１１８に記憶させる。文字列２６８（「△□○に関して」）は、１枚目の原稿の件名であり、読取った原稿画像データの内容を示すものである。文字列２６８に基づいて生成されたファイル名を、原稿画像データのファイル名とすることにより、ファイル名から原稿画像データの内容を把握することが容易になる。 The control unit 110 sets the file name generated in this way as the file name of the read document image data, and stores the document image data in the storage device 118. A character string 268 (“with respect to Δ □ ○”) is the subject of the first original and indicates the content of the read original image data. By using the file name generated based on the character string 268 as the file name of the document image data, it becomes easy to grasp the contents of the document image data from the file name.

一方、システム設定画面２００のチェックボックス２３０にチェックが入れられていない場合、制御部１１０は、「ファイル名候補の精度を高める」設定がＯＮにされていないと判定する（図６に示すステップＳ１０９０においてＮＯ）。制御部１１０は、先頭の領域２７０において除外した文字列２６２及び文字列２６４をファイル名の候補に設定する（ステップＳ１１００）。制御部１１０は、ファイル名の候補が複数存在すると判定し（ステップＳ１０５０においてＹＥＳ）、「手動でファイル名を選択する」設定がＯＮにされているか否かを判定する。「手動でファイル名を選択する」設定がＯＮにされている場合、制御部１１０は、操作パネル１２２に選択画面を表示して、複数のファイル名の候補のなかからファイル名に用いる文字列をユーザに選択させる（ステップＳ１０７０）。すなわち、文字列２６２及び文字列２６４のどちらをファイル名にするかをユーザに選択させる。制御部１１０は、ユーザによって選択された文字列を原稿画像データのファイル名に決定し（ステップＳ１０８０）、そのファイル名で原稿画像データを記憶装置１１８に保存する。 On the other hand, when the check box 230 on the system setting screen 200 is not checked, the control unit 110 determines that the “increase the accuracy of file name candidates” setting is not turned on (step S1090 shown in FIG. 6). NO). The control unit 110 sets the character string 262 and the character string 264 excluded in the head area 270 as file name candidates (step S1100). Control unit 110 determines that there are a plurality of file name candidates (YES in step S1050), and determines whether the “manually select file name” setting is ON. When the “manually select file name” setting is ON, the control unit 110 displays a selection screen on the operation panel 122 and selects a character string to be used for the file name from among a plurality of file name candidates. The user is selected (step S1070). That is, the user is allowed to select which one of the character string 262 and the character string 264 is the file name. The control unit 110 determines the character string selected by the user as the file name of the document image data (step S1080), and stores the document image data in the storage device 118 with the file name.

「手動でファイル名を選択する」設定がＯＮにされていない場合（ステップＳ１０６０においてＮＯ）、すなわち、「自動でファイル名を選択する」設定がＯＮにされている場合、制御部１１０は、文字列２６２及び文字列２６４のうち、ファイルの最も先頭側（原稿画像の先頭側）に位置する文字列（この場合、文字列２６２）を原稿画像データのファイル名に決定する（ステップＳ１０８０）。除外された文字列が一定の文字数以上の文字列の場合、当該文字列がファイル名の候補に設定される。この文字列は、ファイル名に適用可能な文字数を超えているため、この文字列がファイル名とされる場合、文字列の先頭文字からファイル名に適用可能な文字数分がファイル名とされる。 When the “manually select file name” setting is not turned on (NO in step S1060), that is, when the “automatically select file name” setting is turned on, the control unit 110 displays the character Of the column 262 and the character string 264, the character string (in this case, the character string 262) located at the foremost side of the file (in this case, the character image 262) is determined as the file name of the document image data (step S1080). If the excluded character string is a character string having a certain number of characters or more, the character string is set as a file name candidate. Since this character string exceeds the number of characters applicable to the file name, when this character string is used as the file name, the number of characters applicable to the file name from the first character of the character string is set as the file name.

このように、システム設定画面２００のチェックボックス２３０にチェックが入れられているか否かに応じて、生成されるファイル名が異なる場合がある。チェックボックス２３０にチェックを入れることによって、原稿画像データの内容をより把握し易いファイル名が生成される。チェックボックス２３０にチェックが入れられていない場合、チェックボックス２３０にチェックが入れられている場合に比べて、ＯＣＲ処理にかかる時間がより短縮される。 Thus, the generated file name may differ depending on whether or not the check box 230 on the system setting screen 200 is checked. By checking the check box 230, a file name that makes it easier to grasp the contents of the document image data is generated. When the check box 230 is not checked, the time required for the OCR process is further shortened compared to when the check box 230 is checked.

なお、１枚目の原稿が白紙の場合（ステップＳ１０００においてＹＥＳ）、制御部１１０は、ＯＣＲ処理を実行せずに、「日時＋マシン固有情報」からなるファイル名を生成する（ステップＳ１１６０）。読取った原稿画像データは、生成されたファイル名で記憶装置１１８に保存される。 If the first document is blank (YES in step S1000), control unit 110 generates a file name composed of “date and time + machine specific information” without executing the OCR process (step S1160). The read document image data is stored in the storage device 118 with the generated file name.

さらに、１枚目の原稿の全領域に対するＯＣＲ処理が完了するまでに、ファイル名の候補となる文字列が抽出されなかった場合（ステップＳ１１２０においてＹＥＳ）、制御部１１０は、ファイル名のサブ候補となる文字列が追加されているか否かを判定する。サブ候補がない場合（ステップＳ１１４０においてＮＯ）、制御部１１０は、「日時＋マシン固有情報」からなるファイル名を生成して、生成したファイル名で読取った原稿画像データを記憶装置１１８に保存する。一方、サブ候補がある場合（ステップＳ１１４０においてＹＥＳ）、制御部１１０は、そのサブ候補をファイル名の候補に設定する（ステップＳ１１５０）。ファイル名の候補となる文字列が１つの場合（図５に示すステップＳ１０５０においてＮＯ）、その文字列が原稿画像データのファイル名に決定される（ステップＳ１０８０）。ファイル名の候補となる文字列が複数ある場合（ステップＳ１０５０においてＹＥＳ）、自動又は手動で選択された文字列が原稿画像データのファイル名に決定される（ステップＳ１０８０）。 Further, if a character string that is a candidate for the file name is not extracted by the time the OCR process is completed for the entire area of the first document (YES in step S1120), the control unit 110 causes the file name sub-candidate. It is determined whether or not a character string is added. If there is no sub-candidate (NO in step S1140), control unit 110 generates a file name composed of “date and time + machine specific information”, and stores document image data read with the generated file name in storage device 118. . On the other hand, if there is a sub candidate (YES in step S1140), control unit 110 sets the sub candidate as a file name candidate (step S1150). If there is one character string as a file name candidate (NO in step S1050 shown in FIG. 5), that character string is determined as the file name of the document image data (step S1080). If there are a plurality of character strings as file name candidates (YES in step S1050), the character string selected automatically or manually is determined as the file name of the document image data (step S1080).

原稿読取部１３０で読取られる原稿が図８に示される原稿であった場合、１枚目の原稿２８０における次の領域２８２には、一般的な文字列である文字列２６６を除外した後にファイル名の候補となる文字列（文字列２８４及び文字列２８６）が複数存在する。制御部１１０は、ファイル名の候補となる文字列が複数存在すると判定する（ステップＳ１０４０においてＹＥＳ、且つ、ステップＳ１０５０においてＹＥＳ）。「手動でファイル名を選択する」設定がＯＮにされている場合、制御部１１０は、操作パネル１２２に選択画面３００（図９参照）を表示して、複数のファイル名の候補のなかからファイル名に用いる文字列をユーザに選択させる（ステップＳ１０７０）。 When the original read by the original reading unit 130 is the original shown in FIG. 8, the file name is stored in the next area 282 of the first original 280 after the character string 266 that is a general character string is excluded. There are a plurality of candidate character strings (character string 284 and character string 286). Control unit 110 determines that there are a plurality of character strings as file name candidates (YES in step S1040 and YES in step S1050). When the “manually select file name” setting is set to ON, the control unit 110 displays a selection screen 300 (see FIG. 9) on the operation panel 122 and selects a file from a plurality of file name candidates. The user is allowed to select a character string used for the name (step S1070).

図９を参照して、選択画面３００は、文字列２８４を選択するための選択キー３１０、文字列２８６を選択するための選択キー３２０、及びファイル名を手動で入力する場合に操作されるキー３３０を含む。キー３３０が操作されると、ファイル名の入力画面に画面表示が遷移する。ユーザによって選択キー３１０又は選択キー３２０が操作されると、キー操作に応じて文字列が選択され、選択された文字列が原稿画像データのファイル名に決定される（ステップＳ１０８０）。 Referring to FIG. 9, selection screen 300 includes a selection key 310 for selecting character string 284, a selection key 320 for selecting character string 286, and keys operated when a file name is manually input. 330 is included. When the key 330 is operated, the screen display changes to a file name input screen. When the user operates the selection key 310 or the selection key 320, a character string is selected according to the key operation, and the selected character string is determined as the file name of the document image data (step S1080).

スキャン送信モードでは、上記のようにして生成されたファイル名が読取った原稿画像データのファイル名として設定されて記憶装置１１８に一旦記憶される。記憶されたファイル（原稿画像データ）は、ＮＩＣ１７０及びネットワーク５０を介して、外部機器に送信される。 In the scan transmission mode, the file name generated as described above is set as the file name of the read document image data and temporarily stored in the storage device 118. The stored file (original image data) is transmitted to an external device via the NIC 170 and the network 50.

［本実施の形態の効果］
以上の説明から明らかなように、本実施の形態に係る画像処理装置１００を利用することにより、以下に述べる効果を奏する。 [Effects of the present embodiment]
As is clear from the above description, the following effects can be obtained by using the image processing apparatus 100 according to the present embodiment.

画像処理装置１００は、原稿画像の一部の領域に対応する画像データに対してＯＣＲ処理を行なうことにより、一度に原稿画像の全領域に対してＯＣＲ処理を行なう場合に比べて、ＯＣＲ処理にかかる時間を短縮できる。そのため、生成したファイル名を用いて原稿画像データを記憶する場合であっても、当該原稿画像データが記憶されるまでの時間を短縮できる。これにより、ユーザの使い勝手が低下するのを抑制できる。読取った原稿画像データに付されるファイル名は自動で生成されるため、ファイル名を手動で入力する手間を省くことができる。さらに、ファイル名は原稿画像内の文字列に基づいて生成されるため、原稿画像データに付されるファイル名から当該原稿画像データ（ファイル）の内容が把握し易くなる。ファイルの内容を把握するためにファイルを開く、といった操作を行なう必要性が低減されるため、これによっても、ユーザの手間を省くことができる。 The image processing apparatus 100 performs OCR processing by performing OCR processing on image data corresponding to a partial area of the document image, compared to performing OCR processing on all areas of the document image at once. This time can be shortened. Therefore, even when document image data is stored using the generated file name, the time until the document image data is stored can be shortened. Thereby, it can suppress that a user's usability falls. Since the file name attached to the read document image data is automatically generated, it is possible to save the trouble of manually inputting the file name. Further, since the file name is generated based on the character string in the document image, it is easy to grasp the contents of the document image data (file) from the file name attached to the document image data. Since the necessity of performing an operation such as opening a file in order to grasp the contents of the file is reduced, this also saves the user's trouble.

画像処理装置１００はさらに、ファイル名の候補となる文字列として、所定の空白領域によって挟まれた文字群からなる文字列を認識する。こうした文字列が認識されない場合、画像処理装置１００は、原稿画像の他の一部の領域に対応する画像データに対して文字認識処理を行なう。こうした文字列が認識された場合、画像処理装置１００は、認識された文字列に基づいて原稿画像データのファイル名を生成する。さらに、ＯＣＲ処理は、原稿画像の先頭から後尾に向かって所定の大きさの領域単位で順に、各領域に対応する画像データに対して行なわれる。これにより、原稿画像データの内容を考慮したファイル名を生成し易くなる。したがって、原稿画像データに付されるファイル名から当該原稿画像データ（ファイル）の内容を一層把握し易くなる。 The image processing apparatus 100 further recognizes a character string made up of a character group sandwiched between predetermined blank areas as a character string that becomes a file name candidate. When such a character string is not recognized, the image processing apparatus 100 performs a character recognition process on image data corresponding to another part of the original image. When such a character string is recognized, the image processing apparatus 100 generates a file name of the document image data based on the recognized character string. Further, the OCR process is performed on the image data corresponding to each area in order of a predetermined size from the beginning to the tail of the document image. This makes it easy to generate a file name that takes into account the contents of the document image data. Therefore, it becomes easier to grasp the contents of the document image data (file) from the file name attached to the document image data.

さらに、予め設定された一般的な文字列、及び予め定められた文字数以上の文字列をファイル名の候補から除外することにより、原稿の内容を示す件名、表題、又は見出し等の文字列をファイル名の候補として抽出し易くなる。これにより、原稿画像データの内容を考慮したファイル名をより一層生成し易くなる。原稿の内容を示す件名、表題、又は見出し等の文字列は、原稿の比較的上側（先頭側）に記載されるのが一般的である。そのため、こうした文字列は、原稿画像の全領域に対してＯＣＲ処理を行なうことなく抽出することができるので、ＯＣＲ処理にかかる時間を効果的に短縮できる。 Further, by excluding general character strings set in advance and character strings exceeding the predetermined number of characters from the file name candidates, a character string such as a subject, title, or heading indicating the contents of the document is filed. It becomes easy to extract as a name candidate. This makes it easier to generate a file name considering the content of the document image data. A character string such as a subject, title, or headline indicating the content of a document is generally written on the relatively upper side (first side) of the document. Therefore, such a character string can be extracted without performing the OCR process on the entire area of the document image, so that the time required for the OCR process can be effectively shortened.

（第２の実施の形態）
本実施の形態に係る画像処理装置は、複数枚の原稿の画像を連続して読取る場合に、ファイル名の候補となる文字列が抽出されるまで、２枚目以降の原稿の画像データに対してもＯＣＲ処理を行なう点において、第１の実施の形態に係る画像処理装置１００とは異なる。その他の点では、各画像処理装置は同一の構成である。 (Second Embodiment)
When the image processing apparatus according to the present embodiment continuously reads an image of a plurality of originals, the image processing apparatus performs processing on image data of the second and subsequent originals until a character string as a file name candidate is extracted. However, it differs from the image processing apparatus 100 according to the first embodiment in that OCR processing is performed. In other respects, each image processing apparatus has the same configuration.

［ソフトウェア構成］
本実施の形態に係る画像処理装置では、図５及び図６に示されるプログラムに代えて、図１０及び図１１に示されるプログラムが実行される。図１０及び図１１のプログラムは、図５及び図６のステップＳ１０００、ステップＳ１０１０、ステップＳ１１２０及びステップＳ１１３０に代えて、ステップＳ２０００〜ステップＳ２０６０を含む。図１０及び図１１のステップＳ１０２０〜ステップＳ１１１０、及びステップＳ１１４０〜ステップＳ１１６０における処理は、図５及び図６に示される各ステップにおける処理と同じである。以下、異なる部分について説明する。 Software configuration
In the image processing apparatus according to the present embodiment, the program shown in FIGS. 10 and 11 is executed instead of the program shown in FIGS. The programs in FIGS. 10 and 11 include steps S2000 to S2060 instead of steps S1000, S1010, S1120, and S1130 in FIGS. The processes in steps S1020 to S1110 and steps S1140 to S1160 in FIGS. 10 and 11 are the same as the processes in each step shown in FIGS. Hereinafter, different parts will be described.

図１０を参照して、このプログラムは、読取った原稿が何枚目の原稿であるかを示す変数ｎに「１」を代入するステップＳ２０００と、ステップＳ２０００の後に実行され、読取った原稿画像データのデータサイズに基づいて、ｎ枚目の原稿が白紙か否かを判定し、判定結果に応じて制御の流れを分岐させるステップＳ２０１０と、ステップＳ２０１０において、ｎ枚目の原稿が白紙ではないと判定された場合に実行され、ｎ枚目の原稿の先頭領域に対応する画像データを読込み、当該画像データに対してＯＣＲ処理を実行するステップＳ２０２０とを含む。ステップＳ２０２０の処理が終了すると、制御はステップＳ１０２０に進む。 Referring to FIG. 10, this program is executed after step S2000 for substituting “1” for variable n indicating the number of originals read, and read original image data. Based on the data size of step S2010, it is determined whether or not the nth document is a blank sheet, and the flow of control is branched according to the determination result. In step S2010, the nth document is not a blank sheet. Step S2020, which is executed when it is determined, reads image data corresponding to the leading area of the nth document and executes OCR processing on the image data. When the process of step S2020 ends, control proceeds to step S1020.

図１１を参照して、このプログラムはさらに、ステップＳ１１１０の後に実行され、ｎ枚目の原稿画像データの読込みが完了したか否か、すなわち、ｎ枚目の原稿画像の全領域に対してＯＣＲ処理が完了したか否かを判定し、判定結果に応じて制御の流れを分岐させるステップＳ２０３０と、ステップＳ２０３０において、ｎ枚目の原稿画像データの読込みが完了していないと判定された場合に実行され、ｎ枚目の原稿画像の次の領域に対応する画像データを読込み、当該画像データに対してＯＣＲ処理を実行するステップＳ２０４０とを含む。ステップＳ２０３０において、ｎ枚目の原稿画像データの読込みが完了したと判定された場合は、制御はステップＳ１１４０に進む。ステップＳ２０４０の処理が終了すると、制御は、図１０に示すステップＳ１０２０に戻る。 Referring to FIG. 11, this program is further executed after step S1110, and whether or not reading of the nth original image data is completed, that is, OCR is performed on the entire area of the nth original image. It is determined whether or not the processing is completed, and the flow of control is branched according to the determination result, and when it is determined in step S2030 that the reading of the nth original image data is not completed. Step S2040 that is executed, reads image data corresponding to the next area of the nth original image, and executes OCR processing on the image data. If it is determined in step S2030 that reading of the nth document image data has been completed, control proceeds to step S1140. When the process of step S2040 ends, control returns to step S1020 shown in FIG.

このプログラムはさらに、ステップＳ１１４０において、サブ候補がないと判定された場合、又は、ステップＳ２０１０（図１０参照）において、ｎ枚目の原稿が白紙であると判定された場合に実行され、ｎ枚目の原稿は最終ページの原稿か否かを判定し、判定結果に応じて制御の流れを分岐させるステップＳ２０５０と、ステップＳ２０５０において、ｎ枚目の原稿は最終ページの原稿ではないと判定された場合に実行され、変数ｎの値を１だけ増加させるステップＳ２０６０とを含む。ステップＳ２０５０において、ｎ枚目の原稿は最終ページの原稿であると判定された場合は、制御はステップＳ１１６０に進む。ステップＳ２０６０の処理が終了すると、制御は、図１０に示すステップＳ２０１０に戻る。 This program is further executed when it is determined in step S1140 that there is no sub-candidate, or when it is determined in step S2010 (see FIG. 10) that the nth document is blank, and n sheets. It is determined whether or not the eye document is the document of the last page, and the flow of control is branched according to the determination result. In steps S2050 and S2050, it is determined that the nth document is not the document of the last page. Step S2060, which is executed if the variable n is incremented by one. If it is determined in step S2050 that the nth document is the last page, control proceeds to step S1160. When the process of step S2060 ends, control returns to step S2010 shown in FIG.

［動作］
本実施の形態に係る画像処理装置は以下のように動作する。なお、２枚目以降の原稿の画像データに対してもＯＣＲ処理を行なう動作を除いた動作は、上記第１の実施の形態と同様である。したがって、同様の動作についての詳細な説明は繰返さない。 [Operation]
The image processing apparatus according to the present embodiment operates as follows. Note that operations other than the operation of performing OCR processing on the image data of the second and subsequent originals are the same as those in the first embodiment. Therefore, detailed description of similar operations will not be repeated.

画像処理装置は、１枚又は複数枚の原稿の画像を読取り、読取った原稿画像データのファイル名を自動で生成する。複数枚の原稿の画像を読取る場合、画像処理装置は、画像を読取る処理と並行して、ファイル名の生成処理を実行する。 The image processing apparatus reads an image of one or a plurality of originals and automatically generates a file name of the read original image data. When reading an image of a plurality of documents, the image processing apparatus executes a file name generation process in parallel with the image reading process.

画像処理装置は、読取った原稿画像データのデータサイズに基づいて、１枚目の原稿が白紙か否かを判定する。１枚目の原稿が白紙の場合（ステップＳ２０１０においてＹＥＳ）、１枚目の原稿が最終ページの原稿か否かが判定される。１枚目の原稿が最終ページの原稿ではない場合（図１１に示すステップＳ２０５０においてＮＯ）、画像処理装置は、２枚目の原稿が白紙か否かを判定する（ステップＳ２０６０、及び図１０に示すステップＳ２０１０）。このように、読取った原稿に白紙の原稿が含まれる場合、画像処理装置は、白紙の原稿をスキップする。ｎ枚目の原稿が白紙ではないと判定されると（ステップＳ２０１０においてＮＯ）、画像処理装置は、ｎ枚目の原稿の先頭の領域に対応する画像データを読込み、当該画像データに対してＯＣＲ処理を行なう（ステップＳ２０２０）。ＯＣＲ処理によって文字列が認識されると（ステップＳ１０２０においてＹＥＳ）、画像処理装置は、認識された文字列から一般的な文字列及び一定の文字数以上の文字列をファイル名の候補から除外する（ステップＳ１０３０）。一般的な文字列及び一定の文字数以上の文字列を除外した後にファイル名の候補となる文字列がある場合（ステップＳ１０４０においてＹＥＳ）、画像処理装置は、ファイル名の候補となる文字列に基づいてファイル名を決定（生成）する（ステップＳ１０８０）。 The image processing apparatus determines whether or not the first document is blank based on the data size of the read document image data. If the first document is blank (YES in step S2010), it is determined whether or not the first document is the last page. If the first document is not the last page document (NO in step S2050 shown in FIG. 11), the image processing apparatus determines whether the second document is blank (step S2060 and FIG. 10). Step S2010). As described above, when the read document includes a blank document, the image processing apparatus skips the blank document. If it is determined that the nth document is not blank (NO in step S2010), the image processing apparatus reads image data corresponding to the top area of the nth document and performs OCR on the image data. Processing is performed (step S2020). When the character string is recognized by the OCR process (YES in step S1020), the image processing apparatus excludes a general character string and a character string having a certain number of characters or more from the recognized character string from file name candidates ( Step S1030). When there is a character string that becomes a candidate for a file name after excluding a general character string and a character string that exceeds a certain number of characters (YES in step S1040), the image processing apparatus is based on the character string that is a candidate for a file name. The file name is determined (generated) (step S1080).

一方、一般的な文字列及び一定の文字数以上の文字列を除外した後にファイル名の候補となる文字列がない場合（ステップＳ１０４０においてＮＯ）、画像処理装置は、「ファイル名候補の精度を高める」設定がＯＮにされているか否かを判定する。ＯＮにされている場合、画像処理装置は、除外した文字列をファイル名のサブ候補に追加して、ｎ枚目の原稿画像データの読込みが完了したか否かを判定する。なお、ＯＣＲ処理によって文字列が認識されなかった場合（ステップＳ１０２０においてＮＯ）も、画像処理装置は、ｎ枚目の原稿画像データの読込みが完了したか否かを判定する。ｎ枚目の原稿画像データの読込みが完了していない場合（ステップＳ２０３０においてＮＯ）、画像処理装置は、ｎ枚目の原稿の次の領域に対応する画像データを読込み、当該画像データに対してＯＣＲ処理を行なう（ステップＳ２０４０）。 On the other hand, if there is no character string that is a candidate for a file name after excluding a general character string and a character string that exceeds a certain number of characters (NO in step S1040), the image processing apparatus may It is determined whether or not the setting is ON. When it is ON, the image processing apparatus adds the excluded character string to the file name sub-candidate and determines whether or not reading of the nth original image data is completed. Even when the character string is not recognized by the OCR processing (NO in step S1020), the image processing apparatus determines whether reading of the nth original image data is completed. If the reading of the nth original image data has not been completed (NO in step S2030), the image processing apparatus reads image data corresponding to the next area of the nth original and reads the image data. OCR processing is performed (step S2040).

ｎ枚目の原稿画像データの読込みが完了している場合（ステップＳ２０３０においてＹＥＳ）、すなわち、ｎ枚目の原稿の全領域に対するＯＣＲ処理が完了するまでに、ファイル名の候補となる文字列が抽出されなかった場合、画像処理装置は、ファイル名のサブ候補となる文字列が追加されているか否かを判定する。サブ候補がある場合（ステップＳ１１４０においてＹＥＳ）、画像処理装置は、そのサブ候補をファイル名の候補に設定する（ステップＳ１１５０）。サブ候補がない場合（ステップＳ１１４０においてＮＯ）、画像処理装置は、ｎ枚目の原稿が最終ページの原稿であるか否かを判定する。ｎ枚目の原稿が最終ページの原稿ではない場合（ステップＳ２０５０においてＮＯ）、次の原稿（ｎ＋１枚目の原稿）に対して上記した処理を繰返す。ｎ枚目の原稿が最終ページの原稿である場合（ステップＳ２０５０においてＹＥＳ）、画像処理装置は、「日時＋マシン固有情報」からなるファイル名を生成して（ステップＳ１１６０）、生成したファイル名で読取った原稿画像データを記憶装置１１８に保存する。 If the reading of the nth original image data has been completed (YES in step S2030), that is, until the OCR processing for the entire area of the nth original is completed, a character string that is a file name candidate is displayed. If not extracted, the image processing apparatus determines whether or not a character string as a sub candidate of the file name has been added. If there is a sub candidate (YES in step S1140), the image processing apparatus sets the sub candidate as a file name candidate (step S1150). If there is no sub candidate (NO in step S1140), the image processing apparatus determines whether or not the nth document is the last page. If the nth document is not the last page document (NO in step S2050), the above process is repeated for the next document (n + 1th document). If the nth document is the document of the last page (YES in step S2050), the image processing apparatus generates a file name consisting of “date and time + machine specific information” (step S1160), and uses the generated file name. The read document image data is stored in the storage device 118.

このように、本実施の形態に係る画像処理装置は、白紙の原稿を除き、ファイル名の候補となる文字列が抽出されるまで、原稿画像の先頭領域から所定の領域単位で順に各領域に対応する画像データに対してＯＣＲ処理を行なう。そのため、例えば１枚目の原稿からファイル名の候補となる文字列が抽出されなかった場合でも、２枚目以降の原稿からファイル名の候補となる文字列を抽出することが可能となる。これにより、読取った原稿画像データの内容を考慮したファイル名を効果的に生成できる。 As described above, the image processing apparatus according to the present embodiment sequentially removes a blank document from the first region of the document image in units of a predetermined region until each character string as a file name candidate is extracted. OCR processing is performed on the corresponding image data. Therefore, for example, even if a character string that is a candidate for a file name is not extracted from the first document, it is possible to extract a character string that is a candidate for a file name from the second and subsequent documents. This makes it possible to effectively generate a file name that takes into account the content of the read document image data.

（第３の実施の形態）
本実施の形態に係る画像処理装置は、原稿のセット方向を変更するよう促す通知画面を操作パネルに表示する点において、第１の実施の形態に係る画像処理装置１００とは異なる。その他の点では、各画像処理装置は同一の構成である。 (Third embodiment)
The image processing apparatus according to the present embodiment is different from the image processing apparatus 100 according to the first embodiment in that a notification screen that prompts the user to change the document setting direction is displayed on the operation panel. In other respects, each image processing apparatus has the same configuration.

本実施の形態では、画像処理装置は、操作パネルに原稿選択画面４００（図１２参照）を表示して、原稿読取部で読取る原稿がどういった原稿であるかを予めユーザに選択させる。ユーザは、原稿選択画面４００を介して、スキャンする原稿が横書きの原稿か縦書きの原稿かを入力する。図１２を参照して、原稿選択画面４００は、スキャンする原稿の種類を選択するための複数の選択キーを含む。複数の選択キーは、横書きの原稿を選択するための「横書（１）」キー４１０、２ページを１枚に集約した横書きの原稿を選択するための「横書（２）」キー４１２、縦書きの原稿を選択するための「縦書（１）」キー４１４、及び２ページを１枚に集約した縦書きの原稿を選択するための「縦書（２）」キー４１６を含む。 In the present embodiment, the image processing apparatus displays a document selection screen 400 (see FIG. 12) on the operation panel to allow the user to select in advance what kind of document is read by the document reading unit. The user inputs via the document selection screen 400 whether the document to be scanned is a horizontally written document or a vertically written document. Referring to FIG. 12, document selection screen 400 includes a plurality of selection keys for selecting the type of document to be scanned. A plurality of selection keys include a “horizontal writing (1)” key 410 for selecting a horizontally written document, a “horizontal writing (2)” key 412 for selecting a horizontally written document in which two pages are combined into one sheet, A “vertical writing (1)” key 414 for selecting a vertically written document and a “vertical writing (2)” key 416 for selecting a vertically written document in which two pages are combined into one sheet are included.

画像処理装置は、読取る原稿がセットされたときに、原稿のセット方向を検出するセンサ（図示せず。）を含む。画像処理装置は、検出したセット方向に基づいて原稿のスキャン方向を決定する。画像処理装置はさらに、原稿選択画面４００を介して選択された原稿の種類、及びセットされた原稿のスキャン方向に基づいて、ＯＣＲ処理を行なうタイミングを決定する。 The image processing apparatus includes a sensor (not shown) that detects a setting direction of a document when a document to be read is set. The image processing apparatus determines the scanning direction of the document based on the detected setting direction. The image processing apparatus further determines the timing for performing the OCR processing based on the type of document selected via the document selection screen 400 and the scan direction of the set document.

図１３を参照して、原稿選択画面４００（図１２参照）において「横書（１）」キー４１０が操作され、長手方向にスキャンするよう原稿がセットされている場合（ａ１の場合）、又は「横書（２）」キー４１２が操作され、短手方向にスキャンするよう原稿がセットされている場合（ｄ１の場合）、１枚分の原稿画像データの読取りが完了する前に領域４２０又は領域４２６に対してＯＣＲ処理を行なうことが可能である。したがって、これらの場合は、１枚分の原稿画像データの読取りが完了する前にＯＣＲ処理を行なうよう設定される。 Referring to FIG. 13, when “horizontal writing (1)” key 410 is operated on original selection screen 400 (see FIG. 12) and an original is set to be scanned in the longitudinal direction (in the case of a1), or When the “horizontal writing (2)” key 412 is operated and an original is set to be scanned in the short direction (in the case of d1), the reading of the original image data for one sheet is completed before reading the area 420 or OCR processing can be performed on the region 426. Therefore, in these cases, the OCR processing is set to be performed before the reading of the original image data for one sheet is completed.

一方、原稿選択画面４００（図１２参照）において「横書（１）」キー４１０が操作され、短手方向にスキャンするよう原稿がセットされている場合（ｂ１の場合）、又は「横書（２）」キー４１２が操作され、長手方向にスキャンするよう原稿がセットされている場合（ｃ１の場合）、１枚分の原稿画像データの読取りが完了する前に領域４２２又は領域４２４に対してＯＣＲ処理を行なうと文字列の認識が困難になる。したがって、これらの場合は、１枚分の原稿画像データの読取りが完了した後にＯＣＲ処理を行なうよう設定される。 On the other hand, when the “horizontal writing (1)” key 410 is operated on the original selection screen 400 (see FIG. 12) and the original is set to scan in the short direction (in the case of b1), 2) ”key 412 is operated and the original is set to be scanned in the longitudinal direction (in the case of c1), the area 422 or the area 424 is read before the reading of the original image data for one sheet is completed. When the OCR process is performed, it is difficult to recognize the character string. Therefore, in these cases, the OCR process is set to be performed after the reading of the document image data for one sheet is completed.

原稿選択画面４００において「縦書（１）」キー４１４、又は「縦書（２）」キー４１６が操作された場合も、上記と同様である。図１４を参照して、原稿選択画面４００において「縦書（１）」キー４１４が操作され、長手方向にスキャンするよう原稿がセットされている場合（ａ２の場合）、又は「縦書（２）」キー４１６が操作され、短手方向にスキャンするよう原稿がセットされている場合（ｄ２の場合）、１枚分の原稿画像データの読取りが完了する前に領域４３０又は領域４３２に対してＯＣＲ処理を行なうことが可能である。したがって、これらの場合は、１枚分の原稿画像データの読取りが完了する前にＯＣＲ処理を行なうよう設定される。 The same applies to the case where the “vertical writing (1)” key 414 or the “vertical writing (2)” key 416 is operated on the document selection screen 400. Referring to FIG. 14, when “vertical writing (1)” key 414 is operated on original selection screen 400 and the original is set to be scanned in the longitudinal direction (case a2), or “vertical writing (2) ) ”Key 416 is operated and the document is set to scan in the short direction (in the case of d2), the reading of the image data of one sheet is completed with respect to the area 430 or the area 432 OCR processing can be performed. Therefore, in these cases, the OCR processing is set to be performed before the reading of the original image data for one sheet is completed.

原稿選択画面４００において「縦書（１）」キー４１４が操作され、短手方向にスキャンするよう原稿がセットされている場合（ｂ２の場合）、又は「縦書（２）」キー４１６が操作され、長手方向にスキャンするよう原稿がセットされている場合（ｃ２の場合）、１枚分の原稿画像データの読取りが完了する前に領域４３２又は領域４３４に対してＯＣＲ処理を行なうと文字列の認識が困難になる。したがって、これらの場合は、１枚分の原稿画像データの読取りが完了した後にＯＣＲ処理を行なうよう設定される。 When the “vertical writing (1)” key 414 is operated on the original selection screen 400 and the original is set to scan in the short direction (in the case of b2), or the “vertical writing (2)” key 416 is operated. When the original is set to be scanned in the longitudinal direction (in the case of c2), if the OCR process is performed on the area 432 or the area 434 before the reading of the original image data for one sheet is completed, the character string It becomes difficult to recognize. Therefore, in these cases, the OCR process is set to be performed after the reading of the document image data for one sheet is completed.

１枚分の原稿画像データの読取りが完了する前にＯＣＲ処理を行なうことにより、ファイル名の生成処理をより迅速に行なうことが可能となる。画像処理装置は、１枚分の原稿画像データの読取りが完了した後にＯＣＲ処理を行なうよう設定した場合、操作パネルに通知画面５００（図１５参照）を表示して、ユーザに対して、原稿のセット方向を変更するよう促す。図１５を参照して、通知画面５００は、例えば「原稿のセット方向を９０°回転させることにより、ファイル名の自動生成処理が速くなります。」とのメッセージを表示する表示領域５１０、このまま処理を続ける場合に操作される「ＹＥＳ」キー５２０、及び原稿のセット方向を変更する場合に操作される「ＮＯ」キー５３０を含む。 By performing the OCR process before the reading of one document image data is completed, the file name generation process can be performed more quickly. When the image processing apparatus is set to perform the OCR process after the reading of one sheet of document image data is completed, the image processing apparatus displays a notification screen 500 (see FIG. 15) on the operation panel to inform the user of the document. Prompt to change the set direction. Referring to FIG. 15, the notification screen 500 displays, for example, a display area 510 that displays a message “The file name automatic generation processing is accelerated by rotating the document setting direction by 90 °”, and the processing is continued. “YES” key 520 that is operated when the document is continued, and “NO” key 530 that is operated when the document setting direction is changed.

「ＹＥＳ」キー５２０が操作されると、画像処理装置は、１枚分の原稿画像データの読取りが完了した後にＯＣＲ処理を行なう設定を維持する。一方、「ＮＯ」キー５３０が操作されると、画像処理装置は原稿のセット方向が変更されたか否かを判定する。原稿のセット方向が変更されると、画像処理装置は、１枚分の原稿画像データの読取りが完了する前にＯＣＲ処理を行なう設定に設定を変更する。 When “YES” key 520 is operated, the image processing apparatus maintains a setting for performing OCR processing after reading of one sheet of document image data is completed. On the other hand, when the “NO” key 530 is operated, the image processing apparatus determines whether or not the document setting direction has been changed. When the document setting direction is changed, the image processing apparatus changes the setting to a setting for performing OCR processing before reading of one sheet of document image data is completed.

ユーザによって原稿画像の読取開始の指示が行なわれると、画像処理装置は、原稿読取部を制御して、セットされている原稿から原稿画像データを読取る。１枚分の原稿画像データの読取りが完了する前にＯＣＲ処理を行なうよう設定されている場合、画像処理装置は、原稿画像の先頭の領域の画像データを読取ると、その画像データに対してＯＣＲ処理を行なう。１枚分の原稿画像データの読取りが完了した後にＯＣＲ処理を行なうよう設定されている場合、画像処理装置は、読取った原稿画像データから原稿画像の先頭の領域に対応する画像データを抽出し、その画像データに対してＯＣＲ処理を行なう。 When the user gives an instruction to start reading a document image, the image processing apparatus controls the document reading unit to read the document image data from the set document. When the OCR processing is set to be performed before the reading of one document image data is completed, when the image processing apparatus reads the image data of the first area of the document image, the image data is subjected to OCR. Perform processing. When the OCR processing is set to be performed after the reading of one sheet of document image data is completed, the image processing apparatus extracts image data corresponding to the first area of the document image from the read document image data, OCR processing is performed on the image data.

このように、本実施の形態では、ユーザに対して、原稿のセット方向を変更するように促すことによって、ファイル名の生成処理をより迅速に行なうことが可能となる。 As described above, according to the present embodiment, by prompting the user to change the document setting direction, the file name generation process can be performed more quickly.

（変形例）
上記実施の形態では、画像処理装置の１種である複合機に本発明を適用した例について示したが、本発明はそのような実施の形態には限定されない。原稿の画像を原稿画像データとして読取る画像読取機能を有していれば、画像処理装置は複合機以外の例えばスキャナ装置等であってもよい。 (Modification)
In the above-described embodiment, an example in which the present invention is applied to a multifunction peripheral that is one type of image processing apparatus has been described, but the present invention is not limited to such an embodiment. As long as it has an image reading function for reading an image of a document as document image data, the image processing device may be, for example, a scanner device other than a multifunction peripheral.

上記実施の形態では、画像データに対するＯＣＲ処理を制御部で行なう例について示したが、本発明はそのような実施の形態には限定されない。例えば画像処理部にＯＣＲ処理部を設け、ＯＣＲ処理部でＯＣＲ処理を行なうようにしてもよい。 In the above embodiment, an example in which the OCR process for image data is performed by the control unit has been described. However, the present invention is not limited to such an embodiment. For example, an OCR processing unit may be provided in the image processing unit, and the OCR processing unit may perform OCR processing.

上記実施の形態では、読取った原稿画像データを内部の記憶装置に記憶させる例について示したが、本発明はそのような実施の形態には限定されない。例えば、ＵＳＢ（ＵｎｉｖｅｒｓａｌＳｅｒｉａｌＢｕｓ）メモリのような着脱可能な外部記憶媒体を画像処理装置に装着し、装着された外部記憶媒体に読取った原稿画像データを記憶させるようにしてもよい。 In the above-described embodiment, an example in which the read document image data is stored in the internal storage device has been described. However, the present invention is not limited to such an embodiment. For example, a removable external storage medium such as a USB (Universal Serial Bus) memory may be mounted on the image processing apparatus, and the read document image data may be stored in the mounted external storage medium.

上記実施の形態では、ファイル名の候補から除外される一定文字数の文字列として、ファイル名に適用可能な文字数を超える文字列の例について示したが、本発明はそのような実施の形態には限定されない。一定文字数の文字列は、ファイル名に適用可能な文字数の文字列であってもよい。当該文字列の文字数は、任意に設定可能とすることもできる。 In the above embodiment, an example of a character string that exceeds the number of characters applicable to the file name as a character string of a certain number of characters excluded from the file name candidates has been shown, but the present invention is not limited to such an embodiment. It is not limited. The character string having a certain number of characters may be a character string having the number of characters applicable to the file name. The number of characters in the character string can be arbitrarily set.

上記第１の実施の形態では、１枚目の原稿が白紙の場合に、「日時＋マシン固有情報」からなるファイル名を生成する例について示したが、本発明はそのような実施の形態には限定されない。複数枚の原稿を連続して読取った場合に、１枚目の原稿が白紙であっても、２枚目以降の白紙でない原稿に対してＯＣＲ処理を行なうことでファイル名を生成するようにしてもよい。すなわち、白紙の原稿を除く１枚の原稿に対してＯＣＲ処理によるファイル名の生成処理を行なうようにしてもよい。 In the first embodiment, an example of generating a file name composed of “date and time + machine specific information” when the first document is blank has been described. However, the present invention includes such an embodiment. Is not limited. When a plurality of originals are continuously read, even if the first original is a blank sheet, a file name is generated by performing OCR processing on the second and subsequent non-blank originals. Also good. That is, file name generation processing by OCR processing may be performed on a single document excluding a blank document.

上記で開示された技術を適宜組合せて得られる実施の形態についても、本発明の技術的範囲に含まれる。 Embodiments obtained by appropriately combining the techniques disclosed above are also included in the technical scope of the present invention.

今回開示された実施の形態は単に例示であって、本発明が上記した実施の形態のみに限定されるわけではない。本発明の範囲は、発明の詳細な説明の記載を参酌した上で、特許請求の範囲の各請求項によって示され、そこに記載された文言と均等の意味及び範囲内での全ての変更を含む。 The embodiment disclosed herein is merely an example, and the present invention is not limited to the embodiment described above. The scope of the present invention is indicated by each claim of the claims after taking into account the description of the detailed description of the invention, and all modifications within the meaning and scope equivalent to the wording described therein are included. Including.

１００画像処理装置
１１０制御部
１１２ＣＰＵ
１１４ＲＯＭ
１１６ＲＡＭ
１１８記憶装置
１２０操作ユニット
１３０原稿読取部
１４０画像処理部
１５０画像形成部
１６０給紙部
２００システム設定画面
３００選択画面
４００原稿選択画面
５００通知画面

DESCRIPTION OF SYMBOLS 100 Image processing apparatus 110 Control part 112 CPU
114 ROM
116 RAM
118 Storage Device 120 Operation Unit 130 Document Reading Unit 140 Image Processing Unit 150 Image Forming Unit 160 Paper Feeding Unit 200 System Setting Screen 300 Selection Screen 400 Document Selection Screen 500 Notification Screen

Claims

A document reading means for reading a document image as document image data;
Character recognition means for performing character recognition processing on image data corresponding to a partial area of the original image among original image data read by the original reading means;
Depending on the setting direction of the document and the arrangement direction of the characters on the document, the character recognition process by the character recognition unit is performed after the reading of the document image data by the document reading unit is completed, or before the completion. Setting means for setting whether to perform;
A notification means for notifying the user in response to the setting means being set to perform the character recognition processing by the character recognition means after the reading of the document image data by the document reading means is completed;
Generating means for generating a file name of the document image data based on a character string recognized by the character recognition means;
An image processing apparatus comprising: storage means for storing original image data read by the original reading means using a file name generated by the generating means.

In response to the setting means having been set to perform character recognition processing by the character recognition means after the reading of the original image data by the original reading means has been completed by the setting means, The image processing apparatus according to claim 1, wherein notification is made to change the set direction of the image.

The image processing apparatus further includes:
A determination means for determining whether or not a character string sandwiched between predetermined blank areas has been recognized by the character recognition process as a character string as a file name candidate;
Recognition for controlling the character recognition means to perform character recognition processing on image data corresponding to another part of the original image in response to a negative determination result of the determination means. Processing control means,
The generation unit includes a file name generation unit for generating a file name of the document image data based on a recognized character string in response to a positive determination result of the determination unit. The image processing apparatus according to claim 1 or 2 .

The recognition processing control unit controls the character recognition unit to perform character recognition processing on image data corresponding to each area in order of a predetermined size from the beginning to the tail of the document image. The image processing apparatus according to claim 3 .

The determination means excludes a character string that is a preset character string and a predetermined number of characters from the file name candidates, and the character string that is the file name candidate is recognized by the character recognition process. or it contains the string determination means for determining the image processing apparatus according to claim 3 or claim 4.

The image processing apparatus further includes a setting unit configured to set the excluded character string as a file name candidate in response to the determination result of the character string determination unit being negative.
The generating means further comprises a hand stage for generating a file name of the original image data based on the set character string by said setting means, the image processing apparatus according to claim 5.

The image processing apparatus further includes:
In response to the determination result of the determination means being affirmative, a candidate number determination means for determining whether or not a plurality of character strings as file name candidates have been recognized;
In response to the determination result of the candidate number determination means being affirmative, a selection means for causing the user to select one of a plurality of character strings that are file name candidates,
The generating means is based on a recognized character string or a character string selected by the user via the selecting means among a plurality of recognized character strings according to the determination result of the candidate number determining means. generates a file name of the original image data, the image processing apparatus according to any one of claims 3 to 6.

The image processing apparatus further includes:
In response to the determination result of the character string determination means being negative, candidate number determination means for determining whether or not a plurality of character strings are set as file name candidates;
In response to the determination result of the candidate number determination means being affirmative, a selection means for causing the user to select one of a plurality of character strings that are file name candidates,
The generation unit is configured based on a character string selected by the user via the selection unit among a set character string or a plurality of set character strings according to the determination result of the candidate number determination unit. The image processing apparatus according to claim 6 , wherein a file name of the document image data is generated.

The image processing apparatus further determines whether or not the character recognition processing for the document image data of the first document is completed in response to the images of a plurality of documents read by the document reading unit. in accordance with the determination result, including processing stop means for controlling said character recognition means so as to stop the character recognition processing, the image processing apparatus according to any one of claims 1 to 8.