JP6545573B2

JP6545573B2 - Image processing apparatus, image forming apparatus, and chapter division processing method

Info

Publication number: JP6545573B2
Application number: JP2015166217A
Authority: JP
Inventors: 松本　学; 学松本
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 2015-08-25
Filing date: 2015-08-25
Publication date: 2019-07-17
Anticipated expiration: 2035-08-25
Also published as: JP2017045203A

Description

本発明は、複数ページの原稿に係る原稿画像データに対して、章毎に分別する処理を行う画像処理装置、画像形成装置及び章分け処理方法に関する。 The present invention relates to an image processing apparatus, an image forming apparatus, and a chapter division processing method which perform processing of sorting document image data relating to documents of a plurality of pages into chapters.

従来、複数ページからなり、複数の章からなる原稿を読み取り、読み取った原稿画像データから印刷物、電子文書を作成できる画像形成装置が開示されている。 2. Description of the Related Art Conventionally, an image forming apparatus capable of reading a document consisting of a plurality of pages and having a plurality of chapters and creating printed matter and an electronic document from the read document image data is disclosed.

例えば、特許文献１においては、章分け箇所の候補を、以下の１つ以上の条件の組み合わせで抽出する画像形成装置が開示されている。
（１）原稿画像内の行の文字サイズがユーザの指定した文字サイズより大きいときに、該行を章分け箇所の候補とする。
（２）原稿画像内の空白行数がユーザの指定した空白行数より大きいときに、空白行の次の行を章分け箇所の候補とする。
（３）ユーザの指定したページ番号に合致した原稿画像内の先頭行を章分け箇所の候補とする。
（４）原稿画像内の行の文字列にユーザの指定した文字列含むときに、当該行を章分け箇所の候補とする。
（５）原稿画像内のユーザの指定した行を章分け箇所の候補とする。 For example, Patent Document 1 discloses an image forming apparatus that extracts candidates for chapter division locations under a combination of one or more of the following conditions.
(1) When the character size of the line in the document image is larger than the character size designated by the user, the line is made a chapter division candidate.
(2) When the number of blank lines in the document image is larger than the number of blank lines specified by the user, the line following the blank line is set as a candidate for division into chapters.
(3) The first line in the document image matching the page number designated by the user is set as a chapter division candidate.
(4) When the character string of the line in the document image includes the character string designated by the user, the line is set as a chapter division candidate.
(5) A line designated by the user in the document image is used as a chapter division candidate.

特開２０１０−１０９４２０号公報JP, 2010-109420, A

しかしながら、特許文献１の画像形成装置は、何れの条件においてもユーザが条件を設定する必要があり、ユーザ使用性、利便性として煩雑さを伴うが故、ユーザが簡易に使用することが難しいという問題がある。 However, in the image forming apparatus of Patent Document 1, the user needs to set the conditions under any conditions, and although it is complicated as user usability and convenience, it is difficult for the user to use easily. There's a problem.

また、特許文献１の画像形成装置は、章分けを行う処理を開示するのみであって、章分けした結果の利用については、言及されていない。 Further, the image forming apparatus of Patent Document 1 only discloses the process of dividing a chapter, and does not mention the use of the result of dividing the chapter.

本発明は、斯かる事情に鑑みてなされたものであり、その目的とするところは、複数ページの原稿に係る原稿画像データに対して、章毎に分別する処理を行う場合において、該原稿に係る原稿画像データに対して、簡単、かつ、適確に、章毎に分別する章分けの処理を行うことが出来る画像処理装置、画像形成装置及び章分け処理方法を提供することにある。 The present invention has been made in view of such circumstances, and the object of the present invention is to separate original image data relating to a plurality of pages of an original image data into chapters in the case of performing sorting for each chapter. It is an object of the present invention to provide an image processing apparatus, an image forming apparatus, and a chapter division processing method capable of performing processing of chapter division for sorting into chapters easily and appropriately with respect to such document image data.

本発明に係る画像処理装置は、複数ページの原稿に係る原稿画像データに対して、章毎に分別する処理を行う画像処理装置において、前記原稿画像データに対して文字認識処理を施し、最大文字サイズを検出する文字サイズ検出部と、前記最大文字サイズを有する文字列を抽出する文字列抽出部と、章の始まりのページにて章の区分を表す章番号のパターンを記憶している記憶部と、前記文字列抽出部によって抽出された抽出文字列から、前記パターンに基づいて数字を抽出し、該抽出文字列に係るページ番号を前記原稿画像データから取得する章情報取得部とを備え、前記記憶部は、抽出された数字に対応付けて、前記抽出文字列及びページ番号を記憶することを特徴とする。 The image processing apparatus according to the present invention performs character recognition processing on the document image data in an image processing apparatus that performs processing to separate document image data related to documents of a plurality of pages for each chapter, and the maximum character size A character size detection unit that detects a size, a character string extraction unit that extracts a character string having the maximum character size, and a storage unit that stores a chapter number pattern representing a chapter division on a chapter start page And a chapter information acquisition unit which extracts a number based on the pattern from the extracted character string extracted by the character string extraction unit, and acquires a page number related to the extracted character string from the document image data; The storage unit is characterized by storing the extracted character string and the page number in association with the extracted number.

本発明に係る画像処理装置は、抽出された数字が複数である場合、前記章情報取得部によって取得された数字及びページ番号に基づいて、昇降順における抜け数字の数を求め、抜け数字を補完する抜け補完部を備えることを特徴とする。 In the image processing apparatus according to the present invention, when there are a plurality of extracted numbers, the number of missing numbers in the ascending and descending order is obtained based on the numbers and page numbers acquired by the chapter information acquiring unit, and the missing numbers are complemented. It has a missing part complementing part.

本発明に係る画像処理装置は、前記抜け補完部は、抽出された数字が１つである場合、前記ページ番号及び前記原稿の最終ページ番号によって定められる範囲に対して、前記抜け数字の補完を行うことを特徴とする。 In the image processing apparatus according to the present invention, when the number of extracted numerals is one, the missing complement unit complements the missing numerals with respect to a range defined by the page number and the final page number of the document. It is characterized by doing.

本発明に係る画像処理装置は、前記文字サイズ検出部は、各ページの一行目の文字列に対してのみ前記検出を行うことを特徴とする。 The image processing apparatus according to the present invention is characterized in that the character size detection unit performs the detection only for the character string on the first line of each page.

本発明に係る画像処理装置は、前記文字列抽出部は、各ページの一行目の文字列に対してのみ前記抽出を行うことを特徴とする。 The image processing apparatus according to the present invention is characterized in that the character string extraction unit performs the extraction only on the character string on the first line of each page.

本発明に係る画像処理装置は、前記章情報取得部は、前記抽出文字列のうち、最初の一つ又は複数の文字が前記パターンと一致する抽出文字列を検索し、検索された抽出文字列から、対応するパターンに含まれる章番号と一致する数字を抽出することを特徴とする。 In the image processing apparatus according to the present invention, the chapter information acquisition unit searches for an extracted character string in which the first one or more characters of the extracted character strings match the pattern, and the extracted character string is searched , And a digit corresponding to the chapter number included in the corresponding pattern is extracted.

本発明に係る画像形成装置は、請求項１から６の何れか一つに記載の画像処理装置と、シート状の記録媒体に画像形成を行う画像形成部と、特定紙が収容されたトレイと、前記画像形成を行う際、前記処理の結果に基づいて、章の切り替わりに、特定紙を挿入する挿入部とを備えることを特徴とする。 An image forming apparatus according to the present invention comprises an image processing apparatus according to any one of claims 1 to 6, an image forming section for forming an image on a sheet-like recording medium, and a tray containing specific paper. The method is characterized in that, when performing the image formation, an insertion section for inserting a specific sheet is provided to switch the chapter based on the result of the processing.

本発明に係る画像形成装置は、前記画像形成部は、前記章情報取得部によって取得された抽出文字列に係る数字、ページ番号を該抽出文字列に対応付けて、前記原稿に係る目次の画像形成を行うことを特徴とする。 In the image forming apparatus according to the present invention, the image forming unit associates the number and page number related to the extracted character string acquired by the chapter information acquiring unit with the extracted character string, and the image of the table of contents related to the document It is characterized by performing formation.

本発明に係る章分け処理方法は、章の始まりのページにて章の区分を表す章番号のパターンを記憶している記憶部を備えており、複数ページの原稿に係る原稿画像データに対する画像処理を行う画像処理装置にて、章毎に分別する処理を行う章分け処理方法において、前記原稿画像データに対して文字認識処理を施し、最大文字サイズを検出し、前記最大文字サイズを有する文字列を抽出し、前記記憶部に記憶されているパターンに基づいて、抽出された抽出文字列から数字を抽出し、該抽出文字列に係るページ番号を前記原稿画像データから取得し、抽出された数字に対応付けて、前記抽出文字列及びページ番号を記憶することを特徴とする。 A chapter division processing method according to the present invention includes a storage unit storing a chapter number pattern representing division of a chapter on a page at the beginning of a chapter, and performs image processing on document image data related to a plurality of pages of documents In the chapter division processing method of performing classification processing for each chapter in an image processing apparatus for performing character recognition, character recognition processing is performed on the document image data to detect a maximum character size, and a character string having the maximum character size Are extracted, a number is extracted from the extracted character string based on the pattern stored in the storage unit, a page number related to the extracted character string is obtained from the document image data, and the extracted number is extracted. And storing the extracted character string and the page number.

本発明によれば、原稿画像データに対して、簡単、かつ、適確に、章分けの処理を行うことが出来る。 According to the present invention, it is possible to perform chapter division processing easily and properly on original image data.

本実施の形態に係るデジタルカラー複写機の構成を示す縦断面図である。FIG. 1 is a longitudinal sectional view showing a configuration of a digital color copying machine according to an embodiment of the present invention. 本実施の形態に係る複写機の装置全体の各部を制御する制御系を説明する機能ブロック図である。FIG. 2 is a functional block diagram for explaining a control system that controls each part of the entire copying machine according to the present embodiment. 本実施の形態に係る複写機における、原稿画像データの読み取り処理及び章分けの処理を説明するフローチャートである。FIG. 6 is a flowchart for describing reading processing of document image data and chapter division processing in the copying machine according to the present embodiment. FIG. 本実施の形態に係る複写機において、ハードディスクに記憶された章表示文字列のパターン、章番号文字数テーブル、及び最終 Letter Indexテーブルを概念的に表す概念図である。FIG. 8 is a conceptual diagram conceptually showing the chapter display character string pattern, chapter number character count table, and final Letter Index table stored in the hard disk in the copying machine according to the present embodiment. 本実施の形態に係る複写機において、文字サイズ検出部によって行われる最大文字サイズ検出の処理を説明するフローチャートである。FIG. 7 is a flowchart illustrating processing of maximum character size detection performed by the character size detection unit in the copying machine according to the present embodiment. FIG. 本実施の形態に係る複写機において、文字列抽出部によって行われる文字列抽出の処理を説明するフローチャートである。In the copying machine concerning this embodiment, it is a flow chart explaining processing of character string extraction performed by a character string extraction part. 本実施の形態に係る複写機において、章情報取得部によって行われる章情報取得の処理を説明するフローチャートである。FIG. 7 is a flowchart illustrating a chapter information acquisition process performed by the chapter information acquisition unit in the copying machine according to the present embodiment. FIG. 本実施の形態に係る複写機において、章情報取得部によって行われる章文字パターンの検索の処理を説明するフローチャートである。FIG. 7 is a flowchart illustrating a chapter character pattern search process performed by the chapter information acquisition unit in the copying machine according to the present embodiment. FIG. 本実施の形態に係る複写機において、章情報取得部によって行われる章番号文字合致照合の処理を説明するフローチャートである。FIG. 7 is a flowchart illustrating a chapter number character matching process performed by the chapter information acquisition unit in the copying machine according to the present embodiment. FIG. 本実施の形態に係る複写機において、抜け補完部によって行われる抜け補完の処理を説明するフローチャートである。FIG. 7 is a flow chart for explaining the process of missing complementation performed by the missing supplement unit in the copying machine according to the present embodiment. FIG. 本実施の形態に係る複写機において、抜け補完部によって行われる抜け補完の処理を説明するフローチャートである。FIG. 7 is a flow chart for explaining the process of missing complementation performed by the missing supplement unit in the copying machine according to the present embodiment. FIG. 本実施の形態に係る複写機において、抜け補完部によって行われる第１補完の処理を説明するフローチャートである。FIG. 7 is a flowchart illustrating processing of a first complementation performed by a missing part complementation unit in the copying machine according to the present embodiment. FIG. 本実施の形態に係る複写機において、抜け補完部によって行われる第１補完の処理を説明するフローチャートである。FIG. 7 is a flowchart illustrating processing of a first complementation performed by a missing part complementation unit in the copying machine according to the present embodiment. FIG. 本実施の形態に係る複写機において、抜け補完部によって行われる第２補完の処理を説明するフローチャートである。FIG. 7 is a flowchart illustrating processing of second complementation performed by the missing part complementing unit in the copying machine according to the present embodiment. FIG.

以下に、本発明の実施の形態に係る画像処理装置及び画像形成装置を、いわゆる複写機に適用した場合を例として、図面に基づいて詳述する。 Hereinafter, an image processing apparatus and an image forming apparatus according to an embodiment of the present invention will be described in detail based on the drawings, taking as an example a case where it is applied to a so-called copying machine.

（実施の形態１）
図１は本実施の形態に係るデジタルカラー複写機の構成を示す縦断面図である。複写機１の上面には、原稿台１１１及び後述する操作パネルが設けられ、複写機１の内部に画像読取部１１０及び画像形成部２１０が設けられている。 Embodiment 1
FIG. 1 is a longitudinal sectional view showing the configuration of a digital color copying machine according to the present embodiment. A document table 111 and an operation panel to be described later are provided on the upper surface of the copying machine 1, and an image reading unit 110 and an image forming unit 210 are provided inside the copying machine 1.

原稿台１１１の上面には該原稿台１１１に対して開閉可能な状態で支持され、両面自動原稿送り装置（RADF；Reversing Automatic Document Feeder）１１２が装着されている。 A double-sided automatic document feeder (RADF) 112 is mounted on the upper surface of the document table 111 so as to be openable / closable relative to the document table 111.

さらに、両面自動原稿送り装置１１２は、まず、原稿の一方の面が原稿台１１１の所定位置において画像読取部１１０に対向するよう原稿を搬送し、この際、斯かる面の画像の読み取りが行われる。この一方の面についての画像読み取りが終了した後、両面自動原稿送り装置１１２は他方の面が原稿台１１１の所定位置において画像読取部１１０に対向するよう原稿を反転し、原稿台１１１の所定位置に向かって搬送し、斯かる面に対する画像形成が行われる。そして、両面自動原稿送り装置１１２は、１枚の原稿について両面の画像読み取りが終わった後、この原稿を排出し、次の原稿についても同様に両面搬送動作を実行する。以上の両面自動原稿送り装置１１２の動作は、複写機全体の動作に関連して制御されるものである。 Furthermore, the double-sided automatic document feeder 112 first conveys the document so that one side of the document faces the image reading unit 110 at a predetermined position of the document table 111. At this time, reading of the image on the side is performed. It will be. After the image reading on the one side is completed, the double-sided automatic document feeder 112 reverses the document so that the other side faces the image reading unit 110 at a predetermined position of the document table 111 and the predetermined position of the document table 111 And image formation on such a surface is performed. Then, after the double-sided automatic document feeding device 112 completes the double-sided image reading of one sheet of document, it discharges this document, and similarly executes the double-sided conveyance operation for the next document. The operation of the above-described duplex automatic document feeder 112 is controlled in relation to the operation of the entire copying machine.

画像読取部１１０は、両面自動原稿送り装置１１２により原稿台１１１上の所定位置に搬送される原稿の画像を読み取るために、原稿台１１１の下方に配置されている。また、画像読取部１１０は該原稿台１１１の下面に沿って平行に往復移動する（原稿台１１１上に置かれた原稿を読み取る場合）原稿走査体１１３、１１４と、光学レンズ１１５と、光電変換素子であるＣＣＤラインセンサ１１６とを有している。 The image reading unit 110 is disposed below the document table 111 in order to read an image of a document conveyed to a predetermined position on the document table 111 by the duplex automatic document feeder 112. Further, the image reading unit 110 reciprocates in parallel along the lower surface of the document table 111 (when reading a document placed on the document table 111) document scanning bodies 113 and 114, an optical lens 115, and photoelectric conversion And a CCD line sensor 116 which is an element.

原稿走査体１１３、１１４は、第１の走査ユニット１１３と第２の走査ユニット１１４とから構成されている。第１の走査ユニット１１３は原稿の表面を露光する露光ランプと、原稿からの反射光像を所定の方向に向かって偏向する第１ミラーとを有し、原稿台１１１上に原稿が置かれた場合には、原稿台１１１の下面に対して一定の距離を保ちながら所定の走査速度で平行に往復移動するものである。また、両面自動原稿送り装置１１２にて原稿が搬送され、原稿が読み取られる場合には、所定位置で停止している。 The document scanning bodies 113 and 114 are composed of a first scanning unit 113 and a second scanning unit 114. The first scanning unit 113 has an exposure lamp for exposing the surface of the document, and a first mirror for deflecting a reflected light image from the document toward a predetermined direction, and the document is placed on the document table 111 In this case, it reciprocates in parallel at a predetermined scanning speed while maintaining a fixed distance from the lower surface of the document table 111. Further, when the original is conveyed by the duplex automatic document feeder 112 and the original is read, the document is stopped at a predetermined position.

第２の走査ユニット１１４は、第１の走査ユニット１１３の前記第１ミラーにより偏向された原稿からの反射光像をさらに所定の方向に向かって偏向する第２及び第３ミラーとを有し、原稿台１１１上に原稿が置かれた場合には、第１の走査ユニット１１３と一定の速度関係を保って平行に往復移動する。 The second scanning unit 114 has second and third mirrors for further deflecting the reflected light image from the document deflected by the first mirror of the first scanning unit 113 in a predetermined direction, When an original is placed on the original table 111, the original is reciprocated in parallel with the first scanning unit 113 while maintaining a constant speed relationship.

光学レンズ１１５は、第２の走査ユニット１１４の前記第３ミラーにより偏向された原稿からの反射光像を縮小し、縮小された光像をＣＣＤラインセンサ１１６上の所定位置に結像させる。 The optical lens 115 reduces the reflected light image from the document deflected by the third mirror of the second scanning unit 114, and forms the reduced light image at a predetermined position on the CCD line sensor 116.

ＣＣＤラインセンサ１１６は、結像された光像を順次光電変換して電気信号として出力する。ＣＣＤラインセンサ１１６は、白黒画像又はカラー画像を読み取り、Ｒ(赤)、Ｇ(緑)、Ｂ(青)の各色成分に色分解したラインデータを出力することのできる３ラインのカラーＣＣＤである。 The CCD line sensor 116 photoelectrically converts the formed light image sequentially and outputs it as an electric signal. The CCD line sensor 116 is a 3-line color CCD that can read black and white images or color images and output line data separated into color components of R (red), G (green) and B (blue). .

次に、画像形成部２１０の構成、及び画像形成部２１０に係わる各部の構成について説明する。
画像形成部２１０の下方には、用紙トレイ内に積載収容されている記録用紙Ｐを１枚ずつ分離して画像形成部２１０に向かって供給する給紙機構２１１ａ〜２１１ｃが設けられている。そして１枚ずつ分離供給された記録用紙Ｐは、画像形成部２１０の手前に配置された一対のレジストローラ２１２によりタイミングが制御されて画像形成部２１０に搬送される。さらに、片面に画像が形成された記録用紙Ｐは、画像形成部２１０の画像形成にタイミングを合わせて画像形成部２１０に再供給搬送される。 Next, the configuration of the image forming unit 210 and the configuration of each unit related to the image forming unit 210 will be described.
Below the image forming unit 210, sheet feeding mechanisms 211a to 211c are provided which separate the recording sheets P stacked and accommodated in the sheet tray one by one and supply the recording sheets P toward the image forming unit 210. The recording paper P separated and supplied one by one is conveyed to the image forming unit 210 with its timing controlled by a pair of registration rollers 212 arranged in front of the image forming unit 210. Further, the recording sheet P on which the image is formed on one side is re-supplied and conveyed to the image forming unit 210 in timing with the image formation of the image forming unit 210.

また、画像形成部２１０の下方には、転写搬送ベルト機構２１３が配置されている。転写搬送ベルト機構２１３は、駆動ローラ２１４と従動ローラ２１５との間に略平行に伸びるように張架された転写搬送ベルト２１６に記録用紙Ｐを静電吸着させて搬送する。そして、転写搬送ベルト２１６の下側に近接して、パターン画像検出ユニットが設けられている。 Further, below the image forming unit 210, a transfer conveyance belt mechanism 213 is disposed. The transfer conveyance belt mechanism 213 electrostatically attracts the recording sheet P to the transfer conveyance belt 216 stretched so as to extend substantially in parallel between the driving roller 214 and the driven roller 215 and conveys the recording sheet P. A pattern image detection unit is provided in proximity to the lower side of the transfer conveyance belt 216.

さらに、用紙搬送路における転写搬送ベルト機構２１３の下流側には、記録用紙Ｐ上に転写形成されたトナー像を記録用紙Ｐ上に定着させるための定着装置２１７が配置されている。この定着装置２１７の一対の定着ローラ間を通過した記録用紙Ｐは、搬送方向切り換えゲート２１８を経て、排出ローラ２１９により複写機１の外側に取り付けられている排紙トレイ２２０上に排出される。 Further, on the downstream side of the transfer conveyance belt mechanism 213 in the sheet conveyance path, a fixing device 217 for fixing the toner image transferred and formed on the recording sheet P on the recording sheet P is disposed. The recording sheet P having passed between the pair of fixing rollers of the fixing device 217 passes through the conveyance direction switching gate 218 and is discharged onto the sheet discharge tray 220 attached to the outside of the copying machine 1 by the discharge roller 219.

切り換えゲート２１８は、定着後の記録用紙Ｐの搬送経路を、排紙トレイ２２０へ記録用紙Ｐを排出する経路と、画像形成部２１０に向かって記録用紙Ｐを再供給する経路との間で選択的に切り換えるものである。切り換えゲート２１８により再び画像形成部２１０に向かって搬送方向が切り換えられた記録用紙Ｐは、スイッチバック搬送経路２２１を介して表裏反転された後、画像形成部２１０へと再度供給される。 The switching gate 218 selects the transport path of the recording sheet P after fixing, between the path for discharging the recording sheet P to the sheet discharge tray 220 and the path for resupplying the recording sheet P toward the image forming unit 210. Switching. The recording sheet P whose transport direction has been switched back to the image forming unit 210 by the switching gate 218 is reversed over the front and back through the switchback transport path 221, and then supplied again to the image forming unit 210.

また、画像形成部２１０における転写搬送ベルト２１６の上方には、転写搬送ベルト２１６に近接して、第１の画像形成ステーションＰａ、第２の画像形成ステーションＰｂ、第３の画像形成ステーションＰｃ、及び第４の画像形成ステーションＰｄが、用紙搬送経路の上流側から順に並設されている。 The first image forming station Pa, the second image forming station Pb, the third image forming station Pc, and the image forming station 210 are located above the transfer conveying belt 216 in the image forming unit 210 and close to the transfer conveying belt 216. The fourth image forming station Pd is juxtaposed in order from the upstream side of the sheet conveyance path.

転写搬送ベルト２１６は駆動ローラ２１４によって、図１において矢印Ｚで示す方向に摩擦駆動され、上述したように給紙機構２１１ａ〜２１１ｃを通じて給送される記録用紙Ｐを担持し、記録用紙Ｐを画像形成ステーションＰａ〜Ｐｄへと順次搬送する。 The transfer conveyance belt 216 is frictionally driven by the drive roller 214 in the direction indicated by the arrow Z in FIG. 1, and carries the recording sheet P fed through the sheet feeding mechanisms 211a to 211c as described above. The sheet is sequentially transported to the forming stations Pa to Pd.

各画像ステーションＰａ〜Ｐｄは、実質的に同一の構成を有している。各画像ステーションＰａ、Ｐｂ、Ｐｃ、Ｐｄは、図１に示す矢印Ｆ方向に回転駆動される感光体ドラム２２２ａ、２２２ｂ、２２２ｃ、及び２２２ｄを夫々含んでいる。 Each of the image stations Pa to Pd has substantially the same configuration. Each of the image stations Pa, Pb, Pc, and Pd includes photosensitive drums 222a, 222b, 222c, and 222d which are rotationally driven in the direction of arrow F shown in FIG.

各感光体ドラム２２２ａ〜２２２ｄの周辺には、感光体ドラム２２２ａ〜２２２ｄを夫々一様に帯電する帯電器２２３ａ、２２３ｂ、２２３ｃ、２２３ｄと、感光体ドラム２２２ａ〜２２２ｄ上に形成された静電潜像を夫々現像する現像装置２２４ａ、２２４ｂ、２２４ｃ、２２４ｄと、現像された感光体ドラム２２２ａ〜２２２ｄ上のトナー像を記録用紙Ｐへ転写する転写用放電器２２５ａ、２２５ｂ、２２５ｃ、２２５ｄと、感光体ドラム２２２ａ〜２２２ｄ上に残留するトナーを除去するクリーニング装置２２６ａ、２２６ｂ、２２６ｃ、２２６ｄとが感光体ドラム２２２ａ〜２２２ｄの回転方向に沿って順次配置されている。 Around the photosensitive drums 222a to 222d, charging devices 223a, 223b, 223c, and 223d for uniformly charging the photosensitive drums 222a to 222d, respectively, and electrostatic latent formed on the photosensitive drums 222a to 222d are provided. Developing devices 224a, 224b, 224c and 224d for developing the image, and dischargers for transfer 225a, 225b, 225c and 225d for transferring the toner image on the photosensitive drums 222a to 222d to which the image was developed, and photosensitive Cleaning devices 226a, 226b, 226c and 226d for removing the toner remaining on the body drums 222a to 222d are sequentially arranged along the rotational direction of the photosensitive drums 222a to 222d.

また、各感光体ドラム２２２ａ〜２２２ｄの上方には、レーザビームスキャナユニット２２７ａ、２２７ｂ、２２７ｃ、２２７ｄが夫々設けられている。レーザビームスキャナユニット２２７ａ〜２２７ｄは、画像データに応じて変調されたドット光を発する半導体レーザ素子(図示せず)、半導体レーザ素子からのレーザビームを主走査方向に偏向させるためのポリゴンミラー２４０ａ〜２４０ｄと、ポリゴンミラー２４０ａ〜２４０ｄにより偏向されたレーザビームを感光体ドラム２２２ａ〜２２２ｄ表面に結像させるためのｆθレンズ２４１ａ〜２４１ｄ、ミラー２４２ａ〜２４２ｄ、２４３ａ〜２４３ｄなどから構成されている。 Laser beam scanner units 227a, 227b, 227c, and 227d are provided above the photosensitive drums 222a to 222d, respectively. The laser beam scanner units 227a to 227d are semiconductor laser elements (not shown) that emit dot light modulated according to image data, and polygon mirrors 240a to 240a for deflecting laser beams from the semiconductor laser elements in the main scanning direction. 240 d, fθ lenses 241 a to 241 d for forming laser beams deflected by the polygon mirrors 240 a to 240 d on the surfaces of the photosensitive drums 222 a to 222 d, mirrors 242 a to 242 d, 243 a to 243 d, and the like.

レーザビームスキャナ２２７ａにはカラー原稿画像の黒色成分像に対応する画素信号が、レーザビームスキャナ２２７ｂにはカラー原稿画像のシアン色成分像に対応する画素信号が、レーザビームスキャナ２２７ｃにはカラー原稿画像のマゼンタ色成分像に対応する画素信号が、そして、レーザビームスキャナ２２７ｄにはカラー原稿画像のイエロー色成分像に対応する画素信号が夫々入力される。 The laser beam scanner 227a has a pixel signal corresponding to the black component image of the color original image, the laser beam scanner 227b has a pixel signal corresponding to the cyan color component image of the color original image, and the laser beam scanner 227c has a color original image And the pixel signal corresponding to the yellow color component image of the color original image are input to the laser beam scanner 227d.

これにより色変換された原稿画像情報に対応する静電潜像が各感光体ドラム２２２ａ〜２２２ｄ上に形成される。そして、現像装置２２４ａには黒色のトナーが、現像装置２２４ｂにはシアン色のトナーが、現像装置２２４ｃにはマゼンタ色のトナーが、現像装置２２４ｄにはイエロー色のトナーが夫々収容されており、感光体ドラム２２２ａ〜２２２ｄ上の静電潜像は、これら各色のトナーにより現像される。これにより、画像形成部２１０にて色変換された原稿画像情報が各色のトナー像として再現される。 Thus, electrostatic latent images corresponding to the color-converted original image information are formed on the respective photosensitive drums 222a to 222d. The black toner is stored in the developing device 224a, the cyan toner is stored in the developing device 224b, the magenta toner is stored in the developing device 224c, and the yellow toner is stored in the developing device 224d. The electrostatic latent images on the photosensitive drums 222a to 222d are developed with the toners of the respective colors. As a result, the original image information color-converted by the image forming unit 210 is reproduced as a toner image of each color.

また、第１の画像形成ステーションＰａと給紙機構２１１aとの間には用紙吸着用帯電器２２８が設けられており、この吸着用帯電器２２８は転写搬送ベルト２１６の表面を帯電させ、給紙機構２１１ａから供給された記録用紙Ｐは、転写搬送ベルト２１６上に確実に吸着させた状態で第１の画像形成ステーションＰａから第４の画像形成ステーションＰｄの間をずれることなく搬送させる。 In addition, a sheet adsorbing charger 228 is provided between the first image forming station Pa and the sheet feeding mechanism 211a, and the adsorbing charger 228 charges the surface of the transfer conveyance belt 216 to feed the sheet. The recording paper P supplied from the mechanism 211 a is transported without deviation between the first image forming station Pa and the fourth image forming station Pd in a state where the recording paper P is securely attracted onto the transfer conveying belt 216.

一方、第４の画像ステーションＰｄと定着装置２１７との間で駆動ローラ２１４のほぼ真上部には除電器２２９が設けられている。除電器２２９には搬送ベルト２１６に静電吸着されている記録用紙Ｐを転写搬送ベルト２１６から分離するための交流電流が印加されている。 On the other hand, a static eliminator 229 is provided almost immediately above the driving roller 214 between the fourth image station Pd and the fixing device 217. An alternating current for separating the recording sheet P electrostatically attracted to the conveyance belt 216 from the transfer conveyance belt 216 is applied to the static eliminator 229.

上記構成の複写機１においては、記録用紙Ｐとしてカットシート状の紙が使用される。この記録用紙Ｐは、給紙トレイから送り出されて給紙機構２１１ａ〜２１１ｃの給紙搬送経路のガイド内に供給されると、その記録用紙Ｐの先端部分がセンサ（図示せず）にて検知され、このセンサから出力される検知信号に基づいて一対のレジストローラ２１２により一旦停止される。 In the copying machine 1 configured as described above, cut sheet-like paper is used as the recording paper P. When the recording sheet P is fed out of the sheet feeding tray and fed into the guide of the sheet feeding conveyance path of the sheet feeding mechanisms 211a to 211c, a leading end portion of the recording sheet P is detected by a sensor (not shown). And is temporarily stopped by the pair of registration rollers 212 based on the detection signal output from this sensor.

そして、記録用紙Ｐは各画像ステーションＰａ〜Ｐｄとタイミングをとって図１の矢印Ｚ方向に回転している転写搬送ベルト２１６上に送られる。このとき転写搬送ベルト２１６には前述したように吸着用帯電器２２８により所定の帯電が施されているので、記録用紙Ｐは、各画像ステーションＰａ〜Ｐｄを通過する間、安定して搬送供給が行われるようになる。 Then, the recording sheet P is sent onto the transfer conveyance belt 216 rotating in the direction of the arrow Z in FIG. 1 in timing with each of the image stations Pa to Pd. At this time, since the transfer conveyance belt 216 is charged by the suction charger 228 as described above, the recording paper P is stably conveyed and supplied while passing through the image stations Pa to Pd. It will be done.

各画像ステーションＰａ〜Ｐｄにおいては、各色のトナー像が、夫々形成され、転写搬送ベルト２１６により静電吸着されて搬送される記録用紙Ｐの支持面上で重ね合わされる。第４の画像ステーションＰｄによる画像の転写が完了すると、記録用紙Ｐは、その先端部分から順次、除電用放電器により転写搬送ベルト２１６上から剥離され、定着装置２１７へと導かれる。最後に、トナー画像が定着された記録用紙Ｐは、用紙排出口(図示せず)から排紙トレイ２２０上へと排出される。 In each of the image stations Pa to Pd, toner images of the respective colors are respectively formed, and are superimposed on the supporting surface of the recording paper P which is electrostatically attracted and conveyed by the transfer conveyance belt 216. When the transfer of the image by the fourth image station Pd is completed, the recording paper P is peeled off from the transfer conveyance belt 216 sequentially by the discharging device from the leading end portion, and is guided to the fixing device 217. Finally, the recording sheet P on which the toner image is fixed is discharged onto the discharge tray 220 from a sheet discharge port (not shown).

なお、上述の説明ではレーザビームスキャナユニット２２７ａ〜２２７ｄによって、レーザビームを走査して露光することにより、感光体への光書き込みを行なう。しかし、レーザビームスキャナユニットの代わりに、発光ダイオードアレイと結像レンズアレイからなる書き込み光学系（ＬＥＤヘッド）を用いても良い。ＬＥＤヘッドはレーザビームスキャナユニットに比べ、サイズも小さく、また可動部分がなく無音である。よって、複数個の光書き込みユニットを必要とするタンデム方式のデジタルカラー複写機などの画像形成装置では、好適に用いることができる。 In the above description, the laser beam is scanned and exposed by the laser beam scanner units 227a to 227d to write light on the photosensitive member. However, instead of the laser beam scanner unit, a writing optical system (LED head) comprising a light emitting diode array and an imaging lens array may be used. The LED head is smaller in size than the laser beam scanner unit, and has no moving parts and is silent. Therefore, it can be suitably used in an image forming apparatus such as a tandem digital color copying machine which requires a plurality of light writing units.

図２は本実施の形態に係る複写機１の装置全体の各部を制御する制御系を説明する機能ブロック図である。複写機１は制御部４を備えており、制御部４は、ＣＰＵ４０（挿入部）と、ＲＡＤＦ制御部４１と、スキャナ制御部４２と、画像準備制御部４３と、画像形成制御部４４と、給紙トレイ制御部４５と、後処理制御部４６、文字サイズ検出部４７、文字列抽出部４８、章情報取得部４９、及び、抜け補完部５０とを有している。 FIG. 2 is a functional block diagram for explaining a control system that controls each part of the entire apparatus of the copying machine 1 according to the present embodiment. The copying machine 1 includes a control unit 4. The control unit 4 includes a CPU 40 (insertion unit), a RADF control unit 41, a scanner control unit 42, an image preparation control unit 43, and an image formation control unit 44. A paper feed tray control unit 45, a post-processing control unit 46, a character size detection unit 47, a character string extraction unit 48, a chapter information acquisition unit 49, and a omission complement unit 50 are provided.

画像準備制御部４３は、原稿読み取り部１１０の制御を行うスキャナ制御部４２、一時的に印刷すべき画像を記憶する画像メモリ５３、レーザビームスキャナユニット２２７ａ〜２２７ｄの制御を行う露光制御部５１、画像データ蓄積用のハードディスク５５など、複写機１を構成する印刷画像準備用の各ユニットをシーケンス制御により管理すると共に、フィルタ処理、変倍処理、マスキング処理、ガンマ処理などの画像処理を行う。また、画像形成制御部４４と通信を行い、連携することで印刷ジョブの実行を行う。 The image preparation control unit 43 controls the document reading unit 110. The image control unit 53 stores an image to be printed temporarily. The exposure control unit 51 controls the laser beam scanner units 227a to 227d. The units for print image preparation constituting the copying machine 1, such as the hard disk 55 for image data storage, are managed by sequence control, and image processing such as filter processing, scaling processing, masking processing, and gamma processing is performed. Further, communication with the image formation control unit 44 is performed to cooperate with the image formation control unit 44 to execute a print job.

さらに画像形成制御部４４には、給紙機構２１１ａ、２１１ｂ、２１１ｃの制御を行う給紙トレイ制御部４５、後処理の制御を行う後処理制御部４６と相互通信可能な状態で接続されており、エンジン負荷部５６のセンサを入力し、モーター等を出力制御し、各所を統轄的にシーケンス制御することで、印刷画像を形成するように動作している。 Further, the image formation control unit 44 is connected in a mutually communicable state with the paper feed tray control unit 45 that controls the paper feed mechanisms 211 a, 211 b, and 211 c and the post processing control unit 46 that controls the post processing. The sensor of the engine load unit 56 is input to control the output of the motor and the like, and the sequence control of various places is performed to form a print image.

スキャナ制御部４２は、ＣＣＤラインセンサ１１６から原稿画像信号を受け取り、画像準備制御部４３に送るともに、両面自動原稿送り装置（ＲＡＤＦ）と通信し、原稿送り制御を行わせる。また、スキャナ制御部４２は、原稿読み取り部１１０のモーター、ソレノイド等からなるスキャナ負荷部５２に対するシーケンス制御を行うための制御信号を出力するとともに、表示部、該表示部を覆うタッチパネル、テンキー等からなる操作パネルを構成する操作基板ユニット５４と通信を行って、操作状況のモニタ、各種の表示制御を行う。 The scanner control unit 42 receives a document image signal from the CCD line sensor 116, sends it to the image preparation control unit 43, and communicates with a double-sided automatic document feeder (RADF) to perform document feed control. The scanner control unit 42 outputs a control signal for performing sequence control to the scanner load unit 52 including a motor, a solenoid, etc. of the document reading unit 110, and a display unit, a touch panel covering the display unit, ten keys, etc. Communication is performed with the operation board unit 54 constituting the operation panel to monitor the operation situation and perform various display controls.

複写機１全体の処理の流れを両面自動原稿送り装置１１２を使用したコピーの場合を例にとり説明する。 The flow of processing of the entire copying machine 1 will be described by taking the case of copying using the duplex automatic document feeder 112 as an example.

ユーザは、複写機１の両面自動原稿送り装置１１２に原稿を載置し、前記操作パネルを適宜操作することにより、コピーに係る設定を受け付ける設定画面を前記表示部に表示させ、コピー枚数等を設定した後、スタートキーを操作することによりコピーの開始を指示する。 The user places an original on the duplex automatic document feeder 112 of the copying machine 1 and operates the operation panel appropriately to display a setting screen for receiving settings relating to copying on the display unit, and to display the number of copies, etc. After setting, the start key is instructed by operating the start key.

スキャナ制御部４２は、画像読取部１１０の第１の走査ユニット１１３を両面自動原稿送り装置１１２から搬送される原稿を読み取るための所定位置へ移動させ、両面自動原稿送り装置１１２により搬送される原稿の読み取りを開始する。 The scanner control unit 42 moves the first scanning unit 113 of the image reading unit 110 to a predetermined position for reading the document conveyed from the double-sided automatic document feeder 112, and the document conveyed by the double-sided automatic document feeder 112 Start reading.

この際、ＣＣＤラインセンサ１１６では、それを色分解してＲＧＢのアナログ画像信号を得る。更に該アナログ画像信号に対して、スキャナ制御部４２でデジタル信号への変換を行う。スキャナ制御部４２に送られ、デジタル信号に変換された画像データ（原稿画像データ）は、画像準備制御部４３へ送られ、画像メモリ５３へ格納され、画像メモリ５３へ格納された画像データは順次ハードディスク５５へ一旦格納される。また、これと同時に、画像準備制御部４３は、印刷のための画像データ（印刷画像データ）が格納されたことを画像形成制御部４４へ順次通知する。 At this time, the CCD line sensor 116 performs color separation to obtain an RGB analog image signal. Further, the scanner control unit 42 converts the analog image signal into a digital signal. Image data (original image data) sent to the scanner control unit 42 and converted into digital signals is sent to the image preparation control unit 43, stored in the image memory 53, and the image data stored in the image memory 53 are sequentially It is temporarily stored in the hard disk 55. At the same time, the image preparation control unit 43 sequentially notifies the image formation control unit 44 that the image data (print image data) for printing is stored.

画像形成制御部４４は、印刷を行うための印刷画像データの要求を画像準備制御部４３に対し順次行い、画像準備制御部４３は、要求された印刷画像データをハードディスク５５から順次読み出し、画像メモリ５３へ格納後、フィルタ処理、変倍処理、マスキング処理、ガンマ処理、多値化処理という順に画像処理を行う。この後、画像準備制御部４３は露光制御部５１へ印刷画像データを転送し、画像形成制御部４４へ印刷を行うための印刷画像データが転送完了したことを通知する。 The image formation control unit 44 sequentially requests the image preparation control unit 43 for the print image data for printing, and the image preparation control unit 43 sequentially reads the requested print image data from the hard disk 55, and the image memory After storing in 53, image processing is performed in the order of filter processing, scaling processing, masking processing, gamma processing, and multilevel processing. Thereafter, the image preparation control unit 43 transfers the print image data to the exposure control unit 51, and notifies the image formation control unit 44 that the transfer of the print image data for printing is completed.

画像形成制御部４４は、画像準備制御部４３からの印刷画像データの転送完了の通知を受け、印字を開始する。画像形成部２１０の印字処理スピードが、画像読取部１１０の原稿読み取り速度より遅くなる場合においては、ハードディスク５５に読み取られた画像データが格納されて行くだけで、印刷処理に伴って、順次画像データがハードディスク５５から読み出され、画像処理された後に印刷が行われることになる。 The image formation control unit 44 receives the notification of transfer completion of the print image data from the image preparation control unit 43, and starts printing. When the print processing speed of the image forming unit 210 is slower than the document reading speed of the image reading unit 110, the image data read sequentially in the hard disk 55 is sequentially stored along with the print processing. Are read out from the hard disk 55, and printed after image processing.

文字サイズ検出部４７は前記原稿画像データに対してページ毎に文字認識処理を施し、該原稿画像データの最大文字サイズを検出する。また、文字サイズ検出部４７は該原稿画像データに対してページ毎に、公知のＯＣＲ処理を施し、パターン・マッチング法によって、原稿画像データの各ページの文字認識を行う。この際、文字サイズ検出部４７は、前記原稿画像データ（以下、原稿とも言う。）の各ページの一行目の文字列に対してのみ、最大文字サイズの検出を行う。 The character size detection unit 47 performs character recognition processing for each page of the document image data, and detects the maximum character size of the document image data. Further, the character size detection unit 47 performs known OCR processing on the document image data for each page, and performs character recognition of each page of the document image data by the pattern matching method. At this time, the character size detection unit 47 detects the maximum character size only for the character string on the first line of each page of the document image data (hereinafter, also referred to as a document).

すなわち、一般に、章初めのページには、斯かる章の区分を表す章番号及びタイトルが含まれた見出しが、当該章を表す章表示文字列として、最上側に、大きな文字にて記載されている。斯かる章表示文字列としては、例えば、「第Ｘ章○○○○」、「ChapterＸ○○○○」、「＃Ｘ○○○○」、「Ｘ；○○○○」、「Ｘ．○○○○」等が例に挙げられる。ここで、「Ｘ」は、整数を表す文字であり、例えば、アラビア数字、漢数字、ローマ数字等を含む。なお、「○○○○」は当該章のタイトルである。 That is, generally, on the beginning of a chapter, a heading including a chapter number representing the division of the chapter and a title is described in large letters at the top of the chapter display character string representing the chapter. There is. As such chapter display character strings, for example, “Chapter X XX”, “Chapter X XX”, “# X XX”, “X; XXX”, “X. "○○○○" etc. may be mentioned as an example. Here, “X” is a letter representing an integer, and includes, for example, Arabic numerals, Chinese numerals, Roman numerals and the like. Note that "○○○○" is the title of the chapter.

従って、このように、各ページの一行目の文字列に対してのみ、斯かる検出を行うことによって、一層効率的に、後述する章表示文字列の抽出、章分けの処理等を行うことが出来る。 Therefore, as described above, by performing such detection only for the character string on the first line of each page, it is possible to more efficiently extract the chapter display character string to be described later, perform chapter division processing, etc. It can.

文字列抽出部４８は、前記原稿画像データから、文字サイズ検出部４７によって検出された最大文字サイズを有する文字列を、前記章表示文字列の候補として、抽出する。より詳しくは、文字列抽出部４８は、前記原稿画像データの各ページの一行目の文字列に対してのみ、最大文字サイズを有する文字列の抽出を行い、抽出された章表示文字列の候補（以下、候補文字列と言う。）をハードディスク５５に記憶する。従って、各ページの全ての文字列に対して、斯かる抽出を行う場合に比べ、上述したように、効率的に章表示文字列の抽出、章分けの処理等が可能となる。 The character string extraction unit 48 extracts, from the document image data, a character string having the maximum character size detected by the character size detection unit 47 as a candidate for the chapter display character string. More specifically, the character string extraction unit 48 extracts the character string having the maximum character size only for the character string on the first line of each page of the document image data, and the extracted chapter display character string candidate The hard disk 55 stores (hereinafter referred to as a candidate character string). Therefore, as described above, chapter display character string extraction, chapter division processing, and the like can be performed more efficiently than when such extraction is performed on all character strings of each page.

章情報取得部４９は前記候補文字列に含まれる数字を割り出し（抽出）、該候補文字列が記載されているページ番号（以下、章ページ番号）を前記原稿画像データから取得する。このような処理は、章表示文字列に含まれる章番号のパターンに基づいて行われ、該章番号のパターンはハードディスク５５に記憶されている。 The chapter information acquisition unit 49 determines (extracts) a number included in the candidate character string, and acquires a page number (hereinafter, chapter page number) on which the candidate character string is described from the document image data. Such processing is performed based on the chapter number pattern included in the chapter display character string, and the chapter number pattern is stored in the hard disk 55.

より詳しくは、章情報取得部４９は、先ず、文字列抽出部４８によって抽出された候補文字列のうち、冒頭の一つ又は複数の文字が前記章番号のパターンと一致する候補文字列を検出する。次に、章情報取得部４９は、検出された候補文字列から、該候補文字列に対応する前記章番号のパターンに係る章番号と一致する数字を章番号として割り出す。章情報取得部４９は、このように割り出した章番号をIndexとし、該章番号に関連付けて斯かる文字列及び対応する章ページ番号をハードディスク５５に記憶する。以下においては、前記章番号、該章番号に対応する文字列（以下、章文字列とも言う）、及び章ページ番号を章情報ともいう。また、前記章番号のパターンについては、後で詳しく説明する。 More specifically, the chapter information acquisition unit 49 first detects a candidate character string in which one or more characters at the beginning of the candidate character strings extracted by the character string extraction unit 48 match the pattern of the chapter number. Do. Next, the chapter information acquisition unit 49 determines, from the detected candidate character string, a number matching the chapter number relating to the pattern of the chapter number corresponding to the candidate character string as a chapter number. The chapter information acquisition unit 49 sets the chapter number thus identified as an index, and stores the character string and the corresponding chapter page number in the hard disk 55 in association with the chapter number. In the following, the chapter number, a character string corresponding to the chapter number (hereinafter, also referred to as a chapter string), and a chapter page number are also referred to as chapter information. Also, the pattern of the chapter number will be described in detail later.

また、抜け補完部５０は、各ページの一行目以外の箇所に、章表示文字列（章番号）が存在するかを確認することにより、検出が出来なかった章表示文字列（章番号）があれば補完を行う。より詳しくは、抜け補完部５０は、先ず、章情報取得部４９によって割り出された数字（章番号）が、１つであるか、複数であるかの判断を行う。次に、抜け補完部５０は、割り出された数字が複数である場合、昇順又は降順における抜け数字、すなわち、抜けた章番号（以下、抜け章番号と言う。）の数を求め、抜け章番号を補完（抜け数字の補完）する処理を行う。以下、斯かる処理を抜け補完の処理とも言う。また、抜け補完部５０は、割り出された数字（章番号）が１つである場合、前記ページ番号及び前記原稿の最終ページ番号によって定められる範囲に対して、抜け章番号の補完を行う。 In addition, the missing-completion unit 50 checks the presence of a chapter display character string (chapter number) in a place other than the first line of each page, and the chapter display character string (chapter number) that could not be detected is displayed. If there is a complement. More specifically, the missing part complementing unit 50 first determines whether the number (chapter number) calculated by the chapter information acquiring unit 49 is one or more than one. Next, when there are a plurality of numbers determined, the missing part complement unit 50 determines the number of missing numbers in ascending or descending order, that is, the number of missing chapter numbers (hereinafter referred to as missing chapter numbers), and missing chapters. Perform the process of complementing the numbers (completing of the missing numbers). Hereinafter, such processing is also referred to as processing for missing and complementing. Further, when the number (chapter number) determined is one, the missing part complementing the missing chapter number with respect to the range defined by the page number and the final page number of the document.

ＣＰＵ４０は、ＲＯＭ（図示せず）に予め格納されている制御プログラムをＲＡＭ（図示せず）上にロードして実行することによって、上述した各種ハードウェアの制御を行ない、装置全体を本発明の複写機１として動作させる。 The CPU 40 controls the various hardware described above by loading a control program stored in advance in the ROM (not shown) onto the RAM (not shown) to execute the control of the various hardware described above. The copying machine 1 is operated.

以上のような構成を有する複写機１は、例えば、一つ又は複数の章を含む複数ページからなる原稿の原稿画像データを読み取り、章毎に分別する章分け処理を行うことが出来る。以下、詳しく説明する。 The copying machine 1 having the above-described configuration can read, for example, document image data of a document including a plurality of pages including one or a plurality of chapters, and can perform a chapter division process of sorting into chapters. Details will be described below.

図３は本実施の形態に係る複写機１における、原稿画像データの読み取り処理及び章分けの処理を説明するフローチャートである。以下においては、ハードディスク５５には前記章番号のパターン（以下、章番号パターンとも言う）、章番号文字数テーブル、及び最終 Letter Indexテーブルが記憶されているものとする。 FIG. 3 is a flow chart for explaining reading processing of document image data and chapter division processing in the copying machine 1 according to the present embodiment. In the following, it is assumed that the hard disk 55 stores the chapter number pattern (hereinafter also referred to as a chapter number pattern), a chapter number character number table, and a final Letter Index table.

図４は本実施の形態に係る複写機１において、ハードディスク５５に記憶された章番号のパターン、章番号文字数テーブル、及び最終 Letter Indexテーブルを概念的に表す概念図である。図４Ａ、図４Ｂ及び図４Ｃは夫々章番号のパターン（章番号パターン）、章番号文字数テーブル及び最終 Letter Indexテーブルを示す。 FIG. 4 is a conceptual diagram conceptually showing the chapter number pattern, chapter number character number table, and final Letter Index table stored in the hard disk 55 in the copying machine 1 according to the present embodiment. FIG. 4A, FIG. 4B and FIG. 4C respectively show a chapter number pattern (chapter number pattern), a chapter number character table and a final Letter Index table.

ここで、章番号文字数テーブルは章表示文字列に含まれている章番号の構成文字数を前記章番号パターンに関連付けて列挙したものである。また、前記章番号パターンにおいては、章番号に該当する数文字が、例えば、昇順に複数パターン列挙されている。前記章番号パターンは「Chapter Pattern Index」によって確定され、該数文字は、例えば、「１」、「Ｉ」、「i」、「一」等のパターンを有する。 Here, the chapter number character table is a list of the number of characters of the chapter number contained in the chapter display character string in association with the chapter number pattern. In the chapter number pattern, several characters corresponding to the chapter number are listed, for example, in ascending order. The chapter number pattern is determined by "Chapter Pattern Index", and the several characters have, for example, patterns of "1", "I", "i", "one" and the like.

以下においては、昇順に列挙された各数文字が、「Letter Index」によって特定される。また、前記最終 Letter Indexテーブルには、各章番号パターンにおける最終の「Letter Index」が記載されている。なお、「Letter Index」は「０」から始まるものとする。 In the following, each few characters listed in ascending order are identified by "Letter Index". Further, the final "Letter Index" in each chapter number pattern is described in the final Letter Index table. Note that “Letter Index” starts from “0”.

また、以下の説明においては、説明の便宜上、全１００ページであって、１０章にて構成されている原稿の読み込みを行う場合であって、該原稿に章表示文字列として「第Ｘ章○○○○」が含まれているものとする。 Further, in the following description, for the sake of convenience of explanation, it is a case of reading an original consisting of 100 pages and consisting of 10 chapters. "○○○" shall be included.

まず、ユーザは、複写機１の両面自動原稿送り装置１１２に前記原稿を載置し、前記操作パネルを適宜操作することにより、斯かる原稿のコピーを指示する。この際、ＣＰＵ４０は前記操作パネルを介してユーザからコピーの指示を受け付ける。 First, the user places the original on the double-sided automatic original feeding device 112 of the copying machine 1 and operates the operation panel to instruct copying of the original. At this time, the CPU 40 receives a copy instruction from the user via the operation panel.

スキャナ制御部４２はＣＰＵ４０からの指示に応じて画像読取部１１０を制御して、前記原稿を一枚ずつ読み込み、該原稿に対する原稿画像データが得られる。このように得られた原稿画像データに対して、ＯＣＲ処理が施される。 The scanner control unit 42 controls the image reading unit 110 according to an instruction from the CPU 40, reads the document one by one, and obtains document image data for the document. An OCR process is performed on the document image data obtained in this manner.

次いで、文字サイズ検出部４７は、前記原稿画像データに対して最大文字サイズの検出を行う（ステップＳ１０１）。以下、最大文字サイズの検出の処理について詳しく説明する。 Next, the character size detection unit 47 detects the maximum character size for the document image data (step S101). The process of detecting the maximum character size will be described in detail below.

図５は本実施の形態に係る複写機１において、文字サイズ検出部４７によって行われる最大文字サイズ検出の処理を説明するフローチャートである。以下、最大文字サイズ検出の処理について詳しく説明する。 FIG. 5 is a flow chart for explaining the process of detecting the maximum character size performed by the character size detection unit 47 in the copying machine 1 according to the present embodiment. The process of maximum character size detection will be described in detail below.

先ず、文字サイズ検出部４７は変数の初期化を行う（ステップＳ２０１）。より詳しくは、原稿のページを表す変数「Page」を「１」に、最大文字サイズを表す変数「Letter Size」を「０」に初期化する。 First, the character size detection unit 47 initializes a variable (step S201). More specifically, the variable “Page” representing the page of the document is initialized to “1”, and the variable “Letter Size” representing the maximum character size is initialized to “0”.

次いで、文字サイズ検出部４７は前記原稿画像データに基づいて、前記原稿の全ページ数を取得する（ステップＳ２０２）。 Next, the character size detection unit 47 acquires the total number of pages of the document based on the document image data (step S202).

また、文字サイズ検出部４７は、前記原稿画像データから、現在の「Page」に該当するページの画像データを取得し（ステップＳ２０３）、取得された所定ページの画像データに対して、一行目の文字列の最初文字の文字サイズを検出する（ステップＳ２０４）。 Further, the character size detection unit 47 acquires the image data of the page corresponding to the current "Page" from the document image data (step S203), and the acquired image data of the predetermined page is displayed on the first line. The character size of the first character of the character string is detected (step S204).

続いて、文字サイズ検出部４７は検出された文字サイズが「Letter Size」に対応する文字サイズより大きいか判定を行う（ステップＳ２０５）。 Subsequently, the character size detection unit 47 determines whether the detected character size is larger than the character size corresponding to "Letter Size" (step S205).

文字サイズ検出部４７は検出された文字サイズが「Letter Size」に対応する文字サイズより大きいと判定した場合（ステップＳ２０５：ＹＥＳ）、「Letter Size」に対応する文字サイズを検出された文字サイズに置き換える（ステップＳ２０６）。 If the character size detection unit 47 determines that the detected character size is larger than the character size corresponding to “Letter Size” (step S 205: YES), the character size corresponding to “Letter Size” is detected as the detected character size Replace (step S206).

ステップＳ２０６の処理後、又は、検出された文字サイズが「Letter Size」に対応する文字サイズより大きくないと判定した場合（ステップＳ２０５：ＮＯ）、文字サイズ検出部４７は現在の「Page」に該当する数字が前記原稿の全ページ数と等しいか判定する（ステップＳ２０７）。 After the process of step S206, or when it is determined that the detected character size is not larger than the character size corresponding to "Letter Size" (step S205: NO), the character size detection unit 47 corresponds to the current "Page". It is determined whether the number to be processed is equal to the total number of pages of the original (step S207).

現在の「Page」に該当する数字が前記原稿の全ページ数と等しくないと判定した場合（ステップＳ２０７：ＮＯ）、文字サイズ検出部４７は、現在の「Page」に「１」を加算した数字を新たに「Page」とし（ステップＳ２０８）、処理を再びステップＳ２０３に戻す。 If it is determined that the number corresponding to the current "Page" is not equal to the total page number of the document (step S207: NO), the character size detection unit 47 adds "1" to the current "Page" Is newly set to "Page" (step S208), and the process returns to step S203.

一方、文字サイズ検出部４７によって現在の「Page」に該当する数字が前記原稿の全ページ数と等しいと判定された場合（ステップＳ２０７：ＹＥＳ）、最大文字サイズ検出の処理は終了する。 On the other hand, when it is determined by the character size detection unit 47 that the number corresponding to the current "Page" is equal to the total number of pages of the document (step S207: YES), the maximum character size detection process ends.

以上の最大文字サイズ検出の処理によって、前記原稿画像データにおける、最大文字サイズ、即ち「Letter Size」が検出される。 By the above-described processing of the maximum character size detection, the maximum character size, that is, “Letter Size” in the document image data is detected.

再び、図３に基づく説明に戻る。 Returning to the explanation based on FIG. 3 again.

このようにして、ステップＳ１０１にて最大文字サイズが検出されると、続いて、文字列抽出部４８は、文字サイズ検出部４７によって検出された最大文字サイズを有する文字列を、章表示文字列の候補として、抽出する（ステップＳ１０２）。 Thus, when the maximum character size is detected in step S101, subsequently, the character string extraction unit 48 displays the character string having the maximum character size detected by the character size detection unit 47 as a chapter display character string Are extracted as candidates (step S102).

図６は本実施の形態に係る複写機１において、文字列抽出部４８によって行われる文字列抽出の処理を説明するフローチャートである。以下、文字列抽出の処理について詳しく説明する。 FIG. 6 is a flow chart for explaining the character string extraction process performed by the character string extraction unit 48 in the copying machine 1 according to the present embodiment. Hereinafter, the process of character string extraction will be described in detail.

文字列抽出部４８は変数の初期化を行う（ステップＳ３０１）。より詳しくは、原稿のページを表す変数「Page」を「１」にし、変数「Index」を「０」に初期化する。 The character string extraction unit 48 initializes a variable (step S301). More specifically, the variable “Page” representing the page of the document is set to “1”, and the variable “Index” is initialized to “0”.

次いで、文字列抽出部４８は前記原稿画像データから、現在の「Page」に該当するページの画像データを取得し（ステップＳ３０２）、取得された所定ページの画像データに対して、一行目の文字列の最初文字の文字サイズを検出する（ステップＳ３０３）。 Next, the character string extraction unit 48 acquires the image data of the page corresponding to the current "Page" from the document image data (step S302), and the character of the first line is acquired for the acquired image data of the predetermined page. The character size of the first character of the column is detected (step S303).

続いて、文字列抽出部４８は、検出された文字サイズが既に定められた最大文字サイズ「Letter Size」に対応する文字サイズと等しいか否かの判定を行う（ステップＳ３０４）。 Subsequently, the character string extraction unit 48 determines whether the detected character size is equal to the character size corresponding to the predetermined maximum character size “Letter Size” (step S304).

文字列抽出部４８によって、検出された文字サイズが最大文字サイズ「Letter Size」に対応する文字サイズと等しくないと判定された場合（ステップＳ３０４：ＮＯ）、処理はステップＳ３０９に進む。 If the character string extraction unit 48 determines that the detected character size is not equal to the character size corresponding to the maximum character size “Letter Size” (step S304: NO), the process proceeds to step S309.

一方、文字列抽出部４８は、検出された文字サイズが既に定められた最大文字サイズ「Letter Size」に対応する文字サイズと等しいと判定した場合（ステップＳ３０４：ＹＥＳ）、斯かる画像データから、一行目の文字列を抽出する（ステップＳ３０５）。 On the other hand, when the character string extraction unit 48 determines that the detected character size is equal to the character size corresponding to the already determined maximum character size “Letter Size” (step S 304: YES), from such image data, The character string of the first line is extracted (step S305).

次いで、文字列抽出部４８は、抽出された文字列（以下、抽出文字列とも言う）を前記変数「Index」に関連付けて、例えば、ハードディスク５５に記憶し（ステップＳ３０６）、現在のページ番号、すなわち現在の「Page」に対応する数字を、該「Index」に関連付けて、例えば、ハードディスク５５に記憶する（ステップＳ３０７）。続けて、文字列抽出部４８は現在の「Index」に「１」を加算し、これを新たな「Index」とする（ステップＳ３０８）。換言すれば、ハードディスク５５には各「Index」に対応付けて抽出文字列及び当該ページ番号が記憶されている。 Next, the character string extraction unit 48 associates the extracted character string (hereinafter, also referred to as an extracted character string) with the variable “Index”, and stores it in, for example, the hard disk 55 (step S306). That is, a number corresponding to the current "Page" is associated with the "Index" and stored, for example, in the hard disk 55 (step S307). Subsequently, the character string extraction unit 48 adds “1” to the current “Index” and sets it as a new “Index” (step S308). In other words, the extracted character string and the page number are stored in the hard disk 55 in association with each “Index”.

ステップＳ３０８の後、文字列抽出部４８は、現在の「Page」に該当する数字が前記原稿の全ページ数と等しいか判定する（ステップＳ３０９）。 After step S308, the character string extraction unit 48 determines whether the number corresponding to the current "Page" is equal to the total number of pages of the document (step S309).

現在の「Page」に該当する数字が前記原稿の全ページ数と等しくないと判定した場合（ステップＳ３０９：ＮＯ）、文字列抽出部４８は、現在の「Page」に「１」を加算した数字を新たに「Page」とし（ステップＳ３１０）、処理を再びステップＳ３０２に戻す。 When it is determined that the number corresponding to the current "Page" is not equal to the total number of pages of the document (step S309: NO), the character string extraction unit 48 adds "1" to the current "Page". Is newly set to "Page" (step S310), and the process returns to step S302 again.

一方、文字サイズ検出部４７によって現在の「Page」に該当する数字が前記原稿の全ページ数と等しいと判定された場合（ステップＳ３０９：ＹＥＳ）、文字列抽出の処理は終了する。 On the other hand, when it is determined by the character size detection unit 47 that the number corresponding to the current "Page" is equal to the total number of pages of the document (step S309: YES), the character string extraction process ends.

以上の最大文字サイズ検出の処理により、前記原稿画像データにおいて、最大文字サイズ「Letter Size」を有する文字列が抽出される。 By the above-described processing of maximum character size detection, a character string having the maximum character size “Letter Size” is extracted from the document image data.

以上のようにして、最大文字サイズが検出され、検出された最大文字サイズを有する文字列が抽出された後、章情報取得部４９は前記章情報を取得する処理を行う（ステップＳ１０３）。 As described above, after the maximum character size is detected and the character string having the detected maximum character size is extracted, the chapter information acquisition unit 49 performs processing for acquiring the chapter information (step S103).

図７は本実施の形態に係る複写機１において、章情報取得部４９によって行われる章情報取得の処理を説明するフローチャートである。以下、章情報取得の処理について詳しく説明する。 FIG. 7 is a flowchart for explaining chapter information acquisition processing performed by the chapter information acquisition unit 49 in the copying machine 1 according to the present embodiment. The following describes the chapter information acquisition process in detail.

章情報取得部４９は変数の初期化を行う（ステップＳ４０１）。より詳しくは、文字列抽出の処理に係る「Index」（ステップＳ３０８参照）から「１」を引いた数値を「最終Index」とする。また、変数「Chapter Pattern」を「０」に初期化し、変数「Index」を「０」に初期化し、Error Flagをリセットする。 The chapter information acquisition unit 49 initializes the variable (step S401). More specifically, a value obtained by subtracting "1" from "Index" (see step S308) related to the character string extraction process is taken as "final index". Also, the variable “Chapter Pattern” is initialized to “0”, the variable “Index” is initialized to “0”, and the Error Flag is reset.

次いで、章情報取得部４９は、現在の「Index」に対応する抽出文字列をハードディスク５５から読み出し（ステップＳ４０２）、章文字パターンの検索の処理を行う（ステップＳ４０３）。 Next, the chapter information acquisition unit 49 reads the extracted character string corresponding to the current "Index" from the hard disk 55 (step S402), and performs a chapter character pattern search process (step S403).

図８は本実施の形態に係る複写機１において、章情報取得部４９によって行われる章文字パターンの検索の処理を説明するフローチャートである。以下、章文字パターンの検索の処理について詳しく説明する。 FIG. 8 is a flow chart for explaining a chapter character pattern search process performed by the chapter information acquisition unit 49 in the copying machine 1 according to the present embodiment. The processing of the chapter character pattern search will be described in detail below.

先ず、章情報取得部４９は、変数の初期化を行う（ステップＳ６０１）。より詳しくは、章情報取得部４９は「Chapter Pattern」及び「Chapter Pattern Index」（図４Ａ参照）を夫々「１」及び「０」に初期化する。 First, the chapter information acquisition unit 49 initializes a variable (step S601). More specifically, the chapter information acquisition unit 49 initializes “Chapter Pattern” and “Chapter Pattern Index” (see FIG. 4A) to “1” and “0”, respectively.

次いで、章情報取得部４９は、ステップＳ４０２にて読み出された抽出文字列の最初文字を抽出する（ステップＳ６０２）。また、章情報取得部４９はハードディスク５５に記憶された前記章番号パターンから、現在の「Chapter Pattern」に対応する「章番号に係る数文字」（図４Ａ参照）を読み出す（ステップＳ６０３）。 Next, the chapter information acquisition unit 49 extracts the first character of the extracted character string read in step S402 (step S602). Also, the chapter information acquisition unit 49 reads “several characters related to chapter number” (see FIG. 4A) corresponding to the current “Chapter Pattern” from the chapter number pattern stored in the hard disk 55 (step S603).

章情報取得部４９は、抽出した最初文字が、読み出された「章番号に係る数文字」と等しいか否かを判定する（ステップＳ６０４）。すなわち、前記章表示文字列として「Ｘ．○○○○」のような記載が存在する場合、最初文字「Ｘ」と、前記「章番号に係る数文字」とを比較する。 The chapter information acquisition unit 49 determines whether the extracted first character is equal to the read “several characters related to chapter number” (step S604). That is, when a description such as “X. ○ ○ ○” is present as the chapter display character string, the first character “X” is compared with the “several characters relating to the chapter number”.

章情報取得部４９は、抽出した最初文字が、読み出された「章番号に係る数文字」と等しいと判定した場合（ステップＳ６０４：ＹＥＳ）、斯かる「Index」及び「Chapter Pattern」を関連付けて記憶して、章文字パターンの検索の処理を終了する。 When the chapter information acquisition unit 49 determines that the extracted first character is equal to the read "several characters related to the chapter number" (step S604: YES), the "Index" and "Chapter Pattern" are associated with each other. , And the processing of the chapter character pattern search ends.

一方、章情報取得部４９は、抽出した最初文字が、読み出された「章番号に係る数文字」と等しくないと判定した場合（ステップＳ６０４：ＮＯ）、現在の「Chapter Pattern」に「１」を加算した数字を新たに「Chapter Pattern」とする（ステップＳ６０５）。 On the other hand, when the chapter information acquisition unit 49 determines that the extracted first character is not equal to the read "several characters related to the chapter number" (step S604: NO), the current "Chapter Pattern" is "1. The number obtained by adding "" is newly set as "Chapter Pattern" (step S605).

次いで、章情報取得部４９は、現在の「Chapter Pattern Index」が「５」であるか否かの判定を行う（ステップＳ６０６）。章情報取得部４９は、現在の「Chapter Pattern Index」が「５」でないと判定した場合（ステップＳ６０６：ＮＯ）、現在の「Chapter Pattern Index」に「１」を加算した数字を新たに「Chapter Pattern Index」とする（ステップＳ６０７）。以降、処理はステップＳ６０３に戻る。 Next, the chapter information acquisition unit 49 determines whether the current "Chapter Pattern Index" is "5" (step S606). If the chapter information acquisition unit 49 determines that the current "Chapter Pattern Index" is not "5" (step S606: NO), the chapter information acquisition unit 49 adds a "1" to the current "Chapter Pattern Index" to newly add "Chapter". It is set as "Pattern Index" (step S607). Thereafter, the process returns to step S603.

一方、章情報取得部４９によって、現在の「Chapter Pattern Index」が「５」であると判定した場合（ステップＳ６０６：ＹＥＳ）、換言すれば、最初文字に対応する「章番号に係る数文字」が見つからなかった場合は、前記章表示文字列として「第Ｘ章○○○○」のような記載が存在する場合を想定した処理が行われる。すなわち、第２番目の文字に対して、章文字パターンの検索の処理を行う。 On the other hand, when it is determined by the chapter information acquisition unit 49 that the current "Chapter Pattern Index" is "5" (step S606: YES), in other words, "several characters related to chapter number" corresponding to the first character Is not found, processing is performed on the assumption that there is a description such as "Chapter X XX" as the chapter display character string. That is, the processing of the chapter character pattern search is performed on the second character.

章情報取得部４９は、ステップＳ６０６にて「ＹＥＳ」と判定した場合、再び「Chapter Pattern Index」を「０」に初期化する（ステップＳ６０８）。 When the chapter information acquisition unit 49 determines “YES” in step S606, the chapter information acquisition unit 49 initializes “Chapter Pattern Index” to “0” again (step S608).

次いで、章情報取得部４９は、ステップＳ４０２にて読み出された抽出文字列の第２番目文字を抽出する（ステップＳ６０９）。また、章情報取得部４９は現在の「Chapter Pattern」に対応する「章番号に係る数文字」（図４Ａ参照）を読み出す（ステップＳ６１０）。 Next, the chapter information acquisition unit 49 extracts the second character of the extracted character string read in step S402 (step S609). Also, the chapter information acquisition unit 49 reads out “several characters related to the chapter number” (see FIG. 4A) corresponding to the current “Chapter Pattern” (step S610).

章情報取得部４９は、抽出した２番目文字が、読み出された「章番号に係る数文字」と等しいか否かを判定する（ステップＳ６１１）。章情報取得部４９は、抽出した２番目文字が、読み出された「章番号に係る数文字」と等しいと判定した場合（ステップＳ６１１：ＹＥＳ）、斯かる「Index」及び「Chapter Pattern」を関連付けて記憶して、章文字パターンの検索の処理を終了する。 The chapter information acquisition unit 49 determines whether the extracted second character is equal to the read “several characters related to chapter number” (step S611). If the chapter information acquisition unit 49 determines that the extracted second character is equal to the read “several characters related to the chapter number” (step S 611: YES), such “Index” and “Chapter Pattern” It associates and stores, and ends the processing of the chapter character pattern search.

一方、章情報取得部４９は、抽出した２番目文字が、読み出された「章番号に係る数文字」と等しくないと判定した場合（ステップＳ６１１：ＮＯ）、現在の「Chapter Pattern」に「１」を加算した数字を新たに「Chapter Pattern」とする（ステップＳ６１２）。 On the other hand, when the chapter information acquiring unit 49 determines that the extracted second character is not equal to the read "several characters related to the chapter number" (step S611: NO), the current "Chapter Pattern" is " A number obtained by adding "1" is newly set as "Chapter Pattern" (step S612).

次いで、章情報取得部４９は、現在の「Chapter Pattern Index」が「５」であるか否かの判定を行う（ステップＳ６１３）。章情報取得部４９は、現在の「Chapter Pattern Index」が「５」でないと判定した場合（ステップＳ６１３：ＮＯ）、現在の「Chapter Pattern Index」に「１」を加算した数字を新たに「Chapter Pattern Index」とする（ステップＳ６１４）。以降、処理はステップＳ６１０に戻る。 Next, the chapter information acquisition unit 49 determines whether the current "Chapter Pattern Index" is "5" (step S613). When the chapter information acquisition unit 49 determines that the current "Chapter Pattern Index" is not "5" (step S613: NO), the chapter information acquisition unit 49 adds a "1" to the current "Chapter Pattern Index" to "Chapter". It is set as "Pattern Index" (step S614). Thereafter, the process returns to step S610.

一方、章情報取得部４９は、現在の「Chapter Pattern Index」が「５」であると判定した場合（ステップＳ６１３：ＹＥＳ）、換言すれば、第２番目文字に対応する「章番号に係る数文字」も見つからなかった場合は、その旨ハードディスク５５に記憶する（ステップＳ６１５）。詳しくは、章情報取得部４９は「Chapter Pattern」が「０」であると記憶することにより、ステップＳ４０２で読み出された抽出文字列に対応する「章番号に係る数文字」が存在しない旨記憶する。 On the other hand, when the chapter information acquisition unit 49 determines that the current "Chapter Pattern Index" is "5" (step S613: YES), in other words, "the number related to the chapter number corresponding to the second character If no character is found, the fact is stored in the hard disk 55 (step S615). Specifically, the chapter information acquisition unit 49 stores that “Chapter Pattern” is “0”, thereby indicating that “several characters relating to the chapter number” corresponding to the extracted character string read in step S402 does not exist. Remember.

再び、図７に基づく説明に戻る。 It returns to the explanation based on FIG. 7 again.

このようにして、読み出された抽出文字列に対する、章文字パターンの検索の処理後、章情報取得部４９は、「Chapter Pattern」が「０」であるか否かの判定を行う（ステップＳ４０４）。 In this manner, after the chapter character pattern search process is performed on the extracted extracted character string, the chapter information acquisition unit 49 determines whether “Chapter Pattern” is “0” (step S404). ).

章情報取得部４９は、「Chapter Pattern」が「０」であると判定した場合（ステップＳ４０４：ＹＥＳ）、すなわち、合致する「章番号に係る数文字」がない場合、現在の「Index」が前記「最終Index」と等しいか否かの判定を行う（ステップＳ４１５）。 If the chapter information acquisition unit 49 determines that "Chapter Pattern" is "0" (step S404: YES), that is, if there is no matching "several characters related to the chapter number", the current "Index" is It is determined whether it is equal to the "final Index" (step S415).

章情報取得部４９は現在の「Index」が前記「最終Index」と等しいと判定した場合（ステップＳ４１５：ＹＥＳ）、Error Flagをセットする（ステップＳ４１６）。すなわち、全ての抽出文字列が、前記「章番号に係る数文字」の何れとも合致しない場合、Error Flagをセットすることにより、その旨記憶する。 If the chapter information acquisition unit 49 determines that the current "Index" is equal to the "final Index" (step S415: YES), it sets an error flag (step S416). That is, when all the extracted character strings do not match any of the "several characters related to chapter number", the fact is stored by setting the Error Flag.

一方、章情報取得部４９は現在の「Index」が前記「最終Index」と等しくないと判定した場合（ステップＳ４１５：ＮＯ）、現在の「Index」に「１」を加算した数字を新たに「Index」とし（ステップＳ４１７）、処理をステップＳ４０２に戻し、次の「Index」に対しても上述した処理を施す。 On the other hand, when the chapter information acquisition unit 49 determines that the current "Index" is not equal to the "final Index" (step S415: NO), the number obtained by adding "1" to the current "Index" is newly added. In step S417, the process returns to step S402, and the above-described process is performed on the next "index".

しかし、章情報取得部４９は、「Chapter Pattern」が「０」でないと判定した場合（ステップＳ４０４：ＮＯ）、変数「Chapter Number Next Index」を「０」に設定し（ステップＳ４０５）、「Chapter data」を初期化する（ステップＳ４０６）。ここで「Chapter data」はいわゆる２次元データである。
However, when the chapter information acquisition unit 49 determines that “Chapter Pattern” is not “0” (step S404: NO), the variable “Chapter Number Next Index” is set to “0” (step S405), and “Chapter "data" is initialized (step S406). Here, “Chapter data” is so-called two-dimensional data.

次いで、章情報取得部４９は、現在の「Index」に対応する抽出文字列をハードディスク５５から再び読み出し（ステップＳ４０７）、章番号文字合致照合の処理を行う（ステップＳ４０８）。 Next, the chapter information acquisition unit 49 again reads out the extracted character string corresponding to the current "Index" from the hard disk 55 (step S407), and performs chapter number character matching verification processing (step S408).

章情報取得部４９は、斯かる章番号文字合致照合の処理において、前記章番号パターンに基づいて、章表示文字列に含まれている章番号を割り出し、該章番号をIndexとして対応する章文字列及び章ページ番号を関連付けて記憶する。 The chapter information acquisition unit 49 determines the chapter number included in the chapter display character string based on the chapter number pattern in the chapter number character matching process, and the chapter number corresponding to the chapter number as an index. Associate and store column and chapter page numbers.

図９は本実施の形態に係る複写機１において、章情報取得部４９によって行われる章番号文字合致照合の処理を説明するフローチャートである。以下、章番号文字合致照合の処理について詳しく説明する。 FIG. 9 is a flow chart for explaining the chapter number character matching process performed by the chapter information acquisition unit 49 in the copying machine 1 according to the present embodiment. The process of chapter number character matching will be described in detail below.

先ず、章情報取得部４９は変数の設定を行う（ステップＳ７０１）。より詳しくは、章情報取得部４９は変数「Chapter Number」を「０」に設定し、変数「Letter Index」に「Chapter Number Next Index」を代入する。 First, the chapter information acquisition unit 49 sets a variable (step S701). More specifically, the chapter information acquisition unit 49 sets the variable “Chapter Number” to “0”, and substitutes “Chapter Number Next Index” for the variable “Letter Index”.

ここで「Letter Index」は、図４Ａに示した章番号パターンの「章番号に係る数文字」における、数文字の列挙順を示すものであり、該列挙順は昇順である。また、「Chapter Number Next Index」は「０」から始まる。 Here, “Letter Index” indicates the order of enumeration of several characters in “several characters related to chapter number” of the chapter number pattern shown in FIG. 4A, and the enumeration order is in ascending order. Also, "Chapter Number Next Index" starts from "0".

章情報取得部４９は、図７のステップＳ４０３にて行われた章文字パターンの検索の処理結果に基づき、当該抽出文字列に対して、「Chapter Pattern」が「６」以下であるか否かの判定を行う（ステップＳ７０２）。すなわち、最初文字が章番号に該当するか、第２番目文字が章番号に該当するかの判定を行う。 The chapter information acquisition unit 49 determines whether or not “Chapter Pattern” is “6 or less” for the extracted character string based on the processing result of the chapter character pattern search performed in step S 403 of FIG. 7. The determination is made (step S702). That is, it is determined whether the first character corresponds to the chapter number and the second character corresponds to the chapter number.

章情報取得部４９は、当該抽出文字列に対して、「Chapter Pattern」が「６」以下であると判定した場合（ステップＳ７０２：ＹＥＳ）、すなわち、最初文字が章番号に該当する場合、当該「Chapter Pattern」から「１」を引いた数を「Chapter Pattern Index」に代入し（ステップＳ７０３）、該「Chapter Pattern Index」及び「Letter Index」に対応する、図４Ｂに示す「章番号の構成文字数」をハードディスク５５から読み出す（ステップＳ７０４）。 If the chapter information acquisition unit 49 determines that “Chapter Pattern” is “6” or less for the extracted character string (step S 702: YES), that is, if the first character corresponds to the chapter number, The number obtained by subtracting "1" from "Chapter Pattern" is substituted for "Chapter Pattern Index" (step S703), and the "Chapter Number Configuration" shown in FIG. 4B corresponding to the "Chapter Pattern Index" and "Letter Index". The number of characters is read out from the hard disk 55 (step S704).

章情報取得部４９は、当該抽出文字列に対して、前記「章番号の構成文字数」に基づいて、最初の文字から１つ又は２つの文字を抜き出す（ステップＳ７０５）。 The chapter information acquisition unit 49 extracts one or two characters from the first character of the extracted character string based on the "number of characters constituting the chapter number" (step S705).

一方、章情報取得部４９は、当該抽出文字列に対して、「Chapter Pattern」が「６」以下でないと判定した場合（ステップＳ７０２：ＮＯ）、すなわち、第２番目文字が章番号に該当する場合、当該「Chapter Pattern」から「７」を引いた数を「Chapter Pattern Index」に代入し（ステップＳ７０９）、該「Chapter Pattern Index」及び「Letter Index」に対応する、図４Ｂに示す「章番号の構成文字数」をハードディスク５５から読み出す（ステップＳ７１０）。 On the other hand, when the chapter information acquisition unit 49 determines that “Chapter Pattern” is not less than “6” for the extracted character string (step S702: NO), that is, the second character corresponds to the chapter number. In this case, the number obtained by subtracting "7" from the "Chapter Pattern" is substituted for "Chapter Pattern Index" (step S709), and the "chapter" shown in FIG. 4B corresponding to the "Chapter Pattern Index" and "Letter Index". The number of characters constituting the number is read out from the hard disk 55 (step S710).

章情報取得部４９は、当該抽出文字列に対して、前記「章番号の構成文字数」に基づいて、第２番目の文字から１つ又は２つの文字を抜き出す（ステップＳ７１１）。以下においては、ステップＳ７０５又はステップＳ７１１にて抜き出された１つ又は２つの文字を抜き出し文字と言う。 The chapter information acquisition unit 49 extracts one or two characters from the second character based on the “number of characters constituting the chapter number” for the extracted character string (step S711). Hereinafter, one or two characters extracted in step S705 or step S711 are referred to as extracted characters.

ステップＳ７０５又はステップＳ７１１の処理後、章情報取得部４９は、当該「Chapter Pattern Index」及び前記「Letter Index」に対応する、前記章番号パターンの「章番号に係る数文字」を読み出す（ステップＳ７０６）。また、章情報取得部４９は、読み出された「章番号に係る数文字」と前記抜き出し文字とが等しいか否かを判定する（ステップＳ７０７）。 After the process of step S705 or step S711, the chapter information acquisition unit 49 reads out "several characters related to chapter number" of the chapter number pattern corresponding to the "Chapter Pattern Index" and the "Letter Index" (step S706). ). Further, the chapter information acquisition unit 49 determines whether or not the read “several characters related to chapter number” is equal to the extracted character (step S 707).

章情報取得部４９は、読み出された「章番号に係る数文字」と前記抜き出し文字とが等しいと判定した場合（ステップＳ７０７：ＹＥＳ）、現在の「Letter Index」に「１」を加算した数を「Chapter Number」として代入する（ステップＳ７０８）。これによって、章番号が割り出すことが出来る。 When the chapter information acquisition unit 49 determines that the “several characters related to the chapter number” and the extracted characters are equal (step S 707: YES), “1” is added to the current “Letter Index”. A number is substituted as "Chapter Number" (step S708). The chapter number can be determined by this.

一方、章情報取得部４９によって、読み出された「章番号に係る数文字」と前記抜き出し文字とが等しくないと判定された場合（ステップＳ７０７：ＮＯ）、次の章番号と一致するか確認を行う。 On the other hand, when it is determined by the chapter information acquisition unit 49 that the read “several characters related to chapter number” is not equal to the extracted character (step S 707: NO), confirmation is made as to whether it matches the next chapter number. I do.

すなわち、章情報取得部４９は、図４Ｃの最終 Letter Indexテーブルをハードディスク５５から読み出し（ステップＳ７１２）、該最終 Letter Indexテーブルに基づいて、現在の「Letter Index」が最終Letter Indexと等しいか否かの判定を行う（ステップＳ７１３）。 That is, the chapter information acquisition unit 49 reads the final Letter Index table of FIG. 4C from the hard disk 55 (step S 712), and based on the final Letter Index table, whether the current “Letter Index” is equal to the final Letter Index The determination is made (step S713).

章情報取得部４９は、現在の「Letter Index」が最終Letter Indexと等しくないと判定した場合（ステップＳ７１３：ＮＯ）、現在の「Letter Index」に「１」を加算した数を新たな「Letter Index」として代入し（ステップＳ７１４）、処理をステップＳ７０２に戻す。 If the chapter information acquisition unit 49 determines that the current "Letter Index" is not equal to the final Letter Index (step S713: NO), the number "Letter Index" plus "1" is added to the new "Letter". It substitutes as "Index" (step S714), and returns a process to step S702.

一方、章情報取得部４９は、現在の「Letter Index」が最終Letter Indexと等しいと判定した場合（ステップＳ７１３：ＹＥＳ）、すなわち、章番号の割り出しが出来なかった場合、「Chapter Number」を「０」のままにして斯かる章番号文字合致照合の処理を終了する。 On the other hand, if the chapter information acquisition unit 49 determines that the current "Letter Index" is equal to the final Letter Index (step S713: YES), that is, if the chapter number can not be determined, "Chapter Number" End the process of such chapter number character matching by leaving 0 ".

このようにして、読み出された抽出文字列に対する、章番号文字合致照合の処理後、章情報取得部４９は、「Chapter Number」が「０」であるか否かの判定を行う（ステップＳ４０９）。章情報取得部４９は、「Chapter Number」が「０」であると判定した場合（ステップＳ４０９：ＹＥＳ）、処理をステップＳ４１３に進める。 Thus, after the chapter number character matching collation process on the extracted extracted character string, the chapter information acquiring unit 49 determines whether "Chapter Number" is "0" (step S409). ). When the chapter information acquisition unit 49 determines that “Chapter Number” is “0” (step S409: YES), the process proceeds to step S413.

一方、章情報取得部４９は、「Chapter Number」が「０」でないと判定した場合（ステップＳ４０９：ＮＯ）、前記「Chapter Number Next Index」に「Chapter Number」を代入し、変数「Chapter Index」には「Chapter Number」から「１」を引いた数値を代入する（ステップＳ４１０）。 On the other hand, when the chapter information acquisition unit 49 determines that “Chapter Number” is not “0” (Step S 409: NO), “Chapter Number” is substituted for “Chapter Number Next Index”, and the variable “Chapter Index” A numerical value obtained by subtracting "1" from "Chapter Number" is substituted for (Step S410).

次いで、章情報取得部４９は、「Chapter Index」に対応付けて、ステップＳ４０７にて読み出された抽出文字列を章文字列として、例えば、ハードディスク５５に記憶し（ステップＳ４１１）、また、「Chapter Index」に対応付けて、前記抽出文字列に係るページ番号（章ページ番号）をハードディスク５５に記憶する（ステップＳ４１２）。 Next, the chapter information acquisition unit 49 stores the extracted character string read in step S407 as a chapter character string in, for example, the hard disk 55 in association with "Chapter Index" (step S411), The page number (chapter page number) relating to the extracted character string is stored in the hard disk 55 in association with "Chapter Index" (step S412).

また、章情報取得部４９は現在の「Index」が「最終Index」と等しいか否かの判定を行う（ステップＳ４１３）。 Also, the chapter information acquisition unit 49 determines whether the current "Index" is equal to the "final Index" (step S413).

章情報取得部４９は現在の「Index」が前記「最終Index」と等しくないと判定した場合（ステップＳ４１３：ＮＯ）、現在の「Index」に「１」を加算した数字を新たに「Index」とし（ステップＳ４１４）、処理をステップＳ４０７に戻す。 When the chapter information acquisition unit 49 determines that the current "Index" is not equal to the "final Index" (step S413: NO), the number obtained by adding "1" to the current "Index" is newly added to the "Index". Then (step S414), the process returns to step S407.

一方、章情報取得部４９は現在の「Index」が前記「最終Index」と等しいと判定した場合（ステップＳ４１３：ＹＥＳ）、処理を終了する。
以上の処理によって、章番号、該章番号に対応する章文字列及び章ページ番号を含む章情報が取得される。 On the other hand, when the chapter information acquisition unit 49 determines that the current "Index" is equal to the "final Index" (step S413: YES), the process is ended.
By the above processing, chapter information including a chapter number, a chapter character string corresponding to the chapter number, and a chapter page number is acquired.

以上のようにして、章情報取得部４９により、前記章情報を取得する処理がされた後、ＣＰＵ４０は、エラーが発生したか否かを判定する（ステップＳ１０４）。 As described above, after the chapter information acquisition unit 49 performs the process of acquiring the chapter information, the CPU 40 determines whether an error has occurred (step S104).

前記ステップＳ１０３にて、Error Flagがセットされていれば、ＣＰＵ４０はエラーが発生したと判定し（ステップＳ１０４：ＹＥＳ）、章情報がない旨を前記表示部に表示する（ステップＳ１０８）。以降、処理は終了する。 If the error flag is set in step S103, the CPU 40 determines that an error has occurred (step S104: YES), and displays on the display unit that there is no chapter information (step S108). Thereafter, the process ends.

前記ステップＳ１０３にて、Error Flagがセットされていなければ、ＣＰＵ４０はエラーが発生していないと判定し（ステップＳ１０４：ＮＯ）、抜け補完部５０が前記抜け補完の処理を行う（ステップＳ１０５）。 If the Error Flag is not set in step S103, the CPU 40 determines that an error has not occurred (step S104: NO), and the missing complement unit 50 performs the missing complement processing (step S105).

図１０及び図１１は本実施の形態に係る複写機１において、抜け補完部５０によって行われる抜け補完の処理を説明するフローチャートである。以下、抜け補完の処理について詳しく説明する。 FIG. 10 and FIG. 11 are flowcharts for explaining the process of the dropout complementation performed by the dropout complement unit 50 in the copying machine 1 according to the present embodiment. Hereinafter, the process of the missing complement will be described in detail.

抜け補完部５０は変数の初期化を行う（ステップＳ５０１）。より詳しくは、最後の章を示す「Last Chapter Index」に「Chapter Index」を代入し、「Chapter Index」に「１」を設定する。また、「Start Chapter Number Index」に「１」を代入する。 The missing complement unit 50 initializes the variable (step S501). More specifically, "Chapter Index" is substituted for "Last Chapter Index" indicating the last chapter, and "1" is set for "Chapter Index". Also, substitute “1” for “Start Chapter Number Index”.

次いで、抜け補完部５０は、前記「Last Chapter Index」が「０」であるか否かを判定する（ステップＳ５０２）。すなわち、ステップＳ４１０にて「Chapter Index」は「Chapter Number」から「１」を引いた値であることから、斯かる判定は、斯かる原稿が章を１つ含むか又は複数含むかが判定される。 Next, the missing complement unit 50 determines whether the "Last Chapter Index" is "0" (step S502). That is, since “Chapter Index” is a value obtained by subtracting “1” from “Chapter Number” in step S410, such determination determines whether such a document includes one or more chapters. Ru.

抜け補完部５０は、前記「Last Chapter Index」が「０」であると判定した場合（ステップＳ５０２：ＹＥＳ）、すなわち、章が１つである場合、処理をステップＳ５１５に進める。 If it is determined that the “Last Chapter Index” is “0” (step S 502: YES), that is, if there is one chapter, the dropout complementing unit 50 advances the process to step S 515.

抜け補完部５０は、前記「Last Chapter Index」が「０」でないと判定した場合（ステップＳ５０２：ＮＯ）、すなわち、章が複数である場合、抜け補完部５０は現在の「Chapter Index」に対応する章ページ番号をハードディスク５５から読み出す（ステップＳ５０３）。 If it is determined that the “Last Chapter Index” is not “0” (step S 502: NO), that is, if there are a plurality of chapters, the missing complement unit 50 corresponds to the current “Chapter Index”. The chapter page number to be read is read out from the hard disk 55 (step S503).

次いで、抜け補完部５０は読み出した章ページ番号が「０」に等しいか否かの判定を行う（ステップＳ５０４）。換言すれば、抜け補完部５０は、現在の「Chapter Index」に係る章番号に対応する章ページ番号が存在するか否かを判定する。 Next, the missing part complement unit 50 determines whether the chapter page number read out is equal to "0" (step S504). In other words, the missing part complement unit 50 determines whether or not there is a chapter page number corresponding to the chapter number according to the current "Chapter Index".

抜け補完部５０は読み出した章ページ番号が「０」に等しくないと判定した場合（ステップＳ５０４：ＮＯ）、現在の「Chapter Index」が前記「Last Chapter Index」と等しいか否かの判定を行う（ステップＳ５０５）。 If it is determined that the chapter page number read out is not equal to "0" (step S504: NO), it is determined whether or not the current "Chapter Index" is equal to the "Last Chapter Index". (Step S505).

抜け補完部５０は、現在の「Chapter Index」が前記「Last Chapter Index」と等しいと判定した場合（ステップＳ５０５：ＹＥＳ）、処理をステップＳ５１５に進める。 When it is determined that the current “Chapter Index” is equal to the “Last Chapter Index” (step S505: YES), the missing part complement unit 50 proceeds the process to step S515.

一方、抜け補完部５０は、現在の「Chapter Index」が前記「Last Chapter Index」と等しくないと判定した場合（ステップＳ５０５：ＮＯ）、変数「Chapter Page Start」に当該章ページ番号を代入し（ステップＳ５０６）、現在の「Chapter Index」に「１」を加算した数字を新たに「Chapter Index」とし、かつ、「Start Chapter Number Index」に「Chapter Index」を代入する（ステップＳ５０７）。以降、処理はステップＳ５０３に戻る。 On the other hand, when it is determined that the current "Chapter Index" is not equal to the "Last Chapter Index" (step S505: NO), the missing completion unit 50 substitutes the chapter page number for the variable "Chapter Page Start" Step S506) A number obtained by adding “1” to the current “Chapter Index” is newly set as “Chapter Index”, and “Chapter Index” is substituted for “Start Chapter Number Index” (Step S507). Thereafter, the process returns to step S503.

しかし、ステップＳ５０４にて、抜け補完部５０は、読み出した章ページ番号が「０」に等しいと判定した場合（ステップＳ５０４：ＹＥＳ）、換言すれば、章番号の抜けがある場合、抜けている章番号の数を表す変数「Adjust Chapter Number」に「０」を設定する（ステップＳ５０８）。 However, if it is determined in step S504 that the missing chapter complementing section 50 determines that the read chapter page number is equal to "0" (step S504: YES), in other words, if there is a missing chapter number, it is missing. The variable "Adjust Chapter Number" representing the number of chapter numbers is set to "0" (step S508).

次いで、抜け補完部５０は、現在の「Adjust Chapter Number」に「１」を加算した数字を新たに「Adjust Chapter Number」とし（ステップＳ５０９）、また、現在の「Chapter Index」に「１」を加算した数字を新たに「Chapter Index」とする（ステップＳ５１０）。 Next, the missing completion unit 50 newly sets a number obtained by adding "1" to the current "Adjust Chapter Number" as "Adjust Chapter Number" (step S509), and "1" in the current "Chapter Index". The added number is newly set as "Chapter Index" (step S510).

また、抜け補完部５０は現在の「Chapter Index」に対応する章ページ番号をハードディスク５５から読み出す（ステップＳ５１１）。 Further, the missing part complement unit 50 reads out the chapter page number corresponding to the current "Chapter Index" from the hard disk 55 (step S511).

次いで、抜け補完部５０は読み出した章ページ番号が「０」に等しいか否かの判定を行う（ステップＳ５１２）。換言すれば、抜け補完部５０は、現在の「Chapter Index」に係る章番号に対応する章ページ番号が存在するか否かを判定する。 Next, the missing part complement unit 50 determines whether the read chapter page number is equal to "0" (step S512). In other words, the missing part complement unit 50 determines whether or not there is a chapter page number corresponding to the chapter number according to the current "Chapter Index".

抜け補完部５０は、読み出した章ページ番号が「０」に等しいと判定した場合（ステップＳ５１２：ＹＥＳ）、処理をステップＳ５０９に戻し、昇順において抜けている章番号の算出を続ける。 If it is determined that the chapter page number read out is equal to “0” (step S 512: YES), the process is returned to step S 509, and calculation of the chapter number missing in ascending order is continued.

一方、抜け補完部５０は、読み出した章ページ番号が「０」に等しくないと判定した場合（ステップＳ５１２：ＮＯ）、前記抜け補完に係る第１補完処理を行う（ステップＳ５１３）。 On the other hand, when it is determined that the chapter page number read out is not equal to “0” (step S 512: NO), the missing part complementing unit 50 performs the first complementation processing relating to the missing complement (step S 513).

図１２及び図１３は本実施の形態に係る複写機１において、抜け補完部５０によって行われる第１補完の処理を説明するフローチャートである。以下、該第１補完の処理について詳しく説明する。 12 and 13 are flowcharts for explaining the process of the first complementation performed by the missing part complementing unit 50 in the copying machine 1 according to the present embodiment. Hereinafter, the process of the first complement will be described in detail.

抜け補完部５０は変数の初期化を行う（ステップＳ８０１）。より詳しくは、抜け補完部５０は変数「Page Index」に章の初めのページ番号である「Chapter Page Start」を代入し、最後のページを表す「Page End Index」には、次の章に係る章ページ番号から「１」を引く「Chapter Page‐１」を代入する。また、抜け補完部５０は、次の章を指す「Chapter Number Next Index」に「Start Chapter Number Index」を代入する。 The missing complement unit 50 initializes a variable (step S801). More specifically, the missing completion unit 50 substitutes the variable "Page Index" for the first page number "Chapter Page Start" in the chapter, and the last page "Page End Index" for the next chapter. Substitute "Chapter Page-1" by subtracting "1" from the chapter page number. In addition, the missing completion unit 50 substitutes “Start Chapter Number Index” into “Chapter Number Next Index” that indicates the next chapter.

抜け補完部５０は、前記原稿画像データから現在の「Page Index」に対応するページの画像データを読み出して、該画像データにおける行数の検出を行う（ステップＳ８０２）。 The missing part complement unit 50 reads out the image data of the page corresponding to the current "Page Index" from the document image data, and detects the number of lines in the image data (step S802).

次いで、抜け補完部５０は、最終行を示す「Line End Index」に、検出された行数から「１」を引いた数値を代入し（ステップＳ８０３）、また、「Line Index」に「１」を代入する（ステップＳ８０４）。すなわち、章文字列を除いて２行目から斯かる処理が行われる。 Next, the missing part complement unit 50 substitutes a numerical value obtained by subtracting “1” from the detected number of lines into “Line End Index” indicating the final line (step S 803), and “1” in “Line Index”. Is substituted (step S804). That is, such processing is performed from the second line except for the chapter character string.

また、抜け補完部５０は現在の「Page Index」に対応するページの画像データを読み出して現在の「Line Index」に対応する行に係る文字列データを抽出する（ステップＳ８０５）。 Also, the missing part complement unit 50 reads out the image data of the page corresponding to the current "Page Index", and extracts the character string data related to the line corresponding to the current "Line Index" (step S805).

抜け補完部５０は、抽出された文字列データの文字サイズが既に定められた最大文字サイズ「Letter Size」に対応する文字サイズと等しいか判定を行う（ステップＳ８０６）。 The missing character complementing unit 50 determines whether the character size of the extracted character string data is equal to the character size corresponding to the predetermined maximum character size “Letter Size” (step S806).

抜け補完部５０によって、抽出された文字列データの文字サイズが前記最大文字サイズ「Letter Size」に対応する文字サイズと等しくないと判定された場合（ステップＳ８０６：ＮＯ）、処理はステップＳ８１４に進む。 If it is determined that the character size of the extracted character string data is not equal to the character size corresponding to the maximum character size “Letter Size” (step S806: NO), the process proceeds to step S814 .

一方、抜け補完部５０は、抽出された文字列データの文字サイズが前記最大文字サイズ「Letter Size」に対応する文字サイズと等しいと判定した場合（ステップＳ８０６：ＹＥＳ）、抽出された文字列データに対して章情報取得部４９が前記章番号文字合致照合の処理を行う（ステップＳ８０７）。章情報取得部４９による章番号文字合致照合の処理については、図９にて既に説明しており、詳しい説明を省略する。 On the other hand, when it is determined that the character size of the extracted character string data is equal to the character size corresponding to the maximum character size “Letter Size” (step S806: YES), the missing character complementing unit 50 extracts the extracted character string data The chapter information acquisition unit 49 performs the chapter number character matching check process (step S807). The processing of the chapter number character matching by the chapter information acquisition unit 49 has already been described in FIG. 9, and the detailed description will be omitted.

このように、章番号文字合致照合の処理後、抜け補完部５０は、「Chapter Number」が「０」であるか否かの判定を行う（ステップＳ８０８）。抜け補完部５０は、「Chapter Number」が「０」であると判定した場合（ステップＳ８０８：ＹＥＳ）、処理をステップＳ８１４に進める。 Thus, after the chapter number character matching process, the missing character complementing unit 50 determines whether the "Chapter Number" is "0" (step S808). When it is determined that the "Chapter Number" is "0" (step S808: YES), the missing part complement unit 50 proceeds with the process to step S814.

一方、抜け補完部５０は、「Chapter Number」が「０」でないと判定した場合（ステップＳ８０８：ＮＯ）、前記「Chapter Number Next Index」に「Chapter Number」を代入し、変数「Chapter Index」には「Chapter Number」から「１」を引いた数値を代入する（ステップＳ８０９）。 On the other hand, when it is determined that the "Chapter Number" is not "0" (Step S808: NO), the missing completion unit 50 substitutes "Chapter Number" for the "Chapter Number Next Index", and sets the variable "Chapter Index". The value of "Chapter Number" minus "1" is substituted (Step S809).

次いで、抜け補完部５０は、「Chapter Index」に対応付けて、ステップＳ８０５にて読み出された文字列データを章文字列として、例えば、ハードディスク５５に記憶し（ステップＳ８１０）、また、「Chapter Index」に対応付けて、現在の「Page Index」に「１」を加算した数値を、章ページ番号として、ハードディスク５５に記憶する（ステップＳ８１１）。 Next, the missing complement unit 50 stores the character string data read in step S805 as a chapter character string, for example, in the hard disk 55, in association with "Chapter Index" (step S810). A numerical value obtained by adding “1” to the current “Page Index” in association with “Index” is stored in the hard disk 55 as a chapter page number (step S811).

以上の処理によって、抜けた章（抜け章番号）が１箇所検出されたので、前記「Adjust Chapter Number」から「１」を引いた数値を新たな「Adjust Chapter Number」に代入する（ステップＳ８１２）。 Since one missing chapter (missing chapter number) is detected by the above processing, the numerical value obtained by subtracting "1" from the "Adjust Chapter Number" is substituted for a new "Adjust Chapter Number" (step S812). .

次いで、抜け補完部５０は現在の「Adjust Chapter Number」が「０」か否かの判定を行う（ステップＳ８１３）。抜け補完部５０によって現在の「Adjust Chapter Number」が「０」であると判定された場合（ステップＳ８１３：ＹＥＳ）、抜け章番号はないので、第１補完の処理は終了する。 Next, the missing part complement unit 50 determines whether the current "Adjust Chapter Number" is "0" (step S813). If it is determined by the missing complement unit 50 that the current "Adjust Chapter Number" is "0" (step S813: YES), since there is no missing chapter number, the processing of the first complement is ended.

抜け補完部５０は、現在の「Adjust Chapter Number」が「０」でないと判定された場合（ステップＳ８１３：ＮＯ）、現在の「Line Index」が前記「Line End Index」と等しいか否かの判定を行う（ステップＳ８１４）。 If it is determined that the current "Adjust Chapter Number" is not "0" (step S813: NO), the missing complement unit 50 determines whether the current "Line Index" is equal to the "Line End Index". (Step S814).

抜け補完部５０は、現在の「Line Index」が前記「Line End Index」と等しくないと判定をした場合（ステップＳ８１４：ＮＯ）、現在の「Line Index」に「１」を加算した数字を新たに「Line Index」とし（ステップＳ８１５）、再び、処理をステップＳ８０５に戻す。 When it is determined that the current "Line Index" is not equal to the "Line End Index" (step S814: NO), the missing part complement unit 50 newly adds a number obtained by adding "1" to the current "Line Index". To “Line Index” (step S815), and the process returns to step S805 again.

一方、抜け補完部５０は、現在の「Line Index」が前記「Line End Index」と等しいと判定した場合（ステップＳ８１４：ＹＥＳ）、すなわち、現在の「Page Index」に係るページ画像データに対する処理が終わった場合、再び、現在の「Page Index」が前記「Page End Index」と等しいか否かの判定を行う（ステップＳ８１６）。 On the other hand, when it is determined that the current "Line Index" is equal to the "Line End Index" (step S814: YES), the missing complement unit 50 processes the page image data related to the current "Page Index". If it has ended, it is judged again whether or not the current "Page Index" is equal to the "Page End Index" (step S816).

抜け補完部５０は、現在の「Page Index」が前記「 Page End Index」と等しくないと判定をした場合（ステップＳ８１６：ＮＯ）、現在の「Page Index」に「１」を加算した数字を新たに「Page Index」とし（ステップＳ８１７）、処理をステップＳ８０２に戻す。すなわち、次のページに対して同様の処理を施す。 When it is determined that the current “Page Index” is not equal to the “Page End Index” (step S 816: NO), the missing complement unit 50 newly adds a number obtained by adding “1” to the current “Page Index”. To “Page Index” (step S 817), and the process returns to step S 802. That is, the same process is performed on the next page.

一方、抜け補完部５０は、現在の「Page Index」が前記「Page End Index」と等しいと判定した場合（ステップＳ８１６：ＹＥＳ）、Error Flagをセットし（ステップＳ８１８）、第１補完の処理を終了する。すなわち、補完できてない抜け章番号が存在する旨記憶する。 On the other hand, when it is determined that the current "Page Index" is equal to the "Page End Index" (Step S816: YES), the missing complement unit 50 sets an Error Flag (Step S818), and performs the first complement processing. finish. That is, it stores that there is a missing chapter number that can not be complemented.

再び、図１０及び図１１の説明に戻る。 It returns to the explanation of FIG. 10 and FIG. 11 again.

このように、第１補完の処理が終了した後、抜け補完部５０は、現在の「Chapter Index」が前記「Last Chapter Index」と等しいか否かの判定を行う（ステップＳ５１４）。 As described above, after the process of the first complement is completed, the missing complement unit 50 determines whether the current "Chapter Index" is equal to the "Last Chapter Index" (step S514).

抜け補完部５０は、現在の「Chapter Index」が前記「Last Chapter Index」と等しくないと判定した場合（ステップＳ５１４：ＮＯ）、再び、処理をステップＳ５０６に戻す。 When it is determined that the current "Chapter Index" is not equal to the "Last Chapter Index" (step S514: NO), the missing complement unit 50 returns the process to step S506 again.

また、抜け補完部５０は、現在の「Chapter Index」が前記「Last Chapter Index」と等しいと判定した場合（ステップＳ５１４：ＹＥＳ）、処理をステップＳ５１５に進める。 Further, when it is determined that the current “Chapter Index” is equal to the “Last Chapter Index” (step S514: YES), the missing part complement unit 50 advances the process to step S515.

すなわち、ステップＳ５１４で、現在の「Chapter Index」が前記「Last Chapter Index」と等しいと判定された場合、又は、ステップＳ５０２で、前記「Last Chapter Index」が「０」であると判定した場合、最終の章（章が１つのみの場合を含む。）内において、抜け章番号の補完の処理を行う。 That is, if it is determined in step S514 that the current "Chapter Index" is equal to the "Last Chapter Index", or if it is determined in step S502 that the "Last Chapter Index" is "0". In the final chapter (including the case where there is only one chapter), handle the completion of the missing chapter number.

抜け補完部５０は、前記原稿画像データに基づいて、最終ページ番号を取得する（ステップＳ５１５）。 The missing part complement unit 50 acquires the final page number based on the document image data (step S515).

次いで、抜け補完部５０は、「Chapter Page」に「Chapter Page」に「１」を加算した値を設定し、前記「Adjust Chapter Number」を「０」に設定する（ステップＳ５１６）。また、抜け補完部５０は「Last Chapter Index」に対応する章ページ番号をハードディスク５５から読み出す（ステップＳ５１７）。 Next, the missing part complement unit 50 sets a value obtained by adding “1” to “Chapter Page” in “Chapter Page”, and sets “Adjust Chapter Number” to “0” (step S516). Also, the missing part complement unit 50 reads out the chapter page number corresponding to “Last Chapter Index” from the hard disk 55 (step S517).

以降、抜け補完部５０は、前記抜け補完に係る第２補完処理を行う（ステップＳ５１８）。 Thereafter, the missing part complementing unit 50 performs the second complementing process related to the missing part complementation (step S518).

図１４は本実施の形態に係る複写機１において、抜け補完部５０によって行われる第２補完の処理を説明するフローチャートである。以下、該第２補完の処理について詳しく説明する。 FIG. 14 is a flowchart for explaining the process of the second complementation performed by the missing part complementing unit 50 in the copying machine 1 according to the present embodiment. Hereinafter, the process of the second complement will be described in detail.

抜け補完部５０は変数の初期化を行う（ステップＳ９０１）。この処理は図１２のステップＳ８０１の処理と同様であり、詳しい説明を省略する。また、抜け補完部５０は、前記原稿画像データから現在の「Page Index」に対応するページの画像データを読み出し、該画像データにおける行数を検出する（ステップＳ９０２）。 The missing complement unit 50 initializes a variable (step S901). This process is the same as the process of step S801 in FIG. 12, and thus the detailed description is omitted. Further, the missing part complementing unit 50 reads the image data of the page corresponding to the current "Page Index" from the document image data, and detects the number of lines in the image data (step S902).

次いで、抜け補完部５０、最終行を示す「Line End Index」に、検出された行数から「１」を引いた数値を代入し（ステップＳ９０３）、「Line Index」に「１」を代入する（ステップＳ９０４）。 Next, the missing complement unit 50 substitutes a numerical value obtained by subtracting "1" from the detected number of lines into "Line End Index" indicating the final line (step S903), and substitutes "1" into "Line Index". (Step S904).

また、抜け補完部５０は現在の「Page Index」に対応するページの画像データを読み出して現在の「Line Index」に対応する行に係る文字列データを抽出する（ステップＳ９０５）。抜け補完部５０は、抽出された文字列データの文字サイズが既に定められた最大文字サイズ「Letter Size」に対応する文字サイズと等しいか判定を行う（ステップＳ９０６）。 Also, the missing part complement unit 50 reads the image data of the page corresponding to the current "Page Index", and extracts the character string data related to the line corresponding to the current "Line Index" (step S905). The missing character complementing unit 50 determines whether the character size of the extracted character string data is equal to the character size corresponding to the predetermined maximum character size “Letter Size” (step S 906).

抜け補完部５０によって、抽出された文字列データの文字サイズが前記最大文字サイズ「Letter Size」に対応する文字サイズと等しくないと判定された場合（ステップＳ９０６：ＮＯ）、処理はステップＳ９１２に進む。 If it is determined that the character size of the extracted character string data is not equal to the character size corresponding to the maximum character size “Letter Size” (step S 906: NO), the process proceeds to step S 912. .

一方、抜け補完部５０は、抽出された文字列データの文字サイズが前記最大文字サイズ「Letter Size」に対応する文字サイズと等しいと判定した場合（ステップＳ９０６：ＹＥＳ）、抽出された文字列データに対して章情報取得部４９が前記章番号文字合致照合の処理を行う（ステップＳ９０７）。 On the other hand, when it is determined that the character size of the extracted character string data is equal to the character size corresponding to the maximum character size "Letter Size" (step S906: YES), the missing character complementing unit 50 extracts the extracted character string data The chapter information acquisition unit 49 executes the chapter number character matching collation process (step S 907).

このように、章番号文字合致照合の処理後、抜け補完部５０は、「Chapter Number」が「０」であるか否かの判定を行う（ステップＳ９０８）。抜け補完部５０は、「Chapter Number」が「０」であると判定した場合（ステップＳ９０８：ＹＥＳ）、処理をステップＳ９１２に進める。 As described above, after the chapter number character matching process, the missing character complementing unit 50 determines whether "Chapter Number" is "0" (step S908). When it is determined that the "Chapter Number" is "0" (step S908: YES), the missing part complement unit 50 proceeds the process to step S912.

一方、抜け補完部５０は、「Chapter Number」が「０」でないと判定した場合（ステップＳ９０８：ＮＯ）、前記「Chapter Number Next Index」に「Chapter Number」を代入し、変数「Chapter Index」には「Chapter Number」から「１」を引いた数値を代入する（ステップＳ９０９）。 On the other hand, when it is determined that “Chapter Number” is not “0” (Step S 908: NO), the missing completion unit 50 substitutes “Chapter Number” for the “Chapter Number Next Index”, and sets “Chapter Index” for the variable. The value of “Chapter Number” minus “1” is substituted (Step S 909).

次いで、抜け補完部５０は、「Chapter Index」に対応付けて、ステップＳ９０５にて読み出された文字列データを章文字列として記憶し（ステップＳ９１０）、また、「Chapter Index」に対応付けて、現在の「Page Index」に「１」を加算した数値を、章ページ番号として記憶する（ステップＳ９１１）。 Next, the missing completion unit 50 stores the character string data read in step S 905 as a chapter character string in association with “Chapter Index” (step S 910), and in association with “Chapter Index”. A numerical value obtained by adding "1" to the current "Page Index" is stored as a chapter page number (step S911).

抜け補完部５０は、現在の「Line Index」が前記「Line End Index」と等しいか否かの判定を行う（ステップＳ９１２）。抜け補完部５０は、現在の「Line Index」が前記「Line End Index」と等しくないと判定をした場合（ステップＳ９１２：ＮＯ）、現在の「Line Index」に「１」を加算した数字を新たに「Line Index」とし（ステップＳ９１３）、処理をステップＳ９０５に戻す。 The missing part complement unit 50 determines whether the current "Line Index" is equal to the "Line End Index" (step S912). When it is determined that the current "Line Index" is not equal to the "Line End Index" (step S912: NO), the missing part complement unit 50 newly adds a number obtained by adding "1" to the current "Line Index". To "Line Index" (step S913), and the process returns to step S905.

一方、抜け補完部５０は、現在の「Line Index」が前記「Line End Index」と等しいと判定した場合（ステップＳ９１２：ＹＥＳ）、再び、現在の「Page Index」が前記「 Page End Index」と等しいか否かの判定を行う（ステップＳ９１４）。 On the other hand, when it is determined that the current “Line Index” is equal to the “Line End Index” (step S 912: YES), the missing complement unit 50 again sets the current “Page Index” as the “Page End Index”. It is determined whether they are equal (step S914).

抜け補完部５０は、現在の「Page Index」が前記「 Page End Index」と等しくないと判定をした場合（ステップＳ９１４：ＮＯ）、現在の「Page Index」に「１」を加算した数字を新たに「Page Index」とし（ステップＳ９１５）、処理をステップＳ９０２に戻す。 When it is determined that the current “Page Index” is not equal to the “Page End Index” (step S 914: NO), the missing complement unit 50 newly adds a number obtained by adding “1” to the current “Page Index”. To "Page Index" (step S915), and the process returns to step S902.

一方、抜け補完部５０は、現在の「Page Index」が前記「 Page End Index」と等しいと判定した場合（ステップＳ９１４：ＹＥＳ）、斯かる第２補完の処理を終了する。 On the other hand, when it is determined that the current “Page Index” is equal to the “Page End Index” (step S 914: YES), the missing complement unit 50 ends the processing of the second complement.

以上の処理を行うことにより、図１０及び図１１に示した、抜け補完の処理が終了する。 By performing the above process, the process of the missing complementation shown in FIGS. 10 and 11 is completed.

以上のようにして、抜け補完部５０により、前記抜け補完の処理がされた後、ＣＰＵ４０は、エラーが発生したか否かを判定する（ステップＳ１０６）。 As described above, after the process of the missing complementation is performed by the missing complement unit 50, the CPU 40 determines whether an error has occurred (step S106).

例えば、前記ステップＳ１０５にて、Error Flagがセットされていれれば、ＣＰＵ４０はエラーが発生したと判定し（ステップＳ１０６：ＹＥＳ）、章抜けがある旨を前記表示部に表示する（ステップＳ１０７）。以降、処理は終了する。 For example, if the error flag is set in step S105, the CPU 40 determines that an error has occurred (step S106: YES), and displays on the display unit that there is a missing chapter (step S107). Thereafter, the process ends.

前記ステップＳ１０５にて、Error Flagがセットされていなければ、ＣＰＵ４０はエラーが発生していないと判定し（ステップＳ１０６：ＮＯ）、本実施の形態に係る章分けの処理は終了する。 If the error flag is not set in step S105, the CPU 40 determines that an error has not occurred (step S106: NO), and the chapter division processing according to the present embodiment ends.

以上に記載した処理によって、本実施の形態においては、斯かる原稿画像データに対して、簡単、かつ、適確に、章毎に章情報（章番号、章文字列、章ページ番号等を含む。）を分けて格納することにより、章分けの処理を行うことが出来る。 According to the processing described above, in the present embodiment, the chapter image information (chapter number, chapter character string, chapter page number, etc. is included for each chapter easily and appropriately for such manuscript image data). Can be divided into chapters and stored.

更に、本発明においては、このように、章毎に分けられた章情報を用い、斯かる原稿の原稿画像データに基づく印刷（画像形成）を行う際、章と章との切り替わりに、いわゆる合い紙（特定紙）を挿入して、ユーザによる章の区別を容易にすることもできる。この際、ＣＰＵ４０がいわゆる挿入部としての役割をなすように構成すれば良い。また、読み取られた原稿画像データを章毎に分けて記憶し、以降における、章毎の印刷指示に対応することができる。 Furthermore, in the present invention, when printing (image formation) based on the document image data of such a document using chapter information divided into chapters in this manner, so-called matching between chapters and chapters is made. Paper (specific paper) can be inserted to facilitate user distinction of the chapters. At this time, the CPU 40 may be configured to play a role as a so-called insertion unit. Further, the read document image data can be divided into chapters and stored, and can correspond to the printing instruction for each chapter in the following.

なお、章毎に分けられた章情報を用い、章毎の題名が記載された目次を作成することも可能である。 In addition, it is also possible to create a table of contents in which titles of each chapter are described using chapter information divided into chapters.

（実施の形態２）
実施の形態１においては、最大文字サイズの検出が行われ（ステップＳ１０１）、検出された最大文字サイズを有する文字列が章表示文字列の候補として抽出され（ステップＳ１０２）、抽出された文字列から前記章情報が取得される処理を行ってから（ステップＳ１０３）、前記抜け補完の処理が施される（ステップＳ１０５）ことについて記載されている。 Second Embodiment
In the first embodiment, detection of the maximum character size is performed (step S101), and a character string having the detected maximum character size is extracted as a chapter display character string candidate (step S102), and the extracted character string The chapter information is acquired (step S103) and then the missing complement process is performed (step S105).

しかし、本発明はこれに限るものでなく、前記抜け補完の処理を省いても良い。前記抜け補完の処理を省いても、斯かる章分けの処理の妨げにならず、むしろ処理が短くなり、装置側の負担を減らすことが出来る。 However, the present invention is not limited to this, and the process of the missing complement may be omitted. Even if the omission complementing process is omitted, the chapter division process is not impeded, and the process is shortened and the burden on the apparatus can be reduced.

（実施の形態３）
また、実施の形態２においては、前記抜け補完の処理を省くことについて説明したが、本発明は以上の記載に限るものでない。 Third Embodiment
In the second embodiment, the omission of the process of the missing complement is described, but the present invention is not limited to the above description.

例えば、全Ｎの章からなる原稿の場合、前記抜け補完の処理は、最初の章からＮ−１番目章までの各ページに対する抜け補完の処理（ステップＳ５０３〜ステップＳ５１４）と、Ｎ番目（最終）章の各ページに対する抜け補完の処理（ステップＳ５１５〜ステップＳ５１８）とを含む。 For example, in the case of an original consisting of all N chapters, the missing complement process is the missing complement process (steps S503 to S514) for each page from the first chapter to the N-1th chapter, and the Nth (final And the process of missing complementation (steps S515 to S518) for each page of the chapter.

しかし、これに限るものでなく、「ステップＳ５０３〜ステップＳ５１４」の処理と、「ステップＳ５１５〜ステップＳ５１８」の処理との何れか一方、例えば、「ステップＳ５０３〜ステップＳ５１４」の処理のみを施すように構成しても良い。 However, the present invention is not limited to this, and any one of the processing of "step S503 to step S514" and the processing of "step S515 to step S518", for example, only the processing of "step S503 to step S514" You may configure it.

これによって、実施の形態３においては、処理の短縮による装置側の負担軽減と共に、章抜けの対策を図ることが出来る。 As a result, in the third embodiment, it is possible to reduce the burden on the apparatus side by shortening the processing and to take measures against the chapter omission.

本発明の実施態様１においては、複数ページの原稿に係る原稿画像データに対して、章毎に分別する処理を行う画像処理装置１において、前記原稿画像データに対して文字認識処理を施し、最大文字サイズを検出する文字サイズ検出部４７と、前記最大文字サイズを有する文字列を抽出する文字列抽出部４８と、章の始まりのページにて章の区分を表す章番号のパターンを記憶している記憶部５５と、前記文字列抽出部４８によって抽出された抽出文字列から、前記パターンに基づいて数字を抽出し、該抽出文字列に係るページ番号を前記原稿画像データから取得する章情報取得部４９とを備え、前記記憶部５５は、抽出された数字に対応付けて、前記抽出文字列及びページ番号を記憶することを特徴とする。 In the first embodiment of the present invention, in the image processing apparatus 1 which performs processing of sorting document image data for documents of a plurality of pages into chapters, character recognition processing is performed on the document image data, and A character size detection unit 47 for detecting character size, a character string extraction unit 48 for extracting a character string having the maximum character size, and a chapter number pattern representing chapter division on a chapter start page A chapter information acquisition for extracting a number based on the pattern from the storage unit 55 and the extracted character string extracted by the character string extraction unit 48 and acquiring a page number related to the extracted character string from the document image data The storage unit 55 is characterized by storing the extracted character string and the page number in association with the extracted number.

本発明によれば、前記文字サイズ検出部が前記原稿画像データに対して文字認識処理を施し、最大文字サイズを検出し、前記文字列抽出部が前記最大文字サイズを有する文字列を抽出し、前記章情報取得部が前記文字列抽出部によって抽出された抽出文字列から、前記パターンに基づいて数字を抽出し、該抽出文字列に係るページ番号を前記原稿画像データから取得し、抽出された数字に対応付けて、前記抽出文字列及びページ番号が記憶される。 According to the present invention, the character size detection unit performs character recognition processing on the document image data to detect a maximum character size, and the character string extraction unit extracts a character string having the maximum character size. The chapter information acquisition unit extracts a number based on the pattern from the extracted character string extracted by the character string extraction unit, and acquires a page number related to the extracted character string from the document image data and is extracted The extracted character string and the page number are stored in association with numbers.

本発明の実施態様２においては、抽出された数字が複数である場合、前記章情報取得部４９によって取得された数字及びページ番号に基づいて、昇降順における抜け数字の数を求め、抜け数字を補完する抜け補完部５０を備えることを特徴とする。 In the second embodiment of the present invention, when there are a plurality of extracted numbers, the number of missing numbers in the ascending and descending order is determined based on the numbers and page numbers acquired by the chapter information acquiring unit 49, and the remaining numbers are calculated. A feature of the present invention is to provide a missing complement unit 50 that complements.

本発明によれば、抽出された数字が複数である場合、抜け補完部は、前記章情報取得部によって取得された数字及びページ番号に基づいて、昇降順における抜け数字の数を求めて抜け数字を補完する。 According to the present invention, when there are a plurality of extracted numbers, the missing complement unit determines the number of the missing digits in the ascending / descending order based on the numbers and the page numbers acquired by the chapter information acquiring unit and determines the missing digits. To complement.

本発明の実施態様３においては、前記抜け補完部５０は、抽出された数字が１つである場合、前記ページ番号及び前記原稿の最終ページ番号によって定められる範囲に対して、前記抜け数字の補完を行うことを特徴とする。 In the third embodiment of the present invention, when the number extracted is one, the missing part complementing the missing number with respect to the range defined by the page number and the final page number of the document. It is characterized by doing.

本発明によれば、抽出された数字が１つである場合、前記抜け補完部は、前記ページ番号及び前記原稿の最終ページ番号によって定められる範囲に対して、前記抜け数字の補完を行う。 According to the present invention, when the number extracted is one, the missing portion complements the missing number in a range defined by the page number and the final page number of the document.

本発明の実施態様４においては、前記文字サイズ検出部４７は、各ページの一行目の文字列に対してのみ前記検出を行うことを特徴とする。 In the fourth embodiment of the present invention, the character size detection unit 47 performs the detection only on the character string on the first line of each page.

本発明によれば、前記文字サイズ検出部は、各ページの一行目の文字列に対してのみ最大文字サイズを検出する処理を行う。 According to the present invention, the character size detection unit performs processing of detecting the maximum character size only for the character string on the first line of each page.

本発明の実施態様５においては、前記文字列抽出部４８は、各ページの一行目の文字列に対してのみ前記抽出を行うことを特徴とする。 In the fifth embodiment of the present invention, the character string extraction unit 48 performs the extraction only on the character string on the first line of each page.

本発明によれば、前記文字列抽出部は、各ページの一行目の文字列に対してのみ最大文字サイズを有する文字列を抽出する処理を行う。 According to the present invention, the character string extraction unit performs processing of extracting a character string having the maximum character size only for the character string on the first line of each page.

本発明の実施態様６においては、前記章情報取得部４９は、前記抽出文字列のうち、最初の一つ又は複数の文字が前記パターンと一致する抽出文字列を検索し、検索された抽出文字列から、対応するパターンに含まれる章番号と一致する数字を抽出することを特徴とする。 In the sixth embodiment of the present invention, the chapter information acquiring unit 49 searches for an extracted character string in which the first one or more characters of the extracted character string match the pattern, and the extracted character searched It is characterized in that a digit is extracted from the column that matches the chapter number included in the corresponding pattern.

本発明によれば、前記章情報取得部は、前記抽出文字列のうち、最初の一つ又は複数の文字が前記章番号のパターンと一致する抽出文字列を検索し、検索された抽出文字列から、対応するパターンに含まれる章番号と一致する数字を章番号として抽出する。 According to the present invention, the chapter information acquisition unit searches for an extracted character string in which the first one or more characters of the extracted character strings match the pattern of the chapter number, and the extracted character string searched for From this, the numbers matching the chapter numbers included in the corresponding pattern are extracted as chapter numbers.

本発明の実施態様７においては、前記実施態様の何れか一つに記載の画像処理装置と、シート状の記録媒体に画像形成を行う画像形成部と、特定紙が収容されたトレイと、前記画像形成を行う際、前記処理の結果に基づいて、章の切り替わりに、特定紙を挿入する挿入部とを備えることを特徴とする。 In an embodiment 7 of the present invention, the image processing apparatus according to any one of the above-mentioned embodiments, an image forming unit for forming an image on a sheet-like recording medium, a tray containing specific paper, and At the time of performing image formation, it is characterized by including an inserting section for inserting a specific sheet at chapter switching based on the result of the process.

本発明によれば、前記画像形成を行う際、前記挿入部は前記画像処理装置による章分別の処理の結果に基づいて、章の切り替わりに、前記トレイに収容された特定紙を挿入する According to the present invention, when performing the image formation, the insertion unit inserts the specific sheet stored in the tray at the switching of the chapter based on the result of the chapter classification process by the image processing apparatus.

本発明の実施態様８においては、前記画像形成部は、前記章情報取得部４９によって取得された抽出文字列に係る数字、ページ番号を該抽出文字列に対応付けて、前記原稿に係る目次の画像形成を行うことを特徴とする。 In the eighth embodiment of the present invention, the image forming unit associates the numeral and page number of the extracted character string acquired by the chapter information acquiring unit 49 with the extracted character string, and It is characterized in that image formation is performed.

本発明によれば、前記画像形成部は、前記原稿に係る目次の画像形成を行う。すなわち、前記章情報取得部によって取得された抽出文字列に係る数字、ページ番号が該抽出文字列に対応付けられ、目次として画像形成される。 According to the present invention, the image forming unit forms an image of a table of contents relating to the document. That is, the numbers and page numbers related to the extracted character string acquired by the chapter information acquiring unit are associated with the extracted character string, and an image is formed as a table of contents.

本発明の実施態様９においては、章の始まりのページにて章の区分を表す章番号のパターンを記憶している記憶部５５を備えており、複数ページの原稿に係る原稿画像データに対する画像処理を行う画像処理装置１にて、章毎に分別する処理を行う章分け処理方法において、前記原稿画像データに対して文字認識処理を施し、最大文字サイズを検出し、前記最大文字サイズを有する文字列を抽出し、前記記憶部５５に記憶されているパターンに基づいて、抽出された抽出文字列から数字を抽出し、該抽出示文字列に係るページ番号を前記原稿画像データから取得し、抽出された数字に対応付けて、前記抽出文字列及びページ番号を記憶することを特徴とする。 In the ninth embodiment of the present invention, a storage unit 55 storing a chapter number pattern representing division of chapters at the beginning of a chapter page is provided, and image processing is performed on document image data related to a plurality of pages of documents. In the chapter division processing method of performing classification processing for each chapter in the image processing apparatus 1 for performing character recognition, character recognition processing is performed on the document image data to detect the maximum character size, and the character having the maximum character size A row is extracted, a digit is extracted from the extracted extracted character string based on the pattern stored in the storage unit 55, a page number related to the extracted indication character string is acquired from the document image data, and extracted The extracted character string and the page number are stored in association with the designated number.

本発明によれば、画像処理装置において、前記原稿画像データに対して文字認識処理が施されて最大文字サイズが検出され、前記最大文字サイズを有する文字列が抽出され、前記記憶部に記憶されているパターンに基づいて、抽出された抽出文字列から数字が抽出され、該抽出示文字列に係るページ番号が前記原稿画像データから取得され、抽出された数字に対応付けて、前記抽出文字列及びページ番号が記憶される。 According to the present invention, in the image processing apparatus, character recognition processing is performed on the document image data to detect a maximum character size, and a character string having the maximum character size is extracted and stored in the storage unit. Based on the current pattern, a number is extracted from the extracted extracted character string, a page number related to the extracted indication character string is acquired from the document image data, and the extracted character string is associated with the extracted digit. And the page number is stored.

１複写機
４０ＣＰＵ
４４画像形成制御部
４７文字サイズ検出部
４８文字列抽出部
４９章情報取得部
５０抜け補完部
５５ハードディスク
２１０画像形成部
４３画像準備制御部 1 copier 40 CPU
44 image formation control unit 47 character size detection unit 48 character string extraction unit 49 chapter information acquisition unit 50 missing complement unit 55 hard disk 210 image formation unit 43 image preparation control unit

Claims

In an image processing apparatus that performs processing of sorting document image data related to documents of a plurality of pages for each chapter,
A character size detection unit that performs character recognition processing on the document image data and detects a maximum character size;
A character string extraction unit that extracts a character string having the maximum character size;
A storage section that stores a chapter number pattern representing a chapter division on the chapter start page,
And a chapter information acquisition unit that extracts a number based on the pattern from the extracted character string extracted by the character string extraction unit, and acquires a page number related to the extracted character string from the document image data;
The image processing apparatus, wherein the storage unit stores the extracted character string and the page number in association with the extracted number.

When the number extracted is plural, the number of missing numbers in the ascending and descending order is obtained based on the numbers and page numbers acquired by the chapter information acquiring portion, and a missing value complementing portion that complements the missing numbers is provided. The image processing apparatus according to claim 1, wherein

The missing part complementing part performs complementing of the missing number with respect to a range defined by the page number and the last page number of the document when the extracted number is one. The image processing apparatus according to claim 1.

The image processing apparatus according to any one of claims 1 to 3, wherein the character size detection unit performs the detection only on the character string on the first line of each page.

The image processing apparatus according to any one of claims 1 to 4, wherein the character string extraction unit performs the extraction only on the character string on the first line of each page.

The chapter information acquisition unit
Searching for an extracted character string in which the first one or more characters of the extracted character string match the pattern;
The image processing apparatus according to any one of claims 1 to 5, wherein a number matching the chapter number included in the corresponding pattern is extracted from the extracted extracted character string.

An image processing apparatus according to any one of claims 1 to 6.
An image forming unit for forming an image on a sheet-like recording medium;
A tray containing specific paper,
An image forming apparatus comprising: an inserting section for inserting a specific sheet at chapter switching based on a result of the processing when forming the image.

The image forming unit performs image formation of a table of contents related to the document by associating numbers and page numbers related to the extracted character string acquired by the chapter information acquiring unit with the extracted character string. Item 8. An image forming apparatus according to item 7.

The image processing apparatus is provided with a storage unit that stores a chapter number pattern indicating chapter divisions at the beginning of a chapter page, and performs image processing on original image data related to a plurality of pages of original In the chapter division processing method that performs processing to separate
Character recognition processing is performed on the document image data to detect a maximum character size,
Extract a string having the maximum character size,
Based on the pattern stored in the storage unit, a number is extracted from the extracted extracted character string, and a page number related to the extracted character string is acquired from the document image data,
And storing the extracted character string and the page number in association with the extracted number.