JP2020178168A

JP2020178168A - Image forming apparatus

Info

Publication number: JP2020178168A
Application number: JP2019077116A
Authority: JP
Inventors: 隆一奥村; Ryuichi Okumura; 光利中尾; Mitsutoshi Nakao; 忍吉奥; Shinobu Yoshioku; 大介市川; Daisuke Ichikawa
Original assignee: Kyocera Document Solutions Inc
Current assignee: Kyocera Document Solutions Inc
Priority date: 2019-04-15
Filing date: 2019-04-15
Publication date: 2020-10-29

Abstract

To provide an image forming apparatus that, when dividing an integrated image, can reduce time and effort to re-arrange divided images.SOLUTION: An image forming apparatus 100 comprises: a learning unit 218; an imaging unit 2; a division unit 211; and a first extraction unit 212. The learning unit 218 learns document data including documents to estimate relation between characters. The imaging unit 2 images a sheet formed through integration of a plurality of documents, to create imaging data. The division unit 211 divides the imaging data for each of the documents to create divided data. The divided data includes first divided data and second divided data different from the first divided data. The first extraction unit 212 extracts a first character from the first divided data and extracts a second character from the second divided data. Upon receiving input of the first character and the second character, the learning unit 218 outputs an estimation result indicating a degree of the relation between the first character and the second character.SELECTED DRAWING: Figure 2

Description

本発明は、画像形成装置に関する。 The present invention relates to an image forming apparatus.

特許文献１に記載の画像処理装置は、判定手段と、第１サムネイル生成手段と、表示手段とを備える。判定手段は、入力されたドキュメントが、その１ページにＮ（Ｎ≧２）ページ分の原稿内容がまとめられているＮ−ｕｐドキュメントであるか否かを判定する。第１サムネイル生成手段は、判定手段によってＮ−ｕｐドキュメントであると判定された場合に、Ｎページの各々のサムネイルである第１サムネイルを生成する。表示手段は、第１サムネイルをプレビュー表示する。特許文献１に記載の画像処理装置の表示手段は、１ｕｐドキュメントをプレビュー表示できる。 The image processing apparatus described in Patent Document 1 includes a determination means, a first thumbnail generation means, and a display means. The determination means determines whether or not the input document is an N-up document in which the contents of the manuscript for N (N ≧ 2) pages are summarized on one page thereof. The first thumbnail generation means generates a first thumbnail which is a thumbnail of each of N pages when the determination means determines that the document is an N-up document. The display means previews the first thumbnail. The display means of the image processing apparatus described in Patent Document 1 can preview and display a 1-up document.

特開２０１０−２８２０５号公報Japanese Unexamined Patent Publication No. 2010-28205

しかしながら、Ｎ−ｕｐドキュメントのように複数の画像が集約された画像を、画像ごとに分割する場合、画像の順序が連続するように並ばないことがある。したがって、特許文献１に記載の画像形成装置では、ユーザーには、画像の順序を並び替える手間が発生する。 However, when an image in which a plurality of images are aggregated, such as an N-up document, is divided for each image, the images may not be arranged in a continuous order. Therefore, in the image forming apparatus described in Patent Document 1, the user has to take the trouble of rearranging the order of the images.

本発明は上記課題に鑑みてなされたものであり、集約された画像を分割する際に、分割された画像を並び替える手間を抑制できる画像形成装置を提供することを目的とする。 The present invention has been made in view of the above problems, and an object of the present invention is to provide an image forming apparatus capable of suppressing the trouble of rearranging the divided images when the aggregated images are divided.

本発明に係る画像形成装置は、文書を示す文書データに基づいて、シートに文書を形成する。画像形成装置は、学習部と、撮像部と、分割部と、第１抽出部とを備える。前記学習部は、文字と文字との繋がりを推定するために、前記文書を含む文書データを学習する。前記撮像部は、複数の文書が集約されて形成されたシートを撮像して、撮像データを生成する。前記分割部は、前記撮像データを１ページ分の前記文書ごとに分割して、複数の分割データを生成する。前記第１抽出部は、前記分割データの各々から文字を抽出する。前記分割データは、第１分割データと前記第１分割データと異なる第２分割データとを含む。前記第１抽出部は、前記第１分割データが含む第１文書から第１文字を抽出し、前記第２分割データが含む第２文書から第２文字を抽出する。前記第１文書は、前記１ページ分の分割データによって表される文書を示す。前記第２文書は、前記第１文書と異なる前記１ページ分の分割データによって表される文書を示す。前記第１文字は前記第１文書の記載が始まる位置を示す文頭又は前記第１文書の記載が終わる位置を示す文末のうち、いずれか一方の位置に含まれる文字である。前記第２文字は前記第２文書の記載が始まる位置を示す文頭又は前記第２文書の記載が終わる位置を示す文末のうち、前記第１文字が含まれる位置と異なる位置の文字である。前記学習部は、前記第１文字と前記第２文字とが入力されることで、前記第１文字と前記第２文字との繋がりの程度を示す推定結果を出力する。 The image forming apparatus according to the present invention forms a document on a sheet based on the document data indicating the document. The image forming apparatus includes a learning unit, an imaging unit, a dividing unit, and a first extraction unit. The learning unit learns document data including the document in order to estimate the connection between characters. The imaging unit captures a sheet formed by aggregating a plurality of documents to generate imaging data. The division unit divides the imaging data for each page of the document to generate a plurality of division data. The first extraction unit extracts characters from each of the divided data. The divided data includes a first divided data and a second divided data different from the first divided data. The first extraction unit extracts the first character from the first document included in the first divided data, and extracts the second character from the second document included in the second divided data. The first document indicates a document represented by the divided data for one page. The second document indicates a document represented by the divided data for one page different from the first document. The first character is a character included in either the beginning of a sentence indicating the position where the description of the first document starts or the end of the sentence indicating the position where the description of the first document ends. The second character is a character at a position different from the position including the first character at the beginning of the sentence indicating the position where the description of the second document starts or the end of the sentence indicating the position where the description of the second document ends. By inputting the first character and the second character, the learning unit outputs an estimation result indicating the degree of connection between the first character and the second character.

本発明の画像形成装置によれば、集約された画像を分割する際に、分割された画像を並び替える手間を抑制できる。 According to the image forming apparatus of the present invention, when the aggregated images are divided, it is possible to reduce the trouble of rearranging the divided images.

本発明の実施形態１に係る画像形成装置の構成を示す図である。It is a figure which shows the structure of the image forming apparatus which concerns on Embodiment 1 of this invention. 本実施形態１に係る制御部の構成を示す図である。It is a figure which shows the structure of the control part which concerns on this Embodiment 1. 本実施形態１におけるタッチパネル部に表示された選択画面を示す図である。It is a figure which shows the selection screen displayed on the touch panel part in Embodiment 1. 本実施形態１における分割データを示す分割画像を表示した表示画面を示す図である。It is a figure which shows the display screen which displayed the divided image which shows the divided data in Embodiment 1. 本実施形態１における分割データを示す分割画像を表示した表示画面を示す別の図である。It is another figure which shows the display screen which displayed the divided image which shows the divided data in Embodiment 1. FIG. 本実施形態１における制御部が実行する処理を示すフローチャートである。It is a flowchart which shows the process which the control part executes in 1st Embodiment. 本実施形態１における第１決定処理を示すフローチャートである。It is a flowchart which shows the 1st decision process in this Embodiment 1. 本発明の実施形態２に係る制御部の構成を示す図である。It is a figure which shows the structure of the control part which concerns on Embodiment 2 of this invention. 本実施形態２における分割データを示す分割画像を表示した表示画面を示す図である。It is a figure which shows the display screen which displayed the divided image which shows the divided data in Embodiment 2. 本実施形態２における分割データを示す分割画像を表示した表示画面を示す別の図である。It is another figure which shows the display screen which displayed the divided image which shows the divided data in Embodiment 2. 本実施形態２における制御部が実行する処理を示すフローチャートである。It is a flowchart which shows the process which the control part executes in 2nd Embodiment. 本実施形態２における制御部が実行する第２決定処理を示すフローチャートである。It is a flowchart which shows the 2nd decision process which a control part executes in 2nd Embodiment. 本実施形態２における制御部が実行する選択処理を示すフローチャートである。It is a flowchart which shows the selection process which the control part executes in 2nd Embodiment.

以下、本発明の実施形態について、図面を参照しながら説明する。なお、図中、同一又は相当部分については同一の参照符号を付して説明を繰り返さない。また、本発明の実施形態において、Ｘ軸、Ｙ軸、及びＺ軸は互いに直交し、Ｘ軸及びＹ軸は水平方向に平行であり、Ｚ軸は鉛直方向に平行である。 Hereinafter, embodiments of the present invention will be described with reference to the drawings. In the drawings, the same or corresponding parts are designated by the same reference numerals and the description is not repeated. Further, in the embodiment of the present invention, the X-axis, the Y-axis, and the Z-axis are orthogonal to each other, the X-axis and the Y-axis are parallel in the horizontal direction, and the Z-axis is parallel in the vertical direction.

［実施形態１］
まず、図１を参照して、本発明の実施形態１に係る画像形成装置１００の構成について説明する。図１は、画像形成装置１００の構成を示す図である。画像形成装置１００は、カラー複合機である。また、画像形成装置１００は、パーソナルコンピューターと通信可能に接続されていてもよい。 [Embodiment 1]
First, the configuration of the image forming apparatus 100 according to the first embodiment of the present invention will be described with reference to FIG. FIG. 1 is a diagram showing a configuration of an image forming apparatus 100. The image forming apparatus 100 is a color multifunction device. Further, the image forming apparatus 100 may be connected to the personal computer in a communicable manner.

図１に示すように、画像形成装置１００は、画像形成ユニット１、画像読取ユニット２、原稿搬送ユニット３、及び操作表示部４を備える。画像形成ユニット１は、シートＰに画像を形成する。 As shown in FIG. 1, the image forming apparatus 100 includes an image forming unit 1, an image reading unit 2, a document conveying unit 3, and an operation display unit 4. The image forming unit 1 forms an image on the sheet P.

画像読取ユニット２は、シートＲに形成された画像を読み取り、画像を表す撮像データを生成する。具体的には、画像読取ユニット２は、シートＲを撮像してシートＲを表す撮像データを生成する。シートＲは、原稿である。シートＲは、画像が形成されたシートＰを示す。画像は文書を含む。また、画像は複数の文書が集約された画像を含む。画像読取ユニット２は、「撮像部」の一例に相当する。画像読取ユニット２は、コンタクトガラス、ＬＥＤ（ＬｉｇｈｔＥｍｉｔｔｉｎｇＤｉｏｄｅ）、ミラー、キャリッジ、結像レンズ、及び、ＣＣＤ（ＣｈａｒｇｅＣｏｕｐｌｅｄＤｅｖｉｃｅ）を備えている。ＬＥＤ及びミラーは、キャリッジによって支持されている。 The image reading unit 2 reads the image formed on the sheet R and generates imaging data representing the image. Specifically, the image reading unit 2 images the sheet R and generates image data representing the sheet R. Sheet R is a manuscript. Sheet R indicates the sheet P on which the image was formed. The image contains a document. In addition, the image includes an image in which a plurality of documents are aggregated. The image reading unit 2 corresponds to an example of an “imaging unit”. The image reading unit 2 includes a contact glass, an LED (Light Emitting Mode), a mirror, a carriage, an imaging lens, and a CCD (Charge Coupled Device). The LEDs and mirrors are supported by carriages.

画像読取ユニット２によるシートＲの画像読取方法には、フラットベッド読取モード、及び、ＡＤＦ（ＡｕｔｏＤｏｃｕｍｅｎｔＦｅｅｄｅｒ）読取モードの２種類の方法が存在する。フラットベッド読取モードでは、原稿載置用ガラス上に載置されたシートＲの画像を読み取る。ＡＤＦ読取モードでは、原稿搬送ユニット３によってシートＲを搬送させ、シートＲが読取位置を通過する際に、シートＲの画像を読み取る。原稿搬送ユニット３がシートＲを移動させながら、ＣＣＤがシートＲの画像の読み取りを行い、画像を表す撮像データを生成する。シートＲは、例えば、普通紙、コピー紙、再生紙、薄紙、厚紙、又は光沢紙、又はＯＨＰ（ＯｖｅｒｈｅａｄＰｒｏｊｅｃｔｏｒ）シートである。 There are two types of image reading methods for the sheet R by the image reading unit 2: a flatbed reading mode and an ADF (Auto Document Feeder) reading mode. In the flatbed scanning mode, the image of the sheet R placed on the document placing glass is read. In the ADF reading mode, the document transport unit 3 transports the sheet R, and when the sheet R passes through the scanning position, the image of the sheet R is read. While the document transport unit 3 moves the sheet R, the CCD reads the image on the sheet R and generates imaging data representing the image. The sheet R is, for example, plain paper, copy paper, recycled paper, thin paper, thick paper, glossy paper, or an OHP (Overhead Projector) sheet.

原稿搬送ユニット３は、シートＲを画像読取ユニット２に搬送する。 The document transport unit 3 transports the sheet R to the image reading unit 2.

操作表示部４は、タッチパネル４１と、操作ボタン４２とを有する。タッチパネル４１は、表示装置４３とタッチセンサー４５とを有する。表示装置４３は種々の画像を表示する。表示装置４３は、例えば液晶表示装置（ＬｉｑｕｉｄＣｒｙｓｔａａｌＤｉｓｐｌａｙ：ＬＣＤ）である。タッチセンサー４５はユーザーからの操作を受け付ける。また、操作ボタン４２はユーザーからの操作を受け付ける。 The operation display unit 4 has a touch panel 41 and operation buttons 42. The touch panel 41 has a display device 43 and a touch sensor 45. The display device 43 displays various images. The display device 43 is, for example, a liquid crystal display device (Liquid Crystal Display: LCD). The touch sensor 45 receives an operation from the user. Further, the operation button 42 accepts an operation from the user.

タッチパネル４１は、ユーザーからの操作を受け付ける度に、ユーザーからの操作の内容を示す操作情報を生成する。具体的には、タッチパネル４１には、複数のアイコンが表示される。そして、ユーザーは、アイコンをタップする。タッチパネル４１は、操作情報として、アイコンがタップされたことを示す情報を生成する。 Each time the touch panel 41 receives an operation from the user, the touch panel 41 generates operation information indicating the content of the operation from the user. Specifically, a plurality of icons are displayed on the touch panel 41. Then, the user taps the icon. The touch panel 41 generates information indicating that the icon has been tapped as operation information.

操作表示部４は、ユーザーからの操作に基づいて、タッチパネル４１に表示する画像を変更する。具体的には、操作表示部４は、タッチパネル４１に表示された複数のアイコンのうち、ユーザーによって操作されたアイコンに対応する画像をタッチパネル４１に表示する。また、操作表示部４は、タッチパネル４１に表示する画像に対応する情報を、記憶部２２から読み出す。 The operation display unit 4 changes the image displayed on the touch panel 41 based on the operation from the user. Specifically, the operation display unit 4 displays on the touch panel 41 an image corresponding to the icon operated by the user among the plurality of icons displayed on the touch panel 41. Further, the operation display unit 4 reads out the information corresponding to the image displayed on the touch panel 41 from the storage unit 22.

画像形成ユニット１は、搬送機構１１、給送部１２、トナー供給部１３、画像形成部１４、定着部１５、排出部１６、制御部２１及び記憶部２２を含む。画像形成ユニット１は、搬送路Ｌを有する。 The image forming unit 1 includes a transport mechanism 11, a feeding unit 12, a toner supply unit 13, an image forming unit 14, a fixing unit 15, a discharging unit 16, a control unit 21, and a storage unit 22. The image forming unit 1 has a transport path L.

搬送路Ｌは、給送部１２から排出部１６までシートＰを案内する。搬送路Ｌは、給送部１２から排出部１６まで延びる。 The transport path L guides the sheet P from the feeding section 12 to the discharging section 16. The transport path L extends from the feeding section 12 to the discharging section 16.

搬送機構１１は、シートＰを搬送する。具体的には、搬送機構１１は、シートＰを画像形成部１４及び定着部１５を経由して排出部１６まで搬送する。また、搬送機構１１は、定着部１５で画像が定着されたシートＰを反転して画像形成部１４へ搬送できる。 The transport mechanism 11 transports the sheet P. Specifically, the transport mechanism 11 transports the sheet P to the discharge unit 16 via the image forming unit 14 and the fixing unit 15. Further, the transport mechanism 11 can invert the sheet P on which the image is fixed by the fixing portion 15 and convey it to the image forming portion 14.

給送部１２は、シートＰを搬送路Ｌへ供給する。シートＰは、例えば、普通紙、コピー紙、再生紙、薄紙、厚紙、又は光沢紙、又はＯＨＰ（ＯｖｅｒｈｅａｄＰｒｏｊｅｃｔｏｒ）シートである。 The feeding unit 12 supplies the sheet P to the transport path L. The sheet P is, for example, plain paper, copy paper, recycled paper, thin paper, thick paper, glossy paper, or an OHP (Overhead Projector) sheet.

トナー供給部１３には、複数のトナーコンテナが装着される。複数のトナーコンテナのうちの１つは、シアン色のトナーが収納される。複数のトナーコンテナのうちの１つは、マゼンタ色のトナーが収納される。複数のトナーコンテナのうちの１つは、イエロー色のトナーが収納される。複数のトナーコンテナのうちの１つは、黒色のトナーが収納される。 A plurality of toner containers are mounted on the toner supply unit 13. One of the plurality of toner containers stores cyan-colored toner. One of the plurality of toner containers stores magenta toner. One of the plurality of toner containers stores yellow toner. One of the plurality of toner containers stores black toner.

画像形成部１４は、画像をシートＰに形成する。具体的には、画像形成部１４は、複数のシートＰに複数の画像を形成する。画像形成部１４は、転写部を含む。転写部は、画像をシートＰに転写する。その結果、シートＰに画像が形成される。 The image forming unit 14 forms an image on the sheet P. Specifically, the image forming unit 14 forms a plurality of images on the plurality of sheets P. The image forming unit 14 includes a transfer unit. The transfer unit transfers the image to the sheet P. As a result, an image is formed on the sheet P.

画像形成部１４は、複数の画像形成部を含む。複数の画像形成部のうちの１つは、シアン色のトナー像を形成する。複数の画像形成部のうちの１つは、マゼンタ色のトナー像を形成する。複数の画像形成部のうちの１つは、イエロー色のトナー像を形成する。複数の画像形成部のうちの１つは、ブラック色のトナー像を形成する。 The image forming unit 14 includes a plurality of image forming units. One of the plurality of image forming portions forms a cyan toner image. One of the plurality of image forming portions forms a magenta toner image. One of the plurality of image forming portions forms a yellow toner image. One of the plurality of image forming portions forms a black toner image.

画像形成部１４は、転写部と、像担持体と、帯電部と、露光部と、現像部とを含む。 The image forming unit 14 includes a transfer unit, an image carrier, a charging unit, an exposure unit, and a developing unit.

転写部は、トナー画像をシートＰに転写する。転写部は、中間転写ベルトを含む。中間転写ベルトは、無端状のベルトである。中間転写ベルトには、複数色のトナー像が形成される。具体的には、中間転写ベルトには、複数の画像形成部１４が中間転写ベルトにトナー像を形成する。この結果、複数色のトナー像が中間転写ベルト上で重畳され、中間転写ベルト上に画像が形成される。そして、中間転写ベルトに形成された画像は、シートＰに転写される。その結果、シートＰに画像が形成される。 The transfer unit transfers the toner image to the sheet P. The transfer section includes an intermediate transfer belt. The intermediate transfer belt is an endless belt. Toner images of a plurality of colors are formed on the intermediate transfer belt. Specifically, on the intermediate transfer belt, a plurality of image forming portions 14 form a toner image on the intermediate transfer belt. As a result, toner images of a plurality of colors are superimposed on the intermediate transfer belt, and an image is formed on the intermediate transfer belt. Then, the image formed on the intermediate transfer belt is transferred to the sheet P. As a result, an image is formed on the sheet P.

像担持体は、ドラム形状であり、回転軸を有する。像担持体は、回転軸を中心に時計回りに回転する。像担持体は、外周面側に感光層を有する。 The image carrier is drum-shaped and has a rotation axis. The image carrier rotates clockwise about the axis of rotation. The image carrier has a photosensitive layer on the outer peripheral surface side.

帯電部は像担持体の感光層を所定の電位に帯電する。露光部は、像担持体の感光層にレーザー光を照射して露光する。露光部は画像データに基づいて像担持体を露光する。この結果、像担持体に静電潜像が形成される。 The charged portion charges the photosensitive layer of the image carrier to a predetermined potential. The exposed portion irradiates the photosensitive layer of the image carrier with laser light to expose it. The exposed unit exposes the image carrier based on the image data. As a result, an electrostatic latent image is formed on the image carrier.

現像部は像担持体上の静電潜像を現像する。現像部は現像ローラーを有する。現像ローラーは、像担持体にトナーを供給し、像担持体上の静電潜像を現像してトナー画像を形成する。この結果、像担持体の外周面にトナー画像が形成される。 The developing unit develops an electrostatic latent image on the image carrier. The developing unit has a developing roller. The developing roller supplies toner to the image carrier and develops an electrostatic latent image on the image carrier to form a toner image. As a result, a toner image is formed on the outer peripheral surface of the image carrier.

転写部は、像担持体の外周面に形成されたトナー画像をシートＰに転写する。その結果、シートＰにトナー画像が転写される。 The transfer unit transfers the toner image formed on the outer peripheral surface of the image carrier to the sheet P. As a result, the toner image is transferred to the sheet P.

定着部１５は、シートＰを加熱及び加圧し、シートＰに形成された画像をシートＰに定着する。具体的には、定着部１５は、シートＰを加熱及び加圧し、シートＰに形成されたトナー画像をシートＰに定着する。 The fixing portion 15 heats and pressurizes the sheet P, and fixes the image formed on the sheet P to the sheet P. Specifically, the fixing unit 15 heats and pressurizes the sheet P, and fixes the toner image formed on the sheet P on the sheet P.

排出部１６は、シートＰを画像形成装置１００の外部へ排出する。定着部１５がトナー画像をシートＰに定着させた後、搬送機構１１はシートＰを定着部１５から排出部１６まで搬送する。そして、排出部１６はトナー画像の定着したシートＰを画像形成装置１００の外部に排出する。 The discharge unit 16 discharges the sheet P to the outside of the image forming apparatus 100. After the fixing section 15 fixes the toner image on the sheet P, the transport mechanism 11 transports the sheet P from the fixing section 15 to the discharging section 16. Then, the discharge unit 16 discharges the sheet P on which the toner image is fixed to the outside of the image forming apparatus 100.

制御部２１は、画像形成装置１００の動作を制御する。制御部２１は、プロセッサーと記憶装置とを含む。プロセッサーは、例えばＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）を含む。プロセッサーは、記憶装置に記憶された制御プログラムを実行して、操作表示部４、搬送機構１１、給送部１２、画像形成部１４、定着部１５、及び排出部１６を制御する。 The control unit 21 controls the operation of the image forming apparatus 100. The control unit 21 includes a processor and a storage device. The processor includes, for example, a CPU (Central Processing Unit). The processor executes a control program stored in the storage device to control the operation display unit 4, the transport mechanism 11, the feeding unit 12, the image forming unit 14, the fixing unit 15, and the discharging unit 16.

記憶部２２は、記憶装置を含む。具体的には、記憶部２２は、半導体メモリーのようなメモリーを備え、ＨＤＤ（ＨａｒｄＤｉｓｋＤｒｉｖｅ）を備えてもよい。記憶部２２は、制御プログラムを記憶している。 The storage unit 22 includes a storage device. Specifically, the storage unit 22 may include a memory such as a semiconductor memory, and may include an HDD (Hard Disk Drive). The storage unit 22 stores the control program.

次に図２を参照して、制御部２１の構成を詳しく説明する。図２は、本実施形態に係る制御部２１の構成を示す図である。制御部２１は、学習部２１８、分割部２１１、及び第１抽出部２１２を含む。制御部２１は、制御プログラムを実行することで、学習部２１８、分割部２１１、及び第１抽出部２１２として機能する。 Next, the configuration of the control unit 21 will be described in detail with reference to FIG. FIG. 2 is a diagram showing a configuration of a control unit 21 according to the present embodiment. The control unit 21 includes a learning unit 218, a division unit 211, and a first extraction unit 212. By executing the control program, the control unit 21 functions as a learning unit 218, a division unit 211, and a first extraction unit 212.

分割部２１１は、画像読取ユニット２が生成した撮像データを１ページ分の文書ごとに分割して、複数の分割データを生成する。分割データは、第１分割データと第２分割データとを含む。第２分割データは、第１分割データと異なる分割データを示す。 The division unit 211 divides the imaging data generated by the image reading unit 2 for each page of the document to generate a plurality of division data. The divided data includes the first divided data and the second divided data. The second divided data indicates divided data different from the first divided data.

第１抽出部２１２は、分割データの各々から文字を抽出する。具体的には、第１抽出部２１２は、第１分割データが含む第１文書から第１文字を抽出する。第１文書は、１ページ分の分割データによって表される文書を示す。第１文字は、第１文書の文頭又は文末のうち、いずれか一方の位置に含まれる文字である。具体的には、第１文字は第１文書の記載が始まる位置を示す文頭又は第１文書の記載が終わる位置を示す文末のうち、いずれか一方の位置に含まれる文字である。 The first extraction unit 212 extracts characters from each of the divided data. Specifically, the first extraction unit 212 extracts the first character from the first document included in the first partition data. The first document shows a document represented by one page of divided data. The first character is a character included in either the beginning or the end of the first document. Specifically, the first character is a character included in either the beginning of the sentence indicating the position where the description of the first document starts or the end of the sentence indicating the position where the description of the first document ends.

また、第１抽出部２１２は、第２分割データが含む第２文書から第２文字を抽出する。第２文書は、第１文書と異なる１ページ分の分割データによって表される文書を示す。第２文字は第２文書の記載が始まる位置を示す文頭又は第２文書の記載が終わる位置を示す文末のうち、第１文字が含まれる位置と異なる位置の文字である。 In addition, the first extraction unit 212 extracts the second character from the second document included in the second divided data. The second document shows a document represented by one page of divided data different from the first document. The second character is a character at a position different from the position including the first character at the beginning of the sentence indicating the position where the description of the second document starts or the end of the sentence indicating the position where the description of the second document ends.

また、第１文字と第２文字との各々は、単一の文字、単語、及び形態素を含む。形態素は、意味を持つ最小の単位の表現要素を示す。したがって、複数の言語に対応できる。この結果、複数の言語の文字と文字との繋がりを推定できる。 Also, each of the first and second letters includes a single letter, word, and morpheme. A morpheme represents the expression element of the smallest unit that has meaning. Therefore, it can support a plurality of languages. As a result, it is possible to estimate the connection between characters in a plurality of languages.

学習部２１８は、文字と文字との繋がりを推定するために、文書データを学習する。この結果、文書データに基づいて、文字と文字との繋がりを容易に推定できる。 The learning unit 218 learns the document data in order to estimate the connection between the characters. As a result, the connection between characters can be easily estimated based on the document data.

学習は、機械学習を含む。機械学習は、例えば教師あり学習、教師なし学習、及び強化学習を含む。機械学習は、例えば、ニューラルネットワーク（ＮｅｕｒａｌＮｅｔｗｏｒｋ）又はサポートベクターマシン（ＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅ）によって実行される。ニューラルネットワークは、入力層、隠れ層（中間層）、及び出力層を有する。ニューラルネットワークは、誤差逆伝播法（バックプロパゲーション）により、出力層での出力値と最適解との誤差を少なくする。 Learning includes machine learning. Machine learning includes, for example, supervised learning, unsupervised learning, and reinforcement learning. Machine learning is performed, for example, by a neural network (Neural Network) or a support vector machine (Support Vector Machine). The neural network has an input layer, a hidden layer (intermediate layer), and an output layer. The neural network uses an error backpropagation method to reduce the error between the output value and the optimum solution in the output layer.

また、機械学習は、深層学習（ディープラーニング）であってもよい。深層学習は、入力層、２層以上の隠れ層、及び出力層を有するニューラルネットワークによって構成される。具体的には、深層学習は、例えば、畳み込みニューラルネットワーク（ＣｏｎｖｏｌｕｔｉｏｎａｌＮｅｕｒａｌＮｅｔｗｏｒｋ）、再帰型ニューラルネットワーク（ＲｅｃｕｒｒｅｎｔＮｅｕｒａｌＮｅｔｗｏｒｋ）、ボルツマンマシン（Ｂｏｌｔｚｍａｎｍａｃｈｉｎｅ）によって構成される。 Further, the machine learning may be deep learning. Deep learning is composed of a neural network having an input layer, two or more hidden layers, and an output layer. Specifically, deep learning is composed of, for example, a convolutional neural network, a recurrent neural network, and a Boltzmann machine.

また、本実施形態の学習部２１８は、第１文字と第２文字とが入力されることで、第１文字と第２文字との繋がりの程度を示す推定結果を出力する。したがって、文書に記載されている文字に基づいて、ユーザーが文書を並び変える必要がない。この結果、集約された画像を分割する際の、文書を並び変える手間を抑制できる。 Further, the learning unit 218 of the present embodiment outputs an estimation result indicating the degree of connection between the first character and the second character by inputting the first character and the second character. Therefore, the user does not have to reorder the document based on the characters in the document. As a result, it is possible to reduce the trouble of rearranging the documents when dividing the aggregated images.

例えば、一般的に、会議で使用した「２ｉｎ１」のような集約された文書を保管する際に、ユーザーは画像形成装置の分割機能を利用する。複数の文書が集約された資料を分割する場合、画像読取ユニット２がシートＲを読み取った方向などから、文書が順序通りに並ばないことがある。このような場合、ユーザーは文書を目視で確認し、文脈に基づいて、自ら文書の順序を並び替えて文書を保管する。しかしながら、学習部２１８が文字と文字との繋がりの程度を推定するため、ユーザーは、推定結果に基づいて、文書を並び替えることができる。この結果、文書を並び替える手間を抑制できる。 For example, in general, when storing an aggregated document such as "2in1" used in a conference, the user uses the division function of the image forming apparatus. When a document in which a plurality of documents are aggregated is divided, the documents may not be arranged in order due to the direction in which the image reading unit 2 reads the sheet R or the like. In such cases, the user visually checks the documents, rearranges the order of the documents based on the context, and stores the documents. However, since the learning unit 218 estimates the degree of connection between characters, the user can sort the documents based on the estimation result. As a result, it is possible to reduce the trouble of rearranging the documents.

また、本実施形態の学習部２１８が学習する文書データは、画像形成装置１００がシートＰに形成する文書を示すデータである。画像形成装置１００が複数の文書を集約して１枚のシートＰに形成する文書の文書データを学習部２１８が学習する。したがって、学習部２１８に入力される第１文字と第２文字とは、文書データに含まれるため、学習部２１８の学習精度は向上する。この結果、推定結果の精度が向上する。 Further, the document data learned by the learning unit 218 of the present embodiment is data indicating a document formed on the sheet P by the image forming apparatus 100. The learning unit 218 learns the document data of the documents that the image forming apparatus 100 aggregates a plurality of documents and forms on one sheet P. Therefore, since the first character and the second character input to the learning unit 218 are included in the document data, the learning accuracy of the learning unit 218 is improved. As a result, the accuracy of the estimation result is improved.

例えば、画像形成装置１００は、会議で使用する文書をシートＰに形成する。具体的には、画像形成装置１００に入力される文書データに基づいて、画像形成装置１００は、会議で使用する文書をシートＰに形成する。シートＰには、複数の文書が集約される。そして学習部２１８は、文書データを学習する。その後、画像読取ユニット２は、シートＲを読み取って、撮像データを生成する。シートＲは、複数の文書が集約されたシートＰである。更に、撮像データの文書をもとに分割データは生成される。また、学習部２１８には、分割データから取得された第１文字と第２文字とが入力される。つまり、学習部２１８に入力される第１文字と第２文字とは、既に学習部２１８が学習した学習データに含まれる。したがって、学習部２１８は、精度の良い推定結果を出力できる。なお、学習データは、記憶部２２に記憶されている。 For example, the image forming apparatus 100 forms a document to be used in a conference on a sheet P. Specifically, the image forming apparatus 100 forms a document to be used in the conference on the sheet P based on the document data input to the image forming apparatus 100. A plurality of documents are aggregated on the sheet P. Then, the learning unit 218 learns the document data. After that, the image reading unit 2 reads the sheet R and generates imaging data. Sheet R is a sheet P in which a plurality of documents are aggregated. Further, the divided data is generated based on the document of the imaging data. Further, the first character and the second character acquired from the divided data are input to the learning unit 218. That is, the first character and the second character input to the learning unit 218 are included in the learning data already learned by the learning unit 218. Therefore, the learning unit 218 can output an accurate estimation result. The learning data is stored in the storage unit 22.

なお、学習部２１８が学習する文書データは、画像形成装置１００に入力された文書データを含んでもよい。したがって、実際にシートＰに形成されなかった文書の画像データも、学習部２１８は学習できる。 The document data learned by the learning unit 218 may include the document data input to the image forming apparatus 100. Therefore, the learning unit 218 can learn the image data of the document that is not actually formed on the sheet P.

また、本実施形態の学習部２１８は、文書データと、文書データに対応するページ番号とを更に学習する。この結果、学習精度が向上し、文字と文字との繋がりを推定する精度が向上する。 In addition, the learning unit 218 of the present embodiment further learns the document data and the page number corresponding to the document data. As a result, the learning accuracy is improved, and the accuracy of estimating the connection between characters is improved.

また、学習部２１８は、文書データに対して、自然言語処理を実行する。自然言語処理は、自然言語をコンピューターに処理させる一連の技術である。自然言語は、人間と人間とが意思疎通のために使用する言語である。自然言語処理は、形態素解析、構文解析、意味解析及び文脈解析を含む。 In addition, the learning unit 218 executes natural language processing on the document data. Natural language processing is a series of technologies that allow a computer to process natural language. Natural language is the language used by humans for communication. Natural language processing includes morphological analysis, parsing, semantic analysis and context analysis.

学習部２１８は、文書データに対して、形態素解析を実行する。形態素解析は、意味を持つ最小の単位の形態素に区分する処理である。学習部２１８は、文書データに含まれるテキストを記憶部２２に記憶された辞書データに基づいて、文書を形態素に区分する。そして、学習部２１８は、形態素解析の結果に基づいて、構文解析を実行する。構文解析は、形態素と形態素との関連性を解析する処理である。関連性は、例えば、形態素と形態素との修飾関係である。更に、学習部２１８は、構文解析の結果に基づいて、意味解析を実行する。意味解析は、構文解析の結果に基づいて、構文木を決定する処理である。構文木は、構文解析の経過及び結果を木構造で示すものである。更に、学習部２１８は、意味解析の結果に基づいて、文脈解析を実行する。文脈解析は、文と文との関連性を解析する処理である。文は、主語と述語を含み、完結した１つの陳述を示す。学習部２１８は、自然言語処理の結果に基づいて、学習する。自然言語処理の結果は、学習データとして記憶部２２に記憶される。 The learning unit 218 executes morphological analysis on the document data. Morphological analysis is a process of classifying morphemes into the smallest unit of meaning. The learning unit 218 classifies the text included in the document data into morphemes based on the dictionary data stored in the storage unit 22. Then, the learning unit 218 executes the syntactic analysis based on the result of the morphological analysis. Parsing is the process of analyzing the relationship between morphemes. The relationship is, for example, a modification relationship between a morpheme and a morpheme. Further, the learning unit 218 executes the semantic analysis based on the result of the syntactic analysis. Semantic analysis is a process of determining a syntax tree based on the result of parsing. The syntax tree shows the progress and results of parsing in a tree structure. Further, the learning unit 218 executes the context analysis based on the result of the semantic analysis. Context analysis is a process of analyzing the relationship between sentences. The sentence contains the subject and the predicate and indicates one complete statement. The learning unit 218 learns based on the result of natural language processing. The result of natural language processing is stored in the storage unit 22 as learning data.

引き続き、図２を参照して、制御部２１の構成を詳細に説明する。制御部２１は、第１決定部２１３を更に含む。制御部２１は、制御プログラムを実行することで、第１決定部２１３として機能する。 Subsequently, the configuration of the control unit 21 will be described in detail with reference to FIG. The control unit 21 further includes a first determination unit 213. The control unit 21 functions as the first determination unit 213 by executing the control program.

第１決定部２１３は、分割データの順序を決定する。具体的には、第１決定部２１３は、学習部２１８の推定結果に基づいて、分割データの順序を決定する。更に具体的には、第１決定部２１３は、第１文字と第２文字とを学習部２１８に入力し、学習部２１８が出力する第１文字と第２文字との繋がりの程度を示す推定結果に基づいて、第１分割データと第２分割データとの順序を決定する。 The first determination unit 213 determines the order of the divided data. Specifically, the first determination unit 213 determines the order of the divided data based on the estimation result of the learning unit 218. More specifically, the first determination unit 213 inputs the first character and the second character to the learning unit 218, and estimates that the degree of connection between the first character and the second character output by the learning unit 218 is indicated. Based on the result, the order of the first divided data and the second divided data is determined.

したがって、文書に記載されている文字に基づいて、ユーザーが文書を並び変える必要がない。この結果、集約された画像を分割する際の、文書を並び変える手間を抑制できる。 Therefore, the user does not have to reorder the document based on the characters in the document. As a result, it is possible to reduce the trouble of rearranging the documents when dividing the aggregated images.

また、実施形態１の第１抽出部２１２は、第１文書の文末に位置する第１文字を抽出し、第２文書の文頭に位置する第２文字を抽出する。そして、第１決定部２１３は、第１文字と第２文字とを学習部２１８に入力し、学習部２１８から出力された推定結果に基づいて、第１分割データと第２分割データとの順序を決定する。したがって、第１文書と第２文書と続けて読むことができる。この結果、集約されてばらばらになった文書を１つの繋がりのある文書とすることができる。 Further, the first extraction unit 212 of the first embodiment extracts the first character located at the end of the sentence of the first document and extracts the second character located at the beginning of the sentence of the second document. Then, the first determination unit 213 inputs the first character and the second character to the learning unit 218, and based on the estimation result output from the learning unit 218, the order of the first divided data and the second divided data. To determine. Therefore, the first document and the second document can be read in succession. As a result, the aggregated and disjointed documents can be made into one connected document.

また、実施形態１の第１抽出部２１２は、第１文字と第２文字とを抽出する。第１文字は、第１文書の文末に位置する。第２文字は、第２文書の文頭に位置する。そして、第１決定部２１３は、第１文字と第２文字とを学習部２１８に入力し、学習部２１８から出力された推定結果に基づいて、第１分割データと第２分割データとの順序を決定する。したがって、第１文書と第２文書と続けて読むことができる。この結果、集約されてばらばらになった文書を１つの繋がりのある文書とすることができる。 In addition, the first extraction unit 212 of the first embodiment extracts the first character and the second character. The first character is located at the end of the first document. The second character is located at the beginning of the second document. Then, the first determination unit 213 inputs the first character and the second character to the learning unit 218, and based on the estimation result output from the learning unit 218, the order of the first divided data and the second divided data. To determine. Therefore, the first document and the second document can be read in succession. As a result, the aggregated and disjointed documents can be made into one connected document.

次に、図２と図３とを参照して、操作表示部４に表示された選択画面５０を説明する。図３は、タッチパネル部４１に表示された選択画面５０を示す図である。選択画面５０は、画像形成装置１００のコピー機能を選択する操作ボタン４２を操作することで、タッチパネル部４１に表示される。選択画面５０には、用紙選択アイコン５１、縮小／拡大アイコン５２、濃度設定アイコン５３、両面／分割設定アイコン５４、ページ集約設定アイコン５５、ソート／仕分け設定アイコン５６、機能一覧アイコン５７、及び、お気に入りアイコン５８が表示されている。 Next, the selection screen 50 displayed on the operation display unit 4 will be described with reference to FIGS. 2 and 3. FIG. 3 is a diagram showing a selection screen 50 displayed on the touch panel unit 41. The selection screen 50 is displayed on the touch panel unit 41 by operating the operation button 42 for selecting the copy function of the image forming apparatus 100. On the selection screen 50, the paper selection icon 51, the reduction / enlargement icon 52, the density setting icon 53, the double-sided / split setting icon 54, the page aggregation setting icon 55, the sort / sorting setting icon 56, the function list icon 57, and favorites The icon 58 is displayed.

用紙選択アイコン５１は、シートＰのサイズを選択する場合に、ユーザーによって操作される。縮小／拡大アイコン５２は、シートＰに形成された画像を拡大又は縮小する場合に、ユーザーによって操作される。濃度設定アイコン５３は、コピー濃度を設定する場合に、ユーザーによって操作される。両面／分割設定アイコン５４は、両面又は片面の設定、及び「２ｉｎ１」のような複数の画像が集約された画像を、画像ごとに分割する場合にユーザーによって操作される。ページ集約設定アイコン５５は、「２ｉｎ１」のようなページ集約を設定する場合に、ユーザーによって操作される。ソート／仕分け設定アイコン５６は、ソート有無のような仕分け条件を設定する場合に、ユーザーによって操作される。機能一覧アイコン５７は、各種の機能を説明する機能一覧画面をタッチパネル４１に表示する場合に、ユーザーによって操作される。お気に入りアイコン５８は、ユーザーが使用する頻度が高いアイコンをタッチパネル４１に表示する場合に、ユーザーによって操作される。 The paper selection icon 51 is operated by the user when selecting the size of the sheet P. The reduction / enlargement icon 52 is operated by the user when the image formed on the sheet P is enlarged or reduced. The density setting icon 53 is operated by the user when setting the copy density. The double-sided / split setting icon 54 is operated by the user when the double-sided or single-sided setting and an image in which a plurality of images such as "2in1" are aggregated are divided into images. The page aggregation setting icon 55 is operated by the user when setting page aggregation such as "2in1". The sort / sorting setting icon 56 is operated by the user when setting sorting conditions such as the presence / absence of sorting. The function list icon 57 is operated by the user when displaying a function list screen explaining various functions on the touch panel 41. The favorite icon 58 is operated by the user when displaying an icon frequently used by the user on the touch panel 41.

また、図３に示すように、ユーザーＨ１の手の指（例えば、人差し指）によって、両面／分割設定アイコン５４がタップされる。この操作に応じて、タッチパネル部４１は、両面／分割設定アイコン５４に対応する画面を表示する。なお、手は、タッチパネル４１に表示されない。 Further, as shown in FIG. 3, the double-sided / split setting icon 54 is tapped by the finger (for example, the index finger) of the user H1's hand. In response to this operation, the touch panel unit 41 displays the screen corresponding to the double-sided / split setting icon 54. The hand is not displayed on the touch panel 41.

次に、図２〜図４を参照して、両面／分割設定アイコン５４に対応する画面を説明する。図４は、分割データを示す分割画像Ｄを表示した表示画面１１０を示す図である。図４に示すように、表示画面１１０は、第１表示領域１１１と第２表示領域１１２とを含む。 Next, the screen corresponding to the double-sided / split setting icon 54 will be described with reference to FIGS. 2 to 4. FIG. 4 is a diagram showing a display screen 110 displaying a divided image D showing divided data. As shown in FIG. 4, the display screen 110 includes a first display area 111 and a second display area 112.

第１表示領域１１１は、撮像データを示す撮像画像ＲＧ１を表示するプレビュー画像１１３と戻るボタン１１４とが表示される。図４に示す撮像画像ＲＧ１は、２つの画像を１枚のシートＲに集約した「２ｉｎ１」の画像である。撮像画像ＲＧ１は、撮像データが複数の文書を含む場合、撮像画像ＲＧ１も複数表示される。戻るボタン１１４は、図３に示す選択画面５０に戻るためのボタンである。 In the first display area 111, a preview image 113 for displaying the captured image RG1 showing the captured data and a return button 114 are displayed. The captured image RG1 shown in FIG. 4 is a “2in1” image in which two images are aggregated on one sheet R. When the captured image RG1 includes a plurality of documents, the captured image RG1 is also displayed in a plurality. The back button 114 is a button for returning to the selection screen 50 shown in FIG.

第２表示領域１１２には、分割データを示す複数の分割画像Ｄが表示される。複数の分割画像Ｄの各々は、１ページ分の分割データによって表される画像を示す。図４に示す複数の分割画像Ｄは、第１分割画像Ｄ１と第２分割画像Ｄ２とを含む。 In the second display area 112, a plurality of divided images D showing the divided data are displayed. Each of the plurality of divided images D indicates an image represented by one page of divided data. The plurality of divided images D shown in FIG. 4 include a first divided image D1 and a second divided image D2.

第１分割画像Ｄ１は、例えば、複数のページのうちの２ページ目の文書を示す画像である。第１分割画像Ｄ１は、文書を含む。第１分割画像Ｄ１は、第１文頭領域ＢＳ１と第１文末領域ＥＳ１とを含む。 The first divided image D1 is, for example, an image showing a document on the second page of a plurality of pages. The first divided image D1 includes a document. The first divided image D1 includes a first sentence beginning region BS1 and a first sentence ending region ES1.

第１文頭領域ＢＳ１は、第１分割画像Ｄ１に含まれる文書のうち、文頭部分が位置する領域を示す。文頭部分は、１ページ分の分割データによって表される画像に含まれる文書のうち、文書の記載が始まる位置を含む。図４に示す第１文頭領域ＢＳ１には、「ｍｕｌｔｉｆｕｎｃｔｉｏｎｄｅｖｉｃｅｓ・・・・」という文字列が位置する。 The first sentence beginning area BS1 indicates an area in which the sentence beginning portion is located in the document included in the first divided image D1. The beginning of the sentence includes the position where the description of the document starts in the document included in the image represented by the divided data for one page. In the first sentence head region BS1 shown in FIG. 4, the character string "multifaction devices ..." is located.

第１文末領域ＥＳ１は、第１分割画像Ｄ１に含まれる文書のうち、文末部分が位置する領域を示す。文末部分は、１ページ分の分割データによって表される画像に含まれる文書のうち、文書の記載が終わる位置を含む。図４に示す第１文末領域ＥＳ１には、「・・・・ｏｐｔｉｍａｌｌｙ」という文字列が位置する。 The first sentence end area ES1 indicates an area in which the sentence end portion is located in the document included in the first divided image D1. The end of the sentence includes the position where the description of the document ends in the document included in the image represented by the divided data for one page. The character string "... optimally" is located in the first sentence end region ES1 shown in FIG.

第２分割画像Ｄ２は、例えば、複数のページのうちの１ページ目の文書を示す画像である。第２分割画像Ｄ２は、文書を含む。第２分割画像Ｄ２は、第２文頭領域ＢＳ２と第２文末領域ＥＳ２とを含む。 The second divided image D2 is, for example, an image showing a document on the first page of a plurality of pages. The second divided image D2 includes a document. The second divided image D2 includes the second sentence beginning region BS2 and the second sentence ending region ES2.

第２文頭領域ＢＳ２は、第２分割画像Ｄ２に含まれる文書のうち、文頭部分が位置する領域を示す。図４に示す第２文頭領域ＢＳ２には、「ＴｏｋｋｙｏＣｏ．，Ｌｔｄ．・・・・」という文字列が位置する。 The second sentence beginning area BS2 indicates an area in which the sentence beginning portion is located in the document included in the second divided image D2. The character string "Tokyo Co., Ltd ....." is located in the second sentence head region BS2 shown in FIG.

第２文末領域ＥＳ２は、第２分割画像Ｄ２に含まれる文書のうちの文末部分が位置する領域を示す。図４に示す第２分割画像Ｄ２の第２文末領域ＥＳ２には、「・・・・ｐｒｉｎｔｅｒｓａｎｄ」という文字列が位置する。 The second sentence end area ES2 indicates an area in which the sentence end portion of the document included in the second divided image D2 is located. The character string "... printers and" is located in the second sentence end region ES2 of the second divided image D2 shown in FIG.

また、図４に示す第１分割画像Ｄ１は、第１表示領域１１１から第２表示領域１１２へ向かう方向の上流側に位置する。図４に示す第２分割画像Ｄ２は、第１表示領域１１１から第２表示領域１１２へ向かう方向の下流側に位置する。したがって、図４では２ページ目の第１分割画像Ｄ１が上流側に位置し、１ページ目の第２分割画像Ｄ２が下流側に位置する。 Further, the first divided image D1 shown in FIG. 4 is located on the upstream side in the direction from the first display area 111 to the second display area 112. The second divided image D2 shown in FIG. 4 is located on the downstream side in the direction from the first display area 111 to the second display area 112. Therefore, in FIG. 4, the first divided image D1 on the second page is located on the upstream side, and the second divided image D2 on the first page is located on the downstream side.

図４に示す第１分割画像Ｄ１と図４に示す第２分割画像Ｄ２とに撮像画像ＲＧ１を分割する場合、制御部２１は、操作表示部４が分割設定を行う表示画面１１０を表示するように、操作表示部４を制御する。図４に示す表示画面１１０を操作表示部４のタッチパネル４１に表示する場合、図３に示す両面／分割設定アイコン５４のタッチ操作を２回行う。なお、タッチ操作が１回の場合、両面設定の画面が操作表示部４に表示される。分割設定アイコン５４が２回タッチ操作された場合、画像読取ユニット２は、シートＲを撮像してシートＲを表す撮像データを取得する。シートＲは、画像が形成されたシートである。シートＲに形成された画像は、複数の画像が集約された画像である。画像読取ユニット２が撮像した撮像データは、制御部２１に送信される。 When the captured image RG1 is divided into the first divided image D1 shown in FIG. 4 and the second divided image D2 shown in FIG. 4, the control unit 21 displays the display screen 110 on which the operation display unit 4 sets the division. In addition, the operation display unit 4 is controlled. When the display screen 110 shown in FIG. 4 is displayed on the touch panel 41 of the operation display unit 4, the double-sided / split setting icon 54 shown in FIG. 3 is touched twice. When the touch operation is performed once, the double-sided setting screen is displayed on the operation display unit 4. When the division setting icon 54 is touch-operated twice, the image reading unit 2 images the sheet R and acquires the imaging data representing the sheet R. The sheet R is a sheet on which an image is formed. The image formed on the sheet R is an image in which a plurality of images are aggregated. The imaged data captured by the image reading unit 2 is transmitted to the control unit 21.

制御部２１は、撮像データを受信する。そして、制御部２１は、撮像データの所定領域の輝度を取得する。所定領域は、複数の画像が集約された場合に、互いに隣り合う画像と画像との間に形成される領域を示す。また、集約する画像の数に応じて、所定領域のパターンが変更される。集約された画像が２つの場合、所定領域のパターンは、例えば、撮像画像を２つに分断する１本の直線の形状となる。集約された画像が４つの場合、所定領域のパターンは、例えば、撮像画像を４つに分断する十字の形状となる。 The control unit 21 receives the imaging data. Then, the control unit 21 acquires the brightness of a predetermined region of the imaging data. The predetermined region indicates a region formed between images adjacent to each other when a plurality of images are aggregated. In addition, the pattern of a predetermined area is changed according to the number of images to be aggregated. When there are two aggregated images, the pattern of the predetermined region is, for example, the shape of a straight line that divides the captured image into two. When there are four aggregated images, the pattern of the predetermined region is, for example, a cross shape that divides the captured image into four.

また、制御部２１は、所定領域の輝度が所定の階調か否かを判定する。所定の階調は、例えば、白色を示す。そして、分割部２１１は、白色の階調を示す所定領域に基づいて、撮像データを分割する。例えば、分割部２１１は、撮像データを第１分割データと第２分割データとに分割する。 Further, the control unit 21 determines whether or not the brightness of the predetermined region has a predetermined gradation. The predetermined gradation indicates, for example, white. Then, the dividing unit 211 divides the imaging data based on a predetermined region showing the gradation of white. For example, the division unit 211 divides the imaging data into the first division data and the second division data.

更に、制御部２１は、分割データに対して文字認識処理を実行する。文字認識処理は、典型的には、光学的文字認識（ＯｐｔｉｃａｌＣｈａｒａｃｔｅｒＲｅｃｏｇｎｉｔｉｏｎ：ＯＣＲ）処理である。制御部２１は、分割データに対して文字認識処理を実行して、文字画像を検索する。具体的には、制御部２１は、複数の方向から分割データに対して文字認識処理を実行する。したがって、文字画像を検索する精度が向上する。この結果、制御部２１は、検索した文字画像に基づいて、精度のよいテキスト情報を取得できる。 Further, the control unit 21 executes character recognition processing on the divided data. The character recognition process is typically an optical character recognition (OCR) process. The control unit 21 executes a character recognition process on the divided data to search for a character image. Specifically, the control unit 21 executes character recognition processing on the divided data from a plurality of directions. Therefore, the accuracy of searching the character image is improved. As a result, the control unit 21 can acquire accurate text information based on the searched character image.

また、複数の画像を集約する場合、画像データの向きを変更して集約される。そして、分割部２１１の分割データの向きは、撮像データが画像読取ユニット２に撮像された際の向きと同じになる。したがって、分割データに含まれる文書の向きは、一般的に認識される文字の向きと異なる。制御部２１は、複数の方向から分割データに対して文字認識処理を行い、文字画像を最も多く検索できた向きに分割データの向きを修正する。この結果、ユーザーが分割データを確認する際に、分割データを認識することが容易となる。 Further, when a plurality of images are aggregated, the orientation of the image data is changed and the images are aggregated. Then, the orientation of the divided data of the dividing unit 211 is the same as the orientation when the captured data is captured by the image reading unit 2. Therefore, the orientation of the document contained in the divided data is different from the orientation of generally recognized characters. The control unit 21 performs character recognition processing on the divided data from a plurality of directions, and corrects the direction of the divided data in the direction in which the most character images can be searched. As a result, when the user confirms the divided data, it becomes easy to recognize the divided data.

そして、制御部２１は、第１分割データを示す第１分割画像Ｄ１を図４に示す第２表示領域１１２に表示するように、操作表示部４を制御する。制御部２１は、第１分割データを示す第２分割画像Ｄ２を図４に示す第２表示領域１１２に表示するように、操作表示部４を制御する。 Then, the control unit 21 controls the operation display unit 4 so that the first divided image D1 showing the first divided data is displayed in the second display area 112 shown in FIG. The control unit 21 controls the operation display unit 4 so that the second divided image D2 showing the first divided data is displayed in the second display area 112 shown in FIG.

次に、図２〜図５を参照して、第１決定部２１３が第１分割データと第２分割データとの順序を決定するまでの処理を説明する。図５は、分割データを示す分割画像Ｄを表示した表示画面１１０を示す別の図である。図５に示すように、表示画面１１０は、第１表示領域１１１と第２表示領域１１２とを含む。 Next, with reference to FIGS. 2 to 5, the process until the first determination unit 213 determines the order of the first division data and the second division data will be described. FIG. 5 is another view showing the display screen 110 displaying the divided image D showing the divided data. As shown in FIG. 5, the display screen 110 includes a first display area 111 and a second display area 112.

第１表示領域１１１には、撮像データを示す撮像画像ＲＧ１を表示する。撮像画像ＲＧ１は、プレビュー画像１１３と戻るボタン１１４とを含む。第２表示領域１１２には、分割データを示す複数の分割画像Ｄが表示される。図５に示す複数の分割画像Ｄは、第１分割画像Ｄ１と第２分割画像Ｄ２とを含む。 In the first display area 111, the captured image RG1 showing the captured data is displayed. The captured image RG1 includes a preview image 113 and a back button 114. In the second display area 112, a plurality of divided images D showing the divided data are displayed. The plurality of divided images D shown in FIG. 5 include a first divided image D1 and a second divided image D2.

図５に示す第１分割画像Ｄ１は、第１表示領域１１１から第２表示領域１１２へ向かう方向の下流側に位置する。図５に示す第２分割画像Ｄ２は、第１表示領域１１１から第２表示領域１１２へ向かう方向の上流側に位置する。したがって、図５では１ページ目の第２分割画像Ｄ２が上流側に位置し、２ページ目の第１分割画像Ｄ１が下流側に位置する。つまり、ページ番号が上流側から昇順に並んでいる。 The first divided image D1 shown in FIG. 5 is located on the downstream side in the direction from the first display area 111 to the second display area 112. The second divided image D2 shown in FIG. 5 is located on the upstream side in the direction from the first display area 111 to the second display area 112. Therefore, in FIG. 5, the second divided image D2 on the first page is located on the upstream side, and the first divided image D1 on the second page is located on the downstream side. That is, the page numbers are arranged in ascending order from the upstream side.

また、図５に示す第２分割画像Ｄ２の第２文末領域ＥＳ２に位置する文字と、第１分割画像Ｄ１の第１文頭領域ＢＳ１に位置する文字とは、文字と文字とを繋げて意味を成す文字列である。具体的には、図５に示すように、第２文末領域ＥＳ２に位置する「ａｎｄ」という単語と、第１分割画像Ｄ１の第１文頭領域ＢＳ１に位置する「ｍｕｌｔｉｆｕｎｃｔｉｏｎ」という単語とは、「ａｎｄｍｕｌｔｉｆｕｎｃｔｉｏｎ」という単語の列となっている。 Further, the characters located in the second sentence end region ES2 of the second divided image D2 and the characters located in the first sentence beginning region BS1 of the first divided image D1 have meanings by connecting the characters. It is a character string to be formed. Specifically, as shown in FIG. 5, the word "and" located in the second sentence end region ES2 and the word "multifaction" located in the first sentence beginning region BS1 of the first divided image D1 are "multifunction". It is a sequence of words "and partition".

図５に示すように、文字と文字とを繋げて意味をなす文字列とする場合、第１抽出部２１２は、分割データの各々から文字を抽出する。例えば、制御部２１が分割データごとに生成したテキストデータに基づいて、第１抽出部２１２は、文字又は文字列を抽出する。具体的には、第１抽出部２１２は、図４に示す第１分割画像Ｄ１に含まれる文書の第１文頭領域ＢＳ１から「ｍｕｌｔｉｆｕｎｃｔｉｏｎ」という単語を抽出する。第１分割画像Ｄ１に含まれる文書は、「第１文書」の一例に相当する。第１文頭領域ＢＳ１から抽出した単語は、「第１文字」の一例に相当する。そして、第１抽出部２１２は、図４に示す第２分割画像Ｄ２に含まれる文書の第２文末領域ＥＳ２から「ａｎｄ」という単語を抽出する。第２分割画像Ｄ２に含まれる文書は、「第２文書」の一例に相当する。第２文末領域ＥＳ２から抽出した単語は、「第２文字」の一例に相当する。 As shown in FIG. 5, when connecting characters to form a meaningful character string, the first extraction unit 212 extracts characters from each of the divided data. For example, the first extraction unit 212 extracts a character or a character string based on the text data generated by the control unit 21 for each of the divided data. Specifically, the first extraction unit 212 extracts the word "multibunction" from the first sentence head region BS1 of the document included in the first divided image D1 shown in FIG. The document included in the first divided image D1 corresponds to an example of the "first document". The word extracted from the first sentence head area BS1 corresponds to an example of the "first character". Then, the first extraction unit 212 extracts the word "and" from the second sentence end region ES2 of the document included in the second divided image D2 shown in FIG. The document included in the second divided image D2 corresponds to an example of the "second document". The word extracted from the second sentence end region ES2 corresponds to an example of the "second character".

そして、第１決定部２１３は、学習部２１８に第１文字と第２文字とを入力する。更に学習部２１８は、第１文字と第２文字との繋がりの程度を示す推定結果を出力する。例えば、学習部２１８には、第２文末領域ＥＳ２の「ａｎｄ」と第１文頭領域ＢＳ１の「ｍｕｌｔｉｆｕｎｃｔｉｏｎ」とが入力される。そして、学習部２１８は、「ａｎｄ」と「ｍｕｌｔｉｆｕｎｃｔｉｏｎ」との繋がりの程度を示す推定結果を出力する。 Then, the first determination unit 213 inputs the first character and the second character to the learning unit 218. Further, the learning unit 218 outputs an estimation result indicating the degree of connection between the first character and the second character. For example, “and” of the second sentence end region ES2 and “multifunction” of the first sentence beginning region BS1 are input to the learning unit 218. Then, the learning unit 218 outputs an estimation result indicating the degree of connection between the “and” and the “multifunction”.

また、第１抽出部２１２は、図４に示す第１分割画像Ｄ１の第１文末領域ＥＳ１と第２分割画像Ｄ２の第２文頭領域ＢＳ２とから文字を抽出してもよい。具体的には、第１抽出部２１２は、図４に示す第１分割画像Ｄ１の第１文末領域ＥＳ１から「ｏｐｔｉｍａｌｌｙ」という単語を抽出する。第１抽出部２１２は、図４に示す第２分割データを示す第２分割画像Ｄ２の第２文頭領域ＢＳ２から「Ｔｏｋｋｙｏ」という単語を抽出する。 Further, the first extraction unit 212 may extract characters from the first sentence end region ES1 of the first divided image D1 and the second sentence beginning region BS2 of the second divided image D2 shown in FIG. Specifically, the first extraction unit 212 extracts the word "optimally" from the first sentence end region ES1 of the first divided image D1 shown in FIG. The first extraction unit 212 extracts the word "Tokyo" from the second sentence head region BS2 of the second divided image D2 showing the second divided data shown in FIG.

そして、第１決定部２１３は、学習部２１８に第１文字と第２文字とを入力する。更に学習部２１８は、第１文字と第２文字との繋がりの程度を示す推定結果を出力する。例えば、学習部２１８には、第１文末領域ＥＳ１の「ｏｐｔｉｍａｌｌｙ」と第２文頭領域ＢＳ２の「Ｔｏｋｋｙｏ」とが入力される。そして、学習部２１８は、「ｏｐｔｉｍａｌｌｙ」と「Ｔｏｋｋｙｏ」との繋がりの程度を示す推定結果を出力する。 Then, the first determination unit 213 inputs the first character and the second character to the learning unit 218. Further, the learning unit 218 outputs an estimation result indicating the degree of connection between the first character and the second character. For example, the learning unit 218 is input with "optimally" of the first sentence end region ES1 and "Tokkyo" of the second sentence beginning region BS2. Then, the learning unit 218 outputs an estimation result indicating the degree of connection between the “optimally” and the “Tokyo”.

そして、第１決定部２１３は、学習部２１８が出力する第１文字と第２文字との繋がりの程度を示す推定結果に基づいて、第１分割データと第２分割データとの順序を決定する。具体的には、「ａｎｄ」と「ｍｕｌｔｉｆｕｎｃｔｉｏｎ」との繋がりの程度を示す推定結果と「ｏｐｔｉｍａｌｌｙ」と「Ｔｏｋｋｙｏ」との繋がりの程度を示す推定結果とを比較して、繋がりの程度が大きい推定結果に基づいて、第１決定部２１３は第１分割データと第２分割データとの順序を決定する。 Then, the first determination unit 213 determines the order of the first division data and the second division data based on the estimation result indicating the degree of connection between the first character and the second character output by the learning unit 218. .. Specifically, the estimation result indicating the degree of connection between "and" and "multifaction" is compared with the estimation result indicating the degree of connection between "optimally" and "Tokkyo", and the degree of connection is estimated to be large. Based on the result, the first determination unit 213 determines the order of the first division data and the second division data.

更に、制御部２１は、第１決定部２１３の決定に基づいて、操作表示部４が第１分割画像Ｄ１と第２分割画像Ｄ２とを表示するように、操作表示部４を制御する。したがって、図５に示すように、第１分割画像Ｄ１と第２分割画像Ｄ２とは、ページの順に並ぶ。この結果、第１分割画像Ｄ１と第２分割画像Ｄ２とを続けて読むことができる。 Further, the control unit 21 controls the operation display unit 4 so that the operation display unit 4 displays the first divided image D1 and the second divided image D2 based on the determination of the first determination unit 213. Therefore, as shown in FIG. 5, the first divided image D1 and the second divided image D2 are arranged in the order of pages. As a result, the first divided image D1 and the second divided image D2 can be read continuously.

また、学習部２１８の推定結果と第１決定部２１３の決定結果とは、学習部２１８に学習される。したがって、学習部２１８は、文書データとページ番号と推定結果と決定結果とで再学習する。この結果、精度の良い推定結果を出力できる。 Further, the estimation result of the learning unit 218 and the determination result of the first determination unit 213 are learned by the learning unit 218. Therefore, the learning unit 218 relearns with the document data, the page number, the estimation result, and the determination result. As a result, an accurate estimation result can be output.

次に、図６を参照して、実施形態１の制御部２１が実行する処理を説明する。図６は、制御部２１が実行する処理のフローチャートを示す。制御部２１が実行する処理は、ステップＳ１０１〜ステップＳ１０８を含む。 Next, the process executed by the control unit 21 of the first embodiment will be described with reference to FIG. FIG. 6 shows a flowchart of processing executed by the control unit 21. The process executed by the control unit 21 includes steps S101 to S108.

ステップＳ１０１において、制御部２１は、操作表示部４が選択画面５０を表示するように、操作表示部４を制御する。処理は、ステップＳ１０２に進む。 In step S101, the control unit 21 controls the operation display unit 4 so that the operation display unit 4 displays the selection screen 50. The process proceeds to step S102.

ステップＳ１０２において、制御部２１は、操作表示部４から取得した信号が画像データを分割する指示を含むか否かを判定する。画像データを分割する指示を含まない場合（ステップＳ１０２において、Ｎｏ）、処理は終了する。画像データを分割する指示を含む場合（ステップＳ１０２において、Ｙｅｓ）、処理はステップＳ１０３に進む。 In step S102, the control unit 21 determines whether or not the signal acquired from the operation display unit 4 includes an instruction to divide the image data. If the instruction to divide the image data is not included (No in step S102), the process ends. When the instruction to divide the image data is included (Yes in step S102), the process proceeds to step S103.

ステップＳ１０２でＹｅｓの場合、ステップＳ１０３において、制御部２１は、画像読取ユニット２が生成した撮像データを取得する。処理は、ステップＳ１０４に進む。 In the case of Yes in step S102, in step S103, the control unit 21 acquires the image pickup data generated by the image reading unit 2. The process proceeds to step S104.

ステップＳ１０４において、分割部２１１は、撮像データを文書ごとに分割して、分割データを生成する。処理は、ステップＳ１０５に進む。 In step S104, the division unit 211 divides the imaging data for each document to generate the divided data. The process proceeds to step S105.

ステップＳ１０５において、制御部２１は、分割データに対して文字画像の検索を実行し、文書に対応するテキスト情報を取得する。処理は、ステップＳ１０６に進む。 In step S105, the control unit 21 executes a character image search for the divided data and acquires text information corresponding to the document. The process proceeds to step S106.

ステップＳ１０６において、制御部２１は、文字画像の取得率に基づいて、分割データの向きを修正する。具体的には、制御部２１は、文字画像を最も多く検索できた向きに分割データの向きを修正する。処理は、ステップＳ１０７に進む。 In step S106, the control unit 21 corrects the orientation of the divided data based on the acquisition rate of the character image. Specifically, the control unit 21 corrects the orientation of the divided data in the orientation in which the most character images can be searched. The process proceeds to step S107.

ステップＳ１０７において、制御部２１は、第１決定処理を実行する。第１決定処理については、図７を参照して後述する。処理は、ステップＳ１０８に進む。 In step S107, the control unit 21 executes the first determination process. The first determination process will be described later with reference to FIG. 7. The process proceeds to step S108.

ステップＳ１０８において、学習部２１８は、文書データとページ番号と推定結果と決定結果とを学習する。処理は、終了する。 In step S108, the learning unit 218 learns the document data, the page number, the estimation result, and the determination result. The process ends.

次に、図７を参照して、制御部２１が実行する第１決定処理を説明する。図７は、第１決定処理のフローチャートを示す図である。第１決定処理は、ステップＳ２０１〜ステップＳ２１０を含む。図７に示す第１決定処理は、図６に示すステップＳ１０７に対応する。 Next, the first determination process executed by the control unit 21 will be described with reference to FIG. 7. FIG. 7 is a diagram showing a flowchart of the first determination process. The first determination process includes steps S201 to S210. The first determination process shown in FIG. 7 corresponds to step S107 shown in FIG.

ステップＳ２０１において、第１抽出部２１２は、第１分割データが含む第１文書の第１文末領域ＥＳ１から第１文字を抽出する。処理は、ステップＳ２０２に進む。 In step S201, the first extraction unit 212 extracts the first character from the first sentence end region ES1 of the first document included in the first partition data. The process proceeds to step S202.

ステップＳ２０２において、第１抽出部２１２は、第２分割データが含む第２文書の第２文頭領域ＢＳ２から第２文字を抽出する。処理は、ステップＳ２０３に進む。 In step S202, the first extraction unit 212 extracts the second character from the second sentence beginning region BS2 of the second document included in the second divided data. The process proceeds to step S203.

ステップＳ２０３において、第１決定部２１３は、第１文書の文末に位置する第１文字と第２文書の文頭に位置する第２文字とを学習部２１８に入力する。処理は、ステップＳ２０４に進む。 In step S203, the first determination unit 213 inputs the first character located at the end of the sentence of the first document and the second character located at the beginning of the sentence of the second document to the learning unit 218. The process proceeds to step S204.

ステップＳ２０４において、学習部２１８は、第１文字と第２文字との繋がりの程度を示す推定結果を出力する。処理は、ステップＳ２０５に進む。 In step S204, the learning unit 218 outputs an estimation result indicating the degree of connection between the first character and the second character. The process proceeds to step S205.

ステップＳ２０５において、第１抽出部２１２は、第１分割データが含む第１文書の第１文頭領域ＢＳ１から第１文字を抽出する。処理は、ステップＳ２０６に進む。 In step S205, the first extraction unit 212 extracts the first character from the first sentence head region BS1 of the first document included in the first partition data. The process proceeds to step S206.

ステップＳ２０６において、第１抽出部２１２は、第２分割データが含む第２文書の第２文末領域ＥＳ２から第２文字を抽出する。処理は、ステップＳ２０７に進む。 In step S206, the first extraction unit 212 extracts the second character from the second sentence end region ES2 of the second document included in the second divided data. The process proceeds to step S207.

ステップＳ２０７において、第１決定部２１３は、第１文書の文頭に位置する第１文字と第２文書の文末に位置する第２文字とを学習部２１８に入力する。処理は、ステップＳ２０８に進む。 In step S207, the first determination unit 213 inputs the first character located at the beginning of the sentence of the first document and the second character located at the end of the sentence of the second document to the learning unit 218. The process proceeds to step S208.

ステップＳ２０８において、学習部２１８は、第１文字と第２文字との繋がりの程度を示す推定結果を出力する。処理は、ステップＳ２０９に進む。 In step S208, the learning unit 218 outputs an estimation result indicating the degree of connection between the first character and the second character. The process proceeds to step S209.

ステップＳ２０９において、制御部２１は、他に分割データがあるか否かを判定する。他に分割データがある場合（ステップＳ２０９において、Ｙｅｓ）、処理はステップＳ２０１に戻る。他に分割データがない場合（ステップＳ２０９において、Ｎｏ）、処理はステップＳ２１０に進む。 In step S209, the control unit 21 determines whether or not there is other divided data. If there is other divided data (Yes in step S209), the process returns to step S201. If there is no other divided data (No in step S209), the process proceeds to step S210.

ステップＳ２０９でＮｏの場合、ステップＳ２１０において、第１決定部２１３は、第１分割データと第２分割データとの順序を決定する。処理は図６に示すステップＳ１０８に戻る。 If No in step S209, in step S210, the first determination unit 213 determines the order of the first division data and the second division data. The process returns to step S108 shown in FIG.

［実施形態２］
次に、図８を参照して、実施形態２の画像形成装置１００を説明する。実施形態２の画像形成装置１００は、第２抽出部２１４、第２決定部２１５、判定部２１６、及び選択部２１７を有する点で、実施形態１の画像形成装置１００と異なる。以下、実施形態２について、実施形態１と異なる事項について説明し、実施形態１と重複する部分についての説明は割愛する。 [Embodiment 2]
Next, the image forming apparatus 100 of the second embodiment will be described with reference to FIG. The image forming apparatus 100 of the second embodiment is different from the image forming apparatus 100 of the first embodiment in that it has a second extraction unit 214, a second determination unit 215, a determination unit 216, and a selection unit 217. Hereinafter, the items different from the first embodiment will be described with respect to the second embodiment, and the description of the parts overlapping with the first embodiment will be omitted.

図８は、実施形態２の制御部２１の構成を示す図である。制御部２１は、分割部２１１、第１抽出部２１２、第１決定部２１３、学習部２１８、第２抽出部２１４、及び第２決定部２１５を含む。制御部２１は、制御プログラムを実行することで、分割部２１１、第１抽出部２１２、第１決定部２１３、学習部２１８、第２抽出部２１４、及び第２決定部２１５として機能する。分割部２１１、第１抽出部２１２、第１決定部２１３、及び学習部２１８については、実施形態１と同様のため、説明を省略する。 FIG. 8 is a diagram showing the configuration of the control unit 21 of the second embodiment. The control unit 21 includes a division unit 211, a first extraction unit 212, a first determination unit 213, a learning unit 218, a second extraction unit 214, and a second determination unit 215. By executing the control program, the control unit 21 functions as a division unit 211, a first extraction unit 212, a first determination unit 213, a learning unit 218, a second extraction unit 214, and a second determination unit 215. The division unit 211, the first extraction unit 212, the first determination unit 213, and the learning unit 218 are the same as those in the first embodiment, and thus the description thereof will be omitted.

第２抽出部２１４は、文書の所定領域に位置する記号を抽出する。記号は、文字及び数字を含む。文書の所定領域は、文書のヘッダーの領域又は文書のフッターの領域を含む。したがって、第２抽出部２１４は、文書に付されたページ番号を取得できる。 The second extraction unit 214 extracts a symbol located in a predetermined area of the document. Symbols include letters and numbers. A predetermined area of the document includes a header area of the document or a footer area of the document. Therefore, the second extraction unit 214 can acquire the page number attached to the document.

第２決定部２１５は、第２抽出部２１４の抽出結果に基づいて、第１分割データと第２分割データとの順序を決定する。第１決定部２１３の結果に合わせて第２決定部２１５の結果も取得できる。この結果、精度良く第１分割データと第２分割データとの順序を決定できる。 The second determination unit 215 determines the order of the first division data and the second division data based on the extraction result of the second extraction unit 214. The result of the second determination unit 215 can be acquired in accordance with the result of the first determination unit 213. As a result, the order of the first divided data and the second divided data can be accurately determined.

次に、図８〜図１０を参照して、第２決定部２１５が第１分割データと第２分割データとの順序を決定する処理を説明する。図９は、分割データを示す分割画像Ｄを表示した表示画面１１０を示す図である。図９に示すように、表示画面１１０は、第１表示領域１１１と第２表示領域１１２とを含む。 Next, a process in which the second determination unit 215 determines the order of the first division data and the second division data will be described with reference to FIGS. 8 to 10. FIG. 9 is a diagram showing a display screen 110 displaying a divided image D showing divided data. As shown in FIG. 9, the display screen 110 includes a first display area 111 and a second display area 112.

第１表示領域１１１は、撮像データを示す撮像画像ＲＧ２を表示するプレビュー画像１１３と戻るボタン１１４とが表示される。図９に示す撮像画像ＲＧ２は、４つの画像を１枚のシートＲに集約した「４ｉｎ１」の画像である。 In the first display area 111, a preview image 113 for displaying the captured image RG2 showing the captured data and a return button 114 are displayed. The captured image RG2 shown in FIG. 9 is a “4in1” image in which four images are aggregated on one sheet R.

第２表示領域１１２には、分割データを示す複数の分割画像Ｄが表示される。図９に示す複数の分割画像Ｄは、第１分割画像Ｄ１と第２分割画像Ｄ２と第３分割画像Ｄ３と第４分割画像Ｄ４とを含む。 In the second display area 112, a plurality of divided images D showing the divided data are displayed. The plurality of divided images D shown in FIG. 9 include a first divided image D1, a second divided image D2, a third divided image D3, and a fourth divided image D4.

第１分割画像Ｄ１は、例えば、複数のページのうちの３ページ目の文書を示す画像である。第１分割画像Ｄ１は、文書を含む。第１分割画像Ｄ１は、第１文頭領域ＢＳ１と第１文末領域ＥＳ１と第１抽出領域ＣＴ１とを含む。図９に示す第１分割画像Ｄ１の第１文頭領域ＢＳ１には、「ａｒｒａｎｇｅｓ・・・・」という文字列が位置する。図９に示す第１分割画像Ｄ１の第１文末領域ＥＳ１には、「・・・ｗｈｉｃｈｐｒｏｖｉｄｅｓ」という文字列が位置する。図９に示す第１分割画像Ｄ１の第１抽出領域ＣＴ１には、「３」という記号が位置する。 The first divided image D1 is, for example, an image showing a document on the third page of a plurality of pages. The first divided image D1 includes a document. The first divided image D1 includes a first sentence beginning region BS1, a first sentence ending region ES1, and a first extraction region CT1. The character string "arranges ..." is located in the first sentence head region BS1 of the first divided image D1 shown in FIG. The character string "... has products" is located in the first sentence end region ES1 of the first divided image D1 shown in FIG. The symbol "3" is located in the first extraction region CT1 of the first divided image D1 shown in FIG.

第２分割画像Ｄ２は、例えば、複数のページのうちの４ページ目の文書を示す画像である。第２分割画像Ｄ２は、文書を含む。第２分割画像Ｄ２は、第２文頭領域ＢＳ２と第２文末領域ＥＳ２と第２抽出領域ＣＴ２とを含む。図９に示す第２分割画像Ｄ２の第２文頭領域ＢＳ２には、「ｃｏｍｐｒｅｈｅｎｓｉｖｅｓｅｒｖｉｃｅｓ・・・・」という文字列が位置する。図９に示す第２分割画像Ｄ２の第２文末領域ＥＳ２には、「・・・・ｉｎｔｈｅＵＫ．」という文字列が位置する。図９に示す第２分割画像Ｄ２の第２抽出領域ＣＴ２には、「４」という記号が位置する。 The second divided image D2 is, for example, an image showing a document on the fourth page of a plurality of pages. The second divided image D2 includes a document. The second divided image D2 includes the second sentence beginning region BS2, the second sentence ending region ES2, and the second extraction region CT2. The character string "comprehensive services ..." is located in the second sentence head region BS2 of the second divided image D2 shown in FIG. The character string "... in the UK." Is located in the second sentence end region ES2 of the second divided image D2 shown in FIG. The symbol "4" is located in the second extraction region CT2 of the second divided image D2 shown in FIG.

第３分割画像Ｄ３は、例えば、複数のページのうちの１ページ目の文書を示す画像である。第３分割画像Ｄ３は、文書を含む。第３分割画像Ｄ３は、第３文頭領域ＢＳ３と第３文末領域ＥＳ３と第３抽出領域ＣＴ３とを含む。図９に示す第３分割画像Ｄ３の第３文頭領域ＢＳ３には、「ＴｏｋｋｙｏＣｏ．，Ｌｔｄ．・・・・」という文字列が位置する。図９に示す第３分割画像Ｄ３の第３文末領域ＥＳ３には、「・・・・ｐｒｉｎｔｅｒｓａｎｄ」という文字列が位置する。図９に示す第３分割画像Ｄ３の第３抽出領域ＣＴ３には、「１」という記号が位置する。 The third divided image D3 is, for example, an image showing a document on the first page of a plurality of pages. The third divided image D3 includes a document. The third divided image D3 includes a third sentence beginning region BS3, a third sentence ending region ES3, and a third extraction region CT3. The character string "Tokyo Co., Ltd ...." is located in the third sentence head region BS3 of the third divided image D3 shown in FIG. The character string "... printers and" is located in the third sentence end region ES3 of the third divided image D3 shown in FIG. The symbol "1" is located in the third extraction region CT3 of the third divided image D3 shown in FIG.

第４分割画像Ｄ４は、例えば、複数のページのうちの２ページ目の文書を示す画像である。第４分割画像Ｄ４は、文書を含む。第４分割画像Ｄ４は、第４文頭領域ＢＳ４と第４文末領域ＥＳ４と第４抽出領域ＣＴ４とを含む。図９に示す第４分割画像Ｄ４の第４文頭領域ＢＳ４には、「ｍｕｌｔｉｆｕｎｃｔｉｏｎｄｅｖｉｃｅｓ・・・・」という文字列が位置する。図９に示す第４分割画像Ｄ４の第４文末領域ＥＳ４には、「・・・・ｏｐｔｉｍａｌｌｙ」という文字列が位置する。図９に示す第４分割画像Ｄ４の第４抽出領域ＣＴ４には、「２」という記号が位置する。 The fourth divided image D4 is, for example, an image showing a document on the second page of a plurality of pages. The fourth divided image D4 includes a document. The fourth divided image D4 includes the fourth sentence beginning region BS4, the fourth sentence ending region ES4, and the fourth extraction region CT4. The character string "multifunction devices ..." is located in the fourth sentence head region BS4 of the fourth divided image D4 shown in FIG. The character string "... optimally" is located in the fourth sentence end region ES4 of the fourth divided image D4 shown in FIG. The symbol "2" is located in the fourth extraction region CT4 of the fourth divided image D4 shown in FIG.

また、図９に示す第１分割画像Ｄ１と第３分割画像Ｄ３とは、第１表示領域１１１から第２表示領域１１２へ向かう方向の上流側に位置する。第２分割画像Ｄ２と第４分割画像Ｄ４とは、第１表示領域１１１から第２表示領域１１２へ向かう方向の下流側に位置する。したがって、３ページ目を示す第１分割画像Ｄ１は、２ページ目を示す第４分割画像Ｄ４よりも上流に位置する。 Further, the first divided image D1 and the third divided image D3 shown in FIG. 9 are located on the upstream side in the direction from the first display area 111 to the second display area 112. The second divided image D2 and the fourth divided image D4 are located on the downstream side in the direction from the first display area 111 to the second display area 112. Therefore, the first divided image D1 showing the third page is located upstream of the fourth divided image D4 showing the second page.

また、図９に示す第１分割画像Ｄ１と第２分割画像Ｄ２とは、プレビュー画像１１３から戻るボタン１１４へ向かう方向の上流側に位置する。第３分割画像Ｄ３と第４分割画像Ｄ４とは、プレビュー画像１１３から戻るボタン１１４へ向かう方向の下流側に位置する。したがって、３ページ目を示す第１分割画像Ｄ１及び４ページ目を示す第２分割画像Ｄ２は、１ページ目を示す第３分割画像Ｄ３及び２ページ目を示す第４分割画像Ｄ４よりも上流に位置する。したがって、３ページ目を示す第１分割画像Ｄ１は、１ページ目を示す第３分割画像Ｄ３よりも上流に位置する。つまり、図９に示す第１分割画像Ｄ１〜第４分割画像Ｄ４は、順序通り並んでいない。 Further, the first divided image D1 and the second divided image D2 shown in FIG. 9 are located on the upstream side in the direction from the preview image 113 toward the return button 114. The third divided image D3 and the fourth divided image D4 are located on the downstream side in the direction from the preview image 113 toward the return button 114. Therefore, the first divided image D1 showing the third page and the second divided image D2 showing the fourth page are upstream of the third divided image D3 showing the first page and the fourth divided image D4 showing the second page. To position. Therefore, the first divided image D1 showing the third page is located upstream of the third divided image D3 showing the first page. That is, the first divided images D1 to the fourth divided images D4 shown in FIG. 9 are not arranged in order.

図１０は、分割データを示す分割画像Ｄを表示した表示画面１１０を示す別の図である。図１０に示すように、表示画面１１０は、第１表示領域１１１と第２表示領域１１２とを含む。 FIG. 10 is another diagram showing a display screen 110 displaying a divided image D showing divided data. As shown in FIG. 10, the display screen 110 includes a first display area 111 and a second display area 112.

第１表示領域１１１は、撮像データを示す撮像画像ＲＧ２を表示するプレビュー画像１１３と戻るボタン１１４とが表示される。第２表示領域１１２には、分割データを示す複数の分割画像Ｄが表示される。図１０に示す複数の分割画像Ｄは、第１分割画像Ｄ１と第２分割画像Ｄ２と第３分割画像Ｄ３と第４分割画像Ｄ４とを含む。 In the first display area 111, a preview image 113 for displaying the captured image RG2 showing the captured data and a return button 114 are displayed. In the second display area 112, a plurality of divided images D showing the divided data are displayed. The plurality of divided images D shown in FIG. 10 include a first divided image D1, a second divided image D2, a third divided image D3, and a fourth divided image D4.

また、図１０に示す第３分割画像Ｄ３と第１分割画像Ｄ１とは、第１表示領域１１１から第２表示領域１１２へ向かう方向の上流側に位置する。第４分割画像Ｄ４と第２分割画像Ｄ２とは、第１表示領域１１１から第２表示領域１１２へ向かう方向の下流側に位置する。したがって、１ページ目を示す第３分割画像Ｄ３は、２ページ目を示す第４分割画像Ｄ４よりも上流に位置する。また、３ページ目を示す第１分割画像Ｄ１は、第４ページ目を示す第２分割画像Ｄ２よりも上流に位置する。 Further, the third divided image D3 and the first divided image D1 shown in FIG. 10 are located on the upstream side in the direction from the first display area 111 to the second display area 112. The fourth divided image D4 and the second divided image D2 are located on the downstream side in the direction from the first display area 111 to the second display area 112. Therefore, the third divided image D3 showing the first page is located upstream of the fourth divided image D4 showing the second page. Further, the first divided image D1 showing the third page is located upstream of the second divided image D2 showing the fourth page.

また、図１０に示す第３分割画像Ｄ３と第４分割画像Ｄ４とはプレビュー画像１１３から戻るボタン１１４へ向かう方向の上流側に位置する。図１０に示す第１分割画像Ｄ１と第２分割画像Ｄ２とはプレビュー画像１１３から戻るボタン１１４へ向かう方向の下流側に位置する。したがって、第２ページ目を示す第４分割画像Ｄ４は、第１分割画像Ｄ１及び第２分割画像Ｄ２よりも上流に位置する。つまり、図１０に示す第１分割画像Ｄ１〜第４分割画像Ｄ４は、昇順に並んでいる。 Further, the third divided image D3 and the fourth divided image D4 shown in FIG. 10 are located on the upstream side in the direction from the preview image 113 toward the return button 114. The first divided image D1 and the second divided image D2 shown in FIG. 10 are located on the downstream side in the direction from the preview image 113 toward the return button 114. Therefore, the fourth divided image D4 showing the second page is located upstream of the first divided image D1 and the second divided image D2. That is, the first divided images D1 to the fourth divided images D4 shown in FIG. 10 are arranged in ascending order.

図１０に示すように、第１分割画像Ｄ１〜第４分割画像Ｄ４を昇順に並べる場合、第２抽出部２１４は、分割データの各々から記号を抽出する。例えば、制御部２１が分割データごとに生成したテキストデータに基づいて、第２抽出部２１４は、文字を抽出する。具体的には、第２抽出２１４は、図９に示す第１分割画像Ｄ１の第１抽出領域ＣＴ１から「３」という数字を抽出する。第２抽出２１４は、第２分割画像Ｄ２の第２抽出領域ＣＴ２から「４」という数字を抽出する。第２抽出２１４は、第３分割画像Ｄ３の第３抽出領域ＣＴ３から「１」という数字を抽出する。第２抽出２１４は、第４分割画像Ｄ４の第４抽出領域ＣＴ４から「２」という数字を抽出する。 As shown in FIG. 10, when the first divided images D1 to the fourth divided images D4 are arranged in ascending order, the second extraction unit 214 extracts symbols from each of the divided data. For example, the second extraction unit 214 extracts characters based on the text data generated by the control unit 21 for each of the divided data. Specifically, the second extraction 214 extracts the number "3" from the first extraction region CT1 of the first divided image D1 shown in FIG. The second extraction 214 extracts the number “4” from the second extraction region CT2 of the second divided image D2. The second extraction 214 extracts the number "1" from the third extraction region CT3 of the third divided image D3. The second extraction 214 extracts the number “2” from the fourth extraction region CT4 of the fourth divided image D4.

そして、第２決定部２１５は、第２抽出部２１４の抽出結果に基づいて、第１分割データ、第２分割データ、第３分割データ、及び第４分割データの順序を決定する。したがって、１分割データ、第２分割データ、第３分割データ、及び第４分割データが順番に並ぶ。この結果、ユーザーが分割データを並べる手間を抑制できる。 Then, the second determination unit 215 determines the order of the first division data, the second division data, the third division data, and the fourth division data based on the extraction result of the second extraction unit 214. Therefore, the 1-division data, the 2nd division data, the 3rd division data, and the 4th division data are arranged in order. As a result, the user can reduce the trouble of arranging the divided data.

引き続き、図９と図１０とを参照して、分割データが２以上の場合の第１抽出部２１２と第１決定部２１３との処理を説明する。実施形態２の制御部２１は、第１決定部２１３の決定結果と第２決定部２１５の決定結果とを取得できる。 Subsequently, with reference to FIGS. 9 and 10, the processing of the first extraction unit 212 and the first determination unit 213 when the divided data is two or more will be described. The control unit 21 of the second embodiment can acquire the determination result of the first determination unit 213 and the determination result of the second determination unit 215.

制御部２１が分割データごとに生成したテキストデータに基づいて、第１抽出部２１２は、文字を抽出する。具体的には、第１抽出部２１２は、図９に示す第１分割画像Ｄ１に含まれる文書の第１文頭領域ＢＳ１から「ａｒｒａｎｇｅｓ」という単語を抽出する。第１分割画像Ｄ１に含まれる文書は、「第１文書」の一例に相当する。第１文頭領域ＢＳ１から抽出した単語は、「第１文字」の一例に相当する。 The first extraction unit 212 extracts characters based on the text data generated by the control unit 21 for each of the divided data. Specifically, the first extraction unit 212 extracts the word "arranges" from the first sentence head region BS1 of the document included in the first divided image D1 shown in FIG. The document included in the first divided image D1 corresponds to an example of the "first document". The word extracted from the first sentence head area BS1 corresponds to an example of the "first character".

そして、第１抽出部２１２は、図９に示す第２分割画像Ｄ２に含まれる文書の第２文末領域ＥＳ２から「ＵＫ．」という単語を抽出する。第２分割画像Ｄ２に含まれる文書は、「第２文書」の一例に相当する。第２文末領域ＥＳ２から抽出した単語は、「第２文字」の一例に相当する。 Then, the first extraction unit 212 extracts the word "UK." From the second sentence end region ES2 of the document included in the second divided image D2 shown in FIG. The document included in the second divided image D2 corresponds to an example of the "second document". The word extracted from the second sentence end region ES2 corresponds to an example of the "second character".

そして、第１抽出部２１２は、図９に示す第３分割画像Ｄ３に含まれる文書の第３文末領域ＥＳ３から「ａｎｄ」という単語を抽出する。第３分割画像Ｄ３含まれる文書は、「第２文書」の一例に相当する。第３文末領域ＥＳ３から抽出した単語は、「第２文字」の一例に相当する。 Then, the first extraction unit 212 extracts the word "and" from the third sentence end region ES3 of the document included in the third divided image D3 shown in FIG. The document included in the third divided image D3 corresponds to an example of the "second document". The word extracted from the third sentence end region ES3 corresponds to an example of the "second character".

そして、第１抽出部２１２は、図９に示す第４分割画像Ｄ４に含まれる文書の第４文末領域ＥＳ４から「ｏｐｔｉｍａｌｌｙ」という単語を抽出する。第４分割画像Ｄ４含まれる文書は、「第２文書」の一例に相当する。第４文末領域ＥＳ４から抽出した単語は、「第２文字」の一例に相当する。 Then, the first extraction unit 212 extracts the word "optimally" from the fourth sentence end region ES4 of the document included in the fourth divided image D4 shown in FIG. The document included in the fourth divided image D4 corresponds to an example of the "second document". The word extracted from the fourth sentence end region ES4 corresponds to an example of the "second character".

そして、第１決定部２１３は、学習部２１８に第１文字と第２文字とを入力する。更に学習部２１８は、第１文字と第２文字との繋がりの程度を示す推定結果を出力する。 Then, the first determination unit 213 inputs the first character and the second character to the learning unit 218. Further, the learning unit 218 outputs an estimation result indicating the degree of connection between the first character and the second character.

例えば、学習部２１８には、第２文末領域ＥＳ２の「ＵＫ．」と第１文頭領域ＢＳ１の「ａｒｒａｎｇｅｓ」とが入力される。そして、学習部２１８は、「ＵＫ．」と「ａｒｒａｎｇｅｓ」との繋がりの程度を示す第１推定結果を出力する。 For example, "UK." In the second sentence end region ES2 and "arranges" in the first sentence beginning region BS1 are input to the learning unit 218. Then, the learning unit 218 outputs the first estimation result indicating the degree of connection between "UK." And "arranges".

例えば、学習部２１８には、第３文末領域ＥＳ３の「ａｎｄ」と第１文頭領域ＢＳ１の「ａｒｒａｎｇｅｓ」とが入力される。そして、学習部２１８は、「ａｎｄ」と「ａｒｒａｎｇｅｓ」との繋がりの程度を示す第２推定結果を出力する。 For example, “and” of the third sentence end region ES3 and “arranges” of the first sentence beginning region BS1 are input to the learning unit 218. Then, the learning unit 218 outputs a second estimation result indicating the degree of connection between "and" and "arranges".

例えば、学習部２１８には、第４文末領域ＥＳ４の「ｏｐｔｉｍａｌｌｙ」と第１文頭領域ＢＳ１の「ａｒｒａｎｇｅｓ」とが入力される。そして、学習部２１８は、「ｏｐｔｉｍａｌｌｙ」と「ａｒｒａｎｇｅｓ」との繋がりの程度を示す第３推定結果を出力する。 For example, “optimally” of the fourth sentence end region ES4 and “arranges” of the first sentence beginning region BS1 are input to the learning unit 218. Then, the learning unit 218 outputs a third estimation result indicating the degree of connection between the “optimally” and the “arranges”.

そして、第１決定部２１３は、学習部２１８が出力する第１文字と第２文字との繋がりの程度を示す推定結果に基づいて、第１分割データと第２分割データとの順序を決定する。具体的には、第１推定結果、第２推定結果、及び第３推定結果を比較して、繋がりの程度が大きい推定結果に基づいて、第１決定部２１３は第１分割データと第２分割データとの順序を決定する。 Then, the first determination unit 213 determines the order of the first division data and the second division data based on the estimation result indicating the degree of connection between the first character and the second character output by the learning unit 218. .. Specifically, the first estimation result, the second estimation result, and the third estimation result are compared, and the first determination unit 213 performs the first division data and the second division based on the estimation result having a large degree of connection. Determine the order with the data.

更に、第１抽出部２１２は、分割画像Ｄごとに、同様の処理を繰り返す。また、第１抽出部２１２が処理を実行する毎に、第１決定部２１３は第１分割データと第２分割データとの順序を決定する。そして、制御部２１は、第１決定部２１３の決定結果に基づいて、図１０に示すように、第１分割画像Ｄ１〜第４分割画像Ｄ４をページの順序に表示する。この結果、第１分割画像Ｄ１〜第４分割画像Ｄ４を続けて読むことができる。 Further, the first extraction unit 212 repeats the same processing for each divided image D. Further, each time the first extraction unit 212 executes the process, the first determination unit 213 determines the order of the first division data and the second division data. Then, the control unit 21 displays the first divided image D1 to the fourth divided image D4 in the order of pages, as shown in FIG. 10, based on the determination result of the first determination unit 213. As a result, the first divided image D1 to the fourth divided image D4 can be continuously read.

引き続き、図８〜図１０を参照して、実施形態２の制御部２１について更に詳しく説明する。制御部２１は、判定部２１６、及び選択部２１７を更に含む。制御部２１は、制御プログラムを実行することで、判定部２１６、及び選択部２１７として機能する。 Subsequently, the control unit 21 of the second embodiment will be described in more detail with reference to FIGS. 8 to 10. The control unit 21 further includes a determination unit 216 and a selection unit 217. The control unit 21 functions as a determination unit 216 and a selection unit 217 by executing a control program.

判定部２１６は、第１決定部２１３の決定結果と第２決定部２１５の決定結果とが一致するか否かを判定する。したがって、第１決定部２１３の決定結果と第２決定部２１５の決定結果とが一致する場合は、第１決定部２１３と第２決定部２１５との精度が高いと判断できる。また、第１決定部２１３の決定結果と第２決定部２１５の決定結果とが一致しない場合は、第１決定部２１３と第２決定部２１５とのいずれか一方の精度が低いと判断できる。この結果、判定部２１６の判定結果をトリガーに、第１決定部２１３と第２決定部２１５の優劣を判断できる。 The determination unit 216 determines whether or not the determination result of the first determination unit 213 and the determination result of the second determination unit 215 match. Therefore, when the determination result of the first determination unit 213 and the determination result of the second determination unit 215 match, it can be determined that the accuracy of the first determination unit 213 and the second determination unit 215 is high. If the determination result of the first determination unit 213 and the determination result of the second determination unit 215 do not match, it can be determined that the accuracy of either the first determination unit 213 or the second determination unit 215 is low. As a result, the superiority or inferiority of the first determination unit 213 and the second determination unit 215 can be determined by using the determination result of the determination unit 216 as a trigger.

選択部２１７は、第１決定部２１３の決定結果と第２決定部２１５の決定結果とのうちのいずれか一方の決定結果を選択する。具体的には、決定結果が一致すると判定部２１６が判定する場合、選択部２１７は第１決定部２１３の決定結果を選択する。また、決定結果が一致しないと判定部２１６が判定する場合、選択部２１７は第１決定部２１３の決定結果と第２決定部２１５の決定結果とのうちのいずれか一方の決定結果を選択する。第１結果と第２結果とが異なる場合、第１結果と第２結果とのうち、どちらかの結果が優先される。したがって、精度よく第１文書と第２文書の順序を決定できる決定部の結果を採用できる。この結果、ユーザーが第１分割データと第２分割データとの順序を決定する手間を抑制できる。 The selection unit 217 selects the determination result of either the determination result of the first determination unit 213 or the determination result of the second determination unit 215. Specifically, when the determination unit 216 determines that the determination results match, the selection unit 217 selects the determination result of the first determination unit 213. When the determination unit 216 determines that the determination results do not match, the selection unit 217 selects one of the determination result of the first determination unit 213 and the determination result of the second determination unit 215. .. When the first result and the second result are different, one of the first result and the second result has priority. Therefore, the result of the determination unit that can accurately determine the order of the first document and the second document can be adopted. As a result, the user can reduce the trouble of determining the order of the first divided data and the second divided data.

例えば、ユーザーは、決定結果が一致しない場合、第２決定部２１５の決定結果を選択することを予め記憶部２２に記憶させる。したがって、決定結果が一致しないと判定部２１６が判定する場合、選択部２１７は、第２決定部２１５の決定結果を選択する。この結果、学習部２１８の学習が進んでいない場合、第２決定部２１５の決定結果を選択することで精度良く分割データの順序を決定できる。 For example, when the determination results do not match, the user stores in advance in the storage unit 22 that the determination result of the second determination unit 215 is selected. Therefore, when the determination unit 216 determines that the determination results do not match, the selection unit 217 selects the determination result of the second determination unit 215. As a result, when the learning of the learning unit 218 has not progressed, the order of the divided data can be accurately determined by selecting the determination result of the second determination unit 215.

また、学習部２１８の学習が進んだ場合、ユーザーは第１決定部２１３の決定結果を選択することを予め記憶部２２に記憶させる。したがって、決定結果が一致しないと判定部２１６が判定する場合、選択部２１７は、第１決定部２１３の決定結果を選択する。この結果、学習が進んで精度が向上した推定結果に基づいて決定された第１決定部２１３の決定結果を選択できるため、精度良く分割データの順序を決定できる。 Further, when the learning of the learning unit 218 progresses, the storage unit 22 stores in advance that the user selects the determination result of the first determination unit 213. Therefore, when the determination unit 216 determines that the determination results do not match, the selection unit 217 selects the determination result of the first determination unit 213. As a result, since the determination result of the first determination unit 213 determined based on the estimation result with advanced learning and improved accuracy can be selected, the order of the divided data can be determined with high accuracy.

次に、図１１を参照して、実施形態２の制御部２１が実行する処理を説明する。図１１は、制御部２１が実行する処理のフローチャートを示す。制御部２１が実行する処理は、ステップＳ３０１〜ステップＳ３１０を含む。図１１に示すステップＳ３０１〜ステップＳ３０７は、図６に示すステップＳ１０１〜ステップＳ１０７に対応しており、同様の処理を実行する。 Next, the process executed by the control unit 21 of the second embodiment will be described with reference to FIG. FIG. 11 shows a flowchart of processing executed by the control unit 21. The process executed by the control unit 21 includes steps S301 to S310. Steps S301 to S307 shown in FIG. 11 correspond to steps S101 to S107 shown in FIG. 6, and the same processing is executed.

ステップＳ３０７の後に、ステップＳ３０８において、制御部２１は、第２決定処理を実行する。第２決定処理は、図１２を参照して後述する。処理は、ステップＳ３０９に進む。 After step S307, in step S308, the control unit 21 executes the second determination process. The second determination process will be described later with reference to FIG. The process proceeds to step S309.

ステップＳ３０９において、制御部２１は、選択処理を実行する。選択処理は、図１３を参照して後述する。処理はステップＳ３１０に進む。 In step S309, the control unit 21 executes the selection process. The selection process will be described later with reference to FIG. The process proceeds to step S310.

ステップＳ３１０において、学習部２１８は、文書データとページ番号と推定結果と第１決定部２１３の決定結果と第２決定部２１５の決定結果とを学習する。処理は、終了する。 In step S310, the learning unit 218 learns the document data, the page number, the estimation result, the determination result of the first determination unit 213, and the determination result of the second determination unit 215. The process ends.

次に、図１２を参照して、第２決定処理を説明する。図１２は、制御部２１が実行する第２決定処理のフローチャートを示す。制御部２１が実行する第２決定処理は、ステップＳ４０１〜ステップＳ４０３を含む。第２決定処理は、図１１に示すステップＳ３０８に対応する。 Next, the second determination process will be described with reference to FIG. FIG. 12 shows a flowchart of the second determination process executed by the control unit 21. The second determination process executed by the control unit 21 includes steps S401 to S403. The second determination process corresponds to step S308 shown in FIG.

ステップＳ４０１において、第２抽出部２１４は、分割データに含まれる文書の抽出領域ＣＴに位置する記号を抽出する。処理はステップＳ４０２に進む。 In step S401, the second extraction unit 214 extracts the symbol located in the extraction area CT of the document included in the divided data. The process proceeds to step S402.

ステップＳ４０２において、制御部２１は、他の分割データがあるか否かを判定する。他の分割データがある場合（ステップＳ４０２において、Ｙｅｓ）、処理はステップＳ４０１に戻る。他の分割データがない場合（ステップＳ４０２において、Ｎｏ）、処理はステップＳ４０３に進む。 In step S402, the control unit 21 determines whether or not there is other divided data. If there is other divided data (Yes in step S402), the process returns to step S401. If there is no other divided data (No in step S402), the process proceeds to step S403.

ステップＳ４０３において、第２決定部２１５は、第１分割データと第２分割データとの順序を決定する。処理は、図１１に示すステップＳ３０９に戻る。 In step S403, the second determination unit 215 determines the order of the first division data and the second division data. The process returns to step S309 shown in FIG.

次に、図１３を参照して、選択処理を説明する。図１３は、制御部２１が実行する選択処理のフローチャートを示す。制御部２１が実行する選択処理は、ステップＳ５０１〜ステップＳ５０３を含む。選択処理は、図１１に示すステップＳ３０９に対応する。 Next, the selection process will be described with reference to FIG. FIG. 13 shows a flowchart of the selection process executed by the control unit 21. The selection process executed by the control unit 21 includes steps S501 to S503. The selection process corresponds to step S309 shown in FIG.

ステップＳ５０１において、判定部２１６は、第１決定部２１３の決定結果と第２決定部２１５の決定結果とが一致するか否かを判定する。決定結果が一致する場合（ステップＳ５０１において、Ｙｅｓ）、処理はステップＳ５０２に進む。決定結果が一致しない場合（ステップＳ５０１において、Ｎｏ）、処理はステップＳ５０３に進む。 In step S501, the determination unit 216 determines whether or not the determination result of the first determination unit 213 and the determination result of the second determination unit 215 match. If the determination results match (Yes in step S501), the process proceeds to step S502. If the determination results do not match (No in step S501), the process proceeds to step S503.

ステップＳ５０２でＮｏの場合、ステップＳ５０３において、選択部２１７は予め定められた決定結果を選択する。処理は、図１１に示すステップＳ３１０に戻る。 If No in step S502, in step S503, the selection unit 217 selects a predetermined determination result. The process returns to step S310 shown in FIG.

ステップＳ５０２でＹｅｓの場合、ステップＳ５０２において、選択部２１７は第１決定部２１３の決定結果を選択する。処理は、図１１に示すステップＳ３１０に戻る。 In the case of Yes in step S502, in step S502, the selection unit 217 selects the determination result of the first determination unit 213. The process returns to step S310 shown in FIG.

以上、図面を参照しながら本発明の実施形態を説明した。但し、本発明は、上記の実施形態に限られるものではなく、その要旨を逸脱しない範囲で種々の態様において実施することが可能である。また、上記の各実施形態に開示されている複数の構成要素を適宜組み合わせることによって、種々の発明の形成が可能である。例えば、実施形態に示される全構成要素から幾つかの構成要素を削除してもよい。更に、異なる実施形態にわたる構成要素を適宜組み合わせてもよい。図面は、理解しやすくするために、それぞれの構成要素を主体に模式的に示しており、図示された各構成要素の厚み、長さ、個数、間隔等は、図面作成の都合上から実際とは異なる。また、上記の実施形態で示す各構成要素の速度、材質、形状、寸法等は一例であって、特に限定されるものではなく、本発明の構成から実質的に逸脱しない範囲で種々の変更が可能である。 The embodiments of the present invention have been described above with reference to the drawings. However, the present invention is not limited to the above-described embodiment, and can be implemented in various embodiments without departing from the gist thereof. In addition, various inventions can be formed by appropriately combining the plurality of components disclosed in each of the above embodiments. For example, some components may be removed from all the components shown in the embodiments. Further, components over different embodiments may be combined as appropriate. In order to make the drawings easier to understand, each component is schematically shown, and the thickness, length, number, spacing, etc. of each component shown are actual for the convenience of drawing creation. Is different. Further, the speed, material, shape, dimensions, etc. of each component shown in the above embodiment are merely examples, and are not particularly limited, and various changes can be made without substantially deviating from the configuration of the present invention. It is possible.

（１）実施形態２の選択部２１７は、決定結果が一致しないと判定部２１６が判定する場合、選択部２１７は予め定められた決定部の決定結果を選択した。しかし、制御部２１は、決定結果が一致しない場合、ユーザーに分割データの順序を決定させてもよい。そして、ユーザーが決定した順序を学習部２１８が学習する。したがって、学習部２１８は精度良く学習できる。この結果、学習部２１８は、精度の良い推定結果を出力できる。 (1) When the determination unit 216 determines that the determination results do not match, the selection unit 217 of the second embodiment selects the determination result of the predetermined determination unit. However, if the determination results do not match, the control unit 21 may let the user determine the order of the divided data. Then, the learning unit 218 learns the order determined by the user. Therefore, the learning unit 218 can learn with high accuracy. As a result, the learning unit 218 can output an accurate estimation result.

本発明は、画像形成装置の分野に利用可能である。 The present invention can be used in the field of image forming apparatus.

２画像読取ユニット（撮像部）
２１制御部
１００画像形成装置
２１１分割部
２１２第１抽出部
２１３第１決定部
２１４第２抽出部
２１４第２抽出
２１５第２決定部
２１６判定部
２１７選択部
２１８学習部
ＣＴ抽出領域（所定領域）
Ｐシート
Ｒシート 2 Image reading unit (imaging unit)
21 Control unit 100 Image forming device 211 Dividing unit 212 First extraction unit 213 First determination unit 214 Second extraction unit 214 Second extraction 215 Second determination unit 216 Judgment unit 217 Selection unit 218 Learning unit CT extraction area (predetermined area)
P sheet R sheet

Claims

An image forming apparatus that forms a document on a sheet based on document data indicating a document.
A learning unit that learns document data including the above documents in order to estimate the connection between characters,
An imaging unit that generates imaging data by imaging a sheet formed by aggregating multiple documents,
A division unit that divides the imaging data into one page of the document to generate a plurality of division data, and
A first extraction unit that extracts characters from each of the divided data,
With
The divided data includes a first divided data and a second divided data different from the first divided data.
The first extraction unit
The first character is extracted from the first document included in the first partition data,
The second character is extracted from the second document included in the second divided data, and
The first document indicates a document represented by the divided data for one page.
The second document indicates a document represented by the divided data for one page different from the first document.
The first character is a character included in either the beginning of a sentence indicating the position where the description of the first document starts or the end of the sentence indicating the position where the description of the first document ends.
The second character is a character at a position different from the position including the first character in the beginning of the sentence indicating the position where the description of the second document starts or the end of the sentence indicating the position where the description of the second document ends.
The learning unit is an image forming apparatus that outputs an estimation result indicating the degree of connection between the first character and the second character by inputting the first character and the second character.

The image forming apparatus according to claim 1, wherein the document data learned by the learning unit is data including the document formed by the image forming apparatus on a sheet.

The image forming apparatus according to claim 1 or 2, wherein the learning unit further learns the document data and the page number corresponding to the document data.

The image forming apparatus according to claim 1 or 2, wherein each of the first character and the second character contains a single character, a word, and a morpheme.

The first extraction unit
The first character located at the end of the first document is extracted and
The image forming apparatus according to any one of claims 1 to 4, which extracts the second character located at the beginning of the sentence of the second document.

The first extraction unit
The first character located at the beginning of the first document is extracted and
The image forming apparatus according to any one of claims 1 to 5, which extracts the second character located at the end of the sentence of the second document.

The invention according to any one of claims 1 to 6, further comprising a first determination unit that determines the order of the first division data and the second division data based on the estimation result of the learning unit. Image forming device.

A second extraction unit that extracts symbols located in a predetermined area of the document, and
The image forming apparatus according to claim 7, further comprising a second determination unit that determines the order of the first division data and the second division data based on the extraction result of the second extraction unit.

A determination unit that determines whether or not the determination result of the first determination unit and the determination result of the second determination unit match.
Further provided with a selection unit for selecting the determination result of the first determination unit or the determination result of the second determination unit.
When the determination unit determines that the determination result of the first determination unit and the determination result of the second determination unit do not match, the selection unit determines the determination result of the first determination unit and the determination of the second determination unit. The image forming apparatus according to claim 8, wherein a determination result of either one of the results is selected.

The claim that the learning unit learns the document data, the page number corresponding to the document data, the estimation result, the determination result of the first determination unit, and the determination result of the second determination unit. 8 or the image forming apparatus according to claim 9.