JP2020134964A

JP2020134964A - Image processing apparatus and image processing method

Info

Publication number: JP2020134964A
Application number: JP2019022831A
Authority: JP
Inventors: リチェル・ロペズ; Ropez Richell
Original assignee: Kyocera Document Solutions Inc
Current assignee: Kyocera Document Solutions Inc
Priority date: 2019-02-12
Filing date: 2019-02-12
Publication date: 2020-08-31

Abstract

To read an image of a document and classify the image of the document based on the characteristics of the entire image of the document.SOLUTION: In an image processing apparatus 10, when a training mode is set by an operation from an operation unit (operation unit 22 or touch panel 23), a control unit 31 extracts the characteristics of an image of a document read by an image reading unit 11, and learns learning data from which the type of the document input by the operation from the operation unit is derived based on the characteristics of the document. When a classification mode is set by an operation from the operation unit, the control unit 31 extracts the characteristics of the image of the document read by the image reading unit 11, determines the type of the document based on the characteristics of the image of the document and the learning data, and adds information indicating the determined type to image data of the document.SELECTED DRAWING: Figure 2

Description

本発明は、原稿の画像データを取得する画像処理装置及び画像処理方法に関し、特に原稿の画像データを分類するための技術に関する。 The present invention relates to an image processing apparatus and an image processing method for acquiring image data of a manuscript, and more particularly to a technique for classifying image data of a manuscript.

画像処理装置としては、複合機（ＭＦＰ）、コピー機、ファクシミリ装置などがある。このような画像処理装置においては、読取った原稿の画像データを自動的に分類できることが望ましい。 Image processing devices include multifunction devices (MFPs), copiers, facsimile machines, and the like. In such an image processing apparatus, it is desirable that the image data of the scanned document can be automatically classified.

例えば、特許文献１に記載の文書管理システムでは、読取られた文書画像データを文章領域、表領域、罫線、図領域などに分けて、各領域のレイアウト情報を抽出し、文書の画像データにおける特定の位置にある領域に対して文字認識あるいはバーコード認識を実行し、文字認識やバーコード認識によって読み取られた情報を属性として文書画像データに付加して保存し、その属性を用いることで文書画像データを検索できるようにしている。 For example, in the document management system described in Patent Document 1, the read document image data is divided into a text area, a table area, a ruled line, a figure area, and the like, layout information of each area is extracted, and the document is specified in the image data of the document. Character recognition or bar code recognition is executed for the area at the position of, and the information read by character recognition or bar code recognition is added to the document image data as an attribute and saved, and the document image is used by using that attribute. The data can be searched.

特開２００２−３４２３４３号公報JP-A-2002-342343

しかしながら、特許文献１に記載の文書管理システムにおいては、文書画像における特定の位置の領域に対する文字認識やバーコード認識により読取られた情報を文書画像の属性としているので、この属性は、特定の位置の領域の特徴に基づくものであって、文書画像全体の特徴に基づくものではなく、文書画像の分類が適確に行われているとは言い難い。 However, in the document management system described in Patent Document 1, since the information read by character recognition or bar code recognition for a region at a specific position in the document image is used as an attribute of the document image, this attribute is a specific position. It is hard to say that the document images are properly classified because they are based on the characteristics of the area of the document image and not based on the characteristics of the entire document image.

本発明は、上記の事情に鑑みなされたものであり、原稿の画像全体の特徴に基づき該原稿の画像を分類することを目的とする。 The present invention has been made in view of the above circumstances, and an object of the present invention is to classify the images of the manuscript based on the characteristics of the entire image of the manuscript.

本発明の一局面にかかる画像処理装置は、原稿の画像データを取得する画像取得部と、ユーザーによる操作で設定及び指示が入力される操作部と、前記操作部に入力される指示に従ってトレーニングモード又は分類モードを設定する制御部とを備え、前記制御部は、前記操作部への操作により前記トレーニングモードの設定が入力された場合に、前記画像取得部により画像データが取得された原稿の画像の特徴を抽出して、当該原稿の画像の特徴を示すデータを、該原稿の種類を判別するための学習データとして収集し、前記操作部への操作により前記分類モードが設定された場合には、前記画像取得部により画像データが取得された原稿の画像の特徴を抽出して、当該原稿の画像の特徴及び前記学習データに基づき当該原稿の種類を判別し、該判別した種類を示す情報を当該原稿の画像データに付加するものである。 The image processing device according to one aspect of the present invention includes an image acquisition unit for acquiring image data of a document, an operation unit for inputting settings and instructions by user operation, and a training mode according to instructions input to the operation unit. Alternatively, the control unit includes a control unit for setting the classification mode, and the control unit is an image of a manuscript whose image data has been acquired by the image acquisition unit when the training mode setting is input by operating the operation unit. When the characteristics of the original are extracted, data indicating the characteristics of the image of the original is collected as learning data for determining the type of the original, and the classification mode is set by operating the operation unit. , The feature of the image of the manuscript whose image data has been acquired by the image acquisition unit is extracted, the type of the manuscript is discriminated based on the feature of the image of the manuscript and the learning data, and the information indicating the discriminated type is obtained. It is added to the image data of the manuscript.

また、本発明の一局面にかかる画像処理方法は、原稿の画像データを取得する画像取得ステップと、ユーザーによる操作部の操作を受け付ける操作ステップと、前記操作ステップでトレーニングモードが設定された場合に、前記画像取得ステップで画像データを取得された原稿の画像の特徴を抽出して、当該原稿の画像の特徴を示すデータを、該原稿の種類を判別するための学習データとして収集し、前記操作ステップで分類モードが設定された場合には、前記画像取得ステップで画像データを取得された原稿の画像の特徴を抽出して、当該原稿の画像の特徴及び前記学習データに基づき当該原稿の種類を判別し、該判別した種類を示す情報を当該原稿の画像データに付加する制御ステップと、を含むものである。 Further, the image processing method according to one aspect of the present invention includes an image acquisition step for acquiring image data of a document, an operation step for accepting an operation of an operation unit by a user, and a training mode set in the operation step. , The feature of the image of the manuscript whose image data was acquired in the image acquisition step is extracted, and the data showing the feature of the image of the manuscript is collected as learning data for discriminating the type of the manuscript, and the operation is performed. When the classification mode is set in the step, the characteristics of the image of the manuscript whose image data was acquired in the image acquisition step are extracted, and the type of the manuscript is determined based on the characteristics of the image of the manuscript and the learning data. It includes a control step of discriminating and adding information indicating the discriminated type to the image data of the manuscript.

本発明によれば、原稿の画像を読み取って、原稿の画像全体の特徴に基づき該原稿の画像を分類することができる。 According to the present invention, an image of a manuscript can be read and the image of the manuscript can be classified based on the characteristics of the entire image of the manuscript.

本発明の一実施形態にかかる画像処理装置を示す断面図である。It is sectional drawing which shows the image processing apparatus which concerns on one Embodiment of this invention. 本実施形態の画像処理装置の主要内部構成を示すブロック図である。It is a block diagram which shows the main internal structure of the image processing apparatus of this embodiment. ディープラーニングの学習データを生成するためのトレーニングモードの処理手順を示すフローチャートである。It is a flowchart which shows the processing procedure of the training mode for generating the learning data of deep learning. 表示部に表示された分類機能入力画面を示す図である。It is a figure which shows the classification function input screen displayed on the display part. 表示部に表示されたトレーニングモード設定画面を示す図である。It is a figure which shows the training mode setting screen displayed on the display part. ディープラーニングの概念を示す図である。It is a figure which shows the concept of deep learning. 原稿の画像の特徴である画像の色相、コントラスト、色数、濃度と、タグ名である招待状、領収書との対応関係を例示する図表である。It is a chart exemplifying the correspondence between the hue, contrast, the number of colors, and the density of the image, which are the features of the image of the manuscript, and the invitation and the receipt, which are the tag names. 分類モードでの処理手順を示すフローチャートである。It is a flowchart which shows the processing procedure in a classification mode. 表示部に表示された分類モード設定画面を示す図である。It is a figure which shows the classification mode setting screen displayed on the display part. 表示部に表示されたポップアップ画面を示す図である。It is a figure which shows the pop-up screen displayed on the display part. 表示部に表示された他のポップアップ画面を示す図である。It is a figure which shows the other pop-up screen displayed on the display part. 表示部に表示された原稿の画像のプロパティを示す図である。It is a figure which shows the property of the image of the original which was displayed on the display part. 表示部に表示された原稿の画像の他のプロパティを示す図である。It is a figure which shows the other property of the image of a manuscript displayed on the display part. 表示部に表示されたファイルの一覧を示す図である。It is a figure which shows the list of the file displayed on the display part. 表示部に表示された送信設定画面の一例を示す図である。It is a figure which shows an example of the transmission setting screen displayed on the display part.

以下、本発明の一実施形態にかかる画像処理装置について図面を参照して説明する。 Hereinafter, the image processing apparatus according to the embodiment of the present invention will be described with reference to the drawings.

図１は、本発明の一実施形態にかかる画像処理装置を示す断面図である。図１に示すように本実施形態の画像処理装置１０は、例えばコピー機能、プリンター機能、ファクシミリ機能のような複数の機能を兼ね備えたＭＦＰ（画像形成装置）である。この画像処理装置１０は、画像読取部１１と、画像形成部１２とを備えている。 FIG. 1 is a cross-sectional view showing an image processing apparatus according to an embodiment of the present invention. As shown in FIG. 1, the image processing apparatus 10 of the present embodiment is an MFP (image forming apparatus) having a plurality of functions such as a copy function, a printer function, and a facsimile function. The image processing device 10 includes an image reading unit 11 and an image forming unit 12.

画像読取部１１は、原稿の画像を光学的に読み取る撮像素子を有しており、この撮像素子のアナログ出力がデジタル信号に変換されて、原稿の画像を示す画像データが生成される。画像読取部１１は、特許請求の範囲における画像取得部の一例である。 The image reading unit 11 has an image sensor that optically reads the image of the document, and the analog output of the image sensor is converted into a digital signal to generate image data indicating the image of the document. The image reading unit 11 is an example of an image acquisition unit within the scope of claims.

画像形成部１２は、上記画像データによって示される画像を記録紙に形成するものであり、マゼンタ用の画像形成ユニット３Ｍ、シアン用の画像形成ユニット３Ｃ、イエロー用の画像形成ユニット３Ｙ、及びブラック用の画像形成ユニット３Ｂｋを備えている。各画像形成ユニット３Ｍ、３Ｃ、３Ｙ、及び３Ｂｋのいずれにおいても、感光体ドラム４の表面を均一帯電させ、感光体ドラム４の表面を露光して、感光体ドラム４の表面に静電潜像を形成し、感光体ドラム４の表面の静電潜像をトナー像に現像して、感光体ドラム４の表面のトナー像を中間転写ベルト５に１次転写する。これにより、カラーのトナー像が中間転写ベルト５上に形成される。このカラーのトナー像は、中間転写ベルト５と２次転写ローラー６の間のニップ域Ｎにおいて給紙部１４から搬送路８を通じて搬送されてきた記録紙Ｐに２次転写される。 The image forming unit 12 forms the image indicated by the image data on the recording paper, and is used for the image forming unit 3M for magenta, the image forming unit 3C for cyan, the image forming unit 3Y for yellow, and the black. The image forming unit 3Bk of the above is provided. In each of the image forming units 3M, 3C, 3Y, and 3Bk, the surface of the photoconductor drum 4 is uniformly charged, the surface of the photoconductor drum 4 is exposed, and an electrostatic latent image is formed on the surface of the photoconductor drum 4. Is formed, the electrostatic latent image on the surface of the photoconductor drum 4 is developed into a toner image, and the toner image on the surface of the photoconductor drum 4 is primarily transferred to the intermediate transfer belt 5. As a result, a color toner image is formed on the intermediate transfer belt 5. The toner image of this color is secondarily transferred to the recording paper P conveyed from the paper feeding unit 14 through the transfer path 8 in the nip area N between the intermediate transfer belt 5 and the secondary transfer roller 6.

この後、定着装置１５で記録紙Ｐが加熱及び加圧されて、記録紙Ｐ上のトナー像が熱圧着により定着され、更に記録紙Ｐが排出ローラー１６を通じて排出トレイ１７に排出される。 After that, the recording paper P is heated and pressurized by the fixing device 15, the toner image on the recording paper P is fixed by thermocompression bonding, and the recording paper P is discharged to the discharge tray 17 through the discharge roller 16.

次に、図２は、本実施形態の画像処理装置１０の主要内部構成を示すブロック図である。図２に示すように本実施形態の画像処理装置１０は、画像読取部１１と、画像形成部１２と、表示部２１と、操作部２２と、タッチパネル２３と、ネットワークインターフェイス部９１と、画像メモリー２６と、画像処理部２７と、記憶部２８と、制御ユニット２９とを備えている。これらの構成要素は、互いにバスを通じてデータ又は信号の送受信が可能とされている。 Next, FIG. 2 is a block diagram showing a main internal configuration of the image processing device 10 of the present embodiment. As shown in FIG. 2, the image processing device 10 of the present embodiment includes an image reading unit 11, an image forming unit 12, a display unit 21, an operation unit 22, a touch panel 23, a network interface unit 91, and an image memory. 26, an image processing unit 27, a storage unit 28, and a control unit 29 are provided. These components are capable of transmitting and receiving data or signals through the bus to each other.

表示部２１は、液晶ディスプレイ（ＬＣＤ：Liquid Crystal Display）や有機ＥＬ（ＯＬＥＤ：Organic Light-Emitting Diode）ディスプレイなどから構成される。操作部２２は、テンキー、決定キー、スタートキーなどの物理キーを備えている。 The display unit 21 is composed of a liquid crystal display (LCD), an organic EL (OLED: Organic Light-Emitting Diode) display, and the like. The operation unit 22 includes physical keys such as a numeric keypad, an enter key, and a start key.

表示部２１の画面には、タッチパネル２３が配置されている。タッチパネル２３は、所謂抵抗膜方式や静電容量方式などのタッチパネルであって、タッチパネル２３に対するユーザーの指などの接触（タッチ）をその接触位置とともに検知して、その接触位置の座標を示す検知信号を制御ユニット２９の後述する制御部３１などに出力する。 A touch panel 23 is arranged on the screen of the display unit 21. The touch panel 23 is a touch panel of a so-called resistance film type or capacitance type, and detects contact (touch) of a user's finger or the like with the touch panel 23 together with the contact position, and a detection signal indicating the coordinates of the contact position. Is output to the control unit 31 or the like described later of the control unit 29.

ネットワークインターフェイス部９１は、ローカルエリア内、又はインターネット上の端末装置やサーバー等の外部装置と種々のデータの送受信を行うものである。ネットワークインターフェイス部９１は、特許請求の範囲における画像取得部の一例である。本発明の一実施形態に係る画像処理装置は、画像読取部１１又はネットワークインターフェイス部９１の少なくとも一方を備える。 The network interface unit 91 transmits and receives various data to and from an external device such as a terminal device or a server in the local area or on the Internet. The network interface unit 91 is an example of an image acquisition unit within the scope of claims. The image processing apparatus according to the embodiment of the present invention includes at least one of an image reading unit 11 and a network interface unit 91.

画像メモリー２６には、画像読取部１１により読取られた原稿の画像を示す画像データが記憶される。また、制御部３１は、画像処理部２７として機能し、画像処理部２７として、画像メモリー２６内の画像データに対して、シェーディング補正などの様々な画像処理を施す。 The image memory 26 stores image data indicating an image of the original document read by the image reading unit 11. Further, the control unit 31 functions as an image processing unit 27, and as the image processing unit 27, performs various image processing such as shading correction on the image data in the image memory 26.

なお、画像処理部２７は、制御部３１及び制御ユニット２９とは別個に設けられていてもよい。この場合、制御部３１及び画像処理部２７が特許請求の範囲における制御部の一例となる。 The image processing unit 27 may be provided separately from the control unit 31 and the control unit 29. In this case, the control unit 31 and the image processing unit 27 are examples of the control unit within the scope of the claims.

記憶部２８は、ＳＳＤ（Solid State Drive）、ＨＤＤ（Hard Disk Drive）などの大容量の記憶装置であって、各種のアプリケーションプログラムや種々のデータを記憶している。 The storage unit 28 is a large-capacity storage device such as an SSD (Solid State Drive) or an HDD (Hard Disk Drive), and stores various application programs and various data.

制御ユニット２９は、プロセッサー、ＲＡＭ(Random Access Memory）、及びＲＯＭ（Read Only Memory）などから構成される。プロセッサーは、例えばＣＰＵ（Central Processing Unit）、ＡＳＩＣ（Application Specific Integrated Circuit）、又はＭＰＵ（Micro Processing Unit）等である。制御ユニット２９は、上記のＲＯＭ又は記憶部２８に記憶された制御プログラムが上記のプロセッサーで実行されることにより、制御部３１として機能する。 The control unit 29 is composed of a processor, a RAM (Random Access Memory), a ROM (Read Only Memory), and the like. The processor is, for example, a CPU (Central Processing Unit), an ASIC (Application Specific Integrated Circuit), an MPU (Micro Processing Unit), or the like. The control unit 29 functions as the control unit 31 when the control program stored in the ROM or the storage unit 28 is executed by the processor.

制御部３１は、画像処理装置１０を統括的に制御する。制御ユニット２９は、画像読取部１１、画像形成部１２、表示部２１、操作部２２、タッチパネル２３、ネットワークインターフェイス部９１、画像メモリー２６、画像処理部２７、及び記憶部２８などと接続されている。制御部３１は、これらの構成要素の動作制御や、該各構成要素との間での信号またはデータの送受信を行う。 The control unit 31 comprehensively controls the image processing device 10. The control unit 29 is connected to an image reading unit 11, an image forming unit 12, a display unit 21, an operation unit 22, a touch panel 23, a network interface unit 91, an image memory 26, an image processing unit 27, a storage unit 28, and the like. .. The control unit 31 controls the operation of these components and transmits / receives signals or data to / from each component.

制御部３１は、画像処理装置１０による画像形成に必要な各種の処理などを実行する処理部としての役割を果たす。また、制御部３１は、タッチパネル２３から出力される検知信号あるいは操作部２２の物理キーの操作に基づき、ユーザーにより入力された操作指示を受け付ける。更に、制御部３１は、表示部２１の表示動作を制御する機能、及びネットワークインターフェイス部９１の通信動作を制御する機能を有する。 The control unit 31 serves as a processing unit that executes various processes necessary for image formation by the image processing device 10. Further, the control unit 31 receives an operation instruction input by the user based on the detection signal output from the touch panel 23 or the operation of the physical key of the operation unit 22. Further, the control unit 31 has a function of controlling the display operation of the display unit 21 and a function of controlling the communication operation of the network interface unit 91.

このような構成の画像処理装置１０において、例えば、ユーザーが、原稿を画像読取部１１にセットし、操作部２２又はタッチパネル２３を操作して、コピーの実行を指示すると、制御部３１は、画像読取部１１により原稿の画像を読取らせ、該原稿の画像を示す画像データを画像メモリー２６に記憶させる。画像処理部２７は、画像データに対して各種の処理を施す。画像形成部１２は、画像データを画像メモリー２６から取得して、該画像データによって示される原稿の画像を記録紙に形成する。 In the image processing device 10 having such a configuration, for example, when the user sets the document in the image reading unit 11 and operates the operation unit 22 or the touch panel 23 to instruct the execution of copying, the control unit 31 displays the image. The image of the original is read by the reading unit 11, and the image data indicating the image of the original is stored in the image memory 26. The image processing unit 27 performs various processes on the image data. The image forming unit 12 acquires image data from the image memory 26 and forms an image of the original document indicated by the image data on the recording paper.

また、ユーザーによる操作で操作部２２又はタッチパネル２３に、原稿の種類を示すタグ名と、トレーニングモードの設定と、画像読取部１１による原稿の画像の読取りの実行指示が入力され、制御部３１が、トレーニングモードの設定、当該タグ名、及び読取実行指示を受け付けた場合、制御部３１は、画像処理装置１０をトレーニングモードに設定し、画像読取部１１による原稿の画像の読取り、画像メモリー２６への画像データの記憶、及び画像処理部２７による画像データの処理を行わせる。更に、制御部３１は、画像メモリー２６に記憶された当該画像データに基づき、読み取られた原稿の画像の特徴を抽出する処理を実行する。そして、制御部３１は、このように抽出した、該原稿の画像の特徴を示すデータを、上記タグ名により示される原稿の種類を判別するための学習データとして収集する。 Further, the user operates the operation unit 22 or the touch panel 23 to input a tag name indicating the type of the original, a training mode setting, and an instruction to execute the image reading of the original by the image reading unit 11, and the control unit 31 presses. When the training mode setting, the tag name, and the reading execution instruction are received, the control unit 31 sets the image processing device 10 to the training mode, and the image reading unit 11 reads the image of the original to the image memory 26. The image data is stored and the image data is processed by the image processing unit 27. Further, the control unit 31 executes a process of extracting the features of the image of the scanned document based on the image data stored in the image memory 26. Then, the control unit 31 collects the data showing the characteristics of the image of the manuscript extracted in this way as learning data for discriminating the type of the manuscript indicated by the tag name.

この後、ユーザーが、更なる原稿を画像読取部１１にセットし、当該ユーザーによる操作で、操作部２２又はタッチパネル２３に、分類モードの設定と、画像読取部１１による原稿の画像の読取りの実行指示とが受け付けられると、制御部３１は、分類モードを設定し、画像読取部１１による当該原稿の画像の読取り、画像メモリー２６への画像データの記憶、及び画像処理部２７による画像データの処理を行わせる。この後、制御部３１は、更に、画像メモリー２６内の当該画像データに基づき当該原稿の画像の特徴を抽出する処理を実行し、当該原稿の画像の特徴、及び上記のようにして収集されている学習データに基づき、当該原稿の種類を示すタグ名を判別し、当該判定したタグ名を示す情報を、当該原稿の画像データに付加する。すなわち、制御部３１は、このようにタグ名を示す情報を付加することにより該他の原稿を分類する。 After that, the user sets a further document in the image reading unit 11, and the user operates the operation unit 22 or the touch panel 23 to set the classification mode and execute the image reading unit 11 to read the image of the document. When the instruction is received, the control unit 31 sets the classification mode, the image reading unit 11 reads the image of the original, the image data 26 stores the image data, and the image processing unit 27 processes the image data. To do. After that, the control unit 31 further executes a process of extracting the features of the image of the manuscript based on the image data in the image memory 26, and collects the features of the image of the manuscript and as described above. Based on the learning data, the tag name indicating the type of the manuscript is determined, and the information indicating the determined tag name is added to the image data of the manuscript. That is, the control unit 31 classifies the other manuscript by adding the information indicating the tag name in this way.

従って、制御部３１は、トレーニングモードに設定している状態では、画像読取部１１により読取られた原稿の画像の特徴を示すデータを、操作部２２又はタッチパネル２３に入力されたタグ名が示す原稿の種類として判別するための学習データとして収集する。また、制御部３１は、分類モードに設定している状態では、画像読取部１１により読み取られた原稿の画像から抽出した当該原稿の画像の特徴と、この時点で収集されている学習データとを比較することで、当該原稿の画像の特徴と一致又は近似する特徴を示す学習データに対応付けられているタグ名を判別する。そして、制御部３１は、当該タグ名を示す情報を、当該読み取られた原稿の画像に付加することにより、当該読み取られた原稿の画像を分類する。なお、制御部３１は、分類モードに設定している状態でも、画像読取部１１により読取られた原稿の画像の特徴を示すデータを、タグ名を判別するための学習データとして収集する。 Therefore, in the state where the control unit 31 is set to the training mode, the data indicating the characteristics of the image of the document read by the image reading unit 11 is the document indicated by the tag name input to the operation unit 22 or the touch panel 23. It is collected as training data for determining the type of. Further, in the state where the control unit 31 is set to the classification mode, the characteristics of the image of the original document extracted from the image of the original document read by the image reading unit 11 and the learning data collected at this time are obtained. By comparing, the tag name associated with the learning data indicating the feature that matches or approximates the feature of the image of the manuscript is determined. Then, the control unit 31 classifies the image of the scanned document by adding the information indicating the tag name to the image of the scanned document. Even when the classification mode is set, the control unit 31 collects data indicating the characteristics of the image of the original document read by the image reading unit 11 as learning data for determining the tag name.

次に、上記トレーニングモードでの処理手順を、図３に示すフローチャートなどを参照して詳細に説明する。 Next, the processing procedure in the training mode will be described in detail with reference to the flowchart shown in FIG.

まず、管理者（特定のユーザー）は、操作部２２又はタッチパネル２３を操作して、ドキュメント分類機能の実行の指示を入力する。操作部２２又はタッチパネル２３が当該指示の入力を受け付けると、制御部３１は、当該指示に応じて図４に示すような分類機能入力画面Ｇ１を表示部２１に表示させる（Ｓ１０１）。この図４に示す分類機能入力画面Ｇ１には、トレーニングモードの実行指示を入力するためのキーＫ１１、分類モードの実行指示を入力するためのキーＫ１２、及び識別情報ＩＤを入力するためのテキストボックスＴＢ１が表示されている。 First, the administrator (specific user) operates the operation unit 22 or the touch panel 23 to input an instruction to execute the document classification function. When the operation unit 22 or the touch panel 23 receives the input of the instruction, the control unit 31 causes the display unit 21 to display the classification function input screen G1 as shown in FIG. 4 in response to the instruction (S101). In the classification function input screen G1 shown in FIG. 4, a key K11 for inputting a training mode execution instruction, a key K12 for inputting a classification mode execution instruction, and a text box for inputting an identification information ID TB1 is displayed.

管理者による操作部２２又はタッチパネル２３の操作により、当該管理者の識別情報ＩＤがテキストボックスＴＢ１に入力され、トレーニングモードのキーＫ１１が押下されると、制御部３１は、テキストボックスＴＢ１に入力された識別情報ＩＤが当該管理者の識別情報ＩＤに一致するか否かを認証する（Ｓ１０２）。制御部３１が、テキストボックスＴＢ１に入力された識別情報ＩＤが当該管理者の識別情報ＩＤに一致すると判定した場合（Ｓ１０２でＹＥＳ）、画像処理装置１０をトレーニングモードに設定する（Ｓ１０３）。尚、制御部３１が、テキストボックスＴＢ１に入力された識別情報ＩＤが当該管理者の識別情報ＩＤに一致しないと判定した場合（Ｓ１０２でＮＯ）、制御部３１は、トレーニングモードを設定せず、処理は終了する。 By the operation of the operation unit 22 or the touch panel 23 by the administrator, the identification information ID of the administrator is input to the text box TB1, and when the training mode key K11 is pressed, the control unit 31 is input to the text box TB1. Authenticate whether or not the identification information ID matches the identification information ID of the administrator (S102). When the control unit 31 determines that the identification information ID entered in the text box TB1 matches the identification information ID of the administrator (YES in S102), the image processing device 10 is set to the training mode (S103). If the control unit 31 determines that the identification information ID entered in the text box TB1 does not match the identification information ID of the administrator (NO in S102), the control unit 31 does not set the training mode. The process ends.

このようにトレーニングモードが設定されている状態で、管理者が、原稿を画像読取部１１にセットし、操作部２２又はタッチパネル２３を操作して、スキャンの実行指示を入力すると、制御部３１は、当該入力されたスキャンの実行指示に従って、画像読取部１１により原稿の画像を読取らせ、画像読取部１１から出力された原稿の画像を示す画像データを画像メモリー２６に記憶させる。画像処理部２７は、画像データに対して各種の処理を施す（Ｓ１０４）。このトレーニングモードでの画像処理部２７による各種の処理は、後で述べるＳ１０９において原稿の画像の特徴を正確に抽出させるために行われる予備処理であって、画像の濃度、コントラスト、エッジ、及びコーナーについてこれらを強調する処理等である。 When the administrator sets the document in the image reading unit 11 and operates the operation unit 22 or the touch panel 23 in the state where the training mode is set in this way, the control unit 31 inputs a scan execution instruction. In accordance with the input scan execution instruction, the image reading unit 11 reads the image of the original, and the image data 26 indicating the image of the original output from the image reading unit 11 is stored in the image memory 26. The image processing unit 27 performs various processes on the image data (S104). The various processes performed by the image processing unit 27 in this training mode are preliminary processes performed in order to accurately extract the features of the image of the original in S109 described later, and are image density, contrast, edges, and corners. It is a process to emphasize these.

この後、制御部３１は、図５に示すようなトレーニングモード設定画面Ｇ２を表示部２１に表示させる（Ｓ１０５）。この図５に示すトレーニングモード設定画面Ｇ２には、画像読取部１１により読取られた原稿の画像Ｄ１、既に設定されている各タグ名に対応付けられたそれぞれのキーＫ２１、Ｋ２２、Ｋ２３、Ｋ２４、Ｋ２５…、新たな原稿の種類を示すタグ名を記入するためのテキストボックスＴＢ２、及びＯＫキー２０が表示されている。 After that, the control unit 31 causes the display unit 21 to display the training mode setting screen G2 as shown in FIG. 5 (S105). On the training mode setting screen G2 shown in FIG. 5, the image D1 of the document read by the image reading unit 11, and the keys K21, K22, K23, K24 associated with each tag name already set, are displayed. K25 ..., a text box TB2 for entering a tag name indicating a new type of manuscript, and an OK key 20 are displayed.

管理者は、表示部２１に表示されている原稿の画像Ｄ１を確認し、各キーＫ２１、Ｋ２２、…が示すいずれかのタグ名が、該原稿の種類を示すと判断した場合は、各キーＫ２１、Ｋ２２、…のうち、当該原稿の種類を示すタグ名を示すキーにタッチ操作し、また各キーＫ２１、Ｋ２２、…が示すいずれのタグ名も、該原稿の種類を示すタグ名ではないと判断した場合は、操作部２２又はタッチパネル２３を操作して、新たな原稿の種類を示すタグ名をテキストボックスＴＢ２に入力して、ＯＫキー２０にタッチ操作する。 When the administrator confirms the image D1 of the manuscript displayed on the display unit 21 and determines that any of the tag names indicated by the keys K21, K22, ... Indicates the type of the manuscript, each key Of K21, K22, ..., Touch operation is performed on the key indicating the tag name indicating the type of the manuscript, and none of the tag names indicated by the keys K21, K22, ... Is the tag name indicating the type of the manuscript. If it is determined, the operation unit 22 or the touch panel 23 is operated, a tag name indicating a new document type is input into the text box TB2, and the OK key 20 is touch-operated.

制御部３１は、タッチパネル２３を通じて各キーＫ２１、Ｋ２２、…のいずれかのタッチ操作を検出すると（Ｓ１０６「キー」）、該指示されたキーに対応付けられているタグ名を取得する（Ｓ１０７）。また、制御部３１は、タッチパネル２３を通じてＯＫキー２０のタッチ操作を検出すると（Ｓ１０６「タグ名」）、テキストボックスＴＢ２に入力されたタグ名を取得する（Ｓ１０８）。 When the control unit 31 detects a touch operation of any of the keys K21, K22, ... Through the touch panel 23 (S106 "key"), the control unit 31 acquires the tag name associated with the instructed key (S107). .. Further, when the control unit 31 detects the touch operation of the OK key 20 through the touch panel 23 (S106 “tag name”), the control unit 31 acquires the tag name input in the text box TB2 (S108).

そして、制御部３１は、画像メモリー２６内の原稿の画像Ｄ１を示す画像データを解析して、原稿の画像Ｄ１の特徴を抽出する（Ｓ１０９）。 Then, the control unit 31 analyzes the image data indicating the image D1 of the original in the image memory 26 and extracts the features of the image D1 of the original (S109).

更に、制御部３１は、Ｓ１０９で抽出した当該原稿の画像Ｄ１の特徴を示すデータを、Ｓ１０７又はＳ１０８で取得したタグ名が示す、当該原稿の種類を判別するための例えばディープラーニング用の学習データとして収集する（Ｓ１１０）。制御部３１は、このディープラーニングの学習データを記憶部２８に記憶させる（Ｓ１１０）。 Further, the control unit 31 uses the data indicating the characteristics of the image D1 of the manuscript extracted in S109, which is indicated by the tag name acquired in S107 or S108, for example, learning data for deep learning for determining the type of the manuscript. Collect as (S110). The control unit 31 stores the deep learning learning data in the storage unit 28 (S110).

上記では、制御部３１による画像データの解析及び画像の特徴抽出の対象とされるのは、画像読取部１１により読み取られた原稿画像とされているが、これに代えて、ネットワークインターフェイス部９１により上記端末装置又はサーバー等から取得された画像を、画像データの解析及び画像の特徴抽出の対象としてもよい。 In the above, the target of image data analysis and image feature extraction by the control unit 31 is the original image read by the image reading unit 11, but instead, the network interface unit 91 An image acquired from the terminal device, server, or the like may be a target for image data analysis and image feature extraction.

例えば、原稿の種類を示すタグ名は、契約書、領収書、証明書、招待状、又はポスターなどである。また、制御部３１は、画像処理部２７により、原稿の画像の特徴として、画像の色相、コントラスト、色数、又は濃度などの要素を抽出する。学習データは、制御部３１が行う上記ディープラーニングによる学習に用いられる。 For example, the tag name indicating the type of manuscript may be a contract, receipt, certificate, invitation, poster, or the like. Further, the control unit 31 extracts elements such as hue, contrast, number of colors, and density of the image as features of the image of the original by the image processing unit 27. The learning data is used for learning by the deep learning performed by the control unit 31.

例えば、管理者が、招待状を画像読取部１１にセットし、操作部２２又はタッチパネル２３を操作して、トレーニングモードの設定を入力し、「招待状」をタグ名として入力すると共に、スキャンの実行を指示したものとする。制御部３１は、タグ名として「招待状」を取得し、画像読取部１１により招待状の画像を読取らせて、招待状の画像を示す画像データを画像メモリー２６に記憶させ、画像処理部２７により画像データに対して各種の処理を施させる。そして、制御部３１は、招待状の画像の特徴を抽出し、該招待状の画像の特徴を示すデータを、当該原稿の種類を判別するための学習データとして収集する。 For example, the administrator sets the invitation card in the image reading unit 11, operates the operation unit 22 or the touch panel 23, inputs the training mode setting, inputs the "invitation card" as the tag name, and scans. It is assumed that the execution is instructed. The control unit 31 acquires an "invitation card" as a tag name, causes the image reading unit 11 to read the image of the invitation card, stores the image data indicating the image of the invitation card in the image memory 26, and stores the image data indicating the image of the invitation card in the image processing unit. 27 causes various processing on the image data. Then, the control unit 31 extracts the features of the image of the invitation and collects the data showing the features of the image of the invitation as learning data for determining the type of the manuscript.

図７は、原稿の画像の特徴である画像の色相、コントラスト、色数、濃度と、タグ名である招待状、領収書との対応関係を例示する図表Ａである。トレーニングモードでは、図７の図表Ａに示すように、管理者により入力された各タグ名に対応する画像の特徴として、画像の色相、コントラスト、色数、濃度の組み合わせが用いられる例を示している。この例では、タグ名として、招待状及び領収書が用いられる場合を示している。 FIG. 7 is Chart A illustrating the correspondence between the hue, contrast, number of colors, and density of the image, which are the characteristics of the image of the manuscript, and the invitation and the receipt, which are the tag names. In the training mode, as shown in Chart A of FIG. 7, an example is shown in which a combination of hue, contrast, number of colors, and density of an image is used as an image feature corresponding to each tag name input by the administrator. There is. In this example, an invitation and a receipt are used as the tag name.

トレーニングモードが設定された状態で、管理者は、原稿を画像読取部１１にセットし、操作部２２又はタッチパネル２３を操作して、原稿の種類を示すタグ名を入力すると共に、スキャンの実行を指示する。制御部３１は、画像読取部１１により読取られた原稿の画像の特徴を抽出し、当該原稿の画像の特徴を示すデータを、該原稿の種類を判別するための学習データとして収集する。 With the training mode set, the administrator sets the document in the image reading unit 11, operates the operation unit 22 or the touch panel 23, inputs a tag name indicating the type of the document, and executes scanning. Instruct. The control unit 31 extracts the characteristics of the image of the original document read by the image reading unit 11, and collects data indicating the characteristics of the image of the original document as learning data for determining the type of the original document.

本実施形態では、原稿の種類を判別するための処理に、ディープラーニングが用いられる。図６は、制御部３１が行うディープラーニングの概念を示す図である。この図６に示すディープラーニングでは、入力層ＩＬから画像の特徴を入力し、中間層（複数のノードＭＬ）を経由して出力層ＯＬからタグ名を出力する。各ノードＭＬをどのように接続するか、あるいは各ノードＭＬが「どのような入力に対して、どのような出力を行うか」などを調整していくことで、つまり学習データを学習して生成するかあるいは更新することで、入力された画像の特徴に基づき取得されるタグ名を決定して出力するものとなっている。 In the present embodiment, deep learning is used for the process for determining the type of the original. FIG. 6 is a diagram showing a concept of deep learning performed by the control unit 31. In the deep learning shown in FIG. 6, the features of the image are input from the input layer IL, and the tag name is output from the output layer OL via the intermediate layer (plurality of nodes ML). By adjusting how each node ML is connected, or "what kind of input and what kind of output is performed" by each node ML, that is, learning data is learned and generated. By doing or updating, the tag name to be acquired is determined based on the characteristics of the input image and output.

このディープラーニングにおいて、トレーニングモードでは、管理者により入力されたタグ名に対応する画像の特徴として入力層ＩＬから入力した画像の特徴を示す各要素を、ノードＭＬとして記憶する。当該ノードＭＬの集合が学習データの一例である。 In this deep learning, in the training mode, each element indicating the characteristics of the image input from the input layer IL as the characteristics of the image corresponding to the tag name input by the administrator is stored as the node ML. The set of the node ML is an example of learning data.

また、分類モードでは、出力層ＯＬでは、入力層ＩＬから入力した画像の特徴に対応するタグ名を暫定的に決定して出力する。この暫定的なタグ名が、管理者により正式なタグ名を示しているとして承認された場合は、各ノードＭＬを、当該承認されたタグ名と対応付けて記憶する。また、この暫定的なタグ名が、管理者により変更された場合には、各ノードＭＬが更新され、暫定的に決定されるタグ名が、管理者により変更されたタグ名に一致するまでその更新が行われる（すなわち、学習データが更新される）。各ノードＭＬの階層が深くなり、各ノードＭＬが多くなる程、複雑な分類や判断ができるようになり、入力された画像の特徴に対応するタグ名を正確に判定可能となる。従って、ディープラーニングの学習データは、トレーニングモード及び分類モードで処理された原稿の画像が多くなり、これにより各ノードＭＬの階層が深くなって、各ノードＭＬが多くなることでより有効なものとなる。好ましくは、正確なタグ名の取得率（暫定的に決定されたタグ名が、管理者により正式なタグ名を示しているとして承認される率）が例えば９５％以上となるように、トレーニングモードで処理された原稿の画像を増やすようにする。 Further, in the classification mode, the output layer OL tentatively determines the tag name corresponding to the feature of the image input from the input layer IL and outputs the tag name. When this provisional tag name is approved by the administrator as indicating a formal tag name, each node ML is stored in association with the approved tag name. If this provisional tag name is changed by the administrator, each node ML is updated until the provisionally determined tag name matches the tag name changed by the administrator. Updates are made (ie, the training data is updated). The deeper the hierarchy of each node ML and the larger the number of each node ML, the more complicated classification and judgment can be performed, and the tag name corresponding to the feature of the input image can be accurately determined. Therefore, the deep learning learning data is more effective because the number of images of the manuscript processed in the training mode and the classification mode increases, which deepens the hierarchy of each node ML and increases each node ML. Become. Preferably, the training mode is such that the correct tag name acquisition rate (the rate at which the tentatively determined tag name is approved by the administrator as indicating the official tag name) is, for example, 95% or more. Try to increase the number of images of the original processed in.

なお、制御部３１は、原稿の種類を判別するための処理に、ディープラーニングを用いずに、タグ名に対応付けて収集した原稿の画像の特徴を用い、タグ名と、収集した原稿の画像の特徴との対応関係に基づいて、原稿の種類を判別するものとしてもよい。 In addition, the control unit 31 uses the characteristics of the image of the original document collected in association with the tag name without using deep learning in the process for determining the type of the original, and the tag name and the image of the collected original document. The type of the original may be determined based on the correspondence with the characteristics of.

次に、上記分類モードでの処理手順を、図８に示すフローチャートなどを参照して詳細に説明する。 Next, the processing procedure in the above classification mode will be described in detail with reference to the flowchart shown in FIG.

ユーザーは、操作部２２又はタッチパネル２３を操作して、ドキュメント分類機能の実行を指示する。制御部３１は、ドキュメント分類機能の実行の指示に応じて図４に示す分類機能入力画面Ｇ１を表示部２１に表示させる（Ｓ２０１）。 The user operates the operation unit 22 or the touch panel 23 to instruct the execution of the document classification function. The control unit 31 causes the display unit 21 to display the classification function input screen G1 shown in FIG. 4 in response to an instruction to execute the document classification function (S201).

ユーザーは、操作部２２又はタッチパネル２３を操作して、自己の識別情報ＩＤをテキストボックスＴＢ１に記入して、分類モードのキーＫ１２を操作する。制御部３１は、この操作に基づいて分類モードの設定指示が操作部２２又はタッチパネル２３に入力されたとき（Ｓ２０２でＹＥＳ）、分類モードを設定する（Ｓ２０３）。なお、この設定指示の入力がないときは（Ｓ２０２でＮＯ）、処理は終了する。 The user operates the operation unit 22 or the touch panel 23 to enter his / her identification information ID in the text box TB1 and operates the key K12 in the classification mode. The control unit 31 sets the classification mode when the classification mode setting instruction is input to the operation unit 22 or the touch panel 23 based on this operation (YES in S202) (S203). If there is no input of this setting instruction (NO in S202), the process ends.

このように分類モードが設定された状態で、ユーザーが、原稿を画像読取部１１にセットし、操作部２２又はタッチパネル２３を操作して、スキャンの実行を指示すると、制御部３１は、画像読取部１１により原稿の画像を読取らせ、原稿の画像を示す画像データを画像メモリー２６に記憶させ、画像処理部２７により画像データに対して各種の処理を施させる（Ｓ２０４）。この分類モードでの画像処理部２７による各種の処理も、トレーニングモードと同様に、原稿の画像の特徴を正確に抽出させるために行われる画像の濃度、コントラスト、エッジ、コーナーを強調する処理等である。 With the classification mode set in this way, when the user sets the document in the image reading unit 11 and operates the operation unit 22 or the touch panel 23 to instruct the execution of scanning, the control unit 31 reads the image. The image of the original is read by the unit 11, the image data indicating the image of the original is stored in the image memory 26, and the image processing unit 27 performs various processing on the image data (S204). Similar to the training mode, the various processes performed by the image processing unit 27 in this classification mode are also the processes for emphasizing the density, contrast, edges, and corners of the image, which are performed to accurately extract the features of the image of the original. is there.

また、制御部３１は、図９に示すような分類モード設定画面Ｇ３を表示部２１に表示させる（Ｓ２０５）。この図９に示す分類モード設定画面Ｇ３には、画像読取部１１により読取られた原稿の画像データＤ２、及び開始キーＫ３１が表示されている。 Further, the control unit 31 causes the display unit 21 to display the classification mode setting screen G3 as shown in FIG. 9 (S205). On the classification mode setting screen G3 shown in FIG. 9, the image data D2 of the document read by the image reading unit 11 and the start key K31 are displayed.

例えば、ユーザーが、表示部２１に表示されている開始キーＫ３１にタッチ操作すると、制御部３１は、タッチパネル２３を通じて開始キーＫ３１のタッチ操作に基づいて分類処理の実行指示を受け付け、この指示に従って、画像処理部２７により画像メモリー２６内の原稿の画像データＤ２を示す画像データを解析して、原稿の画像データＤ２の特徴を抽出すると共に、原稿の画像データＤ２の特徴、及び記憶部２８に記憶されている上記ディープラーニングで得た学習データに基づいて、原稿の画像データＤ２のタグ名を判別する（Ｓ２０６）。そして、制御部３１は、図１０に示すようなポップアップ画面Ｐ１を分類モード設定画面Ｇ３内に表示させる。この図１０に示すポップアップ画面Ｐ１には、Ｓ２０６で判別された原稿の画像データＤ２のタグ名と共に、ＯＫキーＫ３２、及びＮＧキーＫ３３が表示される。 For example, when the user touches the start key K31 displayed on the display unit 21, the control unit 31 receives an instruction to execute the classification process based on the touch operation of the start key K31 through the touch panel 23, and follows this instruction. The image processing unit 27 analyzes the image data indicating the image data D2 of the original in the image memory 26, extracts the characteristics of the image data D2 of the original, and stores the characteristics of the image data D2 of the original and the storage unit 28. The tag name of the image data D2 of the manuscript is determined based on the learning data obtained by the above-mentioned deep learning (S206). Then, the control unit 31 displays the pop-up screen P1 as shown in FIG. 10 in the classification mode setting screen G3. On the pop-up screen P1 shown in FIG. 10, the OK key K32 and the NG key K33 are displayed together with the tag name of the image data D2 of the original document determined in S206.

ユーザーは、ポップアップ画面Ｐ１の原稿の画像データＤ２のタグ名が正しいと判断すると、ＯＫキーＫ３２にタッチ操作する。制御部３１は、タッチパネル２３を通じてＯＫキーＫ３２のタッチ操作を検出すると（Ｓ２０７「ＯＫ」）、Ｓ２０６で判別したタグ名を示す情報を原稿の画像データＤ２に付加する（Ｓ２０８）。すなわち、制御部３１は、原稿の画像データＤ２を分類する。そして、制御部３１はＳ２０６で抽出した当該原稿の画像データＤ２の特徴を用いて、当該原稿の種類を示すタグ名を判別するためのディープラーニングの学習データを更新し、この学習データを記憶部２８に記憶させる（Ｓ２０９）。 When the user determines that the tag name of the image data D2 of the document on the pop-up screen P1 is correct, he / she touches the OK key K32. When the control unit 31 detects the touch operation of the OK key K32 through the touch panel 23 (S207 “OK”), the control unit 31 adds information indicating the tag name determined in S206 to the image data D2 of the document (S208). That is, the control unit 31 classifies the image data D2 of the original. Then, the control unit 31 updates the deep learning learning data for determining the tag name indicating the type of the manuscript by using the feature of the image data D2 of the manuscript extracted in S206, and stores this learning data. It is stored in 28 (S209).

また、ユーザーは、ポップアップ画面Ｐ１の原稿の画像データＤ２のタグ名が誤っていると判断すると、ＮＧキーＫ３３にタッチ操作する。制御部３１は、タッチパネル２３を通じてＮＧキーＫ３３のタッチ操作を検出すると（Ｓ２０７「ＮＧ」）、図１１に示すようなポップアップ画面Ｐ２を分類モード設定画面Ｇ３内に表示させる。この図１１に示すポップアップ画面Ｐ２には、新たな原稿の種類を示すタグ名を入力するためのテキストボックスＴＢ３、既に設定されている各タグ名に対応付けられたそれぞれのキーＫ２１、Ｋ２２、Ｋ２３、Ｋ２４、Ｋ２５…、及びＯＫキーＫ４１が表示されている。 Further, when the user determines that the tag name of the image data D2 of the document on the pop-up screen P1 is incorrect, the user touches the NG key K33. When the control unit 31 detects the touch operation of the NG key K33 through the touch panel 23 (S207 “NG”), the control unit 31 displays the pop-up screen P2 as shown in FIG. 11 in the classification mode setting screen G3. In the pop-up screen P2 shown in FIG. 11, a text box TB3 for inputting a tag name indicating a new document type, and keys K21, K22, and K23 associated with each tag name already set are displayed. , K24, K25 ..., And OK key K41 are displayed.

ユーザーは、表示部２１に表示されている原稿の画像データＤ２を確認し、各キーＫ２１、Ｋ２２、…のいずれかが該原稿の種類を示すタグ名に対応付けられていれば、該当するキーを操作し、制御部３１は、タッチパネル２３を通じて各キーＫ２１、Ｋ２２、…のいずれかのタッチ操作を検出すると、該操作されたキーに対応付けられている原稿の種類を示すタグ名を、当該原稿の画像データに付加すべき正式なタグ名として取得する（Ｓ２１０）。一方、各キーＫ２１、Ｋ２２、…のいずれも該原稿の種類を示すタグ名に対応付けられていなければ、ユーザーは操作部２２又はタッチパネル２３を操作して、新たな原稿の種類を示すタグ名をテキストボックスＴＢ３に記入し、ＯＫキーＫ４１にタッチ操作する。制御部３１は、ＯＫキーＫ４１のタッチ操作を検出した場合は、テキストボックスＴＢ３に記入された新たな原稿の種類を示すタグ名を、当該原稿の画像に付加すべき正式なタグ名として取得する（Ｓ２１０）。制御部３１は、Ｓ２１０で取得したタグ名を示す情報を原稿の画像データＤ２に付加する。 The user confirms the image data D2 of the manuscript displayed on the display unit 21, and if any of the keys K21, K22, ... Is associated with the tag name indicating the type of the manuscript, the corresponding key. When the control unit 31 detects a touch operation of any of the keys K21, K22, ... Through the touch panel 23, the control unit 31 assigns a tag name indicating the type of the document associated with the operated key. Obtained as a formal tag name to be added to the image data of the manuscript (S210). On the other hand, if none of the keys K21, K22, ... Is associated with the tag name indicating the type of the original, the user operates the operation unit 22 or the touch panel 23 to indicate the new type of the original. Is entered in the text box TB3, and the OK key K41 is touch-operated. When the control unit 31 detects the touch operation of the OK key K41, the control unit 31 acquires the tag name indicating the new document type entered in the text box TB3 as the official tag name to be added to the image of the document. (S210). The control unit 31 adds information indicating the tag name acquired in S210 to the image data D2 of the original.

そして、制御部３１は、ここで用いた原稿の画像データＤ２の特徴を、Ｓ２１０で取得したタグ名を判別するための学習データとして収集し、この原稿の画像データＤ２の特徴を用いて、上述したようにしてディープラーニングにより学習データを更新し、更新した学習データを記憶部２８に記憶させる（Ｓ２０９）。 Then, the control unit 31 collects the characteristics of the image data D2 of the manuscript used here as learning data for discriminating the tag name acquired in S210, and uses the characteristics of the image data D2 of the manuscript to describe the above. In this way, the learning data is updated by deep learning, and the updated learning data is stored in the storage unit 28 (S209).

従って、ユーザーは、分類モードを設定し、画像読取部１１により原稿の画像を読取らせるだけで、該原稿の画像の分類を自動的に行わせることができ、該原稿の画像が正しく分類されなければ、新たな原稿の種類を示す正しいタグ名を入力して、新たな原稿の画像の特徴に基づきその入力した正しいタグ名が導出されるようにディープラーニングの学習データを更新させることができる。 Therefore, the user can automatically classify the image of the original by simply setting the classification mode and having the image reading unit 11 read the image of the original, and the image of the original is correctly classified. If not, it is possible to input the correct tag name indicating the new manuscript type and update the deep learning learning data so that the input correct tag name is derived based on the characteristics of the image of the new manuscript. ..

図１２乃至図１５は、原稿の画像データに対するタグ名を示す情報の付加の例を示している。 12 to 15 show an example of adding information indicating a tag name to the image data of the manuscript.

図１２は、表示部２１に表示された原稿の画像のプロパティＰＲ１を示している。制御部３１は、原稿の画像データに、当該原稿の画像に対するタグ名を示す情報を付加して当該原稿の画像データとともに記憶しておき、当該タグ名を、当該原稿の画像について表示するプロパティ画像としてのプロパティＰＲ１において、その表示項目の一つとして項目Ｑ１（ドキュメントの種類）に表示させる。 FIG. 12 shows the property PR1 of the image of the original displayed on the display unit 21. The control unit 31 adds information indicating a tag name for the image of the manuscript to the image data of the manuscript, stores it together with the image data of the manuscript, and displays the tag name for the image of the manuscript. In the property PR1 as, it is displayed in the item Q1 (document type) as one of the display items.

図１３は、表示部２１に表示された原稿の画像のプロパティＰＲ２を示している。制御部３１は、原稿の画像データに、当該原稿の画像に対するタグ名を示す情報を付加して当該原稿の画像データとともに記憶しておき、当該タグ名を、当該原稿の画像について表示するプロパティ画像としてのプロパティＰＲ２において、その表示項目の一つとして項目Ｑ２（タグ）に表示させる。 FIG. 13 shows the property PR2 of the image of the original displayed on the display unit 21. The control unit 31 adds information indicating a tag name for the image of the manuscript to the image data of the manuscript, stores it together with the image data of the manuscript, and displays the tag name for the image of the manuscript. In the property PR2 as, it is displayed in the item Q2 (tag) as one of the display items.

図１４は、制御部３１により表示部２１に表示されるファイルの一覧ＦＲを示している。制御部３１は、原稿の画像データに、当該原稿の画像に対するタグ名を示す情報を付加して当該原稿の画像データとともに記憶しておき、当該タグ名を、ファイルの一覧を示す画像である一覧画像ＦＲにおけるタグ名の項目Ｑ３に表示させる。 FIG. 14 shows a list FR of files displayed on the display unit 21 by the control unit 31. The control unit 31 adds information indicating a tag name for the image of the manuscript to the image data of the manuscript and stores it together with the image data of the manuscript, and stores the tag name as an image indicating a list of files. It is displayed in the item Q3 of the tag name in the image FR.

図１５は、制御部３１により表示部２１に表示される送信設定画面の一例を示す。制御部３１は、原稿の画像データに当該原稿の画像に対するタグ名を示す情報を付加して当該原稿の画像データとともに記憶しておき、原稿の画像データを画像処理装置１０に接続された例えばコンピューター内の記憶部にあるフォルダに向けて送信するときの送信設定を行う設定画面Ｇ５を表示部２１に表示させるとき、当該タグ名を、設定画面Ｇ５におけるプレビュー画像表示領域ＦＲ２におけるタグ名の項目Ｑ５に表示させる。すなわち、当該タグ名はプレビュー表示時に原稿に関する情報として表示される。 FIG. 15 shows an example of a transmission setting screen displayed on the display unit 21 by the control unit 31. The control unit 31 adds information indicating a tag name for the image of the manuscript to the image data of the manuscript and stores it together with the image data of the manuscript, and stores the image data of the manuscript together with the image data of the manuscript, for example, a computer connected to the image processing device 10. When the setting screen G5 for setting the transmission when transmitting to the folder in the storage unit is displayed on the display unit 21, the tag name is displayed in the item Q5 of the tag name in the preview image display area FR2 on the setting screen G5. To display. That is, the tag name is displayed as information about the original when the preview is displayed.

このようなプロパティやファイルの一覧に対するタグ名の表示により、実行された原稿の画像の分類が容易に確認可能となる。 By displaying the tag name for such a property or a list of files, it is possible to easily confirm the classification of the image of the executed manuscript.

尚、上記実施形態では、本発明の一実施形態を、画像処理装置（複合機）を用いて説明しているが、これは一例に過ぎず、コピー機、プリンター、ファクシミリ装置等の他の画像処理装置でもよい。 In the above embodiment, one embodiment of the present invention is described using an image processing device (multifunction device), but this is only an example, and other images such as a copier, a printer, and a facsimile machine. It may be a processing device.

また、図１乃至図１５を用いて説明した上記実施形態の構成及び処理は、本発明の一例に過ぎず、本発明を当該構成及び処理に限定する趣旨ではない。 Further, the configuration and processing of the above-described embodiment described with reference to FIGS. 1 to 15 are merely examples of the present invention, and the present invention is not intended to be limited to the configuration and processing.

１０画像処理装置
１１画像読取部
１２画像形成部
２１表示部
２２操作部
２３タッチパネル
２８記憶部
２９制御ユニット
３１制御部
９１ネットワークインターフェイス部 10 Image processing device 11 Image reading unit 12 Image forming unit 21 Display unit 22 Operation unit 23 Touch panel 28 Storage unit 29 Control unit 31 Control unit 91 Network interface unit

Claims

The image acquisition unit that acquires the image data of the manuscript and
The operation unit where settings and instructions are input by user operation,
A control unit for setting a training mode or a classification mode according to an instruction input to the operation unit is provided.
The control unit
When the training mode setting is input by operating the operation unit, the characteristics of the image of the manuscript whose image data has been acquired by the image acquisition unit are extracted, and the data indicating the characteristics of the image of the manuscript is obtained. , Collected as learning data to determine the type of the manuscript,
When the classification mode is set by the operation to the operation unit, the feature of the image of the manuscript whose image data has been acquired by the image acquisition unit is extracted, and the feature of the image of the manuscript and the learning data are used. An image processing device that discriminates the type of the manuscript based on the above and adds information indicating the discriminated type to the image data of the manuscript.

When a different document type is input by operating the operation unit after determining the type of the document in the classification mode, the control unit indicates the input type instead of the determined type. The image processing apparatus according to claim 1, wherein the information is added to the image data of the manuscript, and the learning data used for the determination is stored in association with the input different types of manuscripts.

The image processing device according to claim 1 or 2, wherein the control unit extracts the hue, contrast, number of colors, or density of the image data as a feature of the image of the original document.

With a further display
The image according to any one of claims 1 to 3, wherein the control unit associates the type of the original document determined in the classification mode with the image data of the acquired original document and displays it on the display unit. Processing equipment.

The image acquisition step to acquire the image data of the manuscript and
Operation steps that accept user operations on the operation unit,
When the training mode is set in the operation step, the characteristics of the image of the manuscript whose image data has been acquired in the image acquisition step are extracted, and the data indicating the characteristics of the image of the manuscript is used to obtain the type of the manuscript. Collected as learning data for discrimination, when the classification mode is set in the operation step, the characteristics of the image of the document for which the image data was acquired in the image acquisition step are extracted, and the image of the document is An image processing method including a control step of discriminating the type of the manuscript based on the features and the learning data and adding information indicating the discriminated type to the image data of the manuscript.