JP4783317B2

JP4783317B2 - Image forming apparatus, image processing method, program for causing computer to execute the method, and recording medium

Info

Publication number: JP4783317B2
Application number: JP2007065894A
Authority: JP
Inventors: 浩久稲本; 幸二小林
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2007-03-14
Filing date: 2007-03-14
Publication date: 2011-09-28
Anticipated expiration: 2027-03-14
Also published as: JP2008228114A

Description

この発明は、定着による破損の危険が無く、又ユーザの文書作成の自由度を損なわないページ番号情報付加手段及びページ並べ替え手段を備えた画像処理装置、画像処理方法、その方法をコンピュータに実行させるプログラム、および記録媒体に関するものである。 The present invention provides an image processing apparatus, an image processing method, and a computer-implemented image processing apparatus having page number information adding means and page rearranging means that are free from damage due to fixing and do not impair the degree of freedom of user document creation. And a recording medium.

近年のオフィスに於いては、紙文書がコピーされることはもちろんのこと、ＰＣの普及、ネットワークインフラの充実により、紙として出力された原稿をスキャニングすることにより電子化し、ネットワークを介して容易に取り回したり、他者とリソースを共有することで、知識創造に役立てたりすると言ったことが頻繁に行われつつある。
例えば、参加したプレゼンの資料をスキャンし、社外から閲覧したり、社内で共有化したりといった具合である。スキャニングは、一般的には一枚一枚の紙原稿をフラットベットスキャナにセットし、スキャン開始ボタンを押すことによって行われるが、ＡＤＦ（Auto Document Feader）を用いると、重ねた紙が自動的に読み取られるため非常に便利である。
読み取った複数枚の原稿を、ページ順に、例えばマルチページＴｉｆの様な一つのファイルにまとめておくと、一つの文書を閲覧するために複数のファイルに対して操作する必要が無くなり、更に便利である。 In recent offices, not only paper documents are copied, but also the spread of PCs and the enhancement of network infrastructure, making it easy to digitize by scanning the manuscript output as paper, and easily via the network It has been frequently said that it is useful for knowledge creation by managing and sharing resources with others.
For example, the materials of the presentations that you participated in are scanned and viewed from outside, or shared internally. Scanning is generally performed by setting each paper document on a flatbed scanner and pressing the scan start button. However, if ADF (Auto Document Feader) is used, the stacked paper is automatically It is very convenient because it is read.
Collecting a plurality of scanned originals into a single file, such as a multi-page Tif, in page order, eliminates the need to operate on multiple files in order to view a single document. is there.

一方、ＡＤＦを用いる際、稀にミスフィードが発生する事がある。その原因は様々であるが、ミスフィードが発生すると、当然の事ながらページが抜けてしまい、時には肝心の情報を記した領域が抜け落ちてしまう可能性がある。また、仮にミスフィードを察知し、後からミスフィードしたページをスキャニングしたとしても、一つのファイルにまとめるために並べてあったページの順序が狂うことになる。こうなると、スキャニングにより得られた電子文書の閲覧性は非常に低くなる。
更に言うと、紙原稿をページ順に並べてＡＤＦにセットすることは非常に面倒な作業である。これはスキャニングに限った話ではなく、勿論、コピーして出力した際にも同様の問題がある。 On the other hand, when using ADF, misfeeds may occur in rare cases. There are various reasons for this, but when a misfeed occurs, the page is naturally omitted, and sometimes the area where the important information is recorded may be lost. Even if a misfeed is detected and a page that has been misfeeded is scanned later, the order of the pages arranged in one file will be out of order. In this case, the viewability of the electronic document obtained by scanning becomes very low.
Furthermore, it is very troublesome to place paper originals in page order and set them in the ADF. This is not limited to scanning, but of course there is a similar problem when copying and outputting.

このような問題を回避するために、従来では、出力する紙にページ情報の記載されたＲＦＩＤ（Radio Frequency Identification）を付加し、ページ順に異常が見られた際には警告を発したり、どのような順番に並んでいようと、自動的に並べ替えたりする考案が成されている（特許文献１、２）。しかしながら、上記従来技術を電子写真に適用することを考えると、定着の際に大きな圧力と熱がかかり、ＲＦＩＤが破損する事が考えられる。尚、電子写真の出力方式については実施例にて後述する。
また、他の従来技術では、ページ情報を付したバーコードを原稿の特定領域に付加し、上記発明と同様に警告を出したり、ページ順を整えたりといった発明が考案されている（特許文献３）。しかしながら、この手法によると、特定領域にユーザが書き込みを行うことができなくなり、ユーザの文書作成の自由度を損なうと言った問題があった。 In order to avoid such a problem, conventionally, RFID (Radio Frequency Identification) with page information is added to the paper to be output, and if an abnormality is found in the order of pages, a warning is issued. In order to arrange them in the correct order, an idea of automatically rearranging them has been made (Patent Documents 1 and 2). However, considering the application of the above-described conventional technology to electrophotography, it is conceivable that a large pressure and heat are applied during fixing and the RFID is damaged. An electrophotographic output method will be described later in Examples.
In another conventional technique, an invention has been devised in which a bar code with page information is added to a specific area of a document to issue a warning or arrange the page order as in the above invention (Patent Document 3). ). However, according to this method, there is a problem that the user cannot write in a specific area and the user's degree of freedom in document creation is impaired.

更に、ユーザの文書作成の自由度を損なわない情報の付加手段としては例えば、電子透かしと呼ばれる技術が存在する（特許文献４）。電子透かしとは、人間の目に付かないように画像に情報を付加する手段である。例えば、人間が比較的認知しにくい高周波領域に特有のパターンを埋めこむことにより、実現されるものである。しかしながら、付加する情報量が増加すると、画質の劣化が顕著になり、画質と付加情報量とがトレードオフの関係に有ると言える。
更に言うと、一般に電子透かしはセキュリティ向上のために利用されることが多く、例えばユーザ名やコピー回数などが付加される場合が多い。結果、電子透かしが認知できない程度の情報を付加した場合、ページ番号として利用できる領域は非常に限られたものとなる場合が多い。例えば、８ｂｙｔｅの情報が付加されたとして、アルファベット６文字をユーザ名として利用したとすると、アルファベット１文字に要する情報量は１ｂｙｔｅであるため２ｂｙｔｅのみがページ番号として利用できることになる。異なる文書データに付加されるページ番号情報は、少なくとも１／６５５３５（２ｂｙｔｅは６５５３５迄の数字を表現できる）の確率で連なることになる。 Furthermore, as information adding means that does not impair the degree of freedom of user document creation, for example, there is a technique called digital watermark (Patent Document 4). The digital watermark is a means for adding information to an image so as not to be seen by human eyes. For example, it can be realized by embedding a specific pattern in a high-frequency region that is relatively difficult for humans to recognize. However, when the amount of information to be added increases, the image quality deteriorates significantly, and it can be said that the image quality and the amount of additional information are in a trade-off relationship.
Furthermore, in general, a digital watermark is often used for improving security, and for example, a user name, the number of copies, and the like are often added. As a result, when information that is not recognized by the digital watermark is added, the area that can be used as the page number is often very limited. For example, assuming that 8 bytes of information are added and 6 alphabets are used as a user name, the amount of information required for 1 alphabet is 1 byte, so only 2 bytes can be used as a page number. The page number information added to different document data is continuous with a probability of at least 1/65535 (2 bytes can represent numbers up to 65535).

更に言えば、異なるプリンタで同様のルールに従ってページ番号を付加したとすると、更にその確率は高くなる。例えばそれぞれのプリンタで１から順番にページ番号情報を付加する事を考えると、そのことは明らかである。
なお、特許文献５としては、１頁目の原稿面を読み取り、１頁目の画像データにおける白色データの割合に基づいてＣＰＵが白紙であるか否かを判断し、１頁目が白紙の場合には、表裏反対にセットされたとして、主記憶部の画像データを逆順に並べ替えて、記憶部に保存するかあるいはプリンタ部に印字出力をさせ、１頁目が白紙でない場合には、原稿が片面のみ画像を有するか、両面に画像を有することを判断し、片面原稿であれば正しくセットされたとして保存または印刷させ、両面原稿であれば、１頁目を表示部５に表示させて使用者に確認させ、正しい場合には保存または印刷させ、違う場合には並べ替えて保存または印刷させる技術が開示されている。
特開２００６−２５４０５８公報特開２００６−１３０７８４公報特開２００５−１５３３２６公報特許第３５７０２１３号特開２００１−２５１４８３公報黄瀬浩一,柳田修,“白領域の細線化を用いた文書画像の領域分割”,電子情報通信学会論文誌,No.6, pp.1608-1616, 1997 C. Cortes and V.N. Vapnik, “Support vector Networks,” Machine Learning, vol.20, pp.273-297, 1995 More specifically, if page numbers are added according to similar rules in different printers, the probability is even higher. For example, when page number information is added in order from 1 in each printer, this is clear.
In Patent Document 5, when the original surface of the first page is read, the CPU determines whether or not the page is blank based on the ratio of white data in the image data of the first page, and the first page is blank. If the first page is not reversed, the image data in the main storage unit is rearranged in the reverse order and stored in the storage unit or printed out to the printer unit. Is determined to have an image only on one side or on both sides, and if it is a single-sided original, it is stored or printed as correctly set, and if it is a double-sided original, the first page is displayed on the display unit 5. A technique is disclosed in which a user is allowed to confirm and save or print if correct, or rearrange and save or print if different.
JP 2006-254058 A JP 2006-130784 A JP 2005-153326 A Japanese Patent No. 3570213 JP 2001-251383 A Koichi Kise and Osamu Yanagida, “Area Image Segmentation Using Thinning of White Areas”, IEICE Transactions, No. 6, pp. 1608-1616, 1997 C. Cortes and VN Vapnik, “Support vector Networks,” Machine Learning, vol.20, pp.273-297, 1995

すなわち、上記従来技術には、以下のような問題点があった。
まず、ミスフィードが発生すると、当然の事ながらページが抜けてしまい、時には肝心の情報を記した領域が抜け落ちてしまう問題があった。
また、ページ情報の記載されたＲＦＩＤを付加し、ページ順に異常が見られた際には警告を発したり、どのような順番に並んでいようと、自動的に並べ替えたりする従来技術を電子写真に適用することを考えると、定着の際に大きな圧力と熱がかかり、ＲＦＩＤが破損する可能性があった。
また、ページ情報を付したバーコードを原稿の特定領域に付加し、警告を出したり、ページ順を整えたりする手法によると、特定領域にユーザが書き込みを行うことができなくなり、ユーザの文書作成の自由度を損なうと言った問題があった。
また、人間が比較的認知しにくい高周波領域に特有のパターンである電子透かしを埋めこむ方法では、付加する情報量が増加すると、画質の劣化が顕著になり、画質と付加情報量とがトレードオフの関係に有る問題があった。 That is, the prior art has the following problems.
First of all, when a misfeed occurs, there is a problem that a page is omitted as a matter of course, and sometimes an area in which important information is recorded is lost.
In addition, the conventional technology that adds RFID with page information and issues a warning when an abnormality is seen in the page order, or automatically rearranges whatever order they are in order. Considering application to photography, there is a possibility that RFID is damaged due to large pressure and heat applied during fixing.
In addition, by adding a bar code with page information to a specific area of the document and issuing a warning or arranging the page order, the user cannot write to the specific area, creating a user's document. There was a problem that said it would impair the degree of freedom.
In addition, in the method of embedding a digital watermark, which is a pattern peculiar to the high-frequency region that is relatively difficult for humans to recognize, as the amount of information to be added increases, the image quality deteriorates significantly, and the image quality and the amount of additional information trade off. There was a problem with the relationship.

本発明は、上記従来の問題点を鑑みてなされたものであり、その目的は、定着による破損の危険が無く、又ユーザの文書作成の自由度を損なわないページ番号情報付加手段及びページ並べ替え手段を備えた画像形成装置、画像処理方法、その方法をコンピュータに実行させるプログラム、および記録媒体を提供することである。
本発明の他の目的は、画質の劣化を最小限に抑えつつ、異なるプリンタから得られた出力画像であっても正確にページを並べ替えることの可能なページ並べ替え手段を備えた画像形成装置、画像処理方法、その方法をコンピュータに実行させるプログラム、および記録媒体を提供することである。 The present invention has been made in view of the above-described conventional problems, and an object of the present invention is to provide page number information adding means and page rearrangement that do not cause damage due to fixing and do not impair the degree of freedom of user document creation. An image forming apparatus provided with a means, an image processing method, a program for causing a computer to execute the method, and a recording medium are provided.
Another object of the present invention is to provide an image forming apparatus provided with page rearrangement means capable of rearranging pages accurately even with output images obtained from different printers while minimizing degradation of image quality. An image processing method, a program for causing a computer to execute the method, and a recording medium are provided.

上述の目的を達成するために、請求項１記載の発明は、複数ページの原稿を読み取って画像を形成する画像形成装置であって、前記原稿を読み取って画像データを生成する画像読取手段と、前記画像読取手段で読み取った原稿に電子透かし方式で少なくとも前記原稿のページ番号を有する情報を埋め込む情報埋め込み手段と、前記原稿に埋めこまれた情報を抽出する情報抽出手段と、前記情報抽出手段が読み取った情報から前記原稿のページ番号を取得するページ番号取得手段と、前記画像読取手段で読み取った複数ページの画像データを一つのグループとして記憶する記憶手段と、前記ページ番号取得手段が取得したページ番号と既に前記記憶手段に記憶されている各ページ番号とが近い数字であるかを判定するページ番号近接判定手段と、該ページ番号近接判定手段により近いページが存在すると判定された場合に前記複数ページの画像データの画像の類似度を判定する画像類似度判定手段と、該画像類似度判定手段によって判定された画像の類似度によって前記原稿のページを並び替えると共に、前記原稿のページをグループ統合するページ並び替え手段と、を有するページ順序判定手段と、を具備することを特徴とする。 In order to achieve the above-described object, the invention according to claim 1 is an image forming apparatus that reads a plurality of pages of a document to form an image, and that reads the document and generates image data; Information embedding means for embedding information having at least the page number of the original document in a digital watermark format in the original read by the image reading means; information extracting means for extracting information embedded in the original; and the information extracting means Page number acquisition means for acquiring the page number of the document from the read information, storage means for storing image data of a plurality of pages read by the image reading means as one group, and the page acquired by the page number acquisition means Page number proximity determining means for determining whether the number and each page number already stored in the storage means are close numbers; When the page number proximity determining means determines that there is a close page, the image similarity determining means for determining the image similarity of the image data of the plurality of pages, and the image determined by the image similarity determining means The apparatus further comprises a page order determining unit that includes a page rearranging unit that rearranges the pages of the original according to the similarity and group-integrates the pages of the original .

また、請求項２記載の発明は、前記ページ順序判定手段が、前記原稿のページの並べ替え、あるいは前記原稿のページのグループの統合を行う際に、ユーザ確認をさせる確認手段を具備することを特徴とする。
また、請求項３記載の発明は、前記画像類似度判定手段は、少なくとも前記原稿のページの文書タイプを判定する文書タイプ識別手段を有することを特徴とする。
また、請求項４記載の発明は、前記ページ番号取得手段が取得したページ番号が０であった場合に、前記画像読取手段で読み取った複数ページからなる画像データの先頭のページ番号をランダムに決定する埋込情報生成手段を有することを特徴とする。 Also, an invention according to claim 2, wherein the page order determination unit, rearrange the pages of the document, or when performing integration of a group of pages of the document, be provided with a confirmation means for user confirmation It is characterized by.
According to a third aspect of the present invention, the image similarity determination unit includes a document type identification unit that determines at least a document type of a page of the document.
According to a fourth aspect of the present invention, when the page number acquired by the page number acquisition unit is 0, the top page number of image data composed of a plurality of pages read by the image reading unit is randomly determined. And embedded information generating means .

また、請求項５記載の発明は、複数ページの原稿を読み取って画像を形成する画像形成装置における画像処理方法であって、画像読取手段により前記原稿を読み取って画像データを生成するステップと、情報埋め込み手段により前記画像読取手段で読み取った原稿に電子透かし方式で少なくとも前記原稿のページ番号を有する情報を埋め込むステップと、情報抽出手段により前記原稿に埋めこまれた情報を抽出するステップと、ページ番号取得手段により前記情報抽出手段が読み取った情報から前記原稿のページ番号を取得するステップと、記憶手段により前記画像読取手段で読み取った複数ページの画像データを一つのグループとして記憶するステップと、ページ番号近接判定手段により前記ページ番号取得手段が取得したページ番号と既に前記記憶手段に記憶されている各ページ番号とが近い数字であるかを判定するステップと、前記ページ番号近接判定手段により近いページが存在すると判定された場合に、画像類似度判定手段により前記複数ページの画像データの画像の類似度を判定するステップと、ページ並び替え手段により、前記画像類似度判定手段によって判定された画像の類似度によって前記原稿のページを並び替えると共に、前記原稿のページをグループ統合するステップと、を具備することを特徴とする。 Also, fifth aspect of the present invention, the steps of generating an image processing method in an image forming apparatus for forming an image by reading an original of a plurality of pages, the image data by reading the original document by the image reading means, A step of embedding information having at least the page number of the manuscript in an electronic watermark method in the manuscript read by the image reading unit by the information embedding unit; a step of extracting information embedded in the manuscript by the information extracting unit; A step of acquiring a page number of the original from information read by the information extraction unit by a number acquisition unit, a step of storing a plurality of pages of image data read by the image reading unit by a storage unit as one group, and a page The page number acquired by the page number acquisition means by the number proximity determination means and the existing number A step of determining whether each page number stored in the storage means is a close number; and when the page number proximity determination means determines that there is a close page, the image similarity determination means determines the plurality of the page numbers. The step of determining the image similarity of the image data of the page, and the page rearrangement unit rearrange the pages of the original according to the image similarity determined by the image similarity determination unit, and the page of the original And a step of group integration .

また、請求項６記載の発明は、請求項５に記載の画像処理方法をコンピュータに実行させるプログラムを特徴とする。
また、請求項７記載の発明は、請求項５に記載の画像処理方法を達成するコンピュータプログラムが記載された情報記録媒体を特徴とする。 According to a sixth aspect of the present invention, there is provided a program for causing a computer to execute the image processing method according to the fifth aspect.
According to a seventh aspect of the invention, there is provided an information recording medium on which a computer program for achieving the image processing method according to the fifth aspect is described .

本発明によれば、ページ番号情報を画像中に埋込み、読み取り時に抽出し、原稿のページ順序を判定し、その判定されたページ順序に基づいて原稿のページを並び替えることを可能としたため、ミスフィードなどが発生した場合に、ユーザに適切な処理を提示できユーザの利便性が向上する。
また、本発明によれば、ページ番号情報を画像中に埋込み、読み取り時に抽出、抽出した情報に応じてスキャンした画像をページ順に一つのファイルにまとめて記憶されることにより、ユーザのファイル取り扱いの利便性が向上する。
また、本発明によれば、画像類似度判定手段を具備することにより、入力された画像と記憶領域に存在する画像データとにページ情報において連続性があったとしても、異なるファイルを一つのファイルにまとめる事が無くなりユーザの利便性が向上する。 According to the present invention, it is possible to embed page number information in an image, extract it at the time of reading, determine the page order of the document, and rearrange the pages of the document based on the determined page order. When a feed or the like occurs, an appropriate process can be presented to the user, and the convenience for the user is improved.
Further, according to the present invention, the page number information is embedded in the image, and the images scanned and extracted according to the extracted information at the time of reading are collectively stored in one file in the order of the pages, so that the user can handle the file. Convenience is improved.
In addition, according to the present invention, by providing the image similarity determination means, even if there is continuity in the page information between the input image and the image data existing in the storage area, different files are stored in one file. The convenience for the user is improved.

また、本発明によれば、ページの並べ替えを自動的に行うのではなく、ユーザの確認を伴ってのみ行うことで、ユーザにとってはより細やかな操作が可能となり、利便性が向上する。
また、本発明によれば、本来異なっているはずである、入力された画像と記憶領域に含まれる画像との類似度判定を文書タイプによって行うことで、明確に入力された画像が記憶領域に含まれる一連の画像ファイルの一部であるかを判定する際の精度が向上する。
また、本発明によれば、ページ情報の開始ページがランダムに決められることにより、異なるプリンタによってページ情報が埋めこまれたとしても、それらが重複する可能性が低くなりユーザの利便性が向上する。
また、本発明によれば、ページ番号近接判定手段により近いページが存在しない場合に類似度判定を行わないことにより、処理速度が向上しユーザの利便性が向上する。 Further, according to the present invention, the page is not rearranged automatically but only with the confirmation of the user, so that a finer operation is possible for the user and the convenience is improved.
In addition, according to the present invention, by performing similarity determination between an input image and an image included in the storage area, which should be different from each other, according to the document type, the clearly input image is stored in the storage area. The accuracy in determining whether it is part of a series of included image files is improved.
In addition, according to the present invention, since the start page of the page information is randomly determined, even if the page information is embedded by different printers, the possibility that they are duplicated is reduced, and the convenience for the user is improved. .
In addition, according to the present invention, the processing speed is improved and the convenience for the user is improved by not performing the similarity determination when there is no page closer to the page number proximity determination means.

以下に添付の図を参照してこの発明の実施形態を詳細に説明する。
［実施例］
本発明の好適な実施例として電子写真方式のＭＦＰ（Multi Function Printer：画像形成装置）でのコピーを例に挙げて説明する。
図１は、本発明による画像形成装置の一実施形態の概略構成ブロック図である。
図１に示すように、この画像形成装置は、原稿を読み取るスキャナ部１と、スキャナ部１及びネットワーク３を介して取得した画像データを保持する記憶装置５と、スキャナ部１から取り込まれた画像やネットワーク３を介して取得した画像データを処理する画像処理部７と、スキャナ部１から取り込まれた画像やネットワーク３を介して取得した画像データを直接、若しくは記憶装置５に保持された画像データを出力するプリンタ部９とを有している。 Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings.
[Example]
As a preferred embodiment of the present invention, a description will be given by taking copying in an electrophotographic MFP (Multi Function Printer: image forming apparatus) as an example.
FIG. 1 is a schematic block diagram of an image forming apparatus according to an embodiment of the present invention.
As shown in FIG. 1, the image forming apparatus includes a scanner unit 1 that reads a document, a storage device 5 that holds image data acquired via the scanner unit 1 and the network 3, and an image captured from the scanner unit 1. And image processing unit 7 for processing image data acquired via network 3 and image data acquired from scanner unit 1 and image data acquired via network 3 either directly or stored in storage device 5 And a printer unit 9 for outputting.

この画像形成装置においては、スキャナ部１により得られた画像データを全て一度記憶装置５に保持してからプリンタ部９によって出力するものとする。
まず、プリンタ部９の概要について述べる。図２は、図１に示したプリンタ部９の概略構成図である。
ここでは、ＰＣのアプリケーションにより作成した画像は、ドライバによってビットマップデータ（以下画像データ）に変換される。更に画像データはプリンタが出力できるように中間調処理を施される。中間調処理後の画像データはさらに、以下に記載の所定の処理を施される。 In this image forming apparatus, all the image data obtained by the scanner unit 1 is once held in the storage device 5 and then output by the printer unit 9.
First, an outline of the printer unit 9 will be described. FIG. 2 is a schematic configuration diagram of the printer unit 9 shown in FIG.
Here, an image created by a PC application is converted into bitmap data (hereinafter referred to as image data) by a driver. Further, the image data is subjected to halftone processing so that the printer can output it. The image data after halftone processing is further subjected to the predetermined processing described below.

図２において、光源１１は、シアンの画像データに応じて光を発し、感光体１３では光電効果により静電気が生じ、静電気を帯びた感光体１３に対し、リボルバー１５を回してシアンの現像ユニット１７から電荷を帯びたトナーを供給し像を作る。同様にＭＹＫの像を作ることにより、感光体１３上にはＣＭＹＫの像が作られる。
最後に感光体１３に搬送ベルト１９によって搬送されてくる静電気を帯びた紙を密着させることにより紙の上に像が作成される。紙の上に作成された像はあくまでも紙の上に乗っているだけに過ぎないため、高温、高圧の負荷をかけることにより定着され、出力画像が得られる。ここで、かかる負荷の大きさはかなりのものなので、ページ情報の記載されたＲＦＩＤ（Radio Frequency Identification）を付加した場合、紙に負荷されたＲＦＩＤが破壊されてしまう可能性がある。 In FIG. 2, the light source 11 emits light according to cyan image data, and static electricity is generated by the photoelectric effect on the photosensitive member 13, and the cyan developing unit 17 is rotated by rotating the revolver 15 against the photosensitive member 13 charged with static electricity. The toner is charged from the toner to form an image. Similarly, by forming an MYK image, a CMYK image is formed on the photoreceptor 13.
Finally, an electrostatically charged paper transported by the transport belt 19 is brought into close contact with the photosensitive member 13 to form an image on the paper. Since the image created on the paper is merely on the paper, it is fixed by applying a high temperature and high pressure load, and an output image is obtained. Here, since the magnitude of such a load is considerable, when RFID (Radio Frequency Identification) in which page information is described is added, there is a possibility that the RFID loaded on the paper is destroyed.

次に、図３のフローチャートおよび図４を参照して、図１に示した画像形成装置の動作について説明する。図３は、図１に示した画像形成装置の動作フローチャートである。また、図４は、図１に示した画像処理部７の構成ブロック図である。
まず、図３のステップ１０１において、ＭＦＰ操作に際して、ＩＤカードとパスワードによりユーザが特定され認証処理が行なわれる。ここで、ユーザはアスキー文字６文字の固有なユーザ名を持つものとする。そして、ユーザがＡＤＦに原稿をセットし、コピー開始のボタンを押すと、ＡＤＦによって供給された原稿がスキャナ部１によって光学的に読み取られ、画像処理部７においてＡＤ変換によりデジタル一画素あたり１ｂｙｔｅの信号へと変換される（ステップ１０３）。
次に、画像処理部７の情報抽出部２１によって、画像に埋めこまれた情報が存在するかが判定され、且つ存在していた場合、６４ｂｉｔの情報が取得される（ステップ１０５）この情報抽出部２１については後に詳述する。 Next, the operation of the image forming apparatus shown in FIG. 1 will be described with reference to the flowchart of FIG. 3 and FIG. FIG. 3 is an operation flowchart of the image forming apparatus shown in FIG. FIG. 4 is a block diagram showing the configuration of the image processing unit 7 shown in FIG.
First, in step 101 of FIG. 3, when the MFP is operated, a user is specified by an ID card and a password, and an authentication process is performed. Here, it is assumed that the user has a unique user name of 6 ASCII characters. When the user sets an original on the ADF and presses the copy start button, the original supplied by the ADF is optically read by the scanner unit 1, and the image processing unit 7 performs AD conversion to 1 byte per digital pixel. It is converted into a signal (step 103).
Next, the information extraction unit 21 of the image processing unit 7 determines whether or not the information embedded in the image exists, and if it exists, 64-bit information is acquired (step 105). The part 21 will be described in detail later.

次に、図５に示すように、画像処理部７の埋込情報取得部２３のユーザ名取得部２３ａにより、前記６４ｂｉｔの画像のユーザ名にあたる上位４８ｂｉｔの情報が読み出され、アスキーコード表に従って４８ｂｉｔの情報がアスキー文字６字に変換される（ステップ１０７）。もし、６４ｂｉｔの情報が存在しない場合、ｎｕｌｌ文字である００を６文字分出力する。図５は、図４に示した埋込情報取得部２３の構成ブロック図である。
一方、図５に示すように、埋込情報取得部２３のページ番号取得部２３ｂは、前記６４ｂｉｔのうちページ番号にあたる下位１６ｂｉｔを読み出し、１０進数に変換し、出力する。もし６４ｂｉｔの情報が存在しない場合、０を出力する（ステップ１０７）。 Next, as shown in FIG. 5, the user name acquisition unit 23a of the embedding information acquisition unit 23 of the image processing unit 7 reads the upper 48 bits of information corresponding to the user name of the 64-bit image, and follows the ASCII code table. The 48-bit information is converted into 6 ASCII characters (step 107). If there is no 64-bit information, the null character 00 is output for 6 characters. FIG. 5 is a block diagram showing the configuration of the embedded information acquisition unit 23 shown in FIG.
On the other hand, as shown in FIG. 5, the page number acquisition unit 23b of the embedded information acquisition unit 23 reads the lower 16 bits corresponding to the page number out of the 64 bits, converts it into a decimal number, and outputs it. If there is no 64-bit information, 0 is output (step 107).

次に、画像処理部７の埋込情報生成部２５では読み取った画像に対し、埋めこむべき情報を生成する（ステップ１０９）。ここで埋めこまれる情報は、埋込情報取得部２３により取得されるのと同様のユーザ名とページ番号である。なお、埋込情報生成部２５に関しては、後述する。
埋込情報は、スキャナ部１により得られた画像データと対にして記憶装置５にて記憶される（ステップ１１３）。ここで、画像データは、ページ順に１つにまとめて記憶される。同時に埋込情報生成部２５により得られたページ番号はページ順序判定部２７へと送られ、ページ順序判定部２７では埋込情報生成部２５により得られたページ番号に基づいてページの出力順序を判定し、このページ出力順序判定に基づいて、必要とされる場合、並び替え、もしくはグループ統合して記憶させる（ステップ１１１）。ページ順序判定部２７については後述する。
ページ順序判定部２７により得られたページ順序に従って、記憶装置５より画像データ及びそれに対応する埋込情報が読み出され、埋込部２９で、この読み出された画像データに埋込情報がユーザの目に認識できない様に埋めこまれ、これにより得られた画像データがプリンタ部９へ送られ、画像情報と共にプリンタ部９より出力される（ステップ１１３、１１５、１１７）。なお、埋込２９に関しては後述する。
なお、図４に示すように、この画像処理部７は、情報抽出部２１と、埋込情報取得部２３と、埋込情報生成部（ページ番号取得手段）２５と、ページ順序判定部２７と、埋込部２９とを有している。
以上の様に、本発明のポイントは、まず、埋込にＲＦＩＤやバーコードを用いずに電子透かしを用いることによって、上述の様な並べ替えを達成していることにある。 Next, the embedding information generation unit 25 of the image processing unit 7 generates information to be embedded in the read image (step 109). The information embedded here is the same user name and page number as those acquired by the embedded information acquisition unit 23. The embedded information generation unit 25 will be described later.
The embedded information is stored in the storage device 5 in a pair with the image data obtained by the scanner unit 1 (step 113). Here, the image data is stored together in one page order. At the same time, the page number obtained by the embedding information generation unit 25 is sent to the page order determination unit 27. The page order determination unit 27 determines the page output order based on the page number obtained by the embedding information generation unit 25. Based on this page output order determination, if necessary, rearrangement or group integration is performed (step 111). The page order determination unit 27 will be described later.
In accordance with the page order obtained by the page order determination unit 27, the image data and the embedding information corresponding thereto are read from the storage device 5, and the embedding unit 29 adds the embedding information to the read image data. The image data thus obtained is sent to the printer unit 9 and output from the printer unit 9 together with the image information (steps 113, 115 and 117). The embedding 29 will be described later.
As shown in FIG. 4, the image processing unit 7 includes an information extraction unit 21, an embedded information acquisition unit 23, an embedded information generation unit (page number acquisition unit) 25, and a page order determination unit 27. And an embedded portion 29.
As described above, the point of the present invention is that the rearrangement as described above is achieved by using digital watermarks without embedding RFIDs or barcodes.

従来技術で述べたとおり、ＲＦＩＤは電子写真方式の定着により破壊される可能性があり、また、大量に出力されることを想定すると、高コストであるというデメリットがある。またバーコードも、ユーザの文書編集の自由度を損なわせると言ったデメリットがある。一方、電子透かし方式では、原稿を加工するだけであるのでどのような印刷方式であっても利用可能であり、出力時に追加のコストもかからず、ユーザの文書編集の自由度を損なう心配もない。
更に、本発明のもう一つのポイントは、電子透かしにより埋めこまれた情報だけでなく、画像の類似度判定をページ並び替えの参考情報として用いている点にある。電子透かし方式は原稿を加工してしまうため、多くの情報量を埋めこもうとすると画質が大きく劣化してしまうことになる。画質を良好に保つため埋めこむ情報量を削減すると、当然の事ながらページ番号に割り振られる情報量も少なくなる。つまり、ページ番号として付加できる数字の桁数が小さくなる。すると、数字の大きさが限られるため、本来は異なる文書であるのに、同じ文書で有ると判定される可能性が増加する。このようなケースに於いても画像の類似度を判定し、異なる文書であることを検出できれば異なる文書を統合してしまうことを防止することができる。
以上のように、本発明を適用することにより、ユーザが複数枚の原稿のページ順序を整えずにＡＤＦにセットしたとしても、出力されるのは本来のページ順序となり、ユーザの手間を大幅に削減することが可能である。 As described in the prior art, the RFID may be destroyed by the fixing of the electrophotographic method, and there is a demerit that the cost is high when a large amount of output is assumed. The barcode also has a demerit that it impairs the user's freedom of document editing. On the other hand, since the digital watermark method only processes the document, it can be used with any printing method, and there is no additional cost at the time of output, and there is a concern that the user's freedom of document editing may be impaired. Absent.
Furthermore, another point of the present invention is that not only the information embedded by the digital watermark but also the image similarity determination is used as reference information for page rearrangement. Since the digital watermarking method processes a document, if a large amount of information is embedded, the image quality is greatly deteriorated. If the amount of information to be embedded is reduced in order to maintain good image quality, the amount of information allocated to the page number is naturally reduced. That is, the number of digits that can be added as a page number is reduced. Then, since the size of the number is limited, there is an increased possibility that the document is originally different but is determined to be the same document. Even in such a case, it is possible to prevent the integration of different documents if it is possible to determine the similarity of images and detect that the documents are different.
As described above, by applying the present invention, even when the user sets the page order of a plurality of originals in the ADF, the original page order is output, which greatly reduces the user's trouble. It is possible to reduce.

次に、画像処理部７内の各構成要素について説明する。
［埋込情報生成部２５］
次に、埋込情報生成部２５について詳述する。埋込情報生成部２５は、図６に示す様に、ユーザ名生成部２５ａと、ページ番号生成部２５ｂと、乱数生成部２５ｃとを有している。図６は、図４に示した埋込情報生成部２５の構成ブロック図である。
図６に示す様に、ユーザ名生成部２５ａは、埋込情報取得部２３により出力されたユーザ名がｎｕｌｌで無かった場合、埋込情報取得部２３より得られたユーザ名をそのまま出力する。
一方、ユーザ名生成部２５ａは、ユーザ名がｎｕｌｌであった場合、認証処理により得られたユーザ名をユーザ名として出力する。ページ番号生成部２５ｂは、埋込情報取得部２３より得られたページ番号が０でなかった場合、埋込情報取得部２３より得られたページ番号をそのまま出力する一方、得られたページ番号が０であった場合、複数枚の原稿のスキャンが始まってから終わるまでに入力された画像の入力された順番に沿ってページ番号をインクリメントさせながらページ番号を生成する。但し、初めのページ番号に関しては乱数生成部２５ｃによりの番号を用いる。従って、ページ情報の開始ページがランダムに決められることになり、異なるプリンタによってページ情報が埋めこまれたとしても、それらが重複する可能性が低くなりユーザの利便性が向上する。 Next, each component in the image processing unit 7 will be described.
[Embedding information generation unit 25]
Next, the embedded information generation unit 25 will be described in detail. As shown in FIG. 6, the embedded information generating unit 25 includes a user name generating unit 25a, a page number generating unit 25b, and a random number generating unit 25c. FIG. 6 is a block diagram showing the configuration of the embedded information generating unit 25 shown in FIG.
As shown in FIG. 6, when the user name output by the embedded information acquisition unit 23 is not null, the user name generation unit 25a outputs the user name obtained from the embedded information acquisition unit 23 as it is.
On the other hand, when the user name is null, the user name generation unit 25a outputs the user name obtained by the authentication process as the user name. When the page number obtained from the embedding information acquisition unit 23 is not 0, the page number generation unit 25b outputs the page number obtained from the embedding information acquisition unit 23 as it is, while the obtained page number is If it is 0, the page number is generated while incrementing the page number in the input order of the input images from the start to the end of scanning of a plurality of documents. However, the number from the random number generation unit 25c is used for the first page number. Therefore, the start page of the page information is determined at random, and even if the page information is embedded by different printers, the possibility that they will be duplicated is reduced and the convenience for the user is improved.

［埋込部２９］
次に、埋込部２９について詳述する。埋込部２９は、図７に示すように、ＤＣＴ部２９ａと、付加部２９ｂと、逆ＤＣＴ部２９ｃとを有している。図７は、図４に示した埋込部２９の構成ブロック図である。
図７に示すように、ＤＣＴ部２９ａは、入力画像の８×８画素毎を一ブロックとしてＤＣＴを施しＤＣＴ係数を生成する。付加部２９ｂでは、まず、ＤＣＴ係数の高周波を示す領域２×２画素分の値を０にし、更に４ブロック×４ブロックを一単位としてそれぞれに、埋込情報に従って１ビットの情報を付加する。こうすることで（２画素×２画素）×（４ブロック×４ブロック）で６４ｂｉｔの情報を付加できる。このとき４ブロック×４ブロックの繰り返し毎に、同じ情報を付加するものとする。こうすることで、出力後スキャンによって画質が劣化したり、改変されたりしても同じ情報が様々な領域で埋めこまれているので、正確に埋めこまれた情報を抽出できる可能性が高まる。最後に、逆ＤＣＴ部２９ｃは、情報の付加された画像データを逆ＤＣＴすることで、出力画像を得る。 [Embedding part 29]
Next, the embedding part 29 will be described in detail. As shown in FIG. 7, the embedding unit 29 has a DCT unit 29a, an adding unit 29b, and an inverse DCT unit 29c. FIG. 7 is a configuration block diagram of the embedding unit 29 shown in FIG.
As shown in FIG. 7, the DCT unit 29a performs DCT on every 8 × 8 pixels of the input image as one block to generate a DCT coefficient. In the adding unit 29b, first, the value of the region 2 × 2 pixels indicating the high frequency of the DCT coefficient is set to 0, and 4 bits × 4 blocks are set as one unit, and 1-bit information is added according to the embedding information. By doing so, 64-bit information can be added by (2 pixels × 2 pixels) × (4 blocks × 4 blocks). At this time, the same information is added every time 4 blocks × 4 blocks are repeated. By doing this, even if the image quality deteriorates or is altered by post-output scanning, the same information is embedded in various areas, so that the possibility that the embedded information can be extracted accurately increases. Finally, the inverse DCT unit 29c obtains an output image by performing inverse DCT on the image data to which information is added.

［埋込情報取得部２３］
次に、埋込情報取得部２３について詳述する。埋込情報取得部２３は、図８に示すように、ＤＣＴ部２３ａと、読み出し部２３ｂとを有している。図８は、図４に示した埋込情報取得部２３の構成ブロック図である。
図８に示すように、ＤＣＴ部２３ａにより入力画像データにＤＣＴを施す。読み出し部２３ｂでは、ＤＣＴ後の画像データの８×８画素を１ブロックとし、それらの高周波領域２×２画素からそれぞれ１ｂｉｔの情報を抽出する。更に、４×４ブロックから読み出された情報を統合して６４ｂｉｔの埋込情報を出力する。 [Embedding information acquisition unit 23]
Next, the embedded information acquisition unit 23 will be described in detail. As shown in FIG. 8, the embedded information acquisition unit 23 includes a DCT unit 23a and a reading unit 23b. FIG. 8 is a configuration block diagram of the embedded information acquisition unit 23 shown in FIG.
As shown in FIG. 8, DCT is performed on the input image data by the DCT unit 23a. In the reading unit 23b, 8 × 8 pixels of the image data after DCT are regarded as one block, and 1-bit information is extracted from each of the 2 × 2 pixels in the high-frequency region. Further, the information read from the 4 × 4 block is integrated to output 64-bit embedded information.

［ページ順序判定部２７］
次に、ページ順序判定部２７について詳述する。ページ順序判定部２７は、図９に示すように、記憶部２７ａと、ページ番号近接判定部２７ｂと、画像類似度判定部２７ｃと、確認部２７ｄと、ページ並べ替え部２７ｅとを有している。図９は、図４に示したページ順序判定部２７の構成ブロック図である。
図９に示すように、入力は埋込情報生成部２５で生成されたページ番号であり、入力されたページ番号は記憶部２７ａに保持される。ここで、記憶部２７ａに保持されるデータの構造は図１３に示した表のようになる。ここで、カテゴリとは本画像処理装置において判定された文書種を示す。本実施例で想定している入力画像は複数枚から成る文書が複数種混在したものであり、カテゴリはそれぞれの複数枚からなる文書を示すことになる。図１３は、記憶部２７ａに保持されるデータの構造を示す表である。なお、この記憶部２７ａは、記憶装置５に含まれるものとする。 [Page Order Determination Unit 27]
Next, the page order determination unit 27 will be described in detail. As shown in FIG. 9, the page order determination unit 27 includes a storage unit 27a, a page number proximity determination unit 27b, an image similarity determination unit 27c, a confirmation unit 27d, and a page rearrangement unit 27e. Yes. FIG. 9 is a configuration block diagram of the page order determination unit 27 shown in FIG.
As shown in FIG. 9, the input is a page number generated by the embedding information generation unit 25, and the input page number is held in the storage unit 27a. Here, the structure of data held in the storage unit 27a is as shown in the table of FIG. Here, the category indicates the document type determined by the image processing apparatus. The input image assumed in this embodiment is a mixture of a plurality of types of documents, and the category indicates each of the plurality of documents. FIG. 13 is a table showing the structure of data held in the storage unit 27a. The storage unit 27a is assumed to be included in the storage device 5.

次に、入力されたページ番号は同様にページ番号近接判定部２７ｂへと送られる。ページ番号近接判定部２７ｂは、記憶部２７ａに既に記憶されている各ページ番号と入力された番号とが近い数字であるかを判定する。どれだけの差をもって近いとするかは本実施形態では２０としたが、勿論他の数字でも構わないし、ユーザが設定できる様にしておいても構わない。ページ番号近接判定部２７ｂでページ番号が近いものが存在していると判定された場合、画像類似度判定部２７ｃでは、当該ページの画像を記憶装置５より読み出し、画像の類似度を判定する。画像類似度判定部２７ｂについては後述する。なお、ページ番号近接判定部２７ｂでページ番号が近いものが存在していないと判定された場合、画像類似度判定部２７ｃでは、画像の類似度判定は行なわれない。
そして、画像類似度判定部２７ｂにて２つのページが十分に近いと判定された場合、確認部２７ｄは、ユーザにページ順番入れ替えの旨を伝え、ユーザ確認を促して、ユーザの指示を仰ぐ。本実施形態では電子透かし及び画像の類似度判定を組み合わせることで、高精度にページ順序の重複を防止しているが、勿論、ページ順序の重複が発生する可能性も存在する。最終的な判断をユーザにおこなってもらうことで、ページ重複の発生を完全に防止することができる。一方、ユーザの操作が煩雑になってしまう可能性があるため、確認部２７ｄの有効無効はユーザが任意に設定できることが望ましい。確認部２７ｄによりユーザがページ順序の入れ替えを指示した場合、ページ並べ替え部２７ｅにて記憶部２７ａに存在するページの並べ替えを行う。 Next, the input page number is similarly sent to the page number proximity determination unit 27b. The page number proximity determination unit 27b determines whether each page number already stored in the storage unit 27a is close to the input number. In the present embodiment, the difference is set to 20 in the present embodiment, but other numbers may of course be used and the user may be able to set them. When the page number proximity determining unit 27b determines that there is a similar page number, the image similarity determining unit 27c reads the image of the page from the storage device 5 and determines the image similarity. The image similarity determination unit 27b will be described later. Note that when the page number proximity determining unit 27b determines that there is no similar page number, the image similarity determining unit 27c does not perform image similarity determination.
When the image similarity determination unit 27b determines that the two pages are sufficiently close, the confirmation unit 27d informs the user that the page order is to be changed, prompts the user confirmation, and asks for the user's instruction. In the present embodiment, overlapping of the page order is prevented with high accuracy by combining the digital watermark and the similarity determination of the image. Of course, there is also a possibility that the page order is duplicated. By having the user make a final decision, it is possible to completely prevent page duplication. On the other hand, since the user's operation may become complicated, it is desirable that the user can arbitrarily set the validity of the confirmation unit 27d. When the confirmation unit 27d instructs the user to change the page order, the page rearrangement unit 27e rearranges the pages existing in the storage unit 27a.

ページ並べ替え部２７ｅでは、前段のページ番号近接判定部２７ｂ若しくは、画像類似度判定部２７ｃ若しくは確認部２７ｄによって、既に存在しているどのカテゴリにも属さないと判定された場合、記憶領域に新しいカテゴリを作成し、そこへページ番号を登録する。また、いずれかのカテゴリに含まれると判定された場合には、ページ番号を該カテゴリに挿入した後、小さい順に並べ替える。尚、並べ替える方法としてはごく一般的な方法としてバブルソートを用いた。最後に、スキャンが終わった時点で、記憶部２７ａに存在するページ番号をカテゴリ１のページ番号が若い順、続いてカテゴリ２のページ番号が若い順といった具合に順に並べて出力とする。 In the page rearrangement unit 27e, when the previous page number proximity determination unit 27b, the image similarity determination unit 27c, or the confirmation unit 27d determines that it does not belong to any existing category, a new storage area is created. Create a category and register the page number there. If it is determined that the category is included in any of the categories, the page numbers are rearranged in ascending order after being inserted into the category. It should be noted that bubble sorting was used as a very general method. Finally, when the scan is completed, the page numbers existing in the storage unit 27a are output in order, such as the page number of category 1 in ascending order and the order of page number in category 2 in ascending order.

［画像類似度判定部２７ｃ］
次に、画像類似度判定部２７ｃについて詳述する。画像類似度判定部２７ｃは、図１０に示すように、領域分割部３０ａと、特徴量算出部３０ｂと、オブジェクト種判定部３０ｃと、帳票判定部３０ｄと、スライド判定部３０ｅと、文書判定部３０ｆと、総合判定部３０ｇと、記憶部３０ｈと、類似度判定部３０ｉとを有している。図１０は、図９に示した画像類似度判定部２７ｃの構成ブロック図である。
図１０に示すように、比較される２枚の画像、以降第一の画像及び第二の画像が順に画像類似度判定部２７ｃに入力されると、まず、第一の画像が領域分割部３０ａに入力され、領域分割３０ａは、入力された画像を文字領域、絵柄領域といったオブジェクト毎に分割する。尚具体的な領域の分割方法については、非特許文献１記載の方法を用いる。
その概要はまず二値化により白領域を背景として抽出する。ここで二値化は濃度１２７以上を白とする単純な二値化を用いた。次に、白領域を細線化することでチェーンが連なった様な画像を得る。更に数式１を満たす不要なチェーンを除去することで、領域の境界を得ることができる。尚、数式１においてＤとは最小距離、Ｗとは平均線幅差、ｔＤ、ｔＷはそれぞれ事前に設定しておくそれぞれの閾値となっている。これら用語の説明は非特許文献１に譲るが、本実施例ではｔＤ＝５、ｔＷ＝３０を用いた。更に、領域分割部３０ａでは上記境界を包含する矩形を算出する。
（数式１）

そして、特徴量算出部３０ｂでは及び領域分割部３０ａで得られた領域毎に、平均色、エッジ総量、中心位置からのモーメントからなる特徴量を算出し、オブジェクト種判定部３０ｃに送る。オブジェクト種判定部３０ｃは、領域分割部３０ａによって得られた矩形領域がどのようなオブジェクトであるか判定する。ここでいうオブジェクトとは、文字列、背景、その他、である。ここでは後述するサポートベクタマシンを用いて事前に学習しておき、入力画像の特徴からオブジェクト種が推定する。 [Image similarity determination unit 27c]
Next, the image similarity determination unit 27c will be described in detail. As shown in FIG. 10, the image similarity determination unit 27c includes an area division unit 30a, a feature amount calculation unit 30b, an object type determination unit 30c, a form determination unit 30d, a slide determination unit 30e, and a document determination unit. 30f, a comprehensive determination unit 30g, a storage unit 30h, and a similarity determination unit 30i. FIG. 10 is a block diagram showing the configuration of the image similarity determination unit 27c shown in FIG.
As shown in FIG. 10, when two images to be compared, and subsequently the first image and the second image are sequentially input to the image similarity determination unit 27c, first, the first image is the region dividing unit 30a. The area division 30a divides the input image into objects such as a character area and a picture area. As a specific region dividing method, the method described in Non-Patent Document 1 is used.
In outline, first, a white region is extracted as a background by binarization. Here, binarization used simple binarization in which the density is 127 or higher. Next, an image like a chain is obtained by thinning the white area. Further, by removing unnecessary chains that satisfy Equation 1, the boundary of the region can be obtained. In Equation 1, D is a minimum distance, W is an average line width difference, and tD and tW are thresholds set in advance, respectively. The explanation of these terms is left to Non-Patent Document 1, but in this example, tD = 5 and tW = 30 were used. Further, the area dividing unit 30a calculates a rectangle including the boundary.
(Formula 1)

Then, the feature amount calculation unit 30b calculates a feature amount including the average color, the total amount of edges, and the moment from the center position for each region obtained by the region division unit 30a, and sends the feature amount to the object type determination unit 30c. The object type determination unit 30c determines what kind of object the rectangular area obtained by the area division unit 30a is. An object here is a character string, a background, and others. Here, learning is performed in advance using a support vector machine described later, and the object type is estimated from the characteristics of the input image.

次に、帳票判定部３０ｄ、スライド判定部３０ｅ、文書判定部３０ｆは、それぞれ、オブジェクト種判定部３０ｃにより得られたオブジェクト種の構成割合（３次元）を特徴量として、帳票らしさ、スライドらしさ、文書らしさ、を事前にサポートベクタマシンにより学習した識別器によって出力する。サポートベクタマシンは後述する様に、一般的に二値的な判定を行う識別器であるが、入力特徴量が識別面からどれだけ離れて居るかを算出することが可能であり、これを以て、帳票らしさ、スライドらしさ、文書らしさとする。
以上の帳票らしさ、スライドらしさ、文書らしさの大小、すなわち文書タイプを総合判定部３０ｇにより判定し、最も尤もらしいものをもって文書のタイプとする。第一の画像が入力された場合は、前記文書タイプは記憶部３０ｈにより、記憶され、一方第二の画像が入力された場合は、記憶部３０ｈより読み出された第一の文書タイプと第二の画像の文書タイプが一致するかどうかを類似度判定部３０ｉにより判定する。なお、この記憶部３０ｈは、記憶装置５に含まれるものとする。 Next, each of the form determination unit 30d, the slide determination unit 30e, and the document determination unit 30f uses the composition ratio (three-dimensional) of the object type obtained by the object type determination unit 30c as a feature amount, The document quality is output by a classifier previously learned by a support vector machine. As will be described later, the support vector machine is generally a discriminator that performs a binary determination, but it is possible to calculate how far the input feature amount is from the discriminant plane, and with this, It is a form, a slide, and a document.
The above-described form-likeness, slide-likeness, and document-likeness, that is, the document type, is determined by the overall determination unit 30g, and the most likely one is used as the document type. When the first image is input, the document type is stored by the storage unit 30h, and when the second image is input, the first document type and the first document type read from the storage unit 30h are stored. The similarity determination unit 30i determines whether the document types of the two images match. Note that the storage unit 30 h is included in the storage device 5.

次に、サポートベクタマシンの概要について説明する。詳細は非特許文献２に記載されているが、その概要は以下の様になる。図１１は、サポートベクタマシン（ＳＶＭ）の動作の概要を示した説明図である。
まず、前提条件として、図１１の左に示すように○×で表される二つの種類のベクトル群が存在するものとする。ＳＶＭはこれら二つのクラスを最適に分離するための超平面（図の右参照）を決定するためのアルゴリズムといえる。ＳＶＭにおいて、二つのベクトル群を最適に分割するとは、未知のベクトルが入力されたときの対応能力、つまり汎化能力を最大とするということと等価である。これを実現するために、二つのベクトル群の境界位置に存在するベクトル（Support Vector）を見つけて、Support Vectorと超平面との距離が最大となるように超平面を設定する。ここで、実動作上、間違いの教師データも存在するため、間違いの許容量（ソフトマージン）を設定するパラメータを決める必要がある。以上は、線形ＳＶＭについての説明であるが、実際の教師データは線形判別できるベクトル群とは限らない。しかし、特徴量ベクトルを、より高次の空間に射影し（カーネルトリック）、その空間上で超平面を求めることで、非線形な問題にも対応できる。 Next, an outline of the support vector machine will be described. The details are described in Non-Patent Document 2, but the outline is as follows. FIG. 11 is an explanatory diagram showing an outline of the operation of the support vector machine (SVM).
First, as a precondition, it is assumed that there are two types of vector groups represented by ○ × as shown on the left side of FIG. SVM can be said to be an algorithm for determining a hyperplane (see the right in the figure) for optimally separating these two classes. In SVM, optimally dividing two vector groups is equivalent to maximizing the ability to respond when an unknown vector is input, that is, the generalization ability. In order to realize this, a vector (Support Vector) existing at the boundary position between two vector groups is found, and the hyperplane is set so that the distance between the Support Vector and the hyperplane is maximized. Here, since there is erroneous teacher data in actual operation, it is necessary to determine a parameter for setting an allowable error amount (soft margin). The above is a description of the linear SVM, but actual teacher data is not necessarily a vector group that can be linearly discriminated. However, by projecting the feature vector onto a higher-order space (kernel trick) and obtaining a hyperplane in that space, it is possible to cope with nonlinear problems.

以上を実現するには、結果的に、教師データ（ｘｉ，ｙｉ）を用いて数式２の条件下で数式３を最大化するラグランジェ乗数ベクトルαｉを求めることになる。その後ラグランジェ乗数ベクトルの要素のうち０でない要素に対応する教師データ群Ｓ（これがサポートベクタとなる）と、そのうちの任意の一つの教師データ（ｘ０，ｙ０）を用いて超平面のパラメータω、ｈを求める（数式４、５）。
（数式２）

（数式３）

（数式４）

（数式５）

In order to realize the above, as a result, a Lagrange multiplier vector αi that maximizes Expression 3 under the condition of Expression 2 is obtained using the teacher data (xi, yi). Then, using the teacher data group S corresponding to the non-zero element of the Lagrange multiplier vector elements (this is the support vector) and any one of the teacher data (x0, y0), the hyperplane parameter ω, h is obtained (Equations 4 and 5).
(Formula 2)

(Formula 3)

(Formula 4)

(Formula 5)

尚、上記数式に於いてＫ（ｘ，ｙ）はカーネルトリックを実現するためのカーネル関数を示す。カーネル関数には様々なものが考案されているが、今回の実施例ではRadial Basis Function（ＲＢＦ）を用いた。ＲＢＦは数式６で示される関数でＣは任意の数字である。
（数式６）

以上の様に、埋めこまれたページ番号情報のみでなく、類似画像判定を付加することで、異なる文書のページが重複していた場合でも、正しくページの並べ替えができる可能性が高くなる。更に本実施例で述べた様に、文書タイプを識別し、これにより類似画像判定を行う構成としたが、これは以下の理由による。例えばパタンマッチングなどを用いて画像の類似度判定を行う場合を考えると、得られる類似度は近さを示す量になる。つまり、これが実際に類似しているか否かを判定するためには一種の閾値を用いねば成らない。更に、本実施例では一致する二つの文書の類似度を判定するのではなく、ページの異なる二つの文書の類似度を判定しようとするため、閾値の設定は難しくなる。結果実用的にはユーザが閾値を設定し、試行錯誤の上で決定しなければ成らない。一方、文書タイプ識別は、二値的な判断であり、似ているか似ていないかは文書のタイプが一致するか一致しないかという二値的な判断に成るため、上記の様な問題は発生しない。 In the above formula, K (x, y) represents a kernel function for realizing a kernel trick. Various kernel functions have been devised. In this embodiment, a radial basis function (RBF) is used. RBF is a function expressed by Equation 6, and C is an arbitrary number.
(Formula 6)

As described above, by adding not only the embedded page number information but also similar image determination, there is a high possibility that the pages can be correctly rearranged even when pages of different documents are duplicated. Further, as described in the present embodiment, the document type is identified and the similar image determination is performed based on the document type. This is for the following reason. For example, considering the case where image similarity determination is performed using pattern matching or the like, the obtained similarity is an amount indicating closeness. In other words, in order to determine whether or not they are actually similar, a kind of threshold value must be used. Furthermore, in this embodiment, since the similarity between two matching documents is not determined, but the similarity between two documents with different pages is determined, it is difficult to set a threshold value. As a result, in practice, the user must set a threshold value and determine it through trial and error. On the other hand, the document type identification is a binary decision, and the above-mentioned problems occur because it is a binary decision whether the document type matches or does not match. do not do.

次に、図１に示した画像形成装置の一実施形態を用いてスキャンを行う場合について説明する。図１２は、図１に示した画像形成装置の一実施形態を用いたスキャン動作のフローチャートである。
なお、ここに記憶される、スキャン後の画像データは、必ずしもページ毎に管理される訳でなく、マルチページＴｉｆｆであったりＰＤＦといった複数のページを一つのファイルにまとめて管理されているものとする。
図１２のステップ２０１において、原稿がスキャナ部１により光学的に読み取られた後、画像処理部７においてＡＤ変換されて１画素一色あたり８ｂｉｔのＲＧＢ画像データへ変換され、画像データに埋めこまれた６４ｂｉｔの埋込情報が抽出される（ステップ２０３）。次に、抽出された情報からユーザ名を表す情報と、ページ番号を表す情報が抽出され（ステップ２０５）、抽出されたページ番号に基づいて、入力された原稿が既に記憶装置５に記憶された画像データの一部であるかを判定する（ステップ２０７）。
そして、ページ順序判定で入力された原稿が既に記憶装置５に記憶された画像データの一部で有ると判定された場合、既に記憶装置５に存在する該画像データに適切に入力された原稿画像を統合して記憶装置５に記憶する（ステップ２０９、２１１）。 Next, a case where scanning is performed using the embodiment of the image forming apparatus illustrated in FIG. 1 will be described. FIG. 12 is a flowchart of a scanning operation using the embodiment of the image forming apparatus shown in FIG.
It should be noted that the scanned image data stored here is not necessarily managed for each page, but a plurality of pages such as multi-page Tiff or PDF are managed in one file. To do.
In step 201 of FIG. 12, after the original is optically read by the scanner unit 1, it is AD-converted by the image processing unit 7 and converted into RGB image data of 8 bits per pixel and embedded in the image data. 64-bit embedded information is extracted (step 203). Next, information representing the user name and information representing the page number are extracted from the extracted information (step 205), and the input document has already been stored in the storage device 5 based on the extracted page number. It is determined whether it is a part of the image data (step 207).
When it is determined that the document input in the page order determination is a part of the image data already stored in the storage device 5, the document image appropriately input to the image data already existing in the storage device 5 Are integrated and stored in the storage device 5 (steps 209 and 211).

以上の様な構成とすることで、ユーザはスキャン時の様々な作業から解放される。例えば、前述した様に、混在した複数の文書を整理して並べ替えなくとも、記憶手段に保持されるファイルは文書毎に、しかも正しいページ順で並ぶことになる。更に、ＡＤＦにセットした文書のページが一枚だけ抜けていた場合でも、後からその一枚をスキャンすることで自動的に正しい文書に正しいページ順で挿入してくれる。 With the above configuration, the user is freed from various operations during scanning. For example, as described above, even if a plurality of mixed documents are not arranged and rearranged, the files held in the storage means are arranged for each document and in the correct page order. Furthermore, even if only one page of the document set in the ADF is missing, the page is automatically inserted into the correct document in the correct page order by scanning that page later.

本発明による画像形成装置の一実施形態の概略構成ブロック図である。1 is a schematic block diagram of an image forming apparatus according to an embodiment of the present invention. 図１に示したプリンタ部９の概略構成図である。It is a schematic block diagram of the printer part 9 shown in FIG. 図１に示した画像形成装置の動作フローチャートである。3 is an operation flowchart of the image forming apparatus illustrated in FIG. 1. 図１に示した画像処理部７の構成ブロック図である。FIG. 2 is a block diagram illustrating a configuration of an image processing unit 7 illustrated in FIG. 1. 図４に示した埋込情報取得部２３の構成ブロック図である。FIG. 5 is a configuration block diagram of an embedded information acquisition unit 23 illustrated in FIG. 4. 図４に示した埋込情報生成部２５の構成ブロック図である。FIG. 5 is a configuration block diagram of an embedded information generation unit 25 shown in FIG. 4. 図４に示した埋込部２９の構成ブロック図である。FIG. 5 is a configuration block diagram of an embedding unit 29 shown in FIG. 4. 図４に示した埋込情報取得部２３の構成ブロック図である。FIG. 5 is a configuration block diagram of an embedded information acquisition unit 23 illustrated in FIG. 4. 図４に示したページ順序判定部２７の構成ブロック図である。FIG. 5 is a configuration block diagram of a page order determination unit 27 shown in FIG. 4. 図９に示した画像類似度判定部２７ｃの構成ブロック図である。FIG. 10 is a configuration block diagram of an image similarity determination unit 27c shown in FIG. 9. サポートベクタマシン（ＳＶＭ）の動作の概要を示した説明図である。It is explanatory drawing which showed the outline | summary of operation | movement of a support vector machine (SVM). 図１に示した画像形成装置の一実施形態を用いたスキャン動作のフローチャートである。3 is a flowchart of a scanning operation using the embodiment of the image forming apparatus illustrated in FIG. 1. 記憶部２７ａに保持されるデータの構造を表で示す図である。It is a figure which shows the structure of the data hold | maintained at the memory | storage part 27a with a table | surface.

Explanation of symbols

１…スキャナ部、３…ネットワーク、５…記憶装置、７…画像処理部、９…プリンタ部、１１…光源、１３…感光体、１５…リボルバー、１７…現像ユニット、１９…搬送ベルト、２１…情報抽出部、２３…埋込情報取得部、２３ａ…ユーザ名取得部、２３ｂ…ページ番号取得部、２５…埋込情報生成部、２５ａ…ユーザ名生成部、２５ｂ…ページ番号生成部、２５ｃ…乱数生成部、２７…ページ順序判定部、２７ａ…記憶部、２７ｂ…ページ番号近接判定部、２７ｃ…画像類似度判定部、２７ｄ…確認部、２９…埋込部、２９ａ…ＤＣＴ部、２９ｂ…付加部、２９ｃ…逆ＤＣＴ部、３０ａ…領域分割部、３０ｂ…特徴量算出部、３０ｃ…オブジェクト種判定部、３０ｄ…帳票判定部、３０ｅ…スライド判定部、３０ｆ…文書判定部、３０ｇ…総合判定部、３０ｈ…記憶部、３０ｉ…類似度判定部 DESCRIPTION OF SYMBOLS 1 ... Scanner part, 3 ... Network, 5 ... Memory | storage device, 7 ... Image processing part, 9 ... Printer part, 11 ... Light source, 13 ... Photoconductor, 15 ... Revolver, 17 ... Developing unit, 19 ... Conveyor belt, 21 ... Information extraction unit, 23 ... Embedded information acquisition unit, 23a ... User name acquisition unit, 23b ... Page number acquisition unit, 25 ... Embedded information generation unit, 25a ... User name generation unit, 25b ... Page number generation unit, 25c ... Random number generation unit, 27... Page order determination unit, 27a... Storage unit, 27b... Page number proximity determination unit, 27c... Image similarity determination unit, 27d ... confirmation unit, 29 ... embedding unit, 29a ... DCT unit, 29b. Addition unit, 29c ... inverse DCT unit, 30a ... area division unit, 30b ... feature amount calculation unit, 30c ... object type determination unit, 30d ... form determination unit, 30e ... slide determination unit, 30f ... document determination unit, 30g Comprehensive determination unit, 30h ... storage unit, 30i ... similarity determination unit

Claims

An image forming apparatus that reads a plurality of pages of an original to form an image,
Image reading means for reading the original and generating image data;
An information embedding means for embedding information having at least the page number of the original in a digital watermark method in the original read by the image reading means;
Information extracting means for extracting information embedded in the manuscript;
Page number acquisition means for acquiring the page number of the document from the information read by the information extraction means;
Storage means for storing a plurality of pages of image data read by the image reading means as one group;
A page number proximity determining unit that determines whether the page number acquired by the page number acquiring unit and each page number already stored in the storage unit are close to each other, and a page closer to the page number proximity determining unit An image similarity determination unit that determines the image similarity of the image data of the plurality of pages when it is determined to exist, and the pages of the document are rearranged according to the image similarity determined by the image similarity determination unit. An image forming apparatus comprising: a page ordering unit having a page rearranging unit for grouping the pages of the document .

The image forming apparatus according to claim 1 , wherein the page order determination unit includes a confirmation unit that confirms a user when rearranging the pages of the document or integrating the group of document pages. apparatus.

The image similarity determining means image forming apparatus according to claim 1 or 2, characterized in that it has a document type identifying means for determining a document type of page of at least the document.

An embedding information generating means for randomly determining the first page number of image data composed of a plurality of pages read by the image reading means when the page number acquired by the page number acquiring means is 0 ; The image forming apparatus according to claim 1 , wherein the image forming apparatus is an image forming apparatus.

An image processing method in an image forming apparatus for reading a plurality of pages of an original to form an image,
Reading the document by an image reading unit to generate image data;
Embedding information having at least the page number of the original in an electronic watermark method in the original read by the image reading means by the information embedding means;
Extracting information embedded in the original by information extraction means;
Acquiring a page number of the document from information read by the information extraction unit by a page number acquisition unit;
Storing a plurality of pages of image data read by the image reading means by a storage means as one group;
Determining whether the page number acquired by the page number acquisition means by the page number proximity determination means and each page number already stored in the storage means are close numbers;
A step of determining image similarity of the image data of the plurality of pages by an image similarity determination unit when the page number proximity determination unit determines that there is a close page;
And a step of rearranging the pages of the manuscript according to the degree of similarity of the image determined by the image similarity determining unit, and integrating the pages of the manuscript into a group. Processing method.

A program for causing a computer to execute the image processing method according to claim 5 .

An information recording medium on which a computer program for achieving the image processing method according to claim 5 is recorded.