JP4192906B2

JP4192906B2 - Watermark information detection apparatus and watermark information detection method

Info

Publication number: JP4192906B2
Application number: JP2005065778A
Authority: JP
Inventors: 蔵人前野
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 2005-03-09
Filing date: 2005-03-09
Publication date: 2008-12-10
Anticipated expiration: 2025-03-09
Also published as: JP2006253918A

Description

本発明は，印刷された機密情報入り文書又は機密情報入り文書画像から機密情報を検出する技術に関するものである。 The present invention relates to a technique for detecting confidential information from a printed document including confidential information or a document image including confidential information.

画像や文書データなどにコピー・偽造防止のための情報や機密情報（又は，秘密情報）を人の目には見えない形で埋め込む「電子透かし」は，保存やデータの受け渡しがすべて電子媒体上で行われることを前提としており，透かしによって埋め込まれている情報（透かし情報）の劣化や消失がないため確実に情報検出を行うことができる。 “Digital watermark”, which embeds information to prevent copying / counterfeiting or confidential information (or confidential information) in an image or document data in a form that is invisible to the human eye, is all stored and passed on electronic media. Therefore, the information embedded in the watermark (watermark information) is not deteriorated or lost, so that the information can be reliably detected.

上記電子媒体上と同様に，紙媒体に印刷された文書に対しても，文書が不正に改ざんされたりコピーされることを防ぐために，文字以外の視覚的に目障りではない形式でかつ容易に改ざんが不可能であるような機密情報を印刷文書に埋め込む方法によって可能である。 As with the above electronic media, documents printed on paper media can be easily tampered with in a form that is not visually unsightly other than text to prevent unauthorized tampering and copying. This is possible by a method of embedding confidential information in a printed document.

印刷物として最も広く利用される白黒の二値の文書に対して埋め込まれた機密情報を検出する方法としては，以下（１）〜（７）のような方法が知られている（例えば，特許文献１参照）。
（１）透かしの埋め込まれた印刷文書をスキャンし，文書画像を得る。
（２）透かし情報として文書画像全体に埋め込まれた各信号ユニットに反応する各フィルタからの出力値を得るフィルタリング処理を行う。
（３）各信号ユニット間の間隔で強く反応したフィルタのピーク値を探索する。探索は，隣接する信号ユニット位置から埋め込み時に使用した信号ユニットの間隔だけ離れた箇所を中心に行う。上記探索をフィルタリング処理結果全体に対して行う。
（４）各フィルタから出力されるピーク値の大小を比較することで信号ユニットを特定し，信号ユニットと対応付けられた情報（例えば，“０”又は“１”のビット値など。）を抽出する。
（５）抽出された情報から，情報の埋め込まれている有効範囲を特定する。
（６）抽出された情報を印刷文書に埋め込む際のフォーマットで情報を結合し，全体の透かし情報を取得する。
（７）全体の透かし情報から多数決演算などを行うことで誤った情報を訂正し，正しい情報を取得することによって機密情報（ビット列）を検出することができる。 The following methods (1) to (7) are known as methods for detecting confidential information embedded in black and white binary documents that are most widely used as printed materials (for example, Patent Documents). 1).
(1) A printed document with a watermark embedded is scanned to obtain a document image.
(2) A filtering process is performed to obtain an output value from each filter that reacts to each signal unit embedded in the entire document image as watermark information.
(3) The peak value of the filter that reacted strongly at the interval between each signal unit is searched. The search is performed centering on a location separated from the adjacent signal unit position by the interval of the signal unit used at the time of embedding. The search is performed on the entire filtering process result.
(4) A signal unit is identified by comparing the magnitudes of peak values output from the filters, and information associated with the signal unit (for example, a bit value of “0” or “1”) is extracted. To do.
(5) The effective range in which the information is embedded is specified from the extracted information.
(6) The information is combined in the format used when the extracted information is embedded in the print document, and the entire watermark information is acquired.
(7) It is possible to detect the confidential information (bit string) by correcting the erroneous information by performing a majority operation from the entire watermark information and acquiring the correct information.

ここで，図１を参照しながら，上記（２）各信号ユニットに反応するフィルタリング処理について説明する。図１は，フィルタリング処理の概略の一例を示す説明図である。 Here, with reference to FIG. 1, (2) the filtering process that reacts to each signal unit will be described. FIG. 1 is an explanatory diagram illustrating an example of an outline of filtering processing.

図１に示すように，印刷文書をスキャンすることで得られた文書画像１７には例えばインクなどによる汚れが存在する。その汚れの一部を拡大したのが文書画像１７ａである。 As shown in FIG. 1, a document image 17 obtained by scanning a printed document has a stain due to, for example, ink. The document image 17a is an enlarged part of the dirt.

上記文書画像１７ａは，複数のブロック１８から構成されている。なお，１つのブロック１８が１画素を示している。 The document image 17 a is composed of a plurality of blocks 18. One block 18 represents one pixel.

次に，上記文書画像１７ａに対してフィルタリング処理する場合，ガボールフィルタＡとガボールフィルタＢが用いられるが，上記ガボールフィルタＡ，Ｂは，ともに１２×１２画素から構成されているが，かかる例に限定されない。 Next, when filtering the document image 17a, the Gabor filter A and the Gabor filter B are used. The Gabor filters A and B are both composed of 12 × 12 pixels. It is not limited.

ガボールフィルタＡ，Ｂによって文書画像１７ａに対してフィルタリング処理が行われると，各ガボールフィルタで反応した出力値を得ることで，文書画像１７ｂと，文書画像１７ｃが各々生成される。 When the filtering process is performed on the document image 17a by the Gabor filters A and B, the document image 17b and the document image 17c are generated by obtaining the output values that are reacted by the Gabor filters.

さらに上記文書画像１７ｂと文書画像１７ｃとが重ね合わさることで，図１に示す文書画像１７ｄが得られる。 Further, the document image 17b shown in FIG. 1 is obtained by superimposing the document image 17b and the document image 17c.

なお，図１に示すように，文書画像上に汚れなどが存在する場合，汚れの部分のガボールフィルタの反応が弱いため，信号ユニットを特定できず，誤った情報を検出してしまう可能性があった。 As shown in FIG. 1, when there is dirt on the document image, the signal unit cannot be identified and erroneous information may be detected because the reaction of the dirty Gabor filter is weak. there were.

そこで，上記問題を解決するために，文書画像全体に埋め込まれる機密情報には同じ情報を繰り返し複数回埋め込むといった冗長性を持たせ，上記（７）で多数決演算をすることによって，誤った情報を訂正することができる。 Therefore, in order to solve the above problem, the confidential information embedded in the entire document image is provided with redundancy such that the same information is repeatedly embedded several times, and the majority information is calculated in the above (7), so that incorrect information is obtained. It can be corrected.

特開２００３−１０１７６２号公報JP 2003-101762 A

しかしながら，文書画像上の劣化が激しい場合，抽出された信号ユニットの多くは信頼度が低く，これらをそのまま用いて多数決演算すると，復号するのに適切な信頼度の高い情報のみを用いることができず，図２に示すように，多数決演算し，最終的に得られる機密情報の中に誤った情報が含まれてしまう問題があった。 However, when the degradation on the document image is severe, many of the extracted signal units have low reliability, and if they are used as they are for the majority operation, only information with high reliability suitable for decoding can be used. However, as shown in FIG. 2, there is a problem that incorrect information is included in the confidential information finally obtained by majority calculation.

また，上記文書画像上に汚れ以外にも例えば，文字や図形等が重なった信号ユニットを上記（２）に示すフィルタリング処理しても，文字や図形等のエッジ成分に強く反応してしまうため，適切な反応とはいえず，多数決演算し最終的に得られる機密情報の中に誤った情報が含まれてしまう問題があった。 In addition to the dirt on the document image, for example, even if the signal processing in which a character or a figure overlaps, the filtering process shown in the above (2) reacts strongly to the edge component of the character or figure. It was not an appropriate reaction, and there was a problem that incorrect information was included in the confidential information that was finally obtained by majority calculation.

本発明は，上記問題点に鑑みてなされたものであり，本発明の目的は，機密情報への復号の精度を向上させることの可能な新規かつ改良された透かし情報検出装置及び透かし情報検出方法を提供することである。 The present invention has been made in view of the above problems, and an object of the present invention is to provide a new and improved watermark information detection apparatus and watermark information detection method capable of improving the accuracy of decryption to confidential information. Is to provide.

上記課題を解決するため，本発明の第１の観点によれば，画像から機密情報を抽出する透かし情報検出装置が提供される。上記透かし情報検出装置は，画像を入力し，入力画像を得る画像入力部と；入力画像をフィルタリングすることで，その入力画像に埋め込まれた１又は２以上のパターンを抽出し，そのパターンに対応するシンボルを判定し，決定するパターン決定部と；上記シンボルの判定の確度を上記入力画像に埋め込まれた各パターンについて算出する確度算出部と；上記算出された判定確度に基づいてパターンを取捨選択し，選択されたパターンから機密情報に復号する情報復号部とを備え，さらに情報復号部は，上記選択されたパターンに対して，確信度演算をさらに実行し，機密情報に復号することを特徴としている。 In order to solve the above problems, according to a first aspect of the present invention, there is provided a watermark information detecting apparatus for extracting confidential information from an image. The watermark information detection apparatus includes an image input unit that inputs an image and obtains the input image; and extracts one or more patterns embedded in the input image by filtering the input image and supports the pattern A pattern determining unit that determines and determines a symbol to be determined; a probability calculating unit that calculates the accuracy of determination of the symbol for each pattern embedded in the input image; and a pattern selection based on the calculated determination accuracy And an information decrypting unit for decrypting the selected pattern into confidential information, and the information decrypting unit further executes a certainty factor calculation on the selected pattern and decrypts it into the confidential information. It is said.

本発明によれば，透かし情報検出装置は，入力画像に埋め込まれた１又は２以上のパターンをフィルタリングによって取得し，対応するシンボルの判定をする。さらに透かし情報検出装置は，そのシンボルであると判定した確度を求め，当該判定確度によってパターンの取捨選択を行い，判定の確度の高いパターンのみを選択し，その選択されたパターンを用いて機密情報に復号している。かかる構成により，文字の画素や汚れなどが原因で入力画像からパターンに対応するシンボルの判定確度が低いパターンを排除してから機密情報に復号することができるため機密情報への復号の精度を向上させることができる。 According to the present invention, the watermark information detection apparatus acquires one or more patterns embedded in an input image by filtering, and determines a corresponding symbol. Further, the watermark information detection apparatus obtains the accuracy of determining that the symbol is determined, performs pattern selection based on the accuracy of the determination, selects only a pattern having a high accuracy of determination, and uses the selected pattern for confidential information. Is decrypted. This configuration improves the accuracy of decryption to confidential information because it can be decrypted into confidential information after eliminating patterns with low accuracy in determining the symbol corresponding to the pattern from the input image due to character pixels and dirt. Can be made.

上記確信度演算では，上記選択されたパターンについて多数決をとる多数決演算が行われるようにしてもよい。
また確信度演算では，上記選択されたパターンに対して，各フィルタごとにフィルタ出力値を累積し求めた複数のフィルタ出力値の累積値の大小を比較してもよく，また確信度演算では，上記選択されたパターンに対して，各フィルタごとに判定確度を累積し求めた複数の判定確度の累積値の大小を比較してもよい。
さらに，上記確信度演算は，上記入力画像に繰り返し埋め込まれたパターンのうち，該繰り返し埋め込む順番が同じパターン同士に対して多数決演算する演算であるように構成してもよい。
また，確信度演算は，上記入力画像に繰り返し埋め込まれたパターンのうち，該繰り返し埋め込む順番が同じパターン同士に対して，各フィルタごとにフィルタ出力値を累積し求めた複数のフィルタ出力値の累積値の大小を比較する演算でもよく，上記確信度演算は，入力画像に繰り返し埋め込まれたパターンのうち，該繰り返し埋め込む順番が同じパターン同士に対して，各フィルタごとに判定確度を累積し求めた複数の判定確度の累積値の大小を比較する演算であるように構成してもよい。 In the certainty factor calculation, a majority vote calculation for taking a majority vote for the selected pattern may be performed.
In the certainty calculation, the accumulated values of a plurality of filter output values obtained by accumulating the filter output values for each filter may be compared with the selected pattern. In the certainty calculation, You may compare the magnitude | size of the accumulated value of several determination accuracy calculated | required by accumulating determination accuracy for every filter with respect to the said selected pattern.
Further, the certainty factor calculation may be configured to perform a majority operation on patterns that are repeatedly embedded in the input image and that have the same repetitive embedding order.
In addition, the certainty calculation is performed by accumulating a plurality of filter output values obtained by accumulating filter output values for each filter with respect to patterns having the same repetitive embedding order among the patterns repeatedly embedded in the input image. The above certainty factor calculation is obtained by accumulating the determination accuracy for each filter for the patterns that are repeatedly embedded in the input image and having the same repetition order. You may comprise so that it may be the calculation which compares the magnitude of the cumulative value of several determination accuracy.

上記課題を解決するために，本発明の別の観点によれば，識別可能な１又は２種以上のパターンを用意し，１つのパターンに対して１つのシンボルを少なくとも与え，機密情報を符号化することで生成される１又は２以上の前記パターンを配置することによって，その機密情報が埋め込まれた画像を，入力画像として読み込む画像入力部と；入力画像からパターンを検出するため，上記１又は２種以上のパターンごとに反応する検出用フィルタを当該パターンと同じ種類だけ用意して入力画像のフィルタリングを行い，入力画像の画像領域において，全種の検出用フィルタによって得られる出力値に基づいてパターンのシンボルを判定し，決定するパターン決定部と；上記全種の検出用フィルタによって得られた各パターンごとの出力値に基づいてシンボル判定の確度を上記入力画像に埋め込まれた各パターンごとに算出する確度算出部と；上記判定確度算出部により算出された判定確度に応じて，その判定確度に対応するパターンを取捨選択し，そのうち選択されたパターンで機密情報に復号する情報復号部とを備え，さらに上記情報復号部は，上記選択されたパターンに対して，確信度演算をさらに実行し，上記機密情報に復号することを特徴としている。なお，本発明にかかるパターンは，例えば，信号ユニット及び／又は複数のパターンをユニット化したユニットパターンなどを例示できる。 In order to solve the above problems, according to another aspect of the present invention, one or more identifiable patterns are prepared, at least one symbol is given to one pattern, and confidential information is encoded. An image input unit that reads an image in which the confidential information is embedded as an input image by arranging one or two or more of the patterns generated as described above; Filter the input image by preparing the same type of detection filter that reacts to each of two or more patterns, and based on the output values obtained by all types of detection filters in the image area of the input image A pattern determining unit for determining and determining a symbol of the pattern; based on the output value for each pattern obtained by the above-described all types of detection filters; An accuracy calculation unit for calculating the accuracy of symbol determination for each pattern embedded in the input image; and selecting a pattern corresponding to the determination accuracy according to the determination accuracy calculated by the determination accuracy calculation unit. An information decrypting unit that decrypts the confidential information with the selected pattern, and the information decrypting unit further executes a certainty factor operation on the selected pattern to decrypt the confidential information. It is characterized by. Examples of the pattern according to the present invention include a signal unit and / or a unit pattern obtained by unitizing a plurality of patterns.

上記判定確度は，全種の検出用フィルタによるフィルタリングで得られた出力値のうち最大値であるように構成してもよく，また上記判定確度は，全種の検出用フィルタによるフィルタリングで得られた出力値の差分値であるように構成してもよい。 The determination accuracy may be configured to be the maximum value among the output values obtained by filtering with all types of detection filters, and the determination accuracy is obtained by filtering with all types of detection filters. The output value may be a difference value.

上記入力画像には，少なくとも機密情報の符号化により構成される１又は２以上のパターンをユニットとしたユニットパターンが複数回繰り返し埋め込まれているように構成してもよい。かかる構成により，機密情報への復号の精度を向上させることができる。 The input image may be configured such that a unit pattern having at least one or more patterns configured by encoding confidential information as a unit is repeatedly embedded several times. With this configuration, it is possible to improve the accuracy of decryption to confidential information.

上記確信度演算は，上記入力画像に繰り返し埋め込まれたユニットパターンのうち，繰り返し埋め込む順番が同じユニットパターン同士について多数決演算する演算であるようにしてもよい。または，１又は２以上のユニットパターンを復号すると得られる機密情報のビット桁数が同じ桁数のビットに相当するユニットパターン同士について，多数決演算する演算であるように構成してもよい。かかる構成により，正しいパターンに訂正した上で機密情報への復号の精度を向上させることができる。
The certainty factor calculation may be an operation for performing majority calculation for unit patterns having the same order of repeated embedding among unit patterns repeatedly embedded in the input image. Or you may comprise so that it may be a calculation which performs majority calculation about the unit patterns in which the number of bit digits of the confidential information obtained by decoding one or two or more unit patterns corresponds to the same number of bits. With this configuration, it is possible to improve the accuracy of decryption to confidential information after correcting to a correct pattern.

上記情報復号部は，判定確度の閾値を算出し，その判定確度の閾値と判定確度とを比較し，その比較した結果に応じて，パターンの取捨選択を決定するように構成してもよい。かかる構成により，多数決演算の際に用いられるパターンに誤ったパターンが存在する可能性を軽減し，より一層精度の高い機密情報への復号が可能となる。 The information decoding unit may be configured to calculate a threshold value of the determination accuracy, compare the threshold value of the determination accuracy with the determination accuracy, and determine the selection of the pattern according to the comparison result. With this configuration, it is possible to reduce the possibility that an erroneous pattern exists in the pattern used in the majority operation, and to decrypt the confidential information with higher accuracy.

上記判定確度の閾値は，情報復号部による取捨選択前までに，予め算出され，保持されているように構成してもよく，また上記判定確度の閾値は，算出された１又は２以上の判定確度の平均値であるように構成してもよい。 The determination accuracy threshold may be calculated and held in advance before selection by the information decoding unit, and the determination accuracy threshold may be one or more calculated determinations. You may comprise so that it may be an average value of accuracy.

上記透かし情報検出装置は，誤り検出部をさらに備え，上記誤り検出部は，情報復号部により復号された機密情報に対して誤り検出符号によって誤りを検出するように構成してもよい。かかる構成により，多数決演算のみの場合よりもより一層機密情報の完全性が向上し，機密情報の正確性を向上させることができる。 The watermark information detecting device further comprises an error detection unit, the error detection unit may be configured to detect an error by the error detection code to the secure information decoded by the information decoding section. With such a configuration, the integrity of confidential information can be further improved and the accuracy of confidential information can be improved as compared with the case where only the majority operation is performed.

上記誤り検出部は，情報復号部により選択されたパターンに対して誤り検出を実行し，その結果，誤りを検出した場合，上記情報復号部は，複数存在する判定確度のうちパターンの取捨選択時に用いた該判定確度とは異なる判定確度に基づいて，パターンの取捨選択を再度行い，そのうち選択されたパターンに対して，最尤判定をさらに実行し，機密情報に復号するように構成してもよい。かかる構成により，パターンに対応するシンボルの判定が印刷環境等に左右されても，適切な判定確度の閾値を変更することで，どんな環境下でも機密情報への復号の精度の向上化が図れる。 The error detection unit performs error detection on the pattern selected by the information decoding unit, and as a result, when the error is detected, the information decoding unit selects a pattern from the plurality of determination accuracy. A configuration may be adopted in which pattern selection is performed again based on a determination accuracy different from the determination accuracy used, a maximum likelihood determination is further performed on the selected pattern, and the confidential information is decrypted. Good. With this configuration, even if the determination of the symbol corresponding to the pattern depends on the printing environment or the like, the accuracy of decryption to confidential information can be improved under any environment by changing the threshold value of the appropriate determination accuracy.

上記情報復号部は，誤り検出部による誤りが検出されなくなるまで，誤り検出部による誤り検出後に最尤判定を繰り返し実行するように構成してもよい。かかる構成により，機密情報への復号の精度を最大限向上させることができる。 The information decoding unit may be configured to repeatedly execute maximum likelihood determination after error detection by the error detection unit until no error is detected by the error detection unit. With this configuration, it is possible to improve the accuracy of decryption to confidential information to the maximum extent.

上記情報復号部は，複数存在する判定確度の閾値を一度ずつ順番に用いるように構成してもよい。かかる構成により，適切な判定確度の閾値を決定することができる。 The information decoding unit may be configured to use a plurality of determination accuracy thresholds one by one in order. With this configuration, it is possible to determine an appropriate determination accuracy threshold.

上記情報復号部は，判定確度の閾値の値が異なるように動的に算出するように構成してもよい。かかる構成により，印刷環境など，どんな環境下でも適切な判定確度の閾値を決定することができる。 The information decoding unit, the threshold value of the determination accuracy may be configured to dynamically calculate differently. With such a configuration, it is possible to determine an appropriate determination accuracy threshold value under any environment such as a printing environment.

上記課題を解決するための本発明の別の観点によれば，画像から機密情報を抽出する透かし情報検出方法が提供される。上記透かし情報検出方法は，画像を入力し，入力画像を得る画像入力工程と；入力画像をフィルタリングすることで，その入力画像に埋め込まれた１又は２以上のパターンを抽出し，そのパターンに対応するシンボルを判定し，決定するパターン決定工程と；シンボルの判定の確度を上記入力画像に埋め込まれた各パターンについて算出する確度算出工程と；算出された判定確度に基づいてパターンを取捨選択し，選択されたパターンから機密情報に復号する情報復号工程とを含み，情報復号工程では，上記選択されたパターンに対して，最尤判定がさらに実行され，機密情報に復号することを特徴としている。 According to another aspect of the present invention for solving the above problem, a watermark information detection method for extracting confidential information from an image is provided. The watermark information detection method includes an image input step of inputting an image and obtaining the input image; filtering the input image to extract one or more patterns embedded in the input image and corresponding to the pattern A pattern determining step for determining and determining a symbol to be determined; a probability calculating step for calculating the accuracy of symbol determination for each pattern embedded in the input image; and selecting a pattern based on the calculated determination accuracy; And an information decrypting step for decrypting the selected pattern into confidential information. In the information decrypting step, a maximum likelihood determination is further performed on the selected pattern to decrypt it into the confidential information.

また，上記課題を解決するための本発明の別の観点によれば，識別可能な１又は２種以上のパターンを用意し，１つのパターンに対して１つのシンボルを少なくとも与え，機密情報を符号化することで生成される１又は２以上のパターンを配置することによって，その機密情報が埋め込まれた画像を，入力画像として読み込む画像入力工程と；入力画像からパターンを検出するため，パターンに反応する検出用フィルタをパターンと同じ種類だけ用意して入力画像のフィルタリングを行い，入力画像の画像領域において，全種の検出用フィルタによって得られる出力値に基づいてパターンのシンボルを判定し，決定するパターン決定工程と；全種の検出用フィルタによって得られた各パターンごとの出力値に基づいてシンボル判定の確度を上記入力画像に埋め込まれた各パターンごとに算出する確度算出工程と；判定確度算出工程で算出された判定確度に応じて，その判定確度に対応するパターンを取捨選択し，そのうち選択されたパターンで機密情報に復号する情報復号工程とを含み，情報復号工程では，上記選択されたパターンに対して，最尤判定がさらに実行され，機密情報に復号することを特徴としている。 According to another aspect of the present invention for solving the above problem, one or more identifiable patterns are prepared, at least one symbol is given to one pattern, and confidential information is encoded. An image input process that reads an image in which the confidential information is embedded as an input image by arranging one or more patterns generated by the conversion; and reacts to the pattern to detect the pattern from the input image Prepare only the same type of detection filter as the pattern to filter the input image, and determine and determine the symbol of the pattern based on the output values obtained by all types of detection filters in the image area of the input image The pattern determination step; and the accuracy of symbol determination is improved based on the output value of each pattern obtained by all types of detection filters. An accuracy calculation step for calculating each pattern embedded in the input image; according to the determination accuracy calculated in the determination accuracy calculation step, a pattern corresponding to the determination accuracy is selected, and the selected pattern is classified as confidential. An information decrypting step for decrypting the information. In the information decrypting step, a maximum likelihood determination is further performed on the selected pattern to decrypt it into confidential information.

以上説明したように，本発明によれば，機密情報に復号する精度を向上させることができる。また印刷環境などの環境下に左右されずに機密情報に復号する精度を向上させることができる。 As described above, according to the present invention, it is possible to improve the accuracy of decrypting confidential information. Further, it is possible to improve the accuracy of decrypting the confidential information without being influenced by an environment such as a printing environment.

以下，本発明の好適な実施の形態について，添付図面を参照しながら詳細に説明する。なお，以下の説明及び添付図面において，略同一の機能及び構成を有する構成要素については，同一符号を付することにより，重複説明を省略する。 DESCRIPTION OF EXEMPLARY EMBODIMENTS Hereinafter, preferred embodiments of the invention will be described in detail with reference to the accompanying drawings. In the following description and the accompanying drawings, components having substantially the same functions and configurations are denoted by the same reference numerals, and redundant description is omitted.

（第１の実施の形態）
図３は，本実施の形態にかかる透かし情報埋め込み装置及び透かし情報検出装置の構成を示す説明図である。 (First embodiment)
FIG. 3 is an explanatory diagram showing the configuration of the watermark information embedding device and the watermark information detecting device according to this embodiment.

（透かし情報埋め込み装置１０）
透かし情報埋め込み装置１０は，文書データと文書に埋め込む機密情報をもとに文書画像を構成し，紙媒体に印刷を行う装置である。透かし情報埋め込み装置１０は，図３に示したように，文書画像形成部１１と，透かし画像形成部１２と，透かし入り文書画像合成部１３と，出力デバイス１４とにより構成されている。文書データ１５は文書作成ツール等により作成されたデータである。機密情報１６は紙媒体に文字以外の形式で埋め込む情報（文字列や画像，音声データ）などである。 (Watermark information embedding device 10)
The watermark information embedding device 10 is a device that forms a document image based on document data and confidential information embedded in the document and prints it on a paper medium. As shown in FIG. 3, the watermark information embedding device 10 includes a document image forming unit 11, a watermark image forming unit 12, a watermarked document image synthesizing unit 13, and an output device 14. The document data 15 is data created by a document creation tool or the like. The confidential information 16 is information (character string, image, audio data) to be embedded in a paper medium in a format other than characters.

文書画像形成部１１では，文書データ１５を紙面に印刷した状態の画像が作成される。具体的には，文書画像中の白画素領域は何も印刷されない部分であり，黒画素領域は黒の塗料が塗布される部分である。なお，本実施の形態では，白い紙面に黒のインク（単色）で印刷を行うことを前提として説明するが，本発明はこれに限定されず，カラー（多色）で印刷を行う場合であっても，同様に本発明を適用可能である。 In the document image forming unit 11, an image in a state where the document data 15 is printed on a paper surface is created. Specifically, the white pixel region in the document image is a portion where nothing is printed, and the black pixel region is a portion where black paint is applied. In this embodiment, the description will be made on the assumption that printing is performed on white paper with black ink (single color). However, the present invention is not limited to this, and printing is performed in color (multicolor). However, the present invention can be similarly applied.

透かし画像形成部１２は，機密情報１６をディジタル化して数値に変換したものをＮ元符号化（Ｎは２以上）し，符号語の各シンボルをあらかじめ用意した信号に割り当てる。信号は任意の大きさの矩形領域中にドットを配列することにより任意の方向と波長を持つ波を表現し，波の方向や波長に対してシンボルを割り当てたものである。透かし画像は，これらの信号がある規則に従って画像上に配置されたものである。 The watermark image forming unit 12 digitizes the confidential information 16 and converts it into a numerical value, performs N-ary coding (N is 2 or more), and assigns each symbol of the code word to a signal prepared in advance. A signal represents a wave having an arbitrary direction and wavelength by arranging dots in a rectangular area of an arbitrary size, and a symbol is assigned to the direction and wavelength of the wave. A watermark image is one in which these signals are arranged on the image according to a certain rule.

透かし入り文書画像合成部１３は，文書画像と透かし画像を重ね合わせて透かし入りの文書画像（例えば，入力画像）を作成する。また，出力デバイス１４は，プリンタなどの出力装置であり，透かし入り文書画像を紙媒体に印刷する。したがって，文書画像形成部１１，透かし画像形成部１２，透かし入り文書画像合成部１３はプリンタドライバの中の一つの機能として実現されていても良い。 The watermarked document image synthesis unit 13 creates a watermarked document image (for example, an input image) by superimposing the document image and the watermark image. The output device 14 is an output device such as a printer, and prints a watermarked document image on a paper medium. Therefore, the document image forming unit 11, the watermark image forming unit 12, and the watermarked document image synthesizing unit 13 may be realized as one function in the printer driver.

印刷文書２０は，元の文書データ１５に対して機密情報１６を埋め込んで印刷されたものであり，物理的に保管・管理される。なお，本実施の形態にかかる印刷文書２０は，紙媒体などである場合を例に挙げて説明するが，かかる例に限定されず，例えば，文書画像からなる画像データそのままの場合でも実施可能である。 The print document 20 is printed with the confidential information 16 embedded in the original document data 15 and is physically stored and managed. The print document 20 according to the present embodiment will be described by taking a paper medium or the like as an example. However, the print document 20 is not limited to this example, and can be implemented, for example, even when image data including a document image is used as it is. is there.

（透かし情報検出装置３０）
透かし情報検出装置３０は，紙媒体に印刷されている文書（例えば印刷文書）を入力画像として取り込み，埋め込まれている機密情報を復元する装置である。透かし情報検出装置３０は，図３に示したように，入力デバイス（画像入力部）３１と，透かし検出部３２とにより構成されている。 (Watermark information detection device 30)
The watermark information detecting device 30 is a device that takes in a document printed on a paper medium (for example, a printed document) as an input image and restores embedded confidential information. As shown in FIG. 3, the watermark information detection apparatus 30 includes an input device (image input unit) 31 and a watermark detection unit 32.

入力デバイス３１は，スキャナなどの入力装置であり，紙に印刷された文書２０を多値階調のグレイ画像として計算機に取り込む。また，透かし検出部（パターン決定部，確度算出部，情報復号部）３２は，入力画像に対してフィルタ処理を行い，埋め込まれた信号を検出する。検出された信号からシンボルを復元し，埋め込まれた機密情報を取り出す。なお，本実施の形態にかかる透かし検出部は，例えば，本発明に係るパターン決定部，確度算出部，または情報復号部のうち少なくとも一つに該当するものとするが，かかる例に限定されない。 The input device 31 is an input device such as a scanner, and takes in the document 20 printed on paper as a multi-value gray image into a computer. Also, the watermark detection unit (pattern determination unit, accuracy calculation unit, information decoding unit) 32 performs a filtering process on the input image and detects an embedded signal. The symbol is restored from the detected signal, and the embedded confidential information is extracted. The watermark detection unit according to the present embodiment corresponds to at least one of the pattern determination unit, the accuracy calculation unit, and the information decoding unit according to the present invention, but is not limited to this example.

以上のように構成される透かし情報埋め込み装置１０及び透かし情報検出装置３０の動作について説明する。まず，図３〜図１３を参照しながら，透かし情報埋め込み装置１０の動作について説明する。 Operations of the watermark information embedding device 10 and the watermark information detection device 30 configured as described above will be described. First, the operation of the watermark information embedding device 10 will be described with reference to FIGS.

（文書画像形成部１１）
文書データ１５はフォント情報やレイアウト情報を含むデータであり，ワープロソフト等で作成されるものとする。文書画像形成部１１は，この文書データ１５を基に，文書が紙に印刷された状態の画像をページごとに作成する。この文書画像は白黒の二値画像であり，画像上で白い画素（値が１の画素）は背景であり，黒い画素（値が０の画素）は文字領域（インクが塗布される領域）であるものとする。 (Document Image Forming Unit 11)
The document data 15 is data including font information and layout information, and is created by word processing software or the like. Based on the document data 15, the document image forming unit 11 creates an image of a document printed on paper for each page. This document image is a black and white binary image. On the image, white pixels (pixels having a value of 1) are backgrounds, and black pixels (pixels having a value of 0) are character regions (regions to which ink is applied). It shall be.

（透かし画像形成部１２）
機密情報１６は文字，音声，画像などの各種データであり，透かし画像形成部ではこの情報から文書画像の背景として重ね合わせる透かし画像を作成する。 (Watermark image forming unit 12)
The confidential information 16 is various data such as characters, sounds, and images, and the watermark image forming unit creates a watermark image to be superimposed as the background of the document image from this information.

図４は，透かし画像形成部１２の処理の流れを示す流れ図である。 FIG. 4 is a flowchart showing a processing flow of the watermark image forming unit 12.

まず，機密情報１６をＮ元符号に変換する（ステップＳ１０１）。Ｎは任意であるが，本実施の形態では説明を容易にするためＮ＝２とする。従って，ステップＳ１０１で生成される符号は２元符号であり，０と１のビット列で表現されるものとする。このステップＳ１０１ではデータをそのまま符号化しても良いし，データを暗号化したものを符号化しても良い。 First, the confidential information 16 is converted into an N-element code (step S101). N is arbitrary, but in the present embodiment, N = 2 for ease of explanation. Accordingly, the code generated in step S101 is a binary code, and is expressed by a bit string of 0 and 1. In this step S101, the data may be encoded as it is, or the encrypted data may be encoded.

次いで，符号語の各シンボルに対して透かし信号を割り当てる（ステップＳ１０２）。透かし信号とはドット（黒画素）の配列（ドットパターン）によって任意の波長と方向を持つ波を表現したものである。透かし信号については，さらに後述する。 Next, a watermark signal is assigned to each symbol of the codeword (step S102). The watermark signal represents a wave having an arbitrary wavelength and direction by an arrangement (dot pattern) of dots (black pixels). The watermark signal will be further described later.

さらに，符号化されたデータのビット列に対応する信号ユニットを透かし画像上に配置する（ステップＳ１０３）。 Further, a signal unit corresponding to the bit string of the encoded data is arranged on the watermark image (step S103).

上記ステップＳ１０２において，符号語の各シンボルに対して割り当てる透かし信号について説明する。図５は透かし信号の一例を示す説明図である。 The watermark signal assigned to each symbol of the code word in step S102 will be described. FIG. 5 is an explanatory diagram showing an example of a watermark signal.

透かし信号の幅と高さをそれぞれＳｗ，Ｓｈとする。ＳｗとＳｈは異なっていても良いが，本実施の形態では説明を容易にするためＳｗ＝Ｓｈとする。長さの単位は画素数であり，図５の例ではＳｗ＝Ｓｈ＝１２である。これらの信号が紙面に印刷されたときの大きさは，透かし画像の解像度に依存しており，例えば透かし画像が６００ｄｐｉ（ｄｏｔｐｅｒｉｎｃｈ：解像度の単位であり，１インチ当たりのドット数）の画像であるとしたならば，図５の透かし信号の幅と高さは，印刷文書上で１２／６００＝０．０２（インチ）となる。 Let the width and height of the watermark signal be Sw and Sh, respectively. Sw and Sh may be different, but in the present embodiment, Sw = Sh is set for ease of explanation. The unit of length is the number of pixels. In the example of FIG. 5, Sw = Sh = 12. The size when these signals are printed on the paper surface depends on the resolution of the watermark image. For example, the watermark image is an image of 600 dpi (dot per inch: unit of resolution, number of dots per inch). 5, the width and height of the watermark signal in FIG. 5 is 12/600 = 0.02 (inch) on the printed document.

以下，幅と高さがＳｗ，Ｓｈの矩形を１つの信号の単位として「信号ユニット」と称する。図５（１）は，ドット間の距離が水平軸に対してａｒｃｔａｎ（３）（ａｒｃｔａｎはｔａｎの逆関数）の方向に密であり，波の伝搬方向はａｒｃｔａｎ（−１／３）である。以下，この信号ユニットをユニットＡと称する。図５（２）はドット間の距離が水平軸に対してａｒｃｔａｎ（−３）の方向に密であり，波の伝搬方向はａｒｃｔａｎ（１／３）である。以下，この信号ユニットをユニットＢと称する。 Hereinafter, a rectangle having a width and a height of Sw and Sh is referred to as a “signal unit” as one signal unit. In FIG. 5A, the distance between dots is dense in the direction of arctan (3) (arctan is an inverse function of tan) with respect to the horizontal axis, and the wave propagation direction is arctan (−1/3). . Hereinafter, this signal unit is referred to as unit A. In FIG. 5B, the distance between dots is dense in the direction of arctan (−3) with respect to the horizontal axis, and the wave propagation direction is arctan (1/3). Hereinafter, this signal unit is referred to as unit B.

図６は，図５（１）の画素値の変化をａｒｃｔａｎ（−１／３）の方向から見た断面図である。図６において，ドットが配列されている部分が波の最小値の腹（振幅が最大となる点）となり，ドットが配列されていない部分は波の最大値の腹となっている。 6, arctan a change in pixel value in FIG. 5 (1) - is a sectional view seen from the direction of the (1/3). In FIG. 6, the portion where the dots are arranged is the antinode of the minimum value of the wave (the point where the amplitude is maximum), and the portion where the dots are not arranged is the antinode of the maximum value of the wave.

また，ドットが密に配列されている領域はそれぞれ１ユニットの中に２つ存在するため，この例では１ユニットあたりの周波数は２となる。波の伝播方向はドットが密に配列されている方向に垂直となるため，ユニットＡの波は水平方向に対してａｒｃｔａｎ（−１／３），ユニットＢの波はａｒｃｔａｎ（１／３）となる。なお，ａｒｃｔａｎ（ａ）の方向とａｃｒｔａｎ（ｂ）の方向が垂直のとき，ａ×ｂ＝−１である。 In addition, since there are two regions in each unit where dots are densely arranged, the frequency per unit is 2 in this example. Since the wave propagation direction is perpendicular to the direction in which the dots are densely arranged, the wave of unit A is arctan (-1/3) with respect to the horizontal direction, and the wave of unit B is arctan (1/3). Become. Note that a × b = −1 when the direction of arctan (a) and the direction of actan (b) are perpendicular.

本実施の形態では，ユニットＡで表現される透かし信号にシンボル０を割り当て，ユニットＢで表現される透かし信号にシンボル１を割り当てる。また，これらをシンボルユニットと称する。 In the present embodiment, symbol 0 is assigned to the watermark signal expressed by unit A, and symbol 1 is assigned to the watermark signal expressed by unit B. These are called symbol units.

透かし信号には図５（１），（２）で示されるもの以外にも，例えば図７（３）〜（５）で示されるようなドット配列が考えられる。図７（３）は，ドット間の距離が水平軸に対してａｒｃｔａｎ（１／３）の方向に密であり，波の伝搬方向はａｒｃｔａｎ（−３）である。以下，この信号ユニットをユニットＣと称する。 In addition to the watermark signals shown in FIGS. 5 (1) and (2), for example, dot arrangements as shown in FIGS. 7 (3) to (5) are conceivable. In FIG. 7 (3), the distance between dots is dense in the direction of arctan (1/3) with respect to the horizontal axis, and the wave propagation direction is arctan (-3). Hereinafter, this signal unit is referred to as unit C.

図７（４）は，ドット間の距離が水平軸に対してａｒｃｔａｎ（−１／３）の方向に密であり，波の伝搬方向はａｒｃｔａｎ（３）である。以下，この信号ユニットをユニットＤと称する。図７（５）は，ドット間の距離が水平軸に対してａｒｃｔａｎ（１）の方向に密であり，波の伝搬方向はａｒｃｔａｎ（−１）である。なお，図７（５）は，ドット間の距離が水平軸に対してａｒｃｔａｎ（−１）の方向に密であり，波の伝搬方向はａｒｃｔａｎ（１）であると考えることもできる。以下，この信号ユニットをユニットＥと称する。 In FIG. 7 (4), the distance between dots is dense in the direction of arctan (−1/3) with respect to the horizontal axis, and the wave propagation direction is arctan (3). Hereinafter, this signal unit is referred to as unit D. In FIG. 7 (5), the distance between the dots is dense in the direction of arctan (1) with respect to the horizontal axis, and the wave propagation direction is arctan (−1). In FIG. 7 (5), it can be considered that the distance between the dots is dense in the direction of arctan (−1) with respect to the horizontal axis, and the wave propagation direction is arctan (1). Hereinafter, this signal unit is referred to as unit E.

このようにして，先に割り当てた組み合わせ以外にも，シンボル０とシンボル１を割り当てるユニットの組合わせのパターンが複数考えられるため，どの透かし信号がどのシンボルに割り当てられているかを秘密にして第三者（不正者）が埋め込まれた信号を簡単に解読できないようにすることもできる。 In this way, in addition to the combinations assigned previously, there can be a plurality of combinations of units to which symbols 0 and 1 are assigned. Therefore, it is possible to keep secret which watermark signal is assigned to which symbol. It is also possible to prevent a person (illegal person) from easily deciphering the embedded signal.

さらに，図４に示したステップＳ１０２で，機密情報を４元符号で符号化した場合には，例えば，ユニットＡに符号語のシンボル０を，ユニットＢにシンボル１を，ユニットＣにシンボル２を，ユニットＤにシンボル３を割り当てることも可能である Furthermore, when the confidential information is encoded with a quaternary code in step S102 shown in FIG. 4, for example, the code word symbol 0 is assigned to unit A, symbol 1 is assigned to unit B, and symbol 2 is assigned to unit C. , It is also possible to assign symbol 3 to unit D

図５，図７に示した透かし信号の一例においては，１ユニット中のドットの数をすべて等しくしているため，これらのユニットを隙間なく並べることにより，透かし画像の見かけの濃淡が均一となる。したがって印刷された紙面上では，単一の濃度を持つグレー画像が背景として埋め込まれているように見える。 In the example of the watermark signal shown in FIGS. 5 and 7, since the number of dots in one unit is all equal, by arranging these units without gaps, the apparent density of the watermark image becomes uniform. . Therefore, it appears that a gray image having a single density is embedded as a background on the printed paper.

このような効果を出すために，例えば，ユニットＥを背景ユニット（シンボルが割り当てられていない信号ユニット）と定義し，これを隙間なく並べて透かし画像の背景とし，シンボルユニット（ユニットＡ，ユニットＢ）を透かし画像に埋め込む場合は，埋め込もうとする位置の背景ユニット（ユニットＥ）とシンボルユニット（ユニットＡ，ユニットＢ）とを入れ替える。 In order to produce such an effect, for example, unit E is defined as a background unit (signal unit to which no symbol is assigned), and this is arranged without any gap as a background of the watermark image, and symbol unit (unit A, unit B) Is embedded in the watermark image, the background unit (unit E) and the symbol unit (unit A, unit B) at the position to be embedded are exchanged.

図８（１）はユニットＥを背景ユニットと定義し，これを隙間なく並べて透かし画像の背景とした場合を示す説明図である。図８（２）は図８（１）の背景画像の中にユニットＡを埋め込んだ一例を示し，図８（３）は図８（１）の背景画像の中にユニットＢを埋め込んだ一例を示している。本実施の形態では，背景ユニットを透かし画像の背景とする方法について説明するが，シンボルユニットのみを配置することによって透かし画像を生成しても良い。 FIG. 8A is an explanatory diagram showing a case where the unit E is defined as a background unit and arranged as a background of a watermark image without gaps. FIG. 8 (2) shows an example in which the unit A is embedded in the background image of FIG. 8 (1), and FIG. 8 (3) shows an example in which the unit B is embedded in the background image of FIG. 8 (1). Show. In the present embodiment, a method of using a background unit as the background of a watermark image will be described. However, a watermark image may be generated by arranging only symbol units.

次いで，符号語の１シンボルを透かし画像に埋め込む方法について，図９を参照しながら説明する。 Next, a method for embedding one symbol of a code word in a watermark image will be described with reference to FIG.

図９は，透かし画像へのシンボル埋め込み方法の一例を示す説明図である。ここでは，例として「０１０１」というビット列を埋め込む場合について説明する。 FIG. 9 is an explanatory diagram illustrating an example of a symbol embedding method in a watermark image. Here, a case where a bit string “0101” is embedded will be described as an example.

図９（１），（２）に示すように，同じシンボルユニットを繰り返し埋め込む。これは文書中の文字が埋め込んだシンボルユニットの上に重なった場合，信号検出時に検出されなくなることを防ぐためであり，シンボルユニットの繰り返し数と配置のパターン（以下，ユニットパターンと称する。）は任意である。 As shown in FIGS. 9A and 9B, the same symbol unit is repeatedly embedded. This is to prevent the character unit in the document from being detected when a signal is detected when it is superimposed on the embedded symbol unit. The symbol unit repetition number and arrangement pattern (hereinafter referred to as a unit pattern) are used. Is optional.

すなわち，ユニットパターンの一例として，図９（１）のように繰り返し数を４（１つのユニットパターン中に４つのシンボルユニットが存在する）にしたり，図９（２）のように繰り返し数を２（１つのユニットパターン中に２つのシンボルユニットが存在する）にしたりすることができ，あるいは，繰り返し数を１（１つのユニットパターン中には１つのシンボルユニットだけが存在する）としてもよい。 That is, as an example of the unit pattern, the repetition number is set to 4 (4 symbol units exist in one unit pattern) as shown in FIG. 9 (1), or the repetition number is set to 2 as shown in FIG. 9 (2). (There are two symbol units in one unit pattern) or the number of repetitions may be one (only one symbol unit exists in one unit pattern).

また，図９（１），（２）は１つのシンボルユニットに対して１つのシンボルが与えられているが，図９（３）のようにシンボルユニットの配置パターンに対してシンボルを与えても良い。 9 (1) and 9 (2), one symbol is given to one symbol unit. However, as shown in FIG. 9 (3), symbols may be given to the symbol unit arrangement pattern. good.

１ページ分の透かし画像の中に何ビットの情報量を埋め込むことができるかは，信号ユニットの大きさ，ユニットパターンの大きさ，文書画像の大きさに依存する。文書画像の水平方向と垂直方向にいくつの信号を埋め込んだかは，既知として信号検出を行っても良いし，入力装置から入力された画像の大きさと信号ユニットの大きさから逆算しても良い。 The number of bits of information that can be embedded in a watermark image for one page depends on the size of the signal unit, the size of the unit pattern, and the size of the document image. The number of signals embedded in the horizontal and vertical directions of the document image may be detected as a known signal, or may be calculated backward from the size of the image input from the input device and the size of the signal unit.

１ページ分の透かし画像の水平方向にＰｗ個，垂直方向にＰｈ個のユニットパターンが埋め込めるとすると，画像中の任意の位置のユニットパターンをＵ（ｘ，ｙ），ｘ＝１〜Ｐｗ，ｙ＝１〜Ｐｈと表現し，Ｕ（ｘ，ｙ）を「ユニットパターン行列」と称することにする。また，１ページに埋め込むことができるビット数を「埋め込みビット数」と称する。埋め込みビット数はＰｗ×Ｐｈである。 If Pw unit patterns in the horizontal direction and Ph units in the vertical direction can be embedded in a watermark image for one page, unit patterns at arbitrary positions in the image are represented by U (x, y), x = 1 to Pw, y = 1 to Ph, and U (x, y) is referred to as a “unit pattern matrix”. In addition, the number of bits that can be embedded in one page is referred to as the “number of embedded bits”. The number of embedded bits is Pw × Ph.

図１０は，機密情報を透かし画像に埋め込む方法について示した流れ図である。 FIG. 10 is a flowchart showing a method for embedding confidential information in a watermark image.

ここでは１枚（１ページ分）の透かし画像に，同じ情報を繰り返し埋め込む場合について説明する。同じ情報を繰り返し埋め込むことにより，透かし画像と文書画像を重ね合わせたときに１つのユニットパターン全体が塗りつぶされるなどして埋め込み情報が消失するような場合でも，埋め込んだ情報を取り出すことを可能とするためである。 Here, a case where the same information is repeatedly embedded in one (one page) watermark image will be described. By repeatedly embedding the same information, it is possible to extract the embedded information even if the embedded information disappears due to, for example, the entire unit pattern being filled when the watermark image and the document image are overlaid. Because.

まず，機密情報１６をＮ元符号に変換する（ステップＳ２０１）。図４のステップＳ１０１と同様である。以下では，符号化されたデータをデータ符号と称し，ユニットパターンの組合わせによりデータ符号を表現したものをデータ符号ユニットＤｕと称する。 First, the confidential information 16 is converted into an N-element code (step S201). This is the same as step S101 in FIG. Hereinafter, the encoded data is referred to as a data code, and the data code expressed by a combination of unit patterns is referred to as a data code unit Du.

次いで，データ符号の符号長（ここではビット数）と埋め込みビット数から，１枚の画像にデータ符号ユニットを何度繰り返し埋め込むことができるかを計算する（ステップＳ２０２）。本実施の形態ではデータ符号の符号長データをユニットパターン行列の第１行に挿入するものとする。なお，データ符号の符号長を固定長として符号長データは透かし画像には埋め込まないようにしても良い。 Next, based on the code length of the data code (here, the number of bits) and the number of embedded bits, how many times the data code unit can be embedded in one image is calculated (step S202). In the present embodiment, it is assumed that the code length data of the data code is inserted into the first row of the unit pattern matrix. The code length of the data code may be fixed and the code length data may not be embedded in the watermark image.

データ符号ユニットを埋め込む回数Ｄｎは，データ符号長をＣｎとして以下の式で計算される。 The number Dn of embedding the data code unit is calculated by the following equation with the data code length as Cn.

ここで剰余をＲｎ（Ｒｎ＝Ｐｗ×（Ｐｈ−１）−Ｄｎ×Ｃｎ）とすると，ユニットパターン行列にはＤｎ回のデータ符号ユニットおよびデータ符号の先頭Ｒｎビット分に相当するユニットパターンを埋め込むことになる。ただし，剰余部分のＲｎビットは必ずしも埋め込まなくても良い。 Here, assuming that the remainder is Rn (Rn = Pw × (Ph−1) −Dn × Cn ), the unit pattern matrix is embedded with Dn data code units and a unit pattern corresponding to the first Rn bits of the data code. become. However, the Rn bit of the remainder part does not necessarily have to be embedded.

図１１の説明では，ユニットパターン行列のサイズを９×１１（１１行９列），データ符号長を１２（図中で０〜１１の番号がついたものがデータ符号の各符号語を表わす）とする。 In the description of FIG. 11, the size of the unit pattern matrix is 9 × 11 (11 rows and 9 columns), and the data code length is 12 (numbers 0 to 11 in the figure indicate each code word of the data code). And

次いで，ユニットパターン行列の第１行目に符号長データを埋め込む（ステップＳ２０３）。図１１の例では符号長を９ビットのデータで表現して１度だけ埋め込んでいる例を説明しているが，ユニットパターン行列の幅Ｐｗが十分大きい場合，データ符号と同様に符号長データを繰り返し埋め込むこともできる。 Next, the code length data is embedded in the first row of the unit pattern matrix (step S203). In the example of FIG. 11, the code length is expressed by 9-bit data and is embedded only once. However, if the unit pattern matrix width Pw is sufficiently large, It can be embedded repeatedly.

さらに，ユニットパターン行列の第２行以降に，データ符号ユニットを繰り返し埋め込む（ステップＳ２０４）。図１１で示すようにデータ符号のＭＳＢ（ｍｏｓｔｓｉｇｎｉｆｉｃａｎｔｂｉｔ）またはＬＳＢ（ｌｅａｓｔｓｉｇｎｉｆｉｃａｎｔｂｉｔ）から順に行方向に埋め込む。図１１の例ではデータ符号ユニットを７回，およびデータ符号の先頭６ビットを埋め込んでいる例を示している。 Further, the data code unit is repeatedly embedded in the second and subsequent rows of the unit pattern matrix (step S204). As shown in FIG. 11, the MSB (most significant bit) or LSB (least significant bit) of the data code is sequentially embedded in the row direction. The example of FIG. 11 shows an example in which the data code unit is embedded seven times and the first 6 bits of the data code are embedded.

データの埋め込み方法は図１１のように行方向に連続になるように埋め込んでも良いし，列方向に連続になるように埋め込んでも良い。 The data embedding method may be embedded so as to be continuous in the row direction as shown in FIG. 11 or may be embedded so as to be continuous in the column direction.

以上，透かし画像形成部１２における，透かし画像について説明した。次いで，透かし情報埋め込み装置１０の透かし入り文書画像合成部１３について説明する。 The watermark image in the watermark image forming unit 12 has been described above. Next, the watermarked document image composition unit 13 of the watermark information embedding device 10 will be described.

（透かし入り文書画像合成部１３）
透かし入り文書画像合成部１３では，文書画像形成部１１で作成した文書画像と，透かし画像形成部で作成した透かし画像を重ね合わせる。透かし入り文書画像の各画素の値は，文書画像と透かし画像の対応する画素値の論理積演算（ＡＮＤ）によって計算する。すなわち，文書画像と透かし画像のどちらかが０（黒）であれば，透かし入り文書画像の画素値は０（黒），それ以外は１（白）となる。 (Watermarked document image composition unit 13)
The watermarked document image composition unit 13 superimposes the document image created by the document image forming unit 11 and the watermark image created by the watermark image forming unit. The value of each pixel of the watermarked document image is calculated by a logical product operation (AND) of the corresponding pixel values of the document image and the watermark image. That is, if either the document image or the watermark image is 0 (black), the pixel value of the watermarked document image is 0 (black), otherwise 1 (white).

図１２は，透かし入り文書画像の一例を示す説明図である。図１３は，図１２の一部を拡大して示した説明図である。ここで，ユニットパターンは図９（１）のパターンを用いている。透かし入り文書画像は，出力デバイス１４により出力される。 FIG. 12 is an explanatory diagram showing an example of a watermarked document image. FIG. 13 is an explanatory view showing a part of FIG. 12 in an enlarged manner. Here, the unit pattern shown in FIG. 9A is used. The watermarked document image is output by the output device 14.

以上，透かし情報埋め込み装置１０の動作について説明した。 The operation of the watermark information embedding device 10 has been described above.

次いで，図３，及び，図１４〜図２２を参照しながら，透かし情報検出装置３０の動作について説明する。 Next, the operation of the watermark information detection apparatus 30 will be described with reference to FIGS. 3 and 14 to 22.

（透かし検出部３２）
図１４は，透かし検出部３２の処理の流れを示す流れ図である。
まず，スキャナなどの入力デバイス３１によって透かし入り文書画像を計算機のメモリ等に入力する（ステップＳ３０１）。この画像を入力画像と称する。入力画像は多値画像であり，以下では２５６階調のグレイ画像として説明する。また入力画像の解像度（入力デバイス３１で読み込むときの解像度）は，上記透かし情報埋め込み装置１０で作成した透かし入り文書画像と異なっていても良いが，ここでは上記透かし情報埋め込み装置１０で作成した画像と同じ解像度であるとして説明する。また，入力画像は回転や伸縮などの補正が行われているものとする。 (Watermark detection unit 32)
FIG. 14 is a flowchart showing a process flow of the watermark detection unit 32.
First, a watermarked document image is input to a memory of a computer or the like by an input device 31 such as a scanner (step S301). This image is referred to as an input image. The input image is a multi-valued image, and will be described below as a gray image with 256 gradations. Further, the resolution of the input image (resolution when read by the input device 31) may be different from the watermarked document image created by the watermark information embedding device 10, but here the image created by the watermark information embedding device 10 is used. It is assumed that the resolution is the same. Also, it is assumed that the input image has been corrected for rotation, expansion and contraction.

次いで，入力画像の大きさと信号ユニットの大きさから，ユニットパターンがいくつ埋め込まれているかを計算する（ステップＳ３０２）。例えば入力画像の大きさがＷ（幅）×Ｈ（高さ）であるとして，信号ユニットの大きさをＳｗ×Ｓｈ，ユニットパターンはＵｗ×Ｕｈ個の信号ユニットから構成されるとすると，入力画像中に埋め込まれているユニットパターンの数（Ｎ＝Ｐｗ×Ｐｈ）は以下のように計算される。 Next, the number of unit patterns embedded is calculated from the size of the input image and the size of the signal unit (step S302). For example, assuming that the size of the input image is W (width) × H (height), the size of the signal unit is Sw × Sh, and the unit pattern is composed of Uw × Uh signal units. The number of unit patterns embedded therein (N = Pw × Ph) is calculated as follows.

ただし，透かし情報埋め込み装置１０と透かし情報検出装置３０で解像度が異なる場合には，それらの解像度の比によって入力画像中の信号ユニットの大きさを正規化した後，上記の計算を行う。 However, when the resolution is different between the watermark information embedding device 10 and the watermark information detection device 30, the above calculation is performed after normalizing the size of the signal unit in the input image based on the ratio of the resolutions.

次いで，ステップＳ３０２で計算したユニットパターン数をもとに入力画像に対してユニットパターンの区切り位置を設定する（ステップＳ３０３）。図１５は入力画像（図１５（１））と，ユニットパターンの区切り位置を設定した後の入力画像（図１５（２））の一例を示している。 Next, a unit pattern separation position is set for the input image based on the number of unit patterns calculated in step S302 (step S303). FIG. 15 shows an example of the input image (FIG. 15 (1)) and the input image (FIG. 15 (2)) after setting the unit pattern separation position.

次いで，ユニットパターンの区切りごとにシンボルユニットの検出を行い，ユニットパターン行列を復元する（ステップＳ３０４）。以下に，信号検出の詳細を説明する。 Next, a symbol unit is detected for each unit pattern delimiter, and a unit pattern matrix is restored (step S304). Details of signal detection will be described below.

図１６は，入力画像中における，図５（１）に示したユニットＡに対応する領域の一例を示した説明図である。図５では信号ユニットは二値画像であるが，ここでは多値画像である。二値画像を印刷した場合，インクのにじみなどが原因で濃淡が連続的に変化し，白と黒のグラデーションとなるため，図１６のようにドットの周囲が白と黒の中間色になる。したがって図１６を波の伝播方向と平行な方向から見た断面図は図１７のようになる。図６が矩形波であるのに対し，図１７は滑らかな波となる。 FIG. 16 is an explanatory diagram showing an example of an area corresponding to the unit A shown in FIG. 5A in the input image. In FIG. 5, the signal unit is a binary image, but here it is a multi-valued image. When a binary image is printed, the density changes continuously due to ink bleeding and the like, resulting in a white and black gradation, so that the periphery of the dot is an intermediate color between white and black as shown in FIG. Therefore, a cross-sectional view of FIG. 16 viewed from a direction parallel to the wave propagation direction is as shown in FIG. 6 is a rectangular wave, whereas FIG. 17 is a smooth wave.

また，実際には紙の厚さの局所的な変化や，印刷文書の汚れ，出力デバイスや画像入力デバイスの不安定性などの要因により，入力画像中には多くの雑音成分が付加されることになるが，ここでは雑音成分のない場合について説明する。しかしながら，ここで説明する方法を用いれば，雑音成分が付加された画像に対しても安定した信号検出を行うことができる。 Also, in reality, many noise components are added to the input image due to factors such as local changes in paper thickness, dirt on the printed document, and instability of the output device and image input device. However, the case where there is no noise component will be described here. However, if the method described here is used, stable signal detection can be performed even for an image to which a noise component is added.

以下では入力画像から信号ユニットを検出するために，波の周波数と方向，および影響範囲を同時に定義できる二次元ウェーブレットフィルタ（検出用フィルタ）を用いる。以下では，二次元ウェーブレットフィルタの一つであるガボールフィルタを用いる例を示すが，ガボールフィルタと同様な性質を持つフィルタであれば，必ずしもガボールフィルタである必要はなく，さらには信号ユニットと同じドットパターンを持つテンプレートを定義してパターンマッチングを行うなどの方法でも良い。 In the following, in order to detect the signal unit from the input image, a two-dimensional wavelet filter (detection filter) capable of simultaneously defining the frequency and direction of the wave and the influence range is used. In the following, an example using a Gabor filter, which is one of the two-dimensional wavelet filters, is shown. However, if it is a filter having the same properties as the Gabor filter, it is not necessarily a Gabor filter, and moreover, the same dot as the signal unit. A method of defining a template having a pattern and performing pattern matching may be used.

以下にガボールフィルタＧ（ｘ，ｙ），ｘ＝０〜ｇｗ−１，ｙ＝０〜ｇｈ−１を示す。ｇｗ，ｇｈはフィルタのサイズであり，ここでは上記透かし情報埋め込み装置１０で埋め込んだ信号ユニットと同じ大きさである。 The Gabor filter G (x, y), x = 0 to gw−1, and y = 0 to gh−1 are shown below. gw and gh are filter sizes, which are the same size as the signal unit embedded by the watermark information embedding apparatus 10 here.

信号検出には透かし画像に埋め込んだ信号ユニットと周波数，波の方向，および大きさが等しいガボールフィルタを，埋め込んだ信号ユニットの種類と同じ数だけ用意する。ここでは図５のユニットＡとユニットＢに対応するガボールフィルタをフィルタＡ，フィルタＢと称する。 For signal detection, the same number of Gabor filters as the types of embedded signal units are prepared with the same frequency, wave direction, and size as the signal units embedded in the watermark image. Here, the Gabor filters corresponding to the units A and B in FIG.

入力画像中の任意の位置でのフィルタ出力値はフィルタと画像間のコンボリューションにより計算する。ガボールフィルタの場合は実数フィルタと虚数フィルタ（虚数フィルタは実数フィルタと半波長分位相がずれたフィルタ）が存在するため，それらの２乗平均値をフィルタ出力値とする。例えば，フィルタＡの実数フィルタと画像間のコンボリューションがＲｃ，虚数フィルタとのコンボリューションがＩｃであったとすると，出力値Ｆ（Ａ）は以下の式で計算する。 The filter output value at an arbitrary position in the input image is calculated by convolution between the filter and the image. In the case of a Gabor filter, there are a real number filter and an imaginary number filter (an imaginary number filter is a filter whose phase is shifted by a half wavelength from the real number filter), and the mean square value thereof is used as a filter output value. For example, if the convolution between the real filter and the image of the filter A is Rc and the convolution between the imaginary filter is Ic, the output value F (A) is calculated by the following equation.

図１８は，ステップＳ３０３によって区切られたユニットパターンＵ（ｘ，ｙ）中に埋め込まれているシンボルユニットがユニットＡであるかユニットＢであるかを判定する方法について説明する説明図である。 FIG. 18 is an explanatory diagram for explaining a method for determining whether the symbol unit embedded in the unit pattern U (x, y) delimited in step S303 is the unit A or the unit B.

ユニットパターンＵ（ｘ，ｙ）に対するシンボル判定ステップを以下のように行う。
（１）フィルタＡの位置を移動しながら，ユニットパターンＵ（ｘ，ｙ）中のすべての位置についてＦ（Ａ）を計算した結果の最大値をユニットパターンＵ（ｘ，ｙ）に対するフィルタＡの出力値とし，これをＦｕ（Ａ，ｘ，ｙ）とする。
（２）ユニットパターンＵ（ｘ，ｙ）に対するフィルタＢの出力値を（１）と同様に計算し，これをＦｕ（Ｂ，ｘ，ｙ）とする。
（３）Ｆｕ（Ａ，ｘ，ｙ）とＦｕ（Ｂ，ｘ，ｙ）を比較し，Ｆｕ（Ａ，ｘ，ｙ）≧Ｆｕ（Ｂ，ｘ，ｙ）であればユニットパターンＵ（ｘ，ｙ）に埋め込まれているシンボルユニットはユニットＡであると判定し，Ｆｕ（Ａ，ｘ，ｙ）＜Ｆｕ（Ｂ，ｘ，ｙ）であればユニットパターンＵ（ｘ，ｙ）に埋め込まれているシンボルユニットはユニットＢであると判定する。
（４）ユニットパターンＵ（ｘ，ｙ）に埋め込まれていると判定されたシンボルユニットの判定の確かさ（ユニット判定確度）をＰ（ｘ，ｙ）とし，フィルタＡ，Ｂの出力値の最大値，またはフィルタＡ，Ｂ出力値の差分値として与える。以下に，フィルタの出力値の最大値の場合，またはフィルタの出力値の差分値の場合のユニット判定確度Ｐ（ｘ，ｙ）を算出するための式を示す。なお，ユニット判定確度は，単に判定確度等と記載する場合もある。 The symbol determination step for the unit pattern U (x, y) is performed as follows.
(1) While moving the position of the filter A, the maximum value of the result of calculating F (A) for all the positions in the unit pattern U (x, y) is the value of the filter A for the unit pattern U (x, y). The output value is set to Fu (A, x, y).
(2) The output value of the filter B for the unit pattern U (x, y) is calculated in the same manner as (1), and this is assumed to be Fu (B, x, y).
(3) Fu (A, x, y) and Fu (B, x, y) are compared, and if Fu (A, x, y) ≧ Fu (B, x, y), unit pattern U (x, y) The symbol unit embedded in y) is determined to be unit A, and if Fu (A, x, y) <Fu (B, x, y), it is embedded in the unit pattern U (x, y). It is determined that the existing symbol unit is unit B.
(4) The symbol unit determination accuracy (unit determination accuracy) determined to be embedded in the unit pattern U (x, y ) is P (x, y), and the maximum output value of the filters A and B Value or the difference value between the output values of filters A and B. The formula for calculating the unit determination accuracy P (x, y) in the case of the maximum value of the output value of the filter or the difference value of the output value of the filter is shown below. The unit determination accuracy may be simply referred to as determination accuracy.

上記（１），（２）において，フィルタを移動するステップ幅は任意であり，ユニットパターン上の代表的な位置での出力値のみを計算してもよい。また，（３）でＦｕ（Ａ，ｘ，ｙ）とＦｕ（Ｂ，ｘ，ｙ）の差の絶対値があらかじめ定めておいた閾値以下であった場合には，“Ｐ（ｘ，ｙ）＝０”にして，判定不能としてもよい。 In the above (1) and (2), the step width for moving the filter is arbitrary, and only the output value at a representative position on the unit pattern may be calculated. If the absolute value of the difference between Fu (A, x, y) and Fu (B, x, y) is less than or equal to a predetermined threshold value in (3), “P (x, y) = 0 ”, and the determination may be impossible.

以上，信号検出（ステップＳ３０４）の詳細について説明した。再び，図１４の流れ図に戻り，以降のステップＳ３０５について説明する。ステップＳ３０５では，ユニットパターン行列のシンボルを連結してデータ符号を再構成し，元の情報を復元する。 The details of the signal detection (step S304) have been described above. Returning to the flowchart of FIG. 14 again, the following step S305 will be described. In step S305, the symbols of the unit pattern matrix are concatenated to reconstruct the data code, and the original information is restored.

図１９は情報復元の一例を示す説明図である。情報復元のステップは以下の通りである。
（１）各ユニットパターンに埋め込まれているシンボルを検出する（図１９に示す（１））。
（２）シンボルを連結してデータ符号を復元する（図１９に示す（２））。
（３）データ符号を復号して埋め込まれた情報を取り出す（図１９に示す（３））。 FIG. 19 is an explanatory diagram showing an example of information restoration. The steps of information restoration are as follows.
(1) A symbol embedded in each unit pattern is detected ((1) shown in FIG. 19).
(2) The symbols are connected to restore the data code ((2) shown in FIG. 19).
(3) Decode the data code to extract the embedded information ((3) shown in FIG. 19).

図２０〜図２２はデータ符号の復元方法の一例を示す説明図である。復元方法は基本的に図１０の逆の処理となる。 20 to 22 are explanatory diagrams illustrating an example of a data code restoration method. The restoration method is basically the reverse process of FIG.

まず，ユニットパターン行列の第１行から符号長データ部分を取り出して，埋め込まれたデータ符号の符号長を得る（ステップＳ４０１）。 First, the code length data portion is extracted from the first row of the unit pattern matrix to obtain the code length of the embedded data code (step S401).

次いで，ユニットパターン行列のサイズとＳ４０１で得たデータ符号の符号長をもとに，データ符号ユニットを埋め込んだ回数Ｄｎおよび剰余Ｒｎを計算する（ステップＳ４０２）。 Next, based on the size of the unit pattern matrix and the code length of the data code obtained in S401, the number Dn of data code units embedded and the remainder Rn are calculated (step S402).

次いで，ユニットパターン行列の２行目以降からステップＳ２０３と逆の方法でデータ符号ユニットを取り出す（ステップＳ４０３）。図２１の例ではＵ（２，１）（２行１列）から順に１２個のユニットパターンごとに分解する。例えば，第１のユニットパターンは，Ｕ（２，１）〜Ｕ（３，３）であり，第２のユニットパターンは，Ｕ（３，４）〜Ｕ（４，６）といったような具合である。第３のユニットパターン以降も同様である。Ｄｎ＝７，Ｒｎ＝６であるため，１２個のユニットパターン（データ符号ユニット）は７回取り出され，剰余として６個（データ符号ユニットの上位６個に相当する）のユニットパターン（Ｕ（１１，４）〜Ｕ（１１，９））が取り出される。また，上記Ｕ（ｘ，ｙ）がＵ（２，１）（２行１列）から順に１２個のユニットパターンに分解されたのと同様に，既に求めた上記ユニット判定確度Ｐ（ｘ，ｙ）も，Ｐ（２，１）から順に１２個のユニットパターンごとに上記Ｕ（ｘ，ｙ）に対応するように分解する。例えば，ユニット判定確度Ｐ（ｘ，ｙ）のうち，第１のデータ符号ユニットに関連づけられるユニット判定確度は，Ｐ（２，１）〜Ｐ（３，３）であり，第２のデータ符号ユニットに関連づけられるユニット判定確度は，Ｐ（３，４）〜Ｐ（４，６）といったような具合である。 Next, data code units are extracted from the second and subsequent rows of the unit pattern matrix by the reverse method of step S203 (step S403). In the example of FIG. 21, each unit pattern is decomposed in order from U (2, 1) (2 rows and 1 column). For example, the first unit pattern is U (2, 1) to U (3, 3), and the second unit pattern is U (3, 4) to U (4, 6). is there. The same applies to the third unit pattern and thereafter. Since Dn = 7 and Rn = 6, 12 unit patterns (data code units) are extracted 7 times, and 6 unit patterns (corresponding to the upper 6 data code units) (U (11 , 4) to U (11, 9)) are taken out. Further, the unit determination accuracy P (x, y) already obtained is similar to the case where U (x, y) is decomposed into 12 unit patterns sequentially from U (2, 1) (2 rows and 1 column). ) Is also decomposed so as to correspond to the above U (x, y) every 12 unit patterns in order from P (2, 1). For example, among the unit determination accuracy P (x, y), the unit determination accuracy associated with the first data code unit is P (2,1) to P (3,3), and the second data code unit. The unit determination accuracy associated with is such as P (3, 4) to P (4, 6).

次いで，ステップＳ４０３で取り出したデータ符号ユニット，ユニット判定確度に対してビット確信度演算を行うことにより，埋め込んだデータ符号を再構成する（ステップＳ４０４）。以下，ビット確信度演算について説明する。 Next, the embedded data code is reconstructed by performing bit certainty calculation on the data code unit and unit determination accuracy extracted in step S403 (step S404). Hereinafter, the bit certainty calculation will be described.

図２２Ａのようにユニットパターン行列の２行１列目から最初に取り出されたデータ符号ユニットをＤｕ（１，１）〜Ｄｕ（１，１２）とし，順次Ｄｕ（２，１）〜Ｄｕ（２，１２），・・・，と表記する。また，剰余部分はＤｕ（８，１）〜Ｄｕ（８，６）とする。上記にて既に求められたユニット判定確度についても，図２２Ｂに示すように，２行１列から順に分解された一連のユニットパターン（Ｐ（２，１）〜Ｐ（３，３），Ｐ（３，４）〜Ｐ（４，６），…）をＰｕ（１，１）〜Ｐｕ（１，１２），Ｐｕ（２，１）〜Ｐｕ（２，１２），…とする。また，剰余部分はＰｕ（８，１）〜Ｐｕ（８，６）とする。ビット確信度演算は各データ符号ユニットの要素ごとに多数決を取るなどして，データ符号の各シンボルの値を決定する処理である。この際，ユニット判定確度に基づいて有効なデータ符号ユニットを選択的に使用することにより，文字領域との重なりや紙面の汚れなどが原因で，任意のデータ符号ユニット中の任意のユニットから正しく信号検出を行えなかった場合（ビット反転エラーなど）でも，最終的に正しくデータ符号を復元することができる。 As shown in FIG. 22A, the data code units first extracted from the second row and the first column of the unit pattern matrix are Du (1,1) to Du (1,12), and Du (2,1) to Du (2 , 12),... Further, the surplus portion is assumed to be Du (8, 1) to Du (8, 6). As shown in FIG. 22B, the unit determination accuracy already obtained in the above is also a series of unit patterns (P (2, 1) to P (3, 3), P ( (3, 4) to P (4, 6),..., Pu (1, 1) to Pu (1, 12), Pu (2, 1) to Pu (2, 12),. Further, the surplus portion is assumed to be Pu (8, 1) to Pu (8, 6). The bit certainty calculation is a process of determining the value of each symbol of the data code by taking a majority vote for each element of each data code unit. At this time, by selectively using an effective data code unit based on the unit determination accuracy, signals from any unit in any data code unit can be correctly signaled due to overlap with the character area or dirt on the paper. Even if the detection cannot be performed (such as a bit inversion error), the data code can be finally restored correctly.

ここで，図２２Ｃを参照しながら，ユニット判定確度に基づいて有効なデータ符号ユニットを選択する処理について説明すると，図２２Ｃに示すように，ユニット判定確度Ｐｕのうち，閾値Ｔ以上のものを抽出する。なお，図２２Ｃに示すように，○印が付されているＰｕ（ｘ，ｙ）は，そのＰｕ（ｘ，ｙ）のユニット判定確度Ｐｕが閾値Ｔよりも小さいことを示している。 Here, with reference to FIG. 22C, the processing for selecting an effective data code unit based on the unit determination accuracy will be described. As shown in FIG. 22C, the unit determination accuracy Pu that is equal to or greater than the threshold T is extracted. To do. As shown in FIG. 22C, Pu (x, y) marked with a circle indicates that the unit determination accuracy Pu of Pu (x, y) is smaller than the threshold T.

例えば，図２２Ｃに示すように，Ｐｕ（４，１）〜Ｐｕ（４，１２）のうち閾値Ｔよりも小さいＰｕ（ｘ，ｙ）を選択すると，Ｐｕ（４，３），Ｐｕ（４，５），Ｐｕ（４，６），Ｐｕ（４，７），Ｐｕ（４，９），Ｐｕ（４，１２）の６つである。 For example, as shown in FIG. 22C, if Pu (x, y) smaller than the threshold T is selected from Pu (4, 1) to Pu (4, 12), Pu (4, 3), Pu (4, 5), Pu (4, 6), Pu (4, 7), Pu (4, 9), Pu (4, 12).

上記閾値Ｔよりも小さいＰｕ（ｘ，ｙ）を全て選択すると，その閾値Ｔよりも小さいＰｕ（ｘ，ｙ）に１対１対応するＤｕ（ｘ，ｙ）を全て選択するとともに，排除（消去）する（図２２Ｃに示す矢印（１））。つまり，閾値Ｔよりも小さいＰｕ（ｘ，ｙ）に対応するＤｕ（ｘ，ｙ）は，信頼度が低く，そのシンボル自体が誤りである可能性が高く，そのＤｕ（ｘ，ｙ）を排除することで，多数決演算の誤り訂正の精度を向上させることができる。 When all Pu (x, y) smaller than the threshold T are selected, all Du (x, y) corresponding to Pu (x, y) smaller than the threshold T are selected and excluded (erased). (Arrow (1) shown in FIG. 22C). That is, Du (x, y) corresponding to Pu (x, y) smaller than the threshold T has low reliability, and the symbol itself is highly likely to be erroneous, and the Du (x, y) is excluded. By doing so, it is possible to improve the accuracy of error correction in the majority operation.

閾値Ｔよりも小さいＰｕ（ｘ，ｙ）に係るＤｕ（ｘ，ｙ）を排除した状態で，図２２Ｄに示すように，多数決演算を実行する。特に，例えば，配列［５］（Ｄｕ（１，６），Ｄｕ（２，６），…，Ｄｕ（８，６））については，３つの中から１又は０が多い方を多数決で選び，図２２Ｄに示す再構成されたビット列に格納する。従来に係るビット列（配列［５］）には，多数決演算しても既に誤っている値が５つ存在するため，誤りを訂正し正確な値をビット列に格納する可能性を向上させることができなかった。 In the state where Du (x, y) related to Pu (x, y) smaller than the threshold T is excluded, the majority operation is executed as shown in FIG. 22D. In particular, for example, for the array [5] (Du (1, 6), Du (2, 6),..., Du (8, 6)), the one having more 1 or 0 is selected from the three by a majority decision. Store in the reconstructed bit string shown in FIG. 22D. In the conventional bit string (array [5]), there are already five erroneous values even after the majority operation, so the possibility of correcting the error and storing the correct value in the bit string can be improved. There wasn't.

多数決演算についてより具体的に説明すると，例えばデータ符号の１ビット目は，Ｄｕ（１，１），Ｄｕ（２，１），Ｄｕ（３，１），…，Ｄｕ（８，１）の信号検出結果のうちユニット判定確度Ｐｕ（Ｐｕ（１，１），Ｐｕ（２，１），Ｐｕ（３，１），…，Ｐｕ（８，１））が閾値Ｔ以上の信号検出結果について，多数決演算し，“１”である方が多い場合には１と判定し，“０”である方が多い場合には０と判定する。なお，上記ユニット判定確度Ｐｕの閾値Ｔは，具体的には以下の式で示すことができる。 More specifically, the majority operation will be described. For example, the first bit of the data code is a signal of Du (1, 1), Du (2, 1), Du (3, 1), ..., Du (8, 1). Of the detection results, the majority of the signal detection results with unit determination accuracy Pu (Pu (1,1), Pu (2,1), Pu (3,1),..., Pu (8,1)) equal to or greater than the threshold value T are determined. When there are more “1” s in the calculation, it is determined as 1 and when there are more “0” s, it is determined as 0. The threshold T of the unit determination accuracy Pu can be specifically expressed by the following equation.

上記と同様に，データ符号の２ビット目は，Ｄｕ（１，２），Ｄｕ（２，２），Ｄｕ（３，２），…，Ｄｕ（８，２）の信号検出結果と，ユニット判定確度Ｐｕ（Ｐｕ（１，２），Ｐｕ（２，２），Ｐｕ（３，２），…，Ｐｕ（８，２））が閾値Ｔ以上の信号検出結果とに基づいて判定し，以降同様に，１２ビット目まで判定していく。 Similarly to the above, the second bit of the data code includes the signal detection results of Du (1, 2), Du (2, 2), Du (3, 2),. The accuracy Pu (Pu (1, 2), Pu (2, 2), Pu (3, 2),..., Pu (8, 2)) is determined based on the signal detection result equal to or greater than the threshold T, and so on. Then, the determination is made up to the 12th bit.

ビット確信度演算は，図１８の信号検出フィルタの出力値を加算することによっても行うこともできる。つまり各フィルタごとにフィルタ出力値の累積値を求め，求めた複数のフィルタ出力値の累積値を比較することによってビット確信度演算を行うことも可能である。これは，例えば図５（１）のユニットＡに０のシンボルが割り当てられ，図５（２）のユニットＢに１のシンボルが割り当てられているものとし，Ｄｕ（ｍ，ｎ）に対するフィルタＡによる出力値の最大値をＤｆ（Ａ，ｍ，ｎ），Ｄｕ（ｍ，ｎ）に対するフィルタＢによる出力値の最大値をＤｆ（Ｂ，ｍ，ｎ）とすると，機密情報となるデータ符号のＮビット目は，以下の条件式によって求められる。 The bit certainty calculation can also be performed by adding the output values of the signal detection filter of FIG. That is, it is possible to calculate the bit certainty factor by obtaining the cumulative value of the filter output value for each filter and comparing the cumulative values of the obtained plural filter output values. This is because, for example, a symbol of 0 is assigned to unit A in FIG. 5 (1), and a symbol of 1 is assigned to unit B in FIG. 5 (2). If the maximum value of the output value is Df (A, m, n), and the maximum value of the output value by the filter B for Du (m, n) is Df (B, m, n), N of the data code serving as confidential information The bit number is obtained by the following conditional expression.

上記式の関係が成立する場合は０と判定し，それ以外の場合は１と判定する。ただし，Ｎ＜Ｒｎの場合はＤｆの加算はｎ＝１〜Ｒｎ＋１までとなる。例えば，データ符号の２ビット目（Ｄｕ（１，２），Ｄｕ（２，２），Ｄｕ（３，２），…，Ｄｕ（８，２））の信号検出結果について，信号検出フィルタの出力値を累積し，上記式の関係が成立すれば０と判定される。なお，ビット確信度演算は，かかる信号検出フィルタの出力値の累積値に基づき実行する場合に限定されず，例えば，ビット確信度演算は，上記フィルタ出力値の累積値のときとほぼ同様な条件式で，各フィルタごとのユニット判定確度を累積し求めた複数のユニット判定確度（又は，判定確度）の累積値を比較することによってもビット確信度演算を実行することが可能である。 When the relationship of the above formula is established, it is determined as 0, and otherwise it is determined as 1 . However, when N <Rn, the addition of Df is n = 1 to Rn + 1. For example, for the signal detection result of the second bit of the data code (Du (1, 2), Du (2, 2), Du (3, 2),..., Du (8, 2)), the output of the signal detection filter If the values are accumulated and the relationship of the above equation is established, it is determined as 0 . The bit certainty calculation is not limited to the case where it is executed based on the accumulated value of the output value of the signal detection filter. It is also possible to execute the bit certainty calculation by comparing the accumulated values of a plurality of unit determination accuracy (or determination accuracy) obtained by accumulating the unit determination accuracy for each filter in the equation.

ここではデータ符号を繰り返し埋め込む場合について説明したが，データを符号化する際に誤り訂正符号などを用いることにより，データ符号ユニットの繰り返しを行わないような方法も実現できる。 Although the case where data codes are repeatedly embedded has been described here, a method that does not repeat data code units can be realized by using an error correction code or the like when encoding data.

また，ビット確信度演算の多数決演算実行後，図２２Ａ等に示すような復号された機密情報のビット列（データ符号）に対して誤り検出符号等を用いることによって，さらにそのビット列の誤り検出をする場合であっても実施可能である。 Further, after executing the majority decision of the bit certainty calculation, an error detection code or the like is used for the decoded confidential information bit string (data code) as shown in FIG. Even if it is a case, it can be implemented.

以上詳細に説明したように，本実施の形態によれば，以下のような優れた効果がある。
（１−１）ドットの配列の違いにより埋め込み情報を表現するため，元の文書のフォント，文字間や行間のピッチに対する変更を伴わない。
（１−２）シンボルを割り当てているドットパターンと，シンボルを割り当てていないドットパターンの濃度（一定区間内のドットの数）が等しいため，人の目には文書の背景に一様な濃度の網掛けがされているように見え，情報の存在が目立たない。
（１−３）シンボルを割り当てているドットパターンと割り当てていないドットパターンを秘密にしておくことで，埋め込まれている情報の解読が困難となる。
（１−４）情報を表わすパターンは細かいドットの集まりで，文書の背景として一面に埋め込まれているため，埋め込みアルゴリズムが公開されたとしても，印刷された文書に対する埋め込み情報の改ざんが困難となる。
（１−５）波（濃淡変化）の方向の違いにより埋め込み信号を検出するため（１画素単位の詳細な検出を行わないので），印刷文書に多少の汚れなどがあった場合でも，安定した情報検出を行うことができる。
（１−６）同じ情報を繰り返し埋め込み，検出時には繰り返し埋め込まれた情報のすべてを利用して情報復元を行うため，大きなフォントの文字によって信号部分が隠されたり，用紙が汚れていたりすることによる部分的な情報の欠落が発生しても，安定して埋め込んだ情報を取り出すことができる。
（１−７）さらに，文書画像内に存在する文字の画素や，汚れなどによって正確に検出できない信号ユニットについては，予め多数決演算の対象から排除するため，多数決演算による誤り訂正，情報抽出等の精度を向上させることができる。なお，上記ユニット判定確度を求める処理にかかる時間はデータ符号再構成処理にかかる時間と比べて極めて短時間であるため，時間的には従来における上記ユニット判定確度を求めずにデータ符号再構成処理を行う場合とさほど変わらない。 As described above in detail, according to the present embodiment, the following excellent effects are obtained.
(1-1) Since the embedded information is expressed by the difference in dot arrangement, the original document font, character spacing, and line spacing are not changed.
(1-2) Since the dot pattern to which symbols are assigned and the dot pattern to which symbols are not assigned have the same density (the number of dots in a certain interval), the human eye has a uniform density on the background of the document. It appears to be shaded and the presence of information is inconspicuous.
(1-3) Keeping the dot pattern to which symbols are assigned and the dot pattern to which symbols are not assigned in secret makes it difficult to decipher the embedded information.
(1-4) A pattern representing information is a collection of fine dots that are embedded as a background of a document, so that even if an embedding algorithm is disclosed, it is difficult to falsify embedded information in a printed document. .
(1-5) Since the embedded signal is detected based on the difference in the direction of the wave (change in shading) (because detailed detection is not performed in units of one pixel), the printed document is stable even if there is some dirt, etc. Information detection can be performed.
(1-6) Since the same information is repeatedly embedded and information is restored using all of the repeatedly embedded information at the time of detection, the signal portion is hidden by large font characters or the paper is dirty. Even if partial information loss occurs, the embedded information can be taken out stably.
(1-7) Further, signal units that cannot be detected accurately due to character pixels or dirt existing in the document image are excluded from the target of the majority operation in advance, so that error correction, information extraction, etc. by the majority operation are performed. Accuracy can be improved. Note that the time required for the process for obtaining the unit determination accuracy is extremely short compared with the time required for the data code reconstruction process. It is not much different from the case of doing.

（第２の実施の形態）
上記第２の実施の形態の透かし情報検出装置によるデータ符号再構成処理では，透かし入りの文書画像から得られるデータを符号化する際に，誤り検出／訂正符号（例えば，ハミング符号，ＢＣＨ符号）などを用いて，情報が正しいか誤っているかを判断できるような符号化形式を用いる点で，第１の実施の形態のデータ符号再構成処理（Ｓ４０４）と相違する。なお，データ列には，チェックサムやＣＲＣ１６（ＣＲＣ１６−ＣＣＩＴＴなど）を付与する方法を用いること等も可能である。以下，第１の実施の形態との相違点について詳細に説明するが，その他の点については，ほぼ同様であるため詳細な説明は省略する。 (Second Embodiment)
In the data code reconstruction process performed by the watermark information detection apparatus according to the second embodiment, an error detection / correction code (for example, a Hamming code or a BCH code) is used when encoding data obtained from a watermarked document image. Is different from the data code reconstruction process (S404) of the first embodiment in that an encoding format that can determine whether the information is correct or incorrect is used. Note that it is possible to use a method of adding a checksum or CRC16 (CRC16-CCITT or the like) to the data string. Hereinafter, differences from the first embodiment will be described in detail, but the other points are substantially the same, and detailed description thereof will be omitted.

第２の実施の形態では，ユニット判定確度Ｐｕの閾値Ｔを求める際に，所定範囲内に存在する複数の値αを用いることで，複数の閾値Ｔを計算し，各々の閾値Ｔを用いてビット確信度演算を実行する。なお，各々の閾値Ｔ（α_ｉ）は，下記の数式の通りとなる。なお，例えば，値αは，“｛０．１，０．２，０．３，０．４，０．５，０．６，０．７，０．８，０．９，１．０｝”等の値を持っているものとする。 In the second embodiment, when the threshold T of the unit determination accuracy Pu is obtained, a plurality of thresholds T are calculated by using a plurality of values α existing within a predetermined range, and each threshold T is used. Perform bit confidence calculation. Each threshold value T (α _i ) is expressed by the following formula. For example, the value α is “{0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1.0}. It has a value such as “”.

以下に，図２３を参照しながら，第１の実施の形態と相違する第２の実施の形態のデータ再構成処理等の具体的な処理について説明するが，その他のビット確信度演算の部分については第１の実施の形態のビット確信度演算と実質的に同一であるため，詳細な説明は省略する。 Hereinafter, specific processing such as data reconstruction processing according to the second embodiment, which is different from the first embodiment, will be described with reference to FIG. Is substantially the same as the bit certainty calculation of the first embodiment, and a detailed description thereof will be omitted.

図２３に示すように，まず，閾値Ｔ（α_ｉ）を計算する（Ｓ９０１）。なお，ｉの初期値は，例えばｉ＝１であるとする。次に，上記に示した数式から求められる閾値Ｔ（α_ｉ）を用いて，第１の実施の形態で説明したビット確信度演算を行う（Ｓ９０３）。そのビット確信度演算から得られた当該閾値Ｔ（α_ｉ）に対応するデータ符号に対して，透かしの埋め込み側（透かし情報埋め込み装置）で付与した誤り検出符号，誤り検出チェックサム，またはＣＲＣ１６などを用いて誤り検出を実行する（Ｓ９０５）。 As shown in FIG. 23, first, a threshold value T (α _i ) is calculated (S901). It is assumed that the initial value of i is, for example, i = 1. Next, the bit certainty calculation described in the first embodiment is performed using the threshold value T (α _i ) obtained from the above formula (S903). An error detection code, an error detection checksum, CRC16, or the like assigned on the watermark embedding side (watermark information embedding device) to the data code corresponding to the threshold T (α _i ) obtained from the bit certainty calculation Is used to perform error detection (S905).

上記誤り検出の実行によって，誤りが検出された場合（Ｓ９０７），そのデータ符号については，誤りデータ符号として，そのデータ符号を破棄する（排除する）。つまり，誤ったデータ符合を破棄することで，正しいデータ符号のみが存在し，正しいデータ符号だけを得ることができる。 If an error is detected by executing the error detection (S907), the data code is discarded (excluded) as an error data code. That is, by discarding an incorrect data code, only the correct data code exists and only the correct data code can be obtained.

上記誤り検出後の後続処理についてさらに具体的に説明すると，上記Ｔ（α_ｉ）のｉを１インクリメントし（ｉ＝１，２，３，…），閾値Ｔ（α_ｉ＋１）を求め，ユニット判定確度Ｐｕ≧Ｔ（α_ｉ）の関係を満たす信号検出結果についてビット確信度演算した結果，誤り検出を実行する（Ｓ９０５）。当該誤り検出処理によって，さらに誤りが検出された場合（Ｓ９０７），上記説明の通り，誤りのデータ符号を排除していく。その後，さらにｉを１インクリメントし閾値（α_ｉ＋２）を求める（Ｓ９０１）。以降，誤りが検出される限り，ｉがインクリメントすることで値αが範囲外となるまで上記一連の処理が継続する。なお，初めて誤りが検出されなかったデータ符号を得た時点で，そのデータ符号を出力し，その他の残りのビット確信度演算等の処理を省略してもよい。 More specifically, the subsequent processing after the error detection will be described. In this case, _{i of} T (α _i ) is incremented by 1 (i = 1, 2, 3,...), A threshold T (α _{i + 1} ) is obtained, and unit determination is performed. Error detection is performed as a result of the bit certainty calculation for the signal detection result satisfying the relationship of accuracy Pu ≧ T (α _i ) (S905). If more errors are detected by the error detection process (S907), the error data code is eliminated as described above. Thereafter, i is further incremented by 1 to obtain a threshold value (α _{i + 2} ) (S901). Thereafter, as long as an error is detected, the above series of processing continues until i is incremented and the value α is out of range. Note that when a data code for which no error is detected is obtained for the first time, the data code may be output, and the remaining processing such as bit certainty calculation may be omitted.

また，求められた閾値Ｔ（α_ｉ）に基づいて，ビット確信度演算し，誤り検出処理終了後，誤りが検出されないデータ符号であった場合（Ｓ９０７），その閾値Ｔ（α_ｉ）を文書画像に適した閾値として動的に決定する（Ｓ９０９）。つまり，ユニット判定確度Ｐｕは，最尤判定，最尤法等を例示できるように，閾値Ｔ（α_ｉ）を順に変えながら，その都度誤り検出処理を実行し，徐々に適切なユニット判定確度Ｐｕを定めている。以上で，第２の実施の形態に係るデータ符号再構成の一連の処理の説明を終了する。 Also, based on the obtained threshold value T (α _i ), the bit certainty factor is calculated, and after the error detection process, the data code is such that no error is detected (S907), and the threshold value T (α _i ) is recorded in the document A threshold value suitable for the image is dynamically determined (S909). That is, the unit determination accuracy Pu is obtained by executing the error detection process each time while sequentially changing the threshold T (α _i ) so that the maximum likelihood determination, the maximum likelihood method, and the like can be exemplified, and gradually the appropriate unit determination accuracy Pu. Is stipulated. This is the end of the description of the series of data code reconstruction processes according to the second embodiment.

以上詳細に説明したように，本実施の形態によれば，以下のような優れた効果がある。 As described above in detail, according to the present embodiment, the following excellent effects are obtained.

閾値Ｔ（α_ｉ）を動的に変動させ，適切な閾値を決定することで，印刷文書または文書画像上の状況に左右されず，あらゆる環境下で誤った透かし信号を検出する可能性を最大限軽減することができる。 By dynamically changing the threshold value T (α _i ) and determining an appropriate threshold value, it is possible to maximize the possibility of detecting an erroneous watermark signal in any environment regardless of the situation on the printed document or document image. The limit can be reduced.

なお，従来では，文書画像上には印刷やスキャンによる揺らぎや紙質や，汚れなどによって，正しいビット値を示す信号ユニットと誤ったビット値を示す信号ユニットの比率が一定していない。つまり上記汚れや紙質に非常に左右される。したがって，閾値Ｔ（α_ｉ）を固定的な値に設定してしまうと，正しいビット値を示す信号ユニットを雑音として誤った信号ユニットとして破棄してしまう場合や，逆に誤った信号ユニットを正しい信号ユニットとして判定してしまう場合があった。 Conventionally, the ratio between a signal unit indicating a correct bit value and a signal unit indicating an incorrect bit value is not constant on a document image due to fluctuations in printing or scanning, paper quality, dirt, and the like. In other words, it is very dependent on the dirt and paper quality. Therefore, if the threshold value T (α _i ) is set to a fixed value, a signal unit indicating a correct bit value may be discarded as an erroneous signal unit as noise, or conversely, an incorrect signal unit may be correct. In some cases, it was determined as a signal unit.

次に，第１の実施の形態及び第２の実施の形態において，上述した一連の処理は，専用のハードウェアにより行うこともできるし，ソフトウェアにより行うこともできる。一連の処理をソフトウェアによって行う場合には，そのソフトウェアを構成するプログラムが，汎用のコンピュータやマイクロコンピュータ等にインストールされる。 Next, in the first embodiment and the second embodiment, the series of processes described above can be performed by dedicated hardware or software. When a series of processing is performed by software, a program constituting the software is installed in a general-purpose computer or a microcomputer.

プログラムは，コンピュータに内蔵されている記録媒体としてのハードディスクやＲＯＭに予め記録しておくことができる。 The program can be recorded in advance on a hard disk or ROM as a recording medium built in the computer.

あるいはまた，プログラムは，ハードディスクドライブに限らず，フレキシブルディスク，ＣＤ−ＲＯＭ（ＣｏｍｐａｃｔＤｉｓｃＲｅａｄＯｎｌｙＭｅｍｏｒｙ），ＭＯ（ＭａｇｎｅｔｏＯｐｔｉｃａｌ）ディスク，ＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｃ），磁気ディスク，半導体メモリなどのリムーバブル記録媒体に，一時的あるいは永続的に格納（記録）しておくことができる。このようなリムーバブル記録媒体は，いわゆるパッケージソフトウエアとして提供することができる。 Alternatively, the program is not limited to a hard disk drive, but a removable recording medium such as a flexible disk, a CD-ROM (Compact Disc Read Only Memory), an MO (Magneto Optical) disk, a DVD (Digital Versatile Disc), a magnetic disk, and a semiconductor memory. Can be stored (recorded) temporarily or permanently. Such a removable recording medium can be provided as so-called package software.

ここで，本明細書において，コンピュータに各種の処理を行わせるためのプログラムを記述する処理ステップは，必ずしもフローチャートとして記載された順序に沿って時系列に処理する必要はなく，並列的あるいは個別に実行される処理（例えば，並列処理あるいはオブジェクトによる処理）も含むものである。 Here, in this specification, the processing steps for describing a program for causing a computer to perform various processes do not necessarily have to be processed in time series in the order described in the flowchart, but in parallel or individually. This includes processing to be executed (for example, parallel processing or processing by an object).

以上，添付図面を参照しながら本発明の好適な実施形態について説明したが，本発明はかかる例に限定されない。当業者であれば，特許請求の範囲に記載された技術的思想の範疇内において各種の変更例または修正例を想定し得ることは明らかであり，それらについても当然に本発明の技術的範囲に属するものと了解される。 As mentioned above, although preferred embodiment of this invention was described referring an accompanying drawing, this invention is not limited to this example. It is obvious for those skilled in the art that various changes or modifications can be envisaged within the scope of the technical idea described in the claims, and these are naturally within the technical scope of the present invention. It is understood that it belongs.

本実施の形態にかかる透かし情報の検出は，透かし情報が埋め込まれた印刷文書をスキャンし，文書画像を得て，その文書画像から透かし情報を検出する場合を例に挙げて説明したが，かかる例に限定されず，例えば，印刷文書ではなく，電子媒体上で全て処理が行われる場合は，印刷文書をスキャンせず，そのまま透かし情報が埋め込まれた文書画像から透かし情報を検出する場合等でもよい。 The watermark information detection according to the present embodiment has been described by taking an example in which a printed document in which watermark information is embedded is scanned, a document image is obtained, and the watermark information is detected from the document image. For example, when all processing is performed on an electronic medium instead of a printed document, even when the watermark information is detected from a document image in which the watermark information is embedded without scanning the printed document. Good.

本発明は，印刷された機密情報入り文書又は機密情報入り文書画像から機密情報を検出する技術に適用可能である。 The present invention is applicable to a technique for detecting confidential information from a printed document including confidential information or a document image including confidential information.

フィルタリングの処理結果の概略の一例を示す説明図である。It is explanatory drawing which shows an example of the outline of the process result of filtering. 従来にかかるデータ再構成処理の概略の一例を示す説明図である。It is explanatory drawing which shows an example of the outline of the data reconstruction process concerning the former. 第１の実施の形態にかかる透かし情報埋め込み装置及び透かし情報検出装置の構成を示す説明図である。It is explanatory drawing which shows the structure of the watermark information embedding apparatus and watermark information detection apparatus concerning 1st Embodiment. 透かし画像形成部１２の処理の流れを示す流れ図である。4 is a flowchart showing a processing flow of a watermark image forming unit 12. 透かし信号の一例を示す説明図であり，（１）はユニットＡを，（２）はユニットＢを示している。It is explanatory drawing which shows an example of a watermark signal, (1) has shown the unit A, (2) has shown the unit B. 図５（１）の画素値の変化をａｒｃｔａｎ（１／３）の方向から見た断面図である。It is sectional drawing which looked at the change of the pixel value of Fig.5 (1) from the direction of arctan (1/3). 透かし信号の一例を示す説明図であり，（３）はユニットＣを，（４）はユニットＤを，（５）はユニットＥを示している。It is explanatory drawing which shows an example of a watermark signal, (3) shows the unit C, (4) shows the unit D, (5) shows the unit E. 背景画像の説明図であり，（１）はユニットＥを背景ユニットと定義し，これを隙間なく並べた透かし画像の背景とした場合を示し，（２）は（１）の背景画像の中にユニットＡを埋め込んだ一例を示し，（３）は（１）の背景画像の中にユニットＢを埋め込んだ一例を示している。It is explanatory drawing of a background image, (1) shows the case where unit E is defined as a background unit and this is used as the background of a watermark image arranged without gaps, and (2) is in the background image of (1) An example in which the unit A is embedded is shown, and (3) shows an example in which the unit B is embedded in the background image of (1). 透かし画像へのシンボル埋め込み方法の一例を示す説明図である。It is explanatory drawing which shows an example of the symbol embedding method to a watermark image. 機密情報を透かし画像に埋め込む方法について示した流れ図である。5 is a flowchart illustrating a method for embedding confidential information in a watermark image. 透かし検出部３２の処理の流れを示す流れ図である。4 is a flowchart showing a processing flow of a watermark detection unit 32. 透かし入り文書画像の一例を示す説明図である。It is explanatory drawing which shows an example of a watermarked document image. 図１２の一部を拡大して示した説明図である。It is explanatory drawing which expanded and showed a part of FIG. 透かし検出部３２の処理の流れを示す流れ図である。4 is a flowchart showing a processing flow of a watermark detection unit 32. 図１５は入力画像（図１５（１））と，ユニットパターンの区切り位置を設定した後の入力画像（図１５（２））の一例を示している。FIG. 15 shows an example of the input image (FIG. 15 (1)) and the input image (FIG. 15 (2)) after setting the unit pattern separation positions. 入力画像中におけるユニットＡに対応する領域の一例を示した説明図である。It is explanatory drawing which showed an example of the area | region corresponding to the unit A in an input image. 図１６を波の伝播方向と平行な方向から見た断面図である。It is sectional drawing which looked at FIG. 16 from the direction parallel to the propagation direction of a wave. ユニットパターンＵ（ｘ，ｙ）中に埋め込まれているシンボルユニットがユニットＡであるかユニットＢであるかを判定する方法について説明する説明図である。It is explanatory drawing explaining the method to determine whether the symbol unit embedded in the unit pattern U (x, y) is the unit A or the unit B. FIG. 情報復元の一例を示す説明図であるIt is explanatory drawing which shows an example of information restoration データ符号の復元方法の一例を示す説明図である。It is explanatory drawing which shows an example of the decompression | restoration method of a data code. データ符号の復元方法の一例を示す説明図である。It is explanatory drawing which shows an example of the decompression | restoration method of a data code. データ符号の復元方法の一例を示す説明図である。It is explanatory drawing which shows an example of the decompression | restoration method of a data code. データ符号の復元方法の一例を示す説明図である。It is explanatory drawing which shows an example of the decompression | restoration method of a data code. データ符号の復元方法の一例を示す説明図である。It is explanatory drawing which shows an example of the decompression | restoration method of a data code. データ符号の復元方法の一例を示す説明図である。It is explanatory drawing which shows an example of the decompression | restoration method of a data code. 第２の実施の形態のデータ再構成処理の概略の一例を示すフローチャートである。It is a flowchart which shows an example of the outline of the data reconstruction process of 2nd Embodiment.

Explanation of symbols

１０透かし情報埋め込み装置
１１文書画像形成部
１２透かし画像形成部
１３透かし入り文書画像合成部
１４出力デバイス DESCRIPTION OF SYMBOLS 10 Watermark information embedding apparatus 11 Document image formation part 12 Watermark image formation part 13 Watermarked document image composition part 14 Output device

Claims

A watermark information detection device that extracts confidential information from an image:
An image input unit for inputting the image and obtaining an input image;
Extracting the embedded pattern by filtering the input image using a detection filter prepared for the same type as the pattern that reacts to one or more types of patterns embedded in the input image; A pattern determination unit for determining and determining a symbol corresponding to the pattern;
An accuracy calculation unit that calculates the accuracy of the symbol determination for each pattern embedded in the input image based on an output value for each pattern obtained by the detection filter ;
Comparing the calculated determination accuracy with a predetermined threshold value, selecting a pattern corresponding to the determination accuracy indicating a value larger than the threshold value, and decrypting the confidential information using the selected pattern ; Prepared,
The information decoding unit further executes a certainty factor operation for determining a symbol value constituting the confidential information encoded from one or more symbol values corresponding to the selected pattern , and based on the result, the confidentiality calculation is performed. A watermark information detection apparatus for decoding information.

Prepare one or more types of identifiable patterns, give at least one symbol to one pattern, and place one or more types of the patterns generated by encoding confidential information An image input unit that reads the image in which the confidential information is embedded as an input image;
To detect the pattern from the input image, it performs filtering of the input image for detection filter responsive to the pattern prepared by the same kind as the pattern, in the image region of the input image, by the detection filter A pattern determining unit that determines and determines a symbol of the pattern based on an output value obtained;
A probability calculating unit for calculating for each pattern embedded the accuracy of symbol decision in the input image based on the output value of each pattern obtained by the detection filter;
The determination accuracy calculated by the previous Ki確 calculator with a predetermined threshold value, select the pattern corresponding to the determination accuracy indicating a value greater than the threshold value, decoding the secret information using the selected pattern An information decoding unit to perform;
With
The information decoding unit further executes a certainty factor operation for determining a symbol value constituting the confidential information encoded from one or more symbol values corresponding to the selected pattern , and based on the result, the confidentiality calculation is performed. A watermark information detection apparatus for decoding information.

3. The watermark information detection apparatus according to claim 1, wherein in the certainty factor calculation, a majority vote calculation for taking a majority vote is performed on the selected pattern. 4.

The magnitude of the cumulative value of a plurality of filter output values obtained by accumulating the filter output values for each filter is compared with the selected pattern in the certainty factor calculation. 2. The watermark information detection apparatus according to 2.

3. The certainty factor calculation compares the magnitudes of accumulated values of a plurality of determination accuracy obtained by accumulating determination accuracy for each filter with respect to the selected pattern. The watermark information detection apparatus described.

The watermark information detection apparatus according to claim 1, wherein the determination accuracy is a maximum value among output values obtained by filtering using one or more types of the detection filters.

When two or more types of detection filters are used , the determination accuracy is a difference between output values obtained by filtering with at least two of the two or more types of detection filters. The watermark information detection apparatus according to claim 1, wherein the watermark information detection apparatus is calculated using a value.

A unit pattern having at least one or more patterns formed by encoding the confidential information as a unit is repeatedly embedded in the input image a plurality of times. , 4, 5, 6, or 7. The watermark information detection device according to any one of items 7 to 7.

The information decoding unit calculates a threshold value of the determination accuracy using the calculated determination accuracy of one or more, 2, 2, 3, 4, 5, 6, 7, Or the watermark information detection apparatus of any one of 8 items | items.

The watermark information detection apparatus according to claim 9, wherein the threshold value of the determination accuracy is calculated and held in advance before selection of the pattern by the information decoding unit.

11. The watermark information detection apparatus according to claim 9, wherein the threshold value of the determination accuracy is an average value of the calculated one or more determination accuracy.

The watermark information detection apparatus further includes an error detection unit,
The error detection unit detects an error with an error detection code for the confidential information decrypted by the information decryption unit. The watermark information detection apparatus according to any one of 8, 9, 10, or 11.

The information decoding unit has a first threshold and a second threshold different from each other as the predetermined threshold used for comparison with the determination accuracy,
The error detection unit performs error detection on the pattern selected by the information decoding unit as a result of the comparison between the determination accuracy and the first threshold, and as a result, detects an error,
The information decryption unit compares the determination accuracy with the second threshold value and performs the selection of the pattern again, and further executes the certainty factor calculation for the selected pattern to obtain the confidential information . The watermark information detection apparatus according to claim 12 , wherein the watermark information detection apparatus performs decoding.

The information decoding unit is configured to error by the error detecting unit is no longer detected, characterized in that said repeatedly executing the certainty calculation after the error detection by the error detection unit, either one of claims 12 or 13 wherein 2. The watermark information detection apparatus according to item 1.

The information decoding unit uses the plurality of determination accuracy thresholds one by one in order one by one. 15. The watermark information detection apparatus according to any one of items 12, 13, or 14.

The information decoding unit dynamically calculates the threshold value of the determination accuracy to be different from each other, characterized in that: The watermark information detection device according to any one of 11, 12, 13, 14, or 15 items.

A watermark information detection method for extracting confidential information from an image comprising:
An image input step of inputting the image by an image input means to obtain an input image;
A pattern determining step of extracting one or more patterns embedded in the input image by filtering the input image by a pattern determining means , determining and determining a symbol corresponding to the pattern;
An accuracy calculation step of calculating the accuracy of the symbol determination for each pattern based on the output value of each pattern obtained by the detection filter by the accuracy calculation means;
Information by the information decoding means, a determination accuracy of the calculated and compared to a predetermined threshold, select the pattern corresponding to the determination accuracy indicating a value greater than the threshold value, to decode the secret information using the selected pattern Decryption process,
In the information decoding step, a certainty factor calculation for determining a symbol value constituting the confidential information encoded from one or more symbol values corresponding to the selected pattern is further executed, and based on the result, the confidentiality calculation is performed. A method for detecting watermark information, comprising decoding information.

By preparing one or more identifiable patterns, providing at least one symbol for one pattern, and placing one or more of the patterns generated by encoding confidential information An image input step of reading an image in which the confidential information is embedded as an input image by an image input means ;
In order to detect the pattern from the input image, the pattern determination means prepares only the same type of detection filter that reacts to the pattern and performs filtering of the input image, and in the image area of the input image, wherein determining the symbol of the pattern based on the output value obtained by the detection filter, and determining the pattern determination step;
An accuracy calculation step of calculating the accuracy of symbol determination for each pattern embedded in the input image based on the output value for each pattern obtained by the detection filter by the accuracy calculation means ;
The information decoding means compares the determination accuracy calculated in the determination accuracy calculation step with a predetermined threshold, selects a pattern corresponding to the determination accuracy indicating a value larger than the threshold, and uses the selected pattern to An information decoding step for decoding information,
In the information decoding step, a certainty factor calculation for determining a symbol value constituting the confidential information encoded from one or more symbol values corresponding to the selected pattern is further executed, and based on the result, the confidentiality calculation is performed. A method for detecting watermark information, comprising decoding information.