JP4224999B2

JP4224999B2 - Image processing apparatus and image processing method

Info

Publication number: JP4224999B2
Application number: JP2002222044A
Authority: JP
Inventors: 哲二郎近藤; 成司和田; 淳一石橋; 和志吉川
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2002-07-30
Filing date: 2002-07-30
Publication date: 2009-02-18
Anticipated expiration: 2022-07-30
Also published as: JP2004064564A

Description

【０００１】
【発明の属する技術分野】
本発明は、たとえば動画像圧縮装置などに用いられる動き検出装置等の画像処理装置および画像処理方法に関するものである。
【０００２】
【従来の技術】
画像処理装置においては、動画像圧縮を効率よく行うための主要技術の１つとして、画像の動きを示す動きベクトルを求める動き検出がある。この動きベクトルを求める手法はいくつか提案されているが、主な手法の１つとしてブロックマッチングアルゴリズムと呼ばれる手法がある。
【０００３】
図１は、ブロックマッチングアルゴリズムを採用した従来の画像処理装置における動き検出装置の構成例を示すブロック図である。
【０００４】
この動き検出装置１は、フレームメモリ２，３、および動きベクトル検出部４を有している。
動き検出装置１においては、入力端子ＴINから画像信号が入力されると、１画面の情報がフレームメモリ２に格納される。
次の画面情報が入力されると、先ほどの（前回に入力された）フレームメモリ２の情報がフレームメモリ３に格納され、現在（今回）入力された情報がフレームメモリに格納される。
すなわち、カレントフレームＦｃの情報がフレームメモリ２に、参照フレームＦｒの情報がフレームメモリ３に格納されていることになる。
次に、カレントフレームＦｃ、参照フレームＦｒの情報が動きベクトル検出部４に送られる。そして、動きベクトル検出部４でブロック分けされて動きベクトル（Ｖｘ，Ｖｙ）が検出されて、端子ＴOUT から出力される。
【０００５】
図２は、ブロックマッチングアルゴリズムの概要を説明するための図である。以下に、アルゴリズムの概要を図２に関連付けて説明する。
【０００６】
このアルゴリズムにおいては、カレントフレームＦｃ内の注目画素Ｆｃ（ｘ，ｙ）における動きベクトルは、注目画素Ｆｃ（ｘ，ｙ）を中心としてある基準ブロック範囲（Ｌ×Ｌ）の画素と、参照フレームＦｒ内のサーチエリアＳＲ内の前記ブロック範囲（Ｌ×Ｌ）と同じブロック範囲内の画素とで対応する画素との差分絶対値和を演算する。
サーチエリアＳＲ内で抽出するブロック範囲を一画素ずつ移動させながら上述の演算を繰り返し、全てのブロックの中で最も差分絶対値和が最も小さいブロックの中心位置と注目画素位置との差分ベクトルを解（動きベクトル）とする。
【０００７】
次に、図３に関連付けてカレントフレームＦｃ内にある画素Ｆｃ（ｘ，ｙ）の動きベクトルを検出する処理手順を詳細に説明する。
【０００８】
ステップＳＴｌ
ステップＳＴｌにおいては、処理開始ＳＴ０後、注目画素の位置（ｘ，ｙ）から参照フレーム内の同位置を基準としたサーチエリアＳＲが決定する。
【０００９】
ステップＳＴ２
ステップＳＴ２においては、演算結果の最小値を格納する変数ｍｉｎの初期化のために、演算式の最大値を代入する。１画素を８ビット、ブロック内の画素数を１６とすると、２⁸ ×１６＝４０９６をｍｉｎに代入する。
【００１０】
ステップＳＴ３
ステップＳＴ３においては、サーチエリアＳＲ内のブロックをカウントするカウンタ変数ｎを１に初期化する。
【００１１】
ステップＳＴ４
ステップＳＴ４においては、演算結果を代入する変数ｓｕｍを０に初期化する。
【００１２】
ステップＳＴ５
ステップＳＴ５においては、基準ブロックの範囲をＬ×Ｌ、カレントフレームＦｃのあるブロック内の画素をＦｃ（ｉ，ｊ）、参照フレームＦｒのサーチエリアＳＲ内のｋ番目のブロック内の画素をＦｒｋ（ｉ，ｊ）とすると、対応する画素との差分絶対値和、すなわち次の数１に示す演算を行い、演算結果をｓｕｍに代入する。
【００１３】
【数１】

【００１４】
ステップＳＴ６
ステップＳＴ６においては、演算した差分絶対値和ｓｕｍと差分絶対値和の最小値ｍｉｎとの大小関係の判別を行う。演算した差分絶対値和ｓｕｍが小さい場合にはステップＳＴ７へ、大きい場合（等しいを含む）には演算結果が最小値ではないので更新手続きのステップＳＴ７をスキップしてステップＳＴ８へ進む。
【００１５】
ステップＳＴ７
ステップＳＴ７においては、最小値ｍｉｎを演算結果ｓｕｍに更新し、動きベクトル番号としてブロックのカウント値ｎを設定する。
【００１６】
ステップＳＴ８
ステップＳＴ８においては、ブロックのカウント値ｎがサーチエリアＳＲ内のブロック総数、つまり最後のブロックならば終了なのでステップＳＴ１０へ、最後のブロックではなければ、ＳＴ９へ進む。
【００１７】
ステップＳＴ９
ステップＳＴ９においては、ブロックのカウント値ｎをｎ＋１にインクリメントして、演算を繰り返すためにステップＳＴ４へ進む。
【００１８】
ステップＳＴ１０
ステップＳＴ１０においては、動き番号に格納されているブロック番号のブロックの中心画素と（ｘ，ｙ）から動きベクトルを求めて出力する。
【００１９】
【発明が解決しようとする課題】
上述したブロックマッチングアルゴリズムは、式（１）の演算量が非常に膨大となっており、ＭＰＥＧ等の画像圧縮処理の大半の時間がこれに費やされるという不利益がある。
【００２０】
本発明は、かかる事情に鑑みてなされたものであり、その目的は、僅かな演算量のみでマッチング処理を行うことができ、しかも動きベクトル等を精度良く検出することを可能とする、画像処理装置および画像処理方法を提供することにある。
【００２１】
【課題を解決するための手段】
上記目的を達成するため、本発明の第１の観点に係る画像処理装置は、入力画像データを所定数の画素を含むブロックにブロック化し、上記ブロック化された画像データ毎に特徴量を抽出する特徴量抽出手段と、上記特徴量をアドレスとして、上記ブロック化された画像データの位置情報を格納する格納手段と、注目位置の上記ブロックの上記特徴量に基づいて、上記格納手段に格納された上記画像データの位置情報と上記特徴量とのマッチング処理を行い、マッチングした結果、規定範囲外にある場合には、近傍の特徴量を選択して再度マッチング処理を行うマッチング手段とを有する。
【００２２】
第１の観点では、上記特徴量には周辺画素値を含む。
【００２３】
第１の観点では、上記画素値は複数のビットで表され、上記マッチング手段は、フレーム毎にバラツキを含む画像の場合には、上記複数ビットの所定ビットを除いてマッチング処理を行う。
好適には、上記複数ビットの下位側ビットをマスクしてマッチング処理を行う。
また、好適には、複数ビットのビット数を少なくして再量子化を行う。
【００２４】
第１の観点では、上記特徴量には適応的量子化（ＡＤＲＣ）に基づく量子化コードを含む。
【００２５】
第１の観点では、上記マッチング手段は、マッチング結果に基づいて注目ブロックの動き検出を行い、検出した動き情報の値が規定範囲外にあるときに、近傍の特徴量を選択して再度マッチング処理を行う。
【００２６】
第１の観点では、上記マッチング手段は、マッチング処理を行う際の近傍の特徴量として、少なくとも特徴量をｎビット表現した際の所定ビットを反転させたものを対象とする。
【００２７】
第１の観点では、上記マッチング手段は、上記近傍の特徴量の選択において注目画素のパターンに応じてビット反転の形態を変更する
【００２８】
本発明の第２の観点に係る画像処理装置は、画像データのカレントフレームの情報と参照フレームの情報とのマッチング処理を行う画像処理装置であって、上記参照フレームの情報に基づいて注目画素を中心とした所定のブロック範囲の画素値を含む特徴量をアドレスとして変換し、変換後の位置座標を含む情報を格納する格納手段と、上記カレントフレームの情報に含まれる注目画素の特徴量を、特徴量アドレスとして上記格納手段の格納情報を読み取り、カレントフレーム内の注目画素と上記格納手段から読み込んだ特徴量アドレスに含まれる位置座標との距離をそれぞれ演算し、各演算結果同士のマッチング処理を行い、特徴量が規定範囲外にある場合には、近傍の特徴量を選択して再度マッチング処理を行うマッチング手段とを有する。
【００２９】
第２の観点では、上記マッチング手段は、カレントフレーム内の注目画素と上記格納手段から読み込んだ特徴量アドレスに含まれる位置座標との距離をそれぞれ演算し、複数の候補の中から距離が最小である位置情報に基づいた差分座標を注目画素の動き情報として検出し、検出した動き情報の値が想定される動き情報の規定値より小さいときは、検出した動き情報を正しい情報として判断し、検出した動き情報の値が想定される動き情報の規定値以上の場合は、想定外の動き情報として判断して、カレントフレーム内の画素と上記格納手段で近い特徴量アドレスを生成し、近傍画素の位置情報に基づいて再度マッチング処理を行う。
【００３０】
本発明の第３の観点に係る画像処理装置は、画像データのカレントフレームの情報と参照フレームの情報とのマッチング処理を行う画像処理装置であって、上記カレントフレームの情報を受けて適応的量子化（ＡＤＲＣ）に基づく量子化コードをカレントフレームの特徴量として生成し、上記参照フレームの情報を受けて適応的量子化（ＡＤＲＣ）に基づく量子化コードを参照フレームの特徴量として生成する特徴量生成手段と、上記特徴量生成手段による参照フレームの特徴量をアドレスとして変換して、変換後の情報を格納する格納手段と、上記特徴量生成手段によるカレントフレームの特徴量を、特徴量アドレスとして上記格納手段の格納情報を読み取り、カレントフレーム内の注目画素と上記格納手段から読み込んだ特徴量アドレスに含まれる上記ＡＤＲＣの量子化コードのマッチング処理を行い、特徴量が一致しない場合には、近傍の特徴量を選択して再度マッチング処理を行うマッチング手段とを有する。
【００３１】
本発明の第４の観点に係る画像処理方法は、入力画像データを所定数の画素を含むブロックにブロック化する第１のステップと、上記ブロック化された画像データ毎に特徴量を抽出する第２のステップと、上記抽出された特徴量をアドレスとして、上記ブロック化された画像データの位置情報を格納手段に格納する第３のステップと、注目位置の上記ブロックの上記特徴量に基づいて、上記格納手段に格納された上記画像データの位置情報と上記特徴量とのマッチング処理を行い、マッチングした結果、規定範囲外にある場合には、近傍の特徴量を選択して再度マッチング処理を行う第４のステップとを有する。
【００３２】
本発明の第５の観点に係る画像処理方法は、画像データのカレントフレームの情報と参照フレームの情報とのマッチング処理を行う画像処理方法であって、上記参照フレームの情報に基づいて注目画素を中心とした所定のブロック範囲の画素値を含む特徴量をアドレスとして変換し、変換後の位置座標を含む情報を格納手段に格納する第１のステップと、上記カレントフレームの情報に含まれる注目画素の特徴量を、特徴量アドレスとして上記格納手段の格納情報を読み取り、カレントフレーム内の注目画素と上記格納手段から読み込んだ特徴量アドレスに含まれる位置座標との距離をそれぞれ演算する第２のステップと、各演算結果同士のマッチング処理を行い、特徴量が規定範囲外にある場合には、近傍の特徴量を選択して再度マッチング処理を行う第３のステップとを有する。
【００３３】
第５の観点では、上記第３のステップにおいては、上記演算結果の複数の候補の中から距離が最小である位置情報に基づいた差分座標を注目画素の動き情報として検出し、検出した動き情報の値が想定される動き情報の規定値により小さいときは、検出した動き情報を正しい情報として判断し、検出した動き情報の値が想定される動き情報の規定値以上の想定外の動き情報として判断して、カレントフレーム内の画素と上記格納手段で近い特徴量アドレスを生成し、近傍画素の位置情報に基づいて再度マッチング処理を行う。
【００３４】
本発明の第６の観点に係る画像処理方法は、画像データのカレントフレームの情報と参照フレームの情報とのマッチング処理を行う画像処理方法であって、上記カレントフレームの情報を受けて適応的量子化（ＡＤＲＣ）に基づく量子化コードをカレントフレームの特徴量として生成する第１のステップと、上記参照フレームの情報を受けて適応的量子化（ＡＤＲＣ）に基づく量子化コードを参照フレームの特徴量と生成する第２のステップと、上記第２のステップによる参照フレームの特徴量をアドレスとして変換し、変換後の情報を格納手段に格納する第３のステップと、上記第１のステップによるカレントフレームの特徴量を、特徴量アドレスとして上記格納手段の格納情報を読み取り、カレントフレーム内の注目画素と上記格納手段から読み込んだ特徴量アドレスに含まれる上記ＡＤＲＡＣの量子化コードのマッチング処理を行い、マッチングの結果、特徴量が一致しない場合には、近傍の特徴量を選択して再度マッチング処理を行う第４のステップとを有する。
【００３５】
本発明によれば、入力画像データが、特徴抽出手段において、所定数の画素を含むブロックにブロック化され、ブロック化された画像データ毎に特徴量が抽出され格納手段およびマッチング手段に供給される。
格納手段においては、特徴量をアドレスとして、ブロック化された画像データの位置情報が格納される。
マッチング手段において、注目位置のブロックの上記特徴量に基づいて、格納手段に格納された上記画像データの位置情報と特徴量とのマッチング処理が行われる。
そして、マッチングした結果、規定範囲外にある場合には、近傍の特徴量が選択されて再度マッチング処理が行われる。
【００３６】
また、本発明によれば、格納手段において、参照フレームの情報に基づいて注目画素を中心とした所定のブロック範囲の画素値を含む特徴量がアドレスとして変換され、変換後の位置座標を含む情報が格納される。
マッチング手段においては、カレントフレームの情報に含まれる注目画素の特徴量を、特徴量アドレスとして格納手段の格納情報が読み取られる。
マッチング手段においては、カレントフレーム内の注目画素と格納手段から読み込んだ特徴量アドレスに含まれる位置座標との距離がそれぞれ演算される。
マッチング手段においては、演算結果の複数の候補の中から距離が最小である位置情報に基づいた差分座標が注目画素の動き情報として検出される。
そして、検出した動き情報の値が想定される動き情報の規定値より小さいときは、検出した動き情報を正しい情報として判断される。
一方、検出した動き情報の値が想定される動き情報の規定値以上の想定外の動き情報として判断されて、カレントフレーム内の画素と格納手段で近い特徴量アドレスが生成され、近傍画素の位置情報に基づいて再度マッチング処理が行われる。
【００３７】
また、本発明によれば、特徴量生成手段において、カレントフレームの情報を受けて適応的量子化（ＡＤＲＣ）に基づく量子化コードがカレントフレームの特徴量として生成され、マッチング手段に出力される。
また、特徴量生成手段において、参照フレームの情報を受けて適応的量子化（ＡＤＲＣ）に基づく量子化コードが参照フレームの特徴量として生成されて、格納手段に出力される。
格納手段においては、特徴量生成手段による参照フレームの特徴量がアドレスとして変換されて、変換後の情報が格納される。
マッチング手段においては、特徴量生成手段によるカレントフレームの特徴量を、特徴量アドレスとして格納手段の格納情報が読み取られる。
マッチング手段においては、カレントフレーム内の注目画素と格納手段から読み込んだ特徴量アドレスに含まれるＡＤＲＣの量子化コードのマッチング処理が行われる。
そして、マッチング手段において、ＡＤＲＣの量子化コードのマッチング処理の結果、一致した場合に注目画素の動き情報として検出される。
一方、特徴量が一致しない場合には、近傍の特徴量が選択して再度マッチング処理が行われる。
【００３８】
【発明の実施の形態】
以下、本発明の実施の形態について、図面を参照しながら詳細に説明する。
【００３９】
第１実施形態
図４は、本発明に係る画像処理装置の要部である動き検出装置の第１の実施形態を示すブロック図である。
【００４０】
本動き検出装置は、特徴量をアドレスとして位置情報を格納する動き検出メモリ（以下、ＭＥメモリという）を設け、周辺画素値を特徴量としてマッチング処理を行うことにより、僅かな演算で動きベクトルを精度良く推定することを可能とするものである。
以下、本動き検出装置の具体的な構成および機能について、図面を参照しながら詳細に説明する。
【００４１】
本動き検出装置１０は、第１のフレームメモリ１１、第２のフレームメモリ１２、ＭＥメモリ１３、およびアドレス制御部１４を有している。
なお、第１のフレームメモリ１１により本発明に係る特徴量抽出手段が構成され、ＭＥメモリ１３により本発明に係る格納手段が構成され、アドレス制御部１４により本発明に係るマッチング手段が構成される。
【００４２】
第１のフレームメモリ１１は、入力端子ＴINから入力された画像信号の１画面の情報を格納する。
第１のフレームメモリ１１は、次の画面情報が入力されると先に格納した画面情報、すなわちカレントフレームＦｃの情報を格納し、カレントフレームＦｃの情報を第２のフレームメモリ１２、およびアドレス制御部１４に出力する。
また、第１のフレームメモリ１１は、カレントフレームＦｃの情報とともに、注目画素の特徴量、つまりアドレス情報をアドレス制御部１４に供給する。
【００４３】
第２のフレームメモリ１２は、第１のフレームメモリ１１に格納されていた以前（たとえば１回前）の画面情報を参照フレームＦｒの情報として格納する。
【００４４】
ＭＥメモリ１３は、第２のフレームメモリ１２に格納されている参照フレームＦｒの情報に基づいて、注目画素を中心としたあるブロック範囲の画素値である特徴量をアドレスとして変換し、変換後の位置座標を含む情報を格納する。
【００４５】
アドレス制御部１４は、第１のフレームメモリ１１から供給されたカレントフレームＦｃの情報に含まれる注目画素の特徴量を、特徴量アドレスとしてＭＥメモリ１３の格納情報を読み取り、マッチング処理として、カレントフレーム内の注目画素とＭＥメモリ１３から読み込んだ特徴量アドレス（位置座標）との距離を演算し、複数の候補の中から距離が最小である位置情報に基づいた差分座標を注目画素の動きベクトル（Ｖｘ，Ｖｙ）として検出する。
アドレス制御部１４は、検出した動きベクトルＭの値が想定される動きベクトルの最大値Ｍmax より小さいときは（規定範囲内にあるとき）、正しい動きベクトルとして判断し、端子ＴOUT から出力する。
アドレス制御部１４は、検出した動きベクトルＭの値が想定される動きベクトルの最大値Ｍmax より大きいときは（規定範囲外にあるとき（等しい場合も含む））、対応していない位置の動きベクトルと判断し、カレントフレーム内の画素とＭＥメモリ１３で近い特徴量のアドレスを生成して、近傍画素の位置情報に基づいて（近傍の特徴量を選択して）再度マッチング処理を行う。
なお、アドレス制御部１４は、たとえば近傍の特徴量の選択においてビット反転を行うが、このとき注目画素のパターンに応じてビット反転の形態を変更する。
このアドレス制御部１４における処理については後でさらに詳述する。
【００４６】
以下、本実施形態の特徴である特徴量アドレス方式を採用したＭＥメモリ１３の構成および機能について、図５および図６に関連付けて、さらに詳細に説明する。
図５は、特徴量アドレス方式を採用したＭＥメモリの構成例を示す図である。また、図６は、参照フレームの情報をＭＥメモリに格納する手順を説明するためのフローチャートである。
【００４７】
従来のメモリの場合、画素の位置情報をアドレスとして画素値を格納するものであるが、本ＭＥメモリ１３の場合、特徴量をアドレスとして、特徴量毎にその特徴量を持つ画素の位置情報を順次フラグアドレスＦＲＧＡ１，２．．．、つまり図５のＢ、Ｃに格納していく。
本実施形態においては、１つのセルＭＥ−Ｂ１は、位置情報分の記憶容量を備えているものとする。同時に、フラグアドレスＦＲＧＡ０には、その特徴量に格納した数をインクリメントして格納しておくものとする。
特徴量としては、注目画素を中心としたあるブロック内の画素値とする。たとえば、ブロック範囲を３×３、垂直方向をｉ、水平方向をｊ、位置（ｉ，ｊ）の画素値をＬ（ｉ，ｊ）とすると、この場合の特徴量は、次の数２のようになる。
【００４８】
【数２】

【００４９】
次に、参照フレームの情報をＭＥメモリに格納する手順を、図６のフローチャートに関連付けて説明する。
参照フレームＦｒの情報がフレームメモリ１２に格納されると処理を開始する。
【００５０】
ステップＳＴ１０１
ステップＳＴ１０１においては、ＭＥメモリ内の全データを０に初期化する。０を書き込むか、リセット信号をＯｎする。
【００５１】
ステップＳＴ１０２
ステップＳＴ１０２においては、１フレームメモリ内の画素をカウントするカウンタ変数ｎを０に初期化する。
【００５２】
ステップＳＴ１０３
ステップＳＴ１０３においては、図４のフレームメモリ１１から注目画素Ｌｎを中心としたあるブロック範囲の画素値を特徴量（特徴量アドレス）とする。
【００５３】
ステップＳＴ１０４
ステップＳＴ１０４においては、特徴量アドレスをステップＳＴ１０３での特徴量、フラグアドレスを０とした場合のＭＥメモリ１３の内容ＭＥメモリ（特徴量、０）を読み込みフラグアドレスに設定する。
【００５４】
ステップＳＴ１０５
ステップＳＴ１０５においては、特徴量アドレスをステップＳＴ１０３での特徴量、フラグアドレスを０とした場合のＭＥメモリ１３の内容ＭＥメモリ（特徴量、０）をインクリメントする。
【００５５】
ステップＳＴ１０６
ステップＳＴ１０６においては、ＭＥメモリ１３の内容ＭＥメモリ（特徴量、フラグアドレス＋１）の内容に、注目画素Ｌｎの位置情報を書き込む。
【００５６】
ステップＳＴ１０７
ステップＳＴ１０７においては、カウント変数ｎをインクリメントする。
【００５７】
ステップＳＴ１０８
ステップＳＴ１０８においては、注目画素Ｌｎがフレーム内の最後の画素かの判別を行う。最後の画素ではなければ、ステップＳＴ１０３へ進んで次の画素に関して同処理を繰り返す。
また、最後の画素ならば、処理を終了するために、ステップＳＴ１０９へ進む。
【００５８】
次に、本実施形態に係る動きベクトル検出の処理手順を、図４および図７に関連付けて説明する。
なお、図７は、本実施形態に係る動きベクトル検出の処理手順を説明するためのフローチャートである。
【００５９】
ステップＳＴ２０１
ステップＳＴ２０１においては、フレームメモリ１１，１２にそれぞれカレントフレームＦｃ、参照フレームＦｒの情報が格納された後に、参照フレームの情報を特徴量アドレスに変換しながらＭＥメモリ１３に格納する。詳細は、上述（ステップＳＴ１００〜ＳＴ１０９）している。
【００６０】
ステップＳＴ２０２
ステップＳＴ２０２においては、１フレームの画素をカウントするカウント変数ｎを０に初期化する。
【００６１】
ステップＳＴ２０３
ステップＳＴ２０３においては、第１のカレントフレーム１１内の注目画素Ｌｎの特徴量は、その画素を中心としたあるブロック範囲の画素値なので、それらを特徴量として、アドレス制御部１４に送る。
【００６２】
ステップＳＴ２０４
ステップＳＴ２０４においては、アドレス制御部１４は、受け取った特徴量を特徴量アドレスとして、ＭＥメモリ１３から内容ＭＥメモリ（特徴量アドレス、０）の値を読み込み、候補数を意味する変数ｋｎに代入する。
また、候補数カウンタを意味する変数ｋを１に、距離の最小値を意味する変数ｍｉｎを∞に、距離を意味する変数Ｌを０に初期化する。
【００６３】
ステップＳＴ２０５
ステップＳＴ２０５においては、カレントフレーム内の注目画素ＬｎとＭＥメモリ１３から読み込んだＭＥメモリ内容（特徴量アドレス、ｋ）＝位置座標との距離を演算して、変数Ｌに代入する。
【００６４】
ステップＳＴ２０６
ステップＳＴ２０６においては、ステップＳＴ２０５で求まった距離Ｌと距離の最小値ｍｉｎとの大小判別を行う。
その結果、ｍｉｎ＞Ｌならば、距離の最小値Ｌを更新するためにステップＳＴ２０７へ、ｍｉｎ≦Ｌならば、更新ステップをスキップして、ステップＳＴ２０８へ進む。
【００６５】
ステップＳＴ２０７
ステップＳＴ２０７においては、距離の最小値ｍｉｎをＬに更新する。その際のフラグアドレス値、つまりｋを変数ａｎｓに格納しておく。
【００６６】
ステップＳＴ２０８
ステップＳＴ２０８においては、候補カウンタが候補数であるかの判別を行い、候補数である場合はステップＳＴ２１０へ、まだ候補がある場合は、ステップＳＴ２０９へ進む。
【００６７】
ステップＳＴ２０９
ステップＳＴ２０９においては、候補カウンタｋをインクリメント後、ステップＳＴ２０５へ進む。
【００６８】
ステップＳＴ２１０
ステップＳＴ２１０においては、カレントフレーム内の画素Ｌｎと距離が最少である位置情報、つまりＭＥメモリ（特徴量アドレス番号０，ａｎｓ）の値を読み込み、差分座標を動きベクトルＭとする。
【００６９】
ステップＳＴ２１５
ステップＳＴ２１５においては、該当する特徴量アドレスで求まる動きベクトルＭの値が、想定される動きベクトルの最大値Ｍｍａｘより小さいとき（Ｍ＜Ｍmax ）、正しい動きベクトルと判断され、ＳＴ２１１に進む。
該当する特徴量アドレスで求まる動きベクトルＭの値が、想定される動きベクトルの最大値Ｍmax 以上のとき（Ｍ≧Ｍmax ）、対応しない場所の動きベクトルと判断され、ＳＴ２１６に進む。
【００７０】
ステップＳＴ２１６
ステップＳＴ２１６においては、カレントフレーム内の画素ＬｎとＭＥメモリ１３で近い特徴量のアドレスを生成して、その特徴量のアドレス（特徴量アドレス番号ｐ，ｋ）＝位置座標との距離（近傍特徴量アドレスの候補数ｐｎ，候補数ｋｎ）を演算して、変数Ｌに代入する。
【００７１】
ステップＳＴ２１７
ステップＳＴ２１７においては、ステップＳＴ２０５で求まった距離Ｌと距離の最小値ｍｉｎとの大小判別を行い、ｍｉｎ＞Ｌならば、距離の最小値Ｌを更新するためにステップＳＴ２０７へ、ｍｉｎ≦Ｌならば、更新ステップをスキップして、ステップＳＴ２０８へ進む。
【００７２】
ステップＳＴ２１８
ステップＳＴ２１８においては、距離の最小値ｍｉｎをＬに更新する。その際のフラグアドレス値、つまりｋを変数ａｎｓに格納しておく。
【００７３】
ステップＳＴ２１９
ステップＳＴ２１９においては、候補カウンタが候補数であるかの判別を行い、候補数である場合はステップＳＴ２１０へ、まだ候補がある場合は、ステップｃ−ＷＳ９へ進む。
【００７４】
ステップＳＴ２２０
ステップＳＴ２２０においては、候補カウンタｋをインクリメント後、ステップＳＴ２０５へ進む。
【００７５】
ステップＳＴ２２１
ステップＳＴ２２１においては、カレントフレーム内の画素Ｌｎと距離が最少である位置情報、つまりＭＥメモリ（特徴量アドレス番号０，ａｎｓ）の値を読み込み、差分座標を動きベクトルＭとする。
【００７６】
ステップＳＴ２２２，ＳＴ２２３
ステップＳＴ２２２およびＳＴ２２３においては、該当する特徴量アドレスで求まる動きベクトルＭの値が、想定される動きベクトルの最大値Ｍmax より小さいとき（Ｍ＜Ｍmax ）、正しい動きベクトルと判断され、ＳＴ２１１に進む。
該当する特徴量アドレスで求まる動きベクトルＭの値が、想定される動きベクトルの最大値Ｍmax 以上のとき（Ｍ≧Ｍmax ）、対応しない場所の動きベクトルと判断され、別の近傍特徴量アドレスの生成のためＳＴ２１６に進む。
【００７７】
ステップＳＴ２１１
ステップＳＴ２１１においては、注目画素の動きベクトルを出力する。
【００７８】
ステップＳＴ２１２
ステップＳＴ２１２においては、画素のカウンタ変数ｎをインクリメントする。
【００７９】
ステップＳＴ２１３
ステップＳＴ２１３においては、注目画素がカレントフレーム内の最後の画素であるかの判別を行い、最後の画素であれば終了のためステップＳＴ２１４へ、違う場合は、次の画素の動きベクトルを求めるためにステップＳＴ２０３へ進む。
【００８０】
次に、近傍特徴量の算出について説明する。
動き物体のある部分の特徴量が隣接フレームで少量変化することがありうる。本実施形態においては、その対応として、特徴量が少々変化しても、近傍の特徴量から似た特徴量を当てはめて移動距離を求める手法を採用する。
特徴量空間の各要素間は直交の関係にあるため、ある特徴量は図８のｄ−ａ１のように要素の個数を軸とした特徴量空間で定義される。すなわち、ｄ−ａ１で示す座標に特徴量を抽出した画素の空間座標をリンクさせる。
ある特徴量に対して近傍の特徴量は、図８中ｄ−ａ２で示すように、各特徴量軸ｘ，ｙ，ｚの値に対してある範囲の振れ幅を許容する領域として定義される。本実施形態に係る特徴量は、上述したように、注目画素を中心としたあるブロック内の画素値である。たとえば、ブロック範囲を３×３、垂直方向をｉ、水平方向をｊ位置（１，ｊ）の画素値をＬ（１，ｊ）とすると、この場合の特徴量は、式（２）で与えられる。
【００８１】
近傍特徴量の例としては、図９のＳＴ３０１の特徴量の１ビット反転を挙げることができる。特徴量が注目画素を中心としたあるブロック内の画素値であるので、以下の数３のように起こりうる変化は画素値に差分ｑｎを足した値とする。
【００８２】
【数３】

【００８３】
ｑ１からｑ９までのどれか１つの値を変えることで、図７のステップＳＴ２１６の近傍特徴量アドレスの候補ができる。
式（３）のＬどれか１つの値のビット反転の場合、近傍特徴量アドレス数Ｐｎ＝９となる。
近傍特徴量の定義の仕方として、近い値から求めて、それでも解が求まらない場合に差ｑｎを大きくして行く。このループは図７のステップＳＴ２１６からステップＳＴ２２３間のループに相当する。
この場合、特徴量の２ビット反転（図９のＳＴ３０２）のように、式（３）のｑｎ値を変え個数を増やす方向で変化を与える。以上のループを回しても、演算内容はアドレス参照と差分演算と分岐のみなので、演算量が大幅に増大することはない。
【００８４】
画素値としては、たとえば１画素＝８ビットとした場合、コンピュータグラフィックス（ＣＧ）のような画像はフルビット（８ビット）情報でマッチング処理を行えるが、自然画像の場合は、フレーム毎にバラツキを含むので、複数ビットのうち所定ビットを除いて、マッチング処理を行うことが望ましい。具体的には、下位数ビットをマスクして使用してもよいし、ビット数を少なくして再量子化しても良い。
つまり、非線形／線形な量子化におけるビット数を削減する（量子化ビット数を少なくする）ことが望ましい。
【００８５】
以上説明したように、本第１の実施形態によれば、カレントフレームＦｃの情報を格納し、カレントフレームＦｃの情報とともに、注目画素の特徴量であるアドレス情報を出力する第１のフレームメモリ１１と、第１のフレームメモリ１１に格納されていた以前の画面情報を参照フレームＦｒの情報として格納する第２のフレームメモリと、第２のフレームメモリ１２に格納されている参照フレームＦｒの情報に基づいて、注目画素を中心としたあるブロック範囲の画素値を含む特徴量をアドレスとして変換し、変換後の位置情報を含む情報を格納するＭＥメモリ１３と、第１のフレームメモリ１１から供給されたカレントフレームＦｃの情報に含まれる注目画素の特徴量を、特徴量アドレスとしてＭＥメモリ１３の格納情報を読み取り、カレントフレーム内の注目画素とＭＥメモリ１３から読み込んだ特徴量アドレス（位置座標）との距離を演算し、複数の候補の中から距離が最小である位置情報に基づいた差分座標を注目画素の動きベクトル（Ｖｘ，Ｖｙ）として検出し、検出した動きベクトルＭの値が想定される動きベクトルの最大値Ｍmax より小さいときは（規定範囲内にあるとき）、正しい動きベクトルとして判断し、端子ＴOUT から出力し、検出した動きベクトルＭの値が想定される動きベクトルの最大値Ｍmax より大きいときは（規定範囲外にあるとき（等しい場合も含む））、対応していない位置の動きベクトルと判断し、カレントフレーム内の画素とＭＥメモリ１３で近い特徴量のアドレスを生成して、近傍画素の位置情報に基づいて再度マッチング処理を行うアドレス制御部１４とを設けたので、以下の効果を得ることができる。
すなわち、本第１の実施形態においては、ブロックエリア内の空間パターン情報を特徴量とし、候補数だけの距離演算比較をするだけなので、従来の手法よりも僅かな演算量で、かつ、精度の高い動きベクトル検出が可能となる利点がある。
【００８６】
なお、候補数が多くなる場合は、ＭＥメモリ１３に格納する情報を１フレームの全情報ではなく、ある程度のエリアに区分してもよい。
【００８７】
第２実施形態
図１０は、本発明に係る画像処理装置としての動き検出装置の第２の実施形態を示すブロック図である。
【００８８】
本第２の実施形態が上述した第１の実施形態と異なる点は、特徴量を求める特徴量生成手段としてのクラス生成部１５を設けることによって、好ましい特徴量でマッチングが可能となる点である。
【００８９】
クラス生成部１５は、第１のフレームメモリ１１のカレントフレームＦｃの情報を受けてＡＤＲＣに基づく量子化コードをカレントフレームの特徴量として生成してアドレス制御部１４Ａに出力する。
また、クラス生成部１５は、第２のフレームメモリ１２の参照フレームＦｒの情報を受けてＡＤＲＣに基づく量子化コードをカレントフレームの特徴量として生成してＭＥメモリ１３に出力する。
【００９０】
マッチング手段としてのアドレス制御部１４Ａは、カレントフレームの特徴量を、特徴量アドレスとしてＭＥメモリ１３の格納情報を読み取り、カレントフレーム内の注目画素とＭＥメモリ１３から読み込んだ特徴量アドレスに含まれるＡＤＲＣの量子化コードのマッチングを行うことにより注目画素の動きベクトルを検出する。
【００９１】
このように、本第２の実施形態に係るクラス生成部１５での特徴量の生成としてＡＤＲＣを用いる。ＡＤＲＣ（ＡｄａｐｔｉｖｅＤｙｎａｍｉｃＲａｎｇｅＣｏｄｉｎｇ）は、ＶＴＲ（ＶｉｄｅｏＴａｐｅＲｅｃｏｒｄｅｒ）向け高性能符号化用に開発された適応的量子化法であるが、信号レベルの局所的なパターンを短い語長で効率的に表現できるので、この第２の実施形態では、ＡＤＲＣを空間クラス分類のコード発生に使用している。
【００９２】
ＡＤＲＣは、空間クラスタップのダイナミックレンジをＤＲ、ビット割り当てをｎ、空間クラスタップの画素のデータレベルをＬ、再量子化コードをＱとして、以下の数３により、最大値ＭＡＸと最小値ＭＩＮとの間を指定されたビット長で均等に分割して再量子化を行うアルゴリズムである。
【００９３】
【数４】

【００９４】
ただし、〔〕は切り捨て処理を意味する。マッチング処理フローは、上述した図７の説明の特徴量をＡＤＲＣの量子化コードとしたものと等価なので省略する。
【００９５】
空間クラスタップの取り方の一例として、ブロックサイズが３×３の場合は、図１１（Ａ）に示すように全画素を使用してもよいし、図１１（Ｂ）に示すように十文字で構成してもよく、クラスコードに与えられる情報量の制限の中で決定すればよい。
同様にして、ブロックサイズが５×５の場合の一例としては、図１１（Ｃ），（Ｄ）に示すような形態が採用可能である。
図１１（Ｃ）の例は十文字で構成する場合であり、図１１（Ｄ）の例は十文字で構成し、さらに端部の画素を使用する場合である。
【００９６】
例として、ブロックサイズが図１１（Ａ）に示すように３×３の場合のクラスコードは以下のように表すことができる。
【００９７】
【数５】

【００９８】
次に、周辺画素値よりもＡＤＲＣ量子化コードを用いた方が優れていることを、図１２（Ａ），（Ｂ）に関連付けて説明する。
図１２（Ａ），（Ｂ）は、分かりやすいように画像のある１ラインが参照フレームからカレントフレームに移動した際の画素値を表示している。また、図１３は輝度値の１０進数表記と１６進数表記との対応関係を示している。
【００９９】
通常、自然画像の場合は、同じ絵柄（パターン）が移動しても、同じ画素値になる可能性は低く、図１２（Ａ），（Ｂ）に示すように、画素レベルがずれてしまう。
この場合、同じパターンとして正しく検出できるかがポイントとなる。周辺画素値を特徴量として用いた場合で、ノイズ成分の影響を抑えるために、下位ビットをマスクした場合のコード結果を載せている。
記載しているように、同じパターンであるにもかかわらず誤検出することがある。
これに対して、ＡＤＲＣでの量子化コードは、信号レベルの局所的なパターンを短い語長で効率的に表現できることから微小なレベル変動に強く、同じコード結果が得られることが分かる。
具体的には、参照フレームのある１ラインのＡＤＲＣコードは「０１１０１」であり、カレントフレームのある１ラインのＡＤＲＣコードも「０１１０１」であり、両者が一致する。
【０１００】
本第２の実施形態における近傍特徴量の算出について説明する。
動き物体のある部分の特徴量が隣接フレームで少量変化することがありうる。本実施形態においては、その対応として、特徴量が少々変化しても、近傍の特徴量から似た特徴量を当てはめて移動距離を求める手法を採用する。
特徴量空間の各要素間は直交の関係にあるため、ある特徴量は図８のｄ−ａ１のように要素の個数を軸とした特徴量空間で定義される。すなわち、ｄ−ａ１で示す座標に特徴量を抽出した画素の空間座標をリンクさせる。
ある特徴量に対して近傍の特徴量は、図８中ｄ−ａ２で示すように、各特徴量軸ｘ，ｙ，ｚの値に対してある範囲の振れ幅を許容する領域として定義される。本実施形態に係る特徴量は、上述したように、注目画素を中心としたあるブロック内の画素値である。たとえば、ブロック範囲を３×３、垂直方向をｉ、水平方向をｊ位置（１，ｊ）の画素値をＬ（１，ｊ）とすると、この場合の特徴量は、式（５）で与えられる。
【０１０１】
近傍特徴量の例としては、図９のＳＴ３０１の特徴量の１ビット反転を挙げることができる。特徴量が注目画素を中心としたあるブロック内の画素値であるので、以下の数６のように起こりうる変化は画素値に差分ｑｎを足した値とする。
【０１０２】
【数６】

【０１０３】
ｑ１からｑ９までのどれか１つの値を変えることで、図７のステップＳＴ２１６の近傍特徴量アドレスの候補ができる。
式（６）のＬどれか１つの値のビット反転の場合、近傍特徴量アドレス数Ｐｎ＝９となる。
近傍特徴量の定義の仕方として、近い値から求めて、それでも解が求まらない場合に差ｑｎを大きくして行く。このループは図７のステップＳＴ２１６からステップＳＲＴ２２３間のループに相当する。
この場合、特徴量の２ビット反転（図９のＳＴ３０２）のように、式（３）のｑｎ値を変え個数を増やす方向で変化を与える。以上のループを回しても、演算内容はアドレス参照と差分演算と分岐のみなので、演算量が大幅に増大することはない。
【０１０４】
以上説明したように、本第２の実施形態によれば、ＡＤＲＣの量子化コードを特徴量とすることによって従来よりも精度の高い動きベクトル検出が可能となる利点がある。
【０１０５】
【発明の効果】
本発明によれば、ブロックエリア内の空間パターン情報を特徴量とし、候補数だけの距離演算比較をするだけなので、従来の手法よりも僅かな演算量で、かつ、マッチング演算がブロック内の差分絶対値和ではなく、ブロック内の画素値をそのまま使用しており、すなわち空間特徴量を現しているために、精度の高い動きベクトル検出が可能となる利点がある。
【０１０６】
また、本発明によれば、ブロックエリア内の空間パターン情報としてＡＤＲＣの量子化コードを特徴量とすることによって精度の高い動きベクトル検出が可能となる。
【図面の簡単な説明】
【図１】ブロックマッチングアルゴリズムを採用した従来の動き検出装置の構成例を示すブロック図である。
【図２】ブロックマッチングアルゴリズムの概要を説明するための図である。
【図３】カレントフレームＦＣ内ある画素Ｆｃ（ｘ，ｙ）の動きベクトルを検出する処理手順を説明するためのフローチャートである。
【図４】本発明に係る動き検出装置の第１の実施形態を示すブロック図である。
【図５】本実施形態に係る特徴量アドレス方式における動きメモリの構造を説明するための図である。
【図６】本実施形態に係る特徴量アドレス方式における動きメモリへの格納手順を説明するための図である。
【図７】本実施形態に係る特徴量アドレス方式における動き検出の動作を説明するためのフローチャートである。
【図８】近傍特徴量の生成規範例を説明するための図である。
【図９】近傍特徴量の生成規範例を説明するための図である。
【図１０】本発明に係る動き検出装置の第２の実施形態を示すブロック図である。
【図１１】クラスタップのとり方の一例を示す図である。
【図１２】周辺画素値よりもＡＤＲＣ量子化コードを用いた方が優れていることを説明するための図である。
【図１３】輝度値の１０進数と１６進数との対応関係を示す図である。
【符号の説明】
１０，１０Ａ…動き検出装置、１１…第１のフレームメモリ、１２…第２のフレームメモリ、１３…動きベクトルメモリ（ＭＥメモリ）、１４，１４Ａ…アドレス制御部、１５…クラス生成部。[0001]
BACKGROUND OF THE INVENTION
The present invention relates to an image processing apparatus and an image processing method such as a motion detection apparatus used in, for example, a moving image compression apparatus.
[0002]
[Prior art]
In an image processing apparatus, as one of main techniques for efficiently performing moving image compression, there is motion detection that obtains a motion vector indicating the motion of an image. Several methods for obtaining the motion vector have been proposed. One of the main methods is a method called a block matching algorithm.
[0003]
FIG. 1 is a block diagram illustrating a configuration example of a motion detection apparatus in a conventional image processing apparatus that employs a block matching algorithm.
[0004]
The motion detection device 1 includes

frame memories

2 and 3 and a motion vector detection unit 4.
In the motion detection device 1, information of one screen is stored in the frame memory 2 when an image signal is input from the input terminal TIN.
When the next screen information is input, the information of the previous frame memory 2 (inputted last time) is stored in the frame memory 3, and the current (this time) input information is stored in the frame memory.
That is, information on the current frame Fc is stored in the frame memory 2, and information on the reference frame Fr is stored in the frame memory 3.
Next, information on the current frame Fc and the reference frame Fr is sent to the motion vector detection unit 4. Then, the motion vector detection unit 4 divides the block and detects the motion vector (Vx, Vy) and outputs it from the terminal TOUT.
[0005]
FIG. 2 is a diagram for explaining the outline of the block matching algorithm. Hereinafter, the outline of the algorithm will be described with reference to FIG.
[0006]
In this algorithm, the motion vector at the target pixel Fc (x, y) in the current frame Fc is a pixel in the reference block range (L × L) centered on the target pixel Fc (x, y) and the reference frame Fr. The sum of absolute differences between corresponding pixels and the pixels in the same block range as the block range (L × L) in the search area SR is calculated.
The above calculation is repeated while moving the block range to be extracted within the search area SR pixel by pixel, and the difference vector between the center position of the block having the smallest sum of absolute differences and the target pixel position is solved among all the blocks. (Motion vector).
[0007]
Next, the processing procedure for detecting the motion vector of the pixel Fc (x, y) in the current frame Fc will be described in detail with reference to FIG.
[0008]
Step STl
In step ST1, after the process start ST0, the search area SR based on the same position in the reference frame is determined from the position (x, y) of the target pixel.
[0009]
Step ST2
In step ST2, the maximum value of the arithmetic expression is substituted in order to initialize the variable min that stores the minimum value of the calculation result. If one pixel is 8 bits and the number of pixels in a block is 16, 2⁸ X16 = 4096 is substituted for min.
[0010]
Step ST3
In step ST3, a counter variable n for counting blocks in the search area SR is initialized to 1.
[0011]
Step ST4
In step ST4, a variable sum for substituting the calculation result is initialized to zero.
[0012]
Step ST5
In step ST5, the range of the base block is L × L, the pixels in the block having the current frame Fc are Fc (i, j), and the pixels in the kth block in the search area SR of the reference frame Fr are Frk ( If i, j), the sum of absolute differences from the corresponding pixel, that is, the calculation shown in the following equation 1, is performed, and the calculation result is substituted into sum.
[0013]
[Expression 1]

[0014]
Step ST6
In step ST6, the magnitude relationship between the calculated difference absolute value sum sum and the minimum value min of the difference absolute value sum is determined. When the calculated sum of absolute differences sum is small, the process proceeds to step ST7, and when it is large (including equal), the calculation result is not the minimum value, so step ST7 of the update procedure is skipped and the process proceeds to step ST8.
[0015]
Step ST7
In step ST7, the minimum value min is updated to the operation result sum, and the block count value n is set as the motion vector number.
[0016]
Step ST8
In step ST8, if the block count value n is the total number of blocks in the search area SR, that is, if it is the last block, the process is terminated. If not, the process proceeds to step ST10. If not, the process proceeds to ST9.
[0017]
Step ST9
In step ST9, the count value n of the block is incremented to n + 1, and the process proceeds to step ST4 to repeat the calculation.
[0018]
Step ST10
In step ST10, a motion vector is obtained from the central pixel of the block of the block number stored in the motion number and (x, y) and output.
[0019]
[Problems to be solved by the invention]
The block matching algorithm described above has a disadvantage that the amount of calculation of Expression (1) is very large, and most of the time for image compression processing such as MPEG is spent on this.
[0020]
The present invention has been made in view of such circumstances, and an object of the present invention is to perform image processing that can perform matching processing with only a small amount of calculation and can detect a motion vector or the like with high accuracy. An apparatus and an image processing method are provided.
[0021]
[Means for Solving the Problems]
To achieve the above object, an image processing apparatus according to a first aspect of the present invention blocks input image data into blocks including a predetermined number of pixels, and extracts feature amounts for each of the blocked image data. Stored in the storage means based on the feature quantity extraction means, storage means for storing the position information of the image data that has been blocked using the feature quantity as an address, and the feature quantity of the block at the position of interest. The image processing apparatus includes a matching unit that performs matching processing between the position information of the image data and the feature amount, and selects a nearby feature amount and performs matching processing again when the matching result is outside the specified range.
[0022]
In the first aspect, the feature amount includes a peripheral pixel value.
[0023]
In the first aspect, the pixel value is represented by a plurality of bits, and the matching unit performs a matching process except for the predetermined bits of the plurality of bits in the case of an image including variation for each frame.
Preferably, the lower-order bits of the plurality of bits are masked to perform the matching process.
Preferably, requantization is performed with a reduced number of bits.
[0024]
In the first aspect, the feature amount includes a quantization code based on adaptive quantization (ADRC).
[0025]
In the first aspect, the matching unit detects a motion of the block of interest based on the matching result, and selects a nearby feature amount and performs matching processing again when the detected motion information value is outside the specified range. I do.
[0026]
In the first aspect, the matching means targets at least a predetermined bit when the feature amount is expressed by n bits as a feature amount in the vicinity when performing the matching process.
[0027]
In the first aspect, the matching unit changes a bit inversion mode according to a pattern of a target pixel in selecting the feature quantity in the vicinity.
[0028]
An image processing apparatus according to a second aspect of the present invention is an image processing apparatus that performs matching processing between information on a current frame of image data and information on a reference frame, and determines a target pixel based on the information on the reference frame. A storage unit that converts a feature value including a pixel value of a predetermined block range at the center as an address, stores information including a position coordinate after conversion, and a feature value of a target pixel included in the information of the current frame, The storage information of the storage unit is read as a feature amount address, the distance between the pixel of interest in the current frame and the position coordinate included in the feature amount address read from the storage unit is calculated, and matching processing between the calculation results is performed. If the feature amount is out of the specified range, there is a matching means for selecting a nearby feature amount and performing matching processing again. .
[0029]
In the second aspect, the matching unit calculates the distance between the target pixel in the current frame and the position coordinate included in the feature amount address read from the storage unit, and the distance is the smallest among a plurality of candidates. Difference coordinates based on certain position information are detected as motion information of the pixel of interest, and when the detected motion information value is smaller than the expected value of the motion information, the detected motion information is determined as correct information and detected. If the value of the motion information is equal to or greater than the specified value of the assumed motion information, it is determined as unexpected motion information, and a feature amount address close to the pixel in the current frame is generated by the storage means, The matching process is performed again based on the position information.
[0030]
An image processing apparatus according to a third aspect of the present invention is an image processing apparatus that performs a matching process between information on a current frame of image data and information on a reference frame. That generates a quantization code based on quantization (ADRC) as a feature amount of a current frame, and that generates information about the reference frame and generates a quantization code based on adaptive quantization (ADRC) as a feature amount of the reference frame A generating means, a storage means for converting the feature quantity of the reference frame by the feature quantity generating means as an address, and storing the converted information, and a feature quantity of the current frame by the feature quantity generating means as a feature quantity address. Read the storage information of the storage means, the target pixel in the current frame and the feature amount address read from the storage means Performs matching processing of the quantization code of the ADRC included, if the feature value do not match, and a matching means for selecting to again matching the feature quantity of the neighborhood.
[0031]
An image processing method according to a fourth aspect of the present invention includes a first step of blocking input image data into blocks including a predetermined number of pixels, and a first step of extracting feature amounts for each of the blocked image data. Based on the second step, the third step of storing the position information of the blocked image data in the storage means using the extracted feature amount as an address, and the feature amount of the block at the target position, Matching processing is performed between the position information of the image data stored in the storage means and the feature amount. If the result of matching is outside the specified range, a nearby feature amount is selected and matching processing is performed again. And a fourth step.
[0032]
An image processing method according to a fifth aspect of the present invention is an image processing method for performing matching processing between information on a current frame of image data and information on a reference frame. A first step of converting, as an address, a feature value including a pixel value in a predetermined block range as a center, and storing information including the converted position coordinates in a storage unit; and a target pixel included in the current frame information The second step of reading the storage information of the storage means using the feature quantity as the feature quantity address and calculating the distance between the pixel of interest in the current frame and the position coordinates included in the feature quantity address read from the storage means, respectively If the feature value is outside the specified range, select the feature value in the vicinity and match again. And a third step of performing processing.
[0033]
In the fifth aspect, in the third step, the difference coordinate based on the position information with the shortest distance is detected as the motion information of the target pixel from the plurality of candidates of the calculation result, and the detected motion information When the value of the motion information is smaller than the assumed value of the assumed motion information, the detected motion information is determined as correct information, and the detected motion information value is as unexpected motion information that is greater than the assumed value of the assumed motion information. Judgment is made, a feature amount address close to the pixels in the current frame by the storage means is generated, and matching processing is performed again based on position information of neighboring pixels.
[0034]
An image processing method according to a sixth aspect of the present invention is an image processing method for performing matching processing between information on a current frame of image data and information on a reference frame. A first step of generating a quantization code based on quantization (ADRC) as a feature value of a current frame, and receiving a quantization code based on adaptive quantization (ADRC) in response to the information of the reference frame A second step of generating, a third step of converting the feature quantity of the reference frame in the second step as an address, and storing the converted information in the storage means, and a current frame in the first step The storage information of the storage means is read using the feature quantity as a feature quantity address, and the pixel of interest in the current frame and the storage means A matching process is performed on the ADRAC quantization code included in the read feature value address. If the matching result is that the feature values do not match, a neighboring feature value is selected and the matching process is performed again. Steps.
[0035]
According to the present invention, the input image data is blocked into blocks including a predetermined number of pixels in the feature extraction unit, and feature amounts are extracted for each of the blocked image data and supplied to the storage unit and the matching unit. .
The storage means stores the position information of the blocked image data with the feature quantity as an address.
In the matching unit, based on the feature amount of the block at the target position, the matching process between the position information of the image data stored in the storage unit and the feature amount is performed.
If the result of matching is outside the specified range, a nearby feature amount is selected and matching processing is performed again.
[0036]
Further, according to the present invention, in the storage means, the feature quantity including the pixel value in the predetermined block range centered on the target pixel is converted as an address based on the reference frame information, and the information including the converted position coordinates Is stored.
In the matching means, the storage information of the storage means is read with the feature quantity of the pixel of interest included in the current frame information as the feature quantity address.
In the matching means, the distance between the pixel of interest in the current frame and the position coordinate included in the feature amount address read from the storage means is calculated.
In the matching means, the difference coordinate based on the position information with the shortest distance is detected as the motion information of the pixel of interest from among the plurality of calculation result candidates.
And when the value of the detected motion information is smaller than the assumed value of the assumed motion information, the detected motion information is determined as correct information.
On the other hand, the detected motion information value is determined as unexpected motion information that is greater than or equal to the expected value of the motion information, and a feature amount address close to the storage means in the current frame is generated, and the position of the neighboring pixel Matching processing is performed again based on the information.
[0037]
Further, according to the present invention, the feature value generation means receives the information of the current frame, generates a quantization code based on adaptive quantization (ADRC) as the feature value of the current frame, and outputs it to the matching means.
In addition, the feature quantity generation means receives information on the reference frame, generates a quantization code based on adaptive quantization (ADRC) as a feature quantity of the reference frame, and outputs it to the storage means.
In the storage unit, the feature amount of the reference frame by the feature amount generation unit is converted as an address, and the converted information is stored.
In the matching unit, the storage information of the storage unit is read using the feature amount of the current frame by the feature amount generation unit as the feature amount address.
In the matching means, the target pixel in the current frame is matched with the ADRC quantization code included in the feature amount address read from the storage means.
Then, in the matching means, when matching is performed as a result of ADRC quantization code matching processing, it is detected as motion information of the pixel of interest.
On the other hand, if the feature amounts do not match, a nearby feature amount is selected and matching processing is performed again.
[0038]
DETAILED DESCRIPTION OF THE INVENTION
Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.
[0039]
First embodiment
FIG. 4 is a block diagram showing a first embodiment of a motion detection apparatus which is a main part of the image processing apparatus according to the present invention.
[0040]
This motion detection apparatus is provided with a motion detection memory (hereinafter referred to as ME memory) that stores position information using feature amounts as addresses, and performs a matching process using peripheral pixel values as feature amounts, so that a motion vector can be obtained with a slight calculation. It is possible to estimate with high accuracy.
Hereinafter, a specific configuration and function of the motion detection device will be described in detail with reference to the drawings.
[0041]
The motion detection apparatus 10 includes a first frame memory 11, a second frame memory 12, an ME memory 13, and an address control unit 14.
The first frame memory 11 constitutes a feature quantity extraction unit according to the present invention, the ME memory 13 constitutes a storage unit according to the present invention, and the address control unit 14 constitutes a matching unit according to the present invention. .
[0042]
The first frame memory 11 stores information of one screen of the image signal input from the input terminal TIN.
When the next screen information is input, the first frame memory 11 stores the previously stored screen information, that is, the information of the current frame Fc. The information of the current frame Fc is stored in the second frame memory 12 and the address control. To the unit 14.
Further, the first frame memory 11 supplies the feature amount of the target pixel, that is, address information, to the address control unit 14 together with information on the current frame Fc.
[0043]
The second frame memory 12 stores the previous (for example, previous) screen information stored in the first frame memory 11 as information of the reference frame Fr.
[0044]
Based on the information of the reference frame Fr stored in the second frame memory 12, the ME memory 13 converts a feature value, which is a pixel value in a block range centered on the pixel of interest, as an address, Stores information including position coordinates.
[0045]
  The address control unit 14 reads the storage information of the ME memory 13 using the feature amount of the target pixel included in the information of the current frame Fc supplied from the first frame memory 11 as the feature amount address, and performs matching processing as the current frame. The distance between the pixel of interest and the feature amount address (position coordinate) read from the ME memory 13 is calculated, and the difference coordinate based on the position information having the smallest distance among the plurality of candidates is calculated as the motion vector ( Vx, Vy).
  When the detected motion vector M is smaller than the assumed motion vector maximum value Mmax (when it is within the specified range), the address control unit 14 determines that it is a correct motion vector and outputs it from the terminal TOUT.
  When the value of the detected motion vector M is larger than the maximum value Mmax of the assumed motion vector (when it is outside the specified range (including the case where it is equal)), the address control unit 14 does not correspond to the motion vector. The address of the feature quantity close to the pixel in the current frame and the ME memory 13 is generated, and the matching process is performed again based on the position information of the neighboring pixel (selecting the feature quantity of the neighborhood).
  Address control unit14For example, bit inversion is performed when selecting a feature quantity in the vicinity. At this time, the bit inversion mode is changed according to the pattern of the target pixel.
  The processing in the address control unit 14 will be described in detail later.
[0046]
Hereinafter, the configuration and function of the ME memory 13 adopting the feature address method that is a feature of the present embodiment will be described in more detail with reference to FIGS. 5 and 6.
FIG. 5 is a diagram illustrating a configuration example of an ME memory that employs a feature address method. FIG. 6 is a flowchart for explaining a procedure for storing reference frame information in the ME memory.
[0047]
In the case of the conventional memory, the pixel value is stored using the position information of the pixel as an address. However, in the case of this ME memory 13, the position information of the pixel having the feature amount is obtained for each feature amount using the feature amount as an address. Sequential flag address FRGA1,2. . . That is, the data are stored in B and C in FIG.
In the present embodiment, it is assumed that one cell ME-B1 has a storage capacity for position information. At the same time, the number stored in the feature value is incremented and stored in the flag address FRGA0.
The feature value is a pixel value in a certain block centered on the target pixel. For example, if the block range is 3 × 3, the vertical direction is i, the horizontal direction is j, and the pixel value at the position (i, j) is L (i, j), the feature quantity in this case is It becomes like this.
[0048]
[Expression 2]

[0049]
Next, the procedure for storing the reference frame information in the ME memory will be described with reference to the flowchart of FIG.
When the information of the reference frame Fr is stored in the frame memory 12, the process is started.
[0050]
Step ST101
In step ST101, all data in the ME memory is initialized to zero. Write 0 or turn on the reset signal.
[0051]
Step ST102
In step ST102, a counter variable n that counts pixels in one frame memory is initialized to zero.
[0052]
Step ST103
In step ST103, the pixel value in a certain block range centered on the target pixel Ln from the frame memory 11 of FIG. 4 is set as a feature amount (feature amount address).
[0053]
Step ST104
In step ST104, the feature amount address is the feature amount in step ST103, and the content ME memory (feature amount, 0) of the ME memory 13 when the flag address is 0 is read and set as the flag address.
[0054]
Step ST105
In step ST105, the ME memory 13 (feature amount, 0) in the ME memory 13 when the feature amount address is the feature amount in step ST103 and the flag address is 0 is incremented.
[0055]
Step ST106
In step ST106, the position information of the pixel of interest Ln is written into the contents of the ME memory 13 (features, flag address + 1).
[0056]
Step ST107
In step ST107, the count variable n is incremented.
[0057]
Step ST108
In step ST108, it is determined whether the target pixel Ln is the last pixel in the frame. If it is not the last pixel, the process proceeds to step ST103 and the same process is repeated for the next pixel.
If it is the last pixel, the process proceeds to step ST109 to end the process.
[0058]
Next, a motion vector detection processing procedure according to this embodiment will be described with reference to FIGS.
FIG. 7 is a flowchart for explaining the processing procedure of motion vector detection according to this embodiment.
[0059]
Step ST201
In step ST201, information on the current frame Fc and the reference frame Fr is stored in the

frame memories

11 and 12, respectively, and then stored in the ME memory 13 while converting the information on the reference frame into a feature amount address. Details are described above (steps ST100 to ST109).
[0060]
Step ST202
In step ST202, a count variable n for counting pixels of one frame is initialized to zero.
[0061]
Step ST203
In step ST203, since the feature amount of the target pixel Ln in the first current frame 11 is a pixel value in a certain block range centering on the pixel, the feature amount is sent to the address control unit 14 as a feature amount.
[0062]
Step ST204
In step ST204, the address control unit 14 reads the value of the content ME memory (feature value address, 0) from the ME memory 13 using the received feature value as the feature value address, and substitutes the value into a variable kn indicating the number of candidates. .
In addition, a variable k meaning a candidate number counter is initialized to 1, a variable min meaning a minimum distance is set to ∞, and a variable L meaning a distance is initialized to 0.
[0063]
Step ST205
In step ST205, the distance between the target pixel Ln in the current frame and the ME memory contents (feature amount address, k) = position coordinates read from the ME memory 13 is calculated and substituted into the variable L.
[0064]
Step ST206
In step ST206, the size discrimination between the distance L obtained in step ST205 and the minimum distance min is performed.
As a result, if min> L, the process proceeds to step ST207 to update the minimum distance value L, and if min ≦ L, the update step is skipped and the process proceeds to step ST208.
[0065]
Step ST207
In step ST207, the minimum distance min is updated to L. The flag address value at that time, that is, k is stored in the variable ans.
[0066]
Step ST208
In step ST208, it is determined whether the candidate counter is the number of candidates. If it is the number of candidates, the process proceeds to step ST210, and if there are still candidates, the process proceeds to step ST209.
[0067]
Step ST209
In step ST209, after incrementing the candidate counter k, the process proceeds to step ST205.
[0068]
Step ST210
In step ST210, position information having the minimum distance from the pixel Ln in the current frame, that is, the value of the ME memory (feature amount address number 0, ans) is read, and the difference coordinate is set as the motion vector M.
[0069]
Step ST215
In step ST215, when the value of the motion vector M obtained from the corresponding feature quantity address is smaller than the maximum value Mmax of the assumed motion vector (M <Mmax), it is determined as a correct motion vector, and the process proceeds to ST211.
When the value of the motion vector M obtained from the corresponding feature amount address is equal to or greater than the maximum value Mmax of the assumed motion vector (M ≧ Mmax), it is determined as a motion vector at a non-corresponding location, and the process proceeds to ST216.
[0070]
Step ST216
In step ST216, an address of a feature quantity close to the pixel Ln in the current frame and the ME memory 13 is generated, and the feature quantity address (feature quantity address number p, k) = distance (positional feature quantity). The address candidate number pn and the candidate number kn) are calculated and substituted into the variable L.
[0071]
Step ST217
In step ST217, the distance L obtained in step ST205 is distinguished from the minimum distance min, and if min> L, the process proceeds to step ST207 to update the minimum distance L, and if min ≦ L. The update step is skipped and the process proceeds to step ST208.
[0072]
Step ST218
In step ST218, the minimum distance value min is updated to L. The flag address value at that time, that is, k is stored in the variable ans.
[0073]
Step ST219
In step ST219, it is determined whether the candidate counter is the number of candidates. If it is the number of candidates, the process proceeds to step ST210, and if there are still candidates, the process proceeds to step c-WS9.
[0074]
Step ST220
In step ST220, after incrementing the candidate counter k, the process proceeds to step ST205.
[0075]
Step ST221
In step ST221, the position information having the minimum distance from the pixel Ln in the current frame, that is, the value of the ME memory (feature amount address number 0, ans) is read, and the difference coordinate is set as the motion vector M.
[0076]
Steps ST222 and ST223
In steps ST222 and ST223, when the value of the motion vector M obtained from the corresponding feature quantity address is smaller than the maximum value Mmax of the assumed motion vector (M <Mmax), it is determined as a correct motion vector, and the process proceeds to ST211.
When the value of the motion vector M obtained from the corresponding feature amount address is equal to or greater than the maximum value Mmax of the assumed motion vector (M ≧ Mmax), it is determined as a motion vector at a non-corresponding location, and another neighborhood feature amount address is generated. Therefore, the process proceeds to ST216.
[0077]
Step ST211
In step ST211, the motion vector of the target pixel is output.
[0078]
Step ST212
In step ST212, the counter variable n of the pixel is incremented.
[0079]
Step ST213
In step ST213, it is determined whether the pixel of interest is the last pixel in the current frame. If it is the last pixel, the process ends to step ST214. If not, in order to obtain the motion vector of the next pixel. It progresses to step ST203.
[0080]
Next, calculation of the neighborhood feature amount will be described.
It is possible that the feature amount of a certain part of the moving object changes a little in adjacent frames. In the present embodiment, as a countermeasure, a method is adopted in which a moving distance is obtained by applying a similar feature amount from neighboring feature amounts even if the feature amount changes slightly.
Since the elements in the feature amount space are orthogonal to each other, a certain feature amount is defined in the feature amount space with the number of elements as an axis, as indicated by d-a1 in FIG. That is, the spatial coordinates of the pixel from which the feature amount is extracted are linked to the coordinates indicated by d-a1.
A feature quantity in the vicinity of a certain feature quantity is defined as a region that allows a certain range of fluctuation width for the values of the feature quantity axes x, y, and z, as indicated by d-a2 in FIG. . As described above, the feature amount according to the present embodiment is a pixel value in a certain block centered on the target pixel. For example, if the block range is 3 × 3, the vertical direction is i, and the horizontal direction is j (1, j), the pixel value is L (1, j), the feature value in this case is given by equation (2) It is done.
[0081]
As an example of the neighborhood feature amount, 1-bit inversion of the feature amount of ST301 in FIG. 9 can be given. Since the feature value is a pixel value in a certain block centered on the pixel of interest, a possible change as in the following Equation 3 is a value obtained by adding the difference qn to the pixel value.
[0082]
[Equation 3]

[0083]
By changing any one of the values from q1 to q9, the neighborhood feature amount address candidates in step ST216 of FIG. 7 can be made.
In the case of bit inversion of one of the values of L in Expression (3), the neighborhood feature quantity address number Pn = 9.
As a method of defining the neighborhood feature quantity, the difference qn is increased when it is obtained from a close value and the solution cannot be obtained. This loop corresponds to the loop between step ST216 and step ST223 in FIG.
In this case, as in the case of 2-bit inversion of the feature value (ST302 in FIG. 9), the qn value in the equation (3) is changed and the number is increased. Even if the above loop is rotated, the calculation contents are only address reference, difference calculation, and branch, so that the calculation amount does not increase significantly.
[0084]
As a pixel value, for example, when 1 pixel = 8 bits, an image such as computer graphics (CG) can be matched with full-bit (8-bit) information, but in the case of a natural image, it varies from frame to frame. Therefore, it is desirable to perform the matching process by excluding predetermined bits from the plurality of bits. Specifically, the lower few bits may be masked and used, or the number of bits may be reduced and requantized.
In other words, it is desirable to reduce the number of bits in nonlinear / linear quantization (reduce the number of quantization bits).
[0085]
As described above, according to the first embodiment, the information of the current frame Fc is stored, and the first frame memory 11 that outputs the address information that is the feature amount of the target pixel together with the information of the current frame Fc. A second frame memory that stores the previous screen information stored in the first frame memory 11 as information of the reference frame Fr, and a reference frame Fr information stored in the second frame memory 12. On the basis of this, a feature amount including a pixel value in a certain block range centered on the target pixel is converted as an address, and the ME memory 13 storing information including the converted position information is supplied from the first frame memory 11. The information stored in the ME memory 13 is read using the feature quantity of the target pixel included in the information of the current frame Fc as the feature quantity address. The distance between the target pixel in the current frame and the feature amount address (position coordinate) read from the ME memory 13 is calculated, and the difference coordinate based on the position information having the minimum distance is selected from among a plurality of candidates. If it is detected as a vector (Vx, Vy) and the value of the detected motion vector M is smaller than the maximum value Mmax of the assumed motion vector (when it is within the specified range), it is determined as a correct motion vector, and from the terminal TOUT When the detected value of the motion vector M is greater than the maximum value Mmax of the assumed motion vector (when it is outside the specified range (including when it is equal)), it is determined as a motion vector at an unsupported position. The address of the feature quantity close to the pixel in the current frame and the ME memory 13 is generated, and the matching process is performed again based on the position information of the neighboring pixel. Since there is provided a scan control unit 14, it is possible to obtain the following effects.
That is, in the first embodiment, the spatial pattern information in the block area is used as a feature amount, and only the distance calculation comparison is performed for the number of candidates. Therefore, the calculation amount is smaller than that of the conventional method, and the accuracy is high. There is an advantage that high motion vector detection is possible.
[0086]
When the number of candidates increases, the information stored in the ME memory 13 may be divided into a certain area instead of all information of one frame.
[0087]
Second embodiment
FIG. 10 is a block diagram showing a second embodiment of a motion detection apparatus as an image processing apparatus according to the present invention.
[0088]
The second embodiment is different from the first embodiment described above in that a matching with a preferable feature amount is possible by providing a class generation unit 15 as a feature amount generation means for obtaining a feature amount. .
[0089]
The class generation unit 15 receives information on the current frame Fc of the first frame memory 11, generates a quantization code based on ADRC as a feature amount of the current frame, and outputs the generated feature code to the address control unit 14A.
Further, the class generation unit 15 receives the information of the reference frame Fr of the second frame memory 12, generates a quantization code based on ADRC as a feature amount of the current frame, and outputs it to the ME memory 13.
[0090]
The address control unit 14A as a matching unit reads the information stored in the ME memory 13 using the feature amount of the current frame as the feature amount address, and the target pixel in the current frame and the ADRC included in the feature amount address read from the ME memory 13 The motion vector of the pixel of interest is detected by matching the quantization codes.
[0091]
As described above, ADRC is used for generating the feature amount in the class generation unit 15 according to the second embodiment. ADRC (Adaptive Dynamic Range Coding) is an adaptive quantization method developed for high performance coding for VTR (Video Tape Recorder), but it efficiently expresses local patterns at the signal level with a short word length. Therefore, in this second embodiment, ADRC is used for code generation of space class classification.
[0092]
In ADRC, the maximum value MAX and the minimum value MIN are expressed by the following Equation 3, where DR is the dynamic range of the space class tap, n is the bit allocation, L is the data level of the pixel of the space class tap, and Q is the requantization code. Is an algorithm that performs re-quantization by equally dividing the space between the bits with a specified bit length.
[0093]
[Expression 4]

[0094]
However, [] means a truncation process. Since the matching processing flow is equivalent to the case where the feature amount described in FIG. 7 is an ADRC quantization code, a description thereof will be omitted.
[0095]
As an example of how to take a space class tap, when the block size is 3 × 3, all pixels may be used as shown in FIG. 11A, or as shown in FIG. It may be configured and may be determined within the limit of the amount of information given to the class code.
Similarly, as an example in the case where the block size is 5 × 5, the forms shown in FIGS. 11C and 11D can be adopted.
The example of FIG. 11C is a case where it is composed of cross characters, and the example of FIG. 11D is a case where it is composed of cross characters and further uses end pixels.
[0096]
As an example, the class code when the block size is 3 × 3 as shown in FIG. 11A can be expressed as follows.
[0097]
[Equation 5]

[0098]
Next, it will be described with reference to FIGS. 12A and 12B that the ADRC quantization code is superior to the peripheral pixel value.
12A and 12B display pixel values when one line of an image moves from the reference frame to the current frame for easy understanding. FIG. 13 shows the correspondence between the decimal notation and the hexadecimal notation of the luminance value.
[0099]
Normally, in the case of a natural image, even if the same pattern (pattern) is moved, the possibility of the same pixel value is low, and the pixel level is shifted as shown in FIGS.
In this case, the point is whether it can be correctly detected as the same pattern. In the case where the peripheral pixel value is used as the feature amount, a code result in the case where the lower bits are masked is included in order to suppress the influence of the noise component.
As described, false detection may occur despite the same pattern.
On the other hand, it can be seen that the ADRC quantization code can effectively express a local pattern of signal levels with a short word length, and thus is resistant to minute level fluctuations, and the same code result can be obtained.
Specifically, the ADRC code of one line with the reference frame is “01101”, and the ADRC code of one line with the current frame is also “01101”, and they match.
[0100]
The calculation of the neighborhood feature amount in the second embodiment will be described.
It is possible that the feature amount of a certain part of the moving object changes a little in adjacent frames. In the present embodiment, as a countermeasure, a method is adopted in which a moving distance is obtained by applying a similar feature amount from neighboring feature amounts even if the feature amount changes slightly.
Since the elements in the feature amount space are orthogonal to each other, a certain feature amount is defined in the feature amount space with the number of elements as an axis, as indicated by d-a1 in FIG. That is, the spatial coordinates of the pixel from which the feature amount is extracted are linked to the coordinates indicated by d-a1.
A feature quantity in the vicinity of a certain feature quantity is defined as a region that allows a certain range of fluctuation width for the values of the feature quantity axes x, y, and z, as indicated by d-a2 in FIG. . As described above, the feature amount according to the present embodiment is a pixel value in a certain block centered on the target pixel. For example, if the block range is 3 × 3, the vertical direction is i, and the horizontal direction is j (1, j), the pixel value is L (1, j), the feature value in this case is given by equation (5) It is done.
[0101]
As an example of the neighborhood feature amount, 1-bit inversion of the feature amount of ST301 in FIG. 9 can be given. Since the feature value is a pixel value in a certain block centered on the pixel of interest, the change that can occur as shown in Equation 6 below is a difference in the pixel value.qnIs the value added.
[0102]
[Formula 6]

[0103]
  q1To q9By changing any one of the above values, the neighborhood feature amount address candidates in step ST216 in FIG. 7 can be obtained.
  In the case of bit inversion of one of the values of L in Expression (6), the number of neighboring feature amount addresses Pn = 9.
  As a method of defining the neighborhood feature, it is different when the solution is not obtained even if it is obtained from a close value.qnGo bigger.This loopCorresponds to a loop between step ST216 and step SRT223 in FIG.
  In this case, as in 2-bit inversion of the feature value (ST302 in FIG. 9), the equation (3)qnChange in the direction of changing the value and increasing the number. Even if the above loop is rotated, the calculation contents are only address reference, difference calculation, and branch, so that the calculation amount does not increase significantly.
[0104]
As described above, according to the second embodiment, there is an advantage that a motion vector can be detected with higher accuracy than before by using an ADRC quantization code as a feature amount.
[0105]
【The invention's effect】
According to the present invention, the spatial pattern information in the block area is used as a feature amount, and only distance calculation comparison is performed for the number of candidates. Therefore, the amount of calculation is smaller than that of the conventional method, and the matching operation is a difference within the block. Since the pixel values in the block are used as they are instead of the absolute value sum, that is, the spatial feature amount is expressed, there is an advantage that highly accurate motion vector detection is possible.
[0106]
Further, according to the present invention, it is possible to detect a motion vector with high accuracy by using an ADRC quantization code as a feature amount as spatial pattern information in a block area.
[Brief description of the drawings]
FIG. 1 is a block diagram illustrating a configuration example of a conventional motion detection apparatus that employs a block matching algorithm.
FIG. 2 is a diagram for explaining an outline of a block matching algorithm;
FIG. 3 is a flowchart for explaining a processing procedure for detecting a motion vector of a pixel Fc (x, y) in a current frame FC.
FIG. 4 is a block diagram showing a first embodiment of a motion detection apparatus according to the present invention.
FIG. 5 is a diagram for explaining the structure of a motion memory in the feature quantity addressing method according to the present embodiment.
FIG. 6 is a diagram for explaining a storing procedure in a motion memory in the feature quantity addressing method according to the present embodiment.
FIG. 7 is a flowchart for explaining an operation of motion detection in the feature quantity address method according to the embodiment.
FIG. 8 is a diagram for explaining an example of generating a neighborhood feature quantity.
FIG. 9 is a diagram for explaining an example of generating a neighborhood feature quantity.
FIG. 10 is a block diagram showing a second embodiment of the motion detection apparatus according to the present invention.
FIG. 11 is a diagram illustrating an example of how to take a class tap.
FIG. 12 is a diagram for explaining that it is better to use an ADRC quantization code than a peripheral pixel value;
FIG. 13 is a diagram illustrating a correspondence relationship between a decimal number and a hexadecimal number of a luminance value.
[Explanation of symbols]
DESCRIPTION OF

SYMBOLS

10,10A ... Motion detection apparatus, 11 ... 1st frame memory, 12 ... 2nd frame memory, 13 ... Motion vector memory (ME memory), 14, 14A ... Address control part, 15 ... Class generation part.

Claims

An image processing apparatus that performs a matching process between information of a current frame of image data and information of a reference frame,
Based on the information of the reference frame , for each pixel in a predetermined block range centered on the position of the reference frame corresponding to the target pixel in the current frame, the pixel value that is the feature amount is used as an address , Storage means for storing the position coordinates of a pixel having a pixel value that is the feature amount for each pixel value ;
The storage information of the storage unit is read using the feature amount of the target pixel in the current frame as the feature amount address, and the position coordinates of the target pixel and the feature amount of the target pixel in the current frame are read from the storage unit as the feature amount address . a matching means for performing the matching process by computing the distance between the position coordinates included in the feature quantity addresses respectively, and
The matching means is
The distance between the position coordinates included in said stored read from unit feature quantity addresses the position coordinates and the feature quantity of the target pixel of the target pixel as the feature address in the current frame is calculated respectively, and the position coordinates of the pixel of interest The difference from the position coordinate with the smallest distance among the plurality of candidates is detected as the motion information of the target pixel,
When the value of the detected motion information is smaller than the assumed value of the assumed motion information, the detected motion information is determined as correct information,
If the detected motion information value is greater than or equal to the expected value of the motion information, it is determined as unexpected motion information,
A value obtained by adding a difference in position coordinates to each pixel value in a predetermined block range centered on the target pixel is used as a feature amount of a neighboring pixel close to the target pixel, and any one of the position coordinate differences is changed. A candidate for a feature quantity of a neighboring pixel close to the target pixel, and the position coordinates of the target pixel in the current frame and the position coordinates included in the feature quantity address read from the storage means using the feature quantity as the candidate as a feature address An image processing apparatus comprising: calculating a distance again, performing matching processing again, and performing detection processing for the motion information and determination processing for the detected motion information value.

The matching means sets a difference between the position coordinates so as to be a value close to the feature amount of the pixel of interest when obtaining a feature amount candidate of a neighboring pixel close to the pixel of interest, and the value of the detected motion information is The image processing apparatus according to claim 1, wherein the difference value is set to be larger when the value is greater than a predetermined value of the assumed motion information.

An image processing apparatus that performs a matching process between information of a current frame of image data and information of a reference frame,
Based on the current frame information , a quantization code based on adaptive quantization (ADRC) associated with pattern information of each pixel in a predetermined block centered on the target pixel of the current frame is generated as a feature amount, and the above-mentioned reference A quantization code based on adaptive quantization (ADRC) that receives frame information and associates it with pattern information of each pixel within a predetermined block range centered on the position of the reference frame corresponding to the pixel of interest in the current frame a feature amount generating means for generating as a feature quantity,
Based on the reference frame information by the feature quantity generation means , a quantization code that is a feature quantity is obtained for each pixel within a predetermined block range centered on the position of the reference frame corresponding to the target pixel in the current frame. Storage means for storing the position coordinates of the pixel having the quantization code as the feature amount for each quantization code as the feature amount as an address ;
The storage information of the storage unit is read using the feature amount of the target pixel in the current frame as the feature amount address, and the position coordinates of the target pixel and the feature amount of the target pixel in the current frame are read from the storage unit as the feature amount address. Matching means for calculating the distance to the position coordinates included in the feature address and performing the matching process,
The matching means is
The distance between the position coordinates included in the read from said storage means as a feature addresses the position coordinates and the feature quantity of the target pixel of the target pixel features address in the current frame is calculated, respectively, the position coordinates of the pixel of interest And the difference between the position coordinate having the smallest distance from the plurality of candidates and detecting the movement information of the target pixel,
When the value of the detected motion information is smaller than the assumed value of the assumed motion information, the detected motion information is determined as correct information,
If the detected motion information value is greater than or equal to the expected value of the motion information, it is determined as unexpected motion information,
A value obtained by adding a difference in position coordinates to each pixel value in a predetermined block range centered on the target pixel is used as a feature amount of a neighboring pixel close to the target pixel, and any one of the position coordinate differences is changed. A candidate for a feature quantity of a neighboring pixel close to the target pixel, and the position coordinates of the target pixel in the current frame and the position coordinates included in the feature quantity address read from the storage means using the feature quantity as the candidate as a feature address An image processing apparatus comprising: calculating each distance, performing matching processing again, and performing the motion information detection processing and the determination processing for the detected motion information value .

An image processing method for performing matching processing between current frame information and reference frame information of image data,
Based on the information of the reference frame , for each pixel in a predetermined block range centered on the position of the reference frame corresponding to the target pixel in the current frame, the pixel value that is the feature amount is used as an address , A first step of storing the position coordinates of a pixel having a pixel value that is the feature amount for each pixel value ;
The storage information of the storage unit is read using the feature amount of the target pixel in the current frame as the feature amount address, and the position coordinates of the target pixel and the feature amount of the target pixel in the current frame are read from the storage unit as the feature amount address . and a second step of performing the matching process by computing the distance between the position coordinates included in the feature quantity addresses respectively, and
In the second step,
The distance between the position coordinate of the target pixel in the current frame and the position coordinate included in the feature amount address read from the storage unit is calculated using the feature amount of the target pixel as the feature amount address, The difference from the position coordinate with the smallest distance among the plurality of candidates is detected as the motion information of the target pixel,
When the value of the detected motion information is smaller than the assumed value of the assumed motion information, the detected motion information is determined as correct information,
If the detected motion information value is greater than or equal to the expected value of the motion information, it is determined as unexpected motion information,
A value obtained by adding a difference in position coordinates to each pixel value in a predetermined block range centered on the target pixel is used as a feature amount of a neighboring pixel close to the target pixel, and any one of the position coordinate differences is changed. A candidate for a feature quantity of a neighboring pixel close to the target pixel, and the position coordinates of the target pixel in the current frame and the position coordinates included in the feature quantity address read from the storage means using the feature quantity as the candidate as a feature address An image processing method in which the distance is calculated and the matching process is performed again to perform the motion information detection process and the determination process for the detected motion information value .

An image processing method for performing matching processing between current frame information and reference frame information of image data,
First, a quantization code based on adaptive quantization (ADRC) associated with pattern information of each pixel in a predetermined block centered on a target pixel of the current frame is received as a feature amount in response to the current frame information. Steps,
Quantization based on adaptive quantization (ADRC) that receives information on the reference frame and associates it with pattern information of each pixel within a predetermined block range centered on the position of the reference frame corresponding to the pixel of interest in the current frame a second step of generating as a feature quantity code,
Based on the information of the reference frame according to the second step, quantization is each pixel have Nitsu, the feature quantity within a given block range position around the reference frame corresponding to the target pixel in the current frame A third step of storing the position coordinates of the pixel having the quantization code as the feature amount for each quantization code as the feature amount, with the code as an address ;
The storage information of the storage unit is read using the feature amount of the target pixel in the current frame as the feature amount address, and the position coordinates of the target pixel and the feature amount of the target pixel in the current frame are read from the storage unit as the feature amount address. A fourth step of calculating the distance to the position coordinates included in the feature amount address and performing the matching process,
In the fourth step,
The distance between the position coordinate of the target pixel in the current frame and the position coordinate included in the feature amount address read from the storage unit is calculated using the feature amount of the target pixel as the feature amount address, The difference from the position coordinate with the smallest distance among the plurality of candidates is detected as the motion information of the target pixel,
When the value of the detected motion information is smaller than the assumed value of the assumed motion information, the detected motion information is determined as correct information,
If the detected motion information value is greater than or equal to the expected value of the motion information, it is determined as unexpected motion information,
A value obtained by adding a difference in position coordinates to each pixel value in a predetermined block range centered on the target pixel is used as a feature amount of a neighboring pixel close to the target pixel, and any one of the position coordinate differences is changed. A candidate for a feature quantity of a neighboring pixel close to the target pixel, and the position coordinates of the target pixel in the current frame and the position coordinates included in the feature quantity address read from the storage means using the feature quantity as the candidate as a feature address An image processing method in which the distance is calculated and the matching process is performed again to perform the motion information detection process and the determination process for the detected motion information value .