JP4596212B2

JP4596212B2 - Image processing apparatus and method, recording medium, and program

Info

Publication number: JP4596212B2
Application number: JP2001181395A
Authority: JP
Inventors: 哲二郎近藤; 淳一石橋; 貴志沢尾; 直樹藤原; 隆浩永野; 成司和田; 徹三宅
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2001-06-15
Filing date: 2001-06-15
Publication date: 2010-12-08
Anticipated expiration: 2021-06-15
Also published as: CN100458849C; JP2002373336A; CN1969297A

Description

【０００１】
【発明の属する技術分野】
本発明は、画像処理装置および方法、記録媒体、並びにプログラムに関し、特に、センサにより検出した信号と現実世界との違いを考慮した画像処理装置および方法、記録媒体、並びにプログラムに関する。
【０００２】
【従来の技術】
入力画像を基に、より高解像度の画像を生成する処理の１つとして、クラス分類適応処理がある。クラス分類適応処理の例として、空間方向に、より高解像度の画像を生成する処理で使用される係数を予め生成し、生成した係数を基に、空間方向に、より高解像度の画像を生成する処理があげられる。
【０００３】
図１は、SD（Standard Definition（標準精細度））画像からHD（High Definition（高精細度））画像を生成するクラス分類適応処理において使用される係数を生成する、従来の画像処理装置の構成を示すブロック図である。
【０００４】
フレームメモリ１１は、HD画像である入力画像を、フレーム単位で記憶する。
フレームメモリ１１は、記憶しているHD画像を加重平均部１２および対応画素取得部１６に供給する。
【０００５】
加重平均部１２は、フレームメモリ１１に記憶されているHD画像を４分の１加重平均して、SD画像を生成し、生成したSD画像をフレームメモリ１３に供給する。
【０００６】
フレームメモリ１３は、加重平均部１２から供給されたSD画像をフレーム単位で記憶し、記憶しているSD画像をクラス分類部１４および予測タップ取得部１５に供給する。
【０００７】
クラス分類部１４は、クラスタップ取得部２１および波形分類部２２で構成され、フレームメモリ１３に記憶されているSD画像の、注目している画素である注目画素をクラス分類する。クラスタップ取得部２１は、フレームメモリ１３から、注目画素に対応するSD画像の画素である、所定の数のクラスタップを取得し、取得したクラスタップを波形分類部２２に供給する。
【０００８】
図２は、クラスタップ取得部２１が取得するクラスタップを説明する図である。クラスタップ取得部２１は、図２に示すように、所定の位置の１１個のクラスタップを取得する。
【０００９】
波形分類部２２は、クラスタップを基に、注目画素を複数のクラスのうちの１つのクラスに分類し、分類されたクラスに対応するクラス番号を予測タップ取得部１５に供給する。波形分類部２２は、１１個のクラスタップを基に、注目画素を、２０４８のクラスのうちの１つのクラスに分類する。
【００１０】
予測タップ取得部１５は、クラス番号を基に、フレームメモリ１３から分類されたクラスに対応する、SD画像の画素である、所定の数の予測タップを取得し、取得した予測タップおよびクラス番号を対応画素取得部１６に供給する。
【００１１】
図３は、予測タップ取得部１５が取得する予測タップを説明する図である。予測タップ取得部１５は、図３に示すように、所定の位置の９個の予測タップを取得する。
【００１２】
対応画素取得部１６は、予測タップおよびクラス番号を基に、フレームメモリ１１から、予測すべき画素値に対応するHD画像の画素を取得し、予測タップ、クラス番号、および取得した予測すべき画素値に対応するHD画像の画素を正規方程式生成部１７に供給する。
【００１３】
正規方程式生成部１７は、予測タップ、クラス番号、および取得した予測すべき画素値を基に、各クラスに対応し、予測タップおよび予測すべき画素値の関係に対応する正規方程式を生成し、各クラスに対応する、生成した正規方程式を係数計算部１８に供給する。
【００１４】
係数計算部１８は、正規方程式生成部１７から供給された正規方程式を解いて、各クラスに対応する係数セットを計算し、クラス番号と共に、計算した係数セットを係数セットメモリ１９に供給する。
【００１５】
係数セットメモリ１９は、クラス番号を基に、算出された係数セットをクラスに対応させて記憶する。
【００１６】
図４は、クラス分類適応処理の概略を説明する図である。クラス分類適応処理において、HD画像である教師画像から、４分の１加重平均の処理により、対応するSD画像を生成する。生成されたSD画像は、生徒画像と称する。
【００１７】
次に、HD画像である教師画像、および対応するSD画像である生徒画像を基に、SD画像からHD画像を生成するための係数セットが生成される。係数セットは、線形予測などにより、SD画像からHD画像を生成するための係数で構成される。
【００１８】
このように生成された係数セットおよびSD画像から、線形予測などにより、４倍密画像が生成される。係数セットおよび入力画像から、より高密度な画像などを生成する処理をマッピングとも称する。
【００１９】
生成された４倍密画像、および対応するHD画像を基に、SNRの比較、または目視による定性評価が行われる。
【００２０】
特定の教師画像、および対応する生徒画像から生成された係数セットは、特定の教師画像、および対応する生徒画像のセルフの係数セットと称する。セルフの係数セットを使用したマッピングは、セルフマッピングと称する。複数の他の教師画像、および対応する生徒画像から生成された係数セットは、クロスの係数セットと称する。
【００２１】
一方、静止している所定の背景の前で移動する前景である物体をビデオカメラで撮像して得られる画像には、物体の移動速度が比較的速い場合、動きボケが生じ、背景と前景の混ざり合いが生ずる。
【００２２】
従来のクラス分類適応処理においては、図５に示すように、前景、背景、並びに前景および背景の混ざり合いが生じている部分の全てに対して、以上のような学習の処理により、１つの係数セットが生成され、この係数セットを基に、マッピングの処理が実行される。
【００２３】
図６のフローチャートを参照して、SD画像からHD画像を生成する処理において使用される係数を生成する、従来の学習の処理を説明する。ステップＳ１１において、画像処理装置は、生徒画像に未処理の画素があるか否かを判定し、生徒画像に未処理の画素があると判定された場合、ステップＳ１２に進み、ラスタースキャン順に、生徒画像から注目画素を取得する。
【００２４】
ステップＳ１３において、クラス分類部１４のクラスタップ取得部２１は、フレームメモリ１３に記憶されている生徒画像から、注目画素に対応するクラスタップを取得する。ステップＳ１４において、クラス分類部１４の波形分類部２２は、クラスタップを基に、注目画素をクラス分類する。ステップＳ１５において、予測タップ取得部１５は、分類されたクラスを基に、フレームメモリ１３に記憶されている生徒画像から、注目画素に対応する予測タップを取得する。
【００２５】
ステップＳ１６において、対応画素取得部１６は、分類されたクラスを基に、フレームメモリ１１に記憶されている教師画像から、予測すべき画素値に対応する画素を取得する。
【００２６】
ステップＳ１７において、正規方程式生成部１７は、分類されたクラスを基に、クラス毎の行列に、予測タップおよび予測すべき画素値に対応する画素の画素値を足し込み、ステップＳ１１に戻り、画像処理装置は、未処理の画素があるか否かの判定を繰り返す。予測タップおよび予測すべき画素値に対応する画素の画素値を足し込まれるクラス毎の行列は、クラス毎の係数を計算するための正規方程式に対応する。
【００２７】
ステップＳ１１において、生徒画像に未処理の画素がないと判定された場合、ステップＳ１８に進み、正規方程式生成部１７は、予測タップおよび予測すべき画素値に対応する画素の画素値が設定された、クラス毎の行列を係数計算部１８に供給する。係数計算部１８は、予測タップおよび予測すべき画素値に対応する画素の画素値が設定された、クラス毎の行列を解いて、クラス毎の係数セットを計算する。
【００２８】
ステップＳ１９において、係数計算部１８は、計算されたクラス毎の係数を係数セットメモリ１９に出力する。係数セットメモリ１９は、クラス毎に係数セットを記憶し、処理は終了する。
【００２９】
図７は、クラス分類適応処理により、SD画像からHD画像を生成する従来の画像処理装置の構成を示すブロック図である。
【００３０】
フレームメモリ３１は、SD画像である入力画像を、フレーム単位で記憶する。
フレームメモリ３１は、記憶しているSD画像をマッピング部３２に供給する。
【００３１】
マッピング部３２に入力されたSD画像は、クラス分類部４１および予測タップ取得部４２に供給される。
【００３２】
クラス分類部４１は、クラスタップ取得部５１および波形分類部５２で構成され、フレームメモリ３１に記憶されているSD画像の、注目している画素である、注目画素をクラス分類する。クラスタップ取得部５１は、フレームメモリ３１から注目画素に対応する、所定の数のクラスタップを取得し、取得したクラスタップを波形分類部５２に供給する。
【００３３】
波形分類部５２は、クラスタップを基に、所定の数のクラスのうちの、１つのクラスに注目画素を分類し、分類されたクラスに対応するクラス番号を予測タップ取得部４２に供給する。
【００３４】
予測タップ取得部４２は、クラス番号を基に、フレームメモリ３１に記憶されている入力画像から、分類されたクラスに対応する、所定の数の予測タップを取得し、取得した予測タップおよびクラス番号を予測演算部４３に供給する。
【００３５】
予測演算部４３は、クラス番号を基に、係数セットメモリ３３に記憶されている係数セットから、クラスに対応する係数セットを取得する。予測演算部４３は、クラスに対応する係数セット、および予測タップを基に、線形予測により予測画像の画素値を予測する。予測演算部４３は、予測した画素値をフレームメモリ３４に供給する。
【００３６】
フレームメモリ３４は、予測演算部４３から供給された予測された画素値を記憶し、予測された画素値が設定されたHD画像を出力する。
【００３７】
図８は、入力画像の画素値、およびクラス分類適応処理により生成された出力画像の画素値を示す図である。図８に示すように、クラス分類適応処理により生成される画像は、SD画像の帯域制限で失われた波形を含む。その意味で、クラス分類適応処理による、より高解像度の画像の生成の処理は、解像度を創造していると言える。
【００３８】
図９のフローチャートを参照して、クラス分類適応処理を実行する画像処理装置による、SD画像からHD画像を生成する、従来の画像の創造の処理を説明する。
【００３９】
ステップＳ３１において、画像処理装置は、入力画像に未処理の画素があるか否かを判定し、入力画像に未処理の画素があると判定された場合、ステップＳ３２に進み、マッピング部３２は、係数セットメモリ３３に記憶されている係数セットを取得する。ステップＳ３３において、画像処理装置は、ラスタースキャン順に、入力画像から注目画素を取得する。
【００４０】
ステップＳ３４において、クラス分類部４１のクラスタップ取得部５１は、フレームメモリ３１に記憶されている入力画像から、注目画素に対応するクラスタップを取得する。ステップＳ３５において、クラス分類部４１の波形分類部５２は、クラスタップを基に、注目画素を１つのクラスにクラス分類する。
【００４１】
ステップＳ３６において、予測タップ取得部４２は、分類されたクラスを基に、フレームメモリ３１に記憶されている入力画像から、注目画素に対応する予測タップを取得する。
【００４２】
ステップＳ３７において、予測演算部４３は、分類されたクラスに対応する係数セット、および予測タップを基に、線形予測により、予測画像の画素値を予測する。
【００４３】
ステップＳ３８において、予測演算部４３は、予測された画素値をフレームメモリ３４に出力する。フレームメモリ３４は、予測演算部４３から供給された画素値を記憶する。手続きは、ステップＳ３１に戻り、未処理の画素があるか否かの判定を繰り返す。
【００４４】
ステップＳ３１において、入力画像に未処理の画素がないと判定された場合、ステップＳ３９に進み、フレームメモリ３４は、予測値が設定された、記憶している予測画像を出力して、処理は終了する。
【００４５】
【発明が解決しようとする課題】
静止している背景の前で物体が移動するとき、移動する物体の画像自身の混ざり合いによる動きボケのみならず、背景の画像と移動する物体の画像との混ざり合いが生じる。従来、背景の画像と移動する物体の画像との混ざり合いに対応して画像を処理することは、考えられていなかった。
【００４６】
本発明はこのような状況に鑑みてなされたものであり、背景の画像と移動する物体の画像との混ざり合いに対応して画像を処理することができるようにすることを目的とする。
【００４７】
【課題を解決するための手段】
本発明の画像処理装置は、時間積分効果を有する所定数の画素を有する撮像素子によって取得された所定数の画素からなる入力画像を処理する画像処理装置において、入力画像とその１つ前及び後に取得された入力画像との間で、同じ位置の画素の画素値の差分の絶対値と予め定められた閾値とを比較することで、入力画像とその１つ前の入力画像との間の画素の動きの有無及び入力画像とその１つ後の入力画像との間の画素の動きの有無を判定し、その判定結果に基づいて、入力画像の、前景オブジェクトを構成する前景オブジェクト成分からなる前景領域と、背景オブジェクトを構成する背景オブジェクト成分からなる背景領域を少なくとも特定し、特定結果を示す領域特定情報を出力する領域特定手段と、学習用データとして入力される入力画像を用いて、前景領域の画像と背景領域の画像のそれぞれについて、入力画像を教師画像とし、入力画像を低解像度にした画像を生徒画像として、教師画像の所定の画素を、それに対応する生徒画像の画素周辺の複数の画素と予測係数の線形１次式で表し、線形１次式で表される予測画素値と教師画像の所定の画素の画素値との誤差が最小となるような予測係数を予め求める予測係数生成手段と、予測処理用データとして入力される入力画像の前景領域の画像と背景領域の画像のそれぞれについて、予め求めた予測係数と入力画像の画素周辺の複数の画素との線形１次式を演算することにより、入力画像の前景領域の画像と背景領域の画像を高解像度の出力画像に変換する変換手段とを含むことを特徴とする。
【００４８】
領域特定手段には、入力画像の１つ前と２つ前及び１つ後と２つ後に取得された入力画像との間の画素の動きの有無の判定結果も用いて、前景オブジェクト成分と背景オブジェクト成分が混合されてなる混合領域のうちの、時間の経過に対応して背景オブジェクト成分から前景オブジェクト成分になる領域であるカバードバックグラウンド領域と、時間の経過に対応して前景オブジェクト成分から背景オブジェクト成分になる領域であるアンカバードバックグラウンド領域をさらに特定させ、予測係数生成手段には、学習用データとして入力される入力画像を用いて、カバードバックグラウンド領域およびアンカバードバックグラウンド領域の画像についても予測係数をそれぞれ予め求めさせ、変換手段には、予測処理用データとして入力される入力画像のカバードバックグラウンド領域およびアンカバードバックグラウンド領域の画像のそれぞれについても、高解像度の出力画像に変換させることができる。
【００４９】
予測処理用データとして入力される入力画像の前景領域の画像、背景領域の画像、カバードバックグラウンド領域の画像、およびアンカバードバックグラウンド領域の画像を高解像度に変換してそれぞれ得られた出力画像を合成して出力する合成手段をさらに備えることができる。
【００５１】
本発明の画像処理方法は、時間積分効果を有する所定数の画素を有する撮像素子によって取得された所定数の画素からなる入力画像を処理する画像処理方法において、入力画像とその１つ前及び後に取得された入力画像との間で、同じ位置の画素の画素値の差分の絶対値と予め定められた閾値とを比較することで、入力画像とその１つ前の入力画像との間の画素の動きの有無及び入力画像とその１つ後の入力画像との間の画素の動きの有無を判定し、その判定結果に基づいて、入力画像の、前景オブジェクトを構成する前景オブジェクト成分からなる前景領域と、背景オブジェクトを構成する背景オブジェクト成分からなる背景領域を少なくとも特定し、特定結果を示す領域特定情報を出力する領域特定ステップと、学習用データとして入力される入力画像を用いて、前景領域の画像と背景領域の画像のそれぞれについて、入力画像を教師画像とし、入力画像を低解像度にした画像を生徒画像として、教師画像の所定の画素を、それに対応する生徒画像の画素周辺の複数の画素と予測係数の線形１次式で表し、線形１次式で表される予測画素値と教師画像の所定の画素の画素値との誤差が最小となるような予測係数を予め求める予測係数生成ステップと、予測処理用データとして入力される入力画像の前景領域の画像と背景領域の画像のそれぞれについて、予め求めた予測係数と入力画像の画素周辺の複数の画素との線形１次式を演算することにより、入力画像の前景領域の画像と背景領域の画像を高解像度の出力画像に変換する変換ステップとを含むことを特徴とする。
【００５５】
本発明の記録媒体は、コンピュータに、時間積分効果を有する所定数の画素を有する撮像素子によって取得された所定数の画素からなる入力画像とその１つ前及び後に取得された入力画像との間で、同じ位置の画素の画素値の差分の絶対値と予め定められた閾値とを比較することで、入力画像とその１つ前の入力画像との間の画素の動きの有無及び入力画像とその１つ後の入力画像との間の画素の動きの有無を判定し、その判定結果に基づいて、入力画像の、前景オブジェクトを構成する前景オブジェクト成分からなる前景領域と、背景オブジェクトを構成する背景オブジェクト成分からなる背景領域を少なくとも特定し、特定結果を示す領域特定情報を出力する領域特定ステップと、学習用データとして入力される入力画像を用いて、前景領域の画像と背景領域の画像のそれぞれについて、入力画像を教師画像とし、入力画像を低解像度にした画像を生徒画像として、教師画像の所定の画素を、それに対応する生徒画像の画素周辺の複数の画素と予測係数の線形１次式で表し、線形１次式で表される予測画素値と教師画像の所定の画素の画素値との誤差が最小となるような予測係数を予め求める予測係数生成ステップと、予測処理用データとして入力される入力画像の前景領域の画像と背景領域の画像のそれぞれについて、予め求めた予測係数と入力画像の画素周辺の複数の画素との線形１次式を演算することにより、入力画像の前景領域の画像と背景領域の画像を高解像度の出力画像に変換する変換ステップとを実行させるためのプログラムを記録したコンピュータ読み取り可能なものである。
【００５９】
本発明のプログラムは、時間積分効果を有する所定数の画素を有する撮像素子によって取得された所定数の画素からなる入力画像を処理させるコンピュータに、入力画像とその１つ前及び後に取得された入力画像との間で、同じ位置の画素の画素値の差分の絶対値と予め定められた閾値とを比較することで、入力画像とその１つ前の入力画像との間の画素の動きの有無及び入力画像とその１つ後の入力画像との間の画素の動きの有無を判定し、その判定結果に基づいて、入力画像の、前景オブジェクトを構成する前景オブジェクト成分からなる前景領域と、背景オブジェクトを構成する背景オブジェクト成分からなる背景領域を少なくとも特定し、特定結果を示す領域特定情報を出力する領域特定ステップと、学習用データとして入力される入力画像を用いて、前景領域の画像と背景領域の画像のそれぞれについて、入力画像を教師画像とし、入力画像を低解像度にした画像を生徒画像として、教師画像の所定の画素を、それに対応する生徒画像の画素周辺の複数の画素と予測係数の線形１次式で表し、線形１次式で表される予測画素値と教師画像の所定の画素の画素値との誤差が最小となるような予測係数を予め求める予測係数生成ステップと、予測処理用データとして入力される入力画像の前景領域の画像と背景領域の画像のそれぞれについて、予め求めた予測係数と入力画像の画素周辺の複数の画素との線形１次式を演算することにより、入力画像の前景領域の画像と背景領域の画像を高解像度の出力画像に変換する変換ステップとを実行させるためのものである。
【００６３】
本発明の画像処理装置および方法、記録媒体、並びにプログラムにおいては、入力画像とその１つ前及び後に取得された入力画像との間で、同じ位置の画素の画素値の差分の絶対値と予め定められた閾値とを比較することで、入力画像とその１つ前の入力画像との間の画素の動きの有無及び入力画像とその１つ後の入力画像との間の画素の動きの有無が判定され、その判定結果に基づいて、入力画像の、前景オブジェクトを構成する前景オブジェクト成分からなる前景領域と、背景オブジェクトを構成する背景オブジェクト成分からなる背景領域が少なくとも特定され、特定結果を示す領域特定情報が出力され、学習用データとして入力される入力画像を用いて、前景領域の画像と背景領域の画像のそれぞれについて、入力画像を教師画像とし、入力画像を低解像度にした画像を生徒画像として、教師画像の所定の画素を、それに対応する生徒画像の画素周辺の複数の画素と予測係数の線形１次式で表し、線形１次式で表される予測画素値と教師画像の所定の画素の画素値との誤差が最小となるような予測係数が予め求められ、予測処理用データとして入力される入力画像の前景領域の画像と背景領域の画像のそれぞれについて、予め求めた予測係数と入力画像の画素周辺の複数の画素との線形１次式を演算することにより、入力画像の前景領域の画像と背景領域の画像が高解像度の出力画像に変換される。
【００６４】
【発明の実施の形態】
図１０は、本発明に係る画像処理装置の一実施の形態の構成を示すブロック図である。CPU（Central Processing Unit）７１は、ROM（Read Only Memory）７２、または記憶部７８に記憶されているプログラムに従って各種の処理を実行する。RAM（Random Access Memory）７３には、CPU７１が実行するプログラムやデータなどが適宜記憶される。これらのCPU７１、ROM７２、およびRAM７３は、バス７４により相互に接続されている。
【００６５】
CPU７１にはまた、バス７４を介して入出力インタフェース７５が接続されている。入出力インタフェース７５には、キーボード、マウス、マイクロホンなどよりなる入力部７６、ディスプレイ、スピーカなどよりなる出力部７７が接続されている。CPU７１は、入力部７６から入力される指令に対応して各種の処理を実行する。そして、CPU７１は、処理の結果得られた画像や音声等を出力部７７に出力する。
【００６６】
入出力インタフェース７５に接続されている記憶部７８は、例えばハードディスクなどで構成され、CPU７１が実行するプログラムや各種のデータを記憶する。通信部７９は、インターネット、その他のネットワークを介して外部の装置と通信する。この例の場合、通信部７９はセンサの出力を取り込む取得部として働く。
【００６７】
また、通信部７９を介してプログラムを取得し、記憶部７８に記憶してもよい。
【００６８】
入出力インタフェース７５に接続されているドライブ８０は、磁気ディスク９１、光ディスク９２、光磁気ディスク９３、または半導体メモリ９４などが装着されたとき、それらを駆動し、そこに記録されているプログラムやデータなどを取得する。取得されたプログラムやデータは、必要に応じて記憶部７８に転送され、記憶される。
【００６９】
図１１は、本発明に係る画像処理装置の機能の構成を示すブロック図である。
【００７０】
なお、画像処理装置の各機能をハードウェアで実現するか、ソフトウェアで実現するかは問わない。つまり、本明細書の各ブロック図は、ハードウェアのブロック図と考えても、ソフトウェアによる機能ブロック図と考えても良い。
【００７１】
ここで、動きボケとは、撮像の対象となる、現実世界におけるオブジェクトの動きと、センサの撮像の特性とにより生じる、動いているオブジェクトに対応する画像に含まれている歪みをいう。
【００７２】
この明細書では、撮像の対象となる、現実世界におけるオブジェクトに対応する画像を、画像オブジェクトと称する。
【００７３】
画像処理装置に供給された入力画像は、オブジェクト抽出部１０１、領域特定部１０３、混合比算出部１０４、および前景背景分離部１０５に供給される。
【００７４】
オブジェクト抽出部１０１は、入力画像に含まれる前景のオブジェクトに対応する画像オブジェクトを粗く抽出して、抽出した画像オブジェクトを動き検出部１０２に供給する。オブジェクト抽出部１０１は、例えば、入力画像に含まれる前景のオブジェクトに対応する画像オブジェクトの輪郭を検出することで、前景のオブジェクトに対応する画像オブジェクトを粗く抽出する。
【００７５】
オブジェクト抽出部１０１は、入力画像に含まれる背景のオブジェクトに対応する画像オブジェクトを粗く抽出して、抽出した画像オブジェクトを動き検出部１０２に供給する。オブジェクト抽出部１０１は、例えば、入力画像と、抽出された前景のオブジェクトに対応する画像オブジェクトとの差から、背景のオブジェクトに対応する画像オブジェクトを粗く抽出する。
【００７６】
また、例えば、オブジェクト抽出部１０１は、内部に設けられている背景メモリに記憶されている背景の画像と、入力画像との差から、前景のオブジェクトに対応する画像オブジェクト、および背景のオブジェクトに対応する画像オブジェクトを粗く抽出するようにしてもよい。
【００７７】
動き検出部１０２は、例えば、ブロックマッチング法、勾配法、位相相関法、およびペルリカーシブ法などの手法により、粗く抽出された前景のオブジェクトに対応する画像オブジェクトの動きベクトルを算出して、算出した動きベクトルおよび動きベクトルの位置情報（動きベクトルに対応する画素の位置を特定する情報）を領域特定部１０３および動きボケ除去部１０６に供給する。
【００７８】
動き検出部１０２が出力する動きベクトルには、動き量vに対応する情報が含まれている。
【００７９】
また、例えば、動き検出部１０２は、画像オブジェクトに画素を特定する画素位置情報と共に、画像オブジェクト毎の動きベクトルを動きボケ除去部１０６に出力するようにしてもよい。
【００８０】
動き量vは、動いているオブジェクトに対応する画像の位置の変化を画素間隔を単位として表す値である。例えば、前景に対応するオブジェクトの画像が、あるフレームを基準として次のフレームにおいて４画素分離れた位置に表示されるように移動しているとき、前景に対応するオブジェクトの画像の動き量vは、４とされる。
【００８１】
領域特定部１０３は、入力された画像の画素のそれぞれを、前景領域、背景領域、または混合領域のいずれかに特定し、画素毎に前景領域、背景領域、または混合領域のいずれかに属するかを示す情報（以下、領域情報と称する）を混合比算出部１０４、前景背景分離部１０５、および動きボケ除去部１０６に供給する。前景領域、背景領域、または混合領域の詳細は、後述する。
【００８２】
混合比算出部１０４は、入力画像、および領域特定部１０３から供給された領域情報を基に、混合領域に含まれる画素に対応する混合比（以下、混合比αと称する）を算出して、算出した混合比を前景背景分離部１０５に供給する。
【００８３】
混合比αは、後述する式（３）に示されるように、画素値における、背景のオブジェクトに対応する画像の成分（以下、背景の成分とも称する）の割合を示す値である。
【００８４】
前景背景分離部１０５は、領域特定部１０３から供給された領域情報、および混合比算出部１０４から供給された混合比αを基に、前景のオブジェクトに対応する画像の成分（以下、前景の成分とも称する）のみから成る前景成分画像と、背景の成分のみから成る背景成分画像とに入力画像を分離して、前景成分画像を動きボケ除去部１０６に供給し、背景成分画像を補正部１０７に供給する。
【００８５】
動きボケ除去部１０６は、動きベクトルからわかる動き量vおよび領域情報を基に、前景成分画像に含まれる１以上の画素を示す処理単位を決定する。処理単位は、動きボケの量の調整の処理の対象となる１群の画素を指定するデータである。
【００８６】
動きボケ除去部１０６は、前景背景分離部１０５から供給された前景成分画像、動き検出部１０２から供給された動きベクトルおよびその位置情報、並びに処理単位を基に、前景成分画像に含まれる動きボケを除去して、動きボケを除去した前景成分画像を動きボケ除去画像処理部１０８に出力する。
【００８７】
補正部１０７は、背景成分画像における、混合領域に対応する画素の画素値を補正する。背景成分画像の混合領域に対応する画素の画素値は、分離される前の混合領域の画素の画素値から、前景の成分が除去されることにより、算出される。従って、背景成分画像の混合領域に対応する画素の画素値は、隣接する背景領域の画素の画素値に比較し、混合比αに対応して、減少している。
【００８８】
補正部１０７は、このような、背景成分画像における、混合領域に対応する画素の画素値の混合比αに対応するゲインの低下を補正し、補正した背景成分画像を動きボケ除去画像処理部１０８に供給する。
【００８９】
動きボケ除去画像処理部１０８は、クラス分類適応処理により、動きボケが除去された前景成分画像、および補正された背景成分画像を個々に処理する。
【００９０】
例えば、動きボケ除去画像処理部１０８は、動きボケが除去された前景成分画像、および補正された背景成分画像毎に、より高解像度の画像を生成するクラス分類適応処理で使用される係数を生成する。
【００９１】
例えば、動きボケ除去画像処理部１０８は、動きボケが除去された前景成分画像、および補正された背景成分画像毎にクラス分類適応処理を適用して、より高解像度の画像を創造する。
【００９２】
次に、図１２乃至図２７を参照して、画像処理装置に供給される入力画像について説明する。
【００９３】
図１２は、センサによる撮像を説明する図である。センサは、例えば、固体撮像素子であるCCD（Charge-Coupled Device）エリアセンサを備えたCCDビデオカメラなどで構成される。現実世界における、前景に対応するオブジェクトは、現実世界における、背景に対応するオブジェクトと、センサとの間を、例えば、図中の左側から右側に水平に移動する。
【００９４】
センサは、前景に対応するオブジェクトを、背景に対応するオブジェクトと共に撮像する。センサは、撮像した画像を１フレーム単位で出力する。例えば、センサは、１秒間に３０フレームから成る画像を出力する。センサの露光時間は、１／３０秒とすることができる。露光時間は、センサが入力された光の電荷への変換を開始してから、入力された光の電荷への変換を終了するまでの期間である。以下、露光時間をシャッタ時間とも称する。
【００９５】
図１３は、画素の配置を説明する図である。図１３中において、Ａ乃至Ｉは、個々の画素を示す。画素は、画像に対応する平面上に配置されている。１つの画素に対応する１つの検出素子は、センサ上に配置されている。センサが画像を撮像するとき、１つの検出素子は、画像を構成する１つの画素に対応する画素値を出力する。例えば、検出素子のＸ方向の位置は、画像上の横方向の位置に対応し、検出素子のＹ方向の位置は、画像上の縦方向の位置に対応する。
【００９６】
図１４に示すように、例えば、CCDである検出素子は、シャッタ時間に対応する期間、入力された光を電荷に変換して、変換された電荷を蓄積する。電荷の量は、入力された光の強さと、光が入力されている時間にほぼ比例する。検出素子は、シャッタ時間に対応する期間において、入力された光から変換された電荷を、既に蓄積されている電荷に加えていく。すなわち、検出素子は、シャッタ時間に対応する期間、入力される光を積分して、積分された光に対応する量の電荷を蓄積する。検出素子は、時間に対して、積分効果があるとも言える。
【００９７】
検出素子に蓄積された電荷は、図示せぬ回路により、電圧値に変換され、電圧値は更にデジタルデータなどの画素値に変換されて出力される。従って、センサから出力される個々の画素値は、前景または背景に対応するオブジェクトの空間的に広がりを有するある部分を、シャッタ時間について積分した結果である、１次元の空間に射影された値を有する。
【００９８】
画像処理装置は、このようなセンサの蓄積の動作により、出力信号に埋もれてしまった有意な情報、例えば、混合比αを抽出する。
【００９９】
図１５は、動いている前景に対応するオブジェクトと、静止している背景に対応するオブジェクトとを撮像して得られる画像を説明する図である。図１５（Ａ）は、動きを伴う前景に対応するオブジェクトと、静止している背景に対応するオブジェクトとを撮像して得られる画像を示している。図１５（Ａ）に示す例において、前景に対応するオブジェクトは、画面に対して水平に左から右に動いている。
【０１００】
図１５（Ｂ）は、図１５（Ａ）に示す画像の１つのラインに対応する画素値を時間方向に展開したモデル図である。図１５（Ｂ）の横方向は、図１５（Ａ）の空間方向Ｘに対応している。
【０１０１】
背景領域の画素は、背景の成分、すなわち、背景のオブジェクトに対応する画像の成分のみから、その画素値が構成されている。前景領域の画素は、前景の成分、すなわち、前景のオブジェクトに対応する画像の成分のみから、その画素値が構成されている。
【０１０２】
混合領域の画素は、背景の成分、および前景の成分から、その画素値が構成されている。混合領域は、背景の成分、および前景の成分から、その画素値が構成されているので、歪み領域ともいえる。混合領域は、更に、カバードバックグラウンド領域およびアンカバードバックグラウンド領域に分類される。
【０１０３】
カバードバックグラウンド領域は、前景領域に対して、前景のオブジェクトの進行方向の前端部に対応する位置の混合領域であり、時間の経過に対応して背景成分が前景に覆い隠される領域をいう。
【０１０４】
これに対して、アンカバードバックグラウンド領域は、前景領域に対して、前景のオブジェクトの進行方向の後端部に対応する位置の混合領域であり、時間の経過に対応して背景成分が現れる領域をいう。
【０１０５】
このように、前景領域、背景領域、またはカバードバックグラウンド領域若しくはアンカバードバックグラウンド領域を含む画像が、領域特定部１０３、混合比算出部１０４、および前景背景分離部１０５に入力画像として入力される。
【０１０６】
図１６は、以上のような、背景領域、前景領域、混合領域、カバードバックグラウンド領域、およびアンカバードバックグラウンド領域を説明する図である。図１５に示す画像に対応する場合、背景領域は、静止部分であり、前景領域は、動き部分であり、混合領域のカバードバックグラウンド領域は、背景から前景に変化する部分であり、混合領域のアンカバードバックグラウンド領域は、前景から背景に変化する部分である。
【０１０７】
図１７は、静止している前景に対応するオブジェクトおよび静止している背景に対応するオブジェクトを撮像した画像における、隣接して１列に並んでいる画素の画素値を時間方向に展開したモデル図である。例えば、隣接して１列に並んでいる画素として、画面の１つのライン上に並んでいる画素を選択することができる。
【０１０８】
図１７に示すF01乃至F04の画素値は、静止している前景のオブジェクトに対応する画素の画素値である。図１７に示すB01乃至B04の画素値は、静止している背景のオブジェクトに対応する画素の画素値である。
【０１０９】
図１７における縦方向は、図中の上から下に向かって時間が経過する。図１７中の矩形の上辺の位置は、センサが入力された光の電荷への変換を開始する時刻に対応し、図１７中の矩形の下辺の位置は、センサが入力された光の電荷への変換を終了する時刻に対応する。すなわち、図１７中の矩形の上辺から下辺までの距離は、シャッタ時間に対応する。
【０１１０】
以下において、シャッタ時間とフレーム間隔とが同一である場合を例に説明する。
【０１１１】
図１７における横方向は、図１５で説明した空間方向Xに対応する。より具体的には、図１７に示す例において、図１７中の”F01”と記載された矩形の左辺から”B04”と記載された矩形の右辺までの距離は、画素のピッチの８倍、すなわち、連続している８つの画素の間隔に対応する。
【０１１２】
前景のオブジェクトおよび背景のオブジェクトが静止している場合、シャッタ時間に対応する期間において、センサに入力される光は変化しない。
【０１１３】
ここで、シャッタ時間に対応する期間を２つ以上の同じ長さの期間に分割する。例えば、仮想分割数を４とすると、図１７に示すモデル図は、図１８に示すモデルとして表すことができる。仮想分割数は、前景に対応するオブジェクトのシャッタ時間内での動き量vなどに対応して設定される。例えば、４である動き量vに対応して、仮想分割数は、４とされ、シャッタ時間に対応する期間は４つに分割される。
【０１１４】
図中の最も上の行は、シャッタが開いて最初の、分割された期間に対応する。
図中の上から２番目の行は、シャッタが開いて２番目の、分割された期間に対応する。図中の上から３番目の行は、シャッタが開いて３番目の、分割された期間に対応する。図中の上から４番目の行は、シャッタが開いて４番目の、分割された期間に対応する。
【０１１５】
以下、動き量vに対応して分割されたシャッタ時間をシャッタ時間/vとも称する。
【０１１６】
前景に対応するオブジェクトが静止しているとき、センサに入力される光は変化しないので、前景の成分F01/vは、画素値F01を仮想分割数で除した値に等しい。同様に、前景に対応するオブジェクトが静止しているとき、前景の成分F02/vは、画素値F02を仮想分割数で除した値に等しく、前景の成分F03/vは、画素値F03を仮想分割数で除した値に等しく、前景の成分F04/vは、画素値F04を仮想分割数で除した値に等しい。
【０１１７】
背景に対応するオブジェクトが静止しているとき、センサに入力される光は変化しないので、背景の成分B01/vは、画素値B01を仮想分割数で除した値に等しい。同様に、背景に対応するオブジェクトが静止しているとき、背景の成分B02/vは、画素値B02を仮想分割数で除した値に等しく、B03/vは、画素値B03を仮想分割数で除した値に等しく、B04/vは、画素値B04を仮想分割数で除した値に等しい。
【０１１８】
すなわち、前景に対応するオブジェクトが静止している場合、シャッタ時間に対応する期間において、センサに入力される前景のオブジェクトに対応する光が変化しないので、シャッタが開いて最初の、シャッタ時間/vに対応する前景の成分F01/vと、シャッタが開いて２番目の、シャッタ時間/vに対応する前景の成分F01/vと、シャッタが開いて３番目の、シャッタ時間/vに対応する前景の成分F01/vと、シャッタが開いて４番目の、シャッタ時間/vに対応する前景の成分F01/vとは、同じ値となる。F02/v乃至F04/vも、F01/vと同様の関係を有する。
【０１１９】
背景に対応するオブジェクトが静止している場合、シャッタ時間に対応する期間において、センサに入力される背景のオブジェクトに対応する光は変化しないので、シャッタが開いて最初の、シャッタ時間/vに対応する背景の成分B01/vと、シャッタが開いて２番目の、シャッタ時間/vに対応する背景の成分B01/vと、シャッタが開いて３番目の、シャッタ時間/vに対応する背景の成分B01/vと、シャッタが開いて４番目の、シャッタ時間/vに対応する背景の成分B01/vとは、同じ値となる。B02/v乃至B04/vも、同様の関係を有する。
【０１２０】
次に、前景に対応するオブジェクトが移動し、背景に対応するオブジェクトが静止している場合について説明する。
【０１２１】
図１９は、前景に対応するオブジェクトが図中の右側に向かって移動する場合の、カバードバックグラウンド領域を含む、１つのライン上の画素の画素値を時間方向に展開したモデル図である。図１９において、前景の動き量vは、４である。１フレームは短い時間なので、前景に対応するオブジェクトが剛体であり、等速で移動していると仮定することができる。図１９において、前景に対応するオブジェクトの画像は、あるフレームを基準として次のフレームにおいて４画素分右側に表示されるように移動する。
【０１２２】
図１９において、最も左側の画素乃至左から４番目の画素は、前景領域に属する。図１９において、左から５番目乃至左から７番目の画素は、カバードバックグラウンド領域である混合領域に属する。図１９において、最も右側の画素は、背景領域に属する。
【０１２３】
前景に対応するオブジェクトが時間の経過と共に背景に対応するオブジェクトを覆い隠すように移動しているので、カバードバックグラウンド領域に属する画素の画素値に含まれる成分は、シャッタ時間に対応する期間のある時点で、背景の成分から、前景の成分に替わる。
【０１２４】
例えば、図１９中に太線枠を付した画素値Mは、式（１）で表される。
【０１２５】
M=B02/v+B02/v+F07/v+F06/v （１）
【０１２６】
例えば、左から５番目の画素は、１つのシャッタ時間/vに対応する背景の成分を含み、３つのシャッタ時間/vに対応する前景の成分を含むので、左から５番目の画素の混合比αは、1/4である。左から６番目の画素は、２つのシャッタ時間/vに対応する背景の成分を含み、２つのシャッタ時間/vに対応する前景の成分を含むので、左から６番目の画素の混合比αは、1/2である。左から７番目の画素は、３つのシャッタ時間/vに対応する背景の成分を含み、１つのシャッタ時間/vに対応する前景の成分を含むので、左から７番目の画素の混合比αは、3/4である。
【０１２７】
前景に対応するオブジェクトが、剛体であり、前景の画像が次のフレームにおいて４画素右側に表示されるように等速で移動すると仮定できるので、例えば、図１９中の左から４番目の画素の、シャッタが開いて最初の、シャッタ時間/vの前景の成分F07/vは、図１９中の左から５番目の画素の、シャッタが開いて２番目のシャッタ時間/vに対応する前景の成分に等しい。同様に、前景の成分F07/vは、図１９中の左から６番目の画素の、シャッタが開いて３番目のシャッタ時間/vに対応する前景の成分と、図１９中の左から７番目の画素の、シャッタが開いて４番目のシャッタ時間/vに対応する前景の成分とに、それぞれ等しい。
【０１２８】
前景に対応するオブジェクトが、剛体であり、前景の画像が次のフレームにおいて４画素右側に表示されるように等速で移動すると仮定できるので、例えば、図１９中の左から３番目の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分F06/vは、図１９中の左から４番目の画素の、シャッタが開いて２番目のシャッタ時間/vに対応する前景の成分に等しい。同様に、前景の成分F06/vは、図１９中の左から５番目の画素の、シャッタが開いて３番目のシャッタ時間/vに対応する前景の成分と、図１９中の左から６番目の画素の、シャッタが開いて４番目のシャッタ時間/vに対応する前景の成分とに、それぞれ等しい。
【０１２９】
前景に対応するオブジェクトが、剛体であり、前景の画像が次のフレームにおいて４画素右側に表示されるように等速で移動すると仮定できるので、例えば、図１９中の左から２番目の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分F05/vは、図１９中の左から３番目の画素の、シャッタが開いて２番目のシャッタ時間/vのに対応する前景の成分に等しい。同様に、前景の成分F05/vは、図１９中の左から４番目の画素の、シャッタが開いて３番目のシャッタ時間/vに対応する前景の成分と、図１９中の左から５番目の画素の、シャッタが開いて４番目のシャッタ時間/vに対応する前景の成分とに、それぞれ等しい。
【０１３０】
前景に対応するオブジェクトが、剛体であり、前景の画像が次のフレームにおいて４画素右側に表示されるように等速で移動すると仮定できるので、例えば、図１９中の最も左側の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分F04/vは、図１９中の左から２番目の画素の、シャッタが開いて２番目のシャッタ時間/vに対応する前景の成分に等しい。同様に、前景の成分F04/vは、図１９中の左から３番目の画素の、シャッタが開いて３番目のシャッタ時間/vに対応する前景の成分と、図１９中の左から４番目の画素の、シャッタが開いて４番目のシャッタ時間/vに対応する前景の成分とに、それぞれ等しい。
【０１３１】
動いているオブジェクトに対応する前景の領域は、このように動きボケを含むので、歪み領域とも言える。
【０１３２】
図２０は、前景が図中の右側に向かって移動する場合の、アンカバードバックグラウンド領域を含む、１つのライン上の画素の画素値を時間方向に展開したモデル図である。図２０において、前景の動き量vは、４である。１フレームは短い時間なので、前景に対応するオブジェクトが剛体であり、等速で移動していると仮定することができる。図２０において、前景に対応するオブジェクトの画像は、あるフレームを基準として次のフレームにおいて４画素分右側に移動する。
【０１３３】
図２０において、最も左側の画素乃至左から４番目の画素は、背景領域に属する。図２０において、左から５番目乃至左から７番目の画素は、アンカバードバックグラウンドである混合領域に属する。図２０において、最も右側の画素は、前景領域に属する。
【０１３４】
背景に対応するオブジェクトを覆っていた前景に対応するオブジェクトが時間の経過と共に背景に対応するオブジェクトの前から取り除かれるように移動しているので、アンカバードバックグラウンド領域に属する画素の画素値に含まれる成分は、シャッタ時間に対応する期間のある時点で、前景の成分から、背景の成分に替わる。
【０１３５】
例えば、図２０中に太線枠を付した画素値M'は、式（２）で表される。
【０１３６】
M'=F02/v+F01/v+B26/v+B26/v （２）
【０１３７】
例えば、左から５番目の画素は、３つのシャッタ時間/vに対応する背景の成分を含み、１つのシャッタ時間/vに対応する前景の成分を含むので、左から５番目の画素の混合比αは、3/4である。左から６番目の画素は、２つのシャッタ時間/vに対応する背景の成分を含み、２つのシャッタ時間/vに対応する前景の成分を含むので、左から６番目の画素の混合比αは、1/2である。左から７番目の画素は、１つのシャッタ時間/vに対応する背景の成分を含み、３つのシャッタ時間/vに対応する前景の成分を含むので、左から７番目の画素の混合比αは、1/4である。
【０１３８】
式（１）および式（２）をより一般化すると、画素値Mは、式（３）で表される。
【０１３９】
【数１】

ここで、αは、混合比である。Ｂは、背景の画素値であり、Fi/vは、前景の成分である。
【０１４０】
前景に対応するオブジェクトが剛体であり、等速で動くと仮定でき、かつ、動き量vが４であるので、例えば、図２０中の左から５番目の画素の、シャッタが開いて最初の、シャッタ時間/vの前景の成分F01/vは、図２０中の左から６番目の画素の、シャッタが開いて２番目のシャッタ時間/vに対応する前景の成分に等しい。同様に、F01/vは、図２０中の左から７番目の画素の、シャッタが開いて３番目のシャッタ時間/vに対応する前景の成分と、図２０中の左から８番目の画素の、シャッタが開いて４番目のシャッタ時間/vに対応する前景の成分とに、それぞれ等しい。
【０１４１】
前景に対応するオブジェクトが剛体であり、等速で動くと仮定でき、かつ、仮想分割数が４であるので、例えば、図２０中の左から６番目の画素の、シャッタが開いて最初の、シャッタ時間/vの前景の成分F02/vは、図２０中の左から７番目の画素の、シャッタが開いて２番目のシャッタ時間/vに対応する前景の成分に等しい。同様に、前景の成分F02/vは、図２０中の左から８番目の画素の、シャッタが開いて３番目のシャッタ時間/vに対応する前景の成分に等しい。
【０１４２】
前景に対応するオブジェクトが剛体であり、等速で動くと仮定でき、かつ、動き量vが４であるので、例えば、図２０中の左から７番目の画素の、シャッタが開いて最初の、シャッタ時間/vの前景の成分F03/vは、図２０中の左から８番目の画素の、シャッタが開いて２番目のシャッタ時間/vに対応する前景の成分に等しい。
【０１４３】
図１８乃至図２０の説明において、仮想分割数は、４であるとして説明したが、仮想分割数は、動き量vに対応する。動き量vは、一般に、前景に対応するオブジェクトの移動速度に対応する。例えば、前景に対応するオブジェクトが、あるフレームを基準として次のフレームにおいて４画素分右側に表示されるように移動しているとき、動き量vは、４とされる。動き量vに対応し、仮想分割数は、４とされる。同様に、例えば、前景に対応するオブジェクトが、あるフレームを基準として次のフレームにおいて６画素分左側に表示されるように移動しているとき、動き量vは、６とされ、仮想分割数は、６とされる。
【０１４４】
図２１および図２２に、以上で説明した、前景領域、背景領域、カバードバックグラウンド領域若しくはアンカバードバックグラウンド領域から成る混合領域と、分割されたシャッタ時間に対応する前景の成分および背景の成分との関係を示す。
【０１４５】
図２１は、静止している背景の前を移動しているオブジェクトに対応する前景を含む画像から、前景領域、背景領域、および混合領域の画素を抽出した例を示す。図２１に示す例において、前景に対応するオブジェクトは、画面に対して水平に移動している。
【０１４６】
フレーム#n+1は、フレーム#nの次のフレームであり、フレーム#n+2は、フレーム#n+1の次のフレームである。
【０１４７】
フレーム#n乃至フレーム#n+2のいずれかから抽出した、前景領域、背景領域、および混合領域の画素を抽出して、動き量vを４として、抽出された画素の画素値を時間方向に展開したモデルを図２２に示す。
【０１４８】
前景領域の画素値は、前景に対応するオブジェクトが移動するので、シャッタ時間/vの期間に対応する、４つの異なる前景の成分から構成される。例えば、図２２に示す前景領域の画素のうち最も左側に位置する画素は、F01/v,F02/v,F03/v、およびF04/vから構成される。すなわち、前景領域の画素は、動きボケを含んでいる。
【０１４９】
背景に対応するオブジェクトが静止しているので、シャッタ時間に対応する期間において、センサに入力される背景に対応する光は変化しない。この場合、背景領域の画素値は、動きボケを含まない。
【０１５０】
カバードバックグラウンド領域若しくはアンカバードバックグラウンド領域から成る混合領域に属する画素の画素値は、前景の成分と、背景の成分とから構成される。
【０１５１】
次に、オブジェクトに対応する画像が動いているとき、複数のフレームにおける、隣接して１列に並んでいる画素であって、フレーム上で同一の位置の画素の画素値を時間方向に展開したモデルについて説明する。例えば、オブジェクトに対応する画像が画面に対して水平に動いているとき、隣接して１列に並んでいる画素として、画面の１つのライン上に並んでいる画素を選択することができる。
【０１５２】
図２３は、静止している背景に対応するオブジェクトを撮像した画像の３つのフレームの、隣接して１列に並んでいる画素であって、フレーム上で同一の位置の画素の画素値を時間方向に展開したモデル図である。フレーム#nは、フレーム#n-1の次のフレームであり、フレーム#n+1は、フレーム#nの次のフレームである。他のフレームも同様に称する。
【０１５３】
図２３に示すB01乃至B12の画素値は、静止している背景のオブジェクトに対応する画素の画素値である。背景に対応するオブジェクトが静止しているので、フレーム#n-1乃至フレームn+1において、対応する画素の画素値は、変化しない。例えば、フレーム#n-1におけるB05の画素値を有する画素の位置に対応する、フレーム#nにおける画素、およびフレーム#n+1における画素は、それぞれ、B05の画素値を有する。
【０１５４】
図２４は、静止している背景に対応するオブジェクトと共に図中の右側に移動する前景に対応するオブジェクトを撮像した画像の３つのフレームの、隣接して１列に並んでいる画素であって、フレーム上で同一の位置の画素の画素値を時間方向に展開したモデル図である。図２４に示すモデルは、カバードバックグラウンド領域を含む。
【０１５５】
図２４において、前景に対応するオブジェクトが、剛体であり、等速で移動すると仮定でき、前景の画像が次のフレームにおいて４画素右側に表示されるように移動するので、前景の動き量vは、４であり、仮想分割数は、４である。
【０１５６】
例えば、図２４中のフレーム#n-1の最も左側の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分は、F12/vとなり、図２４中の左から２番目の画素の、シャッタが開いて２番目のシャッタ時間/vの前景の成分も、F12/vとなる。図２４中の左から３番目の画素の、シャッタが開いて３番目のシャッタ時間/vの前景の成分、および図２４中の左から４番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分は、F12/vとなる。
【０１５７】
図２４中のフレーム#n-1の最も左側の画素の、シャッタが開いて２番目のシャッタ時間/vの前景の成分は、F11/vとなり、図２４中の左から２番目の画素の、シャッタが開いて３番目のシャッタ時間/vの前景の成分も、F11/vとなる。図２４中の左から３番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分は、F11/vとなる。
【０１５８】
図２４中のフレーム#n-1の最も左側の画素の、シャッタが開いて３番目のシャッタ時間/vの前景の成分は、F10/vとなり、図２４中の左から２番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分も、F10/vとなる。図２４中のフレーム#n-1の最も左側の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分は、F09/vとなる。
【０１５９】
背景に対応するオブジェクトが静止しているので、図２４中のフレーム#n-1の左から２番目の画素の、シャッタが開いて最初のシャッタ時間/vの背景の成分は、B01/vとなる。図２４中のフレーム#n-1の左から３番目の画素の、シャッタが開いて最初および２番目のシャッタ時間/vの背景の成分は、B02/vとなる。図２４中のフレーム#n-1の左から４番目の画素の、シャッタが開いて最初乃至３番目のシャッタ時間/vの背景の成分は、B03/vとなる。
【０１６０】
図２４中のフレーム#n-1において、最も左側の画素は、前景領域に属し、左側から２番目乃至４番目の画素は、カバードバックグラウンド領域である混合領域に属する。
【０１６１】
図２４中のフレーム#n-1の左から５番目の画素乃至１２番目の画素は、背景領域に属し、その画素値は、それぞれ、B04乃至B11となる。
【０１６２】
図２４中のフレーム#nの左から１番目の画素乃至５番目の画素は、前景領域に属する。フレーム#nの前景領域における、シャッタ時間/vの前景の成分は、F05/v乃至F12/vのいずれかである。
【０１６３】
前景に対応するオブジェクトが、剛体であり、等速で移動すると仮定でき、前景の画像が次のフレームにおいて４画素右側に表示されるように移動するので、図２４中のフレーム#nの左から５番目の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分は、F12/vとなり、図２４中の左から６番目の画素の、シャッタが開いて２番目のシャッタ時間/vの前景の成分も、F12/vとなる。図２４中の左から７番目の画素の、シャッタが開いて３番目のシャッタ時間/vの前景の成分、および図２４中の左から８番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分は、F12/vとなる。
【０１６４】
図２４中のフレーム#nの左から５番目の画素の、シャッタが開いて２番目のシャッタ時間/vの前景の成分は、F11/vとなり、図２４中の左から６番目の画素の、シャッタが開いて３番目のシャッタ時間/vの前景の成分も、F11/vとなる。図２４中の左から７番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分は、F11/vとなる。
【０１６５】
図２４中のフレーム#nの左から５番目の画素の、シャッタが開いて３番目のシャッタ時間/vの前景の成分は、F10/vとなり、図２４中の左から６番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分も、F10/vとなる。図２４中のフレーム#nの左から５番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分は、F09/vとなる。
【０１６６】
背景に対応するオブジェクトが静止しているので、図２４中のフレーム#nの左から６番目の画素の、シャッタが開いて最初のシャッタ時間/vの背景の成分は、B05/vとなる。図２４中のフレーム#nの左から７番目の画素の、シャッタが開いて最初および２番目のシャッタ時間/vの背景の成分は、B06/vとなる。図２４中のフレーム#nの左から８番目の画素の、シャッタが開いて最初乃至３番目の、シャッタ時間/vの背景の成分は、B07/vとなる。
【０１６７】
図２４中のフレーム#nにおいて、左側から６番目乃至８番目の画素は、カバードバックグラウンド領域である混合領域に属する。
【０１６８】
図２４中のフレーム#nの左から９番目の画素乃至１２番目の画素は、背景領域に属し、画素値は、それぞれ、B08乃至B11となる。
【０１６９】
図２４中のフレーム#n+1の左から１番目の画素乃至９番目の画素は、前景領域に属する。フレーム#n+1の前景領域における、シャッタ時間/vの前景の成分は、F01/v乃至F12/vのいずれかである。
【０１７０】
前景に対応するオブジェクトが、剛体であり、等速で移動すると仮定でき、前景の画像が次のフレームにおいて４画素右側に表示されるように移動するので、図２４中のフレーム#n+1の左から９番目の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分は、F12/vとなり、図２４中の左から１０番目の画素の、シャッタが開いて２番目のシャッタ時間/vの前景の成分も、F12/vとなる。図２４中の左から１１番目の画素の、シャッタが開いて３番目のシャッタ時間/vの前景の成分、および図２４中の左から１２番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分は、F12/vとなる。
【０１７１】
図２４中のフレーム#n+1の左から９番目の画素の、シャッタが開いて２番目のシャッタ時間/vの期間の前景の成分は、F11/vとなり、図２４中の左から１０番目の画素の、シャッタが開いて３番目のシャッタ時間/vの前景の成分も、F11/vとなる。図２４中の左から１１番目の画素の、シャッタが開いて４番目の、シャッタ時間/vの前景の成分は、F11/vとなる。
【０１７２】
図２４中のフレーム#n+1の左から９番目の画素の、シャッタが開いて３番目の、シャッタ時間/vの前景の成分は、F10/vとなり、図２４中の左から１０番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分も、F10/vとなる。図２４中のフレーム#n+1の左から９番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分は、F09/vとなる。
【０１７３】
背景に対応するオブジェクトが静止しているので、図２４中のフレーム#n+1の左から１０番目の画素の、シャッタが開いて最初のシャッタ時間/vの背景の成分は、B09/vとなる。図２４中のフレーム#n+1の左から１１番目の画素の、シャッタが開いて最初および２番目のシャッタ時間/vの背景の成分は、B10/vとなる。図２４中のフレーム#n+1の左から１２番目の画素の、シャッタが開いて最初乃至３番目の、シャッタ時間/vの背景の成分は、B11/vとなる。
【０１７４】
図２４中のフレーム#n+1において、左側から１０番目乃至１２番目の画素は、カバードバックグラウンド領域である混合領域に対応する。
【０１７５】
図２５は、図２４に示す画素値から前景の成分を抽出した画像のモデル図である。
【０１７６】
図２６は、静止している背景と共に図中の右側に移動するオブジェクトに対応する前景を撮像した画像の３つのフレームの、隣接して１列に並んでいる画素であって、フレーム上で同一の位置の画素の画素値を時間方向に展開したモデル図である。図２６において、アンカバードバックグラウンド領域が含まれている。
【０１７７】
図２６において、前景に対応するオブジェクトは、剛体であり、かつ等速で移動していると仮定できる。前景に対応するオブジェクトが、次のフレームにおいて４画素分右側に表示されるように移動しているので、動き量vは、４である。
【０１７８】
例えば、図２６中のフレーム#n-1の最も左側の画素の、シャッタが開いて最初の、シャッタ時間/vの前景の成分は、F13/vとなり、図２６中の左から２番目の画素の、シャッタが開いて２番目のシャッタ時間/vの前景の成分も、F13/vとなる。図２６中の左から３番目の画素の、シャッタが開いて３番目のシャッタ時間/vの前景の成分、および図２６中の左から４番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分は、F13/vとなる。
【０１７９】
図２６中のフレーム#n-1の左から２番目の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分は、F14/vとなり、図２６中の左から３番目の画素の、シャッタが開いて２番目のシャッタ時間/vの前景の成分も、F14/vとなる。図２６中の左から３番目の画素の、シャッタが開いて最初の、シャッタ時間/vの前景の成分は、F15/vとなる。
【０１８０】
背景に対応するオブジェクトが静止しているので、図２６中のフレーム#n-1の最も左側の画素の、シャッタが開いて２番目乃至４番目の、シャッタ時間/vの背景の成分は、B25/vとなる。図２６中のフレーム#n-1の左から２番目の画素の、シャッタが開いて３番目および４番目の、シャッタ時間/vの背景の成分は、B26/vとなる。図２６中のフレーム#n-1の左から３番目の画素の、シャッタが開いて４番目のシャッタ時間/vの背景の成分は、B27/vとなる。
【０１８１】
図２６中のフレーム#n-1において、最も左側の画素乃至３番目の画素は、アンカバードバックグラウンド領域である混合領域に属する。
【０１８２】
図２６中のフレーム#n-1の左から４番目の画素乃至１２番目の画素は、前景領域に属する。フレームの前景の成分は、F13/v乃至F24/vのいずれかである。
【０１８３】
図２６中のフレーム#nの最も左側の画素乃至左から４番目の画素は、背景領域に属し、画素値は、それぞれ、B25乃至B28となる。
【０１８４】
前景に対応するオブジェクトが、剛体であり、等速で移動すると仮定でき、前景の画像が次のフレームにおいて４画素右側に表示されるように移動するので、図２６中のフレーム#nの左から５番目の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分は、F13/vとなり、図２６中の左から６番目の画素の、シャッタが開いて２番目のシャッタ時間/vの前景の成分も、F13/vとなる。図２６中の左から７番目の画素の、シャッタが開いて３番目のシャッタ時間/vの前景の成分、および図２６中の左から８番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分は、F13/vとなる。
【０１８５】
図２６中のフレーム#nの左から６番目の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分は、F14/vとなり、図２６中の左から７番目の画素の、シャッタが開いて２番目のシャッタ時間/vの前景の成分も、F14/vとなる。図２６中の左から８番目の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分は、F15/vとなる。
【０１８６】
背景に対応するオブジェクトが静止しているので、図２６中のフレーム#nの左から５番目の画素の、シャッタが開いて２番目乃至４番目のシャッタ時間/vの背景の成分は、B29/vとなる。図２６中のフレーム#nの左から６番目の画素の、シャッタが開いて３番目および４番目のシャッタ時間/vの背景の成分は、B30/vとなる。図２６中のフレーム#nの左から７番目の画素の、シャッタが開いて４番目のシャッタ時間/vの背景の成分は、B31/vとなる。
【０１８７】
図２６中のフレーム#nにおいて、左から５番目の画素乃至７番目の画素は、アンカバードバックグラウンド領域である混合領域に属する。
【０１８８】
図２６中のフレーム#nの左から８番目の画素乃至１２番目の画素は、前景領域に属する。フレーム#nの前景領域における、シャッタ時間/vの期間に対応する値は、F13/v乃至F20/vのいずれかである。
【０１８９】
図２６中のフレーム#n+1の最も左側の画素乃至左から８番目の画素は、背景領域に属し、画素値は、それぞれ、B25乃至B32となる。
【０１９０】
前景に対応するオブジェクトが、剛体であり、等速で移動すると仮定でき、前景の画像が次のフレームにおいて４画素右側に表示されるように移動するので、図２６中のフレーム#n+1の左から９番目の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分は、F13/vとなり、図２６中の左から１０番目の画素の、シャッタが開いて２番目のシャッタ時間/vの前景の成分も、F13/vとなる。図２６中の左から１１番目の画素の、シャッタが開いて３番目のシャッタ時間/vの前景の成分、および図２６中の左から１２番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分は、F13/vとなる。
【０１９１】
図２６中のフレーム#n+1の左から１０番目の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分は、F14/vとなり、図２６中の左から１１番目の画素の、シャッタが開いて２番目のシャッタ時間/vの前景の成分も、F14/vとなる。図２６中の左から１２番目の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分は、F15/vとなる。
【０１９２】
背景に対応するオブジェクトが静止しているので、図２６中のフレーム#n+1の左から９番目の画素の、シャッタが開いて２番目乃至４番目の、シャッタ時間/vの背景の成分は、B33/vとなる。図２６中のフレーム#n+1の左から１０番目の画素の、シャッタが開いて３番目および４番目のシャッタ時間/vの背景の成分は、B34/vとなる。図２６中のフレーム#n+1の左から１１番目の画素の、シャッタが開いて４番目のシャッタ時間/vの背景の成分は、B35/vとなる。
【０１９３】
図２６中のフレーム#n+1において、左から９番目の画素乃至１１番目の画素は、アンカバードバックグラウンド領域である混合領域に属する。
【０１９４】
図２６中のフレーム#n+1の左から１２番目の画素は、前景領域に属する。フレーム#n+1の前景領域における、シャッタ時間/vの前景の成分は、F13/v乃至F16/vのいずれかである。
【０１９５】
図２７は、図２６に示す画素値から前景の成分を抽出した画像のモデル図である。
【０１９６】
図２８は、前景領域、背景領域、カバードバックグラウンド領域、およびアンカバードバックグラウンド領域に属する画素毎に分割された画像と、画素の画素値を時間方向に展開したモデル図との対応を示す図である。
【０１９７】
図２８に示すように、領域特定部１０３は、入力画像の前景領域、背景領域、カバードバックグラウンド領域、およびアンカバードバックグラウンド領域を特定する。
【０１９８】
図２９は、前景領域の画像、背景領域の画像、カバードバックグラウンド領域の前景成分画像、カバードバックグラウンド領域の背景の成分、アンカバードバックグラウンド領域の前景の成分、およびアンカバードバックグラウンド領域の背景の成分に分離された入力画像と、画素の画素値を時間方向に展開したモデル図との対応を示す図である。
【０１９９】
図２９に示すように、入力画像は、領域特定部１０３により、前景領域、背景領域、カバードバックグラウンド領域、およびアンカバードバックグラウンド領域を特定される。入力画像は、前景背景分離部１０５により、特定された前景領域、背景領域、カバードバックグラウンド領域、およびアンカバードバックグラウンド領域、および混合比算出部１０４により検出された混合比αを基に、前景領域の画像、カバードバックグラウンド領域の前景の成分、およびアンカバードバックグラウンド領域の前景の成分からなる前景成分画像、並びに背景領域の画像、カバードバックグラウンド領域の背景の成分、およびアンカバードバックグラウンド領域の背景の成分からなる背景成分画像に分離される。
【０２００】
分離された前景成分画像、および背景成分画像は、それぞれの画像毎に、処理される。
【０２０１】
前景背景分離部１０５は、入力画像を、領域情報および混合比αを基に、前景領域の画像、背景領域の画像、カバードバックグラウンド領域の前景成分画像、カバードバックグラウンド領域の背景成分画像、アンカバードバックグラウンド領域の前景成分画像、およびアンカバードバックグラウンド領域の背景成分画像に分離するようにしてもよい。
【０２０２】
図３０は、前景領域、背景領域、および混合領域に分割された画像の例を示す図である。領域特定部１０３は、入力画像の、前景領域、背景領域、および混合領域を特定する。画像処理装置は、前景領域、背景領域、および混合領域を示す領域情報を基に、入力画像を、前景領域の画像、背景領域の画像、および混合領域の画像に分割することができる。
【０２０３】
図３１に示すように、前景背景分離部１０５は、領域特定部１０３から供給された領域情報、および混合比算出部１０４から供給された混合比αを基に、混合領域の画像を、混合領域の前景成分画像および混合領域の背景成分画像に分離する。
【０２０４】
図３２に示すように、分離された背景成分画像は、混合領域の画素値が補正され、分離された前景成分画像は、動きボケが除去される。
【０２０５】
図３３に示すように、入力画像は、領域に分割され、前景の成分と背景の成分とに分離される。分離された入力画像は、前景成分画像および背景成分画像に合成される。
【０２０６】
前景成分画像に含まれる動きボケは、除去される。背景成分画像は、混合領域に対応する画素値が補正される。
【０２０７】
動きボケが除去された前景成分画像、および補正された背景成分画像は、個々に処理される。
【０２０８】
図３４は、本発明に係る画像処理装置の画像の処理を説明するフローチャートである。
【０２０９】
ステップＳ１０１において、領域特定部１０３は、動き検出部１０２から供給された動きベクトルおよびその位置情報、並びに入力画像を基に、入力画像の前景領域、背景領域、カバードバックグラウンド領域、およびアンカバードバックグラウンド領域を特定する。領域特定の処理の詳細は、後述する。
【０２１０】
ステップＳ１０２において、混合比算出部１０４は、領域特定部１０３から供給された領域情報および入力画像を基に、混合比αを算出する。混合比算出部１０４の混合比αを算出する処理の詳細は、後述する。
【０２１１】
ステップＳ１０３において、前景背景分離部１０５は、領域特定部１０３から供給された領域情報、および混合比算出部１０４から供給された混合比αを基に、入力画像を、前景の成分からなる前景成分画像、および背景の成分からなる背景成分画像に分離する。前景背景分離部１０５の画像の分離の処理の詳細は、後述する。
【０２１２】
ステップＳ１０４において、動きボケ除去部１０６は、動き検出部１０２から供給された動きベクトルおよびその位置情報、並びに領域特定部１０３から供給された領域情報を基に、前景背景分離部１０５から供給された前景成分画像の動きボケを除去する。
【０２１３】
ステップＳ１０５において、補正部１０７は、前景背景分離部１０５から供給された背景成分画像の混合領域に対応する画素値を補正する。
【０２１４】
ステップＳ１０６において、動きボケ除去画像処理部１０８は、動きボケが除去された前景成分画像、および補正された背景成分画像毎に、画像の処理を実行して、処理は終了する。動きボケ除去画像処理部１０８が実行する画像処理の詳細は、後述する。
【０２１５】
このように、本発明に係る画像処理装置は、入力画像を、前景成分画像および背景成分画像に分離し、前景成分画像から動きボケを除去して、動きボケが除去された前景成分画像、および背景成分画像毎に画像処理を実行する。
【０２１６】
以下、領域特定部１０３、混合比算出部１０４、前景背景分離部１０５、動きボケ除去部１０６、および動きボケ除去画像処理部１０８のそれぞれの構成について説明する。
【０２１７】
図３５は、領域特定部１０３の構成の一例を示すブロック図である。図３５に構成を示す領域特定部１０３は、動きベクトルを利用しない。フレームメモリ２０１は、入力された画像をフレーム単位で記憶する。フレームメモリ２０１は、処理の対象がフレーム#nであるとき、フレーム#nの２つ前のフレームであるフレーム#n-2、フレーム#nの１つ前のフレームであるフレーム#n-1、フレーム#n、フレーム#nの１つ後のフレームであるフレーム#n+1、およびフレーム#nの２つ後のフレームであるフレーム#n+2を記憶する。
【０２１８】
静動判定部２０２−１は、フレーム#nの領域特定の対象である画素の画像上の位置と同一の位置にあるフレーム#n+2の画素の画素値、およびフレーム#nの領域特定の対象である画素の画像上の位置と同一の位置にあるフレーム#n+1の画素の画素値をフレームメモリ２０１から読み出して、読み出した画素値の差の絶対値を算出する。静動判定部２０２−１は、フレーム#n+2の画素値とフレーム#n+1の画素値との差の絶対値が、予め設定している閾値Thより大きいか否かを判定し、差の絶対値が閾値Thより大きいと判定された場合、動きを示す静動判定を領域判定部２０３−１に供給する。フレーム#n+2の画素の画素値とフレーム#n+1の画素の画素値との差の絶対値が閾値Th以下であると判定された場合、静動判定部２０２−１は、静止を示す静動判定を領域判定部２０３−１に供給する。
【０２１９】
静動判定部２０２−２は、フレーム#nの領域特定の対象である画素の画像上の位置と同一の位置にあるフレーム#n+1の画素の画素値、およびフレーム#nの対象となる画素の画素値をフレームメモリ２０１から読み出して、画素値の差の絶対値を算出する。静動判定部２０２−２は、フレーム#n+1の画素値とフレーム#nの画素値との差の絶対値が、予め設定している閾値Thより大きいか否かを判定し、画素値の差の絶対値が、閾値Thより大きいと判定された場合、動きを示す静動判定を領域判定部２０３−１および領域判定部２０３−２に供給する。フレーム#n+1の画素の画素値とフレーム#nの画素の画素値との差の絶対値が、閾値Th以下であると判定された場合、静動判定部２０２−２は、静止を示す静動判定を領域判定部２０３−１および領域判定部２０３−２に供給する。
【０２２０】
静動判定部２０２−３は、フレーム#nの領域特定の対象である画素の画素値、およびフレーム#nの領域特定の対象である画素の画像上の位置と同一の位置にあるフレーム#n-1の画素の画素値をフレームメモリ２０１から読み出して、画素値の差の絶対値を算出する。静動判定部２０２−３は、フレーム#nの画素値とフレーム#n-1の画素値との差の絶対値が、予め設定している閾値Thより大きいか否かを判定し、画素値の差の絶対値が、閾値Thより大きいと判定された場合、動きを示す静動判定を領域判定部２０３−２および領域判定部２０３−３に供給する。フレーム#nの画素の画素値とフレーム#n-1の画素の画素値との差の絶対値が、閾値Th以下であると判定された場合、静動判定部２０２−３は、静止を示す静動判定を領域判定部２０３−２および領域判定部２０３−３に供給する。
【０２２１】
静動判定部２０２−４は、フレーム#nの領域特定の対象である画素の画像上の位置と同一の位置にあるフレーム#n-1の画素の画素値、およびフレーム#nの領域特定の対象である画素の画像上の位置と同一の位置にあるフレーム#n-2の画素の画素値をフレームメモリ２０１から読み出して、画素値の差の絶対値を算出する。静動判定部２０２−４は、フレーム#n-1の画素値とフレーム#n-2の画素値との差の絶対値が、予め設定している閾値Thより大きいか否かを判定し、画素値の差の絶対値が、閾値Thより大きいと判定された場合、動きを示す静動判定を領域判定部２０３−３に供給する。フレーム#n-1の画素の画素値とフレーム#n-2の画素の画素値との差の絶対値が、閾値Th以下であると判定された場合、静動判定部２０２−４は、静止を示す静動判定を領域判定部２０３−３に供給する。
【０２２２】
領域判定部２０３−１は、静動判定部２０２−１から供給された静動判定が静止を示し、かつ、静動判定部２０２−２から供給された静動判定が動きを示しているとき、フレーム#nにおける領域特定の対象である画素がアンカバードバックグラウンド領域に属すると判定し、領域の判定される画素に対応するアンカバードバックグラウンド領域判定フラグに、アンカバードバックグラウンド領域に属することを示す”１”を設定する。
【０２２３】
領域判定部２０３−１は、静動判定部２０２−１から供給された静動判定が動きを示すか、または、静動判定部２０２−２から供給された静動判定が静止を示しているとき、フレーム#nにおける領域特定の対象である画素がアンカバードバックグラウンド領域に属しないと判定し、領域の判定される画素に対応するアンカバードバックグラウンド領域判定フラグに、アンカバードバックグラウンド領域に属しないことを示す”０”を設定する。
【０２２４】
領域判定部２０３−１は、このように”１”または”０”が設定されたアンカバードバックグラウンド領域判定フラグを判定フラグ格納フレームメモリ２０４に供給する。
【０２２５】
領域判定部２０３−２は、静動判定部２０２−２から供給された静動判定が静止を示し、かつ、静動判定部２０２−３から供給された静動判定が静止を示しているとき、フレーム#nにおける領域特定の対象である画素が静止領域に属すると判定し、領域の判定される画素に対応する静止領域判定フラグに、静止領域に属することを示す”１”を設定する。
【０２２６】
領域判定部２０３−２は、静動判定部２０２−２から供給された静動判定が動きを示すか、または、静動判定部２０２−３から供給された静動判定が動きを示しているとき、フレーム#nにおける領域特定の対象である画素が静止領域に属しないと判定し、領域の判定される画素に対応する静止領域判定フラグに、静止領域に属しないことを示す”０”を設定する。
【０２２７】
領域判定部２０３−２は、このように”１”または”０”が設定された静止領域判定フラグを判定フラグ格納フレームメモリ２０４に供給する。
【０２２８】
領域判定部２０３−２は、静動判定部２０２−２から供給された静動判定が動きを示し、かつ、静動判定部２０２−３から供給された静動判定が動きを示しているとき、フレーム#nにおける領域特定の対象である画素が動き領域に属すると判定し、領域の判定される画素に対応する動き領域判定フラグに、動き領域に属することを示す”１”を設定する。
【０２２９】
領域判定部２０３−２は、静動判定部２０２−２から供給された静動判定が静止を示すか、または、静動判定部２０２−３から供給された静動判定が静止を示しているとき、フレーム#nにおける領域特定の対象である画素が動き領域に属しないと判定し、領域の判定される画素に対応する動き領域判定フラグに、動き領域に属しないことを示す”０”を設定する。
【０２３０】
領域判定部２０３−２は、このように”１”または”０”が設定された動き領域判定フラグを判定フラグ格納フレームメモリ２０４に供給する。
【０２３１】
領域判定部２０３−３は、静動判定部２０２−３から供給された静動判定が動きを示し、かつ、静動判定部２０２−４から供給された静動判定が静止を示しているとき、フレーム#nにおける領域特定の対象である画素がカバードバックグラウンド領域に属すると判定し、領域の判定される画素に対応するカバードバックグラウンド領域判定フラグに、カバードバックグラウンド領域に属することを示す”１”を設定する。
【０２３２】
領域判定部２０３−３は、静動判定部２０２−３から供給された静動判定が静止を示すか、または、静動判定部２０２−４から供給された静動判定が動きを示しているとき、フレーム#nにおける領域特定の対象である画素がカバードバックグラウンド領域に属しないと判定し、領域の判定される画素に対応するカバードバックグラウンド領域判定フラグに、カバードバックグラウンド領域に属しないことを示す”０”を設定する。
【０２３３】
領域判定部２０３−３は、このように”１”または”０”が設定されたカバードバックグラウンド領域判定フラグを判定フラグ格納フレームメモリ２０４に供給する。
【０２３４】
判定フラグ格納フレームメモリ２０４は、領域判定部２０３−１から供給されたアンカバードバックグラウンド領域判定フラグ、領域判定部２０３−２から供給された静止領域判定フラグ、領域判定部２０３−２から供給された動き領域判定フラグ、および領域判定部２０３−３から供給されたカバードバックグラウンド領域判定フラグをそれぞれ記憶する。
【０２３５】
判定フラグ格納フレームメモリ２０４は、記憶しているアンカバードバックグラウンド領域判定フラグ、静止領域判定フラグ、動き領域判定フラグ、およびカバードバックグラウンド領域判定フラグを合成部２０５に供給する。合成部２０５は、判定フラグ格納フレームメモリ２０４から供給された、アンカバードバックグラウンド領域判定フラグ、静止領域判定フラグ、動き領域判定フラグ、およびカバードバックグラウンド領域判定フラグを基に、各画素が、アンカバードバックグラウンド領域、静止領域、動き領域、およびカバードバックグラウンド領域のいずれかに属することを示す領域情報を生成し、判定フラグ格納フレームメモリ２０６に供給する。
【０２３６】
判定フラグ格納フレームメモリ２０６は、合成部２０５から供給された領域情報を記憶すると共に、記憶している領域情報を出力する。
【０２３７】
次に、領域特定部１０３の処理の例を図３６乃至図４０を参照して説明する。
【０２３８】
前景に対応するオブジェクトが移動しているとき、オブジェクトに対応する画像の画面上の位置は、フレーム毎に変化する。図３６に示すように、フレーム#nにおいて、Yn(x,y)で示される位置に位置するオブジェクトに対応する画像は、次のフレームであるフレーム#n+1において、Yn+1(x,y)に位置する。
【０２３９】
前景のオブジェクトに対応する画像の動き方向に隣接して１列に並ぶ画素の画素値を時間方向に展開したモデル図を図３７に示す。例えば、前景のオブジェクトに対応する画像の動き方向が画面に対して水平であるとき、図３７におけるモデル図は、１つのライン上の隣接する画素の画素値を時間方向に展開したモデルを示す。
【０２４０】
図３７において、フレーム#nにおけるラインは、フレーム#n+1におけるラインと同一である。
【０２４１】
フレーム#nにおいて、左から２番目の画素乃至１３番目の画素に含まれているオブジェクトに対応する前景の成分は、フレーム#n+1において、左から６番目乃至１７番目の画素に含まれる。
【０２４２】
フレーム#nにおいて、カバードバックグラウンド領域に属する画素は、左から１１番目乃至１３番目の画素であり、アンカバードバックグラウンド領域に属する画素は、左から２番目乃至４番目の画素である。フレーム#n+1において、カバードバックグラウンド領域に属する画素は、左から１５番目乃至１７番目の画素であり、アンカバードバックグラウンド領域に属する画素は、左から６番目乃至８番目の画素である。
【０２４３】
図３７に示す例において、フレーム#nに含まれる前景の成分が、フレーム#n+1において４画素移動しているので、動き量vは、４である。仮想分割数は、動き量vに対応し、４である。
【０２４４】
次に、注目しているフレームの前後における混合領域に属する画素の画素値の変化について説明する。
【０２４５】
図３８に示す、背景が静止し、前景の動き量vが４であるフレーム#nにおいて、カバードバックグラウンド領域に属する画素は、左から１５番目乃至１７番目の画素である。動き量vが４であるので、１つ前のフレーム#n-1において、左から１５番目乃至１７番目の画素は、背景の成分のみを含み、背景領域に属する。また、更に１つ前のフレーム#n-2において、左から１５番目乃至１７番目の画素は、背景の成分のみを含み、背景領域に属する。
【０２４６】
ここで、背景に対応するオブジェクトが静止しているので、フレーム#n-1の左から１５番目の画素の画素値は、フレーム#n-2の左から１５番目の画素の画素値から変化しない。同様に、フレーム#n-1の左から１６番目の画素の画素値は、フレーム#n-2の左から１６番目の画素の画素値から変化せず、フレーム#n-1の左から１７番目の画素の画素値は、フレーム#n-2の左から１７番目の画素の画素値から変化しない。
【０２４７】
すなわち、フレーム#nにおけるカバードバックグラウンド領域に属する画素に対応する、フレーム#n-1およびフレーム#n-2の画素は、背景の成分のみから成り、画素値が変化しないので、その差の絶対値は、ほぼ０の値となる。従って、フレーム#nにおける混合領域に属する画素に対応する、フレーム#n-1およびフレーム#n-2の画素に対する静動判定は、静動判定部２０２−４により、静止と判定される。
【０２４８】
フレーム#nにおけるカバードバックグラウンド領域に属する画素は、前景の成分を含むので、フレーム#n-1における背景の成分のみから成る場合と、画素値が異なる。従って、フレーム#nにおける混合領域に属する画素、および対応するフレーム#n-1の画素に対する静動判定は、静動判定部２０２−３により、動きと判定される。
【０２４９】
このように、領域判定部２０３−３は、静動判定部２０２−３から動きを示す静動判定の結果が供給され、静動判定部２０２−４から静止を示す静動判定の結果が供給されたとき、対応する画素がカバードバックグラウンド領域に属すると判定する。
【０２５０】
図３９に示す、背景が静止し、前景の動き量vが４であるフレーム#nにおいて、アンカバードバックグラウンド領域に含まれる画素は、左から２番目乃至４番目の画素である。動き量vが４であるので、１つ後のフレーム#n+1において、左から２番目乃至４番目の画素は、背景の成分のみを含み、背景領域に属する。また、更に１つ後のフレーム#n+2において、左から２番目乃至４番目の画素は、背景の成分のみを含み、背景領域に属する。
【０２５１】
ここで、背景に対応するオブジェクトが静止しているので、フレーム#n+2の左から２番目の画素の画素値は、フレーム#n+1の左から２番目の画素の画素値から変化しない。同様に、フレーム#n+2の左から３番目の画素の画素値は、フレーム#n+1の左から３番目の画素の画素値から変化せず、フレーム#n+2の左から４番目の画素の画素値は、フレーム#n+1の左から４番目の画素の画素値から変化しない。
【０２５２】
すなわち、フレーム#nにおけるアンカバードバックグラウンド領域に属する画素に対応する、フレーム#n+1およびフレーム#n+2の画素は、背景の成分のみから成り、画素値が変化しないので、その差の絶対値は、ほぼ０の値となる。従って、フレーム#nにおける混合領域に属する画素に対応する、フレーム#n+1およびフレーム#n+2の画素に対する静動判定は、静動判定部２０２−１により、静止と判定される。
【０２５３】
フレーム#nにおけるアンカバードバックグラウンド領域に属する画素は、前景の成分を含むので、フレーム#n+1における背景の成分のみから成る場合と、画素値が異なる。従って、フレーム#nにおける混合領域に属する画素、および対応するフレーム#n+1の画素に対する静動判定は、静動判定部２０２−２により、動きと判定される。
【０２５４】
このように、領域判定部２０３−１は、静動判定部２０２−２から動きを示す静動判定の結果が供給され、静動判定部２０２−１から静止を示す静動判定の結果が供給されたとき、対応する画素がアンカバードバックグラウンド領域に属すると判定する。
【０２５５】
図４０は、フレーム#nにおける領域特定部１０３の判定条件を示す図である。フレーム#nの判定の対象となる画素の画像上の位置と同一の位置にあるフレーム#n-2の画素と、フレーム#nの判定の対象となる画素の画像上の位置と同一の位置にあるフレーム#n-1の画素とが静止と判定され、フレーム#nの判定の対象となる画素の画像上の位置と同一の位置にあるフレーム#n-1の画素と、フレーム#nの画素とが動きと判定されたとき、領域特定部１０３は、フレーム#nの判定の対象となる画素がカバードバックグラウンド領域に属すると判定する。
【０２５６】
フレーム#nの判定の対象となる画素の画像上の位置と同一の位置にあるフレーム#n-1の画素と、フレーム#nの画素とが静止と判定され、フレーム#nの画素と、フレーム#nの判定の対象となる画素の画像上の位置と同一の位置にあるフレーム#n+1の画素とが静止と判定されたとき、領域特定部１０３は、フレーム#nの判定の対象となる画素が静止領域に属すると判定する。
【０２５７】
フレーム#nの判定の対象となる画素の画像上の位置と同一の位置にあるフレーム#n-1の画素と、フレーム#nの画素とが動きと判定され、フレーム#nの画素と、フレーム#nの判定の対象となる画素の画像上の位置と同一の位置にあるフレーム#n+1の画素とが動きと判定されたとき、領域特定部１０３は、フレーム#nの判定の対象となる画素が動き領域に属すると判定する。
【０２５８】
フレーム#nの画素と、フレーム#nの判定の対象となる画素の画像上の位置と同一の位置にあるフレーム#n+1の画素とが動きと判定され、フレーム#nの判定の対象となる画素の画像上の位置と同一の位置にあるフレーム#n+1の画素と、フレーム#nの判定の対象となる画素の画像上の位置と同一の位置にあるフレーム#n+2の画素とが静止と判定されたとき、領域特定部１０３は、フレーム#nの判定の対象となる画素がアンカバードバックグラウンド領域に属すると判定する。
【０２５９】
図４１は、領域特定部１０３の領域の特定の結果の例を示す図である。図４１（Ａ）において、カバードバックグラウンド領域に属すると判定された画素は、白で表示されている。図４１（Ｂ）において、アンカバードバックグラウンド領域に属すると判定された画素は、白で表示されている。
【０２６０】
図４１（Ｃ）において、動き領域に属すると判定された画素は、白で表示されている。図４１（Ｄ）において、静止領域に属すると判定された画素は、白で表示されている。
【０２６１】
図４２は、判定フラグ格納フレームメモリ２０６が出力する領域情報の内、混合領域を示す領域情報を画像として示す図である。図４２において、カバードバックグラウンド領域またはアンカバードバックグラウンド領域に属すると判定された画素、すなわち混合領域に属すると判定された画素は、白で表示されている。判定フラグ格納フレームメモリ２０６が出力する混合領域を示す領域情報は、混合領域、および前景領域内のテクスチャの無い部分に囲まれたテクスチャの有る部分を示す。
【０２６２】
次に、図４３のフローチャートを参照して、領域特定部１０３の領域特定の処理を説明する。ステップＳ２０１において、フレームメモリ２０１は、判定の対象となるフレーム#nを含むフレーム#n-2乃至フレーム#n+2の画像を取得する。
【０２６３】
ステップＳ２０２において、静動判定部２０２−３は、フレーム#n-1の画素とフレーム#nの同一位置の画素とで、静止か否かを判定し、静止と判定された場合、ステップＳ２０３に進み、静動判定部２０２−２は、フレーム#nの画素とフレーム#n+1の同一位置の画素とで、静止か否かを判定する。
【０２６４】
ステップＳ２０３において、フレーム#nの画素とフレーム#n+1の同一位置の画素とで、静止と判定された場合、ステップＳ２０４に進み、領域判定部２０３−２は、領域の判定される画素に対応する静止領域判定フラグに、静止領域に属することを示す”１”を設定する。領域判定部２０３−２は、静止領域判定フラグを判定フラグ格納フレームメモリ２０４に供給し、手続きは、ステップＳ２０５に進む。
【０２６５】
ステップＳ２０２において、フレーム#n-1の画素とフレーム#nの同一位置の画素とで、動きと判定された場合、または、ステップＳ２０３において、フレーム#nの画素とフレーム#n+1の同一位置の画素とで、動きと判定された場合、フレーム#nの画素が静止領域には属さないので、ステップＳ２０４の処理はスキップされ、手続きは、ステップＳ２０５に進む。
【０２６６】
ステップＳ２０５において、静動判定部２０２−３は、フレーム#n-1の画素とフレーム#nの同一位置の画素とで、動きか否かを判定し、動きと判定された場合、ステップＳ２０６に進み、静動判定部２０２−２は、フレーム#nの画素とフレーム#n+1の同一位置の画素とで、動きか否かを判定する。
【０２６７】
ステップＳ２０６において、フレーム#nの画素とフレーム#n+1の同一位置の画素とで、動きと判定された場合、ステップＳ２０７に進み、領域判定部２０３−２は、領域の判定される画素に対応する動き領域判定フラグに、動き領域に属することを示す”１”を設定する。領域判定部２０３−２は、動き領域判定フラグを判定フラグ格納フレームメモリ２０４に供給し、手続きは、ステップＳ２０８に進む。
【０２６８】
ステップＳ２０５において、フレーム#n-1の画素とフレーム#nの同一位置の画素とで、静止と判定された場合、または、ステップＳ２０６において、フレーム#nの画素とフレーム#n+1の同一位置の画素とで、静止と判定された場合、フレーム#nの画素が動き領域には属さないので、ステップＳ２０７の処理はスキップされ、手続きは、ステップＳ２０８に進む。
【０２６９】
ステップＳ２０８において、静動判定部２０２−４は、フレーム#n-2の画素とフレーム#n-1の同一位置の画素とで、静止か否かを判定し、静止と判定された場合、ステップＳ２０９に進み、静動判定部２０２−３は、フレーム#n-1の画素とフレーム#nの同一位置の画素とで、動きか否かを判定する。
【０２７０】
ステップＳ２０９において、フレーム#n-1の画素とフレーム#nの同一位置の画素とで、動きと判定された場合、ステップＳ２１０に進み、領域判定部２０３−３は、領域の判定される画素に対応するカバードバックグラウンド領域判定フラグに、カバードバックグラウンド領域に属することを示す”１”を設定する。領域判定部２０３−３は、カバードバックグラウンド領域判定フラグを判定フラグ格納フレームメモリ２０４に供給し、手続きは、ステップＳ２１１に進む。
【０２７１】
ステップＳ２０８において、フレーム#n-2の画素とフレーム#n-1の同一位置の画素とで、動きと判定された場合、または、ステップＳ２０９において、フレーム#n-1の画素とフレーム#nの同一位置の画素とで、静止と判定された場合、フレーム#nの画素がカバードバックグラウンド領域には属さないので、ステップＳ２１０の処理はスキップされ、手続きは、ステップＳ２１１に進む。
【０２７２】
ステップＳ２１１において、静動判定部２０２−２は、フレーム#nの画素とフレーム#n+1の同一位置の画素とで、動きか否かを判定し、動きと判定された場合、ステップＳ２１２に進み、静動判定部２０２−１は、フレーム#n+1の画素とフレーム#n+2の同一位置の画素とで、静止か否かを判定する。
【０２７３】
ステップＳ２１２において、フレーム#n+1の画素とフレーム#n+2の同一位置の画素とで、静止と判定された場合、ステップＳ２１３に進み、領域判定部２０３−１は、領域の判定される画素に対応するアンカバードバックグラウンド領域判定フラグに、アンカバードバックグラウンド領域に属することを示す”１”を設定する。領域判定部２０３−１は、アンカバードバックグラウンド領域判定フラグを判定フラグ格納フレームメモリ２０４に供給し、手続きは、ステップＳ２１４に進む。
【０２７４】
ステップＳ２１１において、フレーム#nの画素とフレーム#n+1の同一位置の画素とで、静止と判定された場合、または、ステップＳ２１２において、フレーム#n+1の画素とフレーム#n+2の同一位置の画素とで、動きと判定された場合、フレーム#nの画素がアンカバードバックグラウンド領域には属さないので、ステップＳ２１３の処理はスキップされ、手続きは、ステップＳ２１４に進む。
【０２７５】
ステップＳ２１４において、領域特定部１０３は、フレーム#nの全ての画素について領域を特定したか否かを判定し、フレーム#nの全ての画素について領域を特定していないと判定された場合、手続きは、ステップＳ２０２に戻り、他の画素について、領域特定の処理を繰り返す。
【０２７６】
ステップＳ２１４において、フレーム#nの全ての画素について領域を特定したと判定された場合、ステップＳ２１５に進み、合成部２０５は、判定フラグ格納フレームメモリ２０４に記憶されているアンカバードバックグラウンド領域判定フラグ、およびカバードバックグラウンド領域判定フラグを基に、混合領域を示す領域情報を生成し、更に、各画素が、アンカバードバックグラウンド領域、静止領域、動き領域、およびカバードバックグラウンド領域のいずれかに属することを示す領域情報を生成し、生成した領域情報を判定フラグ格納フレームメモリ２０６に設定し、処理は終了する。
【０２７７】
このように、領域特定部１０３は、フレームに含まれている画素のそれぞれについて、動き領域、静止領域、アンカバードバックグラウンド領域、またはカバードバックグラウンド領域に属することを示す領域情報を生成することができる。
【０２７８】
なお、領域特定部１０３は、アンカバードバックグラウンド領域およびカバードバックグラウンド領域に対応する領域情報に論理和を適用することにより、混合領域に対応する領域情報を生成して、フレームに含まれている画素のそれぞれについて、動き領域、静止領域、または混合領域に属することを示すフラグから成る領域情報を生成するようにしてもよい。
【０２７９】
前景に対応するオブジェクトがテクスチャを有す場合、領域特定部１０３は、より正確に動き領域を特定することができる。
【０２８０】
領域特定部１０３は、動き領域を示す領域情報を前景領域を示す領域情報として、また、静止領域を示す領域情報を背景領域を示す領域情報として出力することができる。
【０２８１】
なお、背景に対応するオブジェクトが静止しているとして説明したが、背景領域に対応する画像が動きを含んでいても上述した領域を特定する処理を適用することができる。例えば、背景領域に対応する画像が一様に動いているとき、領域特定部１０３は、この動きに対応して画像全体をシフトさせ、背景に対応するオブジェクトが静止している場合と同様に処理する。また、背景領域に対応する画像が局所毎に異なる動きを含んでいるとき、領域特定部１０３は、動きに対応した画素を選択して、上述の処理を実行する。
【０２８２】
図４４は、領域特定部１０３の構成の他の例を示すブロック図である。図４４に示す領域特定部１０３は、動きベクトルを使用しない。背景画像生成部３０１は、入力画像に対応する背景画像を生成し、生成した背景画像を２値オブジェクト画像抽出部３０２に供給する。背景画像生成部３０１は、例えば、入力画像に含まれる背景のオブジェクトに対応する画像オブジェクトを抽出して、背景画像を生成する。
【０２８３】
前景のオブジェクトに対応する画像の動き方向に隣接して１列に並ぶ画素の画素値を時間方向に展開したモデル図の例を図４５に示す。例えば、前景のオブジェクトに対応する画像の動き方向が画面に対して水平であるとき、図４５におけるモデル図は、１つのライン上の隣接する画素の画素値を時間方向に展開したモデルを示す。
【０２８４】
図４５において、フレーム#nにおけるラインは、フレーム#n-1およびフレーム#n+1におけるラインと同一である。
【０２８５】
フレーム#nにおいて、左から６番目の画素乃至１７番目の画素に含まれているオブジェクトに対応する前景の成分は、フレーム#n-1において、左から２番目乃至１３番目の画素に含まれ、フレーム#n+1において、左から１０番目乃至２１番目の画素に含まれる。
【０２８６】
フレーム#n-1において、カバードバックグラウンド領域に属する画素は、左から１１番目乃至１３番目の画素であり、アンカバードバックグラウンド領域に属する画素は、左から２番目乃至４番目の画素である。フレーム#nにおいて、カバードバックグラウンド領域に属する画素は、左から１５番目乃至１７番目の画素であり、アンカバードバックグラウンド領域に属する画素は、左から６番目乃至８番目の画素である。フレーム#n+1において、カバードバックグラウンド領域に属する画素は、左から１９番目乃至２１番目の画素であり、アンカバードバックグラウンド領域に属する画素は、左から１０番目乃至１２番目の画素である。
【０２８７】
フレーム#n-1において、背景領域に属する画素は、左から１番目の画素、および左から１４番目乃至２１番目の画素である。フレーム#nにおいて、背景領域に属する画素は、左から１番目乃至５番目の画素、および左から１８番目乃至２１番目の画素である。フレーム#n+1において、背景領域に属する画素は、左から１番目乃至９番目の画素である。
【０２８８】
背景画像生成部３０１が生成する、図４５の例に対応する背景画像の例を図４６に示す。背景画像は、背景のオブジェクトに対応する画素から構成され、前景のオブジェクトに対応する画像の成分を含まない。
【０２８９】
２値オブジェクト画像抽出部３０２は、背景画像および入力画像の相関を基に、２値オブジェクト画像を生成し、生成した２値オブジェクト画像を時間変化検出部３０３に供給する。
【０２９０】
図４７は、２値オブジェクト画像抽出部３０２の構成を示すブロック図である。相関値演算部３２１は、背景画像生成部３０１から供給された背景画像および入力画像の相関を演算し、相関値を生成して、生成した相関値をしきい値処理部３２２に供給する。
【０２９１】
相関値演算部３２１は、例えば、図４８（Ａ）に示すように、Ｘ₄を中心とした３×３の背景画像の中のブロックと、図４８（Ｂ）に示すように、背景画像の中のブロックに対応するＹ₄を中心とした３×３の入力画像の中のブロックに、式（４）を適用して、Ｙ₄に対応する相関値を算出する。
【０２９２】
【数２】

【数３】

【数４】

【０２９３】
相関値演算部３２１は、このように各画素に対応して算出された相関値をしきい値処理部３２２に供給する。
【０２９４】
また、相関値演算部３２１は、例えば、図４９（Ａ）に示すように、Ｘ₄を中心とした３×３の背景画像の中のブロックと、図４９（Ｂ）に示すように、背景画像の中のブロックに対応するＹ₄を中心とした３×３の入力画像の中のブロックに、式（７）を適用して、Ｙ₄に対応する差分絶対値を算出するようにしてもよい。
【０２９５】
【数５】

【０２９６】
相関値演算部３２１は、このように算出された差分絶対値を相関値として、しきい値処理部３２２に供給する。
【０２９７】
しきい値処理部３２２は、相関画像の画素値としきい値th0とを比較して、相関値がしきい値th0以下である場合、２値オブジェクト画像の画素値に1を設定し、相関値がしきい値th0より大きい場合、２値オブジェクト画像の画素値に0を設定して、0または1が画素値に設定された２値オブジェクト画像を出力する。しきい値処理部３２２は、しきい値th0を予め記憶するようにしてもよく、または、外部から入力されたしきい値th0を使用するようにしてもよい。
【０２９８】
図５０は、図４５に示す入力画像のモデルに対応する２値オブジェクト画像の例を示す図である。２値オブジェクト画像において、背景画像と相関の高い画素には、画素値に0が設定される。
【０２９９】
図５１は、時間変化検出部３０３の構成を示すブロック図である。フレームメモリ３４１は、フレーム#nの画素について領域を判定するとき、２値オブジェクト画像抽出部３０２から供給された、フレーム#n-1、フレーム#n、およびフレーム#n+1の２値オブジェクト画像を記憶する。
【０３００】
領域判定部３４２は、フレームメモリ３４１に記憶されているフレーム#n-1、フレーム#n、およびフレーム#n+1の２値オブジェクト画像を基に、フレーム#nの各画素について領域を判定して、領域情報を生成し、生成した領域情報を出力する。
【０３０１】
図５２は、領域判定部３４２の判定を説明する図である。フレーム#nの２値オブジェクト画像の注目している画素が0であるとき、領域判定部３４２は、フレーム#nの注目している画素が背景領域に属すると判定する。
【０３０２】
フレーム#nの２値オブジェクト画像の注目している画素が1であり、フレーム#n-1の２値オブジェクト画像の対応する画素が1であり、フレーム#n+1の２値オブジェクト画像の対応する画素が1であるとき、領域判定部３４２は、フレーム#nの注目している画素が前景領域に属すると判定する。
【０３０３】
フレーム#nの２値オブジェクト画像の注目している画素が1であり、フレーム#n-1の２値オブジェクト画像の対応する画素が0であるとき、領域判定部３４２は、フレーム#nの注目している画素がカバードバックグラウンド領域に属すると判定する。
【０３０４】
フレーム#nの２値オブジェクト画像の注目している画素が1であり、フレーム#n+1の２値オブジェクト画像の対応する画素が0であるとき、領域判定部３４２は、フレーム#nの注目している画素がアンカバードバックグラウンド領域に属すると判定する。
【０３０５】
図５３は、図４５に示す入力画像のモデルに対応する２値オブジェクト画像について、時間変化検出部３０３の判定した例を示す図である。時間変化検出部３０３は、２値オブジェクト画像のフレーム#nの対応する画素が0なので、フレーム#nの左から１番目乃至５番目の画素を背景領域に属すると判定する。
【０３０６】
時間変化検出部３０３は、２値オブジェクト画像のフレーム#nの画素が1であり、フレーム#n+1の対応する画素が0なので、左から６番目乃至９番目の画素をアンカバードバックグラウンド領域に属すると判定する。
【０３０７】
時間変化検出部３０３は、２値オブジェクト画像のフレーム#nの画素が1であり、フレーム#n-1の対応する画素が1であり、フレーム#n+1の対応する画素が1なので、左から１０番目乃至１３番目の画素を前景領域に属すると判定する。
【０３０８】
時間変化検出部３０３は、２値オブジェクト画像のフレーム#nの画素が1であり、フレーム#n-1の対応する画素が0なので、左から１４番目乃至１７番目の画素をカバードバックグラウンド領域に属すると判定する。
【０３０９】
時間変化検出部３０３は、２値オブジェクト画像のフレーム#nの対応する画素が0なので、左から１８番目乃至２１番目の画素を背景領域に属すると判定する。
【０３１０】
次に、図５４のフローチャートを参照して、領域判定部１０３の領域特定の処理を説明する。ステップＳ３０１において、領域判定部１０３の背景画像生成部３０１は、入力画像を基に、例えば、入力画像に含まれる背景のオブジェクトに対応する画像オブジェクトを抽出して背景画像を生成し、生成した背景画像を２値オブジェクト画像抽出部３０２に供給する。
【０３１１】
ステップＳ３０２において、２値オブジェクト画像抽出部３０２は、例えば、図４８を参照して説明した演算により、入力画像と背景画像生成部３０１から供給された背景画像との相関値を演算する。ステップＳ３０３において、２値オブジェクト画像抽出部３０２は、例えば、相関値としきい値th0とを比較することにより、相関値およびしきい値th0から２値オブジェクト画像を演算する。
【０３１２】
ステップＳ３０４において、時間変化検出部３０３は、領域判定の処理を実行して、処理は終了する。
【０３１３】
図５５のフローチャートを参照して、ステップＳ３０４に対応する領域判定の処理の詳細を説明する。ステップＳ３２１において、時間変化検出部３０３の領域判定部３４２は、フレームメモリ３４１に記憶されているフレーム#nにおいて、注目する画素が0であるか否かを判定し、フレーム#nにおいて、注目する画素が0であると判定された場合、ステップＳ３２２に進み、フレーム#nの注目する画素が背景領域に属すると設定して、処理は終了する。
【０３１４】
ステップＳ３２１において、フレーム#nにおいて、注目する画素が1であると判定された場合、ステップＳ３２３に進み、時間変化検出部３０３の領域判定部３４２は、フレームメモリ３４１に記憶されているフレーム#nにおいて、注目する画素が1であり、かつ、フレーム#n-1において、対応する画素が0であるか否かを判定し、フレーム#nにおいて、注目する画素が1であり、かつ、フレーム#n-1において、対応する画素が0であると判定された場合、ステップＳ３２４に進み、フレーム#nの注目する画素がカバードバックグラウンド領域に属すると設定して、処理は終了する。
【０３１５】
ステップＳ３２３において、フレーム#nにおいて、注目する画素が0であるか、または、フレーム#n-1において、対応する画素が1であると判定された場合、ステップＳ３２５に進み、時間変化検出部３０３の領域判定部３４２は、フレームメモリ３４１に記憶されているフレーム#nにおいて、注目する画素が1であり、かつ、フレーム#n+1において、対応する画素が0であるか否かを判定し、フレーム#nにおいて、注目する画素が1であり、かつ、フレーム#n+1において、対応する画素が0であると判定された場合、ステップＳ３２６に進み、フレーム#nの注目する画素がアンカバードバックグラウンド領域に属すると設定して、処理は終了する。
【０３１６】
ステップＳ３２５において、フレーム#nにおいて、注目する画素が0であるか、または、フレーム#n+1において、対応する画素が１であると判定された場合、ステップＳ３２７に進み、時間変化検出部３０３の領域判定部３４２は、フレーム#nの注目する画素を前景領域と設定して、処理は終了する。
【０３１７】
このように、領域特定部１０３は、入力された画像と対応する背景画像との相関値を基に、入力画像の画素が前景領域、背景領域、カバードバックグラウンド領域、およびアンカバードバックグラウンド領域のいずれかに属するかを特定して、特定した結果に対応する領域情報を生成することができる。
【０３１８】
図５６は、領域特定部１０３の他の構成を示すブロック図である。図５６に示す領域特定部１０３は、動き検出部１０２から供給される動きベクトルとその位置情報を使用する。図４４に示す場合と同様の部分には、同一の番号を付してあり、その説明は省略する。
【０３１９】
ロバスト化部３６１は、２値オブジェクト画像抽出部３０２から供給された、Ｎ個のフレームの２値オブジェクト画像を基に、ロバスト化された２値オブジェクト画像を生成して、時間変化検出部３０３に出力する。
【０３２０】
図５７は、ロバスト化部３６１の構成を説明するブロック図である。動き補償部３８１は、動き検出部１０２から供給された動きベクトルとその位置情報を基に、Ｎ個のフレームの２値オブジェクト画像の動きを補償して、動きが補償された２値オブジェクト画像をスイッチ３８２に出力する。
【０３２１】
図５８および図５９の例を参照して、動き補償部３８１の動き補償について説明する。例えば、フレーム#nの領域を判定するとき、図５８に例を示すフレーム#n-1、フレーム#n、およびフレーム#n+1の２値オブジェクト画像が入力された場合、動き補償部３８１は、動き検出部１０２から供給された動きベクトルを基に、図５９に例を示すように、フレーム#n-1の２値オブジェクト画像、およびフレーム#n+1の２値オブジェクト画像を動き補償して、動き補償された２値オブジェクト画像をスイッチ３８２に供給する。
【０３２２】
スイッチ３８２は、１番目のフレームの動き補償された２値オブジェクト画像をフレームメモリ３８３−１に出力し、２番目のフレームの動き補償された２値オブジェクト画像をフレームメモリ３８３−２に出力する。同様に、スイッチ３８２は、３番目乃至Ｎ−１番目のフレームの動き補償された２値オブジェクト画像のそれぞれをフレームメモリ３８３−３乃至フレームメモリ３８３−（Ｎ−１）のいずれかに出力し、Ｎ番目のフレームの動き補償された２値オブジェクト画像をフレームメモリ３８３−Ｎに出力する。
【０３２３】
フレームメモリ３８３−１は、１番目のフレームの動き補償された２値オブジェクト画像を記憶し、記憶されている２値オブジェクト画像を重み付け部３８４−１に出力する。フレームメモリ３８３−２は、２番目のフレームの動き補償された２値オブジェクト画像を記憶し、記憶されている２値オブジェクト画像を重み付け部３８４−２に出力する。
【０３２４】
同様に、フレームメモリ３８３−３乃至フレームメモリ３８３−（Ｎ−１）のそれぞれは、３番目のフレーム乃至Ｎ−１番目のフレームの動き補償された２値オブジェクト画像のいずれかを記憶し、記憶されている２値オブジェクト画像を重み付け部３８４−３乃至重み付け部３８４−（Ｎ−１）のいずれかに出力する。フレームメモリ３８３−Ｎは、Ｎ番目のフレームの動き補償された２値オブジェクト画像を記憶し、記憶されている２値オブジェクト画像を重み付け部３８４−Ｎに出力する。
【０３２５】
重み付け部３８４−１は、フレームメモリ３８３−１から供給された１番目のフレームの動き補償された２値オブジェクト画像の画素値に予め定めた重みw1を乗じて、積算部３８５に供給する。重み付け部３８４−２は、フレームメモリ３８３−２から供給された２番目のフレームの動き補償された２値オブジェクト画像の画素値に予め定めた重みw2を乗じて、積算部３８５に供給する。
【０３２６】
同様に、重み付け部３８４−３乃至重み付け部３８４−（Ｎ−１）のそれぞれは、フレームメモリ３８３−３乃至フレームメモリ３８３−（Ｎ−１）のいずれかから供給された３番目乃至Ｎ−１番目のいずれかのフレームの動き補償された２値オブジェクト画像の画素値に予め定めた重みw3乃至重みw(N-1)のいずれかを乗じて、積算部３８５に供給する。重み付け部３８４−Ｎは、フレームメモリ３８３−Ｎから供給されたＮ番目のフレームの動き補償された２値オブジェクト画像の画素値に予め定めた重みwNを乗じて、積算部３８５に供給する。
【０３２７】
積算部３８５は、１乃至Ｎ番目のフレームの動き補償され、それぞれ重みw1乃至wNのいずれかが乗じられた、２値オブジェクト画像の対応する画素値を積算して、積算された画素値を予め定めたしきい値th0と比較することにより２値オブジェクト画像を生成する。
【０３２８】
このように、ロバスト化部３６１は、Ｎ個の２値オブジェクト画像からロバスト化された２値オブジェト画像を生成して、時間変化検出部３０３に供給するので、図５６に構成を示す領域特定部１０３は、入力画像にノイズが含まれていても、図４４に示す場合に比較して、より正確に領域を特定することができる。
【０３２９】
次に、図５６に構成を示す領域特定部１０３の領域特定の処理について、図６０のフローチャートを参照して説明する。ステップＳ３４１乃至ステップＳ３４３の処理は、図５４のフローチャートで説明したステップＳ３０１乃至ステップＳ３０３とそれぞれ同様なのでその説明は省略する。
【０３３０】
ステップＳ３４４において、ロバスト化部３６１は、ロバスト化の処理を実行する。
【０３３１】
ステップＳ３４５において、時間変化検出部３０３は、領域判定の処理を実行して、処理は終了する。ステップＳ３４５の処理の詳細は、図５５のフローチャートを参照して説明した処理と同様なのでその説明は省略する。
【０３３２】
次に、図６１のフローチャートを参照して、図６０のステップＳ３４４の処理に対応する、ロバスト化の処理の詳細について説明する。ステップＳ３６１において、動き補償部３８１は、動き検出部１０２から供給される動きベクトルとその位置情報を基に、入力された２値オブジェクト画像の動き補償の処理を実行する。ステップＳ３６２において、フレームメモリ３８３−１乃至３８３−Ｎのいずれかは、スイッチ３８２を介して供給された動き補償された２値オブジェクト画像を記憶する。
【０３３３】
ステップＳ３６３において、ロバスト化部３６１は、Ｎ個の２値オブジェクト画像が記憶されたか否かを判定し、Ｎ個の２値オブジェクト画像が記憶されていないと判定された場合、ステップＳ３６１に戻り、２値オブジェクト画像の動き補償の処理および２値オブジェクト画像の記憶の処理を繰り返す。
【０３３４】
ステップＳ３６３において、Ｎ個の２値オブジェクト画像が記憶されたと判定された場合、ステップＳ３６４に進み、重み付け部３８４−１乃至３８４−Ｎのそれぞれは、Ｎ個の２値オブジェクト画像のそれぞれにw1乃至wNのいずれかの重みを乗じて、重み付けする。
【０３３５】
ステップＳ３６５において、積算部３８５は、重み付けされたＮ個の２値オブジェクト画像を積算する。
【０３３６】
ステップＳ３６６において、積算部３８５は、例えば、予め定められたしきい値th1との比較などにより、積算された画像から２値オブジェクト画像を生成して、処理は終了する。
【０３３７】
このように、図５６に構成を示す領域特定部１０３は、ロバスト化された２値オブジェクト画像を基に、領域情報を生成することができる。
【０３３８】
以上のように、領域特定部１０３は、フレームに含まれている画素のそれぞれについて、動き領域、静止領域、アンカバードバックグラウンド領域、またはカバードバックグラウンド領域に属することを示す領域情報を生成することができる。
【０３３９】
図６２は、混合比算出部１０４の構成の一例を示すブロック図である。推定混合比処理部４０１は、入力画像を基に、カバードバックグラウンド領域のモデルに対応する演算により、画素毎に推定混合比を算出して、算出した推定混合比を混合比決定部４０３に供給する。
【０３４０】
推定混合比処理部４０２は、入力画像を基に、アンカバードバックグラウンド領域のモデルに対応する演算により、画素毎に推定混合比を算出して、算出した推定混合比を混合比決定部４０３に供給する。
【０３４１】
前景に対応するオブジェクトがシャッタ時間内に等速で動いていると仮定できるので、混合領域に属する画素の混合比αは、以下の性質を有する。すなわち、混合比αは、画素の位置の変化に対応して、直線的に変化する。画素の位置の変化を１次元とすれば、混合比αの変化は、直線で表現することができ、画素の位置の変化を２次元とすれば、混合比αの変化は、平面で表現することができる。
【０３４２】
なお、１フレームの期間は短いので、前景に対応するオブジェクトが剛体であり、等速で移動していると仮定が成り立つ。
【０３４３】
この場合、混合比αの傾きは、前景のシャッタ時間内での動き量vの逆比となる。
【０３４４】
理想的な混合比αの例を図６３に示す。理想的な混合比αの混合領域における傾きlは、動き量vの逆数として表すことができる。
【０３４５】
図６３に示すように、理想的な混合比αは、背景領域において、１の値を有し、前景領域において、０の値を有し、混合領域において、０を越え１未満の値を有する。
【０３４６】
図６４の例において、フレーム#nの左から７番目の画素の画素値C06は、フレーム#n-1の左から７番目の画素の画素値P06を用いて、式（８）で表すことができる。
【０３４７】
【数６】

【０３４８】
式（８）において、画素値C06を混合領域の画素の画素値Mと、画素値P06を背景領域の画素の画素値Bと表現する。すなわち、混合領域の画素の画素値Mおよび背景領域の画素の画素値Bは、それぞれ、式（９）および式（１０）のように表現することができる。
【０３４９】
M=C06 （９）
B=P06 （１０）
【０３５０】
式（８）中の2/vは、混合比αに対応する。動き量vが４なので、フレーム#nの左から７番目の画素の混合比αは、0.5となる。
【０３５１】
以上のように、注目しているフレーム#nの画素値Cを混合領域の画素値と見なし、フレーム#nの前のフレーム#n-1の画素値Pを背景領域の画素値と見なすことで、混合比αを示す式（３）は、式（１１）のように書き換えられる。
【０３５２】
C=α・P+f （１１）
式（１１）のfは、注目している画素に含まれる前景の成分の和Σ_iFi/vである。
式（１１）に含まれる変数は、混合比αおよび前景の成分の和fの２つである。
【０３５３】
同様に、アンカバードバックグラウンド領域における、動き量vが４であり、時間方向の仮想分割数が４である、画素値を時間方向に展開したモデルを図６５に示す。
【０３５４】
アンカバードバックグラウンド領域において、上述したカバードバックグラウンド領域における表現と同様に、注目しているフレーム#nの画素値Cを混合領域の画素値と見なし、フレーム#nの後のフレーム#n+1の画素値Nを背景領域の画素値と見なすことで、混合比αを示す式（３）は、式（１２）のように表現することができる。
【０３５５】
C=α・N+f （１２）
【０３５６】
なお、背景のオブジェクトが静止しているとして説明したが、背景のオブジェクトが動いている場合においても、背景の動き量vに対応させた位置の画素の画素値を利用することにより、式（８）乃至式（１２）を適用することができる。
例えば、図６４において、背景に対応するオブジェクトの動き量vが２であり、仮想分割数が２であるとき、背景に対応するオブジェクトが図中の右側に動いているとき、式（１０）における背景領域の画素の画素値Bは、画素値P04とされる。
【０３５７】
式（１１）および式（１２）は、それぞれ２つの変数を含むので、そのままでは混合比αを求めることができない。ここで、画像は一般的に空間的に相関が強いので近接する画素同士でほぼ同じ画素値となる。
【０３５８】
そこで、前景成分は、空間的に相関が強いので、前景の成分の和fを前または後のフレームから導き出せるように式を変形して、混合比αを求める。
【０３５９】
図６６のフレーム#nの左から７番目の画素の画素値Mcは、式（１３）で表すことができる。
【０３６０】
【数７】

式（１３）の右辺第１項の2/vは、混合比αに相当する。式（１３）の右辺第２項は、後のフレーム#n+1の画素値を利用して、式（１４）のように表すこととする。
【０３６１】
【数８】

【０３６２】
ここで、前景の成分の空間相関を利用して、式（１５）が成立するとする。
【０３６３】
F=F05=F06=F07=F08=F09=F10=F11=F12 （１５）
式（１４）は、式（１５）を利用して、式（１６）のように置き換えることができる。
【０３６４】
【数９】

【０３６５】
結果として、βは、式（１７）で表すことができる。
【０３６６】
β=2/4 （１７）
【０３６７】
一般的に、式（１５）に示すように混合領域に関係する前景の成分が等しいと仮定すると、混合領域の全ての画素について、内分比の関係から式（１８）が成立する。
【０３６８】
β=1-α （１８）
【０３６９】
式（１８）が成立するとすれば、式（１１）は、式（１９）に示すように展開することができる。
【０３７０】
【数１０】

【０３７１】
同様に、式（１８）が成立するとすれば、式（１２）は、式（２０）に示すように展開することができる。
【０３７２】
【数１１】

【０３７３】
式（１９）および式（２０）において、C，N、およびPは、既知の画素値なので、式（１９）および式（２０）に含まれる変数は、混合比αのみである。式（１９）および式（２０）における、C，N、およびPの関係を図６７に示す。Cは、混合比αを算出する、フレーム#nの注目している画素の画素値である。Nは、注目している画素と空間方向の位置が対応する、フレーム#n+1の画素の画素値である。Pは、注目している画素と空間方向の位置が対応する、フレーム#n-1の画素の画素値である。
【０３７４】
従って、式（１９）および式（２０）のそれぞれに１つの変数が含まれることとなるので、３つのフレームの画素の画素値を利用して、混合比αを算出することができる。式（１９）および式（２０）を解くことにより、正しい混合比αが算出されるための条件は、混合領域に関係する前景の成分が等しい、すなわち、前景のオブジェクトが静止しているとき撮像された前景の画像オブジェクトにおいて、前景のオブジェクトの動きの方向に対応する、画像オブジェクトの境界に位置する画素であって、動き量vの２倍の数の連続している画素の画素値が、一定であることである。
【０３７５】
以上のように、カバードバックグラウンド領域に属する画素の混合比αは、式（２１）により算出され、アンカバードバックグラウンド領域に属する画素の混合比αは、式（２２）により算出される。
【０３７６】
α=(C-N)/(P-N) （２１）
α=(C-P)/(N-P) （２２）
【０３７７】
図６８は、推定混合比処理部４０１の構成を示すブロック図である。フレームメモリ４２１は、入力された画像をフレーム単位で記憶し、入力画像として入力されているフレームから１つ後のフレームをフレームメモリ４２２および混合比演算部４２３に供給する。
【０３７８】
フレームメモリ４２２は、入力された画像をフレーム単位で記憶し、フレームメモリ４２１から供給されているフレームから１つ後のフレームを混合比演算部４２３に供給する。
【０３７９】
従って、入力画像としてフレーム#n+1が混合比演算部４２３に入力されているとき、フレームメモリ４２１は、フレーム#nを混合比演算部４２３に供給し、フレームメモリ４２２は、フレーム#n-1を混合比演算部４２３に供給する。
【０３８０】
混合比演算部４２３は、式（２１）に示す演算により、フレーム#nの注目している画素の画素値C、注目している画素と空間的位置が対応する、フレーム#n+1の画素の画素値N、および注目している画素と空間的位置が対応する、フレーム#n-1の画素の画素値Pを基に、注目している画素の推定混合比を算出して、算出した推定混合比を出力する。例えば、背景が静止しているとき、混合比演算部４２３は、フレーム#nの注目している画素の画素値C、注目している画素とフレーム内の位置が同じ、フレーム#n+1の画素の画素値N、および注目している画素とフレーム内の位置が同じ、フレーム#n-1の画素の画素値Pを基に、注目している画素の推定混合比を算出して、算出した推定混合比を出力する。
【０３８１】
このように、推定混合比処理部４０１は、入力画像を基に、推定混合比を算出して、混合比決定部４０３に供給することができる。
【０３８２】
なお、推定混合比処理部４０２は、推定混合比処理部４０１が式（２１）に示す演算により、注目している画素の推定混合比を算出するのに対して、式（２２）に示す演算により、注目している画素の推定混合比を算出する部分が異なることを除き、推定混合比処理部４０１と同様なので、その説明は省略する。
【０３８３】
図６９は、推定混合比処理部４０１により算出された推定混合比の例を示す図である。図６９に示す推定混合比は、等速で動いているオブジェクトに対応する前景の動き量vが１１である場合の結果を、１ラインに対して示すものである。
【０３８４】
推定混合比は、混合領域において、図６３に示すように、ほぼ直線的に変化していることがわかる。
【０３８５】
図６２に戻り、混合比決定部４０３は、領域特定部１０３から供給された、混合比αの算出の対象となる画素が、前景領域、背景領域、カバードバックグラウンド領域、またはアンカバードバックグラウンド領域のいずれかに属するかを示す領域情報を基に、混合比αを設定する。混合比決定部４０３は、対象となる画素が前景領域に属する場合、０を混合比αに設定し、対象となる画素が背景領域に属する場合、１を混合比αに設定し、対象となる画素がカバードバックグラウンド領域に属する場合、推定混合比処理部４０１から供給された推定混合比を混合比αに設定し、対象となる画素がアンカバードバックグラウンド領域に属する場合、推定混合比処理部４０２から供給された推定混合比を混合比αに設定する。混合比決定部４０３は、領域情報を基に設定した混合比αを出力する。
【０３８６】
図７０は、混合比算出部１０４の他の構成を示すブロック図である。選択部４４１は、領域特定部１０３から供給された領域情報を基に、カバードバックグラウンド領域に属する画素および、これに対応する前および後のフレームの画素を推定混合比処理部４４２に供給する。選択部４４１は、領域特定部１０３から供給された領域情報を基に、アンカバードバックグラウンド領域に属する画素および、これに対応する前および後のフレームの画素を推定混合比処理部４４３に供給する。
【０３８７】
推定混合比処理部４４２は、選択部４４１から入力された画素値を基に、式（２１）に示す演算により、カバードバックグラウンド領域に属する、注目している画素の推定混合比を算出して、算出した推定混合比を選択部４４４に供給する。
【０３８８】
推定混合比処理部４４３は、選択部４４１から入力された画素値を基に、式（２２）に示す演算により、アンカバードバックグラウンド領域に属する、注目している画素の推定混合比を算出して、算出した推定混合比を選択部４４４に供給する。
【０３８９】
選択部４４４は、領域特定部１０３から供給された領域情報を基に、対象となる画素が前景領域に属する場合、０である推定混合比を選択して、混合比αに設定し、対象となる画素が背景領域に属する場合、１である推定混合比を選択して、混合比αに設定する。選択部４４４は、対象となる画素がカバードバックグラウンド領域に属する場合、推定混合比処理部４４２から供給された推定混合比を選択して混合比αに設定し、対象となる画素がアンカバードバックグラウンド領域に属する場合、推定混合比処理部４４３から供給された推定混合比を選択して混合比αに設定する。選択部４４４は、領域情報を基に選択して設定した混合比αを出力する。
【０３９０】
このように、図７０に示す他の構成を有する混合比算出部１０４は、画像の含まれる画素毎に混合比αを算出して、算出した混合比αを出力することができる。
【０３９１】
図７１のフローチャートを参照して、図６２に構成を示す混合比算出部１０４の混合比αの算出の処理を説明する。ステップＳ４０１において、混合比算出部１０４は、領域特定部１０３から供給された領域情報を取得する。ステップＳ４０２において、推定混合比処理部４０１は、カバードバックグラウンド領域に対応するモデルにより推定混合比の演算の処理を実行し、算出した推定混合比を混合比決定部４０３に供給する。混合比推定の演算の処理の詳細は、図７２のフローチャートを参照して、後述する。
【０３９２】
ステップＳ４０３において、推定混合比処理部４０２は、アンカバードバックグラウンド領域に対応するモデルにより推定混合比の演算の処理を実行し、算出した推定混合比を混合比決定部４０３に供給する。
【０３９３】
ステップＳ４０４において、混合比算出部１０４は、フレーム全体について、混合比αを推定したか否かを判定し、フレーム全体について、混合比αを推定していないと判定された場合、ステップＳ４０２に戻り、次の画素について混合比αを推定する処理を実行する。
【０３９４】
ステップＳ４０４において、フレーム全体について、混合比αを推定したと判定された場合、ステップＳ４０５に進み、混合比決定部４０３は、画素が、前景領域、背景領域、カバードバックグラウンド領域、またはアンカバードバックグラウンド領域のいずれかに属するかを示す、領域特定部１０３から供給された領域情報を基に、混合比αを設定する。混合比決定部４０３は、対象となる画素が前景領域に属する場合、０を混合比αに設定し、対象となる画素が背景領域に属する場合、１を混合比αに設定し、対象となる画素がカバードバックグラウンド領域に属する場合、推定混合比処理部４０１から供給された推定混合比を混合比αに設定し、対象となる画素がアンカバードバックグラウンド領域に属する場合、推定混合比処理部４０２から供給された推定混合比を混合比αに設定し、処理は終了する。
【０３９５】
このように、混合比算出部１０４は、領域特定部１０３から供給された領域情報、および入力画像を基に、各画素に対応する特徴量である混合比αを算出することができる。
【０３９６】
図７０に構成を示す混合比算出部１０４の混合比αの算出の処理は、図７１のフローチャートで説明した処理と同様なので、その説明は省略する。
【０３９７】
次に、図７１のステップＳ４０２に対応する、カバードバックグラウンド領域に対応するモデルによる混合比推定の処理を図７２のフローチャートを参照して説明する。
【０３９８】
ステップＳ４２１において、混合比演算部４２３は、フレームメモリ４２１から、フレーム#nの注目画素の画素値Cを取得する。
【０３９９】
ステップＳ４２２において、混合比演算部４２３は、フレームメモリ４２２から、注目画素に対応する、フレーム#n-1の画素の画素値Pを取得する。
【０４００】
ステップＳ４２３において、混合比演算部４２３は、入力画像に含まれる注目画素に対応する、フレーム#n+1の画素の画素値Nを取得する。
【０４０１】
ステップＳ４２４において、混合比演算部４２３は、フレーム#nの注目画素の画素値C、フレーム#n-1の画素の画素値P、およびフレーム#n+1の画素の画素値Nを基に、推定混合比を演算する。
【０４０２】
ステップＳ４２５において、混合比演算部４２３は、フレーム全体について、推定混合比を演算する処理を終了したか否かを判定し、フレーム全体について、推定混合比を演算する処理を終了していないと判定された場合、ステップＳ４２１に戻り、次の画素について推定混合比を算出する処理を繰り返す。
【０４０３】
ステップＳ４２５において、フレーム全体について、推定混合比を演算する処理を終了したと判定された場合、処理は終了する。
【０４０４】
このように、推定混合比処理部４０１は、入力画像を基に、推定混合比を演算することができる。
【０４０５】
図７１のステップＳ４０３におけるアンカバードバックグラウンド領域に対応するモデルによる混合比推定の処理は、アンカバードバックグラウンド領域のモデルに対応する式を利用した、図７２のフローチャートに示す処理と同様なので、その説明は省略する。
【０４０６】
なお、図７０に示す推定混合比処理部４４２および推定混合比処理部４４３は、図７２に示すフローチャートと同様の処理を実行して推定混合比を演算するので、その説明は省略する。
【０４０７】
また、背景に対応するオブジェクトが静止しているとして説明したが、背景領域に対応する画像が動きを含んでいても上述した混合比αを求める処理を適用することができる。例えば、背景領域に対応する画像が一様に動いているとき、推定混合比処理部４０１は、背景の動きに対応して画像全体をシフトさせ、背景に対応するオブジェクトが静止している場合と同様に処理する。また、背景領域に対応する画像が局所毎に異なる背景の動きを含んでいるとき、推定混合比処理部４０１は、混合領域に属する画素に対応する画素として、背景の動きに対応した画素を選択して、上述の処理を実行する。
【０４０８】
また、混合比算出部１０４は、全ての画素について、カバードバックグラウンド領域に対応するモデルによる混合比推定の処理のみを実行して、算出された推定混合比を混合比αとして出力するようにしてもよい。この場合において、混合比αは、カバードバックグラウンド領域に属する画素について、背景の成分の割合を示し、アンカバードバックグラウンド領域に属する画素について、前景の成分の割合を示す。アンカバードバックグラウンド領域に属する画素について、このように算出された混合比αと１との差分の絶対値を算出して、算出した絶対値を混合比αに設定すれば、信号処理装置は、アンカバードバックグラウンド領域に属する画素について、背景の成分の割合を示す混合比αを求めることができる。
【０４０９】
なお、同様に、混合比算出部１０４は、全ての画素について、アンカバードバックグラウンド領域に対応するモデルによる混合比推定の処理のみを実行して、算出された推定混合比を混合比αとして出力するようにしてもよい。
【０４１０】
次に、混合比算出部１０４の他の処理について説明する。
【０４１１】
シャッタ時間内において、前景に対応するオブジェクトが等速で動くことによる、画素の位置の変化に対応して、混合比αが直線的に変化する性質を利用して、空間方向に、混合比αと前景の成分の和fとを近似した式を立てることができる。混合領域に属する画素の画素値および背景領域に属する画素の画素値の組の複数を利用して、混合比αと前景の成分の和fとを近似した式を解くことにより、混合比αを算出する。
【０４１２】
混合比αの変化を、直線として近似すると、混合比αは、式（２３）で表される。
【０４１３】
α=il+p （２３）
式（２３）において、iは、注目している画素の位置を０とした空間方向のインデックスである。lは、混合比αの直線の傾きである。pは、混合比αの直線の切片である共に、注目している画素の混合比αである。式（２３）において、インデックスiは、既知であるが、傾きlおよび切片pは、未知である。
【０４１４】
インデックスi、傾きl、および切片pの関係を図７３に示す。
【０４１５】
混合比αを式（２３）のように近似することにより、複数の画素に対して複数の異なる混合比αは、２つの変数で表現される。図７３に示す例において、５つの画素に対する５つの混合比は、２つの変数である傾きlおよび切片pにより表現される。
【０４１６】
図７４に示す平面で混合比αを近似すると、画像の水平方向および垂直方向の２つの方向に対応する動きvを考慮したとき、式（２３）を平面に拡張して、混合比αは、式（２４）で表される。
【０４１７】
α=jm+kq+p （２４）
式（２４）において、jは、注目している画素の位置を０とした水平方向のインデックスであり、kは、垂直方向のインデックスである。mは、混合比αの面の水平方向の傾きであり、qは、混合比αの面の垂直方向の傾きである。pは、混合比αの面の切片である。
【０４１８】
例えば、図６４に示すフレーム#nにおいて、C05乃至C07について、それぞれ、式（２５）乃至式（２７）が成立する。
【０４１９】
C05=α05・B05/v+f05 （２５）
C06=α06・B06/v+f06 （２６）
C07=α07・B07/v+f07 （２７）
【０４２０】
前景の成分が近傍で一致する、すなわち、F01乃至F03が等しいとして、F01乃至F03をFcに置き換えると式（２８）が成立する。
【０４２１】
f(x)=(1-α(x))・Fc （２８）
式（２８）において、xは、空間方向の位置を表す。
【０４２２】
α（ｘ）を式（２４）で置き換えると、式（２８）は、式（２９）として表すことができる。
【０４２３】

【０４２４】
式（２９）において、（-m・Fc）、（-q・Fc）、および(1-p)・Fcは、式（３０）乃至式（３２）に示すように置き換えられている。
【０４２５】
s=-m・Fc （３０）
t=-q・Fc （３１）
u=(1-p)・Fc （３２）
【０４２６】
式（２９）において、jは、注目している画素の位置を０とした水平方向のインデックスであり、kは、垂直方向のインデックスである。
【０４２７】
このように、前景に対応するオブジェクトがシャッタ時間内において等速に移動し、前景に対応する成分が近傍において一定であるという仮定が成立するので、前景の成分の和は、式（２９）で近似される。
【０４２８】
なお、混合比αを直線で近似する場合、前景の成分の和は、式（３３）で表すことができる。
【０４２９】
f(x)=is+u （３３）
【０４３０】
式（１３）の混合比αおよび前景成分の和を、式（２４）および式（２９）を利用して置き換えると、画素値Mは、式（３４）で表される。
【０４３１】

【０４３２】
式（３４）において、未知の変数は、混合比αの面の水平方向の傾きm、混合比αの面の垂直方向の傾きq、混合比αの面の切片p、s、t、およびuの６つである。
【０４３３】
注目している画素の近傍の画素に対応させて、式（３４）に示す正規方程式に、画素値Mまたは画素値Bを設定し、画素値Mまたは画素値Bが設定された複数の正規方程式を最小自乗法で解いて、混合比αを算出する。
【０４３４】
例えば、注目している画素の水平方向のインデックスjを０とし、垂直方向のインデックスkを０とし、注目している画素の近傍の３×３の画素について、式（３４）に示す正規方程式に画素値Mまたは画素値Bを設定すると、式（３５）乃至式（４３）を得る。

【０４３５】
注目している画素の水平方向のインデックスjが０であり、垂直方向のインデックスkが０であるので、注目している画素の混合比αは、式（２４）より、j=0およびk=0のときの値、すなわち、切片pに等しい。
【０４３６】
従って、式（３５）乃至式（４３）の９つの式を基に、最小自乗法により、水平方向の傾きm、垂直方向の傾きq、切片p、s、t、およびuのそれぞれの値を算出し、切片pを混合比αとして出力すればよい。
【０４３７】
次に、最小自乗法を適用して混合比αを算出するより具体的な手順を説明する。
【０４３８】
インデックスiおよびインデックスkを１つのインデックスxで表現すると、インデックスi、インデックスk、およびインデックスxの関係は、式（４４）で表される。
【０４３９】
x=(j+1)・3+(k+1) （４４）
【０４４０】
水平方向の傾きm、垂直方向の傾きq、切片p、s、t、およびuをそれぞれ変数w0,w1,w2,w3,w4、およびW5と表現し、jB,kB,B,j,k、および1をそれぞれa0,a1,a2,a3,a4、およびa5と表現する。誤差exを考慮すると、式（３５）乃至式（４３）は、式（４５）に書き換えることができる。
【０４４１】
【数１２】

式（４５）において、xは、０乃至８の整数のいずれかの値である。
【０４４２】
式（４５）から、式（４６）を導くことができる。
【０４４３】
【数１３】

【０４４４】
ここで、最小自乗法を適用するため、誤差の自乗和Eを式（４７）に示すようにに定義する。
【０４４５】
【数１４】

【０４４６】
誤差が最小になるためには、誤差の自乗和Eに対する、変数Wvの偏微分が０になればよい。ここで、vは、０乃至５の整数のいずれかの値である。従って、式（４８）を満たすようにwyを求める。
【０４４７】
【数１５】

【０４４８】
式（４８）に式（４６）を代入すると、式（４９）を得る。
【０４４９】
【数１６】

【０４５０】
式（４９）のvに０乃至５の整数のいずれか１つを代入して得られる６つの式に、例えば、掃き出し法（Gauss-Jordanの消去法）などを適用して、wyを算出する。上述したように、w0は水平方向の傾きmであり、w1は垂直方向の傾きqであり、w2は切片pであり、w3はsであり、w4はtであり、w5はuである。
【０４５１】
以上のように、画素値Mおよび画素値Bを設定した式に、最小自乗法を適用することにより、水平方向の傾きm、垂直方向の傾きq、切片p、s、t、およびuを求めることができる。
【０４５２】
式（３５）乃至式（４３）に対応する説明において、混合領域に含まれる画素の画素値をMとし、背景領域に含まれる画素の画素値をBとして説明したが、注目している画素が、カバードバックグラウンド領域に含まれる場合、またはアンカバードバックグラウンド領域に含まれる場合のそれぞれに対して、正規方程式を立てる必要がある。
【０４５３】
例えば、図６４に示す、フレーム#nのカバードバックグラウンド領域に含まれる画素の混合比αを求める場合、フレーム#nの画素のC04乃至C08、およびフレーム#n-1の画素の画素値P04乃至P08が、正規方程式に設定される。
【０４５４】
図６５に示す、フレーム#nのアンカバードバックグラウンド領域に含まれる画素の混合比αを求める場合、フレーム#nの画素のC28乃至C32、およびフレーム#n+1の画素の画素値N28乃至N32が、正規方程式に設定される。
【０４５５】
また、例えば、図７５に示す、カバードバックグラウンド領域に含まれる画素の混合比αを算出するとき、以下の式（５０）乃至式（５８）が立てられる。混合比αを算出する画素の画素値は、Mc5である。
Mc1=(-1)・Bc1・m+(-1)・Bc1・q+Bc1・p+(-1)・s+(-1)・t+u （５０）
Mc2=(0)・Bc2・m+(-1)・Bc2・q+Bc2・p+(0)・s+(-1)・t+u （５１）
Mc3=(+1)・Bc3・m+(-1)・Bc3・q+Bc3・p+(+1)・s+(-1)・t+u （５２）
Mc4=(-1)・Bc4・m+(0)・Bc4・q+Bc4・p+(-1)・s+(0)・t+u （５３）
Mc5=(0)・Bc5・m+(0)・Bc5・q+Bc5・p+(0)・s+(0)・t+u （５４）
Mc6=(+1)・Bc6・m+(0)・Bc6・q+Bc6・p+(+1)・s+(0)・t+u （５５）
Mc7=(-1)・Bc7・m+(+1)・Bc7・q+Bc7・p+(-1)・s+(+1)・t+u （５６）
Mc8=(0)・Bc8・m+(+1)・Bc8・q+Bc8・p+(0)・s+(+1)・t+u （５７）
Mc9=(+1)・Bc9・m+(+1)・Bc9・q+Bc9・p+(+1)・s+(+1)・t+u （５８）
【０４５６】
フレーム#nのカバードバックグラウンド領域に含まれる画素の混合比αを算出するとき、式（５０）乃至式（５８）において、フレーム#nの画素に対応する、フレーム#n-1の画素の背景領域の画素の画素値Bc1乃至Bc9が使用される。
【０４５７】
図７５に示す、アンカバードバックグラウンド領域に含まれる画素の混合比αを算出するとき、以下の式（５９）乃至式（６７）が立てられる。混合比αを算出する画素の画素値は、Mu5である。
Mu1=(-1)・Bu1・m+(-1)・Bu1・q+Bu1・p+(-1)・s+(-1)・t+u （５９）
Mu2=(0)・Bu2・m+(-1)・Bu2・q+Bu2・p+(0)・s+(-1)・t+u （６０）
Mu3=(+1)・Bu3・m+(-1)・Bu3・q+Bu3・p+(+1)・s+(-1)・t+u （６１）
Mu4=(-1)・Bu4・m+(0)・Bu4・q+Bu4・p+(-1)・s+(0)・t+u （６２）
Mu5=(0)・Bu5・m+(0)・Bu5・q+Bu5・p+(0)・s+(0)・t+u （６３）
Mu6=(+1)・Bu6・m+(0)・Bu6・q+Bu6・p+(+1)・s+(0)・t+u （６４）
Mu7=(-1)・Bu7・m+(+1)・Bu7・q+Bu7・p+(-1)・s+(+1)・t+u （６５）
Mu8=(0)・Bu8・m+(+1)・Bu8・q+Bu8・p+(0)・s+(+1)・t+u （６６）
Mu9=(+1)・Bu9・m+(+1)・Bu9・q+Bu9・p+(+1)・s+(+1)・t+u （６７）
【０４５８】
フレーム#nのアンカバードバックグラウンド領域に含まれる画素の混合比αを算出するとき、式（５９）乃至式（６７）において、フレーム#nの画素に対応する、フレーム#n+1の画素の背景領域の画素の画素値Bu1乃至Bu9が使用される。
【０４５９】
図７６は、推定混合比処理部４０１の構成を示すブロック図である。推定混合比処理部４０１に入力された画像は、遅延部５０１および足し込み部５０２に供給される。
【０４６０】
遅延回路２２１は、入力画像を１フレーム遅延させ、足し込み部５０２に供給する。足し込み部５０２に、入力画像としてフレーム#nが入力されているとき、遅延回路２２１は、フレーム#n-1を足し込み部５０２に供給する。
【０４６１】
足し込み部５０２は、混合比αを算出する画素の近傍の画素の画素値、およびフレーム#n-1の画素値を、正規方程式に設定する。例えば、足し込み部５０２は、式（５０）乃至式（５８）に基づいて、正規方程式に画素値Mc1乃至Mc9および画素値Bc1乃至Bc9を設定する。足し込み部５０２は、画素値が設定された正規方程式を演算部５０３に供給する。
【０４６２】
演算部５０３は、足し込み部５０２から供給された正規方程式を掃き出し法などにより解いて推定混合比を求め、求められた推定混合比を出力する。
【０４６３】
このように、推定混合比処理部４０１は、入力画像を基に、推定混合比を算出して、混合比決定部４０３に供給することができる。
【０４６４】
なお、推定混合比処理部４０２は、推定混合比処理部４０１と同様の構成を有するので、その説明は省略する。
【０４６５】
図７７は、推定混合比処理部４０１により算出された推定混合比の例を示す図である。図７７に示す推定混合比は、等速で動いているオブジェクトに対応する前景の動きvが１１であり、７×７画素のブロックを単位として方程式を生成して算出された結果を、１ラインに対して示すものである。
【０４６６】
推定混合比は、混合領域において、図６３に示すように、ほぼ直線的に変化していることがわかる。
【０４６７】
混合比決定部４０３は、領域特定部１０１から供給された、混合比が算出される画素が、前景領域、背景領域、カバードバックグラウンド領域、またはアンカバードバックグラウンド領域のいずれかに属するかを示す領域情報を基に、混合比を設定する。混合比決定部４０３は、対象となる画素が前景領域に属する場合、０を混合比に設定し、対象となる画素が背景領域に属する場合、１を混合比に設定し、対象となる画素がカバードバックグラウンド領域に属する場合、推定混合比処理部４０１から供給された推定混合比を混合比に設定し、対象となる画素がアンカバードバックグラウンド領域に属する場合、推定混合比処理部４０２から供給された推定混合比を混合比に設定する。混合比決定部４０３は、領域情報を基に設定した混合比を出力する。
【０４６８】
図７８のフローチャートを参照して、推定混合比処理部４０１が図７６に示す構成を有する場合における、混合比算出部１０２の混合比の算出の処理を説明する。ステップＳ５０１において、混合比算出部１０２は、領域特定部１０１から供給された領域情報を取得する。ステップＳ５０２において、推定混合比処理部４０１は、カバードバックグラウンド領域に対応するモデルによる混合比推定の処理を実行し、推定混合比を混合比決定部４０３に供給する。混合比推定の処理の詳細は、図７９のフローチャートを参照して、後述する。
【０４６９】
ステップＳ５０３において、推定混合比処理部４０２は、アンカバードバックグラウンド領域に対応するモデルによる混合比推定の処理を実行し、推定混合比を混合比決定部４０３に供給する。
【０４７０】
ステップＳ５０４において、混合比算出部１０２は、フレーム全体について、混合比を推定したか否かを判定し、フレーム全体について、混合比を推定していないと判定された場合、ステップＳ５０２に戻り、次の画素について混合比を推定する処理を実行する。
【０４７１】
ステップＳ５０４において、フレーム全体について、混合比を推定したと判定された場合、ステップＳ５０５に進み、混合比決定部４０３は、領域特定部１０１から供給された、混合比が算出される画素が、前景領域、背景領域、カバードバックグラウンド領域、またはアンカバードバックグラウンド領域のいずれかに属するかを示す領域情報を基に、混合比を設定する。混合比決定部４０３は、対象となる画素が前景領域に属する場合、０を混合比に設定し、対象となる画素が背景領域に属する場合、１を混合比に設定し、対象となる画素がカバードバックグラウンド領域に属する場合、推定混合比処理部４０１から供給された推定混合比を混合比に設定し、対象となる画素がアンカバードバックグラウンド領域に属する場合、推定混合比処理部４０２から供給された推定混合比を混合比に設定し、処理は終了する。
【０４７２】
このように、混合比算出部１０２は、領域特定部１０１から供給された領域情報、および入力画像を基に、各画素に対応する特徴量である混合比αを算出することができる。
【０４７３】
混合比αを利用することにより、動いているオブジェクトに対応する画像に含まれる動きボケの情報を残したままで、画素値に含まれる前景の成分と背景の成分とを分離することが可能になる。
【０４７４】
また、混合比αに基づいて画像を合成すれば、実世界を実際に撮影し直したような動いているオブジェクトのスピードに合わせた正しい動きボケを含む画像を作ることが可能になる。
【０４７５】
次に、図７８のステップＳ５０２に対応する、カバードバックグラウンド領域に対応するモデルによる混合比推定の処理を図７９のフローチャートを参照して説明する。
【０４７６】
ステップＳ５２１において、足し込み部５０２は、入力された画像に含まれる画素値、および遅延回路２２１から供給される画像に含まれる画素値を、カバードバックグラウンド領域のモデルに対応する正規方程式に設定する。
【０４７７】
ステップＳ５２２において、推定混合比処理部４０１は、対象となる画素についての設定が終了したか否かを判定し、対象となる画素についての設定が終了していないと判定された場合、ステップＳ５２１に戻り、正規方程式への画素値の設定の処理を繰り返す。
【０４７８】
ステップＳ５２２において、対象となる画素についての画素値の設定が終了したと判定された場合、ステップＳ５２３に進み、演算部１７３は、画素値が設定された正規方程式を基に、推定混合比を演算して、求められた推定混合比を出力する。
【０４７９】
このように、推定混合比処理部４０１は、入力画像を基に、推定混合比を演算することができる。
【０４８０】
図７８のステップＳ１５３におけるアンカバードバックグラウンド領域に対応するモデルによる混合比推定の処理は、アンカバードバックグラウンド領域のモデルに対応する正規方程式を利用した、図７９のフローチャートに示す処理と同様なので、その説明は省略する。
【０４８１】
なお、背景に対応するオブジェクトが静止しているとして説明したが、背景領域に対応する画像が動きを含んでいても上述した混合比を求める処理を適用することができる。例えば、背景領域に対応する画像が一様に動いているとき、推定混合比処理部４０１は、この動きに対応して画像全体をシフトさせ、背景に対応するオブジェクトが静止している場合と同様に処理する。また、背景領域に対応する画像が局所毎に異なる動きを含んでいるとき、推定混合比処理部４０１は、混合領域に属する画素に対応する画素として、動きに対応した画素を選択して、上述の処理を実行する。
【０４８２】
次に、前景背景分離部１０５について説明する。図８０は、前景背景分離部１０５の構成の一例を示すブロック図である。前景背景分離部１０５に供給された入力画像は、分離部６０１、スイッチ６０２、およびスイッチ６０４に供給される。カバードバックグラウンド領域を示す情報、およびアンカバードバックグラウンド領域を示す、領域特定部１０３から供給された領域情報は、分離部６０１に供給される。前景領域を示す領域情報は、スイッチ６０２に供給される。背景領域を示す領域情報は、スイッチ６０４に供給される。
【０４８３】
混合比算出部１０４から供給された混合比αは、分離部６０１に供給される。
【０４８４】
分離部６０１は、カバードバックグラウンド領域を示す領域情報、アンカバードバックグラウンド領域を示す領域情報、および混合比αを基に、入力画像から前景の成分を分離して、分離した前景の成分を合成部６０３に供給するとともに、入力画像から背景の成分を分離して、分離した背景の成分を合成部６０５に供給する。
【０４８５】
スイッチ６０２は、前景領域を示す領域情報を基に、前景に対応する画素が入力されたとき、閉じられ、入力画像に含まれる前景に対応する画素のみを合成部６０３に供給する。
【０４８６】
スイッチ６０４は、背景領域を示す領域情報を基に、背景に対応する画素が入力されたとき、閉じられ、入力画像に含まれる背景に対応する画素のみを合成部６０５に供給する。
【０４８７】
合成部６０３は、分離部６０１から供給された前景に対応する成分、スイッチ６０２から供給された前景に対応する画素を基に、前景成分画像を合成し、合成した前景成分画像を出力する。前景領域と混合領域とは重複しないので、合成部６０３は、例えば、前景に対応する成分と、前景に対応する画素とに論理和の演算を適用して、前景成分画像を合成する。
【０４８８】
合成部６０３は、前景成分画像の合成の処理の最初に実行される初期化の処理において、内蔵しているフレームメモリに全ての画素値が０である画像を格納し、前景成分画像の合成の処理において、前景成分画像を格納（上書き）する。従って、合成部６０３が出力する前景成分画像の内、背景領域に対応する画素には、画素値として０が格納されている。
【０４８９】
合成部６０５は、分離部６０１から供給された背景に対応する成分、スイッチ６０４から供給された背景に対応する画素を基に、背景成分画像を合成して、合成した背景成分画像を出力する。背景領域と混合領域とは重複しないので、合成部６０５は、例えば、背景に対応する成分と、背景に対応する画素とに論理和の演算を適用して、背景成分画像を合成する。
【０４９０】
合成部６０５は、背景成分画像の合成の処理の最初に実行される初期化の処理において、内蔵しているフレームメモリに全ての画素値が０である画像を格納し、背景成分画像の合成の処理において、背景成分画像を格納（上書き）する。従って、合成部６０５が出力する背景成分画像の内、前景領域に対応する画素には、画素値として０が格納されている。
【０４９１】
図８１は、前景背景分離部１０５に入力される入力画像、並びに前景背景分離部１０５から出力される前景成分画像および背景成分画像を示す図である。
【０４９２】
図８１（Ａ）は、表示される画像の模式図であり、図８１（Ｂ）は、図８１（Ａ）に対応する前景領域に属する画素、背景領域に属する画素、および混合領域に属する画素を含む１ラインの画素を時間方向に展開したモデル図を示す。
【０４９３】
図８１（Ａ）および図８１（Ｂ）に示すように、前景背景分離部１０５から出力される背景成分画像は、背景領域に属する画素、および混合領域の画素に含まれる背景の成分から構成される。
【０４９４】
図８１（Ａ）および図８１（Ｂ）に示すように、前景背景分離部１０５から出力される前景成分画像は、前景領域に属する画素、および混合領域の画素に含まれる前景の成分から構成される。
【０４９５】
混合領域の画素の画素値は、前景背景分離部１０５により、背景の成分と、前景の成分とに分離される。分離された背景の成分は、背景領域に属する画素と共に、背景成分画像を構成する。分離された前景の成分は、前景領域に属する画素と共に、前景成分画像を構成する。
【０４９６】
このように、前景成分画像は、背景領域に対応する画素の画素値が０とされ、前景領域に対応する画素および混合領域に対応する画素に意味のある画素値が設定される。同様に、背景成分画像は、前景領域に対応する画素の画素値が０とされ、背景領域に対応する画素および混合領域に対応する画素に意味のある画素値が設定される。
【０４９７】
次に、分離部６０１が実行する、混合領域に属する画素から前景の成分、および背景の成分を分離する処理について説明する。
【０４９８】
図８２は、図中の左から右に移動するオブジェクトに対応する前景を含む、２つのフレームの前景の成分および背景の成分を示す画像のモデルである。図８２に示す画像のモデルにおいて、前景の動き量vは４であり、仮想分割数は、４とされている。
【０４９９】
フレーム#nにおいて、最も左の画素、および左から１４番目乃至１８番目の画素は、背景の成分のみから成り、背景領域に属する。フレーム#nにおいて、左から２番目乃至４番目の画素は、背景の成分および前景の成分を含み、アンカバードバックグラウンド領域に属する。フレーム#nにおいて、左から１１番目乃至１３番目の画素は、背景の成分および前景の成分を含み、カバードバックグラウンド領域に属する。フレーム#nにおいて、左から５番目乃至１０番目の画素は、前景の成分のみから成り、前景領域に属する。
【０５００】
フレーム#n+1において、左から１番目乃至５番目の画素、および左から１８番目の画素は、背景の成分のみから成り、背景領域に属する。フレーム#n+1において、左から６番目乃至８番目の画素は、背景の成分および前景の成分を含み、アンカバードバックグラウンド領域に属する。フレーム#n+1において、左から１５番目乃至１７番目の画素は、背景の成分および前景の成分を含み、カバードバックグラウンド領域に属する。フレーム#n+1において、左から９番目乃至１４番目の画素は、前景の成分のみから成り、前景領域に属する。
【０５０１】
図８３は、カバードバックグラウンド領域に属する画素から前景の成分を分離する処理を説明する図である。図８３において、α１乃至α１８は、フレーム#nにおける画素のぞれぞれに対応する混合比である。図８３において、左から１５番目乃至１７番目の画素は、カバードバックグラウンド領域に属する。
【０５０２】
フレーム#nの左から１５番目の画素の画素値C15は、式（６８）で表される。
【０５０３】

ここで、α15は、フレーム#nの左から１５番目の画素の混合比である。P15は、フレーム#n-1の左から１５番目の画素の画素値である。
【０５０４】
式（６８）を基に、フレーム#nの左から１５番目の画素の前景の成分の和f15は、式（６９）で表される。
【０５０５】

【０５０６】
同様に、フレーム#nの左から１６番目の画素の前景の成分の和f16は、式（７０）で表され、フレーム#nの左から１７番目の画素の前景の成分の和f17は、式（７１）で表される。
【０５０７】
f16=C16-α16・P16 （７０）
f17=C17-α17・P17 （７１）
【０５０８】
このように、カバードバックグラウンド領域に属する画素の画素値Cに含まれる前景の成分fcは、式（７２）で計算される。
【０５０９】
fc=C-α・P （７２）
Pは、１つ前のフレームの、対応する画素の画素値である。
【０５１０】
図８４は、アンカバードバックグラウンド領域に属する画素から前景の成分を分離する処理を説明する図である。図８４において、α１乃至α１８は、フレーム#nにおける画素のぞれぞれに対応する混合比である。図８４において、左から２番目乃至４番目の画素は、アンカバードバックグラウンド領域に属する。
【０５１１】
フレーム#nの左から２番目の画素の画素値C02は、式（７３）で表される。
【０５１２】

ここで、α2は、フレーム#nの左から２番目の画素の混合比である。N02は、フレーム#n+1の左から２番目の画素の画素値である。
【０５１３】
式（７３）を基に、フレーム#nの左から２番目の画素の前景の成分の和f02は、式（７４）で表される。
【０５１４】

【０５１５】
同様に、フレーム#nの左から３番目の画素の前景の成分の和f03は、式（７５）で表され、フレーム#nの左から４番目の画素の前景の成分の和f04は、式（７６）で表される。
【０５１６】
f03=C03-α3・N03 （７５）
f04=C04-α4・N04 （７６）
【０５１７】
このように、アンカバードバックグラウンド領域に属する画素の画素値Cに含まれる前景の成分fuは、式（７７）で計算される。
【０５１８】
fu=C-α・N （７７）
Nは、１つ後のフレームの、対応する画素の画素値である。
【０５１９】
このように、分離部６０１は、領域情報に含まれる、カバードバックグラウンド領域を示す情報、およびアンカバードバックグラウンド領域を示す情報、並びに画素毎の混合比αを基に、混合領域に属する画素から前景の成分、および背景の成分を分離することができる。
【０５２０】
図８５は、以上で説明した処理を実行する分離部６０１の構成の一例を示すブロック図である。分離部６０１に入力された画像は、フレームメモリ６２１に供給され、混合比算出部１０４から供給されたカバードバックグラウンド領域およびアンカバードバックグラウンド領域を示す領域情報、並びに混合比αは、分離処理ブロック６２２に入力される。
【０５２１】
フレームメモリ６２１は、入力された画像をフレーム単位で記憶する。フレームメモリ６２１は、処理の対象がフレーム#nであるとき、フレーム#nの１つ前のフレームであるフレーム#n-1、フレーム#n、およびフレーム#nの１つ後のフレームであるフレーム#n+1を記憶する。
【０５２２】
フレームメモリ６２１は、フレーム#n-1、フレーム#n、およびフレーム#n+1の対応する画素を分離処理ブロック６２２に供給する。
【０５２３】
分離処理ブロック６２２は、カバードバックグラウンド領域およびアンカバードバックグラウンド領域を示す領域情報、並びに混合比αを基に、フレームメモリ６２１から供給されたフレーム#n-1、フレーム#n、およびフレーム#n+1の対応する画素の画素値に図８３および図８４を参照して説明した演算を適用して、フレーム#nの混合領域に属する画素から前景の成分および背景の成分を分離して、フレームメモリ６２３に供給する。
【０５２４】
分離処理ブロック６２２は、アンカバード領域処理部６３１、カバード領域処理部６３２、合成部６３３、および合成部６３４で構成されている。
【０５２５】
アンカバード領域処理部６３１の乗算器６４１は、混合比αを、フレームメモリ６２１から供給されたフレーム#n+1の画素の画素値に乗じて、スイッチ６４２に出力する。スイッチ６４２は、フレームメモリ６２１から供給されたフレーム#nの画素（フレーム#n+1の画素に対応する）がアンカバードバックグラウンド領域であるとき、閉じられ、乗算器６４１から供給された混合比αを乗じた画素値を演算器６４３および合成部６３４に供給する。スイッチ６４２から出力されるフレーム#n+1の画素の画素値に混合比αを乗じた値は、フレーム#nの対応する画素の画素値の背景の成分に等しい。
【０５２６】
演算器６４３は、フレームメモリ６２１から供給されたフレーム#nの画素の画素値から、スイッチ６４２から供給された背景の成分を減じて、前景の成分を求める。演算器６４３は、アンカバードバックグラウンド領域に属する、フレーム#nの画素の前景の成分を合成部６３３に供給する。
【０５２７】
カバード領域処理部６３２の乗算器６５１は、混合比αを、フレームメモリ６２１から供給されたフレーム#n-1の画素の画素値に乗じて、スイッチ６５２に出力する。スイッチ６５２は、フレームメモリ６２１から供給されたフレーム#nの画素（フレーム#n-1の画素に対応する）がカバードバックグラウンド領域であるとき、閉じられ、乗算器６５１から供給された混合比αを乗じた画素値を演算器６５３および合成部６３４に供給する。スイッチ６５２から出力されるフレーム#n-1の画素の画素値に混合比αを乗じた値は、フレーム#nの対応する画素の画素値の背景の成分に等しい。
【０５２８】
演算器６５３は、フレームメモリ６２１から供給されたフレーム#nの画素の画素値から、スイッチ６５２から供給された背景の成分を減じて、前景の成分を求める。演算器６５３は、カバードバックグラウンド領域に属する、フレーム#nの画素の前景の成分を合成部６３３に供給する。
【０５２９】
合成部６３３は、フレーム#nの、演算器６４３から供給された、アンカバードバックグラウンド領域に属する画素の前景の成分、および演算器６５３から供給された、カバードバックグラウンド領域に属する画素の前景の成分を合成して、フレームメモリ６２３に供給する。
【０５３０】
合成部６３４は、フレーム#nの、スイッチ６４２から供給された、アンカバードバックグラウンド領域に属する画素の背景の成分、およびスイッチ６５２から供給された、カバードバックグラウンド領域に属する画素の背景の成分を合成して、フレームメモリ６２３に供給する。
【０５３１】
フレームメモリ６２３は、分離処理ブロック６２２から供給された、フレーム#nの混合領域の画素の前景の成分と、背景の成分とをそれぞれに記憶する。
【０５３２】
フレームメモリ６２３は、記憶しているフレーム#nの混合領域の画素の前景の成分、および記憶しているフレーム#nの混合領域の画素の背景の成分を出力する。
【０５３３】
特徴量である混合比αを利用することにより、画素値に含まれる前景の成分と背景の成分とを完全に分離することが可能になる。
【０５３４】
合成部６０３は、分離部６０１から出力された、フレーム#nの混合領域の画素の前景の成分と、前景領域に属する画素とを合成して前景成分画像を生成する。合成部６０５は、分離部６０１から出力された、フレーム#nの混合領域の画素の背景の成分と、背景領域に属する画素とを合成して背景成分画像を生成する。
【０５３５】
図８６は、図８２のフレーム#nに対応する、前景成分画像の例と、背景成分画像の例を示す図である。
【０５３６】
図８６（Ａ）は、図８２のフレーム#nに対応する、前景成分画像の例を示す。最も左の画素、および左から１４番目の画素は、前景と背景が分離される前において、背景の成分のみから成っていたので、画素値が０とされる。
【０５３７】
左から２番目乃至４番目の画素は、前景と背景とが分離される前において、アンカバードバックグラウンド領域に属し、背景の成分が０とされ、前景の成分がそのまま残されている。左から１１番目乃至１３番目の画素は、前景と背景とが分離される前において、カバードバックグラウンド領域に属し、背景の成分が０とされ、前景の成分がそのまま残されている。左から５番目乃至１０番目の画素は、前景の成分のみから成るので、そのまま残される。
【０５３８】
図８６（Ｂ）は、図８２のフレーム#nに対応する、背景成分画像の例を示す。最も左の画素、および左から１４番目の画素は、前景と背景とが分離される前において、背景の成分のみから成っていたので、そのまま残される。
【０５３９】
左から２番目乃至４番目の画素は、前景と背景とが分離される前において、アンカバードバックグラウンド領域に属し、前景の成分が０とされ、背景の成分がそのまま残されている。左から１１番目乃至１３番目の画素は、前景と背景とが分離される前において、カバードバックグラウンド領域に属し、前景の成分が０とされ、背景の成分がそのまま残されている。左から５番目乃至１０番目の画素は、前景と背景とが分離される前において、前景の成分のみから成っていたので、画素値が０とされる。
【０５４０】
次に、図８７に示すフローチャートを参照して、前景背景分離部１０５による前景と背景との分離の処理を説明する。ステップＳ６０１において、分離部６０１のフレームメモリ６２１は、入力画像を取得し、前景と背景との分離の対象となるフレーム#nを、その前のフレーム#n-1およびその後のフレーム#n+1と共に記憶する。
【０５４１】
ステップＳ６０２において、分離部６０１の分離処理ブロック６２２は、混合比算出部１０４から供給された領域情報を取得する。ステップＳ６０３において、分離部６０１の分離処理ブロック６２２は、混合比算出部１０４から供給された混合比αを取得する。
【０５４２】
ステップＳ６０４において、アンカバード領域処理部６３１は、領域情報および混合比αを基に、フレームメモリ６２１から供給された、アンカバードバックグラウンド領域に属する画素の画素値から、背景の成分を抽出する。
【０５４３】
ステップＳ６０５において、アンカバード領域処理部６３１は、領域情報および混合比αを基に、フレームメモリ６２１から供給された、アンカバードバックグラウンド領域に属する画素の画素値から、前景の成分を抽出する。
【０５４４】
ステップＳ６０６において、カバード領域処理部６３２は、領域情報および混合比αを基に、フレームメモリ６２１から供給された、カバードバックグラウンド領域に属する画素の画素値から、背景の成分を抽出する。
【０５４５】
ステップＳ６０７において、カバード領域処理部６３２は、領域情報および混合比αを基に、フレームメモリ６２１から供給された、カバードバックグラウンド領域に属する画素の画素値から、前景の成分を抽出する。
【０５４６】
ステップＳ６０８において、合成部６３３は、ステップＳ６０５の処理で抽出されたアンカバードバックグラウンド領域に属する画素の前景の成分と、ステップＳ６０７の処理で抽出されたカバードバックグラウンド領域に属する画素の前景の成分とを合成する。合成された前景の成分は、合成部６０３に供給される。更に、合成部６０３は、スイッチ６０２を介して供給された前景領域に属する画素と、分離部６０１から供給された前景の成分とを合成して、前景成分画像を生成する。
【０５４７】
ステップＳ６０９において、合成部６３４は、ステップＳ６０４の処理で抽出されたアンカバードバックグラウンド領域に属する画素の背景の成分と、ステップＳ６０６の処理で抽出されたカバードバックグラウンド領域に属する画素の背景の成分とを合成する。合成された背景の成分は、合成部６０５に供給される。更に、合成部６０５は、スイッチ６０４を介して供給された背景領域に属する画素と、分離部６０１から供給された背景の成分とを合成して、背景成分画像を生成する。
【０５４８】
ステップＳ６１０において、合成部６０３は、前景成分画像を出力する。ステップＳ６１１において、合成部６０５は、背景成分画像を出力し、処理は終了する。
【０５４９】
このように、前景背景分離部１０５は、領域情報および混合比αを基に、入力画像から前景の成分と、背景の成分とを分離し、前景の成分のみから成る前景成分画像、および背景の成分のみから成る背景成分画像を出力することができる。
【０５５０】
次に、前景成分画像からの動きボケの除去について説明する。
【０５５１】
図８８は、動きボケ除去部１０６の構成の一例を示すブロック図である。動き検出部１０２から供給された動きベクトルとその位置情報、および領域特定部１０３から供給された領域情報は、処理単位決定部８０１およびモデル化部８０２に供給される。前景背景分離部１０５から供給された前景成分画像は、足し込み部８０４に供給される。
【０５５２】
処理単位決定部８０１は、動きベクトルとその位置情報、および領域情報を基に、動きベクトルと共に、生成した処理単位をモデル化部８０２に供給する。処理単位決定部８０１は、生成した処理単位を足し込み部８０４に供給する。
【０５５３】
処理単位決定部８０１が生成する処理単位は、図８９に例を示すように、前景成分画像のカバードバックグラウンド領域に対応する画素から始まり、アンカバードバックグラウンド領域に対応する画素までの動き方向に並ぶ連続する画素、またはアンカバードバックグラウンド領域に対応する画素から始まり、カバードバックグラウンド領域に対応する画素までの動き方向に並ぶ連続する画素を示す。処理単位は、例えば、左上点（処理単位で指定される画素であって、画像上で最も左または最も上に位置する画素の位置）および右下点の２つのデータから成る。
【０５５４】
モデル化部８０２は、動きベクトルおよび入力された処理単位を基に、モデル化を実行する。より具体的には、例えば、モデル化部８０２は、処理単位に含まれる画素の数、画素値の時間方向の仮想分割数、および画素毎の前景の成分の数に対応する複数のモデルを予め記憶しておき、処理単位、および画素値の時間方向の仮想分割数を基に、図９０に示すような、画素値と前景の成分との対応を指定するモデルを選択するようにしても良い。
【０５５５】
例えば、処理単位に対応する画素の数が１２でありシャッタ時間内の動き量vが５であるときにおいては、モデル化部８０２は、仮想分割数を５とし、最も左に位置する画素が１つの前景の成分を含み、左から２番目の画素が２つの前景の成分を含み、左から３番目の画素が３つの前景の成分を含み、左から４番目の画素が４つの前景の成分を含み、左から５番目の画素が５つの前景の成分を含み、左から６番目の画素が５つの前景の成分を含み、左から７番目の画素が５つの前景の成分を含み、左から８番目の画素が５つの前景の成分を含み、左から９番目の画素が４つの前景の成分を含み、左から１０番目の画素が３つの前景の成分を含み、左から１１番目の画素が２つの前景の成分を含み、左から１２番目の画素が１つの前景の成分を含み、全体として８つの前景の成分から成るモデルを選択する。
【０５５６】
なお、モデル化部８０２は、予め記憶してあるモデルから選択するのではなく、動きベクトル、および処理単位が供給されたとき、動きベクトル、および処理単位を基に、モデルを生成するようにしてもよい。
【０５５７】
モデル化部８０２は、選択したモデルを方程式生成部８０３に供給する。
【０５５８】
方程式生成部８０３は、モデル化部８０２から供給されたモデルを基に、方程式を生成する。図９０に示す前景成分画像のモデルを参照して、前景の成分の数が８であり、処理単位に対応する画素の数が１２であり、動き量vが５であり、仮想分割数が５であるときの、方程式生成部８０３が生成する方程式について説明する。
【０５５９】
前景成分画像に含まれるシャッタ時間/vに対応する前景成分がF01/v乃至F08/vであるとき、F01/v乃至F08/vと画素値C01乃至C12との関係は、式（７８）乃至式（８９）で表される。
【０５６０】

【０５６１】
方程式生成部８０３は、生成した方程式を変形して方程式を生成する。方程式生成部８０３が生成する方程式を、式（９０）乃至式（１０１）に示す。

【０５６２】
式（９０）乃至式（１０１）は、式（１０２）として表すこともできる。
【０５６３】
【数１７】

式（１０２）において、jは、画素の位置を示す。この例において、jは、１乃至１２のいずれか１つの値を有する。また、iは、前景値の位置を示す。この例において、iは、１乃至８のいずれか１つの値を有する。aijは、iおよびjの値に対応して、０または１の値を有する。
【０５６４】
誤差を考慮して表現すると、式（１０２）は、式（１０３）のように表すことができる。
【０５６５】
【数１８】

式（１０３）において、ejは、注目画素Cjに含まれる誤差である。
【０５６６】
式（１０３）は、式（１０４）に書き換えることができる。
【０５６７】
【数１９】

【０５６８】
ここで、最小自乗法を適用するため、誤差の自乗和Eを式（１０５）に示すように定義する。
【０５６９】
【数２０】

【０５７０】
誤差が最小になるためには、誤差の自乗和Eに対する、変数Fkによる偏微分の値が０になればよい。式（１０６）を満たすようにFkを求める。
【０５７１】
【数２１】

【０５７２】
式（１０６）において、動き量vは固定値であるから、式（１０７）を導くことができる。
【０５７３】
【数２２】

【０５７４】
式（１０７）を展開して、移項すると、式（１０８）を得る。
【０５７５】
【数２３】

【０５７６】
式（１０８）のkに１乃至８の整数のいずれか１つを代入して得られる８つの式に展開する。得られた８つの式を、行列により１つの式により表すことができる。この式を正規方程式と呼ぶ。
【０５７７】
このような最小自乗法に基づく、方程式生成部８０３が生成する正規方程式の例を式（１０９）に示す。
【０５７８】
【数２４】

【０５７９】
式（１０９）をA・F=v・Cと表すと、C,A,vが既知であり、Fは未知である。また、A,vは、モデル化の時点で既知だが、Cは、足し込み動作において画素値を入力することで既知となる。
【０５８０】
最小自乗法に基づく正規方程式により前景成分を算出することにより、画素Cに含まれている誤差を分散させることができる。
【０５８１】
方程式生成部８０３は、このように生成された正規方程式を足し込み部８０４に供給する。
【０５８２】
足し込み部８０４は、処理単位決定部８０１から供給された処理単位を基に、前景成分画像に含まれる画素値Cを、方程式生成部８０３から供給された行列の式に設定する。足し込み部８０４は、画素値Cを設定した行列を演算部８０５に供給する。
【０５８３】
演算部８０５は、掃き出し法（Gauss-Jordanの消去法）などの解法に基づく処理により、動きボケが除去された前景成分Fi/vを算出して、動きボケが除去された前景の画素値である、０乃至８の整数のいずれかのiに対応するFiを算出して、図９１に例を示す、動きボケが除去された画素値であるFiから成る、動きボケが除去された前景成分画像を出力する。
【０５８４】
なお、図９１に示す動きボケが除去された前景成分画像において、C03乃至C10のそれぞれにF01乃至F08のそれぞれが設定されているのは、画面に対する前景成分画像の位置を変化させないためであり、任意の位置に対応させることができる。
【０５８５】
また、例えば、図９２に示すように、処理単位に対応する画素の数が８であり、動き量vが４であるとき、動きボケ除去部１０６は、式（１１０）に示す行列の式を生成する。
【０５８６】
【数２５】

【０５８７】
動きボケ除去部１０６は、このように処理単位の長さに対応した数の式を立てて、動きボケの量が調整された画素値であるFiを算出する。同様に、例えば、処理単位に含まれる画素の数が１００あるとき、１００個の画素に対応する式を生成して、Fiを算出する。
【０５８８】
以上のように、動きボケ除去部１０６は、動き量vおよび処理単位に対応して、式を生成し、生成した式に前景成分画像の画素値を設定して、動きボケが除去された前景成分画像を算出する。
【０５８９】
次に、図９３のフローチャートを参照して、動きボケ除去部１０６による前景成分画像に含まれる動きボケの除去の処理を説明する。
【０５９０】
ステップＳ８０１において、動きボケ除去部１０６の処理単位決定部８０１は、動きベクトルおよび領域情報を基に、処理単位を生成し、生成した処理単位をモデル化部８０２に供給する。
【０５９１】
ステップＳ８０２において、動きボケ除去部１０６のモデル化部８０２は、動き量vおよび処理単位に対応して、モデルの選択や生成を行う。ステップＳ８０３において、方程式生成部８０３は、選択されたモデルを基に、正規方程式を作成する。
【０５９２】
ステップＳ８０４において、足し込み部８０４は、作成された正規方程式に前景成分画像の画素値を設定する。ステップＳ８０５において、足し込み部８０４は、処理単位に対応する全ての画素の画素値の設定を行ったか否かを判定し、処理単位に対応する全ての画素の画素値の設定を行っていないと判定された場合、ステップＳ８０４に戻り、正規方程式への画素値の設定の処理を繰り返す。
【０５９３】
ステップＳ８０５において、処理単位の全ての画素の画素値の設定を行ったと判定された場合、ステップＳ８０６に進み、演算部８０５は、足し込み部８０４から供給された画素値が設定された正規方程式を基に、動きボケを除去した前景の画素値を算出して、処理は終了する。
【０５９４】
このように、動きボケ除去部１０６は、動きベクトルおよび領域情報を基に、動きボケを含む前景画像から動きボケを除去することができる。
【０５９５】
すなわち、サンプルデータである画素値に含まれる動きボケを除去することができる。
【０５９６】
次に、補正部１０７による背景成分画像の補正について説明する。
【０５９７】
図９４は、図９０に例を示す前景成分画像のモデルに対応する、背景成分画像のモデルの例を示す図である。
【０５９８】
図９４に示すように、元の入力画像の混合領域に対応する、背景成分画像の画素の画素値は、前景の成分が除去されているので、元の入力画像の背景領域に対応する画素に比較して、混合比αに対応して、少ない数の背景の成分により構成されている。
【０５９９】
例えば、図９４に例を示す背景成分画像において、画素値C01は、４つの背景の成分B02/Vで構成され、画素値C02は、３つの背景の成分B03/Vで構成され、画素値C03は、２つの背景の成分B04/Vで構成され、画素値C04は、１つの背景の成分B05/Vで構成される。
【０６００】
また、図９４に例を示す背景成分画像において、画素値C09は、１つの背景の成分B10/Vで構成され、画素値C10は、２つの背景の成分B11/Vで構成され、画素値C11は、３つの背景の成分B12/Vで構成され、画素値C12は、４つの背景の成分B13/Vで構成される。
【０６０１】
このように、元の入力画像の混合領域に対応する、画素の画素値が、元の入力画像の背景領域に対応する画素に比較して、少ない数の背景の成分により構成されているので、前景成分画像の混合領域に対応する画像は、背景領域の画像に比較して、例えば、暗い画像となる。
【０６０２】
補正部１０７は、このような、背景成分画像の混合領域に対応する画素の画素値のそれぞれに、混合比αに対応する定数を乗じて、背景成分画像の混合領域に対応する画素の画素値を補正する。
【０６０３】
例えば、図９４に示す背景成分画像が入力されたとき、補正部１０７は、画素値C01に5/4を乗算し、画素値C02に5/3を乗算し、画素値C11に5/3を乗算し、画素値C12に5/4を乗算する。図９１に例を示す、動きボケが除去された前景成分画像との画素の位置を整合させるために、補正部１０７は、画素値C03乃至C11の画素値を0とする。
【０６０４】
補正部１０７は、図９５に例を示す、混合領域に対応する画素の画素値を補正した背景成分画像を出力する。
【０６０５】
このように、補正部１０７は、背景成分画像の混合領域に対応する画素の画素値を補正すると共に、動きボケが除去された前景成分画像との画素の位置を整合させる。
【０６０６】
図９６は、空間方向に、より高解像度な画像を生成するクラス分類適応処理において使用される係数セットを生成する動きボケ除去画像処理部１０８の構成を示すブロック図である。例えば、図９６に構成を示す動きボケ除去画像処理部１０８は、入力されたHD画像を基に、SD画像からHD画像を生成するクラス分類適応処理において使用される係数セットを生成する。
【０６０７】
背景成分教師画像フレームメモリ１００１は、補正部１０７から供給された、教師画像の補正された背景成分画像を記憶する。背景成分教師画像フレームメモリ１００１は、記憶している教師画像の背景成分画像を加重平均部１００３−１および学習部１００６−１に供給する。
【０６０８】
前景成分教師画像フレームメモリ１００２は、動きボケ除去部１０６から供給された、教師画像の動きボケが除去された前景成分画像を記憶する。前景成分教師画像フレームメモリ１００２は、記憶している教師画像の前景成分画像を加重平均部１００３−２および学習部１００６−２に供給する。
【０６０９】
加重平均部１００３−１は、背景成分教師画像フレームメモリ１００１から供給された、例えば、HD画像である教師画像の背景成分画像を４分の１加重平均して、生徒画像であるSD画像を生成し、生成したSD画像を背景成分生徒画像フレームメモリ１００４に供給する。
【０６１０】
例えば、加重平均部１００３−１は、図９７に示すように、教師画像の２×２（横×縦）の４つの画素（同図において、白丸で示す部分）を１単位とし、各単位の４つの画素の画素値を加算して、加算された結果を４で除算する。加重平均部１００３−１は、このように、４分の１加重平均された結果を、各単位の中心に位置する生徒画像の画素（同図において、黒丸で示す部分）に設定する。
【０６１１】
背景成分生徒画像フレームメモリ１００４は、加重平均部１００３−１から供給された、教師画像の背景成分画像に対応する、生徒画像を記憶する。背景成分生徒画像フレームメモリ１００４は、記憶している、教師画像の背景成分画像に対応する生徒画像を学習部１００６−１に供給する。
【０６１２】
加重平均部１００３−２は、前景成分教師画像フレームメモリ１００２から供給された、例えば、HD画像である教師画像の前景成分画像を４分の１加重平均して、生徒画像であるSD画像を生成し、生成したSD画像を前景成分生徒画像フレームメモリ１００５に供給する。
【０６１３】
前景成分生徒画像フレームメモリ１００５は、加重平均部１００３−２から供給された、教師画像の前景成分画像に対応する、SD画像である生徒画像を記憶する。前景成分生徒画像フレームメモリ１００５は、記憶している、教師画像の前景成分画像に対応する生徒画像を学習部１００６−２に供給する。
【０６１４】
学習部１００６−１は、背景成分教師画像フレームメモリ１００１から供給された教師画像の背景成分画像、および背景成分生徒画像フレームメモリ１００４から供給された、教師画像の背景成分画像に対応する生徒画像を基に、背景成分画像に対応する係数セットを生成し、生成した係数セットを係数セットメモリ１００７に供給する。
【０６１５】
学習部１００６−２は、前景成分教師画像フレームメモリ１００２から供給された教師画像の前景成分画像、および前景成分生徒画像フレームメモリ１００５から供給された、教師画像の前景成分画像に対応する生徒画像を基に、前景成分画像に対応する係数セットを生成し、生成した係数セットを係数セットメモリ１００７に供給する。
【０６１６】
係数セットメモリ１００７は、学習部１００６−１から供給された背景成分画像に対応する係数セット、および学習部１００６−２から供給された前景成分画像に対応する係数セットを記憶する。
【０６１７】
以下、学習部１００６−１および学習部１００６−２を個々に区別する必要がないとき、単に学習部１００６と称する。
【０６１８】
図９８は、学習部１００６の構成を示すブロック図である。
【０６１９】
クラス分類部１０３１は、クラスタップ取得部１０５１および波形分類部１０５２で構成され、入力された生徒画像の、注目している画素である、注目画素をクラス分類する。クラスタップ取得部１０５１は、注目画素に対応する、生徒画像の画素である、所定の数のクラスタップを取得し、取得したクラスタップを波形分類部１０５２に供給する。
【０６２０】
例えば、図９７において、上からｉ番目で、左からｊ番目の生徒画像の画素（図中、黒丸で示す部分）をＸ_ijと表すとすると、クラスタップ取得部１０５１は、注目画素Ｘ_ijの左上、上、右上、左、右、左下、下、右下に隣接する８つの画素Ｘ_(i-1)(j-1)，Ｘ_(i-1)j，Ｘ_(i-1)(j+1)，Ｘ_i(j-1)，Ｘ_i(j+1)，Ｘ_(i-1)(j-1)，Ｘ_(i-1)j，Ｘ_(i+1)(j+1)に、自身を含め、合計９画素で構成されるクラスタップを取得する。このクラスタップは、波形分類部１０５２に供給される。
【０６２１】
なお、この場合、クラスタップは、３×３画素でなる正方形状のブロックで構成されることとなるが、クラス分類用ブロックの形状は、正方形である必要はなく、その他、例えば、長方形や、十文字形、その他の任意な形とすることが可能である。また、クラスタップを構成する画素数も、３×３の９画素に限定されるものではない。
【０６２２】
波形分類部１０５２は、入力信号を、その特徴に基づいていくつかのクラスに分類する、クラス分類処理を実行して、クラスタップを基に、注目画素を１つのクラスに分類する。波形分類部１０５２は、例えば、注目画素を５１２のクラスのうちの１つのクラスに分類し、分類されたクラスに対応するクラス番号を予測タップ取得部１０３２に供給する。
【０６２３】
ここで、クラス分類処理について簡単に説明する。
【０６２４】
いま、例えば、図９９（Ａ）に示すように、ある注目画素と、それに隣接する３つの画素により、２×２画素でなるクラスタップを構成し、また、各画素は、１ビットで表現される（０または１のうちのいずれかのレベルをとる）ものとする。この場合、注目画素を含む２×２の４画素のブロックは、各画素のレベル分布により、図９９（Ｂ）に示すように、１６（＝（２¹）⁴）パターンに分類することができる。従って、いまの場合、注目画素は、１６のパターンに分類することができ、このようなパターン分けが、クラス分類処理であり、クラス分類部１０３１において行われる。
【０６２５】
なお、クラス分類処理は、画像（クラスタップ）のアクティビティ（画像の複雑さ）（変化の激しさ）などをも考慮して行うようにすることが可能である。
【０６２６】
ここで、通常、各画素には、例えば８ビット程度が割り当てられる。また、本実施の形態においては、上述したように、クラスタップは、３×３の９画素で構成される。従って、このようなクラスタップを対象にクラス分類処理を行ったのでは、（２⁸）⁹という膨大な数のクラスに分類されることになる。
【０６２７】
そこで、本実施の形態においては、波形分類部１０５２において、クラスタップに対して、ＡＤＲＣ処理が施され、これにより、クラスタップを構成する画素のビット数を小さくすることで、クラス数を削減する。
【０６２８】
説明を簡単にするため、図１００（Ａ）に示すように、直線上に並んだ４画素で構成されるクラスタップを考えると、ＡＤＲＣ処理においては、その画素値の最大値ＭＡＸと最小値ＭＩＮが検出される。そして、ＤＲ＝ＭＡＸ−ＭＩＮを、クラスタップで構成されるブロックの局所的なダイナミックレンジとし、このダイナミックレンジＤＲに基づいて、クラスタップのブロックを構成する画素の画素値がＫビットに再量子化される。
【０６２９】
即ち、ブロック内の各画素値から、最小値ＭＩＮを減算し、その減算値をＤＲ／２^Kで除算する。そして、その結果得られる除算値に対応するコード（ＡＤＲＣコード）に変換される。具体的には、例えば、Ｋ＝２とした場合、図１００（Ｂ）に示すように、除算値が、ダイナミックレンジＤＲを４（＝２²）等分して得られるいずれの範囲に属するかが判定され、除算値が、最も下のレベルの範囲、下から２番目のレベルの範囲、下から３番目のレベルの範囲、または最も上のレベルの範囲に属する場合には、それぞれ、例えば、００Ｂ，０１Ｂ，１０Ｂ、または１１Ｂなどの２ビットにコード化される（Ｂは２進数であることを表す）。そして、復号側においては、ＡＤＲＣコード００Ｂ，０１Ｂ，１０Ｂ、または１１Ｂは、ダイナミックレンジＤＲを４等分して得られる最も下のレベルの範囲の中心値Ｌ₀₀、下から２番目のレベルの範囲の中心値Ｌ₀₁、下から３番目のレベルの範囲の中心値Ｌ₁₀、または最も上のレベルの範囲の中心値Ｌ₁₁に変換され、その値に、最小値ＭＩＮが加算されることで復号が行われる。
【０６３０】
ここで、このようなＡＤＲＣ処理はノンエッジマッチングと呼ばれる。
【０６３１】
なお、ＡＤＲＣ処理については、本件出願人が先に出願した、例えば、特開平３−５３７７８号公報などに、その詳細が開示されている。
【０６３２】
クラスタップを構成する画素に割り当てられているビット数より少ないビット数で再量子化を行うＡＤＲＣ処理を施すことにより、上述したように、クラス数を削減することができ、このようなＡＤＲＣ処理が、波形分類部１０５２において行われる。
【０６３３】
なお、本実施の形態では、波形分類部１０５２において、ＡＤＲＣコードに基づいて、クラス分類処理が行われるが、クラス分類処理は、その他、例えば、ＤＰＣＭ（予測符号化）や、ＢＴＣ（Block Truncation Coding）、ＶＱ（ベクトル量子化）、ＤＣＴ（離散コサイン変換）、アダマール変換などを施したデータを対象に行うようにすることも可能である。
【０６３４】
予測タップ取得部１０３２は、クラス番号を基に、生徒画像の画素から、クラスに対応し、元の画像（教師画像）の予測値を計算するための単位である、予測タップを取得し、取得した予測タップおよびクラス番号を対応画素取得部１０３３に供給する。
【０６３５】
例えば、図９７において、生徒画像の画素Ｘ_ij（図中、黒丸で示す部分）を中心とする、元の画像（教師画像）における２×２の９画素の画素値を、その最も左から右方向、かつ上から下方向に、Ｙ_ij（１），Ｙ_ij（２），Ｙ_ij（３），Ｙ_ij（４）と表すとすると、画素Ｙ_ij（１）乃至Ｙ_ij（４）の予測値の計算に必要な係数を算出するために、予測タップ取得部１０３２は、例えば、生徒画像の画素Ｘ_ijを中心とする３×３の９画素Ｘ_(i-1)(j-1)，Ｘ_(i-1)j，Ｘ_(i-1)(j+1)，Ｘ_i(j-1)，Ｘ_ij，Ｘ_i(j+1)，Ｘ_(i+1)(j-1)，Ｘ_(i+1)j，Ｘ_(i+1)(j+1)で構成される正方形状の予測タップを取得する。
【０６３６】
具体的には、例えば、図９７において四角形で囲む、教師画像における画素Ｙ₃₃（１）乃至Ｙ₃₃（４）の４画素の予測値の計算に必要な係数を算出するには、画素Ｘ₂₂，Ｘ₂₃，Ｘ₂₄，Ｘ₃₂，Ｘ₃₃，Ｘ₃₄，Ｘ₄₂，Ｘ₄₃，Ｘ₄₄により、予測タップが構成される（この場合の注目画素は、Ｘ₃₃となる）。
【０６３７】
対応画素取得部１０３３は、予測タップおよびクラス番号を基に、予測すべき画素値に対応する教師画像の画素の画素値を取得し、予測タップ、クラス番号、および取得した予測すべき画素値に対応する教師画像の画素の画素値を正規方程式生成部１０３４に供給する。
【０６３８】
例えば、対応画素取得部１０３３は、教師画像における画素Ｙ₃₃（１）乃至Ｙ₃₃（４）の４画素の予測値の計算に必要な係数を算出するとき、予測すべき画素値に対応する教師画像の画素として、画素Ｙ₃₃（１）乃至Ｙ₃₃（４）の画素値を取得する。
【０６３９】
正規方程式生成部１０３４は、予測タップ、クラス番号、および取得した予測すべき画素値を基に、予測タップおよび予測すべき画素値の関係に対応する、適応処理において使用される係数セットを算出するための正規方程式を生成し、クラス番号と共に、生成した正規方程式を係数計算部１０３５に供給する。
【０６４０】
係数計算部１０３５は、正規方程式生成部１０３４から供給された正規方程式を解いて、分類されたクラスに対応する、適応処理において使用される係数セットを計算する。係数計算部１０３５は、クラス番号と共に、計算した係数セットを係数セットメモリ１００７に供給する。
【０６４１】
正規方程式生成部１０３４は、このような正規方程式に対応する行列を生成し、係数計算部１０３５は、生成された行列を基に、係数セットを計算するようにしてもよい。
【０６４２】
ここで、適応処理について説明する。
【０６４３】
例えば、いま、教師画像の画素値ｙの予測値Ｅ［ｙ］を、その周辺の幾つかの画素の画素値（以下、適宜、生徒データという）ｘ₁，ｘ₂，・・・と、所定の予測係数ｗ₁，ｗ₂，・・・の線形結合により規定される線形１次結合モデルにより求めることを考える。この場合、予測値Ｅ［ｙ］は、次式で表すことができる。
【０６４４】
Ｅ［ｙ］＝ｗ₁ｘ₁＋ｗ₂ｘ₂＋・・・（１１１）
【０６４５】
そこで、一般化するために、予測係数ｗの集合でなる行列Ｗ、生徒データの集合でなる行列Ｘ、および予測値Ｅ［ｙ］の集合でなる行列Ｙ’を、
【数２６】

で定義すると、次のような観測方程式が成立する。
【０６４６】
ＸＷ＝Ｙ’ （１１２）
【０６４７】
そして、この観測方程式に最小自乗法を適用して、元の画像の画素値ｙに近い予測値Ｅ［ｙ］を求めることを考える。この場合、元の画像の画素値（以下、適宜、教師データという）ｙの集合でなる行列Ｙ、および元の画像の画素値ｙに対する予測値Ｅ［ｙ］の残差ｅの集合でなる行列Ｅを、
【数２７】

で定義すると、式（１１２）から、次のような残差方程式が成立する。
【０６４８】
ＸＷ＝Ｙ＋Ｅ（１１３）
【０６４９】
この場合、元の画像の画素値ｙに近い予測値Ｅ［ｙ］を求めるための予測係数ｗ_iは、自乗誤差
【数２８】

を最小にすることで求めることができる。
【０６５０】
従って、上述の自乗誤差を予測係数ｗ_iで微分したものが０になる場合、即ち、次式を満たす予測係数ｗ_iが、元の画像の画素値ｙに近い予測値Ｅ［ｙ］を求めるため最適値ということになる。
【０６５１】
【数２９】

【０６５２】
そこで、まず、式（１１３）を、予測係数ｗ_iで微分することにより、次式が成立する。
【０６５３】
【数３０】

【０６５４】
式（１１４）および（１１５）より、式（１１６）が得られる。
【０６５５】
【数３１】

【０６５６】
さらに、式（１１３）の残差方程式における生徒データｘ、予測係数ｗ、教師データｙ、および残差ｅの関係を考慮すると、式（１１６）から、次のような正規方程式を得ることができる。
【０６５７】
【数３２】

【０６５８】
式（１１７）の正規方程式は、求めるべき予測係数ｗの数と同じ数だけたてることができ、従って、式（１１７）を解くことで、最適な予測係数ｗを求めることができる。なお、式（１１７）を解くにあたっては、例えば、掃き出し法（Gauss-Jordanの消去法）などを適用することが可能である。
【０６５９】
以上のようにして、クラスごとに最適な予測係数ｗを求め、さらに、その予測係数ｗを用い、式（１１１）により、教師画像の画素値ｙに近い予測値Ｅ［ｙ］を求めるのが適応処理である。
【０６６０】
正規方程式生成部１０３４は、クラスごとに最適な予測係数ｗを算出するための正規方程式を生成し、係数計算部１０３５は、生成された正規方程式を基に、予測係数ｗを算出する。
【０６６１】
なお、適応処理は、間引かれた画像には含まれていない、元の画像に含まれる成分が再現される点で、補間処理とは異なる。即ち、適応処理は、式（１１１）だけを見る限りは、いわゆる補間フィルタを用いての補間処理と同一であるが、その補間フィルタのタップ係数に相当する予測係数ｗが、教師データｙを用いての、いわば学習により求められるため、元の画像に含まれる成分を再現することができる。このことから、適応処理は、いわば画像の創造作用がある処理ということができる。
【０６６２】
図１０１は、図９６に構成を示す動きボケ除去画像処理部１０８が生成する係数セットを説明する図である。入力画像は、領域特定部１０３により、前景領域、背景領域、カバードバックグラウンド領域、およびアンカバードバックグラウンド領域が特定される。
【０６６３】
領域が特定され、混合比算出部１０４により混合比αが検出された入力画像は、前景背景分離部１０５により、前景成分画像、および背景成分画像に分離される。
【０６６４】
分離された前景成分画像は、動きボケ除去部１０６により、動きボケが除去される。分離された背景成分画像の混合領域に対応する画素値は、補正部１０７により、前景成分画像の動きボケの除去に対応して補正される。
【０６６５】
動きボケ除去画像処理部１０８は、動きボケが除去された前景成分画像、および補正された背景成分画像を基に、前景成分画像に対応する係数セット、および背景成分画像に対応する係数セットを個々に算出する。
【０６６６】
すなわち、学習部１００６−１は、分離され、補正された背景成分画像を基に、背景成分画像に対応する係数セットを算出し、学習部１００６−２は、分離され、動きボケが除去された前景成分画像を基に、前景成分画像に対応する係数セットを算出する。
【０６６７】
背景成分画像に対応する係数セットは、分離され、補正された背景成分画像に適用する、画素値を予測するクラス分類適応処理において、背景成分画像に対応する画像の画素値の予測に使用される。
【０６６８】
前景成分画像に対応する係数セットは、入力画像から分離され、動きボケが除去された前景成分画像に適用する、画素値を予測するクラス分類適応処理において、前景成分画像に対応する画像の画素値の予測に使用される。
【０６６９】
前景成分画像に対応する予測画像は、動きボケが付加される。背景成分画像に対応する予測画像は、前景成分画像の動きボケの付加に対応して、補正される。
【０６７０】
補正された背景成分画像に対応する予測画像、および動きボケが付加された前景成分画像に対応する予測画像は、合成され、１つの予測画像とされる。
【０６７１】
図１０２のフローチャートを参照して、図９６に構成を示す動きボケ除去画像処理部１０８による、クラス分類適応処理による画素値の予測に使用される係数セットを生成する学習の処理を説明する。
【０６７２】
ステップＳ１００１において、加重平均部１００３−１および加重平均部１００３−２は、背景成分画像に対応する生徒画像、および前景成分画像に対応する生徒画像を生成する。すなわち、加重平均部１００３−１は、背景成分教師画像フレームメモリ１００１に記憶されている、教師画像の背景成分画像を、例えば、４分の１加重平均して、教師画像の背景成分画像に対応する生徒画像を生成する。
【０６７３】
加重平均部１００３−２は、前景成分教師画像フレームメモリ１００２に記憶されている、教師画像の前景成分画像を、例えば、４分の１加重平均して、教師画像の前景成分画像に対応する生徒画像を生成する。
【０６７４】
ステップＳ１００２において、学習部１００６−１は、背景成分教師画像フレームメモリ１００１に記憶されている教師画像の背景成分画像、および背景成分生徒画像フレームメモリ１００４に記憶されている、教師画像の背景成分画像に対応する生徒画像を基に、背景成分画像に対応する係数セットを生成する。ステップＳ１００２における係数セットの生成の処理の詳細は、図１０３のフローチャートを参照して後述する。
【０６７５】
ステップＳ１００３において、学習部１００６−２は、前景成分教師画像フレームメモリ１００２に記憶されている教師画像の前景成分画像、および前景成分生徒画像フレームメモリ１００５に記憶されている、教師画像の前景成分画像に対応する生徒画像を基に、前景成分画像に対応する係数セットを生成する。
【０６７６】
ステップＳ１００４において、学習部１００６−１および学習部１００６−２は、それぞれ、背景成分画像に対応する係数セット、または前景成分画像に対応する係数セットを係数セットメモリ１００７に出力する。係数セットメモリ１００７は、背景成分画像に対応する係数セット、または前景成分画像の対応する係数セットをそれぞれに記憶して、処理は終了する。
【０６７７】
このように、図９６に構成を示す動きボケ除去画像処理部１０８は、背景成分画像に対応する係数セット、および前景成分画像に対応する係数セットを生成することができる。
【０６７８】
なお、ステップＳ１００２およびステップＳ１００３の処理を、シリアルに実行しても、パラレルに実行しても良いことは勿論である。
【０６７９】
次に、図１０３のフローチャートを参照して、ステップＳ１００２の処理に対応する、学習部１００６−１が実行する背景成分画像に対応する係数セットの生成の処理を説明する。
【０６８０】
ステップＳ１０２１において、学習部１００６−１は、背景成分画像に対応する生徒画像に未処理の画素があるか否かを判定し、背景成分画像に対応する生徒画像に未処理の画素があると判定された場合、ステップＳ１０２２に進み、ラスタースキャン順に、背景成分画像に対応する生徒画像から注目画素を取得する。
【０６８１】
ステップＳ１０２３において、クラス分類部１０３１のクラスタップ取得部１０５１は、背景成分生徒画像フレームメモリ１００４に記憶されている生徒画像から、注目画素に対応するクラスタップを取得する。ステップＳ１０２４において、クラス分類部１０３１の波形分類部１０５２は、クラスタップに対して、ＡＤＲＣ処理を適用し、これにより、クラスタップを構成する画素のビット数を小さくして、注目画素をクラス分類する。ステップＳ１０２５において、予測タップ取得部１０３２は、分類されたクラスを基に、背景成分生徒画像フレームメモリ１００４に記憶されている生徒画像から、注目画素に対応する予測タップを取得する。
【０６８２】
ステップＳ１０２６において、対応画素取得部１０３３は、分類されたクラスを基に、背景成分教師画像フレームメモリ１００１に記憶されている教師画像の背景成分画像から、予測すべき画素値に対応する画素を取得する。
【０６８３】
ステップＳ１０２７において、正規方程式生成部１０３４は、分類されたクラスを基に、クラス毎の行列に、予測タップおよび予測すべき画素値に対応する画素の画素値を足し込み、ステップＳ１０２１に戻り、学習部１００６−１は、未処理の画素があるか否かの判定を繰り返す。予測タップおよび予測すべき画素値に対応する画素の画素値を足し込まれるクラス毎の行列は、クラス毎の係数セットを計算するための正規方程式に対応している。
【０６８４】
ステップＳ１０２１において、生徒画像に未処理の画素がないと判定された場合、ステップＳ１０２８に進み、正規方程式生成部１０３４は、予測タップおよび予測すべき画素値に対応する画素の画素値が設定された、クラス毎の行列を係数計算部１０３５に供給する。係数計算部１０３５は、予測タップおよび予測すべき画素値に対応する画素の画素値が設定された、クラス毎の行列を解いて、背景成分画像に対応する、クラス毎の係数セットを計算する。
【０６８５】
なお、係数計算部１０３５は、線形予測により画素値を予測するための係数セットに限らず、非線形予測により画素値を予測するための係数セットを計算するようにしてもよい。
【０６８６】
ステップＳ１０２９において、係数計算部１０３５は、背景成分画像に対応する、クラス毎の係数セットを係数セットメモリ１００７に出力し、処理は終了する。
【０６８７】
このように、学習部１００６−１は、背景成分画像に対応する係数セットを生成することができる。
【０６８８】
ステップＳ１００３に対応する、学習部１００６−２による、前景成分画像に対応する係数セットの生成の処理は、前景成分教師画像フレームメモリ１００２に記憶されている前景成分画像、および前景成分生徒画像フレームメモリ１０５に記憶されている前景成分画像に対応する生徒画像を使用することを除いて、図１０３のフローチャートを参照して説明した処理と同様なので、その説明は省略する。
【０６８９】
このように、図９６に構成を示す動きボケ除去画像処理部１０８は、補正された背景成分画像に対応する係数セット、および動きボケが除去された前景成分画像に対応する係数セットを個々に生成することができる。
【０６９０】
図１０４は、クラス分類適応処理を実行して、空間方向に、より高解像度な画像を生成する動きボケ除去画像処理部１０８の構成を示すブロック図である。例えば、図１０４に構成を示す動きボケ除去画像処理部１０８は、SD画像である入力画像を基に、クラス分類適応処理を実行して、HD画像を生成する。
【０６９１】
背景成分画像フレームメモリ１１０１は、補正部１０７から供給された、補正された背景成分画像を記憶する。背景成分画像フレームメモリ１１０１は、記憶している背景成分画像をマッピング部１１０３−１に供給する。
【０６９２】
前景成分画像フレームメモリ１１０２は、動きボケ除去部１０６から供給された、動きボケが除去された前景成分画像を記憶する。前景成分画像フレームメモリ１１０２は、記憶している前景成分画像をマッピング部１１０３−２に供給する。
【０６９３】
マッピング部１１０３−１は、係数セットメモリ１１０４に記憶されている、背景成分画像に対応する係数セットを基に、クラス分類適応処理により、背景成分画像フレームメモリ１１０１に記憶されている背景成分画像に対応する予測画像を生成する。マッピング部１１０３−１は、生成した予測画像を補正部１１０５に供給する。
【０６９４】
補正部１１０５は、動きボケ付加部１１０６が付加する動きボケに対応して、背景成分画像の混合領域に対応する、予測画像の所定の画素の画素値に0を設定するか、または付加される動きボケに対応する所定の値で、予測画像の所定の画素の画素値を除算する。補正部１１０５は、このように補正された予測画像を合成部１１０７に供給する。
【０６９５】
マッピング部１１０３−２は、係数セットメモリ１１０４に記憶されている、前景成分画像に対応する係数セットを基に、クラス分類適応処理により、前景成分画像フレームメモリ１１０２に記憶されている前景成分画像に対応する予測画像を生成する。マッピング部１１０３−２は、生成した予測画像を動きボケ付加部１１０６に供給する。
【０６９６】
動きボケ付加部１１０６は、所望の動きボケ調整量v'、例えば、入力画像の動き量vの半分の値の動きボケ調整量v'や、動き量vと無関係の値の動きボケ調整量v'を与えることで、予測画像に動きボケを付加する。動きボケ付加部１１０６は、動きボケが除去された前景成分画像の予測画像の画素値Fiを動きボケ調整量v'で除すことにより、前景の成分Fi/v'を算出して、前景の成分Fi/v'の和を算出して、動きボケが付加された画素値を生成する。
【０６９７】
例えば、図１０５に示す予測画像が入力され、動きボケ調整量v'が３のとき、図１０６に示すように、画素値C02は、（F01）/v'とされ、画素値C03は、（F01+F02）/v'とされ、画素値C04は、（F01+F02+F03）/v'とされ、画素値C05は、（F02+F03+F04）/v'とされる。
【０６９８】
動きボケ付加部１１０６は、このように動きボケを付加した、前景成分画像の予測画像を合成部１１０７に供給する。
【０６９９】
合成部１１０７は、補正部１１０５から供給された、補正された背景成分画像に対応する予測画像、および動きボケ付加部１１０６から供給された、動きボケが付加された前景成分画像に対応する予測画像を合成し、合成された予測画像をフレームメモリ１１０８に供給する。
【０７００】
フレームメモリ１１０８は、合成部１１０７から供給された予測画像を記憶すると共に、記憶している画像を出力画像として出力する。
【０７０１】
以下、マッピング部１１０３−１およびマッピング部１１０３−２を個々に区別する必要がないとき、単にマッピング部１１０３と称する。
【０７０２】
図１０７は、マッピング部１１０３の構成を示すブロック図である。
【０７０３】
マッピング処理部１１３１は、クラス分類処理を実行するクラス分類部１１４１、並びに適応処理を実行する予測タップ取得部１１４２および予測演算部１１４３で構成されている。
【０７０４】
クラス分類部１１４１は、クラスタップ取得部１１５１および波形分類部１１５２で構成され、背景成分画像、または前景成分画像のいずれか一方の入力画像の、注目している画素である、注目画素をクラス分類する。
【０７０５】
クラスタップ取得部１１５１は、入力画像の注目画素に対応する、所定の数のクラスタップを取得し、取得したクラスタップを波形分類部１１５２に供給する。例えば、クラスタップ取得部１１５１は、９個のクラスタップを取得し、取得したクラスタップを波形分類部１１５２に供給する。
【０７０６】
波形分類部１１５２は、クラスタップに対して、ＡＤＲＣ処理を適用し、これにより、クラスタップを構成する画素のビット数を小さくして、注目画素を所定の数のクラスのうちの１つのクラスに分類し、分類されたクラスに対応するクラス番号を予測タップ取得部１１４２に供給する。例えば、波形分類部１１５２は、注目画素を５１２のクラスのうちの１つのクラスに分類し、分類されたクラスに対応するクラス番号を予測タップ取得部１１４２に供給する。
【０７０７】
予測タップ取得部１１４２は、クラス番号を基に、入力画像から、クラスに対応する、所定の数の予測タップを取得し、取得した予測タップおよびクラス番号を予測演算部１１４３に供給する。
【０７０８】
予測演算部１１４３は、クラス番号を基に、係数セットメモリ１１０４に記憶されている背景成分画像に対応する係数セット、および前景成分画像に対応する係数セットから、入力画像に対応し、クラスに対応する係数セットを取得する。
予測演算部１１４３は、入力画像に対応し、クラスに対応する係数セット、および予測タップを基に、線形予測により予測画像の画素値を予測する。予測演算部１１４３は、予測した画素値をフレームメモリ１１３２に供給する。
【０７０９】
なお、予測演算部１１４３は、非線形予測により予測画像の画素値を予測するようしてもよい。
【０７１０】
フレームメモリ１１３２は、マッピング処理部１１３１から供給された、予測された画素値を記憶し、予測された画素値からなる画像を出力する。
【０７１１】
次に、図１０８のフローチャートを参照して、図１０４に構成を示す動きボケ除去画像処理部１０８の画像の創造の処理を説明する。
【０７１２】
ステップＳ１１０１において、マッピング部１１０３−１は、係数セットメモリ１１０４に記憶されている、背景成分画像に対応する係数セットを基に、クラス分類適応処理により、背景成分画像フレームメモリ１１０１に記憶されている背景成分画像に対応する画像を予測する。背景成分画像に対応する画像の予測の処理の詳細は、図１０９のフローチャートを参照して後述する。
【０７１３】
ステップＳ１１０２において、マッピング部１１０３−２は、係数セットメモリ１１０４に記憶されている、前景成分画像に対応する係数セットを基に、クラス分類適応処理により、前景成分画像フレームメモリ１１０２に記憶されている前景成分画像に対応する画像を予測する。
【０７１４】
ステップＳ１１０３において、補正部１１０５は、背景成分画像に対応する予測された画像を補正する。
【０７１５】
ステップＳ１１０４において、動きボケ付加部１１０６は、前景成分画像に対応する予測された画像に動きボケを付加する。
【０７１６】
ステップＳ１１０５において、合成部１１０７は、背景成分画像に対応する予測画像、および前景領域に対応する予測画像を合成する。合成部１１０７は、合成された画像をフレームメモリ１１０８に供給する。フレームメモリ１１０８は、合成部１１０７から供給された画像を記憶する。
【０７１７】
ステップＳ１１０６において、フレームメモリ１１０８は、記憶している、合成された画像を出力し、処理は終了する。
【０７１８】
このように、図１０４に構成を示す動きボケ除去画像処理部１０８を有する画像処理装置は、背景成分画像に対応する予測画像を生成し、動きボケが除去された前景成分画像に対応する予測画像を個々に生成することができる。
【０７１９】
なお、ステップＳ１１０１およびステップＳ１１０２の処理を、シリアルに実行しても、パラレルに実行しても良いことは勿論である。
【０７２０】
図１０９のフローチャートを参照して、ステップＳ１１０１に対応する、マッピング部１１０３−１による背景成分画像に対応する画像の予測の処理を説明する。
【０７２１】
ステップＳ１１２１において、マッピング部１１０３−１は、背景成分画像に未処理の画素があるか否かを判定し、背景成分画像に未処理の画素があると判定された場合、ステップＳ１１２２に進み、マッピング処理部１１３１は、係数セットメモリ１１０４に記憶されている、背景成分画像に対応する係数セットを取得する。ステップＳ１１２３において、マッピング処理部１１３１は、ラスタースキャン順に、背景成分画像フレームメモリ１１０１に記憶されている背景成分画像から注目画素を取得する。
【０７２２】
ステップＳ１１２４において、クラス分類部１１４１のクラスタップ取得部１１５１は、背景成分画像フレームメモリ１１０１に記憶されている背景成分画像から、注目画素に対応するクラスタップを取得する。ステップＳ１１２５において、クラス分類部１１４１の波形分類部１１５２は、クラスタップに対して、ＡＤＲＣ処理を適用し、これにより、クラスタップを構成する画素のビット数を小さくして、注目画素をクラス分類する。ステップＳ１１２６において、予測タップ取得部１１４２は、分類されたクラスを基に、背景成分画像フレームメモリ１１０１に記憶されている背景成分画像から、注目画素に対応する予測タップを取得する。
【０７２３】
ステップＳ１１２７において、予測演算部１１４３は、背景成分画像および分類されたクラスに対応する係数セット、および予測タップを基に、線形予測により、予測画像の画素値を予測する。
【０７２４】
なお、予測演算部１１４３は、線形予測に限らず、非線形予測により予測画像の画素値を予測するようにしてもよい。
【０７２５】
ステップＳ１１２８において、予測演算部１１４３は、予測された画素値をフレームメモリ１１３２に出力する。フレームメモリ１１３２は、予測演算部１１４３から供給された画素値を記憶する。手続きは、ステップＳ１１２１に戻り、未処理の画素があるか否かの判定を繰り返す。
【０７２６】
ステップＳ１１２１において、背景成分画像に未処理の画素がないと判定された場合、ステップＳ１１２９に進み、フレームメモリ１１３２は、記憶されている背景成分画像に対応する予測画像を出力して、処理は終了する。
【０７２７】
このように、マッピング部１１０３−１は、補正された背景成分画像を基に、背景成分画像に対応する画像を予測することができる。
【０７２８】
ステップＳ１１０２に対応する、マッピング部１１０３−２による、前景成分画像に対応する予測画像の生成の処理は、前景成分画像フレームメモリ１１０２に記憶されている前景成分画像、および前景成分画像に対応する係数セットを使用することを除いて、図１０９のフローチャートを参照して説明した処理と同様なので、その説明は省略する。
【０７２９】
このように、図１０４に構成を示す動きボケ除去画像処理部１０８は、背景成分画像に対応する予測画像を生成し、動きボケが除去された前景成分画像に対応する予測画像を個々に生成することができる。
【０７３０】
図１１０は、画像処理装置の機能の他の構成を示すブロック図である。図１１に示す画像処理装置が領域特定と混合比αの算出を順番に行うのに対して、図１１０に示す画像処理装置は、領域特定と混合比αの算出を並行して行う。
【０７３１】
図１１のブロック図に示す機能と同様の部分には同一の番号を付してあり、その説明は省略する。
【０７３２】
入力画像は、オブジェクト抽出部１０１、領域特定部１０３、混合比算出部１５０１、および前景背景分離部１５０２に供給される。
【０７３３】
混合比算出部１５０１は、入力画像を基に、画素がカバードバックグラウンド領域に属すると仮定した場合における推定混合比、および画素がアンカバードバックグラウンド領域に属すると仮定した場合における推定混合比を、入力画像に含まれる画素のそれぞれに対して算出し、算出した画素がカバードバックグラウンド領域に属すると仮定した場合における推定混合比、および画素がアンカバードバックグラウンド領域に属すると仮定した場合における推定混合比を前景背景分離部１５０２に供給する。
【０７３４】
図１１１は、混合比算出部１５０１の構成の一例を示すブロック図である。
【０７３５】
図１１１に示す推定混合比処理部４０１は、図６２に示す推定混合比処理部４０１と同じである。図１１１に示す推定混合比処理部４０２は、図６２に示す推定混合比処理部４０２と同じである。
【０７３６】
推定混合比処理部４０１は、入力画像を基に、カバードバックグラウンド領域のモデルに対応する演算により、画素毎に推定混合比を算出して、算出した推定混合比を出力する。
【０７３７】
推定混合比処理部４０２は、入力画像を基に、アンカバードバックグラウンド領域のモデルに対応する演算により、画素毎に推定混合比を算出して、算出した推定混合比を出力する。
【０７３８】
前景背景分離部１５０２は、混合比算出部１５０１から供給された、画素がカバードバックグラウンド領域に属すると仮定した場合における推定混合比、および画素がアンカバードバックグラウンド領域に属すると仮定した場合における推定混合比、並びに領域特定部１０３から供給された領域情報を基に、入力画像を、背景成分画像、および前景成分画像に分離し、分離された画像を動きボケ除去画像処理部１０８に供給する。
【０７３９】
図１１２は、前景背景分離部１５０２の構成の一例を示すブロック図である。
【０７４０】
図８０に示す動きボケ除去部１０６と同様の部分には同一の番号を付してあり、その説明は省略する。
【０７４１】
選択部１５２１は、領域特定部１０３から供給された領域情報を基に、混合比算出部１５０１から供給された、画素がカバードバックグラウンド領域に属すると仮定した場合における推定混合比、および画素がアンカバードバックグラウンド領域に属すると仮定した場合における推定混合比のいずれか一方を選択して、選択した推定混合比を混合比αとして分離部６０１に供給する。
【０７４２】
分離部６０１は、選択部１５２１から供給された混合比αおよび領域情報を基に、混合領域に属する画素の画素値から前景の成分および背景の成分を抽出し、アンカバードバックグラウンド領域の背景の成分、アンカバードバックグラウンド領域の前景の成分、カバードバックグラウンド領域の背景の成分、およびカバードバックグラウンド領域の前景の成分に分離する。
【０７４３】
分離部６０１は、図８５に示す構成と同じ構成とすることができる。
【０７４４】
このように、図１１０に構成を示す画像処理装置は、背景成分画像、および前景成分画像毎に、それぞれの性質に対応して処理を実行することができる。
【０７４５】
以上のように、本発明の画像処理装置においては、背景成分画像および前景成分画像に入力画像が分離され、分離された画像に適した処理が実行されるので、例えば、不自然な画像を生成することなく、より解像度の高い画像が生成される。
【０７４６】
図１１３は、画像処理装置の機能の他の構成を示すブロック図である。
【０７４７】
図１１に示す場合と同様の部分には同一の番号を付してあり、その説明は省略する。
【０７４８】
オブジェクト抽出部１０１は、入力画像に含まれる前景のオブジェクトに対応する画像オブジェクトを粗く抽出して、抽出した画像オブジェクトを動き検出部１０２に供給する。
【０７４９】
動き検出部１０２は、例えば、ブロックマッチング法、勾配法、位相相関法、およびペルリカーシブ法などの手法により、粗く抽出された前景のオブジェクトに対応する画像オブジェクトの動きベクトルを算出して、算出した動きベクトルおよび動きベクトルの位置情報を領域特定部１０３に供給する。
【０７５０】
領域特定部１０３は、入力された画像の画素のそれぞれを、前景領域、背景領域、またはアンカバードバックグラウンド領域、若しくはカバードバックグラウンド領域からなる混合領域のいずれかに特定し、領域情報を混合比算出部１０４、および前景背景分離部２００１に供給する。
【０７５１】
混合比算出部１０４は、入力画像、および領域特定部１０３から供給された領域情報を基に、混合領域に含まれる画素に対応する混合比αを算出して、算出した混合比を前景背景分離部２００１に供給する。
【０７５２】
前景背景分離部２００１は、領域特定部１０３から供給された領域情報、および混合比算出部１０４から供給された混合比αを基に、前景のオブジェクトに対応する画像の成分と、背景の成分のみから成る背景成分画像とを分離して、背景領域の画像、アンカバードバックグラウンド領域の背景の成分のみからなる画像（以下、アンカバードバックグラウンド領域の背景成分画像と称する）、アンカバードバックグラウンド領域の前景の成分のみからなる画像（以下、アンカバードバックグラウンド領域の前景成分画像と称する）、カバードバックグラウンド領域の背景の成分のみからなる画像（以下、カバードバックグラウンド領域の背景成分画像と称する）、カバードバックグラウンド領域の前景の成分のみからなる画像（以下、カバードバックグラウンド領域の前景成分画像と称する）、および前景領域の画像を分離画像処理部２００２に供給する。
【０７５３】
分離画像処理部２００２は、前景背景分離部２００１から供給された、背景領域の画像、アンカバードバックグラウンド領域の背景成分画像、アンカバードバックグラウンド領域の前景成分画像、カバードバックグラウンド領域の背景成分画像、カバードバックグラウンド領域の前景成分画像、および前景領域の画像をそれぞれ処理する。
【０７５４】
例えば、分離画像処理部２００２は、背景領域の画像、アンカバードバックグラウンド領域の背景成分画像、アンカバードバックグラウンド領域の前景成分画像、カバードバックグラウンド領域の背景成分画像、カバードバックグラウンド領域の前景成分画像、および前景領域の画像毎に、より高解像度の画像を生成するクラス分類適応処理で使用される係数を生成する。
【０７５５】
例えば、分離画像処理部２００２は、背景領域の画像、アンカバードバックグラウンド領域の背景成分画像、アンカバードバックグラウンド領域の前景成分画像、カバードバックグラウンド領域の背景成分画像、カバードバックグラウンド領域の前景成分画像、および前景領域の画像毎にクラス分類適応処理を適用して、より高解像度の画像を創造する。
【０７５６】
図１１４は、図１１３に構成を示す分離画像処理部２００２の処理を説明する図である。入力画像は、領域特定部１０３により、前景領域、背景領域、カバードバックグラウンド領域、およびアンカバードバックグラウンド領域が特定される。
【０７５７】
領域が特定され、混合比算出部１０４により混合比αが検出された入力画像は、前景背景分離部２００１により、前景領域の画像、背景領域の画像、カバードバックグラウンド領域の前景成分画像、カバードバックグラウンド領域の背景成分画像、アンカバードバックグラウンド領域の前景成分画像、およびアンカバードバックグラウンド領域の背景成分画像に分離される。
【０７５８】
分離画像処理部２００２は、分離された前景領域の画像、背景領域の画像、カバードバックグラウンド領域の前景成分画像、カバードバックグラウンド領域の背景成分画像、アンカバードバックグラウンド領域の前景成分画像、およびアンカバードバックグラウンド領域の背景成分画像を基に、前景領域の画像に対応する係数セット、背景領域の画像に対応する係数セット、カバードバックグラウンド領域の前景成分画像に対応する係数セット、カバードバックグラウンド領域の背景成分画像に対応する係数セット、アンカバードバックグラウンド領域の前景成分画像に対応する係数セット、およびアンカバードバックグラウンド領域の背景成分画像に対応する係数セットを個々に算出する。
【０７５９】
背景領域に対応する係数セットは、画素値を予測するクラス分類適応処理において、背景領域の画素値の予測に使用される。アンカバードバックグラウンド領域の背景成分画像に対応する係数セットは、画素値を予測するクラス分類適応処理において、アンカバードバックグラウンド領域の背景成分画像に対応する画素値の予測に使用される。アンカバードバックグラウンド領域の前景成分画像に対応する係数セットは、画素値を予測するクラス分類適応処理において、アンカバードバックグラウンド領域の前景成分画像に対応する画素値の予測に使用される。
【０７６０】
カバードバックグラウンド領域の背景成分画像に対応する係数セットは、画素値を予測するクラス分類適応処理において、カバードバックグラウンド領域の背景成分画像に対応する画素値の予測に使用される。カバードバックグラウンド領域の前景成分画像に対応する係数セットは、画素値を予測するクラス分類適応処理において、カバードバックグラウンド領域の前景成分画像に対応する画素値の予測に使用される。
【０７６１】
前景領域に対応する係数セットは、画素値を予測するクラス分類適応処理において、前景領域の画素値の予測に使用される。
【０７６２】
背景領域の画像に対応する予測画像、アンカバードバックグラウンド領域の背景成分画像に対応する予測画像、アンカバードバックグラウンド領域の前景成分画像に対応する予測画像、カバードバックグラウンド領域の背景成分画像に対応する予測画像、カバードバックグラウンド領域の前景成分画像に対応する予測画像、および前景領域の画像に対応する予測画像は、合成され、１つの予測画像とされる。
【０７６３】
図１１５は、前景背景分離部２００１の構成の一例を示すブロック図である。
前景背景分離部２００１に供給された入力画像は、分離部２１０１、スイッチ２１０２、およびスイッチ２１０３に供給される。カバードバックグラウンド領域を示す情報、およびアンカバードバックグラウンド領域を示す、領域特定部１０３から供給された領域情報は、分離部２１０１に供給される。前景領域を示す領域情報は、スイッチ２１０２に供給される。背景領域を示す領域情報は、スイッチ２１０３に供給される。
【０７６４】
混合比算出部１０４から供給された混合比αは、分離部２１０１に供給される。
【０７６５】
分離部２１０１は、カバードバックグラウンド領域を示す領域情報、および混合比αを基に、入力画像のカバードバックグラウンド領域から前景の成分を分離するとともに、背景の成分を分離して、分離された前景の成分より構成されるカバードバックグラウンド領域の前景成分画像、および分離された背景の成分より構成されるカバードバックグラウンド領域の背景成分画像を出力する。
【０７６６】
分離部２１０１は、アンカバードバックグラウンド領域を示す領域情報、および混合比αを基に、入力画像のアンカバードバックグラウンド領域から前景の成分を分離するとともに、背景の成分を分離して、分離された前景の成分より構成されるアンカバードバックグラウンド領域の前景成分画像、および分離された背景の成分より構成されるアンカバードバックグラウンド領域の背景成分画像を出力する。
【０７６７】
スイッチ２１０２は、前景領域を示す領域情報を基に、前景領域に対応する画素が入力されたとき、閉じられ、前景領域の画像を出力する。
【０７６８】
スイッチ２１０３は、背景領域を示す領域情報を基に、背景領域に対応する画素が入力されたとき、閉じられ、背景領域の画像を出力する。
【０７６９】
図１１６は、分離部２１０１の構成の一例を示すブロック図である。分離部２１０１に入力された画像は、フレームメモリ２１２１に供給され、混合比算出部１０４から供給されたカバードバックグラウンド領域およびアンカバードバックグラウンド領域を示す領域情報、並びに混合比αは、分離処理ブロック２１２２に入力される。
【０７７０】
フレームメモリ２１２１は、入力された画像をフレーム単位で記憶する。フレームメモリ２１２１は、処理の対象がフレーム#nであるとき、フレーム#nの１つ前のフレームであるフレーム#n-1、フレーム#n、およびフレーム#nの１つ後のフレームであるフレーム#n+1を記憶する。
【０７７１】
フレームメモリ２１２１は、フレーム#n-1、フレーム#n、およびフレーム#n+1の対応する画素を分離処理ブロック２１２２に供給する。
【０７７２】
分離処理ブロック２１２２は、カバードバックグラウンド領域およびアンカバードバックグラウンド領域を示す領域情報、並びに混合比αを基に、フレームメモリ２１２１から供給されたフレーム#n-1、フレーム#n、およびフレーム#n+1の対応する画素の画素値に図８３および図８４を参照して説明した演算を適用して、フレーム#nの混合領域に属する画素から前景の成分および背景の成分を分離する。
【０７７３】
分離処理ブロック２１２２は、アンカバード領域処理部２１３１、およびカバード領域処理部２１３２で構成されている。
【０７７４】
アンカバード領域処理部２１３１の乗算器２１４１は、混合比αを、フレームメモリ２１２１から供給されたフレーム#n+1の画素の画素値に乗じて、スイッチ２１４２に出力する。スイッチ２１４２は、フレームメモリ２１２１から供給されたフレーム#nの画素（フレーム#n+1の画素に対応する）がアンカバードバックグラウンド領域であるとき、閉じられ、乗算器２１４１から供給された混合比αを乗じた画素値を演算器２１４３に供給する。スイッチ２１４２から出力されるフレーム#n+1の画素の画素値に混合比αを乗じた値は、フレーム#nの対応する画素の画素値の背景の成分に等しく、アンカバードバックグラウンド領域の背景成分画像として出力される。
【０７７５】
演算器２１４３は、フレームメモリ２１２１から供給されたフレーム#nの画素の画素値から、スイッチ２１４２から供給された背景の成分を減じて、前景の成分を求める。演算器２１４３は、アンカバードバックグラウンド領域に属する、フレーム#nの画素の前景成分画像を出力する。
【０７７６】
カバード領域処理部２１３２の乗算器２１５１は、混合比αを、フレームメモリ２１２１から供給されたフレーム#n-1の画素の画素値に乗じて、スイッチ２１５２に出力する。スイッチ２１５２は、フレームメモリ２１２１から供給されたフレーム#nの画素（フレーム#n-1の画素に対応する）がカバードバックグラウンド領域であるとき、閉じられ、乗算器２１５１から供給された混合比αを乗じた画素値を演算器２１５３に供給する。スイッチ２１５２から出力されるフレーム#n-1の画素の画素値に混合比αを乗じた値は、フレーム#nの対応する画素の画素値の背景の成分に等しく、カバードバックグラウンド領域の背景成分画像として出力される。
【０７７７】
演算器２１５３は、フレームメモリ２１２１から供給されたフレーム#nの画素の画素値から、スイッチ２１５２から供給された背景の成分を減じて、前景の成分を求める。演算器２１５３は、カバードバックグラウンド領域に属する、フレーム#nの画素の前景成分画像を出力する。
【０７７８】
特徴量である混合比αを利用することにより、画素値に含まれる前景の成分と背景の成分とを完全に分離することが可能になる。
【０７７９】
図１１７は、空間方向に、より高解像度な画像を生成するクラス分類適応処理において使用される係数セットを生成する分離画像処理部２００２の構成を示すブロック図である。例えば、図１１７に構成を示す分離画像処理部２００２は、入力されたHD画像を基に、SD画像からHD画像を生成するクラス分類適応処理において使用される係数セットを生成する。
【０７８０】
背景領域教師画像フレームメモリ２２０１は、前景背景分離部２００１から供給された、教師画像の背景領域の画像を記憶する。背景領域教師画像フレームメモリ２２０１は、記憶している教師画像の背景領域の画像を加重平均部２２０７−１および学習部２２１４−１に供給する。
【０７８１】
アンカバードバックグラウンド領域背景成分教師画像フレームメモリ２２０２は、前景背景分離部２００１から供給された、教師画像のアンカバードバックグラウンド領域の背景成分画像を記憶する。アンカバードバックグラウンド領域背景成分教師画像フレームメモリ２２０２は、記憶している教師画像のアンカバードバックグラウンド領域の背景成分画像を加重平均部２２０７−２および学習部２２１４−２に供給する。
【０７８２】
アンカバードバックグラウンド領域前景成分教師画像フレームメモリ２２０３は、前景背景分離部２００１から供給された、教師画像のアンカバードバックグラウンド領域の前景成分画像を記憶する。アンカバードバックグラウンド領域前景成分教師画像フレームメモリ２２０３は、記憶している教師画像のアンカバードバックグラウンド領域の前景成分画像を加重平均部２２０７−３および学習部２２１４−３に供給する。
【０７８３】
カバードバックグラウンド領域背景成分教師画像フレームメモリ２２０４は、前景背景分離部２００１から供給された、教師画像のカバードバックグラウンド領域の背景成分画像を記憶する。カバードバックグラウンド領域背景成分教師画像フレームメモリ２２０４は、記憶している教師画像のカバードバックグラウンド領域の背景成分画像を加重平均部２２０７−４および学習部２２１４−４に供給する。
【０７８４】
カバードバックグラウンド領域前景成分教師画像フレームメモリ２２０５は、前景背景分離部２００１から供給された、教師画像のカバードバックグラウンド領域の前景成分画像を記憶する。カバードバックグラウンド領域前景成分教師画像フレームメモリ２２０５は、記憶している教師画像のカバードバックグラウンド領域の前景成分画像を加重平均部２２０７−５および学習部２２１４−５に供給する。
【０７８５】
前景領域教師画像フレームメモリ２２０６は、前景背景分離部２００１から供給された、教師画像の前景領域の画像を記憶する。前景領域教師画像フレームメモリ２２０６は、記憶している教師画像の前景領域の画像を加重平均部２２０７−６および学習部２２１４−６に供給する。
【０７８６】
加重平均部２２０７−１は、背景領域教師画像フレームメモリ２２０１から供給された、例えば、HD画像である教師画像の背景領域の画像を４分の１加重平均して、生徒画像であるSD画像を生成し、生成したSD画像を背景領域生徒画像フレームメモリ２２０８に供給する。
【０７８７】
背景領域生徒画像フレームメモリ２２０８は、加重平均部２２０７−１から供給された、教師画像の背景領域の画像に対応する、生徒画像を記憶する。背景領域生徒画像フレームメモリ２２０８は、記憶している、教師画像の背景領域の画像に対応する生徒画像を学習部２２１４−１に供給する。
【０７８８】
加重平均部２２０７−２は、アンカバードバックグラウンド領域背景成分教師画像フレームメモリ２２０２から供給された、HD画像である教師画像のアンカバードバックグラウンド領域の背景成分画像を、例えば、４分の１加重平均して、生徒画像であるSD画像を生成し、生成したSD画像をアンカバードバックグラウンド領域背景成分生徒画像フレームメモリ２２０９に供給する。
【０７８９】
アンカバードバックグラウンド領域背景成分生徒画像フレームメモリ２２０９は、加重平均部２２０７−２から供給された、教師画像のアンカバードバックグラウンド領域の背景成分画像に対応する、SD画像である生徒画像を記憶する。アンカバードバックグラウンド領域背景成分生徒画像フレームメモリ２２０９は、記憶している、教師画像のアンカバードバックグラウンド領域の背景成分画像に対応する生徒画像を学習部２２１４−２に供給する。
【０７９０】
加重平均部２２０７−３は、アンカバードバックグラウンド領域前景成分教師画像フレームメモリ２２０３から供給された、HD画像である教師画像のアンカバードバックグラウンド領域の前景成分画像を、例えば、４分の１加重平均して、生徒画像であるSD画像を生成し、生成したSD画像をアンカバードバックグラウンド領域前景成分生徒画像フレームメモリ２２１０に供給する。
【０７９１】
アンカバードバックグラウンド領域前景成分生徒画像フレームメモリ２２１０は、加重平均部２２０７−３から供給された、教師画像のアンカバードバックグラウンド領域の前景成分画像に対応する、SD画像である生徒画像を記憶する。アンカバードバックグラウンド領域前景成分生徒画像フレームメモリ２２１０は、記憶している、教師画像のアンカバードバックグラウンド領域の前景成分画像に対応する生徒画像を学習部２２１４−３に供給する。
【０７９２】
加重平均部２２０７−４は、カバードバックグラウンド領域背景成分教師画像フレームメモリ２２０４から供給された、教師画像のカバードバックグラウンド領域の背景成分画像を、例えば、４分の１加重平均して、生徒画像であるSD画像を生成し、生成したSD画像をカバードバックグラウンド領域背景成分生徒画像フレームメモリ２２１１に供給する。
【０７９３】
カバードバックグラウンド領域背景成分生徒画像フレームメモリ２２１１は、加重平均部２２０７−４から供給された、教師画像のカバードバックグラウンド領域の背景成分画像に対応する、SD画像である生徒画像を記憶する。カバードバックグラウンド領域背景成分生徒画像フレームメモリ２２１１は、記憶している、教師画像のカバードバックグラウンド領域の背景成分画像に対応する生徒画像を学習部２２１４−４に供給する。
【０７９４】
加重平均部２２０７−５は、カバードバックグラウンド領域前景成分教師画像フレームメモリ２２０５から供給された、教師画像のカバードバックグラウンド領域の前景成分画像を、例えば、４分の１加重平均して、生徒画像であるSD画像を生成し、生成したSD画像をカバードバックグラウンド領域前景成分生徒画像フレームメモリ２２１２に供給する。
【０７９５】
カバードバックグラウンド領域前景成分生徒画像フレームメモリ２２１２は、加重平均部２２０７−５から供給された、教師画像のカバードバックグラウンド領域の前景成分画像に対応する、SD画像である生徒画像を記憶する。カバードバックグラウンド領域前景成分生徒画像フレームメモリ２２１２は、記憶している、教師画像のカバードバックグラウンド領域の前景成分画像に対応する生徒画像を学習部２２１４−５に供給する。
【０７９６】
加重平均部２２０７−６は、前景領域教師画像フレームメモリ２２０６から供給された、例えば、HD画像である教師画像の前景領域の画像を４分の１加重平均して、生徒画像であるSD画像を生成し、生成したSD画像を前景領域生徒画像フレームメモリ２２１３に供給する。
【０７９７】
前景領域生徒画像フレームメモリ２２１３は、加重平均部２２０７−６から供給された、教師画像の前景領域の画像に対応する、SD画像である生徒画像を記憶する。前景領域生徒画像フレームメモリ２２１３は、記憶している、教師画像の前景領域の画像に対応する生徒画像を学習部２２１４−６に供給する。
【０７９８】
学習部２２１４−１は、背景領域教師画像フレームメモリ２２０１から供給された教師画像の背景領域の画像、および背景領域生徒画像フレームメモリ２２０８から供給された、教師画像の背景領域の画像に対応する生徒画像を基に、背景領域に対応する係数セットを生成し、生成した係数セットを係数セットメモリ２２１５に供給する。
【０７９９】
学習部２２１４−２は、アンカバードバックグラウンド領域背景成分教師画像フレームメモリ２２０２から供給された教師画像のアンカバードバックグラウンド領域の背景成分画像、およびアンカバードバックグラウンド領域背景成分生徒画像フレームメモリ２２０９から供給された、教師画像のアンカバードバックグラウンド領域の背景成分画像に対応する生徒画像を基に、アンカバードバックグラウンド領域の背景成分画像に対応する係数セットを生成し、生成した係数セットを係数セットメモリ２２１５に供給する。
【０８００】
学習部２２１４−３は、アンカバードバックグラウンド領域前景成分教師画像フレームメモリ２２０３から供給された教師画像のアンカバードバックグラウンド領域の前景成分画像、およびアンカバードバックグラウンド領域前景成分生徒画像フレームメモリ２２１０から供給された、教師画像のアンカバードバックグラウンド領域の前景成分画像に対応する生徒画像を基に、アンカバードバックグラウンド領域の前景成分画像に対応する係数セットを生成し、生成した係数セットを係数セットメモリ２２１５に供給する。
【０８０１】
学習部２２１４−４は、カバードバックグラウンド領域背景成分教師画像フレームメモリ２２０４から供給された教師画像のカバードバックグラウンド領域の背景成分画像、およびカバードバックグラウンド領域背景成分生徒画像フレームメモリ２２１１から供給された、教師画像のカバードバックグラウンド領域の背景成分画像に対応する生徒画像を基に、カバードバックグラウンド領域の背景成分画像に対応する係数セットを生成し、生成した係数セットを係数セットメモリ２２１５に供給する。
【０８０２】
学習部２２１４−５は、カバードバックグラウンド領域前景成分教師画像フレームメモリ２２０５から供給された教師画像のカバードバックグラウンド領域の前景成分画像、およびカバードバックグラウンド領域前景成分生徒画像フレームメモリ２２１２から供給された、教師画像のカバードバックグラウンド領域の前景成分画像に対応する生徒画像を基に、カバードバックグラウンド領域の前景成分画像に対応する係数セットを生成し、生成した係数セットを係数セットメモリ２２１５に供給する。
【０８０３】
学習部２２１４−６は、前景領域教師画像フレームメモリ２２０６から供給された教師画像の前景領域の画像、および前景領域生徒画像フレームメモリ２２１３から供給された、教師画像の前景領域の画像に対応する生徒画像を基に、前景領域に対応する係数セットを生成し、生成した係数セットを係数セットメモリ２２１５に供給する。
【０８０４】
係数セットメモリ２２１５は、学習部２２１４−１から供給された背景領域に対応する係数セット、学習部２２１４−２から供給されたアンカバードバックグラウンド領域の背景成分画像に対応する係数セット、学習部２２１４−３から供給されたアンカバードバックグラウンド領域の前景成分画像に対応する係数セット、学習部２２１４−４から供給されたカバードバックグラウンド領域の背景成分画像に対応する係数セット、学習部２２１４−５から供給されたカバードバックグラウンド領域の前景成分画像に対応する係数セット、および学習部２２１４−６から供給された前景領域に対応する係数セットを記憶する。
【０８０５】
なお、学習部２２１４−１乃至２２１４−６は、学習部１００６と同様の構成を有するので、その説明は省略する。
【０８０６】
図１１８は、クラス分類適応処理を実行して、空間方向に、より高解像度な画像を生成する分離画像処理部２００２の構成を示すブロック図である。例えば、図１１８に構成を示す分離画像処理部２００２は、SD画像である入力画像を基に、クラス分類適応処理を実行して、HD画像を生成する。
【０８０７】
背景領域フレームメモリ２３０１は、前景背景分離部２００１から供給された、背景領域に属する画素からなる背景領域の画像を記憶する。背景領域フレームメモリ２３０１は、記憶している背景領域の画像をマッピング部２３０７−１に供給する。
【０８０８】
アンカバードバックグラウンド領域背景成分画像フレームメモリ２３０２は、前景背景分離部２００１から供給された、アンカバードバックグラウンド領域の背景成分画像を記憶する。アンカバードバックグラウンド領域背景成分画像フレームメモリ２３０２は、記憶しているアンカバードバックグラウンド領域の背景成分画像をマッピング部２３０７−２に供給する。
【０８０９】
アンカバードバックグラウンド領域前景成分画像フレームメモリ２３０３は、前景背景分離部２００１から供給された、アンカバードバックグラウンド領域の前景成分画像を記憶する。アンカバードバックグラウンド領域前景成分画像フレームメモリ２３０３は、記憶しているアンカバードバックグラウンド領域の前景成分画像をマッピング部２３０７−３に供給する。
【０８１０】
カバードバックグラウンド領域背景成分画像フレームメモリ２３０４は、前景背景分離部２００１から供給された、カバードバックグラウンド領域の背景成分画像を記憶する。カバードバックグラウンド領域背景成分画像フレームメモリ２３０４は、記憶しているカバードバックグラウンド領域の背景成分画像をマッピング部２３０７−４に供給する。
【０８１１】
カバードバックグラウンド領域前景成分画像フレームメモリ２３０５は、前景背景分離部２００１から供給された、カバードバックグラウンド領域の前景成分画像を記憶する。カバードバックグラウンド領域前景成分画像フレームメモリ２３０５は、記憶しているカバードバックグラウンド領域の前景成分画像をマッピング部２３０７−５に供給する。
【０８１２】
前景領域フレームメモリ２３０６は、前景背景分離部２００１から供給された、前景領域に属する画素からなる前景領域の画像を記憶する。前景領域画像フレームメモリ２３０６は、記憶している前景領域の画像をマッピング部２３０７−６に供給する。
【０８１３】
マッピング部２３０７−１は、係数セットメモリ２３０８に記憶されている、背景領域に対応する係数セットを基に、クラス分類適応処理により、背景領域フレームメモリ２３０１に記憶されている背景領域の画像に対応する予測画像を生成する。マッピング部２３０７−１は、生成した予測画像を合成部２３０９に供給する。
【０８１４】
マッピング部２３０７−２は、係数セットメモリ２３０８に記憶されている、アンカバードバックグラウンド領域の背景成分画像に対応する係数セットを基に、クラス分類適応処理により、アンカバードバックグラウンド領域背景成分画像フレームメモリ２３０２に記憶されている、アンカバードバックグラウンド領域の背景成分画像に対応する予測画像を生成する。マッピング部２３０７−２は、生成した予測画像を合成部２３０９に供給する。
【０８１５】
マッピング部２３０７−３は、係数セットメモリ２３０８に記憶されている、アンカバードバックグラウンド領域の前景成分画像に対応する係数セットを基に、クラス分類適応処理により、アンカバードバックグラウンド領域前景成分画像フレームメモリ２３０３に記憶されている、アンカバードバックグラウンド領域の前景成分画像に対応する予測画像を生成する。マッピング部２３０７−３は、生成した予測画像を合成部２３０９に供給する。
【０８１６】
マッピング部２３０７−４は、係数セットメモリ２３０８に記憶されている、カバードバックグラウンド領域の背景成分画像に対応する係数セットを基に、クラス分類適応処理により、カバードバックグラウンド領域背景成分画像フレームメモリ２３０４に記憶されている、カバードバックグラウンド領域の背景成分画像に対応する予測画像を生成する。マッピング部２３０７−４は、生成した予測画像を合成部２３０９に供給する。
【０８１７】
マッピング部２３０７−５は、係数セットメモリ２３０８に記憶されている、カバードバックグラウンド領域の前景成分画像に対応する係数セットを基に、クラス分類適応処理により、カバードバックグラウンド領域前景成分画像フレームメモリ２３０５に記憶されている、カバードバックグラウンド領域の前景成分画像に対応する予測画像を生成する。マッピング部２３０７−５は、生成した予測画像を合成部２３０９に供給する。
【０８１８】
マッピング部２３０７−６は、係数セットメモリ２３０８に記憶されている、前景領域に対応する係数セットを基に、クラス分類適応処理により、前景領域フレームメモリ２３０６に記憶されている前景領域の画像に対応する予測画像を生成する。マッピング部２３０７−６は、生成した予測画像を合成部２３０９に供給する。
【０８１９】
合成部２３０９は、マッピング部２３０７−１から供給された背景領域の画像に対応する予測画像、マッピング部２３０７−２から供給されたアンカバードバックグラウンド領域の背景成分画像に対応する予測画像、マッピング部２３０７−３から供給されたアンカバードバックグラウンド領域の前景成分画像に対応する予測画像、マッピング部２３０７−４から供給されたカバードバックグラウンド領域の背景成分画像に対応する予測画像、マッピング部２３０７−５から供給されたカバードバックグラウンド領域の前景成分画像に対応する予測画像、およびマッピング部２３０７−６から供給された前景領域の画像に対応する予測画像を合成し、合成された予測画像をフレームメモリ２３１０に供給する。
【０８２０】
フレームメモリ２３１０は、合成部２３０９から供給された予測画像を記憶すると共に、記憶している画像を出力画像として出力する。
【０８２１】
なお、マッピング部２３０７−１乃至２３０７−６は、マッピング部１１０３と同様の構成を有するので、その説明は省略する。
【０８２２】
図１１９乃至図１２４に示す画像を参照して、図１１８に構成を示す分離画像処理部２００２を有する本発明の画像処理装置の処理の結果の例を説明する。
【０８２３】
例に示す結果を生成する処理において、本発明の画像処理装置のクラス分類適応処理におけるクラスの数の総和は、従来のクラス分類適応処理におけるクラスの数とほぼ同一である。すなわち、従来のクラス分類適応処理におけるクラスの数は、２０４８とし、各領域の画像に対応する、本発明の画像処理装置のクラス分類適応処理におけるクラスの数は、５１２とした。
【０８２４】
また、従来のクラス分類適応処理における予測タップの数、および本発明の画像処理装置の各領域のクラス分類適応処理における予測タップの数は、９個とし、同一とした。
【０８２５】
図１１９乃至図１２１を参照して、カバードバックグラウンド領域における予測の結果を説明する。
【０８２６】
図１１９（Ａ）は、教師画像の混合領域における画像の例を示す図である。図１１９（Ｂ）は、教師画像の混合領域における画像の、空間方向の位置に対応する画素値の変化を示す図である。
【０８２７】
図１２０（Ａ）は、図１１９に示す教師画像に対応する、従来のクラス分類適応処理により生成された、混合領域の画像の例を示す図である。図１２０（Ｂ）は、図１１９に示す教師画像に対応する、従来のクラス分類適応処理により生成された、混合領域における画像の、空間方向の位置に対応する画素値の変化を示す図である。
【０８２８】
図１２１（Ａ）は、図１１９に示す教師画像に対応する、図１１８に構成を示す分離画像処理部２００２により生成された、混合領域の画像の例を示す図である。図１２０（Ｂ）は、図１１９に示す教師画像に対応する、図１１８に構成を示す分離画像処理部２００２により生成された、混合領域における画像の、空間方向の位置に対応する画素値の変化を示す図である。
【０８２９】
従来のクラス分類適応処理により生成された、混合領域における画像の画素値は、教師画像に比較して、階段状に変化し、生成された実際の画像においても、段階的に変化していることが、目視により確認できる。
【０８３０】
これに対して、図１１８に構成を示す分離画像処理部２００２により生成された、混合領域における画像の画素値は、従来に比較して、より滑らかに変化し、教師画像により近い変化を示す。分離画像処理部２００２により生成された画像を目視により確認しても、従来に比較して、滑らかな画像であることが確認できる。
【０８３１】
図１１８に構成を示す分離画像処理部２００２により生成された、混合領域における画像は、入力画像を前景領域、混合領域、または背景領域に分割して、生成された画像に比較しても、より滑らかに変化している。
【０８３２】
図１２２乃至図１２４を参照して、画素の位置に対して画素値がほぼ直線的に変化している前景領域における予測の結果を説明する。
【０８３３】
図１２２（Ａ）は、画素値がほぼ直線的に変化している、教師画像の前景領域における画像の例を示す図である。図１２２（Ｂ）は、画素値がほぼ直線的に変化している、教師画像の前景領域における画像の、空間方向の位置に対応する画素値の変化を示す図である。
【０８３４】
図１２３（Ａ）は、従来のクラス分類適応処理により生成された、図１２２の画像に対応する、前景領域の画像の例を示す図である。図１２３（Ｂ）は、従来のクラス分類適応処理により生成された、図１２２の画像に対応する、前景領域における画像の、空間方向の位置に対応する画素値の変化を示す図である。
【０８３５】
図１２４（Ａ）は、図１１８に構成を示す分離画像処理部２００２により生成された、図１２２の画像に対応する、前景領域の画像の例を示す図である。図１２４（Ｂ）は、図１１８に構成を示す分離画像処理部２００２により生成された、図１２２の画像に対応する、前景領域における画像の、空間方向の位置に対応する画素値の変化を示す図である。
【０８３６】
従来のクラス分類適応処理により生成された、前景領域における画像の画素値は、混合領域と同様に、教師画像に比較して、階段状に変化し、実際の画像においても、段階的に変化していることが、目視により確認できる。
【０８３７】
これに対して、図１１８に構成を示す分離画像処理部２００２により生成された、前景領域における画像の画素値は、従来に比較して、より滑らかに変化し、教師画像に極めて近い値となる。分離画像処理部２００２により生成された画像の目視による確認においては、教師画像との違いが認められなかった。
【０８３８】
図１２５は、図１１３に構成を示す画像処理装置の画像の処理を説明するフローチャートである。
【０８３９】
ステップＳ２００１において、領域特定部１０３は、動き検出部１０２から供給された動きベクトルおよびその位置情報、並びに入力画像を基に、入力画像の前景領域、背景領域、カバードバックグラウンド領域、およびアンカバードバックグラウンド領域を特定する。
【０８４０】
ステップＳ２００２において、混合比算出部１０４は、領域特定部１０３から供給された領域情報および入力画像を基に、混合比αを算出する。
【０８４１】
ステップＳ２００３において、前景背景分離部２００１は、領域特定部１０３から供給された領域情報、および混合比算出部１０４から供給された混合比αを基に、入力画像を、前景領域の画像、背景領域の画像、カバードバックグラウンド領域の前景成分画像、カバードバックグラウンド領域の背景成分画像、アンカバードバックグラウンド領域の前景成分画像、およびアンカバードバックグラウンド領域の背景成分画像に分離する。前景背景分離部２００１の画像の分離の処理の詳細は、後述する。
【０８４２】
ステップＳ２００４において、分離画像処理部２００２は、分離された、前景領域の画像、背景領域の画像、カバードバックグラウンド領域の前景成分画像、カバードバックグラウンド領域の背景成分画像、アンカバードバックグラウンド領域の前景成分画像、およびアンカバードバックグラウンド領域の背景成分画像毎に、画像の処理を実行して、処理は終了する。分離画像処理部２００２が実行する画像処理の詳細は、後述する。
【０８４３】
このように、本発明に係る画像処理装置は、入力画像を、前景領域の画像、背景領域の画像、カバードバックグラウンド領域の前景成分画像、カバードバックグラウンド領域の背景成分画像、アンカバードバックグラウンド領域の前景成分画像、およびアンカバードバックグラウンド領域の背景成分画像に分離し、分離された、前景領域の画像、背景領域の画像、カバードバックグラウンド領域の前景成分画像、カバードバックグラウンド領域の背景成分画像、アンカバードバックグラウンド領域の前景成分画像、およびアンカバードバックグラウンド領域の背景成分画像毎に画像処理を実行する。
【０８４４】
次に、図１２６に示すフローチャートを参照して、前景背景分離部２００１による前景と背景との分離の処理を説明する。ステップＳ２１０１において、分離部２１０１のフレームメモリ２１２１は、入力画像を取得し、前景と背景との分離の対象となるフレーム#nを、その前のフレーム#n-1およびその後のフレーム#n+1と共に記憶する。
【０８４５】
ステップＳ２１０２において、分離部２１０１の分離処理ブロック２１２２は、領域特定部１０３から供給された領域情報を取得する。ステップＳ２１０３において、分離部２１０１の分離処理ブロック２１２２は、混合比算出部１０４から供給された混合比αを取得する。
【０８４６】
ステップＳ２１０４において、アンカバード領域処理部２１３１は、領域情報および混合比αを基に、フレームメモリ２１２１から供給された、アンカバードバックグラウンド領域に属する画素の画素値から、背景の成分を抽出し、アンカバードバックグラウンド領域の背景成分画像として出力する。
【０８４７】
ステップＳ２１０５において、アンカバード領域処理部２１３１は、領域情報および混合比αを基に、フレームメモリ２１２１から供給された、アンカバードバックグラウンド領域に属する画素の画素値から、前景の成分を抽出し、アンカバードバックグラウンド領域の前景成分画像として出力する。
【０８４８】
ステップＳ２１０６において、カバード領域処理部２１３２は、領域情報および混合比αを基に、フレームメモリ２１２１から供給された、カバードバックグラウンド領域に属する画素の画素値から、背景の成分を抽出し、カバードバックグラウンド領域の背景成分画像として出力する。
【０８４９】
ステップＳ２１０７において、カバード領域処理部２１３２は、領域情報および混合比αを基に、フレームメモリ２１２１から供給された、カバードバックグラウンド領域に属する画素の画素値から、前景の成分を抽出し、カバードバックグラウンド領域の前景成分画像として出力し、処理は終了する。
【０８５０】
このように、前景背景分離部２００１は、領域情報および混合比αを基に、入力画像から前景の成分と、背景の成分とを分離し、前景の成分のみから成る前景成分画像、および背景の成分のみから成る背景成分画像を出力することができる。
【０８５１】
図１２７のフローチャートを参照して、図１１７に構成を示す分離画像処理部２００２による、クラス分類適応処理による画素値の予測に使用される係数セットを生成する学習の処理を説明する。
【０８５２】
ステップＳ２２０１において、加重平均部２２０７−１乃至２２０７−６は、背景領域の画像、前景領域の画像、アンカバードバックグラウンド領域の背景成分画像、アンカバードバックグラウンド領域の前景成分画像、カバードバックグラウンド領域の背景成分画像、およびカバードバックグラウンド領域の前景成分画像の生徒画像を生成する。すなわち、加重平均部２２０７−１は、背景領域教師画像フレームメモリ２２０１に記憶されている、教師画像の背景領域の画像を、例えば、４分の１加重平均して、教師画像の背景領域の画像に対応する生徒画像を生成する。
【０８５３】
加重平均部２２０７−２は、アンカバードバックグラウンド領域背景成分教師画像フレームメモリ２２０２に記憶されている、教師画像のアンカバードバックグラウンド領域の背景成分画像を、例えば、４分の１加重平均して、教師画像のアンカバードバックグラウンド領域の背景成分画像に対応する生徒画像を生成する。
【０８５４】
加重平均部２２０７−３は、アンカバードバックグラウンド領域前景成分教師画像フレームメモリ２２０３に記憶されている、教師画像のアンカバードバックグラウンド領域の前景成分画像を、例えば、４分の１加重平均して、教師画像のアンカバードバックグラウンド領域の前景成分画像に対応する生徒画像を生成する。
【０８５５】
加重平均部２２０７−４は、カバードバックグラウンド領域背景成分教師画像フレームメモリ２２０４に記憶されている、教師画像のカバードバックグラウンド領域の背景成分画像を、例えば、４分の１加重平均して、教師画像のカバードバックグラウンド領域の背景成分画像に対応する生徒画像を生成する。
【０８５６】
加重平均部２２０７−５は、カバードバックグラウンド領域前景成分教師画像フレームメモリ２２０５に記憶されている、教師画像のカバードバックグラウンド領域の前景成分画像を、例えば、４分の１加重平均して、教師画像のカバードバックグラウンド領域の前景成分画像に対応する生徒画像を生成する。
【０８５７】
加重平均部２２０７−６は、前景領域教師画像フレームメモリ２２０６に記憶されている、教師画像の前景領域の画像を、例えば、４分の１加重平均して、教師画像の前景領域の画像に対応する生徒画像を生成する。
【０８５８】
ステップＳ２２０２において、学習部２２１４−１は、背景領域教師画像フレームメモリ２２０１に記憶されている教師画像の背景領域の画像、および背景領域生徒画像フレームメモリ２２０８に記憶されている、教師画像の背景領域の画像に対応する生徒画像を基に、背景領域に対応する係数セットを生成する。
【０８５９】
ステップＳ２２０３において、学習部２２１４−２は、アンカバードバックグラウンド領域背景成分教師画像フレームメモリ２２０２に記憶されている、教師画像のアンカバードバックグラウンド領域の背景成分画像、およびアンカバードバックグラウンド領域背景成分生徒画像フレームメモリ２２０９に記憶されている、教師画像のアンカバードバックグラウンド領域の背景成分画像に対応する生徒画像を基に、アンカバードバックグラウンド領域の背景成分画像に対応する係数セットを生成する。
【０８６０】
ステップＳ２２０４において、学習部２２１４−３は、アンカバードバックグラウンド領域前景成分教師画像フレームメモリ２２０３に記憶されている、教師画像のアンカバードバックグラウンド領域の前景成分画像、およびアンカバードバックグラウンド領域前景成分生徒画像フレームメモリ２２１０に記憶されている、教師画像のアンカバードバックグラウンド領域の前景成分画像に対応する生徒画像を基に、アンカバードバックグラウンド領域の前景成分画像に対応する係数セットを生成する。
【０８６１】
ステップＳ２２０５において、学習部２２１４−４は、カバードバックグラウンド領域背景成分教師画像フレームメモリ２２０４に記憶されている、教師画像のカバードバックグラウンド領域の背景成分画像、およびカバードバックグラウンド領域背景成分生徒画像フレームメモリ２２１１に記憶されている、教師画像のカバードバックグラウンド領域の背景成分画像に対応する生徒画像を基に、カバードバックグラウンド領域の背景成分画像に対応する係数セットを生成する。
【０８６２】
ステップＳ２２０６において、学習部２２１４−５は、カバードバックグラウンド領域前景成分教師画像フレームメモリ２２０５に記憶されている、教師画像のカバードバックグラウンド領域の前景成分画像、およびカバードバックグラウンド領域前景成分生徒画像フレームメモリ２２１２に記憶されている、教師画像のカバードバックグラウンド領域の前景成分画像に対応する生徒画像を基に、カバードバックグラウンド領域の前景成分画像に対応する係数セットを生成する。
【０８６３】
ステップＳ２２０７において、学習部２２１４−６は、前景領域教師画像フレームメモリ２２０６に記憶されている教師画像の前景領域の画像、および前景領域生徒画像フレームメモリ２２１３に記憶されている、教師画像の前景領域の画像に対応する生徒画像を基に、前景領域に対応する係数セットを生成する。
【０８６４】
ステップＳ２２０８において、学習部２２１４−１乃至２２１２−４は、それぞれ、背景領域に対応する係数セット、アンカバードバックグラウンド領域の背景成分画像に対応する係数セット、アンカバードバックグラウンド領域の前景成分画像に対応する係数セット、カバードバックグラウンド領域の背景成分画像に対応する係数セット、カバードバックグラウンド領域の前景成分画像に対応する係数セット、または前景領域に対応する係数セットを係数セットメモリ２２１５に出力する。係数セットメモリ２２１５は、背景領域、前景領域、アンカバードバックグラウンド領域の背景成分画像、アンカバードバックグラウンド領域の前景成分画像、カバードバックグラウンド領域の背景成分画像、またはカバードバックグラウンド領域の前景成分画像のそれぞれに対応する係数セットを記憶して、処理は終了する。
【０８６５】
このように、図１１７に構成を示す分離画像処理部２００２は、背景領域の画像に対応する係数セット、アンカバードバックグラウンド領域の背景成分画像に対応する係数セット、アンカバードバックグラウンド領域の前景成分画像に対応する係数セット、カバードバックグラウンド領域の背景成分画像に対応する係数セット、カバードバックグラウンド領域の前景成分画像に対応する係数セット、および前景領域の画像に対応する係数セットを生成することができる。
【０８６６】
ステップＳ２２０２乃至ステップＳ２２０７の処理の詳細は、図１０３のフローチャートを参照して説明した処理と同様なので、その説明は省略する。
【０８６７】
なお、ステップＳ２２０２乃至ステップＳ２２０７の処理を、シリアルに実行しても、パラレルに実行しても良いことは勿論である。
【０８６８】
次に、図１２８のフローチャートを参照して、図１１８に構成を示す分離画像処理部２００２の画像の創造の処理を説明する。
【０８６９】
ステップＳ２３０１において、マッピング部２３０７−１は、係数セットメモリ２３０８に記憶されている、背景領域に対応する係数セットを基に、クラス分類適応処理により、背景領域フレームメモリ２３０１に記憶されている背景領域の画像に対応する画像を予測する。
【０８７０】
ステップＳ２３０２において、マッピング部２３０７−２は、係数セットメモリ２３０８に記憶されている、アンカバードバックグラウンド領域の背景成分画像に対応する係数セットを基に、クラス分類適応処理により、アンカバードバックグラウンド領域背景成分画像フレームメモリ２３０２に記憶されている、アンカバードバックグラウンド領域の背景成分画像に対応する画像を予測する。
【０８７１】
ステップＳ２３０３において、マッピング部２３０７−３は、係数セットメモリ２３０８に記憶されている、アンカバードバックグラウンド領域の前景成分画像に対応する係数セットを基に、クラス分類適応処理により、アンカバードバックグラウンド領域前景成分画像フレームメモリ２３０３に記憶されている、アンカバードバックグラウンド領域の前景成分画像に対応する画像を予測する。
【０８７２】
ステップＳ２３０４において、マッピング部２３０７−４は、係数セットメモリ２３０８に記憶されている、カバードバックグラウンド領域の背景成分画像に対応する係数セットを基に、クラス分類適応処理により、カバードバックグラウンド領域背景成分画像フレームメモリ２３０４に記憶されている、カバードバックグラウンド領域の背景成分画像に対応する画像を予測する。
【０８７３】
ステップＳ２３０５において、マッピング部２３０７−５は、係数セットメモリ２３０８に記憶されている、カバードバックグラウンド領域の前景成分画像に対応する係数セットを基に、クラス分類適応処理により、カバードバックグラウンド領域前景成分画像フレームメモリ２３０５に記憶されている、カバードバックグラウンド領域の前景成分画像に対応する画像を予測する。
【０８７４】
ステップＳ２３０６において、マッピング部２３０７−６は、係数セットメモリ２３０８に記憶されている、前景領域に対応する係数セットを基に、クラス分類適応処理により、前景領域フレームメモリ２３０６に記憶されている前景領域の画像に対応する画像を予測する。
【０８７５】
ステップＳ２３０７において、合成部２３０９は、背景領域の画像に対応する予測画像、アンカバードバックグラウンド領域の背景成分画像に対応する予測画像、アンカバードバックグラウンド領域の前景成分画像に対応する予測画像、カバードバックグラウンド領域の背景成分画像に対応する予測画像、カバードバックグラウンド領域の前景成分画像に対応する予測画像、および前景領域に対応する予測画像を合成する。合成部２３０９は、合成された画像をフレームメモリ２３１０に供給する。フレームメモリ２３１０は、合成部２３０９から供給された画像を記憶する。
【０８７６】
ステップＳ２３０８において、フレームメモリ２３１０は、記憶している、合成された画像を出力し、処理は終了する。
【０８７７】
このように、図１１８に構成を示す分離画像処理部２００２を有する画像処理装置は、分離された、背景領域の画像、アンカバードバックグラウンド領域の背景成分画像、アンカバードバックグラウンド領域の前景成分画像、カバードバックグラウンド領域の背景成分画像、カバードバックグラウンド領域の前景成分画像、および前景領域の画像毎に、予測画像を生成することができる。
【０８７８】
ステップＳ２３０１乃至ステップＳ２３０６の処理の詳細は、図１０９のフローチャートを参照して説明した処理と同様なので、その説明は省略する。
【０８７９】
なお、ステップＳ２３０１乃至ステップＳ２３０６の処理を、シリアルに実行しても、パラレルに実行しても良いことは勿論である。
【０８８０】
図１２９は、入力画像を分離して、分離された画像毎に処理する画像処理装置の機能の他の構成を示すブロック図である。図１１３に示す画像処理装置が領域特定と混合比αの算出を順番に行うのに対して、図１２９に示す画像処理装置は、領域特定と混合比αの算出を並行して行う。
【０８８１】
図１１３のブロック図に示す機能と同様の部分には同一の番号を付してあり、その説明は省略する。
【０８８２】
入力画像は、オブジェクト抽出部１０１、領域特定部１０３、混合比算出部１５０１、および前景背景分離部２５０１に供給される。
【０８８３】
混合比算出部１５０１は、入力画像を基に、画素がカバードバックグラウンド領域に属すると仮定した場合における推定混合比、および画素がアンカバードバックグラウンド領域に属すると仮定した場合における推定混合比を、入力画像に含まれる画素のそれぞれに対して算出し、算出した画素がカバードバックグラウンド領域に属すると仮定した場合における推定混合比、および画素がアンカバードバックグラウンド領域に属すると仮定した場合における推定混合比を前景背景分離部２５０１に供給する。
【０８８４】
前景背景分離部２５０１は、混合比算出部１５０１から供給された、画素がカバードバックグラウンド領域に属すると仮定した場合における推定混合比、および画素がアンカバードバックグラウンド領域に属すると仮定した場合における推定混合比、並びに領域特定部１０３から供給された領域情報を基に、入力画像を、背景領域の画像、アンカバードバックグラウンド領域の背景成分画像、アンカバードバックグラウンド領域の前景成分画像、カバードバックグラウンド領域の背景成分画像、カバードバックグラウンド領域の前景成分画像、および前景領域の画像に分離し、分離された画像を分離画像処理部２００２に供給する。
【０８８５】
図１３０は、前景背景分離部２５０１の構成の一例を示すブロック図である。
【０８８６】
図１１５に示す前景背景分離部２００１と同様の部分には同一の番号を付してあり、その説明は省略する。
【０８８７】
選択部２５２１は、領域特定部１０３から供給された領域情報を基に、混合比算出部１５０１から供給された、画素がカバードバックグラウンド領域に属すると仮定した場合における推定混合比、および画素がアンカバードバックグラウンド領域に属すると仮定した場合における推定混合比のいずれか一方を選択して、選択した推定混合比を混合比αとして分離部２１０１に供給する。
【０８８８】
分離部２１０１は、選択部２５２１から供給された混合比αおよび領域情報を基に、混合領域に属する画素の画素値から前景の成分および背景の成分を抽出し、アンカバードバックグラウンド領域の背景成分画像、アンカバードバックグラウンド領域の前景成分画像、カバードバックグラウンド領域の背景成分画像、およびカバードバックグラウンド領域の前景成分画像に分離する。
【０８８９】
分離部２１０１は、図１１６に示す構成と同じ構成とすることができる。
【０８９０】
このように、図１２９に構成を示す画像処理装置は、背景領域の画像、アンカバードバックグラウンド領域の背景成分画像、アンカバードバックグラウンド領域の前景成分画像、カバードバックグラウンド領域の背景成分画像、カバードバックグラウンド領域の前景成分画像、および前景領域の画像毎に、それぞれの性質に対応して処理を実行することができる。
【０８９１】
図１３１は、画像処理装置の機能のさらに他の構成を示すブロック図である。
【０８９２】
画像処理装置に供給された入力画像は、オブジェクト抽出部１０１、領域特定部１０３、および領域処理部３００１に供給される。
【０８９３】
オブジェクト抽出部１０１は、入力画像に含まれる前景のオブジェクトに対応する画像オブジェクトを粗く抽出して、抽出した画像オブジェクトを動き検出部１０２に供給する。オブジェクト抽出部１０１は、例えば、入力画像に含まれる前景のオブジェクトに対応する画像オブジェクトの輪郭を検出することで、前景のオブジェクトに対応する画像オブジェクトを粗く抽出する。
【０８９４】
オブジェクト抽出部１０１は、入力画像に含まれる背景のオブジェクトに対応する画像オブジェクトを粗く抽出して、抽出した画像オブジェクトを動き検出部１０２に供給する。オブジェクト抽出部１０１は、例えば、入力画像と、抽出された前景のオブジェクトに対応する画像オブジェクトとの差から、背景のオブジェクトに対応する画像オブジェクトを粗く抽出する。
【０８９５】
動き検出部１０２は、例えば、ブロックマッチング法、勾配法、位相相関法、およびペルリカーシブ法などの手法により、粗く抽出された前景のオブジェクトに対応する画像オブジェクトの動きベクトルを算出して、算出した動きベクトルおよび動きベクトルの位置情報を領域特定部１０３に供給する。
【０８９６】
領域特定部１０３は、入力された画像の画素のそれぞれを、前景領域、背景領域、または混合領域のいずれかに特定し、画素毎に前景領域、背景領域、または混合領域のいずれかに属するかを示す領域情報を領域処理部３００１に供給する。
【０８９７】
領域処理部３００１は、領域特定部１０３から供給された領域情報を基に、前景領域、背景領域、または混合領域毎に、入力画像を分割し、分割された入力画像毎に画像処理を実行する。例えば、領域処理部３００１は、前景領域、背景領域、または混合領域毎に、入力画像を分割し、分割された入力画像毎に、より高解像度の画像を生成するクラス分類適応処理で使用される係数を生成する。
【０８９８】
例えば、領域処理部３００１は、前景領域、背景領域、または混合領域毎に、入力画像を分割し、分割された入力画像毎にクラス分類適応処理を適用して、より高解像度の画像を創造する。
【０８９９】
図１３２は、図１３１に構成を示す領域処理部３００１の処理を説明する図である。領域処理部３００１は、背景領域に対応する係数セット、アンカバードバックグラウンド領域に対応する係数セット、前景領域に対応する係数セット、およびカバードバックグラウンド領域に対応する係数セットを個々に算出する。
【０９００】
背景領域に対応する係数セットは、画素値を予測するクラス分類適応処理において、背景領域の画素値の予測に使用される。アンカバードバックグラウンド領域に対応する係数セットは、画素値を予測するクラス分類適応処理において、アンカバードバックグラウンド領域の画素値の予測に使用される。
【０９０１】
カバードバックグラウンド領域に対応する係数セットは、画素値を予測するクラス分類適応処理において、カバードバックグラウンド領域の画素値の予測に使用される。前景領域に対応する係数セットは、画素値を予測するクラス分類適応処理において、前景領域の画素値の予測に使用される。
【０９０２】
背景画像に対応する予測画像、アンカバードバックグラウンド領域に対応する予測画像、カバードバックグラウンド領域に対応する予測画像、および前景画像に対応する予測画像は、合成され、１つの予測画像とされる。
【０９０３】
図１３３は、空間方向に、より高解像度な画像を生成するクラス分類適応処理において使用される係数セットを生成する領域処理部３００１の構成を示すブロック図である。教師画像フレームメモリ３１０１は、例えば、HD画像である入力画像を、フレーム単位で記憶する。教師画像フレームメモリ３１０１は、記憶している入力画像を領域分割部３１０２に供給する。
【０９０４】
領域分割部３１０２は、領域特定部１０３から供給された領域情報を基に、背景領域、前景領域、カバードバックグラウンド領域、またはアンカバードバックグラウンド領域に教師画像を分割する。
【０９０５】
領域分割部３１０２は、分割された教師画像である、教師画像の背景領域に属する画素からなる画像を背景領域教師画像フレームメモリ３１０３に供給し、教師画像のアンカバードバックグラウンド領域に属する画素からなる画像をアンカバードバックグラウンド領域教師画像フレームメモリ３１０４に供給し、教師画像のカバードバックグラウンド領域に属する画素からなる画像をカバードバックグラウンド領域教師画像フレームメモリ３１０５に供給し、教師画像の前景領域に属する画素からなる画像を前景領域教師画像フレームメモリ３１０６に供給する。
【０９０６】
背景領域教師画像フレームメモリ３１０３は、領域分割部３１０２から供給された、教師画像の背景領域に属する画素からなる画像を記憶する。背景領域教師画像フレームメモリ３１０３は、記憶している教師画像の背景領域に属する画素からなる画像を加重平均部３１０７−１および学習部３１１２−１に供給する。
【０９０７】
アンカバードバックグラウンド領域教師画像フレームメモリ３１０４は、領域分割部３１０２から供給された、教師画像のアンカバードバックグラウンド領域に属する画素からなる画像を記憶する。アンカバードバックグラウンド領域教師画像フレームメモリ３１０４は、記憶している教師画像のアンカバードバックグラウンド領域に属する画素からなる画像を加重平均部３１０７−２および学習部３１１２−２に供給する。
【０９０８】
カバードバックグラウンド領域教師画像フレームメモリ３１０５は、領域分割部３１０２から供給された、教師画像のカバードバックグラウンド領域に属する画素からなる画像を記憶する。カバードバックグラウンド領域教師画像フレームメモリ３１０５は、記憶している教師画像のカバードバックグラウンド領域に属する画素からなる画像を加重平均部３１０７−３および学習部３１１２−３に供給する。
【０９０９】
前景領域教師画像フレームメモリ３１０６は、領域分割部３１０２から供給された、教師画像の前景領域に属する画素からなる画像を記憶する。前景領域教師画像フレームメモリ３１０６は、記憶している教師画像の前景領域に属する画素からなる画像を加重平均部３１０７−４および学習部３１１２−４に供給する。
【０９１０】
加重平均部３１０７−１は、背景領域教師画像フレームメモリ３１０３から供給された、例えば、HD画像である教師画像の背景領域に属する画素からなる画像を４分の１加重平均して、生徒画像であるSD画像を生成し、生成したSD画像を背景領域生徒画像フレームメモリ３１０８に供給する。
【０９１１】
背景領域生徒画像フレームメモリ３１０８は、加重平均部３１０７−１から供給された、教師画像の背景領域に属する画素からなる画像に対応する、生徒画像を記憶する。背景領域生徒画像フレームメモリ３１０８は、記憶している、教師画像の背景領域に属する画素からなる画像に対応する生徒画像を学習部３１１２−１に供給する。
【０９１２】
加重平均部３１０７−２は、アンカバードバックグラウンド領域教師画像フレームメモリ３１０４から供給された、HD画像である教師画像のアンカバードバックグラウンド領域に属する画素からなる画像を、例えば、４分の１加重平均して、生徒画像であるSD画像を生成し、生成したSD画像をアンカバードバックグラウンド領域生徒画像フレームメモリ３１０９に供給する。
【０９１３】
アンカバードバックグラウンド領域生徒画像フレームメモリ３１０９は、加重平均部３１０７−２から供給された、教師画像のアンカバードバックグラウンド領域に属する画素からなる画像に対応する、SD画像である生徒画像を記憶する。
アンカバードバックグラウンド領域生徒画像フレームメモリ３１０９は、記憶している、教師画像のアンカバードバックグラウンド領域に属する画素からなる画像に対応する生徒画像を学習部３１１２−２に供給する。
【０９１４】
加重平均部３１０７−３は、カバードバックグラウンド領域教師画像フレームメモリ３１０５から供給された、教師画像のカバードバックグラウンド領域に属する画素からなる画像を、例えば、４分の１加重平均して、生徒画像であるSD画像を生成し、生成したSD画像をカバードバックグラウンド領域生徒画像フレームメモリ３１１０に供給する。
【０９１５】
カバードバックグラウンド領域生徒画像フレームメモリ３１１０は、加重平均部３１０７−３から供給された、教師画像のカバードバックグラウンド領域に属する画素からなる画像に対応する、SD画像である生徒画像を記憶する。カバードバックグラウンド領域生徒画像フレームメモリ３１１０は、記憶している、教師画像のカバードバックグラウンド領域に属する画素からなる画像に対応する生徒画像を学習部３１１２−３に供給する。
【０９１６】
加重平均部３１０７−４は、前景領域教師画像フレームメモリ３１０６から供給された、例えば、HD画像である教師画像の前景領域に属する画素からなる画像を４分の１加重平均して、生徒画像であるSD画像を生成し、生成したSD画像を前景領域生徒画像フレームメモリ３１１１に供給する。
【０９１７】
前景領域生徒画像フレームメモリ３１１１は、加重平均部３１０７−４から供給された、教師画像の前景領域に属する画素からなる画像に対応する、SD画像である生徒画像を記憶する。前景領域生徒画像フレームメモリ３１１１は、記憶している、教師画像の前景領域に属する画素からなる画像に対応する生徒画像を学習部３１１２−４に供給する。
【０９１８】
学習部３１１２−１は、背景領域教師画像フレームメモリ３１０３から供給された教師画像の背景領域に属する画素からなる画像、および背景領域生徒画像フレームメモリ３１０８から供給された、教師画像の背景領域に属する画素からなる画像に対応する生徒画像を基に、背景領域に対応する係数セットを生成し、生成した係数セットを係数セットメモリ３１１３に供給する。
【０９１９】
学習部３１１２−２は、アンカバードバックグラウンド領域教師画像フレームメモリ３１０４から供給された教師画像のアンカバードバックグラウンド領域に属する画素からなる画像、およびアンカバードバックグラウンド領域生徒画像フレームメモリ３１０９から供給された、教師画像のアンカバードバックグラウンド領域に属する画素からなる画像に対応する生徒画像を基に、アンカバードバックグラウンド領域に対応する係数セットを生成し、生成した係数セットを係数セットメモリ３１１３に供給する。
【０９２０】
学習部３１１２−３は、カバードバックグラウンド領域教師画像フレームメモリ３１０５から供給された教師画像のカバードバックグラウンド領域に属する画素からなる画像、およびカバードバックグラウンド領域生徒画像フレームメモリ３１１０から供給された、教師画像のカバードバックグラウンド領域に属する画素からなる画像に対応する生徒画像を基に、カバードバックグラウンド領域に対応する係数セットを生成し、生成した係数セットを係数セットメモリ３１１３に供給する。
【０９２１】
学習部３１１２−４は、前景領域教師画像フレームメモリ３１０６から供給された教師画像の前景領域に属する画素からなる画像、および前景領域生徒画像フレームメモリ３１１０から供給された、教師画像の前景領域に属する画素からなる画像に対応する生徒画像を基に、前景領域に対応する係数セットを生成し、生成した係数セットを係数セットメモリ３１１３に供給する。
【０９２２】
係数セットメモリ３１１３は、学習部３１１２−１から供給された背景領域に対応する係数セット、学習部３１１２−２から供給されたアンカバードバックグラウンド領域に対応する係数セット、学習部３１１２−３から供給されたカバードバックグラウンド領域に対応する係数セット、および学習部３１１２−４から供給された前景領域に対応する係数セットを記憶する。
【０９２３】
学習部３１１２−１乃至学習部３１１２−４は、学習部１００６と同様の構成を有するので、その説明は省略する。
【０９２４】
図１３４は、クラス分類適応処理を実行して、空間方向に、より高解像度な画像を生成する領域処理部３００１の構成を示すブロック図である。フレームメモリ３２０１は、例えば、SD画像である入力画像を、フレーム単位で記憶する。フレームメモリ３２０１は、記憶している入力画像を領域分割部３２０２に供給する。
【０９２５】
領域分割部３２０２は、領域特定部１０３から供給された領域情報を基に、背景領域、前景領域、カバードバックグラウンド領域、またはアンカバードバックグラウンド領域毎に入力画像を分割する。すなわち、領域分割部３２０２は、分割された入力画像である、背景領域に属する画素からなる画像を背景領域フレームメモリ３２０３に供給し、アンカバードバックグラウンド領域に属する画素からなる画像をアンカバードバックグラウンド領域フレームメモリ３２０４に供給し、カバードバックグラウンド領域に属する画素からなる画像をカバードバックグラウンド領域フレームメモリ３２０５に供給し、前景領域に属する画素からなる画像を前景領域フレームメモリ３２０６に供給する。
【０９２６】
背景領域フレームメモリ３２０３は、領域分割部３２０２から供給された、背景領域に属する画素からなる画像を記憶する。背景領域フレームメモリ３２０３は、記憶している背景領域に属する画素からなる画像をマッピング部３２０７−１に供給する。
【０９２７】
アンカバードバックグラウンド領域フレームメモリ３２０４は、領域分割部３２０２から供給された、アンカバードバックグラウンド領域に属する画素からなる画像を記憶する。アンカバードバックグラウンド領域フレームメモリ３２０４は、記憶しているアンカバードバックグラウンド領域に属する画素からなる画像をマッピング部３２０７−２に供給する。
【０９２８】
カバードバックグラウンド領域フレームメモリ３２０５は、領域分割部３２０２から供給された、カバードバックグラウンド領域に属する画素からなる画像を記憶する。カバードバックグラウンド領域フレームメモリ３２０５は、記憶しているカバードバックグラウンド領域に属する画素からなる画像をマッピング部３２０７−３に供給する。
【０９２９】
前景領域フレームメモリ３２０６は、領域分割部３２０２から供給された、前景領域に属する画素からなる画像を記憶する。前景領域入力画像フレームメモリ３１０６は、記憶している前景領域に属する画素からなる画像をマッピング部３２０７−４に供給する。
【０９３０】
マッピング部３２０７−１は、係数セットメモリ３２０８に記憶されている、背景領域に対応する係数セットを基に、クラス分類適応処理により、背景領域フレームメモリ３２０３に記憶されている背景領域に属する画素からなる画像に対応する予測画像を生成する。マッピング部３２０７−１は、生成した予測画像を合成部３２０９に供給する。
【０９３１】
マッピング部３２０７−２は、係数セットメモリ３２０８に記憶されている、アンカバードバックグラウンド領域に対応する係数セットを基に、クラス分類適応処理により、アンカバードバックグラウンド領域フレームメモリ３２０４に記憶されている、アンカバードバックグラウンド領域に属する画素からなる画像に対応する予測画像を生成する。マッピング部３２０７−２は、生成した予測画像を合成部３２０９に供給する。
【０９３２】
マッピング部３２０７−３は、係数セットメモリ３２０８に記憶されている、カバードバックグラウンド領域に対応する係数セットを基に、クラス分類適応処理により、カバードバックグラウンド領域フレームメモリ３２０５に記憶されている、カバードバックグラウンド領域に属する画素からなる画像に対応する予測画像を生成する。マッピング部３２０７−３は、生成した予測画像を合成部３２０９に供給する。
【０９３３】
マッピング部３２０７−４は、係数セットメモリ３２０８に記憶されている、前景領域に対応する係数セットを基に、クラス分類適応処理により、前景領域フレームメモリ３２０６に記憶されている前景領域に属する画素からなる画像に対応する予測画像を生成する。マッピング部３２０７−４は、生成した予測画像を合成部３２０９に供給する。
【０９３４】
合成部３２０９は、マッピング部３２０７−１から供給された背景領域に属する画素からなる画像に対応する予測画像、マッピング部３２０７−２から供給されたアンカバードバックグラウンド領域に属する画素からなる画像に対応する予測画像、マッピング部３２０７−３から供給されたカバードバックグラウンド領域に属する画素からなる画像に対応する予測画像、およびマッピング部３２０７−４から供給された前景領域に属する画素からなる画像に対応する予測画像を合成し、合成された予測画像をフレームメモリ３２１０に供給する。
【０９３５】
フレームメモリ３２１０は、合成部３２０９から供給された予測画像を記憶すると共に、記憶している画像を出力画像として出力する。
【０９３６】
マッピング部３２０７−１乃至３２０７−４は、マッピング部１１０３と同様の構成を有するので、その説明は省略する。
【０９３７】
図１３５乃至図１４０に示す画像を参照して、図１３４に構成を示す領域処理部３００１を有する本発明の画像処理装置の処理の結果の例を説明する。
【０９３８】
例に示す結果を生成する処理において、本発明の画像処理装置のクラス分類適応処理におけるクラスの数の総和は、従来のクラス分類適応処理におけるクラスの数と同一である。すなわち、従来のクラス分類適応処理におけるクラスの数は、２０４８とし、本発明の画像処理装置の各領域のクラス分類適応処理におけるクラスの数は、３１１２とした。
【０９３９】
また、従来のクラス分類適応処理における予測タップの数、および本発明の画像処理装置の各領域のクラス分類適応処理における予測タップの数は、９個とし、同一とした。
【０９４０】
図１３５乃至図１３７を参照して、カバードバックグラウンド領域における予測の結果を説明する。
【０９４１】
図１３５（Ａ）は、教師画像の混合領域における画像の例を示す図である。図１３５（Ｂ）は、教師画像の混合領域における画像の、空間方向の位置に対応する画素値の変化を示す図である。
【０９４２】
図１３６（Ａ）は、図１３５に示す教師画像に対応する、従来のクラス分類適応処理により生成された、混合領域の画像の例を示す図である。図１３６（Ｂ）は、図１３５に示す教師画像に対応する、従来のクラス分類適応処理により生成された、混合領域における画像の、空間方向の位置に対応する画素値の変化を示す図である。
【０９４３】
図１３７（Ａ）は、図１３５に示す教師画像に対応する、図１３４に構成を示す領域処理部３００１により生成された、混合領域の画像の例を示す図である。
図１３６（Ｂ）は、図１３５に示す教師画像に対応する、図１３４に構成を示す領域処理部３００１により生成された、混合領域における画像の、空間方向の位置に対応する画素値の変化を示す図である。
【０９４４】
従来のクラス分類適応処理により生成された、混合領域における画像の画素値は、教師画像に比較して、階段状に変化し、生成された実際の画像においても、段階的に変化していることが、目視により確認できる。
【０９４５】
これに対して、図１３４に構成を示す領域処理部３００１により生成された、混合領域における画像の画素値は、従来に比較して、より滑らかに変化し、教師画像により近い変化を示す。領域処理部３００１により生成された画像を目視により確認しても、従来に比較して、滑らかな画像であることが確認できる。
【０９４６】
図１３８乃至図１４０を参照して、画素の位置に対して画素値がほぼ直線的に変化している前景領域における予測の結果を説明する。
【０９４７】
図１３８（Ａ）は、画素値がほぼ直線的に変化している、教師画像の前景領域における画像の例を示す図である。図１３８（Ｂ）は、画素値がほぼ直線的に変化している、教師画像の前景領域における画像の、空間方向の位置に対応する画素値の変化を示す図である。
【０９４８】
図１３９（Ａ）は、従来のクラス分類適応処理により生成された、図１３８の画像に対応する、前景領域の画像の例を示す図である。図１３９（Ｂ）は、従来のクラス分類適応処理により生成された、図１３８の画像に対応する、前景領域における画像の、空間方向の位置に対応する画素値の変化を示す図である。
【０９４９】
図１４０（Ａ）は、図１３４に構成を示す領域処理部３００１により生成された、図１３８の画像に対応する、前景領域の画像の例を示す図である。図１３６（Ｂ）は、図１３４に構成を示す領域処理部３００１により生成された、図１３８の画像に対応する、前景領域における画像の、空間方向の位置に対応する画素値の変化を示す図である。
【０９５０】
従来のクラス分類適応処理により生成された、前景領域における画像の画素値は、混合領域と同様に、教師画像に比較して、階段状に変化し、実際の画像においても、段階的に変化していることが、目視により確認できる。
【０９５１】
これに対して、図１３４に構成を示す領域処理部３００１により生成された、前景領域における画像の画素値は、従来に比較して、より滑らかに変化し、教師画像に極めて近い値となる。領域処理部３００１により生成された画像の目視による確認においては、教師画像との違いが認められなかった。
【０９５２】
また、所定の画像について、従来のクラス分類適応処理により生成された画像の各領域におけるSN比と、本発明に係る画像処理装置のクラス分類適応処理により生成された画像の各領域におけるSN比とを求めて比較した。
【０９５３】
従来のクラス分類適応処理により生成された画像のカバードバックグラウンド領域におけるSN比は、32.1716dBであり、アンカバードバックグラウンド領域におけるSN比は、31.8744dBであり、前景領域におけるSN比は、31.8835dBであり、背景領域におけるSN比は、31.9985dBであった。
【０９５４】
これに対して、本発明に係る画像処理装置により生成された画像のカバードバックグラウンド領域におけるSN比は、32.1799dBであり、アンカバードバックグラウンド領域におけるSN比は、31.8922dBであり、前景領域におけるSN比は、32.0925dBであり、背景領域におけるSN比は、32.0177dBであった。
【０９５５】
このように、本発明に係る画像処理装置により生成された画像のSN比は、いずれの領域においても、従来のクラス分類適応処理により生成された画像のSN比に比較して高い。
【０９５６】
図１４１は、図１３１に構成を示す画像処理装置の画像の処理を説明するフローチャートである。
【０９５７】
ステップＳ３００１において、領域特定部１０３は、動き検出部１０２から供給された動きベクトルおよびその位置情報を基に、入力画像の前景領域、背景領域、カバードバックグラウンド領域、およびアンカバードバックグラウンド領域を特定する。
【０９５８】
ステップＳ３００２において、領域処理部３００１は、入力画像を、特定された前景領域、背景領域、カバードバックグラウンド領域、およびアンカバードバックグラウンド領域に分割して、分割された、前景領域、背景領域、カバードバックグラウンド領域、およびアンカバードバックグラウンド領域毎に、画像の処理を実行して、処理は終了する。
【０９５９】
このように、本発明に係る画像処理装置は、入力画像を、前景領域、背景領域、カバードバックグラウンド領域、およびアンカバードバックグラウンド領域に分割し、分割された、前景領域、背景領域、カバードバックグラウンド領域、およびアンカバードバックグラウンド領域毎に画像処理を実行する。
【０９６０】
図１４２のフローチャートを参照して、図１３３に構成を示す領域処理部３００１による、クラス分類適応処理による画素値の予測に使用される係数セットを生成する学習の処理を説明する。
【０９６１】
ステップＳ３１０１において、領域分割部３１０２は、領域特定部１０３から供給された領域情報を基に、教師画像フレームメモリ３１０１に記憶されている教師画像を領域分割する。すなわち、領域分割部３１０２は、領域分割された教師画像である、教師画像の背景領域に属する画素からなる画像を背景領域教師画像フレームメモリ３１０３に供給する。領域分割部３１０２は、領域分割された教師画像である、教師画像のアンカバードバックグラウンド領域に属する画素からなる画像をアンカバードバックグラウンド領域教師画像フレームメモリ３１０４に供給する。
【０９６２】
領域分割部３１０２は、領域分割された教師画像である、教師画像のカバードバックグラウンド領域に属する画素からなる画像をカバードバックグラウンド領域教師画像フレームメモリ３１０５に供給する。領域分割部３１０２は、領域分割された教師画像である、教師画像の前景領域に属する画素からなる画像を前景領域教師画像フレームメモリ３１０６に供給する。
【０９６３】
ステップＳ３１０２において、加重平均部３１０７−１乃至３１０７−４は、背景領域、前景領域、アンカバードバックグラウンド領域、およびカバードバックグラウンド領域の生徒画像を生成する。すなわち、加重平均部３１０７−１は、背景領域教師画像フレームメモリ３１０３に記憶されている、教師画像の背景領域に属する画素からなる画像を、例えば、４分の１加重平均して、教師画像の背景領域に属する画素からなる画像に対応する生徒画像を生成する。加重平均部３１０７−２は、アンカバードバックグラウンド領域教師画像フレームメモリ３１０４に記憶されている、教師画像のアンカバードバックグラウンド領域に属する画素からなる画像を、例えば、４分の１加重平均して、教師画像のアンカバードバックグラウンド領域に属する画素からなる画像に対応する生徒画像を生成する。
【０９６４】
加重平均部３１０７−３は、カバードバックグラウンド領域教師画像フレームメモリ３１０５に記憶されている、教師画像のカバードバックグラウンド領域に属する画素からなる画像を、例えば、４分の１加重平均して、教師画像のカバードバックグラウンド領域に属する画素からなる画像を生成する。加重平均部３１０７−４は、前景領域教師画像フレームメモリ３１０６に記憶されている、教師画像の前景領域に属する画素からなる画像を、例えば、４分の１加重平均して、教師画像の前景領域に属する画素からなる画像に対応する生徒画像を生成する。
【０９６５】
ステップＳ３１０３において、学習部３１１２−１は、背景領域教師画像フレームメモリ３１０３に記憶されている教師画像の背景領域に属する画素からなる画像、および背景領域生徒画像フレームメモリ３１０８に記憶されている、教師画像の背景領域に属する画素からなる画像に対応する生徒画像を基に、背景領域に対応する係数セットを生成する。
【０９６６】
ステップＳ３１０４において、学習部３１１２−２は、アンカバードバックグラウンド領域教師画像フレームメモリ３１０４に記憶されている、教師画像のアンカバードバックグラウンド領域に属する画素からなる画像、およびアンカバードバックグラウンド領域生徒画像フレームメモリ３１０９に記憶されている、教師画像のアンカバードバックグラウンド領域に属する画素からなる画像に対応する生徒画像を基に、アンカバードバックグラウンド領域に対応する係数セットを生成する。
【０９６７】
ステップＳ３１０５において、学習部３１１２−３は、カバードバックグラウンド領域教師画像フレームメモリ３１０５に記憶されている、教師画像のカバードバックグラウンド領域に属する画素からなる画像、およびカバードバックグラウンド領域生徒画像フレームメモリ３１１０に記憶されている、教師画像のカバードバックグラウンド領域に属する画素からなる画像に対応する生徒画像を基に、カバードバックグラウンド領域に対応する係数セットを生成する。
【０９６８】
ステップＳ３１０６において、学習部３１１２−４は、前景領域教師画像フレームメモリ３１０６に記憶されている教師画像の前景領域に属する画素からなる画像、および前景領域生徒画像フレームメモリ３１１１に記憶されている、教師画像の前景領域に属する画素からなる画像に対応する生徒画像を基に、前景領域に対応する係数セットを生成する。
【０９６９】
ステップＳ３１０７において、学習部３１１２−１乃至３１１２−４は、それぞれ、背景領域に対応する係数セット、アンカバードバックグラウンド領域に対応する係数セット、カバードバックグラウンド領域に対応する係数セット、または前景領域に対応する係数セットを係数セットメモリ３１１３に供給する。係数セットメモリ３１１３は、背景領域、前景領域、アンカバードバックグラウンド領域、またはカバードバックグラウンド領域のそれぞれに対応する係数セットを記憶して、処理は終了する。
【０９７０】
このように、図１３３に構成を示す領域処理部３００１は、背景領域に対応する係数セット、アンカバードバックグラウンド領域に対応する係数セット、カバードバックグラウンド領域に対応する係数セット、および前景領域に対応する係数セットを生成することができる。
【０９７１】
ステップＳ３１０３乃至ステップＳ３１０６の処理の詳細は、図１０３のフローチャートを参照して説明した処理と同様なので、その説明は省略する。
【０９７２】
なお、ステップＳ３１０３乃至ステップＳ３１０６の処理を、シリアルに実行しても、パラレルに実行しても良いことは勿論である。
【０９７３】
次に、図１４３のフローチャートを参照して、図１３４に構成を示す領域処理部３００１の画像の創造の処理を説明する。
【０９７４】
ステップＳ３２０１において、領域分割部３２０２は、領域特定部１０３から供給された領域情報を基に、背景領域、前景領域、カバードバックグラウンド領域、またはアンカバードバックグラウンド領域に入力画像を分割する。すなわち、領域分割部３２０２は、分割された入力画像である、背景領域に属する画素からなる画像を背景領域フレームメモリ３２０３に供給し、アンカバードバックグラウンド領域に属する画素からなる画像をアンカバードバックグラウンド領域フレームメモリ３２０４に供給し、カバードバックグラウンド領域に属する画素からなる画像をカバードバックグラウンド領域フレームメモリ３２０５に供給し、前景領域に属する画素からなる画像を前景領域フレームメモリ３２０６に供給する。
【０９７５】
ステップＳ３２０２において、マッピング部３２０７−１は、係数セットメモリ３２０８に記憶されている、背景領域に対応する係数セットを基に、クラス分類適応処理により、背景領域フレームメモリ３２０３に記憶されている背景領域に属する画素からなる画像に対応する画像を予測する。
【０９７６】
ステップＳ３２０３において、マッピング部３２０７−２は、係数セットメモリ３２０８に記憶されている、アンカバードバックグラウンド領域に対応する係数セットを基に、クラス分類適応処理により、アンカバードバックグラウンド領域フレームメモリ３２０４に記憶されている、アンカバードバックグラウンド領域に属する画素からなる画像に対応する画像を予測する。
【０９７７】
ステップＳ３２０４において、マッピング部３２０７−３は、係数セットメモリ３２０８に記憶されている、カバードバックグラウンド領域に対応する係数セットを基に、クラス分類適応処理により、カバードバックグラウンド領域フレームメモリ３２０５に記憶されている、カバードバックグラウンド領域に属する画素からなる画像に対応する画像を予測する。
【０９７８】
ステップＳ３２０５において、マッピング部３２０７−４は、係数セットメモリ３２０８に記憶されている、前景領域に対応する係数セットを基に、クラス分類適応処理により、前景領域フレームメモリ３２０６に記憶されている前景領域に属する画素からなる画像に対応する画像を予測する。
【０９７９】
ステップＳ３２０６において、合成部３２０９は、背景領域に属する画素からなる画像に対応する予測画像、アンカバードバックグラウンド領域に属する画素からなる画像に対応する予測画像、カバードバックグラウンド領域に属する画素からなる画像に対応する予測画像、および前景領域に対応する予測画像を合成する。合成部３２０９は、合成された画像をフレームメモリ３２１０に供給する。
フレームメモリ３２１０は、合成部３２０９から供給された画像を記憶する。
【０９８０】
ステップＳ３２０７において、フレームメモリ３２１０は、記憶している、合成された画像を出力し、処理は終了する。
【０９８１】
このように、図１３４に構成を示す領域処理部３００１を有する画像処理装置は、背景領域、アンカバードバックグラウンド領域、カバードバックグラウンド領域、および前景領域毎に、入力画像を分割し、分割された画像毎に予測画像を生成することができる。
【０９８２】
ステップＳ３２０２乃至ステップＳ３２０５の処理の詳細は、図１０９のフローチャートを参照して説明した処理と同様なので、その説明は省略する。
【０９８３】
なお、ステップＳ３２０２乃至ステップＳ３２０５の処理を、シリアルに実行しても、パラレルに実行しても良いことは勿論である。
【０９８４】
また、動きボケ除去画像処理部１０８、分離画像処理部２００２、および領域処理部３００１が実行する処理は、SD画像とHD画像とに対応する係数の生成、またはSD画像からHD画像を生成する処理に限らず、例えば、空間方向により解像度の高いの画像を生成するための係数を生成し、空間方向により解像度の高い画像を生成するようにしてもよい。さらに、動きボケ除去画像処理部１０８、分離画像処理部２００２、および領域処理部３００１は、時間方向に、より解像度の高い画像を生成する処理を実行するようにしてもよい。
【０９８５】
なお、動きボケ除去画像処理部１０８、分離画像処理部２００２、および領域処理部３００１は、所定の情報から係数を生成して、生成された係数を基に、クラス分類適応処理を実行するようにしてもよい。
【０９８６】
また、動きボケ除去画像処理部１０８、分離画像処理部２００２、および領域処理部３００１は、クラス分類の処理に基づいて、例えば、所望の大きさへの画像のサイズの変換、RGBなどの色信号の抽出、ノイズの除去、画像の圧縮、または符号化など他の処理を実行するようにしてもよい。例えば、動きボケ除去画像処理部１０８、分離画像処理部２００２、および領域処理部３００１に、分類されたクラスおよびそれぞれの画像に対応する動きベクトルを基に、動きベクトルに沿った方向の圧縮率を低く、動きベクトルに直交する方向の圧縮率を高くして、各領域毎の画像を圧縮させるようにすれば、従来に比較して、画像の劣化が少ないまま、圧縮比を高くすることができる。
【０９８７】
なお、前景となるオブジェクトの動きの方向は左から右として説明したが、その方向に限定されないことは勿論である。
【０９８８】
以上においては、３次元空間と時間軸情報を有する現実空間の画像をビデオカメラを用いて２次元空間と時間軸情報を有する時空間への射影を行った場合を例としたが、本発明は、この例に限らず、より多くの第１の次元の第１の情報を、より少ない第２の次元の第２の情報に射影した場合に適応することが可能である。
【０９８９】
なお、センサは、CCDに限らす、固体撮像素子である、例えば、BBD（Bucket Brigade Device）、CID（Charge Injection Device）、またはCPD（Charge Priming Device）などのセンサでもよく、また、検出素子がマトリックス状に配置されているセンサに限らず、検出素子が１列に並んでいるセンサでもよい。
【０９９０】
本発明の信号処理を行うプログラムを記録した記録媒体は、図１０に示すように、コンピュータとは別に、ユーザにプログラムを提供するために配布される、プログラムが記録されている磁気ディスク９１（フロッピ（登録商標）ディスクを含む）、光ディスク９２（CD-ROM(Compact Disc-Read Only Memory),DVD(Digital Versatile Disc)を含む）、光磁気ディスク９３（ＭＤ（Mini-Disc）（商標）を含む）、もしくは半導体メモリ９４などよりなるパッケージメディアにより構成されるだけでなく、コンピュータに予め組み込まれた状態でユーザに提供される、プログラムが記録されているROM７２や、記憶部７８に含まれるハードディスクなどで構成される。
【０９９１】
なお、本明細書において、記録媒体に記録されるプログラムを記述するステップは、記載された順序に沿って時系列的に行われる処理はもちろん、必ずしも時系列的に処理されなくとも、並列的あるいは個別に実行される処理をも含むものである。
【０９９２】
【発明の効果】
本発明の画像処理装置および方法、記録媒体、並びにプロクラムによれば、入力画像データに基づいて、前景オブジェクトを構成する前景オブジェクト成分、および背景オブジェクトを構成する背景オブジェクト成分が混合されてなる混合領域と、前景オブジェクト成分からなる前景領域、および背景オブジェクトを構成する背景オブジェクト成分からなる背景領域の一方により構成される非混合領域とが特定され、特定結果に対応する領域特定情報が出力され、領域特定情報に対応して、入力画像データの各画素データに対応するクラスが決定されるようにしたので、背景の画像と移動する物体の画像との混ざり合い対応して画像を処理することができるようになる。
【図面の簡単な説明】
【図１】従来の画像処理装置の構成を示すブロック図である。
【図２】クラスタップを説明する図である。
【図３】予測タップを説明する図である。
【図４】クラス分類適応処理の概略を説明する図である。
【図５】従来の係数セットを説明する図である。
【図６】従来の学習の処理を説明するフローチャートである。
【図７】従来の画像処理装置の構成を示すブロック図である。
【図８】入力画像の画素値、およびクラス分類適応処理により生成された出力画像の画素値を示す図である。
【図９】従来の画像の創造の処理を説明するフローチャートである。
【図１０】本発明に係る画像処理装置の一実施の形態の構成を示すブロック図である。
【図１１】画像処理装置の機能の構成を示すブロック図である。
【図１２】センサによる撮像を説明する図である。
【図１３】画素の配置を説明する図である。
【図１４】検出素子の動作を説明する図である。
【図１５】動いている前景に対応するオブジェクトと、静止している背景に対応するオブジェクトとを撮像して得られる画像を説明する図である。
【図１６】背景領域、前景領域、混合領域、カバードバックグラウンド領域、およびアンカバードバックグラウンド領域を説明する図である。
【図１７】静止している前景に対応するオブジェクトおよび静止している背景に対応するオブジェクトを撮像した画像における、隣接して１列に並んでいる画素の画素値を時間方向に展開したモデル図である。
【図１８】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図１９】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図２０】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図２１】前景領域、背景領域、および混合領域の画素を抽出した例を示す図である。
【図２２】画素と画素値を時間方向に展開したモデルとの対応を示す図である。
【図２３】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図２４】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図２５】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図２６】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図２７】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図２８】分割された画像と、画素の画素値を時間方向に展開したモデル図との対応を示す図である。
【図２９】分離された画像と、画素の画素値を時間方向に展開したモデル図との対応を示す図である。
【図３０】分割された画像の例を示す図である。
【図３１】分離された画像の例を示す図である。
【図３２】動きボケが除去された画像と、画素の画素値を時間方向に展開したモデル図との対応を示す図である。
【図３３】本発明に係る画像処理装置の処理を説明する図である。
【図３４】本発明に係る画像処理装置の画像の処理を説明するフローチャートである。
【図３５】領域特定部１０３の構成の一例を示すブロック図である。
【図３６】前景に対応するオブジェクトが移動しているときの画像を説明する図である。
【図３７】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図３８】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図３９】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図４０】領域判定の条件を説明する図である。
【図４１】領域特定部１０３の領域の特定の結果の例を示す図である。
【図４２】領域特定部１０３の領域の特定の結果の例を示す図である。
【図４３】領域特定の処理を説明するフローチャートである。
【図４４】領域特定部１０３の構成の他の一例を示すブロック図である。
【図４５】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図４６】背景画像の例を示す図である。
【図４７】２値オブジェクト画像抽出部３０２の構成を示すブロック図である。
【図４８】相関値の算出を説明する図である。
【図４９】相関値の算出を説明する図である。
【図５０】２値オブジェクト画像の例を示す図である。
【図５１】時間変化検出部３０３の構成を示すブロック図である。
【図５２】領域判定部３４２の判定を説明する図である。
【図５３】時間変化検出部３０３の判定の例を示す図である。
【図５４】領域判定部１０３の領域特定の処理を説明するフローチャートである。
【図５５】領域判定の処理の詳細を説明するフローチャートである。
【図５６】領域特定部１０３のさらに他の構成を示すブロック図である。
【図５７】ロバスト化部３６１の構成を説明するブロック図である。
【図５８】動き補償部３８１の動き補償を説明する図である。
【図５９】動き補償部３８１の動き補償を説明する図である。
【図６０】領域特定の処理を説明するフローチャートである。
【図６１】ロバスト化の処理の詳細を説明するフローチャートである。
【図６２】混合比算出部１０４の構成の一例を示すブロック図である。
【図６３】理想的な混合比αの例を示す図である。
【図６４】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図６５】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図６６】前景の成分の相関を利用した近似を説明する図である。
【図６７】C，N、およびPの関係を説明する図である。
【図６８】推定混合比処理部４０１の構成を示すブロック図である。
【図６９】推定混合比の例を示す図である。
【図７０】混合比算出部１０４の他の構成を示すブロック図である。
【図７１】混合比の算出の処理を説明するフローチャートである。
【図７２】推定混合比の演算の処理を説明するフローチャートである。
【図７３】混合比αを近似する直線を説明する図である。
【図７４】混合比αを近似する平面を説明する図である。
【図７５】混合比αを算出するときの複数のフレームの画素の対応を説明する図である。
【図７６】混合比推定処理部４０１の他の構成を示すブロック図である。
【図７７】推定混合比の例を示す図である。
【図７８】混合比の算出の処理を説明するフローチャートである。
【図７９】カバードバックグラウンド領域に対応するモデルによる混合比推定の処理を説明するフローチャートである。
【図８０】前景背景分離部１０５の構成の一例を示すブロック図である。
【図８１】入力画像、前景成分画像、および背景成分画像を示す図である。
【図８２】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図８３】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図８４】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図８５】分離部６０１の構成の一例を示すブロック図である。
【図８６】分離された前景成分画像、および背景成分画像の例を示す図である。
【図８７】前景と背景との分離の処理を説明するフローチャートである。
【図８８】動きボケ除去部１０６の構成の一例を示すブロック図である。
【図８９】処理単位を説明する図である。
【図９０】前景成分画像の画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図９１】前景成分画像の画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図９２】前景成分画像の画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図９３】動きボケ除去部１０６による前景成分画像に含まれる動きボケの除去の処理を説明するフローチャートである。
【図９４】背景成分画像のモデルを示す図である。
【図９５】補正された背景成分画像のモデルを示す図である。
【図９６】係数セットを生成する動きボケ除去画像処理部１０８の構成を示すブロック図である。
【図９７】教師画像と生徒画像との関係を説明する図である。
【図９８】学習部１００６の構成を示すブロック図である。
【図９９】クラス分類処理を説明する図である。
【図１００】ＡＤＲＣ処理を説明する図である。
【図１０１】動きボケ除去画像処理部１０８が生成する係数セットを説明する図である。
【図１０２】動きボケ除去画像処理部１０８による、係数セットを生成する学習の処理を説明するフローチャートである。
【図１０３】背景成分画像に対応する係数セットの生成の処理を説明するフローチャートである。
【図１０４】クラス分類適応処理を実行して、空間方向に、より高解像度な画像を生成する動きボケ除去画像処理部１０８の構成を示すブロック図である。
【図１０５】動きボケが除去された前景成分画像のモデルを示す図である。
【図１０６】動きボケが付加された前景成分画像のモデルを示す図である。
【図１０７】マッピング部１１０３の構成を示すブロック図である。
【図１０８】動きボケ除去画像処理部１０８の画像の創造の処理を説明するフローチャートである。
【図１０９】背景成分画像に対応する画像の予測の処理を説明するフローチャートである。
【図１１０】画像処理装置の機能の他の構成を示すブロック図である。
【図１１１】混合比算出部１５０１の構成の一例を示すブロック図である。
【図１１２】前景背景分離部１５０２の構成の一例を示すブロック図である。
【図１１３】画像処理装置の機能の他の構成を示すブロック図である。
【図１１４】分離画像処理部２００２の処理を説明する図である。
【図１１５】前景背景分離部２００１の構成の一例を示すブロック図である。
【図１１６】分離部２１０１の構成の一例を示すブロック図である。
【図１１７】係数セットを生成する分離画像処理部２００２の構成を示すブロック図である。
【図１１８】空間方向に、より高解像度な画像を生成する分離画像処理部２００２の構成を示すブロック図である。
【図１１９】教師画像の混合領域における画像の例を示す図である。
【図１２０】従来のクラス分類適応処理により生成された、混合領域の画像の例を示す図である。
【図１２１】分離画像処理部２００２により生成された、混合領域の画像の例を示す図である。
【図１２２】教師画像の前景領域における画像の例を示す図である。
【図１２３】従来のクラス分類適応処理により生成された、前景領域の画像の例を示す図である。
【図１２４】分離画像処理部２００２により生成された、前景領域の画像の例を示す図である。
【図１２５】図１１３に構成を示す画像処理装置の画像の処理を説明するフローチャートである。
【図１２６】前景背景分離部２００１による前景と背景との分離の処理を説明するフローチャートである。
【図１２７】分離画像処理部２００２による、係数セットを生成する学習の処理を説明するフローチャートである。
【図１２８】分離画像処理部２００２の画像の創造の処理を説明するフローチャートである。
【図１２９】画像処理装置の機能のさらに他の構成を示すブロック図である。
【図１３０】前景背景分離部２５０１の構成の一例を示すブロック図である。
【図１３１】画像処理装置の機能のさらに他の構成を示すブロック図である。
【図１３２】領域処理部３００１の処理を説明する図である。
【図１３３】係数セットを生成する領域処理部３００１の構成を示すブロック図である。
【図１３４】空間方向に、より高解像度な画像を生成する領域処理部３００１の構成を示すブロック図である。
【図１３５】教師画像の混合領域における画像の例を示す図である。
【図１３６】従来のクラス分類適応処理により生成された、混合領域の画像の例を示す図である。
【図１３７】領域処理部３００１により生成された、混合領域の画像の例を示す図である。
【図１３８】教師画像の前景領域における画像の例を示す図である。
【図１３９】従来のクラス分類適応処理により生成された、前景領域の画像の例を示す図である。
【図１４０】領域処理部３００１により生成された、前景領域の画像の例を示す図である。
【図１４１】本発明に係る画像処理装置の画像の処理を説明するフローチャートである。
【図１４２】領域処理部３００１による、係数セットを生成する学習の処理を説明するフローチャートである。
【図１４３】領域処理部３００１の画像の創造の処理を説明するフローチャートである。
【符号の説明】
７１ CPU，７２ ROM，７３ RAM，７６入力部，７７出力部，７８記憶部，７９通信部，９１磁気ディスク，９２光ディスク，９３光磁気ディスク，９４半導体メモリ，１０１オブジェクト抽出部，１０２動き検出部，１０３領域特定部，１０４混合比算出部，１０５前景背景分離部，１０６動きボケ除去部，１０７補正部，１０８動きボケ除去画像処理部，２０１フレームメモリ，２０２−１乃至２０２−４静動判定部，２０３−１乃至２０３−３領域判定部，２０４判定フラグ格納フレームメモリ，２０５合成部，２０６判定フラグ格納フレームメモリ，３０１背景画像生成部，３０２２値オブジェクト画像抽出部，３０３時間変化検出部，３２１相関値演算部，３２２しきい値処理部，３４１フレームメモリ，３４２領域判定部，３６１ロバスト化部，３８１動き補償部，３８２スイッチ，３８３−１乃至３８３−Ｎフレームメモリ、３８４−１乃至３８４−Ｎ重み付け部，３８５積算部，４０１推定混合比処理部，４０２推定混合比処理部，４０３混合比決定部，４２１フレームメモリ，４２２フレームメモリ，４２３混合比演算部，４４１選択部，４４２推定混合比処理部，４４３推定混合比処理部，４４４選択部，５０１遅延回路，５０２足し込み部，５０３演算部，６０１分離部，６０２スイッチ，６０３合成部，６０４スイッチ，６０５合成部，６２１フレームメモリ，６２２分離処理ブロック，６２３フレームメモリ，６３１アンカバード領域処理部，６３２カバード領域処理部，６３３合成部，６３４合成部，８０１処理単位決定部，８０２モデル化部，８０３方程式生成部，８０４足し込み部，８０５演算部，１００１背景成分教師画像フレームメモリ，１００２前景成分教師画像フレームメモリ，１００３−１および１００３−２加重平均部，１００４背景成分生徒画像フレームメモリ，１００５前景成分生徒画像フレームメモリ，１００６−１および１００６−２学習部，１００７係数セットメモリ，１０３１クラス分類部，１０３２予測タップ取得部，１０３３対応画素取得部，１０３４正規方程式生成部，１０３５係数計算部，１０５１クラスタップ取得部，１０５２波形分類部，１１０１背景成分画像フレームメモリ，１１０２前景成分画像フレームメモリ，１１０３−１および１１０３−２マッピング部，１１０４係数セットメモリ，１１０５補正部，１１０６動きボケ付加部，１１０７合成部，１１３１マッピング処理部，１１４１クラス分類部，１１４２予測タップ取得部，１１４３予測演算部，１１５１クラスタップ取得部，１１５２波形分類部，１５０１混合比算出部，１５０２前景背景分離部，１５２１選択部，２００１前景背景分離部，２００２分離画像処理部，２１０１分離部，２１０２スイッチ，２１０３スイッチ，２２０１背景領域教師画像フレームメモリ，２２０２アンカバードバックグラウンド領域背景成分教師画像フレームメモリ，２２０３アンカバードバックグラウンド領域前景成分教師画像フレームメモリ，２２０４カバードバックグラウンド領域背景成分教師画像フレームメモリ，２２０５カバードバックグラウンド領域前景成分教師画像フレームメモリ，２２０６前景領域教師画像フレームメモリ，２２０７−１乃至２２０７−６加重平均部，２２０８背景領域生徒画像フレームメモリ，２２０９アンカバードバックグラウンド領域背景成分生徒画像フレームメモリ，２２１０アンカバードバックグラウンド領域前景成分生徒画像フレームメモリ，２２１１カバードバックグラウンド領域背景成分生徒画像フレームメモリ，２２１２カバードバックグラウンド領域前景成分生徒画像フレームメモリ，２２１３前景領域生徒画像フレームメモリ，２２１４−１乃至２２１４−６学習部，２２１５係数セットメモリ，２３０１背景領域フレームメモリ，２３０２アンカバードバックグラウンド領域背景成分画像フレームメモリ，２３０３アンカバードバックグラウンド領域前景成分画像フレームメモリ，２３０４カバードバックグラウンド領域背景成分画像フレームメモリ，２３０５カバードバックグラウンド領域前景成分画像フレームメモリ，２３０６前景領域フレームメモリ，２３０７−１乃至２３０７−６マッピング部，２３０８係数セットメモリ，２３０９合成部，２５０１前景背景分離部，２５２１選択部，３００１領域処理部，３１０２領域分割部，３１０３背景領域教師画像フレームメモリ，３１０４アンカバードバックグラウンド領域教師画像フレームメモリ，３１０５カバードバックグラウンド領域教師画像フレームメモリ，３１０６前景領域教師画像フレームメモリ，３１０７−１乃至３１０７−４加重平均部，３１０８背景領域生徒画像フレームメモリ，３１０９アンカバードバックグラウンド領域生徒画像フレームメモリ，３１１０カバードバックグラウンド領域生徒画像フレームメモリ，３１１１前景領域生徒画像フレームメモリ，３１１２−１乃至３１１２−４学習部，３１１３係数セットメモリ，３２０６領域分割部，３２０３背景領域フレームメモリ，３２０４アンカバードバックグラウンド領域フレームメモリ，３２０５カバードバックグラウンド領域フレームメモリ，３２０６前景領域フレームメモリ，３２０７−１乃至３２０７−４マッピング部，３２０８係数セットメモリ，３２０９合成部[0001]
BACKGROUND OF THE INVENTION
The present invention relates to an image processing apparatus and method, a recording medium, and a program, and more particularly, to an image processing apparatus and method, a recording medium, and a program that take into consideration a difference between a signal detected by a sensor and the real world.
[0002]
[Prior art]
One of the processes for generating a higher resolution image based on the input image is a class classification adaptive process. As an example of class classification adaptive processing, coefficients used in processing for generating a higher resolution image in the spatial direction are generated in advance, and a higher resolution image is generated in the spatial direction based on the generated coefficient. Processing.
[0003]
FIG. 1 shows a configuration of a conventional image processing apparatus that generates coefficients used in class classification adaptive processing for generating HD (High Definition) images from SD (Standard Definition) images. FIG.
[0004]
The frame memory 11 stores an input image that is an HD image in units of frames.
The frame memory 11 supplies the stored HD image to the weighted average unit 12 and the corresponding pixel acquisition unit 16.
[0005]
The weighted average unit 12 generates a SD image by performing a weighted average of the HD images stored in the frame memory 11 by a quarter, and supplies the generated SD image to the frame memory 13.
[0006]
The frame memory 13 stores the SD image supplied from the weighted average unit 12 in units of frames, and supplies the stored SD image to the class classification unit 14 and the prediction tap acquisition unit 15.
[0007]
The class classification unit 14 includes a class tap acquisition unit 21 and a waveform classification unit 22, and classifies a target pixel that is a pixel of interest in an SD image stored in the frame memory 13. The class tap acquisition unit 21 acquires a predetermined number of class taps that are pixels of the SD image corresponding to the target pixel from the frame memory 13, and supplies the acquired class taps to the waveform classification unit 22.
[0008]
FIG. 2 is a diagram illustrating class taps acquired by the class tap acquisition unit 21. As shown in FIG. 2, the class tap acquisition unit 21 acquires eleven class taps at predetermined positions.
[0009]
The waveform classification unit 22 classifies the target pixel into one of a plurality of classes based on the class tap, and supplies a class number corresponding to the classified class to the prediction tap acquisition unit 15. The waveform classification unit 22 classifies the target pixel into one of 2048 classes based on 11 class taps.
[0010]
The prediction tap acquisition unit 15 acquires a predetermined number of prediction taps that are pixels of the SD image corresponding to the class classified from the frame memory 13 based on the class number, and uses the acquired prediction tap and class number. This is supplied to the corresponding pixel acquisition unit 16.
[0011]
FIG. 3 is a diagram illustrating the prediction tap acquired by the prediction tap acquisition unit 15. As shown in FIG. 3, the prediction tap acquisition unit 15 acquires nine prediction taps at predetermined positions.
[0012]
The corresponding pixel acquisition unit 16 acquires pixels of an HD image corresponding to the pixel value to be predicted from the frame memory 11 based on the prediction tap and the class number, and the prediction tap, the class number, and the acquired pixel to be predicted The pixel of the HD image corresponding to the value is supplied to the normal equation generation unit 17.
[0013]
The normal equation generation unit 17 generates a normal equation corresponding to each class based on the prediction tap, the class number, and the acquired pixel value to be predicted, and corresponding to the relationship between the prediction tap and the pixel value to be predicted. The generated normal equation corresponding to each class is supplied to the coefficient calculation unit 18.
[0014]
The coefficient calculation unit 18 solves the normal equation supplied from the normal equation generation unit 17 to calculate a coefficient set corresponding to each class, and supplies the calculated coefficient set to the coefficient set memory 19 together with the class number.
[0015]
The coefficient set memory 19 stores the calculated coefficient set in association with the class based on the class number.
[0016]
FIG. 4 is a diagram for explaining the outline of the class classification adaptation process. In the class classification adaptation process, a corresponding SD image is generated from the teacher image, which is an HD image, by a quarter-weighted average process. The generated SD image is referred to as a student image.
[0017]
Next, a coefficient set for generating an HD image from the SD image is generated based on the teacher image that is an HD image and the student image that is a corresponding SD image. The coefficient set includes coefficients for generating an HD image from an SD image by linear prediction or the like.
[0018]
A quadruple density image is generated from the coefficient set and the SD image generated in this way by linear prediction or the like. The process of generating a higher-density image or the like from the coefficient set and the input image is also referred to as mapping.
[0019]
Based on the generated quadruple density image and the corresponding HD image, SNR comparison or visual qualitative evaluation is performed.
[0020]
The coefficient set generated from the specific teacher image and the corresponding student image is referred to as the self coefficient set of the specific teacher image and the corresponding student image. Mapping using a self coefficient set is called self-mapping. A coefficient set generated from a plurality of other teacher images and corresponding student images is referred to as a cross coefficient set.
[0021]
On the other hand, in an image obtained by capturing an object that is a foreground moving in front of a predetermined stationary background with a video camera, motion blur occurs when the moving speed of the object is relatively fast, and the background and foreground Mixing occurs.
[0022]
In the conventional classification adaptation process, as shown in FIG. 5, one coefficient is obtained by the above learning process for all of the foreground, background, and foreground / background mixture. A set is generated, and mapping processing is executed based on the coefficient set.
[0023]
A conventional learning process for generating a coefficient used in a process for generating an HD image from an SD image will be described with reference to a flowchart of FIG. In step S11, the image processing apparatus determines whether or not there are unprocessed pixels in the student image. If it is determined that there are unprocessed pixels in the student image, the process proceeds to step S12 and the students are processed in the raster scan order. Obtain a pixel of interest from the image.
[0024]
In step S 13, the class tap acquisition unit 21 of the class classification unit 14 acquires a class tap corresponding to the target pixel from the student image stored in the frame memory 13. In step S14, the waveform classification unit 22 of the class classification unit 14 classifies the pixel of interest based on the class tap. In step S15, the prediction tap acquisition unit 15 acquires a prediction tap corresponding to the target pixel from the student image stored in the frame memory 13 based on the classified class.
[0025]
In step S 16, the corresponding pixel acquisition unit 16 acquires a pixel corresponding to the pixel value to be predicted from the teacher image stored in the frame memory 11 based on the classified class.
[0026]
In step S17, the normal equation generation unit 17 adds the pixel value of the pixel corresponding to the prediction tap and the pixel value to be predicted to the matrix for each class based on the classified class, and returns to step S11 to return the image The processing device repeatedly determines whether there are unprocessed pixels. The matrix for each class into which the pixel value of the pixel corresponding to the prediction tap and the pixel value to be predicted is added corresponds to a normal equation for calculating the coefficient for each class.
[0027]
When it is determined in step S11 that there are no unprocessed pixels in the student image, the process proceeds to step S18, and the normal equation generation unit 17 sets the pixel value of the pixel corresponding to the prediction tap and the pixel value to be predicted. The matrix for each class is supplied to the coefficient calculation unit 18. The coefficient calculation unit 18 calculates a coefficient set for each class by solving a matrix for each class in which pixel values of pixels corresponding to the prediction tap and the pixel value to be predicted are set.
[0028]
In step S 19, the coefficient calculation unit 18 outputs the calculated coefficient for each class to the coefficient set memory 19. The coefficient set memory 19 stores a coefficient set for each class, and the process ends.
[0029]
FIG. 7 is a block diagram showing a configuration of a conventional image processing apparatus that generates an HD image from an SD image by class classification adaptive processing.
[0030]
The frame memory 31 stores an input image that is an SD image in units of frames.
The frame memory 31 supplies the stored SD image to the mapping unit 32.
[0031]
The SD image input to the mapping unit 32 is supplied to the class classification unit 41 and the prediction tap acquisition unit 42.
[0032]
The class classification unit 41 includes a class tap acquisition unit 51 and a waveform classification unit 52, and classifies a pixel of interest, which is a pixel of interest, of an SD image stored in the frame memory 31. The class tap acquisition unit 51 acquires a predetermined number of class taps corresponding to the target pixel from the frame memory 31, and supplies the acquired class taps to the waveform classification unit 52.
[0033]
The waveform classification unit 52 classifies the target pixel into one class out of a predetermined number of classes based on the class tap, and supplies the class number corresponding to the classified class to the prediction tap acquisition unit 42.
[0034]
The prediction tap acquisition unit 42 acquires a predetermined number of prediction taps corresponding to the classified class from the input image stored in the frame memory 31 based on the class number, and acquires the acquired prediction tap and class number. Is supplied to the prediction calculation unit 43.
[0035]
The prediction calculation unit 43 acquires a coefficient set corresponding to the class from the coefficient set stored in the coefficient set memory 33 based on the class number. The prediction calculation unit 43 predicts the pixel value of the predicted image by linear prediction based on the coefficient set corresponding to the class and the prediction tap. The prediction calculation unit 43 supplies the predicted pixel value to the frame memory 34.
[0036]
The frame memory 34 stores the predicted pixel value supplied from the prediction calculation unit 43 and outputs an HD image in which the predicted pixel value is set.
[0037]
FIG. 8 is a diagram illustrating the pixel value of the input image and the pixel value of the output image generated by the class classification adaptation process. As shown in FIG. 8, the image generated by the classification adaptation process includes a waveform lost due to the band limitation of the SD image. In that sense, it can be said that the process of generating a higher-resolution image by the class classification adaptive process creates a resolution.
[0038]
With reference to the flowchart of FIG. 9, a conventional image creation process for generating an HD image from an SD image by the image processing apparatus that executes the class classification adaptation process will be described.
[0039]
In step S31, the image processing apparatus determines whether there is an unprocessed pixel in the input image. If it is determined that there is an unprocessed pixel in the input image, the process proceeds to step S32, where the mapping unit 32 The coefficient set stored in the coefficient set memory 33 is acquired. In step S33, the image processing apparatus acquires the target pixel from the input image in the raster scan order.
[0040]
In step S 34, the class tap acquisition unit 51 of the class classification unit 41 acquires the class tap corresponding to the target pixel from the input image stored in the frame memory 31. In step S35, the waveform classification unit 52 of the class classification unit 41 classifies the target pixel into one class based on the class tap.
[0041]
In step S 36, the prediction tap acquisition unit 42 acquires a prediction tap corresponding to the target pixel from the input image stored in the frame memory 31 based on the classified class.
[0042]
In step S37, the prediction calculation unit 43 predicts the pixel value of the predicted image by linear prediction based on the coefficient set corresponding to the classified class and the prediction tap.
[0043]
In step S 38, the prediction calculation unit 43 outputs the predicted pixel value to the frame memory 34. The frame memory 34 stores the pixel value supplied from the prediction calculation unit 43. The procedure returns to step S31 and repeats the determination of whether or not there is an unprocessed pixel.
[0044]
If it is determined in step S31 that there are no unprocessed pixels in the input image, the process proceeds to step S39, where the frame memory 34 outputs the stored prediction image in which the prediction value is set, and the processing ends. To do.
[0045]
[Problems to be solved by the invention]
When an object moves in front of a stationary background, not only motion blur due to mixing of the moving object image itself, but also mixing of the background image and the moving object image occurs. Conventionally, it has not been considered to process an image corresponding to a mixture of a background image and an image of a moving object.
[0046]
SUMMARY An advantage of some aspects of the invention is that an image can be processed in accordance with a mixture of a background image and a moving object image.
[0047]
[Means for Solving the Problems]
 The image processing apparatus of the present inventionIn an image processing apparatus for processing an input image composed of a predetermined number of pixels acquired by an imaging device having a predetermined number of pixels having a time integration effect,Input imageAnd the input image acquired immediately before and after that, the absolute value of the difference between the pixel values of the pixels at the same position is compared with a predetermined threshold value, so that the input image and the previous input image are compared. The presence or absence of pixel movement between the input image and the presence or absence of pixel movement between the input image and the next input image is determined. Based on the determination result,The foreground object components that make up the foreground objectA foreground area consisting ofBackground object components that make up the background objectAt least a background area consisting ofIdentify and specific resultsIndicateArea specifying means for outputting area specifying information;Using the input image input as learning data, for each of the foreground region image and the background region image, the input image is a teacher image, and the input image is a low-resolution image as a student image. Is expressed by a linear primary expression of a prediction coefficient and a plurality of pixels around the corresponding pixel of the student image, and a predicted pixel value expressed by the linear primary expression and a pixel value of a predetermined pixel of the teacher image Prediction coefficient generation means that obtains a prediction coefficient that minimizes the error in advance, and a prediction coefficient and an input image that are obtained in advance for each of the foreground area image and the background area image of the input image input as prediction processing data. Conversion means for converting a foreground region image and a background region image of an input image into a high-resolution output image by calculating a linear linear expression with a plurality of pixels around the pixelIt is characterized by including.
[0048]
 The region specifying means also uses the determination result of the presence / absence of pixel movement between the input image acquired one before, two before, one after, and two after the input image, and uses the foreground object component and the background. A covered background area in which the background object component changes from the background object component to the foreground object component as time elapses, and the foreground object component to the background as time elapses. An uncovered background area that is an area that becomes an object component is further specified, and the prediction coefficient generation unit uses the input image input as learning data to detect the images of the covered background area and the uncovered background area. Each of the prediction coefficients is obtained in advance, and is input to the conversion means as prediction processing data. For each well of the covered background area and the uncovered background area of the image of the input image that can be converted to high resolution output image.
[0049]
Output images obtained by converting the foreground area image, background area image, covered background area image, and uncovered background area image of the input image input as prediction processing data to high resolution, respectively. The image forming apparatus may further include a combining unit that combines and outputs.
[0051]
 The image processing method of the present invention includes:In an image processing method for processing an input image composed of a predetermined number of pixels acquired by an imaging device having a predetermined number of pixels having a time integration effect,Input imageAnd the input image acquired immediately before and after that, the absolute value of the difference between the pixel values of the pixels at the same position is compared with a predetermined threshold value, so that the input image and the previous input image are compared. The presence or absence of pixel movement between the input image and the presence or absence of pixel movement between the input image and the next input image is determined. Based on the determination result,The foreground object components that make up the foreground objectA foreground area consisting ofBackground object components that make up the background objectAt least a background area consisting ofIdentify and specific resultsIndicateAn area specifying step for outputting area specifying information;Using the input image input as learning data, for each of the foreground region image and the background region image, the input image is used as a teacher image, and the input image is set as a student image. Are represented by a linear primary expression of a prediction coefficient and a plurality of pixels around the corresponding pixel of the student image, and a prediction pixel value represented by the linear primary expression and a pixel value of a predetermined pixel of the teacher image A prediction coefficient generation step for obtaining a prediction coefficient that minimizes the error in advance, and a prediction coefficient and an input image obtained in advance for each of the foreground area image and the background area image of the input image input as prediction processing data. A conversion step of converting a foreground region image and a background region image of the input image into a high-resolution output image by calculating a linear linear expression with a plurality of pixels around the pixelIt is characterized by including.
[0055]
 Recording medium of the present inventionIs the same position between an input image consisting of a predetermined number of pixels acquired by an imaging device having a predetermined number of pixels having a time integration effect and an input image acquired immediately before and after the input image. By comparing the absolute value of the difference between the pixel values of the pixels and a predetermined threshold value, the presence / absence of pixel movement between the input image and the previous input image, and the input image and the next one The presence / absence of pixel movement between the input image and the input image is determined. Based on the determination result, the foreground area of the input image including the foreground object component constituting the foreground object and the background object component configuring the background object are included. The foreground area image and the background are identified by using an area specifying step for specifying at least a background area and outputting area specifying information indicating a specifying result, and an input image input as learning data. For each image in the region, the input image is a teacher image, the input image is a low-resolution image as a student image, a predetermined pixel of the teacher image, a plurality of pixels around the corresponding pixel of the student image, and a prediction coefficient A prediction coefficient generation step for obtaining in advance a prediction coefficient that minimizes an error between a predicted pixel value represented by the linear linear expression and a pixel value of a predetermined pixel of the teacher image; For each of the image in the foreground area and the image in the background area of the input image input as processing data, by calculating a linear primary expression of the prediction coefficient obtained in advance and a plurality of pixels around the pixels of the input image, The computer-readable recording medium stores a program for executing a conversion step of converting an image in the foreground area and an image in the background area of the input image into a high-resolution output image.
[0059]
 The program of the present inventionBetween the input image and the input image acquired immediately before and after the input image, the computer that processes the input image consisting of the predetermined number of pixels acquired by the imaging device having the predetermined number of pixels having a time integration effect, By comparing the absolute value of the difference between the pixel values of the pixels at the same position with a predetermined threshold value, the presence / absence of pixel movement between the input image and the previous input image, and the input image and its 1 The presence / absence of pixel movement between the next input image and the foreground object component constituting the foreground object constituting the foreground object of the input image and the background object constituting the background object are determined based on the determination result. Using at least an area specifying step for specifying at least a background area composed of components and outputting area specifying information indicating a specifying result, and an input image input as learning data, the foreground For each of the image of the area and the image of the background area, the input image is a teacher image, the input image is a low-resolution image as a student image, a predetermined pixel of the teacher image is a plurality of pixels around the corresponding pixel of the student image A prediction coefficient that obtains in advance a prediction coefficient that minimizes an error between a predicted pixel value represented by the linear primary expression and a pixel value of a predetermined pixel of the teacher image. For each of the generation step and the foreground area image and the background area image of the input image input as the prediction processing data, a linear linear expression of the prediction coefficient obtained in advance and a plurality of pixels around the pixels of the input image is obtained. By performing the calculation, a conversion step for converting the foreground region image and the background region image of the input image into a high-resolution output image is executed.
[0063]
 In the image processing apparatus and method, the recording medium, and the program of the present invention,By comparing the absolute value of the difference between the pixel values of the pixels at the same position and a predetermined threshold value between the input image and the input image acquired immediately before and after the input image, the input image and one of them The presence / absence of pixel movement between the previous input image and the presence / absence of pixel movement between the input image and the next input image are determined. Based on the determination result, the foreground object of the input image is determined. At least a foreground area consisting of foreground object components constituting a background object and a background area consisting of background object components constituting a background object, region identification information indicating the identification result is output, and an input image input as learning data For each of the foreground region image and the background region image, the input image is a teacher image, the input image is a low resolution image, and a predetermined pixel of the teacher image A linear primary expression of a prediction coefficient and a plurality of pixels around the pixel of the corresponding student image and the prediction pixel value expressed by the linear primary expression and a pixel value of a predetermined pixel of the teacher image are minimized. A prediction coefficient that is obtained in advance, and for each of the foreground area image and the background area image of the input image input as prediction processing data, the prediction coefficient obtained in advance and a plurality of pixels around the pixels of the input image By calculating the linear linear expression, the image in the foreground area and the image in the background area of the input image are converted into a high-resolution output image..
[0064]
DETAILED DESCRIPTION OF THE INVENTION
FIG. 10 is a block diagram showing a configuration of an embodiment of an image processing apparatus according to the present invention. A CPU (Central Processing Unit) 71 executes various processes according to a program stored in a ROM (Read Only Memory) 72 or a storage unit 78. A RAM (Random Access Memory) 73 appropriately stores programs executed by the CPU 71 and data. These CPU 71, ROM 72, and RAM 73 are connected to each other by a bus 74.
[0065]
An input / output interface 75 is also connected to the CPU 71 via a bus 74. The input / output interface 75 is connected to an input unit 76 such as a keyboard, mouse, and microphone, and an output unit 77 such as a display and a speaker. The CPU 71 executes various processes in response to commands input from the input unit 76. Then, the CPU 71 outputs an image, sound, or the like obtained as a result of the processing to the output unit 77.
[0066]
The storage unit 78 connected to the input / output interface 75 is composed of, for example, a hard disk and stores programs executed by the CPU 71 and various data. The communication unit 79 communicates with an external device via the Internet or other networks. In this example, the communication unit 79 serves as an acquisition unit that captures the output of the sensor.
[0067]
A program may be acquired via the communication unit 79 and stored in the storage unit 78.
[0068]
The drive 80 connected to the input / output interface 75, when a magnetic disk 91, an optical disk 92, a magneto-optical disk 93, or a semiconductor memory 94 is mounted, drives them, and programs and data recorded there. Get etc. The acquired program and data are transferred to and stored in the storage unit 78 as necessary.
[0069]
FIG. 11 is a block diagram showing a functional configuration of the image processing apparatus according to the present invention.
[0070]
It does not matter whether each function of the image processing apparatus is realized by hardware or software. That is, each block diagram in this specification may be considered as a hardware block diagram or a software functional block diagram.
[0071]
Here, the motion blur refers to a distortion included in an image corresponding to a moving object, which is caused by the movement of an object in the real world to be imaged and the imaging characteristics of the sensor.
[0072]
In this specification, an image corresponding to an object in the real world to be imaged is referred to as an image object.
[0073]
The input image supplied to the image processing apparatus is supplied to the object extraction unit 101, the region specifying unit 103, the mixture ratio calculation unit 104, and the foreground / background separation unit 105.
[0074]
The object extraction unit 101 roughly extracts an image object corresponding to a foreground object included in the input image, and supplies the extracted image object to the motion detection unit 102. For example, the object extraction unit 101 detects the outline of the image object corresponding to the foreground object included in the input image, thereby roughly extracting the image object corresponding to the foreground object.
[0075]
The object extraction unit 101 roughly extracts an image object corresponding to a background object included in the input image, and supplies the extracted image object to the motion detection unit 102. For example, the object extraction unit 101 roughly extracts an image object corresponding to the background object from the difference between the input image and the image object corresponding to the extracted foreground object.
[0076]
Further, for example, the object extraction unit 101 corresponds to the image object corresponding to the foreground object and the background object from the difference between the background image stored in the background memory provided therein and the input image. You may make it extract the image object to perform roughly.
[0077]
The motion detection unit 102 calculates the motion vector of the image object corresponding to the coarsely extracted foreground object by a method such as a block matching method, a gradient method, a phase correlation method, and a per-recursive method. The motion vector and the position information of the motion vector (information specifying the position of the pixel corresponding to the motion vector) are supplied to the region specifying unit 103 and the motion blur removing unit 106.
[0078]
The motion vector output from the motion detection unit 102 includes information corresponding to the motion amount v.
[0079]
Further, for example, the motion detection unit 102 may output a motion vector for each image object to the motion blur removal unit 106 together with pixel position information for specifying a pixel in the image object.
[0080]
The motion amount v is a value that represents a change in the position of the image corresponding to the moving object in units of pixel intervals. For example, when the image of the object corresponding to the foreground is moved so as to be displayed at a position separated by four pixels in the next frame with reference to a certain frame, the motion amount v of the image of the object corresponding to the foreground is 4.
[0081]
The area specifying unit 103 specifies each pixel of the input image as one of the foreground area, the background area, or the mixed area, and whether each pixel belongs to one of the foreground area, the background area, or the mixed area (Hereinafter referred to as region information) is supplied to the mixture ratio calculation unit 104, the foreground / background separation unit 105, and the motion blur removal unit 106. Details of the foreground area, background area, or mixed area will be described later.
[0082]
Based on the input image and the region information supplied from the region specifying unit 103, the mixture ratio calculation unit 104 calculates a mixture ratio (hereinafter referred to as a mixture ratio α) corresponding to the pixels included in the mixture region, The calculated mixture ratio is supplied to the foreground / background separator 105.
[0083]
The mixing ratio α is a value indicating a ratio of an image component (hereinafter also referred to as a background component) corresponding to a background object in a pixel value, as shown in an equation (3) described later.
[0084]
Based on the region information supplied from the region specifying unit 103 and the mixture ratio α supplied from the mixture ratio calculation unit 104, the foreground / background separation unit 105 performs image component corresponding to the foreground object (hereinafter referred to as foreground component). The input image is separated into a foreground component image consisting of only the background component and a background component image consisting of only the background component, the foreground component image is supplied to the motion blur removal unit 106, and the background component image is supplied to the correction unit 107. Supply.
[0085]
The motion blur removal unit 106 determines a processing unit indicating one or more pixels included in the foreground component image based on the motion amount v and the region information that can be known from the motion vector. The processing unit is data that designates a group of pixels to be subjected to a process for adjusting the amount of motion blur.
[0086]
The motion blur removal unit 106 is based on the foreground component image supplied from the foreground / background separation unit 105, the motion vector and the position information supplied from the motion detection unit 102, and the processing unit, and the motion blur included in the foreground component image. And the foreground component image from which the motion blur is removed is output to the motion blur removal image processing unit 108.
[0087]
The correction unit 107 corrects the pixel value of the pixel corresponding to the mixed region in the background component image. The pixel value of the pixel corresponding to the mixed region of the background component image is calculated by removing the foreground component from the pixel value of the pixel of the mixed region before being separated. Therefore, the pixel value of the pixel corresponding to the mixed region of the background component image is decreased corresponding to the mixing ratio α as compared with the pixel value of the pixel of the adjacent background region.
[0088]
The correction unit 107 corrects such a decrease in gain corresponding to the mixing ratio α of the pixel values of the pixels corresponding to the mixed region in the background component image, and the corrected background component image is subjected to the motion blur removal image processing unit 108. To supply.
[0089]
The motion blur removal image processing unit 108 individually processes the foreground component image from which motion blur has been removed and the corrected background component image by the class classification adaptive processing.
[0090]
For example, the motion blur removal image processing unit 108 generates coefficients used in the class classification adaptive processing that generates a higher resolution image for each foreground component image from which motion blur is removed and each corrected background component image. To do.
[0091]
For example, the motion blur removal image processing unit 108 applies a class classification adaptive process to each foreground component image from which motion blur has been removed and each corrected background component image, thereby creating a higher resolution image.
[0092]
Next, an input image supplied to the image processing apparatus will be described with reference to FIGS.
[0093]
FIG. 12 is a diagram for explaining imaging by a sensor. The sensor is composed of, for example, a CCD video camera equipped with a CCD (Charge-Coupled Device) area sensor which is a solid-state image sensor. The object corresponding to the foreground in the real world moves horizontally between the object corresponding to the background and the sensor in the real world, for example, from the left side to the right side in the drawing.
[0094]
The sensor images an object corresponding to the foreground together with an object corresponding to the background. The sensor outputs the captured image in units of one frame. For example, the sensor outputs an image composed of 30 frames per second. The exposure time of the sensor can be 1/30 second. The exposure time is a period from the start of the conversion of the input light into the electric charge until the end of the conversion of the input light into the electric charge. Hereinafter, the exposure time is also referred to as shutter time.
[0095]
FIG. 13 is a diagram illustrating the arrangement of pixels. In FIG. 13, A to I indicate individual pixels. The pixels are arranged on a plane corresponding to the image. One detection element corresponding to one pixel is arranged on the sensor. When the sensor captures an image, one detection element outputs a pixel value corresponding to one pixel constituting the image. For example, the position of the detection element in the X direction corresponds to the horizontal position on the image, and the position of the detection element in the Y direction corresponds to the vertical position on the image.
[0096]
As shown in FIG. 14, for example, a detection element that is a CCD converts input light into electric charges for a period corresponding to a shutter time, and accumulates the converted electric charges. The amount of charge is approximately proportional to the intensity of the input light and the time during which the light is input. In the period corresponding to the shutter time, the detection element adds the electric charge converted from the input light to the already accumulated electric charge. That is, the detection element integrates the input light for a period corresponding to the shutter time, and accumulates an amount of charge corresponding to the integrated light. It can be said that the detection element has an integration effect with respect to time.
[0097]
The electric charge accumulated in the detection element is converted into a voltage value by a circuit (not shown), and the voltage value is further converted into a pixel value such as digital data and output. Therefore, each pixel value output from the sensor is a value projected onto a one-dimensional space, which is the result of integrating a part of the object corresponding to the foreground or background having a spatial extent with respect to the shutter time. Have.
[0098]
The image processing apparatus extracts significant information buried in the output signal, for example, the mixing ratio α, by the accumulation operation of the sensor.
[0099]
FIG. 15 is a diagram illustrating an image obtained by imaging an object corresponding to a moving foreground and an object corresponding to a stationary background. FIG. 15A shows an image obtained by imaging an object corresponding to a foreground with movement and an object corresponding to a stationary background. In the example shown in FIG. 15A, the object corresponding to the foreground is moving horizontally from the left to the right with respect to the screen.
[0100]
FIG. 15B is a model diagram in which pixel values corresponding to one line of the image shown in FIG. The horizontal direction in FIG. 15B corresponds to the spatial direction X in FIG.
[0101]
The pixel value of the background region pixel is composed of only the background component, that is, the image component corresponding to the background object. The pixel value of the foreground region pixel is composed of only the foreground component, that is, the image component corresponding to the foreground object.
[0102]
The pixel value of the pixel in the mixed area is composed of a background component and a foreground component. Since the pixel value is composed of the background component and the foreground component, the mixed region can be said to be a distortion region. The mixed area is further classified into a covered background area and an uncovered background area.
[0103]
The covered background area is a mixed area at a position corresponding to the front end in the advancing direction of the foreground object with respect to the foreground area, and is an area where the background component is covered with the foreground as time passes.
[0104]
On the other hand, the uncovered background area is a mixed area at a position corresponding to the rear end portion of the foreground object in the advancing direction with respect to the foreground area, and an area where a background component appears as time passes. Say.
[0105]
As described above, an image including a foreground area, a background area, or a covered background area or an uncovered background area is input as an input image to the area specifying unit 103, the mixture ratio calculation unit 104, and the foreground / background separation unit 105. .
[0106]
FIG. 16 is a diagram illustrating the background area, the foreground area, the mixed area, the covered background area, and the uncovered background area as described above. In the case of the image shown in FIG. 15, the background area is a stationary part, the foreground area is a moving part, the covered background area of the mixed area is a part that changes from the background to the foreground, The uncovered background area is a portion that changes from the foreground to the background.
[0107]
FIG. 17 is a model diagram in which pixel values of pixels arranged in a row adjacent to each other in an image obtained by capturing an object corresponding to a stationary foreground and an object corresponding to a stationary background are expanded in the time direction. It is. For example, pixels arranged on one line of the screen can be selected as the pixels arranged adjacent to each other in one column.
[0108]
The pixel values F01 to F04 shown in FIG. 17 are pixel values corresponding to the still foreground object. The pixel values B01 to B04 shown in FIG. 17 are the pixel values of the pixels corresponding to the stationary background object.
[0109]
In the vertical direction in FIG. 17, time elapses from the top to the bottom in the figure. The position of the upper side of the rectangle in FIG. 17 corresponds to the time at which the sensor starts to convert the input light into charges, and the position of the lower side of the rectangle in FIG. 17 indicates the charge of the light input by the sensor. Corresponds to the time to finish conversion of. That is, the distance from the upper side to the lower side of the rectangle in FIG. 17 corresponds to the shutter time.
[0110]
Hereinafter, a case where the shutter time and the frame interval are the same will be described as an example.
[0111]
The horizontal direction in FIG. 17 corresponds to the spatial direction X described in FIG. More specifically, in the example shown in FIG. 17, the distance from the left side of the rectangle described as “F01” in FIG. 17 to the right side of the rectangle described as “B04” is 8 times the pixel pitch, That is, it corresponds to the interval between eight consecutive pixels.
[0112]
When the foreground object and the background object are stationary, the light input to the sensor does not change during the period corresponding to the shutter time.
[0113]
Here, the period corresponding to the shutter time is divided into two or more periods having the same length. For example, if the number of virtual divisions is 4, the model diagram shown in FIG. 17 can be represented as the model shown in FIG. The virtual division number is set corresponding to the amount of movement v of the object corresponding to the foreground within the shutter time. For example, the number of virtual divisions is 4 corresponding to the motion amount v being 4, and the period corresponding to the shutter time is divided into 4.
[0114]
The top row in the figure corresponds to the first divided period after the shutter opens.
The second row from the top in the figure corresponds to the second divided period from when the shutter has opened. The third line from the top in the figure corresponds to the third divided period from when the shutter has opened. The fourth row from the top in the figure corresponds to the fourth divided period from when the shutter has opened.
[0115]
Hereinafter, the shutter time divided in accordance with the motion amount v is also referred to as shutter time / v.
[0116]
Since the light input to the sensor does not change when the object corresponding to the foreground is stationary, the foreground component F01 / v is equal to a value obtained by dividing the pixel value F01 by the virtual division number. Similarly, when the object corresponding to the foreground is stationary, the foreground component F02 / v is equal to the value obtained by dividing the pixel value F02 by the virtual division number, and the foreground component F03 / v is the virtual value of the pixel value F03. The foreground component F04 / v is equal to the value obtained by dividing the pixel value F04 by the virtual division number.
[0117]
Since the light input to the sensor does not change when the object corresponding to the background is stationary, the background component B01 / v is equal to the value obtained by dividing the pixel value B01 by the virtual division number. Similarly, when the object corresponding to the background is stationary, the background component B02 / v is equal to the value obtained by dividing the pixel value B02 by the virtual division number, and B03 / v is obtained by dividing the pixel value B03 by the virtual division number. B04 / v is equal to a value obtained by dividing the pixel value B04 by the number of virtual divisions.
[0118]
That is, when the object corresponding to the foreground is stationary, the light corresponding to the foreground object input to the sensor does not change during the period corresponding to the shutter time. The foreground component F01 / v corresponding to, the foreground component F01 / v corresponding to the second shutter time / v after the shutter opens, and the third foreground corresponding to the shutter time / v corresponding to the shutter time / v. And the foreground component F01 / v corresponding to the fourth shutter time / v after the shutter is opened have the same value. F02 / v to F04 / v have the same relationship as F01 / v.
[0119]
When the object corresponding to the background is stationary, the light corresponding to the background object input to the sensor does not change during the period corresponding to the shutter time, so it corresponds to the first shutter time / v after the shutter opens. Background component B01 / v, the second background component B01 / v corresponding to the shutter time / v after the shutter opens, and the third background component corresponding to the shutter time / v corresponding to the shutter time / v B01 / v and the fourth background component B01 / v corresponding to the shutter time / v after the shutter is opened have the same value. B02 / v to B04 / v have the same relationship.
[0120]
Next, a case where the object corresponding to the foreground moves and the object corresponding to the background is stationary will be described.
[0121]
FIG. 19 is a model diagram in which pixel values of pixels on one line including the covered background area are expanded in the time direction when the object corresponding to the foreground moves toward the right side in the drawing. In FIG. 19, the foreground motion amount v is 4. Since one frame is a short time, it can be assumed that the object corresponding to the foreground is a rigid body and is moving at a constant speed. In FIG. 19, the image of the object corresponding to the foreground moves so as to be displayed on the right side by four pixels in the next frame with reference to a certain frame.
[0122]
In FIG. 19, the leftmost pixel through the fourth pixel from the left belong to the foreground area. In FIG. 19, the fifth through seventh pixels from the left belong to the mixed area, which is a covered background area. In FIG. 19, the rightmost pixel belongs to the background area.
[0123]
Since the object corresponding to the foreground is moving so as to cover the object corresponding to the background with the passage of time, the component included in the pixel value of the pixel belonging to the covered background area has a period corresponding to the shutter time. At this point, the background component is replaced by the foreground component.
[0124]
For example, a pixel value M with a thick frame in FIG. 19 is expressed by Expression (1).
[0125]
M = B02 / v + B02 / v + F07 / v + F06 / v (1)
[0126]
For example, since the fifth pixel from the left includes a background component corresponding to one shutter time / v and includes a foreground component corresponding to three shutter times / v, the mixture ratio of the fifth pixel from the left α is 1/4. The sixth pixel from the left includes a background component corresponding to two shutter times / v and includes a foreground component corresponding to two shutter times / v. Therefore, the mixture ratio α of the sixth pixel from the left is 1/2. The seventh pixel from the left includes a background component corresponding to three shutter times / v, and includes a foreground component corresponding to one shutter time / v. Therefore, the mixture ratio α of the seventh pixel from the left is 3/4.
[0127]
Since it can be assumed that the object corresponding to the foreground is a rigid body and moves at a constant speed so that the foreground image is displayed on the right side of four pixels in the next frame, for example, the fourth pixel from the left in FIG. The foreground component F07 / v of the first shutter time / v after the shutter is opened is the foreground component of the fifth pixel from the left in FIG. 19 corresponding to the second shutter time / v after the shutter is opened. be equivalent to. Similarly, the foreground component F07 / v corresponds to the foreground component of the sixth pixel from the left in FIG. 19 corresponding to the third shutter time / v from when the shutter has opened, and the seventh pixel from the left in FIG. And the foreground component corresponding to the fourth shutter time / v after the shutter is opened.
[0128]
Since it can be assumed that the object corresponding to the foreground is a rigid body and moves at a constant speed so that the foreground image is displayed on the right side of four pixels in the next frame, for example, the third pixel from the left in FIG. The foreground component F06 / v of the first shutter time / v after the shutter is opened is the foreground component of the fourth pixel from the left in FIG. 19 corresponding to the second shutter time / v after the shutter is opened. equal. Similarly, the foreground component F06 / v is the sixth pixel from the left in FIG. 19 and the foreground component of the fifth pixel from the left in FIG. 19 corresponding to the third shutter time / v from when the shutter has opened. And the foreground component corresponding to the fourth shutter time / v after the shutter is opened.
[0129]
Since it can be assumed that the object corresponding to the foreground is a rigid body and moves at a constant speed so that the foreground image is displayed on the right side of four pixels in the next frame, for example, the second pixel from the left in FIG. The foreground component F05 / v of the first shutter time / v after the shutter is opened is the foreground component of the third pixel from the left in FIG. 19 corresponding to the second shutter time / v after the shutter is opened. be equivalent to. Similarly, the foreground component F05 / v is the fourth pixel from the left in FIG. 19 and the foreground component corresponding to the third shutter time / v from when the shutter is opened, and the fifth pixel from the left in FIG. And the foreground component corresponding to the fourth shutter time / v after the shutter is opened.
[0130]
Since it can be assumed that the object corresponding to the foreground is a rigid body and moves at a constant speed so that the foreground image is displayed on the right side of four pixels in the next frame, for example, the shutter of the leftmost pixel in FIG. The foreground component F04 / v of the first shutter time / v after opening is equal to the foreground component of the second pixel from the left in FIG. 19 corresponding to the second shutter time / v after the shutter is opened. Similarly, the foreground component F04 / v corresponds to the foreground component of the third pixel from the left in FIG. 19 corresponding to the third shutter time / v from when the shutter has opened, and the fourth pixel from the left in FIG. And the foreground component corresponding to the fourth shutter time / v after the shutter is opened.
[0131]
Since the foreground area corresponding to the moving object includes motion blur as described above, it can be said to be a distortion area.
[0132]
FIG. 20 is a model diagram in which pixel values of pixels on one line including the uncovered background area are expanded in the time direction when the foreground moves toward the right side in the drawing. In FIG. 20, the foreground motion amount v is 4. Since one frame is a short time, it can be assumed that the object corresponding to the foreground is a rigid body and is moving at a constant speed. In FIG. 20, the image of the object corresponding to the foreground moves to the right by four pixels in the next frame with reference to a certain frame.
[0133]
In FIG. 20, the leftmost pixel through the fourth pixel from the left belong to the background area. In FIG. 20, the fifth through seventh pixels from the left belong to the mixed area, which is an uncovered background. In FIG. 20, the rightmost pixel belongs to the foreground area.
[0134]
Since the object corresponding to the foreground that covered the object corresponding to the background is moved so as to be removed from the front of the object corresponding to the background over time, it is included in the pixel value of the pixel belonging to the uncovered background area The component to be changed from the foreground component to the background component at a certain point in time corresponding to the shutter time.
[0135]
For example, a pixel value M ′ with a thick line frame in FIG. 20 is expressed by Expression (2).
[0136]
M '= F02 / v + F01 / v + B26 / v + B26 / v (2)
[0137]
For example, the fifth pixel from the left includes a background component corresponding to three shutter times / v, and includes a foreground component corresponding to one shutter time / v, so the mixing ratio of the fifth pixel from the left α is 3/4. The sixth pixel from the left includes a background component corresponding to two shutter times / v and includes a foreground component corresponding to two shutter times / v. Therefore, the mixture ratio α of the sixth pixel from the left is 1/2. Since the seventh pixel from the left includes a background component corresponding to one shutter time / v and includes a foreground component corresponding to three shutter times / v, the mixture ratio α of the seventh pixel from the left is 1/4.
[0138]
When the expressions (1) and (2) are generalized, the pixel value M is expressed by the expression (3).
[0139]
[Expression 1]

Here, α is a mixing ratio. B is a background pixel value, and Fi / v is a foreground component.
[0140]
Since the object corresponding to the foreground is a rigid body and can be assumed to move at a constant speed, and the amount of movement v is 4, for example, the first pixel from the left in FIG. The foreground component F01 / v of the shutter time / v is equal to the foreground component of the sixth pixel from the left in FIG. 20 corresponding to the second shutter time / v after the shutter is opened. Similarly, F01 / v represents the foreground component of the seventh pixel from the left in FIG. 20 corresponding to the third shutter time / v from when the shutter has opened, and the eighth pixel from the left in FIG. , And the foreground component corresponding to the fourth shutter time / v after the shutter is opened.
[0141]
Since the object corresponding to the foreground is a rigid body and can be assumed to move at a constant speed, and the number of virtual divisions is 4, for example, the first pixel from the left in FIG. The foreground component F02 / v of the shutter time / v is equal to the foreground component of the seventh pixel from the left in FIG. 20 corresponding to the second shutter time / v after the shutter is opened. Similarly, the foreground component F02 / v is equal to the foreground component of the eighth pixel from the left in FIG. 20 corresponding to the third shutter time / v from when the shutter has opened.
[0142]
Since the object corresponding to the foreground is a rigid body and can be assumed to move at a constant speed, and the amount of movement v is 4, for example, the seventh pixel from the left in FIG. The foreground component F03 / v of the shutter time / v is equal to the foreground component of the eighth pixel from the left in FIG. 20 corresponding to the second shutter time / v after the shutter is opened.
[0143]
In the description of FIG. 18 to FIG. 20, it is described that the virtual division number is 4, but the virtual division number corresponds to the motion amount v. The amount of movement v generally corresponds to the moving speed of the object corresponding to the foreground. For example, when the object corresponding to the foreground is moving so as to be displayed to the right by four pixels in the next frame with reference to a certain frame, the amount of movement v is 4. Corresponding to the motion amount v, the number of virtual divisions is 4. Similarly, for example, when the object corresponding to the foreground is moving so that it is displayed on the left by 6 pixels in the next frame with reference to a certain frame, the motion amount v is set to 6, and the number of virtual divisions is , 6.
[0144]
21 and 22, the above-described mixed area composed of the foreground area, the background area, the covered background area, or the uncovered background area, and the foreground components and the background components corresponding to the divided shutter times. The relationship is shown.
[0145]
FIG. 21 shows an example in which pixels in the foreground area, the background area, and the mixed area are extracted from an image including a foreground corresponding to an object moving in front of a stationary background. In the example shown in FIG. 21, the object corresponding to the foreground is moving horizontally with respect to the screen.
[0146]
Frame # n + 1 is the next frame after frame #n, and frame # n + 2 is the next frame after frame # n + 1.
[0147]
Extract the pixels in the foreground area, background area, and mixed area extracted from any of frame #n to frame # n + 2, set the amount of motion v to 4, and set the pixel values of the extracted pixels in the time direction The developed model is shown in FIG.
[0148]
Since the object corresponding to the foreground moves, the pixel value in the foreground area is composed of four different foreground components corresponding to the shutter time / v period. For example, the leftmost pixel among the pixels in the foreground area shown in FIG. 22 is composed of F01 / v, F02 / v, F03 / v, and F04 / v. That is, the pixels in the foreground area include motion blur.
[0149]
Since the object corresponding to the background is stationary, the light corresponding to the background input to the sensor does not change during the period corresponding to the shutter time. In this case, the pixel value in the background area does not include motion blur.
[0150]
The pixel value of the pixel belonging to the mixed area composed of the covered background area or the uncovered background area is composed of a foreground component and a background component.
[0151]
Next, when the image corresponding to the object is moving, the pixel values of the pixels at the same position on the frame that are adjacent to each other in a plurality of frames are developed in the time direction. The model will be described. For example, when the image corresponding to the object moves horizontally with respect to the screen, the pixels arranged on one line of the screen can be selected as the pixels arranged in a row adjacent to each other.
[0152]
FIG. 23 shows pixels arranged in a row adjacent to three frames of an image obtained by imaging an object corresponding to a stationary background, and the pixel values of the pixels at the same position on the frame are represented by time. It is the model figure developed in the direction. Frame #n is the next frame after frame # n-1, and frame # n + 1 is the next frame after frame #n. Other frames are also referred to in the same manner.
[0153]
The pixel values B01 to B12 shown in FIG. 23 are pixel values corresponding to the stationary background object. Since the object corresponding to the background is stationary, the pixel value of the corresponding pixel does not change in frame # n−1 to frame n + 1. For example, the pixel in frame #n and the pixel in frame # n + 1 corresponding to the position of the pixel having a pixel value of B05 in frame # n−1 each have a pixel value of B05.
[0154]
FIG. 24 shows pixels arranged in a row adjacent to each other in three frames of an image obtained by imaging an object corresponding to a foreground moving to the right side in the drawing together with an object corresponding to a stationary background, FIG. 5 is a model diagram in which pixel values of pixels at the same position on a frame are developed in the time direction. The model shown in FIG. 24 includes a covered background area.
[0155]
In FIG. 24, it can be assumed that the object corresponding to the foreground is a rigid body and moves at a constant speed, and the foreground image is moved so that the foreground image is displayed on the right side by four pixels in the next frame. 4 and the number of virtual divisions is 4.
[0156]
For example, the foreground component of the leftmost pixel of frame # n−1 in FIG. 24 for the first shutter time / v after the shutter opens is F12 / v, and the second pixel from the left in FIG. The foreground component of the second shutter time / v after the shutter is opened is also F12 / v. The foreground component of the third pixel from the left in FIG. 24 and the third shutter time / v from when the shutter opened, and the fourth shutter time from the shutter of the fourth pixel from the left in FIG. The foreground component of / v is F12 / v.
[0157]
The foreground component of the leftmost pixel of frame # n-1 in FIG. 24 for the second shutter time / v after the shutter opens is F11 / v, and the second pixel from the left in FIG. The foreground component of the third shutter time / v after the shutter is opened is also F11 / v. The foreground component of the third pixel from the left in FIG. 24 corresponding to the fourth portion of the shutter time / v from when the shutter has opened is F11 / v.
[0158]
The foreground component of the leftmost pixel of frame # n-1 in FIG. 24 for the third shutter time / v after the shutter opens is F10 / v, and the second pixel from the left in FIG. The foreground component of the fourth shutter time / v after the shutter is opened is also F10 / v. The foreground component of the leftmost pixel of frame # n−1 in FIG. 24 corresponding to the fourth shutter time / v from when the shutter has opened is F09 / v.
[0159]
Since the object corresponding to the background is stationary, the background component of the second pixel from the left of frame # n-1 in FIG. Become. The background component of the third pixel from the left of frame # n−1 in FIG. 24 corresponding to the first and second shutter time / v from when the shutter has opened is B02 / v. The background component of the fourth pixel from the left of frame # n−1 in FIG. 24 corresponding to the first through third shutter time / v from when the shutter has opened is B03 / v.
[0160]
In frame # n−1 in FIG. 24, the leftmost pixel belongs to the foreground area, and the second to fourth pixels from the left belong to the mixed area, which is a covered background area.
[0161]
The fifth through twelfth pixels from the left of frame # n−1 in FIG. 24 belong to the background area, and the pixel values thereof are B04 through B11, respectively.
[0162]
The first through fifth pixels from the left of frame #n in FIG. 24 belong to the foreground area. The foreground component of the shutter time / v in the foreground area of frame #n is any one of F05 / v to F12 / v.
[0163]
It can be assumed that the object corresponding to the foreground is a rigid body and moves at a constant speed, and the foreground image moves so as to be displayed on the right side by four pixels in the next frame, so from the left of frame #n in FIG. The foreground component of the fifth pixel at the first shutter time / v after the shutter opens is F12 / v, and the sixth pixel from the left in FIG. 24 opens the shutter at the second shutter time / v. The foreground component is also F12 / v. The foreground component of the seventh pixel from the left in FIG. 24 for the third shutter time / v after the shutter opens, and the fourth shutter time for the eighth pixel from the left in FIG. The foreground component of / v is F12 / v.
[0164]
The foreground component of the fifth pixel from the left of frame #n in FIG. 24 corresponding to the second shutter time / v from when the shutter has opened is F11 / v, and the sixth pixel from the left in FIG. The foreground component of the third shutter time / v after the shutter is opened is also F11 / v. The foreground component of the seventh pixel from the left in FIG. 24 corresponding to the fourth portion of the shutter time / v from when the shutter has opened is F11 / v.
[0165]
The foreground component of the fifth pixel from the left in frame #n in FIG. 24 corresponding to the third shutter time / v from when the shutter has opened is F10 / v, and the sixth pixel from the left in FIG. The foreground component of the fourth shutter time / v after the shutter is opened is also F10 / v. The foreground component of the fifth pixel from the left of frame #n in FIG. 24 corresponding to the fourth portion of the shutter time / v from when the shutter has opened is F09 / v.
[0166]
Since the object corresponding to the background is stationary, the background component of the sixth pixel from the left of frame #n in FIG. 24 corresponding to the first shutter time / v after the shutter is opened is B05 / v. The background component of the seventh pixel from the left of frame #n in FIG. 24 corresponding to the first and second shutter time / v from when the shutter has opened is B06 / v. The background component of the eighth pixel from the left of frame #n in FIG. 24 corresponding to the first through third shutter time / v from when the shutter has opened is B07 / v.
[0167]
In frame #n in FIG. 24, the sixth through eighth pixels from the left belong to the mixed area, which is a covered background area.
[0168]
The ninth through twelfth pixels from the left of frame #n in FIG. 24 belong to the background area, and the pixel values thereof are B08 through B11, respectively.
[0169]
The first through ninth pixels from the left in frame # n + 1 in FIG. 24 belong to the foreground area. The foreground component of the shutter time / v in the foreground area of frame # n + 1 is any one of F01 / v to F12 / v.
[0170]
It can be assumed that the object corresponding to the foreground is a rigid body and moves at a constant speed, and the foreground image moves so as to be displayed on the right side by four pixels in the next frame, so that the frame # n + 1 in FIG. The foreground component of the ninth pixel from the left when the shutter opens is the first shutter time / v is F12 / v, and the tenth pixel from the left in FIG. 24 is the second shutter time after the shutter is opened. The foreground component of / v is also F12 / v. The foreground component of the eleventh pixel from the left in FIG. 24 and the third shutter time / v from when the shutter opens, and the fourth shutter time from the shutter of the twelfth pixel from the left in FIG. The foreground component of / v is F12 / v.
[0171]
The foreground component of the ninth pixel from the left in frame # n + 1 in FIG. 24 corresponding to the second shutter time / v from when the shutter has opened is F11 / v, which is the tenth from the left in FIG. The foreground component of the third shutter time / v after the shutter opens is also F11 / v. The foreground component of the eleventh pixel from the left in FIG. 24 corresponding to the fourth shutter time / v from when the shutter has opened is F11 / v.
[0172]
The foreground component of the ninth pixel from the left of frame # n + 1 in FIG. 24 corresponding to the third shutter time / v from when the shutter has opened is F10 / v, which is the tenth pixel from the left in FIG. The foreground component of the fourth shutter time / v after the shutter is opened is also F10 / v. The foreground component of the ninth pixel from the left of frame # n + 1 in FIG. 24 corresponding to the fourth portion of the shutter time / v from when the shutter has opened is F09 / v.
[0173]
Since the object corresponding to the background is stationary, the background component of the tenth pixel from the left of frame # n + 1 in FIG. Become. The background component of the eleventh pixel from the left of frame # n + 1 in FIG. 24 corresponding to the first and second shutter time / v from when the shutter has opened is B10 / v. The background component of the twelfth pixel from the left of frame # n + 1 in FIG. 24 corresponding to the first through third shutter time / v from when the shutter has opened is B11 / v.
[0174]
In frame # n + 1 in FIG. 24, the tenth through twelfth pixels from the left correspond to the mixed area, which is a covered background area.
[0175]
FIG. 25 is a model diagram of an image obtained by extracting foreground components from the pixel values shown in FIG.
[0176]
FIG. 26 shows pixels arranged in a row adjacent to each other in three frames of an image obtained by capturing a foreground corresponding to an object moving to the right side in the drawing together with a stationary background. It is the model figure which expand | deployed the pixel value of the pixel of the position of the time direction. In FIG. 26, an uncovered background area is included.
[0177]
In FIG. 26, it can be assumed that the object corresponding to the foreground is a rigid body and is moving at a constant speed. Since the object corresponding to the foreground is moved so as to be displayed on the right side by four pixels in the next frame, the motion amount v is 4.
[0178]
For example, the foreground component of the leftmost pixel of frame # n−1 in FIG. 26 that is the first for the shutter time / v after the shutter opens is F13 / v, and is the second pixel from the left in FIG. The foreground component of the second shutter time / v after the shutter is opened is also F13 / v. The foreground component of the third pixel from the left in FIG. 26 for the third shutter time / v after the shutter opens, and the fourth shutter time for the fourth pixel from the left in FIG. The foreground component of / v is F13 / v.
[0179]
The foreground component of the second pixel from the left of frame # n-1 in FIG. 26 corresponding to the first shutter time / v from when the shutter has opened is F14 / v, and the third pixel from the left in FIG. The foreground component of the second shutter time / v after the shutter is opened is also F14 / v. The foreground component of the third pixel from the left in FIG. 26 corresponding to the first shutter time / v from when the shutter has opened is F15 / v.
[0180]
Since the object corresponding to the background is stationary, the background component of the leftmost pixel of frame # n−1 in FIG. 26 corresponding to the second to fourth shutter time / v from the shutter opening is B25. / v. The background components of the second pixel from the left of frame # n−1 in FIG. 26 corresponding to the third and fourth shutter time / v from when the shutter has opened are B26 / v. The background component of the third pixel from the left of frame # n−1 in FIG. 26 corresponding to the fourth portion of the shutter time / v from when the shutter has opened is B27 / v.
[0181]
In frame # n-1 in FIG. 26, the leftmost pixel through the third pixel belong to the mixed area, which is an uncovered background area.
[0182]
The fourth through twelfth pixels from the left of frame # n−1 in FIG. 26 belong to the foreground area. The foreground component of the frame is any one of F13 / v to F24 / v.
[0183]
The leftmost pixel through the fourth pixel from the left in frame #n in FIG. 26 belong to the background area, and the pixel values are B25 through B28, respectively.
[0184]
It can be assumed that the object corresponding to the foreground is a rigid body and moves at a constant speed, and the foreground image moves so as to be displayed on the right side of four pixels in the next frame. Therefore, from the left of frame #n in FIG. The foreground component of the fifth pixel at the first shutter time / v after the shutter opens is F13 / v, and the sixth shutter pixel from the left in FIG. 26 opens the shutter at the second shutter time / v. The foreground component is also F13 / v. The foreground component of the seventh pixel from the left in FIG. 26 for the third shutter time / v after the shutter opens, and the fourth shutter time for the eighth pixel from the left in FIG. The foreground component of / v is F13 / v.
[0185]
The foreground component of the sixth pixel from the left of frame #n in FIG. 26 corresponding to the first shutter time / v from when the shutter has opened is F14 / v, and the seventh pixel from the left in FIG. The foreground component of the second shutter time / v after opening is also F14 / v. The foreground component of the eighth pixel from the left in FIG. 26 corresponding to the first shutter time / v from when the shutter has opened is F15 / v.
[0186]
Since the object corresponding to the background is stationary, the background components of the fifth pixel from the left of frame #n in FIG. 26 corresponding to the second to fourth shutter time / v from when the shutter has opened are B29 / v. The background component of the sixth pixel from the left of frame #n in FIG. 26 corresponding to the third and fourth shutter time / v from when the shutter has opened is B30 / v. The background component of the seventh pixel from the left of frame #n in FIG. 26 corresponding to the fourth portion of the shutter time / v from when the shutter has opened is B31 / v.
[0187]
In frame #n in FIG. 26, the fifth through seventh pixels from the left belong to the mixed area, which is an uncovered background area.
[0188]
The eighth through twelfth pixels from the left of frame #n in FIG. 26 belong to the foreground area. The value corresponding to the period of the shutter time / v in the foreground area of frame #n is any one of F13 / v to F20 / v.
[0189]
The leftmost pixel through the eighth pixel from the left of frame # n + 1 in FIG. 26 belong to the background area, and the pixel values thereof are B25 through B32, respectively.
[0190]
It can be assumed that the object corresponding to the foreground is a rigid body and moves at a constant speed, and the foreground image moves so as to be displayed on the right side by four pixels in the next frame, so that the frame # n + 1 in FIG. The foreground component of the ninth pixel from the left when the shutter opens is the first shutter time / v is F13 / v, and the tenth pixel from the left in FIG. The foreground component of / v is also F13 / v. The foreground component of the eleventh pixel from the left in FIG. 26 and the third shutter time / v from when the shutter has opened, and the fourth shutter time from the shutter of the twelfth pixel from the left in FIG. The foreground component of / v is F13 / v.
[0191]
The foreground component of the tenth pixel from the left of frame # n + 1 in FIG. 26 corresponding to the first shutter time / v from when the shutter has opened is F14 / v, and the eleventh pixel from the left in FIG. The foreground component of the second shutter time / v after the shutter is opened is also F14 / v. The foreground component of the twelfth pixel from the left in FIG. 26 corresponding to the first shutter time / v from when the shutter has opened is F15 / v.
[0192]
Since the object corresponding to the background is stationary, the background components of the ninth pixel from the left of frame # n + 1 in FIG. , B33 / v. The background component of the tenth pixel from the left of frame # n + 1 in FIG. 26 corresponding to the third and fourth shutter time / v from when the shutter has opened is B34 / v. The background component of the eleventh pixel from the left of frame # n + 1 in FIG. 26 corresponding to the fourth portion of the shutter time / v from when the shutter has opened is B35 / v.
[0193]
In frame # n + 1 in FIG. 26, the ninth through eleventh pixels from the left belong to the mixed area, which is an uncovered background area.
[0194]
The twelfth pixel from the left of frame # n + 1 in FIG. 26 belongs to the foreground area. The foreground component of the shutter time / v in the foreground area of frame # n + 1 is any one of F13 / v to F16 / v.
[0195]
FIG. 27 is a model diagram of an image obtained by extracting foreground components from the pixel values shown in FIG.
[0196]
FIG. 28 is a diagram illustrating a correspondence between an image divided for each pixel belonging to the foreground area, the background area, the covered background area, and the uncovered background area, and a model diagram in which pixel values of the pixels are expanded in the time direction. It is.
[0197]
As shown in FIG. 28, the area specifying unit 103 specifies the foreground area, the background area, the covered background area, and the uncovered background area of the input image.
[0198]
FIG. 29 shows an image of the foreground area, an image of the background area, a foreground component image of the covered background area, a background component of the covered background area, a foreground component of the uncovered background area, and a background of the uncovered background area. It is a figure which shows a response | compatibility with the model image which expand | deployed the pixel value of the pixel in the time direction, and the input image isolate | separated into this component.
[0199]
As shown in FIG. 29, the input image of the input image is specified by the area specifying unit 103 in the foreground area, the background area, the covered background area, and the uncovered background area. The input image is generated based on the foreground area, background area, covered background area, and uncovered background area specified by the foreground / background separator 105 and the mixture ratio α detected by the mixture ratio calculator 104. Foreground component image consisting of region image, foreground component of covered background region, and foreground component of uncovered background region, and background region image, background component of covered background region, and uncovered background region Are separated into background component images consisting of the background components.
[0200]
The separated foreground component image and background component image are processed for each image.
[0201]
The foreground / background separation unit 105 determines the input image based on the region information and the mixture ratio α, the foreground region image, the background region image, the covered background region foreground component image, the covered background region background component image, You may make it isolate | separate into the foreground component image of a covered background area | region, and the background component image of an uncovered background area | region.
[0202]
FIG. 30 is a diagram illustrating an example of an image divided into a foreground area, a background area, and a mixed area. The area specifying unit 103 specifies the foreground area, the background area, and the mixed area of the input image. The image processing apparatus can divide the input image into an image in the foreground area, an image in the background area, and an image in the mixed area based on area information indicating the foreground area, the background area, and the mixed area.
[0203]
As shown in FIG. 31, the foreground / background separation unit 105 converts the mixed region image into the mixed region based on the region information supplied from the region specifying unit 103 and the mixing ratio α supplied from the mixing ratio calculation unit 104. The foreground component image and the background component image of the mixed region are separated.
[0204]
As shown in FIG. 32, the pixel value of the mixed region is corrected in the separated background component image, and the motion blur is removed from the separated foreground component image.
[0205]
As shown in FIG. 33, the input image is divided into regions and separated into foreground components and background components. The separated input image is combined with the foreground component image and the background component image.
[0206]
Motion blur included in the foreground component image is removed. In the background component image, the pixel value corresponding to the mixed region is corrected.
[0207]
The foreground component image from which the motion blur is removed and the corrected background component image are individually processed.
[0208]
FIG. 34 is a flowchart illustrating image processing of the image processing apparatus according to the present invention.
[0209]
In step S101, the region specifying unit 103, based on the motion vector supplied from the motion detection unit 102 and its position information, and the input image, the foreground region, the background region, the covered background region, and the uncovered back of the input image. Identify the ground area. Details of the area specifying process will be described later.
[0210]
In step S 102, the mixture ratio calculation unit 104 calculates the mixture ratio α based on the region information and the input image supplied from the region specifying unit 103. Details of the process of calculating the mixture ratio α by the mixture ratio calculation unit 104 will be described later.
[0211]
In step S103, the foreground / background separation unit 105 converts the input image into a foreground component composed of foreground components based on the region information supplied from the region specifying unit 103 and the mixture ratio α supplied from the mixture ratio calculation unit 104. The image is separated into a background component image composed of an image and a background component. Details of the image separation processing of the foreground / background separation unit 105 will be described later.
[0212]
In step S 104, the motion blur removal unit 106 is supplied from the foreground / background separation unit 105 based on the motion vector and its position information supplied from the motion detection unit 102 and the region information supplied from the region specifying unit 103. Remove motion blur from foreground component images.
[0213]
In step S 105, the correction unit 107 corrects the pixel value corresponding to the mixed region of the background component image supplied from the foreground / background separation unit 105.
[0214]
In step S106, the motion blur removal image processing unit 108 performs image processing for each foreground component image from which motion blur has been removed and the corrected background component image, and the processing ends. Details of the image processing executed by the motion blur removal image processing unit 108 will be described later.
[0215]
As described above, the image processing apparatus according to the present invention separates an input image into a foreground component image and a background component image, removes motion blur from the foreground component image, and removes motion blur, and Image processing is executed for each background component image.
[0216]
Hereinafter, the configurations of the area specifying unit 103, the mixture ratio calculation unit 104, the foreground / background separation unit 105, the motion blur removal unit 106, and the motion blur removal image processing unit 108 will be described.
[0217]
FIG. 35 is a block diagram illustrating an example of the configuration of the area specifying unit 103. The area specifying unit 103 having the configuration shown in FIG. 35 does not use a motion vector. The frame memory 201 stores the input image in units of frames. When the processing target is the frame #n, the frame memory 201 is a frame # n-2 that is a frame immediately before the frame #n, a frame # n-1 that is a frame immediately before the frame #n, A frame #n, a frame # n + 1 that is a frame subsequent to the frame #n, and a frame # n + 2 that is a frame subsequent to the frame #n are stored.
[0218]
The static motion determination unit 202-1 determines the pixel value of the pixel of frame # n + 2 at the same position on the image of the pixel that is the target of region specification of frame #n, and the region specification of frame #n. The pixel value of the pixel of frame # n + 1 at the same position as the position of the target pixel on the image is read from the frame memory 201, and the absolute value of the difference between the read pixel values is calculated. The static motion determination unit 202-1 determines whether or not the absolute value of the difference between the pixel value of frame # n + 2 and the pixel value of frame # n + 1 is greater than a preset threshold Th, When it is determined that the absolute value of the difference is greater than the threshold value Th, a static motion determination indicating motion is supplied to the region determination unit 203-1. When it is determined that the absolute value of the difference between the pixel value of the pixel of frame # n + 2 and the pixel value of the pixel of frame # n + 1 is equal to or less than the threshold value Th, the static motion determination unit 202-1 The static motion determination shown is supplied to the region determination unit 203-1.
[0219]
The static motion determination unit 202-2 is the target of the pixel value of the frame # n + 1 at the same position on the image of the pixel that is the target of region identification of the frame #n, and the target of the frame #n. The pixel value of the pixel is read from the frame memory 201, and the absolute value of the difference between the pixel values is calculated. The static motion determination unit 202-2 determines whether or not the absolute value of the difference between the pixel value of the frame # n + 1 and the pixel value of the frame #n is greater than a preset threshold value Th. When it is determined that the absolute value of the difference between the two is greater than the threshold value Th, a static motion determination indicating motion is supplied to the region determination unit 203-1 and the region determination unit 203-2. When it is determined that the absolute value of the difference between the pixel value of the pixel of frame # n + 1 and the pixel value of the pixel of frame #n is equal to or less than the threshold value Th, the static motion determination unit 202-2 indicates stillness. The static motion determination is supplied to the region determination unit 203-1 and the region determination unit 203-2.
[0220]
The static motion determination unit 202-3 determines the frame #n at the same position as the pixel value of the pixel that is the region specification target of the frame #n and the position of the pixel that is the region specification target of the frame #n. The pixel value of −1 pixel is read from the frame memory 201, and the absolute value of the difference between the pixel values is calculated. The static motion determination unit 202-3 determines whether or not the absolute value of the difference between the pixel value of the frame #n and the pixel value of the frame # n-1 is larger than a preset threshold value Th. When it is determined that the absolute value of the difference between the two is greater than the threshold value Th, a static motion determination indicating motion is supplied to the region determination unit 203-2 and the region determination unit 203-3. When it is determined that the absolute value of the difference between the pixel value of the pixel of frame #n and the pixel value of the pixel of frame # n-1 is equal to or less than the threshold value Th, the static motion determination unit 202-3 indicates the still The static motion determination is supplied to the region determination unit 203-2 and the region determination unit 203-3.
[0221]
The static motion determination unit 202-4 determines the pixel value of the pixel of frame # n-1 at the same position on the image of the pixel that is the target of region specification of frame #n, and the region specification of frame #n. The pixel value of the pixel of frame # n-2 located at the same position on the image of the target pixel is read from the frame memory 201, and the absolute value of the difference between the pixel values is calculated. The static motion determination unit 202-4 determines whether or not the absolute value of the difference between the pixel value of the frame # n-1 and the pixel value of the frame # n-2 is greater than a preset threshold Th, When it is determined that the absolute value of the difference between the pixel values is greater than the threshold value Th, a static motion determination indicating motion is supplied to the region determination unit 203-3. When it is determined that the absolute value of the difference between the pixel value of the pixel of frame # n-1 and the pixel value of the pixel of frame # n-2 is equal to or less than the threshold value Th, the static motion determination unit 202-4 Is supplied to the region determination unit 203-3.
[0222]
The region determination unit 203-1 is configured such that the static motion determination supplied from the static motion determination unit 202-1 indicates stillness and the static motion determination supplied from the static motion determination unit 202-2 indicates movement. The pixel that is the target of region identification in frame #n is determined to belong to the uncovered background region, and the uncovered background region determination flag corresponding to the pixel that is determined to belong to the region belongs to the uncovered background region. “1” indicating “” is set.
[0223]
The area determination unit 203-1 indicates that the static motion determination supplied from the static motion determination unit 202-1 indicates movement, or the static motion determination supplied from the static motion determination unit 202-2 indicates stillness. When determining that the pixel that is the target of region identification in frame #n does not belong to the uncovered background region, the uncovered background region determination flag corresponding to the pixel to be determined for the region is set to the uncovered background region. “0” is set to indicate that it does not belong.
[0224]
The area determination unit 203-1 supplies the uncovered background area determination flag in which “1” or “0” is set as described above to the determination flag storage frame memory 204.
[0225]
The region determination unit 203-2 is configured such that the static motion determination supplied from the static motion determination unit 202-2 indicates static and the static motion determination supplied from the static motion determination unit 202-3 indicates static. Then, it is determined that the pixel that is the target of region identification in frame #n belongs to the still region, and “1” indicating that it belongs to the still region is set in the still region determination flag corresponding to the pixel to be determined for the region.
[0226]
In the area determination unit 203-2, the static motion determination supplied from the static motion determination unit 202-2 indicates a motion, or the static motion determination supplied from the static motion determination unit 202-3 indicates a motion. At this time, it is determined that the pixel that is the region identification target in frame #n does not belong to the still region, and “0” indicating that it does not belong to the still region is set in the still region determination flag corresponding to the pixel to be determined for the region. Set.
[0227]
The region determination unit 203-2 supplies the still region determination flag in which “1” or “0” is set as described above to the determination flag storage frame memory 204.
[0228]
The region determination unit 203-2 is configured such that the static motion determination supplied from the static motion determination unit 202-2 indicates movement and the static motion determination supplied from the static motion determination unit 202-3 indicates movement. Then, it is determined that the pixel that is the target of region identification in frame #n belongs to the motion region, and “1” indicating that it belongs to the motion region is set in the motion region determination flag corresponding to the pixel determined for the region.
[0229]
The region determination unit 203-2 indicates that the static motion determination supplied from the static motion determination unit 202-2 indicates static or the static motion determination supplied from the static motion determination unit 202-3 indicates static. At this time, it is determined that the pixel that is the region identification target in frame #n does not belong to the motion region, and “0” indicating that it does not belong to the motion region is set in the motion region determination flag corresponding to the pixel that is determined to be the region. Set.
[0230]
The region determination unit 203-2 supplies the motion region determination flag set to “1” or “0” to the determination flag storage frame memory 204 in this way.
[0231]
The region determination unit 203-3 is configured such that the static motion determination supplied from the static motion determination unit 202-3 indicates movement and the static motion determination supplied from the static motion determination unit 202-4 indicates stillness. , It is determined that the pixel that is the target of region identification in frame #n belongs to the covered background region, and the covered background region determination flag corresponding to the pixel to be determined of the region indicates that it belongs to the covered background region. 1 ”is set.
[0232]
In the area determination unit 203-3, the static motion determination supplied from the static motion determination unit 202-3 indicates stillness, or the static motion determination supplied from the static motion determination unit 202-4 indicates movement. When determining that the pixel that is the target of region identification in frame #n does not belong to the covered background region, the covered background region determination flag corresponding to the pixel to be determined for the region does not belong to the covered background region. “0” is set to indicate.
[0233]
The area determination unit 203-3 supplies the covered background area determination flag set to “1” or “0” to the determination flag storage frame memory 204 in this way.
[0234]
The determination flag storage frame memory 204 is supplied from the uncovered background region determination flag supplied from the region determination unit 203-1, the still region determination flag supplied from the region determination unit 203-2, and the region determination unit 203-2. The movement area determination flag and the covered background area determination flag supplied from the area determination unit 203-3 are stored.
[0235]
The determination flag storage frame memory 204 supplies the stored uncovered background area determination flag, still area determination flag, motion area determination flag, and covered background area determination flag to the synthesis unit 205. Based on the uncovered background area determination flag, the still area determination flag, the motion area determination flag, and the covered background area determination flag supplied from the determination flag storage frame memory 204, the combining unit 205 Area information indicating that it belongs to any one of the covered background area, the stationary area, the motion area, and the covered background area is generated and supplied to the determination flag storage frame memory 206.
[0236]
The determination flag storage frame memory 206 stores the area information supplied from the synthesis unit 205 and outputs the stored area information.
[0237]
Next, an example of processing of the area specifying unit 103 will be described with reference to FIGS.
[0238]
When the object corresponding to the foreground is moving, the position on the screen of the image corresponding to the object changes for each frame. As shown in FIG. 36, in the frame #n, the image corresponding to the object located at the position indicated by Yn (x, y) is Yn + 1 (x, y in the frame # n + 1 which is the next frame. Located in y).
[0239]
FIG. 37 shows a model diagram in which pixel values of pixels arranged in a row adjacent to the moving direction of the image corresponding to the foreground object are developed in the time direction. For example, when the motion direction of the image corresponding to the foreground object is horizontal with respect to the screen, the model diagram in FIG. 37 shows a model in which pixel values of adjacent pixels on one line are expanded in the time direction.
[0240]
In FIG. 37, the line in frame #n is the same as the line in frame # n + 1.
[0241]
Foreground components corresponding to the objects included in the second through thirteenth pixels from the left in frame #n are included in the sixth through seventeenth pixels from the left in frame # n + 1.
[0242]
In frame #n, the pixels belonging to the covered background area are the 11th to 13th pixels from the left, and the pixels belonging to the uncovered background area are the 2nd to 4th pixels from the left. In frame # n + 1, the pixels belonging to the covered background area are the 15th to 17th pixels from the left, and the pixels belonging to the uncovered background area are the 6th to 8th pixels from the left.
[0243]
In the example shown in FIG. 37, the foreground component included in frame #n has moved four pixels in frame # n + 1, so the amount of motion v is four. The virtual division number corresponds to the motion amount v and is 4.
[0244]
Next, changes in pixel values of pixels belonging to the mixed region before and after the frame of interest will be described.
[0245]
In the frame #n shown in FIG. 38 where the background is stationary and the foreground motion amount v is 4, the pixels belonging to the covered background area are the 15th to 17th pixels from the left. Since the motion amount v is 4, in the previous frame # n−1, the fifteenth through seventeenth pixels from the left include only background components and belong to the background area. In frame # n-2, the fifteenth through seventeenth pixels from the left include only background components and belong to the background area.
[0246]
Here, since the object corresponding to the background is stationary, the pixel value of the fifteenth pixel from the left in frame # n-1 does not change from the pixel value of the fifteenth pixel from the left in frame # n-2. . Similarly, the pixel value of the 16th pixel from the left of frame # n-1 does not change from the pixel value of the 16th pixel from the left of frame # n-2, and the 17th pixel from the left of frame # n-1 The pixel value of this pixel does not change from the pixel value of the 17th pixel from the left in frame # n-2.
[0247]
That is, the pixels of frame # n-1 and frame # n-2 corresponding to the pixels belonging to the covered background area in frame #n are composed of only background components, and the pixel value does not change. The value is almost zero. Therefore, the static motion determination for the pixels in frame # n-1 and frame # n-2 corresponding to the pixels belonging to the mixed region in frame #n is determined as static by the static motion determination unit 202-4.
[0248]
Since the pixels belonging to the covered background area in frame #n include the foreground components, the pixel values are different from the case of only the background components in frame # n-1. Therefore, the static motion determination for the pixels belonging to the mixed region in frame #n and the corresponding pixels in frame # n-1 is determined as motion by the static motion determination unit 202-3.
[0249]
As described above, the region determination unit 203-3 is supplied with the result of the static motion determination indicating the motion from the static motion determination unit 202-3, and is supplied with the result of the static motion determination indicating the static motion from the static motion determination unit 202-4. When it is done, it is determined that the corresponding pixel belongs to the covered background area.
[0250]
In frame #n shown in FIG. 39 in which the background is stationary and the foreground motion amount v is 4, the pixels included in the uncovered background area are the second through fourth pixels from the left. Since the motion amount v is 4, in the next frame # n + 1, the second through fourth pixels from the left include only background components and belong to the background area. Further, in the next frame # n + 2, the second through fourth pixels from the left include only background components and belong to the background area.
[0251]
Here, since the object corresponding to the background is stationary, the pixel value of the second pixel from the left of frame # n + 2 does not change from the pixel value of the second pixel from the left of frame # n + 1. . Similarly, the pixel value of the third pixel from the left of frame # n + 2 does not change from the pixel value of the third pixel from the left of frame # n + 1, and is the fourth from the left of frame # n + 2. The pixel value of this pixel does not change from the pixel value of the fourth pixel from the left in frame # n + 1.
[0252]
That is, the pixels of frame # n + 1 and frame # n + 2, which correspond to the pixels belonging to the uncovered background area in frame #n, consist only of background components, and the pixel value does not change. The absolute value is almost zero. Therefore, the static motion determination for the pixels in frame # n + 1 and frame # n + 2 corresponding to the pixels belonging to the mixed region in frame #n is determined as static by the static motion determination unit 202-1.
[0253]
Since the pixels belonging to the uncovered background area in frame #n include the foreground components, the pixel values are different from the case of only the background components in frame # n + 1. Therefore, the static motion determination for the pixels belonging to the mixed region in frame #n and the corresponding pixels in frame # n + 1 is determined as motion by the static motion determination unit 202-2.
[0254]
As described above, the region determination unit 203-1 is supplied with the result of the static motion determination indicating the motion from the static motion determination unit 202-2, and is supplied with the result of the static motion determination indicating the static motion from the static motion determination unit 202-1. Is determined to belong to the uncovered background area.
[0255]
FIG. 40 is a diagram illustrating determination conditions of the area specifying unit 103 in frame #n. The pixel in frame # n-2 at the same position on the image of the pixel to be judged in frame #n and the same position on the image of the pixel to be judged in frame #n A pixel in frame # n-1 is determined to be stationary, and a pixel in frame # n-1 and a pixel in frame #n at the same position on the image of the pixel to be determined in frame #n Are determined to be movements, the area specifying unit 103 determines that the pixel to be determined for frame #n belongs to the covered background area.
[0256]
The pixel in frame # n-1 and the pixel in frame #n at the same position on the image of the pixel to be determined in frame #n are determined to be stationary, and the pixel in frame #n When it is determined that the pixel of frame # n + 1 at the same position on the image of the pixel to be determined as #n is still, the area specifying unit 103 determines that the determination target of frame #n is Is determined to belong to the still region.
[0257]
The pixel in frame # n-1 and the pixel in frame #n at the same position on the image of the pixel to be determined in frame #n are determined to move, and the pixel in frame #n When it is determined that a pixel in frame # n + 1 located at the same position on the image of a pixel to be determined as #n is a motion, the region specifying unit 103 determines that the determination is performed in frame #n. Is determined to belong to the motion region.
[0258]
The pixel in frame #n and the pixel in frame # n + 1 at the same position on the image of the pixel to be determined in frame #n are determined as motion, and the determination target in frame #n The pixel of frame # n + 1 at the same position as the position of the pixel on the image and the pixel of frame # n + 2 at the same position as the position of the pixel to be determined at frame #n on the image Are determined to be stationary, the area specifying unit 103 determines that the pixel to be determined for frame #n belongs to the uncovered background area.
[0259]
FIG. 41 is a diagram illustrating an example of the result of specifying the area of the area specifying unit 103. In FIG. 41A, pixels that are determined to belong to the covered background area are displayed in white. In FIG. 41B, pixels that are determined to belong to the uncovered background area are displayed in white.
[0260]
In FIG. 41C, pixels that are determined to belong to the motion region are displayed in white. In FIG. 41D, the pixels determined to belong to the still area are displayed in white.
[0261]
FIG. 42 is a diagram illustrating, as an image, region information indicating a mixed region among region information output from the determination flag storage frame memory 206. In FIG. 42, the pixels determined to belong to the covered background area or the uncovered background area, that is, the pixels determined to belong to the mixed area are displayed in white. The area information indicating the mixed area output from the determination flag storage frame memory 206 indicates a mixed area and a portion having a texture surrounded by a portion having no texture in the foreground area.
[0262]
Next, the area specifying process of the area specifying unit 103 will be described with reference to the flowchart of FIG. In step S201, the frame memory 201 acquires images of frames # n-2 to # n + 2 including the frame #n to be determined.
[0263]
In step S202, the static motion determination unit 202-3 determines whether or not the pixel in frame # n-1 and the pixel at the same position in frame #n are stationary. Then, the static motion determination unit 202-2 determines whether or not the frame #n and the pixel at the same position in the frame # n + 1 are still.
[0264]
In step S203, if it is determined that the pixel in frame #n and the pixel in the same position in frame # n + 1 are determined to be stationary, the process proceeds to step S204, and the region determination unit 203-2 determines that the region is determined. A corresponding still area determination flag is set to “1” indicating that it belongs to a still area. The region determination unit 203-2 supplies the still region determination flag to the determination flag storage frame memory 204, and the procedure proceeds to step S205.
[0265]
When it is determined in step S202 that the pixel in frame # n-1 and the pixel at the same position in frame #n are in motion, or in step S203, the pixel in frame #n and the same position in frame # n + 1 If the pixel is determined to be moving, the pixel in frame #n does not belong to the still region, so the process of step S204 is skipped, and the procedure proceeds to step S205.
[0266]
In step S205, the static motion determination unit 202-3 determines whether or not the pixel in frame # n-1 and the pixel at the same position in frame #n are in motion, and if it is determined as motion, the process proceeds to step S206. Then, the static motion determination unit 202-2 determines whether or not there is motion between the pixel of frame #n and the pixel at the same position of frame # n + 1.
[0267]
If it is determined in step S206 that the pixel in frame #n and the pixel in the same position in frame # n + 1 are in motion, the process proceeds to step S207, and the region determination unit 203-2 determines that the region is determined. “1” indicating that it belongs to a motion region is set in the corresponding motion region determination flag. The region determination unit 203-2 supplies the motion region determination flag to the determination flag storage frame memory 204, and the procedure proceeds to step S208.
[0268]
If it is determined in step S205 that the pixel in frame # n-1 and the pixel in the same position in frame #n are still, or in step S206, the pixel in frame #n and the same position in frame # n + 1 If the current pixel is determined to be still, the pixel of frame #n does not belong to the motion region, so the process of step S207 is skipped, and the procedure proceeds to step S208.
[0269]
In step S208, the static motion determination unit 202-4 determines whether or not the pixel in frame # n-2 and the pixel in the same position in frame # n-1 are stationary. In step S209, the static motion determination unit 202-3 determines whether or not there is motion between the pixel in frame # n-1 and the pixel at the same position in frame #n.
[0270]
If it is determined in step S209 that the motion of the pixel in frame # n-1 and the pixel at the same position in frame #n is determined as moving, the process proceeds to step S210, and the region determination unit 203-3 determines that the region is to be determined. The corresponding covered background area determination flag is set to “1” indicating that it belongs to the covered background area. The area determination unit 203-3 supplies the covered background area determination flag to the determination flag storage frame memory 204, and the procedure proceeds to step S211.
[0271]
If it is determined in step S208 that the pixel in frame # n-2 and the pixel in the same position in frame # n-1 are in motion, or in step S209, the pixel in frame # n-1 and the pixel in frame #n If it is determined that the pixel at the same position is still, the pixel of frame #n does not belong to the covered background area, so the process of step S210 is skipped, and the procedure proceeds to step S211.
[0272]
In step S211, the static motion determination unit 202-2 determines whether or not the pixel in the frame #n and the pixel in the same position in the frame # n + 1 are in motion, and if it is determined to be in motion, the process proceeds to step S212. Then, the static motion determination unit 202-1 determines whether or not the pixel of frame # n + 1 and the pixel at the same position of frame # n + 2 are still.
[0273]
If it is determined in step S212 that the pixel in frame # n + 1 and the pixel in the same position in frame # n + 2 are stationary, the process proceeds to step S213, and the region determination unit 203-1 determines the region. In the uncovered background area determination flag corresponding to the pixel, “1” indicating that the pixel belongs to the uncovered background area is set. The area determination unit 203-1 supplies the uncovered background area determination flag to the determination flag storage frame memory 204, and the procedure proceeds to step S214.
[0274]
If it is determined in step S211 that the pixel in frame #n and the pixel in the same position in frame # n + 1 are stationary, or in step S212, the pixel in frame # n + 1 and the frame # n + 2 If it is determined that the motion is the same pixel, the pixel of frame #n does not belong to the uncovered background area, so the process of step S213 is skipped, and the procedure proceeds to step S214.
[0275]
In step S214, the area specifying unit 103 determines whether or not an area has been specified for all the pixels of frame #n. If it is determined that no area has been specified for all the pixels of frame #n, Returns to step S202 and repeats the area specifying process for other pixels.
[0276]
If it is determined in step S214 that the area has been specified for all the pixels of frame #n, the process proceeds to step S215, where the synthesis unit 205 stores the uncovered background area determination flag stored in the determination flag storage frame memory 204. And a covered background area determination flag, area information indicating a mixed area is generated, and each pixel belongs to one of an uncovered background area, a stationary area, a motion area, and a covered background area. The region information indicating this is generated, the generated region information is set in the determination flag storage frame memory 206, and the process ends.
[0277]
As described above, the region specifying unit 103 may generate region information indicating that each pixel included in the frame belongs to the motion region, the still region, the uncovered background region, or the covered background region. it can.
[0278]
The area specifying unit 103 generates area information corresponding to the mixed area by applying a logical sum to the area information corresponding to the uncovered background area and the covered background area, and is included in the frame. For each pixel, region information including a flag indicating that the pixel belongs to a motion region, a still region, or a mixed region may be generated.
[0279]
When the object corresponding to the foreground has a texture, the area specifying unit 103 can specify the movement area more accurately.
[0280]
The area specifying unit 103 can output area information indicating a motion area as area information indicating a foreground area, and area information indicating a still area as area information indicating a background area.
[0281]
In addition, although the object corresponding to the background has been described as stationary, the above-described processing for specifying the region can be applied even if the image corresponding to the background region includes a motion. For example, when the image corresponding to the background area is moving uniformly, the area specifying unit 103 shifts the entire image corresponding to this movement, and performs the same processing as when the object corresponding to the background is stationary. To do. In addition, when the image corresponding to the background region includes a different motion for each local area, the region specifying unit 103 selects a pixel corresponding to the motion and executes the above-described processing.
[0282]
FIG. 44 is a block diagram illustrating another example of the configuration of the area specifying unit 103. 44 does not use a motion vector. The background image generation unit 301 generates a background image corresponding to the input image, and supplies the generated background image to the binary object image extraction unit 302. For example, the background image generation unit 301 extracts an image object corresponding to a background object included in the input image, and generates a background image.
[0283]
FIG. 45 shows an example of a model diagram in which pixel values of pixels arranged in a line adjacent to the moving direction of the image corresponding to the foreground object are developed in the time direction. For example, when the motion direction of the image corresponding to the foreground object is horizontal with respect to the screen, the model diagram in FIG. 45 shows a model in which pixel values of adjacent pixels on one line are expanded in the time direction.
[0284]
In FIG. 45, the lines in frame #n are the same as the lines in frame # n−1 and frame # n + 1.
[0285]
In frame #n, the foreground components corresponding to the objects included in the sixth through seventeenth pixels from the left are included in the second through thirteenth pixels from the left in frame # n-1. In frame # n + 1, they are included in the 10th to 21st pixels from the left.
[0286]
In frame # n−1, the pixels belonging to the covered background area are the 11th to 13th pixels from the left, and the pixels belonging to the uncovered background area are the 2nd to 4th pixels from the left. In frame #n, the pixels belonging to the covered background area are the 15th to 17th pixels from the left, and the pixels belonging to the uncovered background area are the 6th to 8th pixels from the left. In frame # n + 1, the pixels belonging to the covered background area are the 19th to 21st pixels from the left, and the pixels belonging to the uncovered background area are the 10th to 12th pixels from the left.
[0287]
In frame # n−1, the pixels belonging to the background area are the first pixel from the left and the fourteenth through twenty-first pixels from the left. In frame #n, the pixels belonging to the background area are the first through fifth pixels from the left, and the eighteenth through twenty-first pixels from the left. In frame # n + 1, the pixels belonging to the background area are the first through ninth pixels from the left.
[0288]
FIG. 46 shows an example of a background image generated by the background image generation unit 301 and corresponding to the example of FIG. The background image is composed of pixels corresponding to the background object, and does not include image components corresponding to the foreground object.
[0289]
The binary object image extraction unit 302 generates a binary object image based on the correlation between the background image and the input image, and supplies the generated binary object image to the time change detection unit 303.
[0290]
FIG. 47 is a block diagram showing the configuration of the binary object image extraction unit 302. The correlation value calculation unit 321 calculates the correlation between the background image and the input image supplied from the background image generation unit 301, generates a correlation value, and supplies the generated correlation value to the threshold processing unit 322.
[0291]
For example, as shown in FIG._FourAnd a block corresponding to the block in the background image as shown in FIG. 48 (B)._FourApplying equation (4) to a block in a 3 × 3 input image centered at Y_FourThe correlation value corresponding to is calculated.
[0292]
[Expression 2]

[Equation 3]

[Expression 4]

[0293]
The correlation value calculation unit 321 supplies the correlation value calculated for each pixel in this way to the threshold processing unit 322.
[0294]
In addition, the correlation value calculation unit 321 performs, for example, as shown in FIG._FourAnd a block corresponding to the block in the background image as shown in FIG. 49 (B)._FourApplying equation (7) to a block in a 3 × 3 input image centered at Y_FourThe absolute difference value corresponding to may be calculated.
[0295]
[Equation 5]

[0296]
The correlation value calculation unit 321 supplies the difference absolute value calculated in this way to the threshold processing unit 322 as a correlation value.
[0297]
The threshold value processing unit 322 compares the pixel value of the correlation image with the threshold value th0. When the correlation value is equal to or less than the threshold value th0, the threshold value processing unit 322 sets the pixel value of the binary object image to 1 and sets the correlation value. Is greater than the threshold th0, the pixel value of the binary object image is set to 0, and a binary object image with 0 or 1 set to the pixel value is output. The threshold processing unit 322 may store the threshold th0 in advance, or may use the threshold th0 input from the outside.
[0298]
FIG. 50 is a diagram illustrating an example of a binary object image corresponding to the model of the input image illustrated in FIG. In the binary object image, the pixel value is set to 0 for a pixel having a high correlation with the background image.
[0299]
FIG. 51 is a block diagram illustrating a configuration of the time change detection unit 303. The frame memory 341 determines the area for the pixel of frame #n, and the binary object image of frame # n−1, frame #n, and frame # n + 1 supplied from the binary object image extraction unit 302 Remember.
[0300]
The area determination unit 342 determines an area for each pixel of the frame #n based on the binary object images of the frame # n−1, the frame #n, and the frame # n + 1 stored in the frame memory 341. Region information is generated, and the generated region information is output.
[0301]
FIG. 52 is a diagram for explaining determination by the region determination unit 342. When the pixel of interest of the binary object image of frame #n is 0, the region determination unit 342 determines that the pixel of interest of frame #n belongs to the background region.
[0302]
The pixel of interest of the binary object image of frame #n is 1, the corresponding pixel of the binary object image of frame # n-1 is 1, and the correspondence of the binary object image of frame # n + 1 When the pixel to be processed is 1, the region determination unit 342 determines that the pixel of interest in frame #n belongs to the foreground region.
[0303]
When the pixel of interest of the binary object image of frame #n is 1 and the corresponding pixel of the binary object image of frame # n-1 is 0, the region determination unit 342 It is determined that the pixel in question belongs to the covered background area.
[0304]
When the pixel of interest of the binary object image of frame #n is 1 and the corresponding pixel of the binary object image of frame # n + 1 is 0, the region determination unit 342 It is determined that the current pixel belongs to the uncovered background area.
[0305]
FIG. 53 is a diagram illustrating an example in which the time change detection unit 303 determines the binary object image corresponding to the input image model illustrated in FIG. Since the corresponding pixel of frame #n of the binary object image is 0, the time change detection unit 303 determines that the first to fifth pixels from the left of the frame #n belong to the background area.
[0306]
The temporal change detection unit 303 has the uncovered background region as the sixth to ninth pixels from the left because the pixel of frame #n of the binary object image is 1 and the corresponding pixel of frame # n + 1 is 0. It is determined that it belongs to.
[0307]
The temporal change detection unit 303 has a pixel of frame #n of 1 in the binary object image, a corresponding pixel of frame # n−1 is 1, and a corresponding pixel of frame # n + 1 is 1. The tenth through thirteenth pixels are determined to belong to the foreground area.
[0308]
Since the pixel of frame #n of the binary object image is 1 and the corresponding pixel of frame # n−1 is 0, the time change detection unit 303 sets the 14th to 17th pixels from the left as the covered background area. Judge as belonging.
[0309]
The time change detection unit 303 determines that the 18th to 21st pixels from the left belong to the background area because the corresponding pixel of frame #n of the binary object image is 0.
[0310]
Next, the area specifying process of the area determining unit 103 will be described with reference to the flowchart of FIG. In step S301, the background image generation unit 301 of the region determination unit 103 generates a background image by extracting, for example, an image object corresponding to a background object included in the input image based on the input image, and generates the generated background. The image is supplied to the binary object image extraction unit 302.
[0311]
In step S302, the binary object image extraction unit 302 calculates a correlation value between the input image and the background image supplied from the background image generation unit 301, for example, by the calculation described with reference to FIG. In step S303, the binary object image extraction unit 302 calculates a binary object image from the correlation value and the threshold value th0, for example, by comparing the correlation value with the threshold value th0.
[0312]
In step S304, the time change detection unit 303 executes region determination processing, and the processing ends.
[0313]
Details of the area determination processing corresponding to step S304 will be described with reference to the flowchart of FIG. In step S321, the region determination unit 342 of the time change detection unit 303 determines whether or not the pixel of interest is 0 in the frame #n stored in the frame memory 341, and pays attention in the frame #n. If it is determined that the pixel is 0, the process proceeds to step S322, the pixel of interest in frame #n is set as belonging to the background area, and the process ends.
[0314]
If it is determined in step S321 that the pixel of interest is 1 in frame #n, the process proceeds to step S323, where the area determination unit 342 of the time change detection unit 303 stores the frame #n stored in the frame memory 341. In frame # n-1, it is determined whether or not the corresponding pixel is 0, and in frame #n, the target pixel is 1 and frame #n If it is determined at n−1 that the corresponding pixel is 0, the process proceeds to step S324, the pixel of interest in frame #n is set as belonging to the covered background area, and the process ends.
[0315]
If it is determined in step S323 that the pixel of interest is 0 in frame #n or the corresponding pixel is 1 in frame # n-1, the process proceeds to step S325, and the time change detection unit 303 The area determination unit 342 determines whether the pixel of interest is 1 in frame #n stored in the frame memory 341 and whether the corresponding pixel is 0 in frame # n + 1. If it is determined that the pixel of interest is 1 in frame #n and the corresponding pixel is 0 in frame # n + 1, the process proceeds to step S326, and the pixel of interest of frame #n is undefined. The process ends with setting to belong to the covered background area.
[0316]
If it is determined in step S325 that the pixel of interest is 0 in frame #n or the corresponding pixel is 1 in frame # n + 1, the process proceeds to step S327, and the time change detection unit 303 The area determination unit 342 sets the pixel of interest in frame #n as the foreground area, and the process ends.
[0317]
As described above, the area specifying unit 103 determines whether the pixels of the input image are the foreground area, the background area, the covered background area, and the uncovered background area based on the correlation value between the input image and the corresponding background image. It is possible to specify which one belongs, and generate region information corresponding to the specified result.
[0318]
FIG. 56 is a block diagram showing another configuration of the area specifying unit 103. The area specifying unit 103 shown in FIG. 56 uses the motion vector supplied from the motion detection unit 102 and its position information. The same parts as those shown in FIG. 44 are denoted by the same reference numerals, and the description thereof is omitted.
[0319]
The robust unit 361 generates a robust binary object image based on the binary object images of N frames supplied from the binary object image extraction unit 302, and sends it to the time change detection unit 303. Output.
[0320]
FIG. 57 is a block diagram illustrating the configuration of the robust unit 361. The motion compensation unit 381 compensates for the motion of the binary object image of N frames based on the motion vector supplied from the motion detection unit 102 and its position information, and obtains a binary object image with motion compensated. Output to the switch 382.
[0321]
The motion compensation of the motion compensation unit 381 will be described with reference to the examples of FIGS. For example, when the region of frame #n is determined, if binary object images of frame # n-1, frame #n, and frame # n + 1 shown in FIG. 58 are input, the motion compensation unit 381 Based on the motion vector supplied from the motion detector 102, motion compensation is performed on the binary object image of frame # n-1 and the binary object image of frame # n + 1 as shown in FIG. Then, the binary object image subjected to motion compensation is supplied to the switch 382.
[0322]
The switch 382 outputs the motion-compensated binary object image of the first frame to the frame memory 383-1, and outputs the motion-compensated binary object image of the second frame to the frame memory 383-2. Similarly, the switch 382 outputs each of the motion compensated binary object images of the third to N−1th frames to any of the frame memory 383-3 to the frame memory 383- (N−1), The motion-compensated binary object image of the Nth frame is output to the frame memory 383-N.
[0323]
The frame memory 383-1 stores the binary object image for which motion compensation has been performed for the first frame, and outputs the stored binary object image to the weighting unit 384-1. The frame memory 383-2 stores the binary object image with motion compensation of the second frame, and outputs the stored binary object image to the weighting unit 384-2.
[0324]
Similarly, each of the frame memories 383-3 to 383- (N-1) stores and stores any of the motion compensated binary object images of the third frame to the (N-1) th frame. The binary object image thus output is output to any one of the weighting unit 384-3 to the weighting unit 384- (N-1). The frame memory 383-N stores the binary object image with motion compensation of the Nth frame, and outputs the stored binary object image to the weighting unit 384-N.
[0325]
The weighting unit 384-1 multiplies the pixel value of the motion-compensated binary object image of the first frame supplied from the frame memory 383-1 by a predetermined weight w1 and supplies the result to the integrating unit 385. The weighting unit 384-2 multiplies the pixel value of the motion compensated binary object image of the second frame supplied from the frame memory 383-2 by a predetermined weight w2, and supplies the result to the integrating unit 385.
[0326]
Similarly, each of the weighting units 384-3 to 384- (N-1) is the third to N-1 supplied from any one of the frame memories 383-3 to 383- (N-1). The pixel value of the motion-compensated binary object image of any one of the frames is multiplied by one of the predetermined weights w3 to w (N−1) and supplied to the accumulating unit 385. The weighting unit 384-N multiplies the pixel value of the motion-compensated binary object image of the Nth frame supplied from the frame memory 383-N by a predetermined weight wN and supplies the result to the integrating unit 385.
[0327]
The accumulating unit 385 accumulates the corresponding pixel values of the binary object image that have been subjected to motion compensation of the 1st to Nth frames and multiplied by one of the weights w1 to wN, respectively, and the accumulated pixel values are obtained in advance. A binary object image is generated by comparing with a predetermined threshold value th0.
[0328]
In this way, the robust unit 361 generates a robust binary object image from the N binary object images and supplies the generated binary object image to the time change detection unit 303. Therefore, the region specifying unit whose configuration is shown in FIG. The area 103 can specify the area more accurately than the case shown in FIG. 44 even if the input image contains noise.
[0329]
Next, the area specifying process of the area specifying unit 103 shown in FIG. 56 will be described with reference to the flowchart of FIG. The processes in steps S341 through S343 are the same as those in steps S301 through S303 described with reference to the flowchart of FIG.
[0330]
In step S344, the robust unit 361 executes a robust process.
[0331]
In step S345, the time change detection unit 303 executes region determination processing, and the processing ends. Details of the processing in step S345 are the same as the processing described with reference to the flowchart in FIG.
[0332]
Next, details of the robust processing corresponding to the processing in step S344 in FIG. 60 will be described with reference to the flowchart in FIG. In step S361, the motion compensation unit 381 performs motion compensation processing on the input binary object image based on the motion vector supplied from the motion detection unit 102 and its position information. In step S362, any of the frame memories 383-1 to 383-N stores the motion compensated binary object image supplied via the switch 382.
[0333]
In step S363, the robust unit 361 determines whether or not N binary object images are stored. If it is determined that N binary object images are not stored, the process returns to step S361. The motion compensation processing of the binary object image and the storage processing of the binary object image are repeated.
[0334]
If it is determined in step S363 that N binary object images have been stored, the process proceeds to step S364, and each of the weighting units 384-1 to 384-N adds w1 to w in each of the N binary object images. Multiply by one of the weights of wN.
[0335]
In step S365, the integration unit 385 integrates the weighted N binary object images.
[0336]
In step S366, the integrating unit 385 generates a binary object image from the integrated image, for example, by comparison with a predetermined threshold value th1, and the process ends.
[0337]
As described above, the region specifying unit 103 having the configuration shown in FIG. 56 can generate region information based on the robust binary object image.
[0338]
As described above, the area specifying unit 103 generates area information indicating that each of the pixels included in the frame belongs to the motion area, the still area, the uncovered background area, or the covered background area. Can do.
[0339]
FIG. 62 is a block diagram illustrating an example of the configuration of the mixture ratio calculation unit 104. Based on the input image, the estimated mixture ratio processing unit 401 calculates an estimated mixture ratio for each pixel by an operation corresponding to the model of the covered background area, and supplies the calculated estimated mixture ratio to the mixture ratio determining unit 403. To do.
[0340]
Based on the input image, the estimated mixture ratio processing unit 402 calculates an estimated mixture ratio for each pixel by an operation corresponding to the model of the uncovered background region, and the calculated estimated mixture ratio is sent to the mixture ratio determining unit 403. Supply.
[0341]
Since it can be assumed that the object corresponding to the foreground is moving at a constant speed within the shutter time, the mixture ratio α of the pixels belonging to the mixed area has the following properties. That is, the mixture ratio α changes linearly in response to changes in the pixel position. If the change in the pixel position is one-dimensional, the change in the mixture ratio α can be expressed by a straight line. If the change in the pixel position is two-dimensional, the change in the mixture ratio α is expressed by a plane. be able to.
[0342]
Since the period of one frame is short, it is assumed that the object corresponding to the foreground is a rigid body and moves at a constant speed.
[0343]
In this case, the gradient of the mixture ratio α is the inverse ratio of the motion amount v within the foreground shutter time.
[0344]
An example of an ideal mixing ratio α is shown in FIG. The gradient l in the mixing region of the ideal mixing ratio α can be expressed as the reciprocal of the motion amount v.
[0345]
As shown in FIG. 63, the ideal mixing ratio α has a value of 1 in the background area, a value of 0 in the foreground area, and a value exceeding 0 and less than 1 in the mixing area. .
[0346]
In the example of FIG. 64, the pixel value C06 of the seventh pixel from the left of frame #n can be expressed by Expression (8) using the pixel value P06 of the seventh pixel from the left of frame # n-1. it can.
[0347]
[Formula 6]

[0348]
In Expression (8), the pixel value C06 is expressed as the pixel value M of the pixel in the mixed region, and the pixel value P06 is expressed as the pixel value B of the pixel in the background region. That is, the pixel value M of the pixel in the mixed region and the pixel value B of the pixel in the background region can be expressed as Equation (9) and Equation (10), respectively.
[0349]
M = C06 (9)
B = P06 (10)
[0350]
2 / v in equation (8) corresponds to the mixing ratio α. Since the motion amount v is 4, the mixture ratio α of the seventh pixel from the left of the frame #n is 0.5.
[0351]
As described above, the pixel value C of the focused frame #n is regarded as the pixel value of the mixed region, and the pixel value P of the frame # n-1 before the frame #n is regarded as the pixel value of the background region. Equation (3) indicating the mixing ratio α can be rewritten as Equation (11).
[0352]
C = α ・ P + f (11)
F in Expression (11) is the sum of the foreground components included in the pixel of interest Σ_iFi / v.
There are two variables included in equation (11): the mixture ratio α and the sum f of the foreground components.
[0353]
Similarly, FIG. 65 shows a model in which pixel values are expanded in the time direction, in which the amount of motion v is 4 and the number of virtual divisions in the time direction is 4, in the uncovered background area.
[0354]
In the uncovered background area, similarly to the above-described representation in the covered background area, the pixel value C of the frame #n of interest is regarded as the pixel value of the mixed area, and the frame # n + 1 after the frame #n Eq. (3) indicating the mixture ratio α can be expressed as Eq. (12) by regarding the pixel value N of と as the pixel value of the background region.
[0355]
C = α ・ N + f (12)
[0356]
Although it has been described that the background object is stationary, even when the background object is moving, by using the pixel value of the pixel at the position corresponding to the background motion amount v, the expression (8 ) To (12) can be applied.
For example, in FIG. 64, when the motion amount v of the object corresponding to the background is 2 and the number of virtual divisions is 2, when the object corresponding to the background is moving to the right side in the figure, The pixel value B of the pixel in the background area is set to a pixel value P04.
[0357]
Since Expression (11) and Expression (12) each include two variables, the mixture ratio α cannot be obtained as it is. Here, since an image generally has a strong spatial correlation, adjacent pixels have almost the same pixel value.
[0358]
Therefore, since the foreground components have a strong spatial correlation, the formula is modified so that the sum f of the foreground components can be derived from the previous or subsequent frame to obtain the mixture ratio α.
[0359]
The pixel value Mc of the seventh pixel from the left in frame #n in FIG. 66 can be expressed by Expression (13).
[0360]
[Expression 7]

2 / v in the first term on the right side of Equation (13) corresponds to the mixing ratio α. The second term on the right side of Expression (13) is expressed as Expression (14) using the pixel value of the subsequent frame # n + 1.
[0361]
[Equation 8]

[0362]
Here, Equation (15) is established using the spatial correlation of the foreground components.
[0363]
F = F05 = F06 = F07 = F08 = F09 = F10 = F11 = F12 (15)
Expression (14) can be replaced with Expression (16) using Expression (15).
[0364]
[Equation 9]

[0365]
As a result, β can be expressed by equation (17).
[0366]
β = 2/4 (17)
[0367]
In general, assuming that the foreground components related to the mixed region are equal as shown in Equation (15), Equation (18) is established from the relationship of the internal ratio for all the pixels in the mixed region.
[0368]
β = 1-α (18)
[0369]
If Expression (18) is established, Expression (11) can be expanded as shown in Expression (19).
[0370]
[Expression 10]

[0371]
Similarly, if equation (18) holds, equation (12) can be expanded as shown in equation (20).
[0372]
## EQU11 ##

[0373]
In Expression (19) and Expression (20), C, N, and P are known pixel values, and therefore the variable included in Expression (19) and Expression (20) is only the mixture ratio α. FIG. 67 shows the relationship between C, N, and P in the equations (19) and (20). C is the pixel value of the pixel of interest in frame #n for calculating the mixture ratio α. N is a pixel value of a pixel in frame # n + 1 corresponding to a pixel of interest corresponding to a position in the spatial direction. P is a pixel value of a pixel in frame # n−1 in which the pixel of interest corresponds to the position in the spatial direction.
[0374]
Accordingly, since one variable is included in each of the equations (19) and (20), the mixture ratio α can be calculated using the pixel values of the pixels of the three frames. The condition for calculating the correct mixture ratio α by solving the equations (19) and (20) is that the foreground components related to the mixed region are equal, that is, the imaging is performed when the foreground object is stationary. In the foreground image object thus obtained, the pixel values of the pixels located at the boundary of the image object corresponding to the direction of the motion of the foreground object, which are twice as many as the movement amount v, are continuous. It is constant.
[0375]
As described above, the mixing ratio α of the pixels belonging to the covered background area is calculated by Expression (21), and the mixing ratio α of the pixels belonging to the uncovered background area is calculated by Expression (22).
[0376]
α = (C-N) / (P-N) (21)
α = (C-P) / (N-P) (22)
[0377]
FIG. 68 is a block diagram illustrating a configuration of the estimated mixture ratio processing unit 401. The frame memory 421 stores the input image in units of frames, and supplies the frame immediately after the frame input as the input image to the frame memory 422 and the mixture ratio calculation unit 423.
[0378]
The frame memory 422 stores the input image in units of frames, and supplies the frame immediately after the frame supplied from the frame memory 421 to the mixture ratio calculation unit 423.
[0379]
Therefore, when the frame # n + 1 is input to the mixing ratio calculation unit 423 as an input image, the frame memory 421 supplies the frame #n to the mixing ratio calculation unit 423, and the frame memory 422 stores the frame # n− 1 is supplied to the mixture ratio calculation unit 423.
[0380]
The mixture ratio calculation unit 423 calculates the pixel value C of the pixel of interest in frame #n and the pixel of frame # n + 1 corresponding to the spatial position of the pixel of interest by the calculation shown in Expression (21). And the estimated mixture ratio of the pixel of interest was calculated based on the pixel value N of the pixel and the pixel value P of the pixel of frame # n-1 whose spatial position corresponds to the pixel of interest. Output the estimated mixture ratio. For example, when the background is stationary, the mixture ratio calculation unit 423 determines that the pixel value C of the pixel of interest in frame #n is the same as the pixel of interest in the frame # n + 1. Calculate the estimated mixture ratio of the pixel of interest based on the pixel value N of the pixel and the pixel value P of the pixel of frame # n-1, which has the same position in the frame as the pixel of interest. The estimated mixture ratio is output.
[0381]
As described above, the estimated mixture ratio processing unit 401 can calculate the estimated mixture ratio based on the input image and supply the estimated mixture ratio to the mixture ratio determining unit 403.
[0382]
The estimated mixture ratio processing unit 402 calculates the estimated mixture ratio of the pixel of interest by the calculation shown in the equation (21) by the estimated mixture ratio processing unit 401, whereas the calculation shown in the equation (22). Thus, the estimated mixture ratio processing unit 401 is the same as the estimated mixture ratio processing unit 401 except that a part for calculating the estimated mixture ratio of the pixel of interest is different.
[0383]
FIG. 69 is a diagram illustrating an example of the estimated mixture ratio calculated by the estimated mixture ratio processing unit 401. The estimated mixture ratio shown in FIG. 69 indicates the result when the foreground motion amount v corresponding to an object moving at a constant speed is 11, for one line.
[0384]
It can be seen that the estimated mixture ratio changes almost linearly in the mixed region as shown in FIG.
[0385]
Returning to FIG. 62, the mixture ratio determination unit 403 determines that the pixel for which the mixture ratio α supplied from the region specifying unit 103 is to be calculated is the foreground region, the background region, the covered background region, or the uncovered background region. The mixing ratio α is set on the basis of the area information indicating which of the two. The mixture ratio determining unit 403 sets 0 as the mixture ratio α when the target pixel belongs to the foreground area, and sets 1 as the mixture ratio α when the target pixel belongs to the background area. When the pixel belongs to the covered background area, the estimated mixture ratio supplied from the estimated mixture ratio processing unit 401 is set to the mixture ratio α, and when the target pixel belongs to the uncovered background area, the estimated mixture ratio processing unit The estimated mixing ratio supplied from 402 is set to the mixing ratio α. The mixture ratio determination unit 403 outputs a mixture ratio α set based on the region information.
[0386]
FIG. 70 is a block diagram showing another configuration of the mixture ratio calculation unit 104. Based on the region information supplied from the region specifying unit 103, the selection unit 441 supplies the pixels belonging to the covered background region and the corresponding pixels of the previous and subsequent frames to the estimated mixture ratio processing unit 442. Based on the region information supplied from the region specifying unit 103, the selection unit 441 supplies the pixels belonging to the uncovered background region and the corresponding pixels in the previous and subsequent frames to the estimated mixture ratio processing unit 443. .
[0387]
Based on the pixel value input from the selection unit 441, the estimated mixture ratio processing unit 442 calculates the estimated mixture ratio of the pixel of interest belonging to the covered background region by the calculation shown in Expression (21). The calculated estimated mixture ratio is supplied to the selection unit 444.
[0388]
Based on the pixel value input from the selection unit 441, the estimated mixture ratio processing unit 443 calculates an estimated mixture ratio of the pixel of interest belonging to the uncovered background region by the calculation shown in Expression (22). Then, the calculated estimated mixture ratio is supplied to the selection unit 444.
[0389]
When the target pixel belongs to the foreground area based on the area information supplied from the area specifying unit 103, the selection unit 444 selects an estimated mixture ratio that is 0, sets the mixture ratio α, If the pixel belongs to the background region, an estimated mixture ratio of 1 is selected and set to the mixture ratio α. When the target pixel belongs to the covered background area, the selection unit 444 selects the estimated mixture ratio supplied from the estimated mixture ratio processing unit 442 and sets it to the mixture ratio α, and the target pixel is uncovered back. When belonging to the ground region, the estimated mixture ratio supplied from the estimated mixture ratio processing unit 443 is selected and set to the mixture ratio α. The selection unit 444 outputs the mixture ratio α selected and set based on the region information.
[0390]
As described above, the mixture ratio calculation unit 104 having another configuration shown in FIG. 70 can calculate the mixture ratio α for each pixel included in the image and output the calculated mixture ratio α.
[0390]
With reference to the flowchart of FIG. 71, the process of calculating the mixture ratio α of the mixture ratio calculator 104 shown in FIG. 62 will be described. In step S 401, the mixture ratio calculation unit 104 acquires the region information supplied from the region specifying unit 103. In step S 402, the estimated mixture ratio processing unit 401 performs an estimated mixture ratio calculation process using a model corresponding to the covered background region, and supplies the calculated estimated mixture ratio to the mixture ratio determining unit 403. Details of the calculation process of the mixture ratio estimation will be described later with reference to the flowchart of FIG.
[0392]
In step S 403, the estimated mixture ratio processing unit 402 performs an estimated mixture ratio calculation process using a model corresponding to the uncovered background region, and supplies the calculated estimated mixture ratio to the mixture ratio determining unit 403.
[0393]
In step S404, the mixture ratio calculation unit 104 determines whether or not the mixture ratio α is estimated for the entire frame. If it is determined that the mixture ratio α is not estimated for the entire frame, the process returns to step S402. Then, the process of estimating the mixture ratio α for the next pixel is executed.
[0394]
If it is determined in step S404 that the mixture ratio α has been estimated for the entire frame, the process proceeds to step S405, where the mixture ratio determination unit 403 determines that the pixel is a foreground area, a background area, a covered background area, or an uncovered back. The mixture ratio α is set based on the area information supplied from the area specifying unit 103 indicating which of the ground areas belongs. The mixture ratio determining unit 403 sets 0 as the mixture ratio α when the target pixel belongs to the foreground area, and sets 1 as the mixture ratio α when the target pixel belongs to the background area. When the pixel belongs to the covered background area, the estimated mixture ratio supplied from the estimated mixture ratio processing unit 401 is set to the mixture ratio α, and when the target pixel belongs to the uncovered background area, the estimated mixture ratio processing unit The estimated mixture ratio supplied from 402 is set to the mixture ratio α, and the process ends.
[0395]
As described above, the mixture ratio calculation unit 104 can calculate the mixture ratio α, which is a feature amount corresponding to each pixel, based on the region information supplied from the region specifying unit 103 and the input image.
[0396]
70 is the same as the process described with reference to the flowchart of FIG. 71, and thus the description thereof is omitted.
[0397]
Next, the mixing ratio estimation process using the model corresponding to the covered background area corresponding to step S402 in FIG. 71 will be described with reference to the flowchart in FIG.
[0398]
In step S421, the mixture ratio calculation unit 423 acquires the pixel value C of the target pixel of frame #n from the frame memory 421.
[0399]
In step S422, the mixture ratio calculation unit 423 acquires the pixel value P of the pixel of frame # n−1 corresponding to the target pixel from the frame memory 422.
[0400]
In step S423, the mixture ratio calculation unit 423 acquires the pixel value N of the pixel of frame # n + 1 corresponding to the target pixel included in the input image.
[0401]
In step S424, the mixture ratio calculation unit 423, based on the pixel value C of the pixel of interest in frame #n, the pixel value P of the pixel of frame # n-1, and the pixel value N of the pixel of frame # n + 1, Calculate the estimated mixture ratio.
[0402]
In step S425, the mixture ratio calculation unit 423 determines whether or not the process of calculating the estimated mixture ratio has been completed for the entire frame, and determines that the process of calculating the estimated mixture ratio has not been completed for the entire frame. If so, the process returns to step S421, and the process of calculating the estimated mixture ratio for the next pixel is repeated.
[0403]
If it is determined in step S425 that the process of calculating the estimated mixture ratio has been completed for the entire frame, the process ends.
[0404]
Thus, the estimated mixture ratio processing unit 401 can calculate the estimated mixture ratio based on the input image.
[0405]
The process of estimating the mixture ratio by the model corresponding to the uncovered background area in step S403 of FIG. 71 is the same as the process shown in the flowchart of FIG. 72 using the expression corresponding to the model of the uncovered background area. Description is omitted.
[0406]
Note that the estimated mixture ratio processing unit 442 and the estimated mixture ratio processing unit 443 shown in FIG. 70 perform the same processing as the flowchart shown in FIG. 72 to calculate the estimated mixture ratio, and thus description thereof is omitted.
[0407]
In addition, although it has been described that the object corresponding to the background is stationary, the above-described processing for obtaining the mixture ratio α can be applied even if the image corresponding to the background region includes a motion. For example, when the image corresponding to the background region is moving uniformly, the estimated mixture ratio processing unit 401 shifts the entire image corresponding to the movement of the background, and the object corresponding to the background is stationary. Process in the same way. In addition, when the image corresponding to the background region includes a different background motion for each local area, the estimated mixture ratio processing unit 401 selects a pixel corresponding to the background motion as a pixel corresponding to the pixel belonging to the mixed region. Then, the above-described processing is executed.
[0408]
In addition, the mixture ratio calculation unit 104 executes only the mixture ratio estimation process using the model corresponding to the covered background region for all pixels, and outputs the calculated estimated mixture ratio as the mixture ratio α. Also good. In this case, the mixing ratio α indicates the ratio of the background components for the pixels belonging to the covered background area, and indicates the ratio of the foreground components for the pixels belonging to the uncovered background area. For the pixels belonging to the uncovered background area, if the absolute value of the difference between the mixture ratio α and 1 calculated in this way is calculated and the calculated absolute value is set to the mixture ratio α, the signal processing apparatus For the pixels belonging to the uncovered background area, the mixing ratio α indicating the ratio of the background components can be obtained.
[0409]
Similarly, the mixture ratio calculation unit 104 executes only the mixture ratio estimation process using the model corresponding to the uncovered background area for all pixels, and outputs the calculated estimated mixture ratio as the mixture ratio α. You may make it do.
[0410]
Next, another process of the mixture ratio calculation unit 104 will be described.
[0411]
Using the property that the mixture ratio α changes linearly in response to a change in pixel position due to the object corresponding to the foreground moving at a constant speed within the shutter time, the mixture ratio α in the spatial direction. And an equation that approximates the sum f of the foreground components. By using a plurality of sets of pixel values of pixels belonging to the mixed area and pixel values belonging to the background area, the mixing ratio α is calculated by solving an equation that approximates the mixing ratio α and the sum f of the foreground components. calculate.
[0412]
When the change in the mixing ratio α is approximated as a straight line, the mixing ratio α is expressed by Expression (23).
[0413]
α = il + p (23)
In Expression (23), i is an index in the spatial direction where the position of the pixel of interest is 0. l is the slope of the straight line of the mixing ratio α. p is a straight line intercept of the mixing ratio α and is the mixing ratio α of the pixel of interest. In equation (23), the index i is known, but the slope l and the intercept p are unknown.
[0414]
FIG. 73 shows the relationship between the index i, the slope l, and the intercept p.
[0415]
By approximating the mixture ratio α as shown in Expression (23), a plurality of different mixture ratios α are expressed by two variables for a plurality of pixels. In the example shown in FIG. 73, the five mixing ratios for the five pixels are expressed by two variables, the gradient l and the intercept p.
[0416]
When the mixture ratio α is approximated in the plane shown in FIG. 74, when considering the motion v corresponding to the two directions of the image in the horizontal direction and the vertical direction, the equation (23) is expanded to a plane, and the mixture ratio α is It is represented by Formula (24).
[0417]
α = jm + kq + p (24)
In Expression (24), j is a horizontal index with the position of the pixel of interest being 0, and k is a vertical index. m is the horizontal inclination of the surface of the mixing ratio α, and q is the vertical inclination of the surface of the mixing ratio α. p is an intercept of the surface of the mixing ratio α.
[0418]
For example, in frame #n shown in FIG. 64, equations (25) to (27) are established for C05 to C07, respectively.
[0419]
C05 = α05 ・ B05 / v + f05 (25)
C06 = α06 ・ B06 / v + f06 (26)
C07 = α07 ・ B07 / v + f07 (27)
[0420]
When the foreground components match in the vicinity, that is, F01 to F03 are equal, and F01 to F03 are replaced with Fc, Expression (28) is established.
[0421]
f (x) = (1-α (x)) · Fc (28)
In Expression (28), x represents a position in the spatial direction.
[0422]
When α (x) is replaced with Expression (24), Expression (28) can be expressed as Expression (29).
[0423]

[0424]
In the equation (29), (−m · Fc), (−q · Fc), and (1-p) · Fc are replaced as shown in the equations (30) to (32).
[0425]
s = -m · Fc (30)
t = -q · Fc (31)
u = (1-p) ・ Fc (32)
[0426]
In Expression (29), j is a horizontal index with the position of the pixel of interest as 0, and k is a vertical index.
[0427]
In this way, since it is assumed that the object corresponding to the foreground moves at a constant speed within the shutter time and the component corresponding to the foreground is constant in the vicinity, the sum of the foreground components is expressed by Equation (29). Approximated.
[0428]
When the mixture ratio α is approximated by a straight line, the sum of the foreground components can be expressed by Expression (33).
[0429]
f (x) = is + u (33)
[0430]
When the sum of the mixture ratio α and the foreground component in Expression (13) is replaced using Expression (24) and Expression (29), the pixel value M is expressed by Expression (34).
[0431]

[0432]
In equation (34), the unknown variables are the horizontal gradient m of the surface of the mixing ratio α, the vertical inclination q of the surface of the mixing ratio α, the intercepts p, s, t, and u of the surface of the mixing ratio α. These are six.
[0433]
A plurality of normal equations in which the pixel value M or the pixel value B is set in the normal equation shown in the equation (34) in correspondence with the pixel in the vicinity of the pixel of interest, and the pixel value M or the pixel value B is set. Is calculated by the method of least squares to calculate the mixture ratio α.
[0434]
For example, the horizontal index j of the pixel of interest is set to 0, the index k of the vertical direction is set to 0, and a 3 × 3 pixel in the vicinity of the pixel of interest is expressed by the normal equation shown in Expression (34). When the pixel value M or the pixel value B is set, Expressions (35) to (43) are obtained.

[0435]
Since the index j in the horizontal direction of the pixel of interest is 0 and the index k in the vertical direction is 0, the mixture ratio α of the pixel of interest is expressed by j = 0 and k = It is equal to the value at 0, that is, the intercept p.
[0436]
Accordingly, based on the nine equations (35) to (43), the values of the horizontal gradient m, the vertical gradient q, the intercepts p, s, t, and u are calculated by the method of least squares. The intercept p may be output as the mixing ratio α.
[0437]
Next, a more specific procedure for calculating the mixture ratio α by applying the least square method will be described.
[0438]
When the index i and the index k are expressed by one index x, the relationship between the index i, the index k, and the index x is expressed by Expression (44).
[0439]
x = (j + 1) ・ 3+ (k + 1) (44)
[0440]
Express horizontal slope m, vertical slope q, intercepts p, s, t, and u as variables w0, w1, w2, w3, w4, and W5, respectively, jB, kB, B, j, k, And 1 are expressed as a0, a1, a2, a3, a4, and a5, respectively. In consideration of the error ex, Expressions (35) to (43) can be rewritten into Expression (45).
[0441]
[Expression 12]

In the formula (45), x is an integer value from 0 to 8.
[0442]
From equation (45), equation (46) can be derived.
[0443]
[Formula 13]

[0444]
Here, in order to apply the method of least squares, an error sum of squares E is defined as shown in equation (47).
[0445]
[Expression 14]

[0446]
In order to minimize the error, it is only necessary that the partial differentiation of the variable Wv with respect to the square sum E of the error becomes zero. Here, v is one of integers from 0 to 5. Therefore, wy is obtained so as to satisfy the equation (48).
[0447]
[Expression 15]

[0448]
Substituting equation (46) into equation (48) yields equation (49).
[0449]
[Expression 16]

[0450]
For example, a sweep method (Gauss-Jordan elimination method) is applied to six formulas obtained by substituting any one of integers 0 to 5 for v in formula (49) to calculate wy. . As described above, w0 is the horizontal gradient m, w1 is the vertical gradient q, w2 is the intercept p, w3 is s, w4 is t, and w5 is u.
[0451]
As described above, horizontal slope m, vertical slope q, intercepts p, s, t, and u are obtained by applying the method of least squares to the equation in which pixel value M and pixel value B are set. be able to.
[0452]
In the description corresponding to the expressions (35) to (43), the pixel value of the pixel included in the mixed area has been described as M, and the pixel value of the pixel included in the background area has been described as B. Therefore, it is necessary to establish a normal equation for each of the cases where they are included in the covered background region or the uncovered background region.
[0453]
For example, when obtaining the mixture ratio α of pixels included in the covered background area of frame #n shown in FIG. 64, pixel values C04 to C08 of pixels of frame #n and pixel values P04 to P04 of pixels of frame # n−1 P08 is set as a normal equation.
[0454]
When obtaining the mixture ratio α of pixels included in the uncovered background area of frame #n shown in FIG. 65, pixel values N28 to N32 of pixels C28 to C32 of frame #n and pixels of frame # n + 1 Is set to a normal equation.
[0455]
Further, for example, when calculating the mixture ratio α of the pixels included in the covered background region shown in FIG. 75, the following equations (50) to (58) are established. The pixel value of the pixel for calculating the mixture ratio α is Mc5.
Mc1 = (-1) ・ Bc1 ・ m + (-1) ・ Bc1 ・ q + Bc1 ・ p + (-1) ・ s + (-1) ・ t + u (50)
Mc2 = (0) ・ Bc2 ・ m + (-1) ・ Bc2 ・ q + Bc2 ・ p + (0) ・ s + (-1) ・ t + u (51)
Mc3 = (+ 1) ・ Bc3 ・ m + (-1) ・ Bc3 ・ q + Bc3 ・ p + (+ 1) ・ s + (-1) ・ t + u (52)
Mc4 = (-1) ・ Bc4 ・ m + (0) ・ Bc4 ・ q + Bc4 ・ p + (-1) ・ s + (0) ・ t + u (53)
Mc5 = (0) ・ Bc5 ・ m + (0) ・ Bc5 ・ q + Bc5 ・ p + (0) ・ s + (0) ・ t + u (54)
Mc6 = (+ 1) ・ Bc6 ・ m + (0) ・ Bc6 ・ q + Bc6 ・ p + (+ 1) ・ s + (0) ・ t + u (55)
Mc7 = (-1) ・ Bc7 ・ m + (+ 1) ・ Bc7 ・ q + Bc7 ・ p + (-1) ・ s + (+ 1) ・ t + u (56)
Mc8 = (0) ・ Bc8 ・ m + (+ 1) ・ Bc8 ・ q + Bc8 ・ p + (0) ・ s + (+ 1) ・ t + u (57)
Mc9 = (+ 1) ・ Bc9 ・ m + (+ 1) ・ Bc9 ・ q + Bc9 ・ p + (+ 1) ・ s + (+ 1) ・ t + u (58)
[0456]
When calculating the mixture ratio α of the pixels included in the covered background area of frame #n, the background of the pixel of frame # n−1 corresponding to the pixel of frame #n in equations (50) to (58) Pixel values Bc1 to Bc9 of the pixels in the area are used.
[0457]
When calculating the mixture ratio α of the pixels included in the uncovered background area shown in FIG. 75, the following equations (59) to (67) are established. The pixel value of the pixel for calculating the mixture ratio α is Mu5.
Mu1 = (-1) ・ Bu1 ・ m + (-1) ・ Bu1 ・ q + Bu1 ・ p + (-1) ・ s + (-1) ・ t + u (59)
Mu2 = (0) ・ Bu2 ・ m + (-1) ・ Bu2 ・ q + Bu2 ・ p + (0) ・ s + (-1) ・ t + u (60)
Mu3 = (+ 1) ・ Bu3 ・ m + (-1) ・ Bu3 ・ q + Bu3 ・ p + (+ 1) ・ s + (-1) ・ t + u (61)
Mu4 = (-1) ・ Bu4 ・ m + (0) ・ Bu4 ・ q + Bu4 ・ p + (-1) ・ s + (0) ・ t + u (62)
Mu5 = (0) ・ Bu5 ・ m + (0) ・ Bu5 ・ q + Bu5 ・ p + (0) ・ s + (0) ・ t + u (63)
Mu6 = (+ 1) ・ Bu6 ・ m + (0) ・ Bu6 ・ q + Bu6 ・ p + (+ 1) ・ s + (0) ・ t + u (64)
Mu7 = (-1) ・ Bu7 ・ m + (+ 1) ・ Bu7 ・ q + Bu7 ・ p + (-1) ・ s + (+ 1) ・ t + u (65)
Mu8 = (0) ・ Bu8 ・ m + (+ 1) ・ Bu8 ・ q + Bu8 ・ p + (0) ・ s + (+ 1) ・ t + u (66)
Mu9 = (+ 1) ・ Bu9 ・ m + (+ 1) ・ Bu9 ・ q + Bu9 ・ p + (+ 1) ・ s + (+ 1) ・ t + u (67)
[0458]
When calculating the mixture ratio α of the pixels included in the uncovered background area of frame #n, in the equations (59) to (67), the pixels of the frame # n + 1 corresponding to the pixels of the frame #n are calculated. The pixel values Bu1 to Bu9 of the pixels in the background area are used.
[0459]
FIG. 76 is a block diagram illustrating a configuration of the estimated mixture ratio processing unit 401. The image input to the estimated mixture ratio processing unit 401 is supplied to the delay unit 501 and the adding unit 502.
[0460]
The delay circuit 221 delays the input image by one frame and supplies it to the adding unit 502. When the frame #n is input as an input image to the adding unit 502, the delay circuit 221 supplies the frame # n-1 to the adding unit 502.
[0461]
The adding unit 502 sets the pixel value of the pixel near the pixel for calculating the mixture ratio α and the pixel value of the frame # n−1 in a normal equation. For example, the adding unit 502 sets the pixel values Mc1 to Mc9 and the pixel values Bc1 to Bc9 in the normal equation based on the equations (50) to (58). The adding unit 502 supplies the normal equation in which the pixel value is set to the calculation unit 503.
[0462]
The computing unit 503 solves the normal equation supplied from the adding unit 502 by a sweeping method or the like to obtain an estimated mixture ratio, and outputs the obtained estimated mixture ratio.
[0463]
As described above, the estimated mixture ratio processing unit 401 can calculate the estimated mixture ratio based on the input image and supply the estimated mixture ratio to the mixture ratio determining unit 403.
[0464]
Note that the estimated mixture ratio processing unit 402 has the same configuration as the estimated mixture ratio processing unit 401, and thus description thereof is omitted.
[0465]
FIG. 77 is a diagram illustrating an example of the estimated mixture ratio calculated by the estimated mixture ratio processing unit 401. In the estimated mixture ratio shown in FIG. 77, the foreground motion v corresponding to an object moving at a constant speed is 11, and the result calculated by generating an equation with a block of 7 × 7 pixels as one unit is 1 line. Is shown.
[0466]
It can be seen that the estimated mixture ratio changes almost linearly in the mixed region as shown in FIG.
[0467]
The mixture ratio determining unit 403 indicates whether the pixel supplied from the region specifying unit 101 and for which the mixture ratio is calculated belongs to any of the foreground region, the background region, the covered background region, or the uncovered background region. The mixing ratio is set based on the area information. When the target pixel belongs to the foreground area, the mixing ratio determination unit 403 sets 0 as the mixing ratio, and when the target pixel belongs to the background area, sets the mixing ratio to 1 so that the target pixel is When belonging to the covered background region, the estimated mixture ratio supplied from the estimated mixture ratio processing unit 401 is set as the mixture ratio, and when the target pixel belongs to the uncovered background region, supplied from the estimated mixture ratio processing unit 402 The estimated estimated mixing ratio is set as the mixing ratio. The mixture ratio determination unit 403 outputs a mixture ratio set based on the region information.
[0468]
With reference to the flowchart of FIG. 78, the process of calculating the mixture ratio of the mixture ratio calculator 102 when the estimated mixture ratio processor 401 has the configuration shown in FIG. 76 will be described. In step S 501, the mixture ratio calculation unit 102 acquires the region information supplied from the region specifying unit 101. In step S 502, the estimated mixture ratio processing unit 401 executes a mixture ratio estimation process using a model corresponding to the covered background region, and supplies the estimated mixture ratio to the mixture ratio determining unit 403. Details of the mixing ratio estimation process will be described later with reference to the flowchart of FIG.
[0469]
In step S 503, the estimated mixture ratio processing unit 402 executes a mixture ratio estimation process using a model corresponding to the uncovered background region, and supplies the estimated mixture ratio to the mixture ratio determining unit 403.
[0470]
In step S504, the mixture ratio calculation unit 102 determines whether or not the mixture ratio has been estimated for the entire frame. If it is determined that the mixture ratio has not been estimated for the entire frame, the process returns to step S502, and the next step A process of estimating the mixture ratio is performed for the pixels.
[0471]
If it is determined in step S504 that the mixture ratio has been estimated for the entire frame, the process proceeds to step S505, where the mixture ratio determining unit 403 determines that the pixel supplied from the region specifying unit 101 and for which the mixture ratio is calculated is the foreground. The mixing ratio is set based on region information indicating whether the region belongs to one of the region, the background region, the covered background region, and the uncovered background region. When the target pixel belongs to the foreground area, the mixing ratio determination unit 403 sets 0 as the mixing ratio, and when the target pixel belongs to the background area, sets the mixing ratio to 1 so that the target pixel is When belonging to the covered background region, the estimated mixture ratio supplied from the estimated mixture ratio processing unit 401 is set as the mixture ratio, and when the target pixel belongs to the uncovered background region, supplied from the estimated mixture ratio processing unit 402 The estimated mixture ratio thus set is set as the mixture ratio, and the process ends.
[0472]
As described above, the mixture ratio calculation unit 102 can calculate the mixture ratio α, which is a feature amount corresponding to each pixel, based on the region information supplied from the region specifying unit 101 and the input image.
[0473]
By using the mixing ratio α, it becomes possible to separate the foreground component and the background component included in the pixel value while leaving the motion blur information included in the image corresponding to the moving object. .
[0474]
Also, by compositing images based on the mixture ratio α, it is possible to create an image including correct motion blur that matches the speed of a moving object as if the real world was actually recaptured.
[0475]
Next, the mixing ratio estimation process using the model corresponding to the covered background area corresponding to step S502 in FIG. 78 will be described with reference to the flowchart in FIG.
[0476]
In step S521, the adding unit 502 sets the pixel value included in the input image and the pixel value included in the image supplied from the delay circuit 221 to a normal equation corresponding to the model of the covered background area. .
[0477]
In step S522, the estimated mixture ratio processing unit 401 determines whether or not the setting for the target pixel has been completed. If it is determined that the setting for the target pixel has not been completed, the process proceeds to step S521. Returning, the process of setting the pixel value to the normal equation is repeated.
[0478]
If it is determined in step S522 that the pixel value setting for the target pixel has been completed, the process proceeds to step S523, and the calculation unit 173 calculates the estimated mixture ratio based on the normal equation in which the pixel value is set. Then, the obtained estimated mixture ratio is output.
[0479]
Thus, the estimated mixture ratio processing unit 401 can calculate the estimated mixture ratio based on the input image.
[0480]
The mixture ratio estimation process by the model corresponding to the uncovered background area in step S153 of FIG. 78 is the same as the process shown in the flowchart of FIG. 79 using the normal equation corresponding to the model of the uncovered background area. The description is omitted.
[0481]
Note that although the object corresponding to the background has been described as stationary, the above-described processing for obtaining the mixture ratio can be applied even if the image corresponding to the background area includes movement. For example, when the image corresponding to the background region is moving uniformly, the estimated mixture ratio processing unit 401 shifts the entire image corresponding to this movement, and is the same as when the object corresponding to the background is stationary. To process. Further, when the image corresponding to the background region includes a different motion for each local area, the estimated mixture ratio processing unit 401 selects a pixel corresponding to the motion as a pixel corresponding to the pixel belonging to the mixed region, and Execute the process.
[0482]
Next, the foreground / background separation unit 105 will be described. FIG. 80 is a block diagram illustrating an example of the configuration of the foreground / background separator 105. The input image supplied to the foreground / background separator 105 is supplied to the separator 601, the switch 602, and the switch 604. The information indicating the covered background area and the area information supplied from the area specifying unit 103 indicating the uncovered background area are supplied to the separation unit 601. Area information indicating the foreground area is supplied to the switch 602. Area information indicating the background area is supplied to the switch 604.
[0483]
The mixing ratio α supplied from the mixing ratio calculation unit 104 is supplied to the separation unit 601.
[0484]
The separation unit 601 separates the foreground components from the input image based on the region information indicating the covered background region, the region information indicating the uncovered background region, and the mixing ratio α, and synthesizes the separated foreground components. The background component is separated from the input image, and the separated background component is supplied to the synthesis unit 605.
[0485]
The switch 602 is closed when a pixel corresponding to the foreground is input based on the region information indicating the foreground region, and supplies only the pixel corresponding to the foreground included in the input image to the combining unit 603.
[0486]
The switch 604 is closed when a pixel corresponding to the background is input based on the region information indicating the background region, and supplies only the pixel corresponding to the background included in the input image to the combining unit 605.
[0487]
The combining unit 603 combines the foreground component image based on the component corresponding to the foreground supplied from the separation unit 601 and the pixel corresponding to the foreground supplied from the switch 602, and outputs the combined foreground component image. Since the foreground area and the mixed area do not overlap, the synthesis unit 603 synthesizes the foreground component image by applying a logical sum operation to the component corresponding to the foreground and the pixel corresponding to the foreground, for example.
[0488]
In the initialization process executed at the beginning of the foreground component image synthesis process, the synthesis unit 603 stores an image in which all pixel values are 0 in the built-in frame memory, and performs synthesis of the foreground component image. In the process, the foreground component image is stored (overwritten). Accordingly, 0 is stored as the pixel value in the pixel corresponding to the background area in the foreground component image output by the synthesis unit 603.
[0489]
The combining unit 605 combines the background component images based on the components corresponding to the background supplied from the separation unit 601 and the pixels corresponding to the background supplied from the switch 604, and outputs the combined background component image. Since the background area and the mixed area do not overlap, the synthesis unit 605 synthesizes the background component image by applying a logical sum operation to the component corresponding to the background and the pixel corresponding to the background, for example.
[0490]
In the initialization process executed at the beginning of the background component image synthesis process, the synthesis unit 605 stores an image in which all pixel values are 0 in the built-in frame memory, and performs synthesis of the background component image. In the processing, the background component image is stored (overwritten). Accordingly, 0 is stored as the pixel value in the pixel corresponding to the foreground area in the background component image output from the synthesis unit 605.
[0491]
FIG. 81 is a diagram illustrating an input image input to the foreground / background separator 105 and a foreground component image and a background component image output from the foreground / background separator 105.
[0492]
FIG. 81A is a schematic diagram of a displayed image, and FIG. 81B shows a pixel belonging to the foreground area, a pixel belonging to the background area, and a pixel belonging to the mixed area corresponding to FIG. 81A. 1 is a model diagram in which pixels of one line including are expanded in the time direction.
[0493]
As shown in FIGS. 81A and 81B, the background component image output from the foreground / background separator 105 is composed of pixels belonging to the background area and background components included in the pixels in the mixed area. The
[0494]
As shown in FIGS. 81A and 81B, the foreground component image output from the foreground / background separator 105 is composed of pixels belonging to the foreground area and foreground components included in the pixels of the mixed area. The
[0495]
The pixel values of the pixels in the mixed region are separated into a background component and a foreground component by the foreground / background separation unit 105. The separated background components together with the pixels belonging to the background area constitute a background component image. The separated foreground components together with the pixels belonging to the foreground area constitute a foreground component image.
[0496]
Thus, in the foreground component image, the pixel value of the pixel corresponding to the background area is set to 0, and a meaningful pixel value is set to the pixel corresponding to the foreground area and the pixel corresponding to the mixed area. Similarly, in the background component image, the pixel value of the pixel corresponding to the foreground area is set to 0, and a meaningful pixel value is set to the pixel corresponding to the background area and the pixel corresponding to the mixed area.
[0497]
Next, a process performed by the separation unit 601 to separate the foreground components and the background components from the pixels belonging to the mixed area will be described.
[0498]
FIG. 82 is a model of an image showing foreground components and background components of two frames including a foreground corresponding to an object moving from left to right in the drawing. In the image model shown in FIG. 82, the foreground motion amount v is 4, and the number of virtual divisions is 4.
[0499]
In frame #n, the leftmost pixel and the fourteenth through eighteenth pixels from the left consist only of background components and belong to the background area. In frame #n, the second through fourth pixels from the left include a background component and a foreground component, and belong to the uncovered background area. In frame #n, the eleventh through thirteenth pixels from the left include a background component and a foreground component, and belong to the covered background area. In frame #n, the fifth through tenth pixels from the left consist of only the foreground components and belong to the foreground area.
[0500]
In frame # n + 1, the first through fifth pixels from the left and the eighteenth pixel from the left consist of only the background components, and belong to the background area. In frame # n + 1, the sixth through eighth pixels from the left include a background component and a foreground component, and belong to the uncovered background area. In frame # n + 1, the fifteenth through seventeenth pixels from the left include a background component and a foreground component, and belong to the covered background area. In frame # n + 1, the ninth through fourteenth pixels from the left consist of only the foreground components, and belong to the foreground area.
[0501]
FIG. 83 is a diagram illustrating a process of separating foreground components from pixels belonging to the covered background area. In FIG. 83, α1 to α18 are mixing ratios corresponding to the respective pixels in frame #n. In FIG. 83, the fifteenth through seventeenth pixels from the left belong to the covered background area.
[0502]
The pixel value C15 of the fifteenth pixel from the left in frame #n is expressed by equation (68).
[0503]

Here, α15 is the mixture ratio of the fifteenth pixel from the left in frame #n. P15 is the pixel value of the fifteenth pixel from the left in frame # n-1.
[0504]
Based on Expression (68), the sum f15 of the foreground components of the fifteenth pixel from the left in frame #n is expressed by Expression (69).
[0505]

[0506]
Similarly, the foreground component sum f16 of the 16th pixel from the left in frame #n is expressed by Equation (70), and the foreground component sum f17 of the 17th pixel from the left in frame #n is expressed by Equation (70). (71)
[0507]
f16 = C16-α16 ・ P16 (70)
f17 = C17-α17 ・ P17 (71)
[0508]
In this way, the foreground component fc included in the pixel value C of the pixel belonging to the covered background area is calculated by Expression (72).
[0509]
fc = C-α ・ P (72)
P is the pixel value of the corresponding pixel in the previous frame.
[0510]
FIG. 84 is a diagram illustrating processing for separating foreground components from pixels belonging to the uncovered background area. In FIG. 84, α1 to α18 are mixing ratios corresponding to the pixels in the frame #n. In FIG. 84, the second through fourth pixels from the left belong to the uncovered background area.
[0511]
The pixel value C02 of the second pixel from the left in frame #n is expressed by Expression (73).
[0512]

Here, α2 is the mixture ratio of the second pixel from the left in frame #n. N02 is the pixel value of the second pixel from the left in frame # n + 1.
[0513]
Based on Expression (73), the sum f02 of the foreground components of the second pixel from the left in frame #n is expressed by Expression (74).
[0514]

[0515]
Similarly, the sum f03 of the foreground components of the third pixel from the left in frame #n is expressed by Expression (75), and the sum f04 of the foreground components of the fourth pixel from the left of frame #n is expressed by Expression (75). (76)
[0516]
f03 = C03-α3 ・ N03 (75)
f04 = C04-α4 ・ N04 (76)
[0517]
In this way, the foreground component fu included in the pixel value C of the pixel belonging to the uncovered background area is calculated by Expression (77).
[0518]
fu = C-α ・ N (77)
N is the pixel value of the corresponding pixel in the next frame.
[0519]
As described above, the separation unit 601 determines from the pixels belonging to the mixed region based on the information indicating the covered background region, the information indicating the uncovered background region, and the mixing ratio α for each pixel included in the region information. Foreground and background components can be separated.
[0520]
FIG. 85 is a block diagram illustrating an example of the configuration of the separation unit 601 that executes the processing described above. The image input to the separation unit 601 is supplied to the frame memory 621, and the region information indicating the covered background region and the uncovered background region supplied from the mixture ratio calculation unit 104, and the mixture ratio α are the separation processing block. It is input to 622.
[0521]
The frame memory 621 stores the input image in units of frames. When the object of processing is frame #n, the frame memory 621 is a frame that is the frame immediately after frame # n-1, frame #n, and frame #n. Remember # n + 1.
[0522]
The frame memory 621 supplies the pixels corresponding to the frame # n−1, the frame #n, and the frame # n + 1 to the separation processing block 622.
[0523]
The separation processing block 622 includes the frame # n−1, the frame #n, and the frame #n supplied from the frame memory 621 based on the area information indicating the covered background area and the uncovered background area, and the mixing ratio α. The calculation described with reference to FIGS. 83 and 84 is applied to the pixel value of the corresponding pixel of +1 to separate the foreground component and the background component from the pixels belonging to the mixed region of frame #n, and This is supplied to the memory 623.
[0524]
The separation processing block 622 includes an uncovered area processing unit 631, a covered area processing unit 632, a combining unit 633, and a combining unit 634.
[0525]
The multiplier 641 of the uncovered area processing unit 631 multiplies the mixing ratio α by the pixel value of the pixel of frame # n + 1 supplied from the frame memory 621 and outputs the result to the switch 642. The switch 642 is closed when the pixel of frame #n (corresponding to the pixel of frame # n + 1) supplied from the frame memory 621 is an uncovered background area, and the mixture ratio supplied from the multiplier 641 The pixel value multiplied by α is supplied to the calculator 643 and the synthesis unit 634. The value obtained by multiplying the pixel value of the pixel of frame # n + 1 output from the switch 642 by the mixing ratio α is equal to the background component of the pixel value of the corresponding pixel of frame #n.
[0526]
The computing unit 643 subtracts the background component supplied from the switch 642 from the pixel value of the pixel of frame #n supplied from the frame memory 621 to obtain the foreground component. The computing unit 643 supplies the foreground component of the pixel of frame #n belonging to the uncovered background area to the synthesis unit 633.
[0527]
The multiplier 651 of the covered area processing unit 632 multiplies the mixture ratio α by the pixel value of the pixel of frame # n−1 supplied from the frame memory 621 and outputs the result to the switch 652. The switch 652 is closed when the pixel of the frame #n supplied from the frame memory 621 (corresponding to the pixel of the frame # n−1) is the covered background region, and the mixture ratio α supplied from the multiplier 651 is The pixel value multiplied by is supplied to the calculator 653 and the combining unit 634. A value obtained by multiplying the pixel value of the pixel of frame # n−1 output from the switch 652 by the mixing ratio α is equal to the background component of the pixel value of the corresponding pixel of frame #n.
[0528]
The arithmetic unit 653 subtracts the background component supplied from the switch 652 from the pixel value of the pixel of frame #n supplied from the frame memory 621 to obtain the foreground component. The calculator 653 supplies the foreground components of the pixels of the frame #n belonging to the covered background area to the synthesis unit 633.
[0529]
The synthesizer 633 outputs the foreground components of the pixel belonging to the uncovered background area supplied from the calculator 643 and the foreground of the pixel belonging to the covered background area supplied from the calculator 653 of the frame #n. The components are combined and supplied to the frame memory 623.
[0530]
The combining unit 634 receives the background component of the pixel belonging to the uncovered background area supplied from the switch 642 and the background component of the pixel belonging to the covered background area supplied from the switch 652 of the frame #n. Combined and supplied to the frame memory 623.
[0531]
The frame memory 623 stores the foreground components and the background components of the pixels in the mixed area of the frame #n supplied from the separation processing block 622, respectively.
[0532]
The frame memory 623 outputs the stored foreground components of the pixels in the mixed area of frame #n and the stored background components of the pixels of the mixed area in frame #n.
[0533]
By using the mixture ratio α, which is a feature amount, it is possible to completely separate the foreground component and the background component included in the pixel value.
[0534]
The synthesizing unit 603 combines the foreground components of the pixels in the mixed area of frame #n output from the separating unit 601 with the pixels belonging to the foreground area to generate a foreground component image. The synthesizing unit 605 synthesizes the background component of the pixel in the mixed area of frame #n output from the separating unit 601 and the pixel belonging to the background area to generate a background component image.
[0535]
FIG. 86 is a diagram illustrating an example of the foreground component image and an example of the background component image corresponding to frame #n in FIG.
[0536]
FIG. 86A shows an example of the foreground component image corresponding to frame #n in FIG. Since the leftmost pixel and the fourteenth pixel from the left consist of only background components before the foreground and the background are separated, the pixel value is set to zero.
[0537]
The second through fourth pixels from the left belong to the uncovered background area before the foreground and the background are separated, the background component is 0, and the foreground component is left as it is. The eleventh to thirteenth pixels from the left belong to the covered background area before the foreground and the background are separated, the background component is set to 0, and the foreground component is left as it is. The fifth through tenth pixels from the left are made up of only the foreground components and are left as they are.
[0538]
FIG. 86B shows an example of a background component image corresponding to frame #n in FIG. The leftmost pixel and the fourteenth pixel from the left are left as they are because they consisted only of the background components before the foreground and the background were separated.
[0539]
The second through fourth pixels from the left belong to the uncovered background area before the foreground and the background are separated, the foreground components are set to 0, and the background components are left as they are. The eleventh to thirteenth pixels from the left belong to the covered background area before the foreground and the background are separated, and the foreground components are set to 0 and the background components are left as they are. Since the fifth through tenth pixels from the left consist of only the foreground components before the foreground and the background are separated, the pixel value is set to zero.
[0540]
Next, foreground / background separation processing by the foreground / background separation unit 105 will be described with reference to a flowchart shown in FIG. In step S601, the frame memory 621 of the separation unit 601 obtains an input image, and determines the frame #n to be separated from the foreground and the background as the previous frame # n-1 and the subsequent frame # n + 1. Remember with.
[0541]
In step S 602, the separation processing block 622 of the separation unit 601 acquires the region information supplied from the mixture ratio calculation unit 104. In step S 603, the separation processing block 622 of the separation unit 601 acquires the mixture ratio α supplied from the mixture ratio calculation unit 104.
[0542]
In step S604, the uncovered area processing unit 631 extracts a background component from the pixel values of the pixels belonging to the uncovered background area supplied from the frame memory 621 based on the area information and the mixture ratio α.
[0543]
In step S605, the uncovered area processing unit 631 extracts the foreground components from the pixel values of the pixels belonging to the uncovered background area supplied from the frame memory 621 based on the area information and the mixture ratio α.
[0544]
In step S606, the covered area processing unit 632 extracts a background component from the pixel values of the pixels belonging to the covered background area supplied from the frame memory 621 based on the area information and the mixture ratio α.
[0545]
In step S607, the covered area processing unit 632 extracts the foreground components from the pixel values of the pixels belonging to the covered background area supplied from the frame memory 621 based on the area information and the mixture ratio α.
[0546]
In step S608, the synthesizer 633 extracts the foreground components of the pixels belonging to the uncovered background area extracted in step S605 and the foreground components of the pixels belonging to the covered background area extracted in step S607. And synthesize. The synthesized foreground components are supplied to the synthesis unit 603. Further, the synthesizing unit 603 combines the pixels belonging to the foreground area supplied via the switch 602 with the foreground components supplied from the separating unit 601 to generate a foreground component image.
[0547]
In step S609, the synthesizer 634 extracts the background component of the pixel belonging to the uncovered background area extracted in step S604 and the background component of the pixel belonging to the covered background area extracted in step S606. And synthesize. The synthesized background component is supplied to the synthesis unit 605. Furthermore, the synthesis unit 605 synthesizes the pixels belonging to the background area supplied via the switch 604 and the background components supplied from the separation unit 601 to generate a background component image.
[0548]
In step S610, the synthesis unit 603 outputs the foreground component image. In step S611, the synthesis unit 605 outputs a background component image, and the process ends.
[0549]
As described above, the foreground / background separation unit 105 separates the foreground component and the background component from the input image based on the region information and the mixture ratio α, and the foreground component image including only the foreground component and the background A background component image consisting only of components can be output.
[0550]
Next, the removal of motion blur from the foreground component image will be described.
[0551]
FIG. 88 is a block diagram illustrating an example of the configuration of the motion blur removal unit 106. The motion vector and its position information supplied from the motion detection unit 102 and the region information supplied from the region specifying unit 103 are supplied to the processing unit determination unit 801 and the modeling unit 802. The foreground component image supplied from the foreground / background separation unit 105 is supplied to the adding unit 804.
[0552]
The processing unit determination unit 801 supplies the generated processing unit to the modeling unit 802 together with the motion vector based on the motion vector, its position information, and region information. The processing unit determining unit 801 supplies the generated processing unit to the adding unit 804.
[0553]
The processing unit generated by the processing unit determination unit 801 starts from the pixel corresponding to the covered background area of the foreground component image and moves in the direction of movement to the pixel corresponding to the uncovered background area, as shown in FIG. 89. A continuous pixel lined up in a moving direction starting from a pixel lined up or a pixel corresponding to an uncovered background area to a pixel corresponding to a covered background area is shown. The processing unit is composed of, for example, two pieces of data: an upper left point (a pixel specified by the processing unit and located at the leftmost or uppermost pixel on the image) and a lower right point.
[0554]
The modeling unit 802 executes modeling based on the motion vector and the input processing unit. More specifically, for example, the modeling unit 802 previously stores a plurality of models corresponding to the number of pixels included in the processing unit, the number of virtual divisions of the pixel values in the time direction, and the number of foreground components for each pixel. 90, a model that specifies the correspondence between the pixel value and the foreground components as shown in FIG. 90 may be selected based on the processing unit and the number of virtual divisions of the pixel value in the time direction. .
[0555]
For example, when the number of pixels corresponding to the processing unit is 12 and the amount of motion v within the shutter time is 5, the modeling unit 802 sets the virtual division number to 5 and sets the leftmost pixel to 1 The foreground component, the second pixel from the left contains the two foreground components, the third pixel from the left contains the three foreground components, and the fourth pixel from the left contains the four foreground components The fifth pixel from the left contains five foreground components, the sixth pixel from the left contains five foreground components, the seventh pixel from the left contains five foreground components, and eight from the left. The tenth pixel includes five foreground components, the ninth pixel from the left includes four foreground components, the tenth pixel from the left includes three foreground components, and the eleventh pixel from the left is 2 The foreground component, and the twelfth pixel from the left contains one foreground component. As a whole, a model consisting of eight foreground components is selected.
[0556]
Note that the modeling unit 802 generates a model based on the motion vector and the processing unit when the motion vector and the processing unit are supplied, instead of selecting from the models stored in advance. Also good.
[0557]
The modeling unit 802 supplies the selected model to the equation generation unit 803.
[0558]
The equation generation unit 803 generates an equation based on the model supplied from the modeling unit 802. Referring to the model of the foreground component image shown in FIG. 90, the number of foreground components is 8, the number of pixels corresponding to the processing unit is 12, the amount of motion v is 5, and the number of virtual divisions is 5 The equation generated by the equation generation unit 803 will be described.
[0559]
When the foreground components corresponding to the shutter time / v included in the foreground component image are F01 / v to F08 / v, the relationship between F01 / v to F08 / v and the pixel values C01 to C12 is expressed by equations (78) to (78). It is represented by Formula (89).
[0560]

[0561]
The equation generation unit 803 generates an equation by modifying the generated equation. Equations generated by the equation generation unit 803 are shown in equations (90) to (101).

[0562]
Expressions (90) to (101) can also be expressed as Expression (102).
[0563]
[Expression 17]

In Expression (102), j indicates the position of the pixel. In this example, j has any one value of 1 to 12. I indicates the position of the foreground value. In this example, i has any one value of 1 to 8. aij has a value of 0 or 1 corresponding to the values of i and j.
[0564]
When expressed in consideration of the error, Expression (102) can be expressed as Expression (103).
[0565]
[Expression 18]

In Expression (103), ej is an error included in the target pixel Cj.
[0566]
Expression (103) can be rewritten as Expression (104).
[0567]
[Equation 19]

[0568]
Here, in order to apply the method of least squares, an error sum of squares E is defined as shown in Expression (105).
[0569]
[Expression 20]

[0570]
In order to minimize the error, the partial differential value of the variable Fk with respect to the square sum E of the error may be zero. Fk is obtained so as to satisfy Expression (106).
[0571]
[Expression 21]

[0572]
In Expression (106), since the motion amount v is a fixed value, Expression (107) can be derived.
[0573]
[Expression 22]

[0574]
When equation (107) is expanded and transferred, equation (108) is obtained.
[0575]
[Expression 23]

[0576]
This is expanded into eight equations obtained by substituting any one of integers 1 to 8 for k in equation (108). The obtained eight expressions can be expressed by one expression by a matrix. This equation is called a normal equation.
[0577]
An example of a normal equation generated by the equation generation unit 803 based on such a method of least squares is shown in Equation (109).
[0578]
[Expression 24]

[0579]
When Expression (109) is expressed as A · F = v · C, C, A, v are known, and F is unknown. A and v are known at the time of modeling, but C is known by inputting a pixel value in the adding operation.
[0580]
By calculating the foreground component using a normal equation based on the method of least squares, the error included in the pixel C can be dispersed.
[0581]
The equation generation unit 803 supplies the normal equation generated in this way to the addition unit 804.
[0582]
The addition unit 804 sets the pixel value C included in the foreground component image to the matrix expression supplied from the equation generation unit 803 based on the processing unit supplied from the processing unit determination unit 801. The adding unit 804 supplies a matrix in which the pixel value C is set to the calculation unit 805.
[0583]
The calculation unit 805 calculates a foreground component Fi / v from which motion blur has been removed by processing based on a solution method such as a sweep-out method (Gauss-Jordan elimination method), and uses the foreground pixel values from which motion blur has been removed. A foreground component from which motion blur is removed, which is made up of Fi, which is a pixel value from which motion blur has been removed, is calculated by calculating Fi corresponding to any one of integers from 0 to 8 as shown in FIG. Output an image.
[0584]
In addition, in the foreground component image from which the motion blur shown in FIG. 91 is removed, each of F01 to F08 is set to each of C03 to C10 because the position of the foreground component image with respect to the screen is not changed. It can correspond to an arbitrary position.
[0585]
For example, as illustrated in FIG. 92, when the number of pixels corresponding to the processing unit is 8 and the motion amount v is 4, the motion blur removal unit 106 represents the matrix equation represented by Equation (110). Generate.
[0586]
[Expression 25]

[0587]
The motion blur removal unit 106 thus calculates the number of expressions corresponding to the length of the processing unit in this way, and calculates Fi that is a pixel value in which the amount of motion blur is adjusted. Similarly, for example, when the number of pixels included in the processing unit is 100, an expression corresponding to 100 pixels is generated and Fi is calculated.
[0588]
As described above, the motion blur removal unit 106 generates an expression corresponding to the motion amount v and the processing unit, sets the pixel value of the foreground component image in the generated expression, and removes the motion blur. A component image is calculated.
[0589]
Next, processing for removing motion blur included in the foreground component image by the motion blur removal unit 106 will be described with reference to the flowchart of FIG.
[0590]
In step S 801, the processing unit determination unit 801 of the motion blur removal unit 106 generates a processing unit based on the motion vector and the region information, and supplies the generated processing unit to the modeling unit 802.
[0591]
In step S802, the modeling unit 802 of the motion blur removal unit 106 selects and generates a model corresponding to the motion amount v and the processing unit. In step S803, the equation generation unit 803 creates a normal equation based on the selected model.
[0592]
In step S804, the adding unit 804 sets the pixel value of the foreground component image in the created normal equation. In step S805, the adding unit 804 determines whether or not the pixel values of all the pixels corresponding to the processing unit have been set, and if the pixel values of all the pixels corresponding to the processing unit have not been set. If it is determined, the process returns to step S804, and the process of setting the pixel value in the normal equation is repeated.
[0593]
If it is determined in step S805 that the pixel values of all the pixels in the processing unit have been set, the process advances to step S806, and the calculation unit 805 calculates a normal equation in which the pixel values supplied from the addition unit 804 are set. Based on this, the foreground pixel value from which motion blur is removed is calculated, and the process ends.
[0594]
As described above, the motion blur removal unit 106 can remove motion blur from the foreground image including motion blur based on the motion vector and the region information.
[0595]
That is, motion blur included in the pixel value that is sample data can be removed.
[0596]
Next, correction of the background component image by the correction unit 107 will be described.
[0597]
FIG. 94 is a diagram illustrating an example of a background component image model corresponding to the foreground component image model illustrated in FIG. 90.
[0598]
As shown in FIG. 94, the pixel value of the pixel of the background component image corresponding to the mixed region of the original input image is the pixel value corresponding to the background region of the original input image because the foreground component is removed. In comparison, it is composed of a small number of background components corresponding to the mixing ratio α.
[0599]
For example, in the background component image shown in FIG. 94, the pixel value C01 is composed of four background components B02 / V, and the pixel value C02 is composed of three background components B03 / V. Is composed of two background components B04 / V, and the pixel value C04 is composed of one background component B05 / V.
[0600]
In the background component image shown in FIG. 94, the pixel value C09 is composed of one background component B10 / V, and the pixel value C10 is composed of two background components B11 / V. Is composed of three background components B12 / V, and the pixel value C12 is composed of four background components B13 / V.
[0601]
Thus, since the pixel value of the pixel corresponding to the mixed area of the original input image is composed of a small number of background components compared to the pixel corresponding to the background area of the original input image, The image corresponding to the mixed region of the foreground component image is, for example, a dark image compared to the image of the background region.
[0602]
The correcting unit 107 multiplies each of the pixel values corresponding to the mixed region of the background component image by a constant corresponding to the mixing ratio α to obtain the pixel value of the pixel corresponding to the mixed region of the background component image. Correct.
[0603]
For example, when the background component image shown in FIG. 94 is input, the correction unit 107 multiplies the pixel value C01 by 5/4, multiplies the pixel value C02 by 5/3, and multiplies the pixel value C11 by 5/3. Multiply and multiply the pixel value C12 by 5/4. In order to match the pixel position with the foreground component image from which the motion blur is removed as shown in FIG. 91, the correction unit 107 sets the pixel values C03 to C11 to 0.
[0604]
The correcting unit 107 outputs a background component image in which the pixel values of the pixels corresponding to the mixed region are corrected as shown in FIG.
[0605]
As described above, the correction unit 107 corrects the pixel value of the pixel corresponding to the mixed region of the background component image and aligns the pixel position with the foreground component image from which the motion blur is removed.
[0606]
FIG. 96 is a block diagram showing a configuration of the motion deblurred image processing unit 108 that generates a coefficient set used in the class classification adaptive processing for generating a higher resolution image in the spatial direction. For example, the motion deblurred image processing unit 108 having the configuration shown in FIG. 96 generates a coefficient set used in the class classification adaptive process for generating an HD image from an SD image based on the input HD image.
[0607]
The background component teacher image frame memory 1001 stores the corrected background component image of the teacher image supplied from the correction unit 107. The background component teacher image frame memory 1001 supplies the stored background component image of the teacher image to the weighted average unit 1003-1 and the learning unit 1006-1.
[0608]
The foreground component teacher image frame memory 1002 stores the foreground component image supplied from the motion blur removal unit 106 from which motion blur of the teacher image has been removed. The foreground component teacher image frame memory 1002 supplies the stored foreground component images of the teacher image to the weighted average unit 1003-2 and the learning unit 1006-2.
[0609]
The weighted average unit 1003-1 generates an SD image that is a student image by performing a quarter-weighted average of the background component image of the teacher image that is supplied from the background component teacher image frame memory 1001, for example, an HD image. Then, the generated SD image is supplied to the background component student image frame memory 1004.
[0610]
For example, as shown in FIG. 97, the weighted average unit 1003-1 uses four 2 × 2 (horizontal × vertical) pixels (indicated by white circles in the figure) as one unit, as shown in FIG. The pixel values of the four pixels are added, and the added result is divided by four. In this way, the weighted average unit 1003-1 sets the result of the weighted average by a quarter to the pixel of the student image (the part indicated by a black circle in the figure) located at the center of each unit.
[0611]
The background component student image frame memory 1004 stores a student image corresponding to the background component image of the teacher image supplied from the weighted average unit 1003-1. The background component student image frame memory 1004 supplies the stored student image corresponding to the background component image of the teacher image to the learning unit 1006-1.
[0612]
The weighted average unit 1003-2 generates a SD image that is a student image by performing a quarter-weighted average of the foreground component image of the teacher image that is, for example, an HD image supplied from the foreground component teacher image frame memory 1002. Then, the generated SD image is supplied to the foreground component student image frame memory 1005.
[0613]
The foreground component student image frame memory 1005 stores a student image, which is an SD image, corresponding to the foreground component image of the teacher image supplied from the weighted average unit 1003-2. The foreground component student image frame memory 1005 supplies the stored student image corresponding to the foreground component image of the teacher image to the learning unit 1006-2.
[0614]
The learning unit 1006-1 receives the background component image of the teacher image supplied from the background component teacher image frame memory 1001 and the student image corresponding to the background component image of the teacher image supplied from the background component student image frame memory 1004. Based on this, a coefficient set corresponding to the background component image is generated, and the generated coefficient set is supplied to the coefficient set memory 1007.
[0615]
The learning unit 1006-2 receives the foreground component image of the teacher image supplied from the foreground component teacher image frame memory 1002 and the student image corresponding to the foreground component image of the teacher image supplied from the foreground component student image frame memory 1005. Based on this, a coefficient set corresponding to the foreground component image is generated, and the generated coefficient set is supplied to the coefficient set memory 1007.
[0616]
The coefficient set memory 1007 stores a coefficient set corresponding to the background component image supplied from the learning unit 1006-1 and a coefficient set corresponding to the foreground component image supplied from the learning unit 1006-2.
[0617]
Hereinafter, the learning unit 1006-1 and the learning unit 1006-2 are simply referred to as a learning unit 1006 when it is not necessary to distinguish them individually.
[0618]
FIG. 98 is a block diagram illustrating a configuration of the learning unit 1006.
[0619]
The class classification unit 1031 includes a class tap acquisition unit 1051 and a waveform classification unit 1052, and classifies a pixel of interest, which is a pixel of interest, of the input student image. The class tap acquisition unit 1051 acquires a predetermined number of class taps that are pixels of the student image corresponding to the target pixel, and supplies the acquired class taps to the waveform classification unit 1052.
[0620]
For example, in FIG. 97, the pixel of the i th student image from the top and the j th student image from the left (the part indicated by a black circle in the figure) is X_ijIn this case, the class tap acquisition unit 1051 displays the target pixel X_ij8 pixels X adjacent to the upper left, upper, upper right, left, right, lower left, lower, lower right of_{(i-1) (j-1)}, X_{(i-1) j}, X_{(i-1) (j + 1)}, X_{i (j-1)}, X_{i (j + 1)}, X_{(i-1) (j-1)}, X_{(i-1) j}, X_{(i + 1) (j + 1)}In addition, a class tap composed of a total of 9 pixels including itself is acquired. The class tap is supplied to the waveform classification unit 1052.
[0621]
In this case, the class tap is configured by a square block of 3 × 3 pixels, but the shape of the class classification block does not have to be a square, for example, a rectangle, It can be a cross shape or any other shape. Also, the number of pixels constituting the class tap is not limited to 3 × 3 9 pixels.
[0622]
The waveform classification unit 1052 performs a class classification process that classifies the input signal into several classes based on the characteristics thereof, and classifies the pixel of interest into one class based on the class tap. For example, the waveform classification unit 1052 classifies the target pixel into one of 512 classes, and supplies a class number corresponding to the classified class to the prediction tap acquisition unit 1032.
[0623]
Here, the class classification process will be briefly described.
[0624]
Now, for example, as shown in FIG. 99 (A), a class tap of 2 × 2 pixels is constituted by a certain pixel of interest and three pixels adjacent thereto, and each pixel is expressed by 1 bit. (Takes a level of 0 or 1). In this case, a 2 × 2 4-pixel block including the pixel of interest has 16 (= (2) as shown in FIG. 99B due to the level distribution of each pixel.¹)^Four) Can be classified into patterns. Therefore, in this case, the target pixel can be classified into 16 patterns, and such pattern classification is class classification processing, and is performed in the class classification unit 1031.
[0625]
The class classification process can be performed in consideration of the activity (complexity of the image) (the intensity of change) of the image (class tap).
[0626]
Here, normally, for example, about 8 bits are assigned to each pixel. Further, in the present embodiment, as described above, the class tap is composed of 3 × 3 9 pixels. Therefore, when class classification processing is performed for such a class tap, (2⁸)⁹It will be classified into a huge number of classes.
[0627]
Therefore, in the present embodiment, the waveform classification unit 1052 performs ADRC processing on the class tap, thereby reducing the number of classes by reducing the number of bits of the pixels constituting the class tap. .
[0628]
To simplify the description, as shown in FIG. 100A, when considering a class tap composed of four pixels arranged on a straight line, in the ADRC processing, the maximum value MAX and the minimum value MIN of the pixel values are considered. Is detected. Then, DR = MAX-MIN is set as the local dynamic range of the block constituted by class taps, and the pixel values of the pixels constituting the class tap block are requantized to K bits based on the dynamic range DR. Is done.
[0629]
That is, the minimum value MIN is subtracted from each pixel value in the block, and the subtracted value is converted into DR / 2.^KDivide by. Then, it is converted into a code (ADRC code) corresponding to the division value obtained as a result. Specifically, for example, when K = 2, as shown in FIG. 100 (B), the division value has a dynamic range DR of 4 (= 2²) It is determined which range is obtained by equally dividing, and the division value is the range of the lowest level, the range of the second level from the bottom, the range of the third level from the bottom, or the top In the case of belonging to the level range, for example, it is encoded into 2 bits such as 00B, 01B, 10B, or 11B (B represents a binary number). On the decoding side, the ADRC code 00B, 01B, 10B, or 11B is the center value L of the lowest level range obtained by dividing the dynamic range DR into four equal parts.₀₀, Center value L of the second level range from the bottom₀₁, Center value L of the third level range from the bottom_TenOr the center value L of the range of the highest level₁₁Decoding is performed by adding the minimum value MIN to the value.
[0630]
Here, such ADRC processing is called non-edge matching.
[0631]
The details of the ADRC processing are disclosed in, for example, Japanese Patent Application Laid-Open No. 3-53778 filed by the applicant of the present application.
[0632]
By performing ADRC processing that performs requantization with a smaller number of bits than the number of bits allocated to the pixels constituting the class tap, the number of classes can be reduced as described above. This is performed in the waveform classification unit 1052.
[0633]
In the present embodiment, the waveform classification unit 1052 performs the class classification process based on the ADRC code. However, the class classification process includes, for example, DPCM (predictive coding) and BTC (Block Truncation Coding). ), VQ (vector quantization), DCT (discrete cosine transform), Hadamard transform, and the like.
[0634]
Based on the class number, the prediction tap acquisition unit 1032 acquires and acquires a prediction tap, which is a unit for calculating a prediction value of the original image (teacher image) corresponding to the class from the pixel of the student image. The predicted tap and the class number thus supplied are supplied to the corresponding pixel acquisition unit 1033.
[0635]
For example, in FIG. 97, pixel X of the student image_ijThe pixel value of 9 pixels of 2 × 2 in the original image (teacher image) centered on (indicated by the black circle in the figure) is Y from the left to the right and from the top to the bottom._ij(1), Y_ij(2), Y_ij(3), Y_ijIf expressed as (4), pixel Y_ij(1) to Y_ijIn order to calculate the coefficient necessary for the calculation of the predicted value of (4), the prediction tap acquisition unit 1032, for example, uses the pixel X of the student image._ij3 × 3 9 pixels X centered on_{(i-1) (j-1)}, X_{(i-1) j}, X_{(i-1) (j + 1)}, X_{i (j-1)}, X_ij, X_{i (j + 1)}, X_{(i + 1) (j-1)}, X_{(i + 1) j}, X_{(i + 1) (j + 1)}The square-shaped prediction tap comprised by is acquired.
[0636]
Specifically, for example, the pixel Y in the teacher image surrounded by a rectangle in FIG.₃₃(1) to Y₃₃In order to calculate the coefficient necessary for calculating the predicted value of four pixels in (4), the pixel X_{twenty two}, X_{twenty three}, X_{twenty four}, X₃₂, X₃₃, X₃₄, X₄₂, X₄₃, X₄₄(A pixel of interest in this case is X₃₃Becomes).
[0637]
The corresponding pixel acquisition unit 1033 acquires the pixel value of the pixel of the teacher image corresponding to the pixel value to be predicted based on the prediction tap and the class number, and sets the prediction tap, the class number, and the acquired pixel value to be predicted. The pixel values of the corresponding teacher image pixels are supplied to the normal equation generation unit 1034.
[0638]
For example, the corresponding pixel acquisition unit 1033 uses the pixel Y in the teacher image.₃₃(1) to Y₃₃When calculating the coefficients necessary for calculating the predicted values of the four pixels in (4), the pixel Y is used as the pixel of the teacher image corresponding to the pixel value to be predicted.₃₃(1) to Y₃₃The pixel value of (4) is acquired.
[0639]
The normal equation generation unit 1034 calculates a coefficient set used in the adaptive process corresponding to the relationship between the prediction tap and the pixel value to be predicted, based on the prediction tap, the class number, and the acquired pixel value to be predicted. The normal equation is generated, and the generated normal equation is supplied to the coefficient calculation unit 1035 together with the class number.
[0640]
The coefficient calculation unit 1035 solves the normal equation supplied from the normal equation generation unit 1034 and calculates a coefficient set used in the adaptive process corresponding to the classified class. The coefficient calculation unit 1035 supplies the calculated coefficient set to the coefficient set memory 1007 together with the class number.
[0641]
The normal equation generation unit 1034 may generate a matrix corresponding to such a normal equation, and the coefficient calculation unit 1035 may calculate a coefficient set based on the generated matrix.
[0642]
Here, the adaptation process will be described.
[0643]
For example, the predicted value E [y] of the pixel value y of the teacher image is now set to the pixel values of some surrounding pixels (hereinafter referred to as student data as appropriate) x₁, X₂, ... and a predetermined prediction coefficient w₁, W₂Consider a linear primary combination model defined by the linear combination of. In this case, the predicted value E [y] can be expressed by the following equation.
[0644]
E [y] = w₁x₁+ W₂x₂+ ... (111)
[0645]
Therefore, in order to generalize, a matrix W composed of a set of prediction coefficients w, a matrix X composed of a set of student data, and a matrix Y ′ composed of a set of predicted values E [y]
[Equation 26]

Then, the following observation equation holds.
[0646]
XW = Y ′ (112)
[0647]
Then, it is considered to apply the least square method to this observation equation to obtain a predicted value E [y] close to the pixel value y of the original image. In this case, a matrix Y consisting of a set of pixel values y of the original image (hereinafter referred to as teacher data as appropriate) y and a set of residuals e of predicted values E [y] for the pixel values y of the original image. E
[Expression 27]

From the equation (112), the following residual equation is established.
[0648]
XW = Y + E (113)
[0649]
In this case, the prediction coefficient w for obtaining the predicted value E [y] close to the pixel value y of the original image_iIs the square error
[Expression 28]

Can be obtained by minimizing.
[0650]
Therefore, the above square error is converted into the prediction coefficient w._iWhen the value differentiated by 0 is 0, that is, the prediction coefficient w satisfying the following equation:_iHowever, this is the optimum value for obtaining the predicted value E [y] close to the pixel value y of the original image.
[0651]
[Expression 29]

[0652]
Therefore, first, Equation (113) is converted into the prediction coefficient w._iIs differentiated by the following equation.
[0653]
[30]

[0654]
From the expressions (114) and (115), the expression (116) is obtained.
[0655]
[31]

[0656]
Further, considering the relationship among the student data x, the prediction coefficient w, the teacher data y, and the residual e in the residual equation of Equation (113), the following normal equation can be obtained from Equation (116). .
[0657]
[Expression 32]

[0658]
The normal equation of Expression (117) can be established by the same number as the number of prediction coefficients w to be obtained. Therefore, the optimal prediction coefficient w can be obtained by solving Expression (117). In solving the equation (117), for example, a sweeping method (Gauss-Jordan elimination method) or the like can be applied.
[0659]
As described above, the optimum prediction coefficient w for each class is obtained, and further, the prediction value E [y] close to the pixel value y of the teacher image is obtained by the equation (111) using the prediction coefficient w. It is an adaptive process.
[0660]
The normal equation generation unit 1034 generates a normal equation for calculating the optimal prediction coefficient w for each class, and the coefficient calculation unit 1035 calculates the prediction coefficient w based on the generated normal equation.
[0661]
Note that the adaptive processing is different from the interpolation processing in that a component included in the original image that is not included in the thinned image is reproduced. That is, the adaptive process is the same as the interpolation process using a so-called interpolation filter as long as only Expression (111) is seen, but the prediction coefficient w corresponding to the tap coefficient of the interpolation filter uses the teacher data y. In other words, since it is obtained by learning, the components included in the original image can be reproduced. From this, it can be said that the adaptive process is a process having an image creating action.
[0662]
FIG. 101 is a diagram for explaining a coefficient set generated by the motion blur removal image processing unit 108 having the configuration shown in FIG. In the input image, the area specifying unit 103 specifies a foreground area, a background area, a covered background area, and an uncovered background area.
[0663]
The input image in which the region is specified and the mixture ratio α is detected by the mixture ratio calculation unit 104 is separated by the foreground / background separation unit 105 into a foreground component image and a background component image.
[0664]
The separated motion of the foreground component image is removed by the motion blur removal unit 106. The pixel value corresponding to the mixed region of the separated background component image is corrected by the correction unit 107 corresponding to the removal of motion blur of the foreground component image.
[0665]
The motion blur removal image processing unit 108 individually sets a coefficient set corresponding to the foreground component image and a coefficient set corresponding to the background component image based on the foreground component image from which motion blur is removed and the corrected background component image. To calculate.
[0666]
That is, the learning unit 1006-1 calculates a coefficient set corresponding to the background component image based on the separated and corrected background component image, and the learning unit 1006-2 is separated and the motion blur is removed. A coefficient set corresponding to the foreground component image is calculated based on the foreground component image.
[0667]
The coefficient set corresponding to the background component image is used to predict the pixel value of the image corresponding to the background component image in the class classification adaptive process for predicting the pixel value applied to the separated and corrected background component image. .
[0668]
The coefficient set corresponding to the foreground component image is applied to the foreground component image that has been separated from the input image and from which motion blur has been removed. In the class classification adaptation process that predicts the pixel value, the pixel value of the image corresponding to the foreground component image Used for prediction.
[0669]
Motion blur is added to the predicted image corresponding to the foreground component image. The predicted image corresponding to the background component image is corrected corresponding to the addition of motion blur of the foreground component image.
[0670]
The predicted image corresponding to the corrected background component image and the predicted image corresponding to the foreground component image to which motion blur is added are combined into one predicted image.
[0671]
A learning process for generating a coefficient set used for pixel value prediction by the class classification adaptive process by the motion deblurred image processing unit 108 shown in FIG. 96 will be described with reference to the flowchart of FIG.
[0672]
In step S1001, the weighted average unit 1003-1 and the weighted average unit 1003-2 generate a student image corresponding to the background component image and a student image corresponding to the foreground component image. That is, the weighted average unit 1003-1 performs a weighted average of the background component images of the teacher image stored in the background component teacher image frame memory 1001, for example, to correspond to the background component image of the teacher image. Generate student images.
[0673]
The weighted average unit 1003-2 performs a weighted average of the foreground component images of the teacher image stored in the foreground component teacher image frame memory 1002, for example, by a quarter, and corresponds to the foreground component image of the teacher image. Generate an image.
[0674]
In step S1002, the learning unit 1006-1 determines the background component image of the teacher image stored in the background component teacher image frame memory 1001 and the background component image of the teacher image stored in the background component student image frame memory 1004. A coefficient set corresponding to the background component image is generated based on the student image corresponding to. Details of the coefficient set generation processing in step S1002 will be described later with reference to the flowchart of FIG.
[0675]
In step S1003, the learning unit 1006-2 performs the foreground component image of the teacher image stored in the foreground component teacher image frame memory 1002 and the foreground component image of the teacher image stored in the foreground component student image frame memory 1005. A coefficient set corresponding to the foreground component image is generated based on the student image corresponding to.
[0676]
In step S1004, the learning unit 1006-1 and the learning unit 1006-2 each output a coefficient set corresponding to the background component image or a coefficient set corresponding to the foreground component image to the coefficient set memory 1007. The coefficient set memory 1007 stores the coefficient set corresponding to the background component image or the coefficient set corresponding to the foreground component image, and the process ends.
[0677]
In this way, the motion blur removal image processing unit 108 having the configuration shown in FIG. 96 can generate a coefficient set corresponding to the background component image and a coefficient set corresponding to the foreground component image.
[0678]
Of course, the processing of step S1002 and step S1003 may be executed serially or in parallel.
[0679]
Next, processing for generating a coefficient set corresponding to the background component image executed by the learning unit 1006-1 and corresponding to the processing in step S1002 will be described with reference to the flowchart in FIG.
[0680]
In step S1021, the learning unit 1006-1 determines whether there is an unprocessed pixel in the student image corresponding to the background component image, and determines that there is an unprocessed pixel in the student image corresponding to the background component image. If YES in step S1022, the process proceeds to step S1022, and the target pixel is acquired from the student image corresponding to the background component image in the raster scan order.
[0681]
In step S1023, the class tap acquisition unit 1051 of the class classification unit 1031 acquires the class tap corresponding to the target pixel from the student image stored in the background component student image frame memory 1004. In step S1024, the waveform classification unit 1052 of the class classification unit 1031 applies ADRC processing to the class tap, thereby reducing the number of bits of the pixels constituting the class tap and classifying the target pixel. . In step S1025, the prediction tap acquisition unit 1032 acquires a prediction tap corresponding to the target pixel from the student image stored in the background component student image frame memory 1004 based on the classified class.
[0682]
In step S1026, the corresponding pixel acquisition unit 1033 acquires a pixel corresponding to the pixel value to be predicted from the background component image of the teacher image stored in the background component teacher image frame memory 1001 based on the classified class. To do.
[0683]
In step S1027, the normal equation generation unit 1034 adds the pixel value of the pixel corresponding to the prediction tap and the pixel value to be predicted to the matrix for each class based on the classified class, and returns to step S1021 to perform learning. The unit 1006-1 repeats the determination whether there is an unprocessed pixel. The matrix for each class into which the pixel value of the pixel corresponding to the prediction tap and the pixel value to be predicted is added corresponds to a normal equation for calculating a coefficient set for each class.
[0684]
If it is determined in step S1021 that there are no unprocessed pixels in the student image, the process proceeds to step S1028, and the normal equation generation unit 1034 sets the pixel value of the pixel corresponding to the prediction tap and the pixel value to be predicted. The matrix for each class is supplied to the coefficient calculation unit 1035. The coefficient calculation unit 1035 calculates a coefficient set for each class corresponding to the background component image by solving a matrix for each class in which the pixel values of the pixels corresponding to the prediction tap and the pixel value to be predicted are set.
[0685]
Note that the coefficient calculation unit 1035 is not limited to a coefficient set for predicting a pixel value by linear prediction, and may calculate a coefficient set for predicting a pixel value by nonlinear prediction.
[0686]
In step S1029, the coefficient calculation unit 1035 outputs the coefficient set for each class corresponding to the background component image to the coefficient set memory 1007, and the process ends.
[0687]
Thus, the learning unit 1006-1 can generate a coefficient set corresponding to the background component image.
[0688]
The processing of generating a coefficient set corresponding to the foreground component image by the learning unit 1006-2 corresponding to step S1003 includes the foreground component image stored in the foreground component teacher image frame memory 1002 and the foreground component student image frame memory. Since the processing is the same as that described with reference to the flowchart of FIG. 103 except that the student image corresponding to the foreground component image stored in 105 is used, description thereof will be omitted.
[0689]
In this way, the motion blur removal image processing unit 108 having the configuration shown in FIG. 96 individually generates a coefficient set corresponding to the corrected background component image and a coefficient set corresponding to the foreground component image from which the motion blur is removed. can do.
[0690]
FIG. 104 is a block diagram illustrating a configuration of the motion blur removal image processing unit 108 that executes the class classification adaptation process and generates a higher resolution image in the spatial direction. For example, the motion deblurred image processing unit 108 having the configuration shown in FIG. 104 performs class classification adaptation processing based on an input image that is an SD image to generate an HD image.
[0691]
The background component image frame memory 1101 stores the corrected background component image supplied from the correction unit 107. The background component image frame memory 1101 supplies the stored background component image to the mapping unit 1103-1.
[0692]
The foreground component image frame memory 1102 stores the foreground component image from which motion blur has been removed, which is supplied from the motion blur removal unit 106. The foreground component image frame memory 1102 supplies the stored foreground component image to the mapping unit 1103-2.
[0693]
Based on the coefficient set corresponding to the background component image stored in the coefficient set memory 1104, the mapping unit 1103-1 applies the class classification adaptation process to the background component image stored in the background component image frame memory 1101. A corresponding predicted image is generated. The mapping unit 1103-1 supplies the generated predicted image to the correction unit 1105.
[0694]
Corresponding to the motion blur added by the motion blur adding unit 1106, the correcting unit 1105 sets or adds 0 to the pixel value of a predetermined pixel of the predicted image corresponding to the mixed region of the background component image. The pixel value of a predetermined pixel of the predicted image is divided by a predetermined value corresponding to motion blur. The correction unit 1105 supplies the predicted image corrected in this way to the synthesis unit 1107.
[0695]
The mapping unit 1103-2 converts the foreground component image stored in the foreground component image frame memory 1102 by the class classification adaptive process based on the coefficient set corresponding to the foreground component image stored in the coefficient set memory 1104. A corresponding predicted image is generated. The mapping unit 1103-2 supplies the generated predicted image to the motion blur adding unit 1106.
[0696]
The motion blur adding unit 1106 has a desired motion blur adjustment amount v ′, for example, a motion blur adjustment amount v ′ that is half the value of the motion amount v of the input image, or a motion blur adjustment amount v that has a value unrelated to the motion amount v. By giving ', motion blur is added to the predicted image. The motion blur adding unit 1106 calculates the foreground component Fi / v ′ by dividing the pixel value Fi of the predicted image of the foreground component image from which the motion blur has been removed by the motion blur adjustment amount v ′, and calculates the foreground component Fi / v ′. The sum of the components Fi / v ′ is calculated to generate a pixel value to which motion blur is added.
[0697]
For example, when the predicted image shown in FIG. 105 is input and the motion blur adjustment amount v ′ is 3, as shown in FIG. 106, the pixel value C02 is (F01) / v ′, and the pixel value C03 is ( F01 + F02) / v ′, the pixel value C04 is (F01 + F02 + F03) / v ′, and the pixel value C05 is (F02 + F03 + F04) / v ′.
[0698]
The motion blur adding unit 1106 supplies the prediction image of the foreground component image to which the motion blur is added in this way to the synthesizing unit 1107.
[0699]
The synthesizing unit 1107 supplies the predicted image corresponding to the corrected background component image supplied from the correcting unit 1105, and the predicted image corresponding to the foreground component image added with motion blur supplied from the motion blur adding unit 1106. And the synthesized predicted image is supplied to the frame memory 1108.
[0700]
The frame memory 1108 stores the predicted image supplied from the synthesizing unit 1107 and outputs the stored image as an output image.
[0701]
Hereinafter, when there is no need to distinguish between the mapping unit 1103-1 and the mapping unit 1103-2, they are simply referred to as the mapping unit 1103.
[0702]
FIG. 107 is a block diagram showing the configuration of the mapping unit 1103.
[0703]
The mapping processing unit 1131 includes a class classification unit 1141 that executes class classification processing, and a prediction tap acquisition unit 1142 and a prediction calculation unit 1143 that execute adaptive processing.
[0704]
The class classification unit 1141 includes a class tap acquisition unit 1151 and a waveform classification unit 1152, and classifies a pixel of interest, which is a pixel of interest, in either the background component image or the foreground component image. To do.
[0705]
The class tap acquisition unit 1151 acquires a predetermined number of class taps corresponding to the target pixel of the input image, and supplies the acquired class taps to the waveform classification unit 1152. For example, the class tap acquisition unit 1151 acquires nine class taps and supplies the acquired class taps to the waveform classification unit 1152.
[0706]
The waveform classification unit 1152 applies ADRC processing to the class tap, thereby reducing the number of bits of the pixels constituting the class tap, and changing the target pixel to one of a predetermined number of classes. Classification is performed, and the class number corresponding to the classified class is supplied to the prediction tap acquisition unit 1142. For example, the waveform classification unit 1152 classifies the pixel of interest into one of 512 classes, and supplies a class number corresponding to the classified class to the prediction tap acquisition unit 1142.
[0707]
The prediction tap acquisition unit 1142 acquires a predetermined number of prediction taps corresponding to the class from the input image based on the class number, and supplies the acquired prediction tap and class number to the prediction calculation unit 1143.
[0708]
The prediction calculation unit 1143 corresponds to the input image from the coefficient set corresponding to the background component image and the coefficient set corresponding to the foreground component image stored in the coefficient set memory 1104 based on the class number, and corresponds to the class. Get the coefficient set.
The prediction calculation unit 1143 predicts the pixel value of the predicted image by linear prediction based on the coefficient set corresponding to the class and the prediction tap corresponding to the input image. The prediction calculation unit 1143 supplies the predicted pixel value to the frame memory 1132.
[0709]
Note that the prediction calculation unit 1143 may predict the pixel value of the predicted image by nonlinear prediction.
[0710]
The frame memory 1132 stores the predicted pixel value supplied from the mapping processing unit 1131 and outputs an image including the predicted pixel value.
[0711]
Next, image creation processing of the motion blur removal image processing unit 108 shown in FIG. 104 will be described with reference to the flowchart of FIG.
[0712]
In step S1101, the mapping unit 1103-1 is stored in the background component image frame memory 1101 by the class classification adaptation process based on the coefficient set corresponding to the background component image stored in the coefficient set memory 1104. An image corresponding to the background component image is predicted. Details of the process of predicting the image corresponding to the background component image will be described later with reference to the flowchart of FIG.
[0713]
In step S 1102, the mapping unit 1103-2 is stored in the foreground component image frame memory 1102 by class classification adaptation processing based on the coefficient set corresponding to the foreground component image stored in the coefficient set memory 1104. An image corresponding to the foreground component image is predicted.
[0714]
In step S1103, the correction unit 1105 corrects the predicted image corresponding to the background component image.
[0715]
In step S1104, the motion blur adding unit 1106 adds motion blur to the predicted image corresponding to the foreground component image.
[0716]
In step S1105, the synthesis unit 1107 synthesizes the predicted image corresponding to the background component image and the predicted image corresponding to the foreground area. The combining unit 1107 supplies the combined image to the frame memory 1108. The frame memory 1108 stores the image supplied from the synthesizing unit 1107.
[0717]
In step S1106, the frame memory 1108 outputs the stored synthesized image, and the process ends.
[0718]
As described above, the image processing apparatus including the motion blur removal image processing unit 108 having the configuration illustrated in FIG. 104 generates a prediction image corresponding to the background component image, and the prediction image corresponding to the foreground component image from which the motion blur is removed. Can be generated individually.
[0719]
Of course, the processing of step S1101 and step S1102 may be executed serially or in parallel.
[0720]
With reference to the flowchart of FIG. 109, the process of predicting an image corresponding to the background component image by the mapping unit 1103-1 corresponding to step S1101 will be described.
[0721]
In step S1121, the mapping unit 1103-1 determines whether there are unprocessed pixels in the background component image. If it is determined that there are unprocessed pixels in the background component image, the mapping unit 1103-1 proceeds to step S1122. The processing unit 1131 acquires a coefficient set corresponding to the background component image stored in the coefficient set memory 1104. In step S1123, the mapping processing unit 1131 acquires the target pixel from the background component image stored in the background component image frame memory 1101 in the raster scan order.
[0722]
In step S1124, the class tap acquisition unit 1151 of the class classification unit 1141 acquires a class tap corresponding to the target pixel from the background component image stored in the background component image frame memory 1101. In step S1125, the waveform classification unit 1152 of the class classification unit 1141 applies ADRC processing to the class tap, thereby reducing the number of bits of the pixels constituting the class tap and classifying the target pixel. . In step S1126, the prediction tap acquisition unit 1142 acquires a prediction tap corresponding to the target pixel from the background component image stored in the background component image frame memory 1101 based on the classified class.
[0723]
In step S1127, the prediction calculation unit 1143 predicts the pixel value of the predicted image by linear prediction based on the background component image, the coefficient set corresponding to the classified class, and the prediction tap.
[0724]
Note that the prediction calculation unit 1143 is not limited to linear prediction, and may predict pixel values of a predicted image by nonlinear prediction.
[0725]
In step S 1128, the prediction calculation unit 1143 outputs the predicted pixel value to the frame memory 1132. The frame memory 1132 stores the pixel value supplied from the prediction calculation unit 1143. The procedure returns to step S1121 and repeats the determination of whether or not there is an unprocessed pixel.
[0726]
If it is determined in step S1121 that there are no unprocessed pixels in the background component image, the process advances to step S1129, and the frame memory 1132 outputs a predicted image corresponding to the stored background component image, and the processing ends. To do.
[0727]
As described above, the mapping unit 1103-1 can predict an image corresponding to the background component image based on the corrected background component image.
[0728]
The process of generating a predicted image corresponding to the foreground component image by the mapping unit 1103-2 corresponding to step S 1102 includes the foreground component image stored in the foreground component image frame memory 1102 and the coefficient corresponding to the foreground component image. Except for the use of sets, the processing is the same as that described with reference to the flowchart in FIG.
[0729]
As described above, the motion blur removal image processing unit 108 having the configuration illustrated in FIG. 104 generates a prediction image corresponding to the background component image, and individually generates a prediction image corresponding to the foreground component image from which the motion blur is removed. be able to.
[0730]
FIG. 110 is a block diagram illustrating another configuration of the functions of the image processing apparatus. The image processing apparatus shown in FIG. 11 sequentially performs area specification and calculation of the mixture ratio α, whereas the image processing apparatus shown in FIG. 110 performs area specification and calculation of the mixture ratio α in parallel.
[0731]
The same parts as those shown in the block diagram of FIG. 11 are denoted by the same reference numerals, and the description thereof is omitted.
[0732]
The input image is supplied to the object extraction unit 101, the region specifying unit 103, the mixture ratio calculation unit 1501, and the foreground / background separation unit 1502.
[0733]
Based on the input image, the mixture ratio calculation unit 1501 calculates the estimated mixture ratio when it is assumed that the pixel belongs to the covered background area, and the estimated mixture ratio when it is assumed that the pixel belongs to the uncovered background area. Calculated for each pixel included in the input image, estimated mixture ratio when the calculated pixel is assumed to belong to the covered background area, and estimated mixture when the pixel is assumed to belong to the uncovered background area The ratio is supplied to the foreground / background separator 1502.
[0734]
FIG. 111 is a block diagram illustrating an example of the configuration of the mixture ratio calculation unit 1501.
[0735]
An estimated mixture ratio processing unit 401 shown in FIG. 111 is the same as the estimated mixture ratio processing unit 401 shown in FIG. The estimated mixture ratio processing unit 402 shown in FIG. 111 is the same as the estimated mixture ratio processing unit 402 shown in FIG.
[0736]
Based on the input image, the estimated mixture ratio processing unit 401 calculates an estimated mixture ratio for each pixel by an operation corresponding to the model of the covered background region, and outputs the calculated estimated mixture ratio.
[0737]
The estimated mixture ratio processing unit 402 calculates an estimated mixture ratio for each pixel by an operation corresponding to the model of the uncovered background area based on the input image, and outputs the calculated estimated mixture ratio.
[0738]
The foreground / background separation unit 1502 supplies the estimated mixture ratio supplied from the mixture ratio calculation unit 1501 when it is assumed that the pixel belongs to the covered background area, and the estimation when the pixel belongs to the uncovered background area. Based on the mixing ratio and the region information supplied from the region specifying unit 103, the input image is separated into a background component image and a foreground component image, and the separated image is supplied to the motion blur removal image processing unit.
[0739]
FIG. 112 is a block diagram illustrating an example of the configuration of the foreground / background separation unit 1502.
[0740]
Parts that are the same as those in the motion blur removal unit 106 shown in FIG. 80 are assigned the same reference numerals, and descriptions thereof are omitted.
[0741]
Based on the region information supplied from the region specifying unit 103, the selection unit 1521 supplies the estimated mixture ratio supplied from the mixture ratio calculation unit 1501 when it is assumed that the pixel belongs to the covered background region, and the pixel is undefined. One of the estimated mixture ratios when it is assumed to belong to the covered background region is selected, and the selected estimated mixture ratio is supplied to the separation unit 601 as the mixture ratio α.
[0741]
Based on the mixture ratio α and region information supplied from the selection unit 1521, the separation unit 601 extracts the foreground components and the background components from the pixel values of the pixels belonging to the mixed region, and extracts the background of the uncovered background region. Separated into a component, a foreground component of the uncovered background area, a background component of the covered background area, and a foreground component of the covered background area.
[0743]
The separation unit 601 can have the same configuration as that shown in FIG.
[0744]
As described above, the image processing apparatus having the configuration shown in FIG. 110 can execute processing corresponding to each property for each of the background component image and the foreground component image.
[0745]
As described above, in the image processing apparatus of the present invention, the input image is separated into the background component image and the foreground component image, and processing suitable for the separated image is executed. For example, an unnatural image is generated. Without this, an image with higher resolution is generated.
[0746]
FIG. 113 is a block diagram illustrating another configuration of the functions of the image processing apparatus.
[0747]
The same parts as those shown in FIG. 11 are denoted by the same reference numerals, and the description thereof is omitted.
[0748]
The object extraction unit 101 roughly extracts an image object corresponding to a foreground object included in the input image, and supplies the extracted image object to the motion detection unit 102.
[0749]
The motion detection unit 102 calculates the motion vector of the image object corresponding to the coarsely extracted foreground object by a method such as a block matching method, a gradient method, a phase correlation method, and a per-recursive method. The motion vector and the position information of the motion vector are supplied to the region specifying unit 103.
[0750]
The area specifying unit 103 specifies each pixel of the input image as one of a foreground area, a background area, an uncovered background area, or a mixed area including a covered background area, and sets the area information as a mixing ratio. This is supplied to the calculation unit 104 and the foreground / background separation unit 2001.
[0751]
Based on the input image and the region information supplied from the region specifying unit 103, the mixture ratio calculation unit 104 calculates the mixture ratio α corresponding to the pixels included in the mixed region, and the calculated mixture ratio is foreground / background separation. Part 2001.
[0752]
Based on the region information supplied from the region specifying unit 103 and the mixture ratio α supplied from the mixture ratio calculation unit 104, the foreground / background separation unit 2001 includes only the image component and background component corresponding to the foreground object. A background region image, an image consisting only of the background components of the uncovered background region (hereinafter referred to as the background component image of the uncovered background region), and the uncovered background region An image consisting only of the foreground components (hereinafter referred to as the foreground component image of the uncovered background area), an image consisting of only the background components of the covered background area (hereinafter referred to as the background component image of the covered background area) , An image consisting only of foreground components in the covered background area (hereinafter referred to as cover Background referred foreground component image in the region), and supplies the image of the foreground area to the separating image processing unit 2002.
[0753]
The separated image processing unit 2002 is supplied from the foreground / background separating unit 2001, the background region image, the background component image of the uncovered background region, the foreground component image of the uncovered background region, and the background component image of the covered background region. The foreground component image in the covered background area and the image in the foreground area are processed.
[0754]
For example, the separated image processing unit 2002 includes a background area image, an uncovered background area background component image, an uncovered background area foreground component image, a covered background area background component image, and a covered background area foreground component. For each image and foreground region image, a coefficient used in the class classification adaptive processing for generating a higher resolution image is generated.
[0755]
For example, the separated image processing unit 2002 includes a background area image, an uncovered background area background component image, an uncovered background area foreground component image, a covered background area background component image, and a covered background area foreground component. A class classification adaptive process is applied to each image and each image in the foreground area to create a higher resolution image.
[0756]
FIG. 114 is a diagram for explaining processing of the separated image processing unit 2002 having the configuration shown in FIG. In the input image, the area specifying unit 103 specifies a foreground area, a background area, a covered background area, and an uncovered background area.
[0757]
The input image in which the region is specified and the mixture ratio α is detected by the mixture ratio calculation unit 104 is input to the foreground / background separation unit 2001 by using the foreground region image, the background region image, the foreground component image in the covered background region, A background component image in the ground area, a foreground component image in the uncovered background area, and a background component image in the uncovered background area are separated.
[0758]
The separated image processing unit 2002 includes a separated foreground area image, background area image, covered background area foreground component image, covered background area background component image, uncovered background area foreground component image, and uncovered background area image. Based on the background component image in the covered background area, the coefficient set corresponding to the image in the foreground area, the coefficient set corresponding to the image in the background area, the coefficient set corresponding to the foreground component image in the covered background area, and the covered background area A coefficient set corresponding to the background component image, a coefficient set corresponding to the foreground component image in the uncovered background area, and a coefficient set corresponding to the background component image in the uncovered background area are calculated individually.
[0759]
The coefficient set corresponding to the background region is used for prediction of the pixel value of the background region in the class classification adaptive process for predicting the pixel value. The coefficient set corresponding to the background component image in the uncovered background area is used for prediction of the pixel value corresponding to the background component image in the uncovered background area in the class classification adaptive process for predicting the pixel value. The coefficient set corresponding to the foreground component image in the uncovered background area is used for prediction of the pixel value corresponding to the foreground component image in the uncovered background area in the class classification adaptive process for predicting the pixel value.
[0760]
The coefficient set corresponding to the background component image in the covered background area is used for predicting the pixel value corresponding to the background component image in the covered background area in the class classification adaptation process for predicting the pixel value. The coefficient set corresponding to the foreground component image in the covered background area is used for predicting the pixel value corresponding to the foreground component image in the covered background area in the class classification adaptation process for predicting the pixel value.
[0761]
The coefficient set corresponding to the foreground region is used for predicting the pixel value of the foreground region in the class classification adaptive process for predicting the pixel value.
[0762]
Corresponding to the predicted image corresponding to the background region image, the predicted image corresponding to the background component image of the uncovered background region, the predicted image corresponding to the foreground component image of the uncovered background region, and the background component image of the covered background region The predicted image corresponding to the foreground component image in the covered background area and the predicted image corresponding to the image in the foreground area are combined into one predicted image.
[0763]
FIG. 115 is a block diagram illustrating an example of the configuration of the foreground / background separator 2001.
The input image supplied to the foreground / background separator 2001 is supplied to the separator 2101, the switch 2102, and the switch 2103. The information indicating the covered background area and the area information supplied from the area specifying unit 103 indicating the uncovered background area are supplied to the separation unit 2101. Area information indicating the foreground area is supplied to the switch 2102. Area information indicating the background area is supplied to the switch 2103.
[0764]
The mixture ratio α supplied from the mixture ratio calculation unit 104 is supplied to the separation unit 2101.
[0765]
The separation unit 2101 separates the foreground components from the covered background region of the input image based on the region information indicating the covered background region and the mixing ratio α, and separates the background components, thereby separating the separated foreground. A foreground component image of the covered background area composed of the above components and a background component image of the covered background area composed of the separated background components are output.
[0766]
The separation unit 2101 separates the foreground components from the uncovered background area of the input image and the background components based on the area information indicating the uncovered background area and the mixing ratio α. A foreground component image of the uncovered background area composed of the foreground components and a background component image of the uncovered background area composed of the separated background components are output.
[0767]
Based on the area information indicating the foreground area, the switch 2102 is closed when a pixel corresponding to the foreground area is input, and outputs an image of the foreground area.
[0768]
The switch 2103 is closed when a pixel corresponding to the background area is input based on the area information indicating the background area, and outputs an image of the background area.
[0769]
FIG. 116 is a block diagram illustrating an example of the configuration of the separation unit 2101. The image input to the separation unit 2101 is supplied to the frame memory 2121, the region information indicating the covered background region and the uncovered background region supplied from the mixture ratio calculation unit 104, and the mixture ratio α are the separation processing block. 2122 is input.
[0770]
The frame memory 2121 stores the input image in units of frames. When the object of processing is frame #n, the frame memory 2121 is a frame that is a frame immediately after frame # n-1, frame #n, and frame #n. Remember # n + 1.
[0771]
The frame memory 2121 supplies the pixels corresponding to the frame # n−1, the frame #n, and the frame # n + 1 to the separation processing block 2122.
[0772]
The separation processing block 2122 includes the frame # n−1, the frame #n, and the frame #n supplied from the frame memory 2121 based on the region information indicating the covered background region and the uncovered background region, and the mixing ratio α. The foreground component and the background component are separated from the pixels belonging to the mixed region of frame #n by applying the operation described with reference to FIGS. 83 and 84 to the pixel value of the corresponding pixel of +1.
[0773]
The separation processing block 2122 includes an uncovered area processing unit 2131 and a covered area processing unit 2132.
[0774]
The multiplier 2141 of the uncovered area processing unit 2131 multiplies the mixture ratio α by the pixel value of the pixel of frame # n + 1 supplied from the frame memory 2121 and outputs the result to the switch 2142. The switch 2142 is closed when the pixel of frame #n supplied from the frame memory 2121 (corresponding to the pixel of frame # n + 1) is the uncovered background area, and the mixing ratio supplied from the multiplier 2141 The pixel value multiplied by α is supplied to the calculator 2143. The value obtained by multiplying the pixel value of the pixel of frame # n + 1 output from the switch 2142 by the mixing ratio α is equal to the background component of the pixel value of the corresponding pixel of frame #n, and the background of the uncovered background area Output as a component image.
[0775]
The computing unit 2143 subtracts the background component supplied from the switch 2142 from the pixel value of the pixel of frame #n supplied from the frame memory 2121 to obtain the foreground component. The calculator 2143 outputs the foreground component image of the pixel of frame #n belonging to the uncovered background area.
[0776]
The multiplier 2151 of the covered area processing unit 2132 multiplies the pixel value of the pixel of frame # n−1 supplied from the frame memory 2121 by the multiplication ratio α, and outputs the result to the switch 2152. The switch 2152 is closed when the pixel of frame #n supplied from the frame memory 2121 (corresponding to the pixel of frame # n−1) is the covered background area, and the mixture ratio α supplied from the multiplier 2151 The pixel value multiplied by is supplied to the calculator 2153. A value obtained by multiplying the pixel value of the pixel of frame # n−1 output from the switch 2152 by the mixing ratio α is equal to the background component of the pixel value of the corresponding pixel of frame #n, and the background component of the covered background area Output as an image.
[0777]
The computing unit 2153 subtracts the background component supplied from the switch 2152 from the pixel value of the pixel of frame #n supplied from the frame memory 2121 to obtain the foreground component. The computing unit 2153 outputs the foreground component image of the pixel of frame #n belonging to the covered background area.
[0778]
By using the mixture ratio α, which is a feature amount, it is possible to completely separate the foreground component and the background component included in the pixel value.
[0779]
FIG. 117 is a block diagram illustrating a configuration of a separated image processing unit 2002 that generates a coefficient set used in the class classification adaptive processing that generates a higher resolution image in the spatial direction. For example, the separated image processing unit 2002 having the configuration shown in FIG. 117 generates a coefficient set used in the class classification adaptive process for generating an HD image from an SD image based on the input HD image.
[0780]
The background area teacher image frame memory 2201 stores the background area image of the teacher image supplied from the foreground / background separation unit 2001. The background area teacher image frame memory 2201 supplies the background area image of the stored teacher image to the weighted average unit 2207-1 and the learning unit 2214-1.
[0781]
The uncovered background area background component teacher image frame memory 2202 stores the background component image of the uncovered background area of the teacher image supplied from the foreground / background separation unit 2001. The uncovered background region background component teacher image frame memory 2202 supplies the stored background component image of the uncovered background region of the teacher image to the weighted average unit 2207-2 and the learning unit 2214-2.
[0782]
The uncovered background area foreground component teacher image frame memory 2203 stores the foreground component image of the uncovered background area of the teacher image supplied from the foreground / background separation unit 2001. The uncovered background area foreground component teacher image frame memory 2203 supplies the stored foreground component image of the uncovered background area of the teacher image to the weighted average unit 2207-3 and the learning unit 2214-3.
[0783]
The covered background area background component teacher image frame memory 2204 stores the background component image of the covered background area of the teacher image supplied from the foreground / background separation unit 2001. The covered background area background component teacher image frame memory 2204 supplies the stored background component image of the covered background area of the teacher image to the weighted average unit 2207-4 and the learning unit 2214-4.
[0784]
The covered background area foreground component teacher image frame memory 2205 stores the foreground component image of the covered background area of the teacher image supplied from the foreground / background separation unit 2001. The covered background area foreground component teacher image frame memory 2205 supplies the stored foreground component image of the covered background area of the teacher image to the weighted average unit 2207-5 and the learning unit 2214-5.
[0785]
The foreground area teacher image frame memory 2206 stores the image of the foreground area of the teacher image supplied from the foreground / background separation unit 2001. The foreground area teacher image frame memory 2206 supplies the stored image of the foreground area of the teacher image to the weighted average unit 2207-6 and the learning unit 2214-6.
[0786]
The weighted average unit 2207-1, for example, a quarter-weighted average of the background area image of the teacher image that is the HD image supplied from the background area teacher image frame memory 2201, and obtains the SD image that is the student image. The generated SD image is supplied to the background area student image frame memory 2208.
[0787]
The background area student image frame memory 2208 stores a student image corresponding to the background area image of the teacher image supplied from the weighted average unit 2207-1. The background area student image frame memory 2208 supplies the stored student image corresponding to the background area image of the teacher image to the learning unit 2214-1.
[0788]
The weighted average unit 2207-2 applies, for example, a quarter weight to the background component image of the uncovered background area of the teacher image that is an HD image supplied from the uncovered background area background component teacher image frame memory 2202. On average, an SD image that is a student image is generated, and the generated SD image is supplied to the uncovered background area background component student image frame memory 2209.
[0789]
The uncovered background area background component student image frame memory 2209 stores a student image which is an SD image corresponding to the background component image of the uncovered background area of the teacher image supplied from the weighted average unit 2207-2. . The uncovered background area background component student image frame memory 2209 supplies the stored student image corresponding to the background component image of the uncovered background area of the teacher image to the learning unit 2214-2.
[0790]
The weighted average unit 2207-3 weights the foreground component image of the uncovered background area of the teacher image, which is an HD image, supplied from the uncovered background area foreground component teacher image frame memory 2203, for example, by a quarter weight. On average, an SD image that is a student image is generated, and the generated SD image is supplied to the uncovered background area foreground component student image frame memory 2210.
[0791]
The uncovered background area foreground component student image frame memory 2210 stores a student image which is an SD image corresponding to the foreground component image of the uncovered background area of the teacher image supplied from the weighted average unit 2207-3. . The uncovered background area foreground component student image frame memory 2210 supplies the stored student image corresponding to the foreground component image in the uncovered background area of the teacher image to the learning unit 2214-3.
[0792]
The weighted average unit 2207-4 performs a weighted average of the background component images of the covered background area of the teacher image supplied from the covered background area background component teacher image frame memory 2204, for example, to obtain a student image. The generated SD image is generated, and the generated SD image is supplied to the covered background area background component student image frame memory 2211.
[0793]
The covered background area background component student image frame memory 2211 stores a student image which is an SD image corresponding to the background component image of the covered background area of the teacher image supplied from the weighted average unit 2207-4. The covered background area background component student image frame memory 2211 supplies the stored student image corresponding to the background component image in the covered background area of the teacher image to the learning unit 2214-4.
[0794]
The weighted average unit 2207-5 performs a weighted average of the foreground component images in the covered background area of the teacher image supplied from the covered background area foreground component teacher image frame memory 2205, for example, to obtain a student image. The generated SD image is generated, and the generated SD image is supplied to the covered background area foreground component student image frame memory 2212.
[0795]
The covered background area foreground component student image frame memory 2212 stores a student image that is an SD image corresponding to the foreground component image in the covered background area of the teacher image supplied from the weighted average unit 2207-5. The covered background area foreground component student image frame memory 2212 supplies the stored student image corresponding to the foreground component image in the covered background area of the teacher image to the learning unit 2214-5.
[0796]
The weighted average unit 2207-6 performs, for example, a one-fourth weighted average of the images in the foreground area of the teacher image that is the HD image supplied from the foreground area teacher image frame memory 2206, and the SD image that is the student image The generated SD image is supplied to the foreground area student image frame memory 2213.
[0797]
The foreground area student image frame memory 2213 stores a student image which is an SD image corresponding to the image of the foreground area of the teacher image supplied from the weighted average unit 2207-6. The foreground area student image frame memory 2213 supplies the stored student image corresponding to the foreground area image of the teacher image to the learning unit 2214-6.
[0798]
The learning unit 2214-1 is a student corresponding to the background region image of the teacher image supplied from the background region teacher image frame memory 2201 and the background region image of the teacher image supplied from the background region student image frame memory 2208. A coefficient set corresponding to the background area is generated based on the image, and the generated coefficient set is supplied to the coefficient set memory 2215.
[0799]
The learning unit 2214-2 receives the background component image of the uncovered background area of the teacher image supplied from the uncovered background area background component teacher image frame memory 2202 and the uncovered background area background component student image frame memory 2209. Based on the supplied student image corresponding to the background component image of the uncovered background area of the teacher image, a coefficient set corresponding to the background component image of the uncovered background area is generated, and the generated coefficient set is set to the coefficient set. This is supplied to the memory 2215.
[0800]
The learning unit 2214-3 receives the foreground component image of the uncovered background area of the teacher image supplied from the uncovered background area foreground component teacher image frame memory 2203 and the uncovered background area foreground component student image frame memory 2210. A coefficient set corresponding to the foreground component image in the uncovered background area is generated based on the supplied student image corresponding to the foreground component image in the uncovered background area of the teacher image, and the generated coefficient set is set as a coefficient set. This is supplied to the memory 2215.
[0801]
The learning unit 2214-4 supplies the background component image of the covered background area of the teacher image supplied from the covered background area background component teacher image frame memory 2204 and the background element image of the covered background area student component supplied from the student image frame memory 2211. Based on the student image corresponding to the background component image in the covered background area of the teacher image, a coefficient set corresponding to the background component image in the covered background area is generated, and the generated coefficient set is supplied to the coefficient set memory 2215. .
[0802]
The learning unit 2214-5 supplies the foreground component image of the covered background area of the teacher image supplied from the covered background area foreground component teacher image frame memory 2205 and the covered background area foreground component student image frame memory 2212. Based on the student image corresponding to the foreground component image in the covered background area of the teacher image, a coefficient set corresponding to the foreground component image in the covered background area is generated, and the generated coefficient set is supplied to the coefficient set memory 2215. .
[0803]
The learning unit 2214-6 is a student corresponding to the foreground area image of the teacher image supplied from the foreground area teacher image frame memory 2206 and the foreground area image of the teacher image supplied from the foreground area student image frame memory 2213. A coefficient set corresponding to the foreground area is generated based on the image, and the generated coefficient set is supplied to the coefficient set memory 2215.
[0804]
The coefficient set memory 2215 is a coefficient set corresponding to the background region supplied from the learning unit 2214-1, a coefficient set corresponding to the background component image of the uncovered background region supplied from the learning unit 2214-2, and a learning unit 2214. From the learning unit 2214-5, the coefficient set corresponding to the foreground component image of the uncovered background area supplied from -3, the coefficient set corresponding to the background component image of the covered background area supplied from the learning unit 2214-4 The coefficient set corresponding to the foreground component image of the supplied covered background area and the coefficient set corresponding to the foreground area supplied from the learning unit 2214-6 are stored.
[0805]
Note that the learning units 2214-1 to 2214-6 have the same configuration as the learning unit 1006, and thus description thereof is omitted.
[0806]
FIG. 118 is a block diagram illustrating a configuration of the separated image processing unit 2002 that executes the class classification adaptive processing to generate a higher resolution image in the spatial direction. For example, the separated image processing unit 2002 having the configuration illustrated in FIG. 118 performs class classification adaptation processing based on an input image that is an SD image to generate an HD image.
[0807]
The background area frame memory 2301 stores the image of the background area that is supplied from the foreground / background separation unit 2001 and includes pixels belonging to the background area. The background area frame memory 2301 supplies the stored background area image to the mapping unit 2307-1.
[0808]
The uncovered background area background component image frame memory 2302 stores the background component image of the uncovered background area supplied from the foreground / background separation unit 2001. The uncovered background area background component image frame memory 2302 supplies the stored background component image of the uncovered background area to the mapping unit 2307-2.
[0809]
The uncovered background area foreground component image frame memory 2303 stores the foreground component image of the uncovered background area supplied from the foreground / background separation unit 2001. The uncovered background area foreground component image frame memory 2303 supplies the stored uncovered background area foreground component image to the mapping unit 2307-3.
[0810]
The covered background area background component image frame memory 2304 stores the background component image of the covered background area supplied from the foreground / background separation unit 2001. The covered background area background component image frame memory 2304 supplies the stored background component image of the covered background area to the mapping unit 2307-4.
[0811]
The covered background area foreground component image frame memory 2305 stores the foreground component image of the covered background area supplied from the foreground / background separation unit 2001. The covered background area foreground component image frame memory 2305 supplies the stored foreground component image of the covered background area to the mapping unit 2307-5.
[0812]
The foreground area frame memory 2306 stores the image of the foreground area, which is supplied from the foreground / background separation unit 2001 and includes pixels belonging to the foreground area. The foreground area image frame memory 2306 supplies the stored image of the foreground area to the mapping unit 2307-6.
[0813]
Based on the coefficient set corresponding to the background area stored in the coefficient set memory 2308, the mapping unit 2307-1 corresponds to the image of the background area stored in the background area frame memory 2301 by the class classification adaptive processing. A predicted image to be generated is generated. The mapping unit 2307-1 supplies the generated predicted image to the synthesis unit 2309.
[0814]
The mapping unit 2307-2 performs unclassified background area background component image frame by class classification adaptation processing based on the coefficient set corresponding to the background component image of the uncovered background area stored in the coefficient set memory 2308. A prediction image corresponding to the background component image of the uncovered background area stored in the memory 2302 is generated. The mapping unit 2307-2 supplies the generated predicted image to the synthesis unit 2309.
[0815]
The mapping unit 2307-3 performs an uncovered background area foreground component image frame by class classification adaptation processing based on the coefficient set corresponding to the foreground component image of the uncovered background area stored in the coefficient set memory 2308. A prediction image corresponding to the foreground component image in the uncovered background area stored in the memory 2303 is generated. The mapping unit 2307-3 supplies the generated predicted image to the synthesis unit 2309.
[0816]
The mapping unit 2307-4 performs the covered background area background component image frame memory 2304 by class classification adaptive processing based on the coefficient set stored in the coefficient set memory 2308 and corresponding to the background component image of the covered background area. The prediction image corresponding to the background component image in the covered background area is stored. The mapping unit 2307-4 supplies the generated predicted image to the synthesis unit 2309.
[0817]
Based on the coefficient set corresponding to the foreground component image in the covered background area stored in the coefficient set memory 2308, the mapping unit 2307-5 performs the classified background adaptive processing to perform the covered background area foreground component image frame memory 2305. The predicted image corresponding to the foreground component image in the covered background area stored in the above is generated. The mapping unit 2307-5 supplies the generated predicted image to the synthesis unit 2309.
[0818]
Based on the coefficient set corresponding to the foreground area stored in the coefficient set memory 2308, the mapping unit 2307-6 corresponds to the image of the foreground area stored in the foreground area frame memory 2306 by the class classification adaptive process. A predicted image to be generated is generated. The mapping unit 2307-6 supplies the generated predicted image to the synthesis unit 2309.
[0819]
The synthesizing unit 2309 includes a predicted image corresponding to the background region image supplied from the mapping unit 2307-1, a predicted image corresponding to the background component image of the uncovered background region supplied from the mapping unit 2307-2, and a mapping unit. A predicted image corresponding to the foreground component image of the uncovered background area supplied from 2307-3, a predicted image corresponding to the background component image of the covered background area supplied from the mapping unit 2307-4, and a mapping unit 2307-5 The predicted image corresponding to the foreground component image of the covered background area supplied from the image and the predicted image corresponding to the image of the foreground area supplied from the mapping unit 2307-6 are combined, and the combined predicted image is combined with the frame memory 2310. To supply.
[0820]
The frame memory 2310 stores the predicted image supplied from the combining unit 2309 and outputs the stored image as an output image.
[0821]
Note that the mapping units 2307-1 to 2307-6 have the same configuration as the mapping unit 1103, and thus description thereof is omitted.
[0822]
With reference to the images shown in FIGS. 119 to 124, an example of processing results of the image processing apparatus of the present invention having the separated image processing unit 2002 having the configuration shown in FIG. 118 will be described.
[0823]
In the process of generating the result shown in the example, the total number of classes in the class classification adaptation process of the image processing apparatus of the present invention is substantially the same as the number of classes in the conventional class classification adaptation process. That is, the number of classes in the conventional class classification adaptive processing is 2048, and the number of classes in the class classification adaptive processing of the image processing apparatus of the present invention corresponding to the image of each region is 512.
[0824]
In addition, the number of prediction taps in the conventional class classification adaptation process and the number of prediction taps in the class classification adaptation process of each region of the image processing apparatus of the present invention are set to nine and the same.
[0825]
The prediction results in the covered background area will be described with reference to FIGS.
[0826]
FIG. 119A is a diagram illustrating an example of an image in a teacher image mixed region. FIG. 119B is a diagram illustrating a change in pixel value corresponding to a position in the spatial direction of the image in the mixed region of the teacher images.
[0827]
FIG. 120A is a diagram illustrating an example of a mixed region image generated by the conventional class classification adaptation process corresponding to the teacher image illustrated in FIG. 119. FIG. 120B is a diagram showing a change in pixel value corresponding to the position in the spatial direction of the image in the mixed region generated by the conventional class classification adaptation process corresponding to the teacher image shown in FIG. 119. .
[0828]
FIG. 121A is a diagram illustrating an example of a mixed region image generated by the separated image processing unit 2002 having the configuration illustrated in FIG. 118 corresponding to the teacher image illustrated in FIG. 119. FIG. 120B shows a change in pixel value corresponding to the position in the spatial direction of the image in the mixed area generated by the separated image processing unit 2002 having the configuration shown in FIG. 118 corresponding to the teacher image shown in FIG. FIG.
[0829]
The pixel value of the image in the mixed area generated by the conventional classification adaptation process changes stepwise compared to the teacher image, and also changes stepwise in the generated actual image. However, it can be confirmed visually.
[0830]
On the other hand, the pixel value of the image in the mixed area generated by the separated image processing unit 2002 having the configuration shown in FIG. 118 changes more smoothly than the conventional one and shows a change closer to the teacher image. Even if the image generated by the separated image processing unit 2002 is visually confirmed, it can be confirmed that the image is smoother than the conventional image.
[0831]
The image in the mixed region generated by the separated image processing unit 2002 having the configuration shown in FIG. 118 can be obtained by dividing the input image into the foreground region, the mixed region, or the background region and comparing it with the generated image. It is changing smoothly.
[0832]
With reference to FIGS. 122 to 124, description will be given of a prediction result in the foreground region where the pixel value changes substantially linearly with respect to the pixel position.
[0833]
FIG. 122A is a diagram illustrating an example of an image in the foreground area of the teacher image in which the pixel value changes substantially linearly. FIG. 122B is a diagram illustrating a change in the pixel value corresponding to the position in the spatial direction of the image in the foreground region of the teacher image in which the pixel value changes substantially linearly.
[0834]
FIG. 123A is a diagram illustrating an example of an image of the foreground area corresponding to the image of FIG. 122 generated by the conventional class classification adaptation process. FIG. 123B is a diagram illustrating a change in pixel value corresponding to the position in the spatial direction of the image in the foreground area corresponding to the image of FIG. 122 generated by the conventional class classification adaptation process.
[0835]
FIG. 124A is a diagram showing an example of the foreground region image corresponding to the image of FIG. 122 generated by the separated image processing unit 2002 having the configuration shown in FIG. FIG. 124B shows a change in pixel value corresponding to the position in the spatial direction of the image in the foreground area corresponding to the image of FIG. 122 generated by the separated image processing unit 2002 having the configuration shown in FIG. FIG.
[0836]
The pixel values of the image in the foreground area generated by the conventional classification adaptation process change stepwise as compared to the teacher image, as in the mixed area, and change stepwise in the actual image. It can be confirmed visually.
[0837]
On the other hand, the pixel value of the image in the foreground area generated by the separated image processing unit 2002 having the configuration shown in FIG. 118 changes more smoothly than in the past, and is very close to the teacher image. . In visual confirmation of the image generated by the separated image processing unit 2002, a difference from the teacher image was not recognized.
[0838]
FIG. 125 is a flowchart for explaining image processing of the image processing apparatus having the configuration shown in FIG.
[0839]
In step S2001, the region specifying unit 103, based on the motion vector supplied from the motion detection unit 102 and its position information, and the input image, the foreground region, the background region, the covered background region, and the uncovered back of the input image. Identify the ground area.
[0840]
In step S2002, the mixture ratio calculation unit 104 calculates the mixture ratio α based on the region information and the input image supplied from the region specifying unit 103.
[0841]
In step S2003, the foreground / background separation unit 2001 converts the input image, the foreground region image, and the background region based on the region information supplied from the region specifying unit 103 and the mixture ratio α supplied from the mixture ratio calculation unit 104. , A foreground component image in the covered background area, a background component image in the covered background area, a foreground component image in the uncovered background area, and a background component image in the uncovered background area. Details of the image separation processing of the foreground / background separation unit 2001 will be described later.
[0841]
In step S2004, the separated image processing unit 2002 determines the separated foreground area image, background area image, covered background area foreground component image, covered background area background component image, and uncovered background area foreground. Image processing is executed for each component image and background component image in the uncovered background area, and the processing ends. Details of the image processing executed by the separated image processing unit 2002 will be described later.
[0843]
As described above, the image processing apparatus according to the present invention converts an input image into a foreground area image, a background area image, a covered background area foreground component image, a covered background area background component image, and an uncovered background area. The foreground component image and the background component image of the uncovered background area are separated and separated into the foreground area image, the background area image, the foreground component image of the covered background area, and the background component image of the covered background area. Image processing is executed for each foreground component image in the uncovered background area and for each background component image in the uncovered background area.
[0844]
Next, foreground / background separation processing by the foreground / background separation unit 2001 will be described with reference to the flowchart shown in FIG. 126. In step S2101, the frame memory 2121 of the separation unit 2101 acquires the input image, and determines the frame #n to be separated from the foreground and the background as the previous frame # n-1 and the subsequent frame # n + 1. Remember with.
[0845]
In step S2102, the separation processing block 2122 of the separation unit 2101 acquires the region information supplied from the region specifying unit 103. In step S2103, the separation processing block 2122 of the separation unit 2101 acquires the mixture ratio α supplied from the mixture ratio calculation unit 104.
[0846]
In step S2104, the uncovered area processing unit 2131 extracts a background component from the pixel values of the pixels belonging to the uncovered background area supplied from the frame memory 2121 based on the area information and the mixture ratio α. Output as a background component image of the uncovered background area.
[0847]
In step S2105, the uncovered area processing unit 2131 extracts the foreground components from the pixel values of the pixels belonging to the uncovered background area supplied from the frame memory 2121 based on the area information and the mixing ratio α. Output as a foreground component image of the uncovered background area.
[0848]
In step S2106, the covered area processing unit 2132 extracts a background component from the pixel values of the pixels belonging to the covered background area supplied from the frame memory 2121 based on the area information and the mixture ratio α, Output as a background component image of the ground area.
[0849]
In step S2107, the covered area processing unit 2132 extracts the foreground components from the pixel values of the pixels belonging to the covered background area supplied from the frame memory 2121 based on the area information and the mixture ratio α, and performs the covered back. The foreground component image of the ground area is output, and the process ends.
[0850]
As described above, the foreground / background separation unit 2001 separates the foreground component and the background component from the input image based on the region information and the mixture ratio α, and the foreground component image including only the foreground component and the background A background component image consisting only of components can be output.
[0851]
With reference to the flowchart of FIG. 127, a learning process for generating a coefficient set used for pixel value prediction by the class classification adaptive process by the separated image processing unit 2002 having the configuration shown in FIG. 117 will be described.
[0852]
In step S2201, the weighted average units 2207-1 to 2207-6 perform the background area image, the foreground area image, the uncovered background area background component image, the uncovered background area foreground component image, and the covered background area. And a student image of the foreground component image of the covered background region are generated. That is, the weighted average unit 2207-1 calculates the weighted average of the background area image of the teacher image stored in the background area teacher image frame memory 2201, for example, by a quarter, and the background area image of the teacher image. A student image corresponding to is generated.
[0853]
The weighted average unit 2207-2 performs, for example, a one-fourth weighted average of the background component images in the uncovered background area of the teacher image stored in the uncovered background area background component teacher image frame memory 2202. Then, a student image corresponding to the background component image in the uncovered background area of the teacher image is generated.
[0854]
The weighted average unit 2207-3 averages the foreground component image of the uncovered background area of the teacher image stored in the uncovered background area foreground component teacher image frame memory 2203, for example, by a quarterly weighted average. Then, a student image corresponding to the foreground component image in the uncovered background area of the teacher image is generated.
[0855]
The weighted average unit 2207-4 performs a weighted average of the background component images of the covered background area of the teacher image stored in the covered background area background component teacher image frame memory 2204, for example, by a quarter, and determines the teacher. A student image corresponding to the background component image in the covered background area of the image is generated.
[0856]
The weighted average unit 2207-5 performs a weighted average of the foreground component images of the covered background area of the teacher image stored in the covered background area foreground component teacher image frame memory 2205, for example, by a quarterly weight, and performs the teacher calculation. A student image corresponding to the foreground component image in the covered background area of the image is generated.
[0857]
The weighted average unit 2207-6 performs, for example, a one-fourth weighted average of the images in the foreground area of the teacher image stored in the foreground area teacher image frame memory 2206, and corresponds to the image in the foreground area of the teacher image. Generate student images.
[0858]
In step S2202, the learning unit 2214-1 determines that the background image of the teacher image stored in the background region teacher image frame memory 2201 and the background region of the teacher image stored in the background region student image frame memory 2208 are stored. A coefficient set corresponding to the background region is generated on the basis of the student image corresponding to the first image.
[0859]
In step S2203, the learning unit 2214-2 stores the background component image of the uncovered background area of the teacher image and the uncovered background area background component stored in the uncovered background area background component teacher image frame memory 2202 A coefficient set corresponding to the background component image of the uncovered background area is generated based on the student image corresponding to the background component image of the uncovered background area of the teacher image stored in the student image frame memory 2209.
[0860]
In step S2204, the learning unit 2214-3 stores the foreground component image of the uncovered background area of the teacher image and the uncovered background area foreground component stored in the uncovered background area foreground component teacher image frame memory 2203. Based on the student image stored in the student image frame memory 2210 and corresponding to the foreground component image in the uncovered background area of the teacher image, a coefficient set corresponding to the foreground component image in the uncovered background area is generated.
[0861]
In step S 2205, the learning unit 2214-4 stores the background component image of the covered background area of the teacher image and the covered background area background component student image frame stored in the covered background area background component teacher image frame memory 2204. A coefficient set corresponding to the background component image in the covered background area is generated based on the student image stored in the memory 2211 and corresponding to the background component image in the covered background area of the teacher image.
[0862]
In step S 2206, the learning unit 2214-5 stores the foreground component image of the covered background area of the teacher image and the covered background area foreground component student image frame stored in the covered background area foreground component teacher image frame memory 2205. A coefficient set corresponding to the foreground component image of the covered background area is generated based on the student image stored in the memory 2212 and corresponding to the foreground component image of the covered background area of the teacher image.
[0863]
In step S 2207, the learning unit 2214-6 determines the foreground area of the teacher image stored in the foreground area student image frame memory 2213 and the foreground area image of the teacher image stored in the foreground area teacher image frame memory 2206. A coefficient set corresponding to the foreground region is generated based on the student image corresponding to the first image.
[0864]
In step S2208, the learning units 2214-1 to 2212-4 respectively convert the coefficient set corresponding to the background region, the coefficient set corresponding to the background component image of the uncovered background region, and the foreground component image of the uncovered background region. The coefficient set corresponding to the background component image in the covered background area, the coefficient set corresponding to the foreground component image in the covered background area, or the coefficient set corresponding to the foreground area is output to the coefficient set memory 2215. The coefficient set memory 2215 includes a background region, a foreground region, a background component image of an uncovered background region, a foreground component image of an uncovered background region, a background component image of a covered background region, or a foreground component image of a covered background region. The coefficient set corresponding to each of these is stored, and the process ends.
[0865]
As described above, the separated image processing unit 2002 shown in FIG. 117 has a coefficient set corresponding to the background region image, a coefficient set corresponding to the background component image in the uncovered background region, and the foreground component in the uncovered background region. Generating a coefficient set corresponding to the image, a coefficient set corresponding to the background component image in the covered background area, a coefficient set corresponding to the foreground component image in the covered background area, and a coefficient set corresponding to the image in the foreground area. it can.
[0866]
Details of the processing in steps S2202 to S2207 are the same as the processing described with reference to the flowchart of FIG. 103, and thus description thereof is omitted.
[0867]
Of course, the processing in steps S2202 to S2207 may be executed serially or in parallel.
[0868]
Next, the image creation processing of the separated image processing unit 2002 shown in FIG. 118 will be described with reference to the flowchart of FIG.
[0869]
In step S2301, the mapping unit 2307-1 stores the background area stored in the background area frame memory 2301 by the class classification adaptation process based on the coefficient set corresponding to the background area stored in the coefficient set memory 2308. An image corresponding to the image of is predicted.
[0870]
In step S2302, the mapping unit 2307-2 performs unclassified background area by class classification adaptation processing based on the coefficient set corresponding to the background component image of the uncovered background area stored in the coefficient set memory 2308. An image corresponding to the background component image in the uncovered background area stored in the background component image frame memory 2302 is predicted.
[0871]
In step S2303, the mapping unit 2307-3 performs an uncovered background area by class classification adaptation processing based on the coefficient set corresponding to the foreground component image of the uncovered background area stored in the coefficient set memory 2308. An image corresponding to the foreground component image in the uncovered background area stored in the foreground component image frame memory 2303 is predicted.
[0872]
In step S 2304, the mapping unit 2307-4 performs the covered background area background component by the class classification adaptive process based on the coefficient set corresponding to the background component image of the covered background area stored in the coefficient set memory 2308. An image corresponding to the background component image in the covered background area stored in the image frame memory 2304 is predicted.
[0873]
In step S 2305, the mapping unit 2307-5 performs the covered background area foreground component by the class classification adaptive process based on the coefficient set corresponding to the foreground component image of the covered background area stored in the coefficient set memory 2308. An image corresponding to the foreground component image in the covered background area stored in the image frame memory 2305 is predicted.
[0874]
In step S 2306, the mapping unit 2307-6 performs the foreground area stored in the foreground area frame memory 2306 by the class classification adaptation process based on the coefficient set corresponding to the foreground area stored in the coefficient set memory 2308. An image corresponding to the image of is predicted.
[0875]
In step S2307, the synthesis unit 2309, the predicted image corresponding to the background region image, the predicted image corresponding to the background component image of the uncovered background region, the predicted image corresponding to the foreground component image of the uncovered background region, The predicted image corresponding to the background component image in the background area, the predicted image corresponding to the foreground component image in the covered background area, and the predicted image corresponding to the foreground area are synthesized. The combining unit 2309 supplies the combined image to the frame memory 2310. The frame memory 2310 stores the image supplied from the synthesis unit 2309.
[0876]
In step S2308, the frame memory 2310 outputs the stored composite image, and the process ends.
[0877]
As described above, the image processing apparatus having the separated image processing unit 2002 having the configuration illustrated in FIG. 118 includes the separated background area image, uncovered background area background component image, and uncovered background area foreground component image. A predicted image can be generated for each of the background component image in the covered background area, the foreground component image in the covered background area, and the image in the foreground area.
[0878]
Details of the processing in steps S2301 to S2306 are the same as the processing described with reference to the flowchart of FIG. 109, and thus description thereof is omitted.
[0879]
Of course, the processing in steps S2301 to S2306 may be executed serially or in parallel.
[0880]
FIG. 129 is a block diagram illustrating another configuration of the function of the image processing apparatus that separates input images and processes the separated images. The image processing apparatus shown in FIG. 113 sequentially performs area specification and calculation of the mixture ratio α, whereas the image processing apparatus shown in FIG. 129 performs area specification and calculation of the mixture ratio α in parallel.
[0881]
The same parts as those shown in the block diagram of FIG. 113 are denoted by the same reference numerals, and the description thereof is omitted.
[0882]
The input image is supplied to the object extraction unit 101, the region specifying unit 103, the mixture ratio calculation unit 1501, and the foreground / background separation unit 2501.
[0883]
Based on the input image, the mixture ratio calculation unit 1501 calculates the estimated mixture ratio when it is assumed that the pixel belongs to the covered background area, and the estimated mixture ratio when it is assumed that the pixel belongs to the uncovered background area. Calculated for each pixel included in the input image, estimated mixture ratio when the calculated pixel is assumed to belong to the covered background area, and estimated mixture when the pixel is assumed to belong to the uncovered background area The ratio is supplied to the foreground / background separator 2501.
[0884]
The foreground / background separation unit 2501 supplied from the mixture ratio calculation unit 1501 is an estimated mixture ratio when it is assumed that the pixel belongs to the covered background area, and an estimation when the pixel belongs to the uncovered background area. Based on the mixture ratio and the region information supplied from the region specifying unit 103, the input image is converted into a background region image, an uncovered background region background component image, an uncovered background region foreground component image, and a covered background. The image is separated into the background component image of the region, the foreground component image of the covered background region, and the image of the foreground region, and the separated image is supplied to the separated image processing unit 2002.
[0885]
FIG. 130 is a block diagram illustrating an example of the configuration of the foreground / background separation unit 2501.
[0886]
The same parts as those in the foreground / background separator 2001 shown in FIG. 115 are denoted by the same reference numerals, and the description thereof is omitted.
[0887]
Based on the region information supplied from the region specifying unit 103, the selection unit 2521 supplies the estimated mixture ratio supplied from the mixture ratio calculation unit 1501 when it is assumed that the pixel belongs to the covered background area, and the pixel is undefined. One of the estimated mixture ratios when it is assumed to belong to the covered background region is selected, and the selected estimated mixture ratio is supplied to the separation unit 2101 as the mixture ratio α.
[0888]
The separation unit 2101 extracts the foreground components and the background components from the pixel values of the pixels belonging to the mixed region based on the mixing ratio α and the region information supplied from the selection unit 2521, and the background component of the uncovered background region The image is separated into the foreground component image in the uncovered background area, the background component image in the covered background area, and the foreground component image in the covered background area.
[0889]
The separation unit 2101 can have the same configuration as that shown in FIG.
[0890]
As described above, the image processing apparatus having the configuration shown in FIG. 129 includes a background area image, an uncovered background area background component image, an uncovered background area foreground component image, a covered background area background component image, and a covered area image. For each of the foreground component image in the background area and the image in the foreground area, processing can be executed corresponding to each property.
[0891]
FIG. 131 is a block diagram illustrating still another configuration of the function of the image processing apparatus.
[0892]
The input image supplied to the image processing apparatus is supplied to the object extraction unit 101, the region specifying unit 103, and the region processing unit 3001.
[0893]
The object extraction unit 101 roughly extracts an image object corresponding to a foreground object included in the input image, and supplies the extracted image object to the motion detection unit 102. For example, the object extraction unit 101 detects the outline of the image object corresponding to the foreground object included in the input image, thereby roughly extracting the image object corresponding to the foreground object.
[0894]
The object extraction unit 101 roughly extracts an image object corresponding to a background object included in the input image, and supplies the extracted image object to the motion detection unit 102. For example, the object extraction unit 101 roughly extracts an image object corresponding to the background object from the difference between the input image and the image object corresponding to the extracted foreground object.
[0895]
The motion detection unit 102 calculates the motion vector of the image object corresponding to the coarsely extracted foreground object by a method such as a block matching method, a gradient method, a phase correlation method, and a per-recursive method. The motion vector and the position information of the motion vector are supplied to the region specifying unit 103.
[0896]
The area specifying unit 103 specifies each pixel of the input image as one of the foreground area, the background area, or the mixed area, and whether each pixel belongs to one of the foreground area, the background area, or the mixed area Is supplied to the area processing unit 3001.
[0897]
The area processing unit 3001 divides the input image for each foreground area, background area, or mixed area based on the area information supplied from the area specifying unit 103 and executes image processing for each divided input image. . For example, the area processing unit 3001 is used in a class classification adaptation process that divides an input image for each foreground area, background area, or mixed area and generates a higher resolution image for each divided input image. Generate coefficients.
[0898]
For example, the area processing unit 3001 divides an input image for each foreground area, background area, or mixed area, and applies a classification adaptation process to each divided input image to create a higher-resolution image. .
[0899]
FIG. 132 is a diagram for explaining the processing of the area processing unit 3001 having the configuration shown in FIG. The area processing unit 3001 individually calculates a coefficient set corresponding to the background area, a coefficient set corresponding to the uncovered background area, a coefficient set corresponding to the foreground area, and a coefficient set corresponding to the covered background area.
[0900]
The coefficient set corresponding to the background region is used for prediction of the pixel value of the background region in the class classification adaptive process for predicting the pixel value. The coefficient set corresponding to the uncovered background area is used for predicting the pixel value of the uncovered background area in the class classification adaptation process for predicting the pixel value.
[0901]
The coefficient set corresponding to the covered background region is used for predicting the pixel value of the covered background region in the class classification adaptation process for predicting the pixel value. The coefficient set corresponding to the foreground region is used for predicting the pixel value of the foreground region in the class classification adaptive process for predicting the pixel value.
[0902]
The predicted image corresponding to the background image, the predicted image corresponding to the uncovered background region, the predicted image corresponding to the covered background region, and the predicted image corresponding to the foreground image are combined into one predicted image.
[0903]
FIG. 133 is a block diagram illustrating a configuration of an area processing unit 3001 that generates a coefficient set used in the class classification adaptation process that generates a higher-resolution image in the spatial direction. The teacher image frame memory 3101 stores, for example, an input image that is an HD image in units of frames. The teacher image frame memory 3101 supplies the stored input image to the area dividing unit 3102.
[0904]
The area dividing unit 3102 divides the teacher image into a background area, a foreground area, a covered background area, or an uncovered background area based on the area information supplied from the area specifying unit 103.
[0905]
The area dividing unit 3102 supplies the image including the pixels belonging to the background area of the teacher image, which is the divided teacher image, to the background area teacher image frame memory 3103 and includes the pixels belonging to the uncovered background area of the teacher image. An image is supplied to the uncovered background area teacher image frame memory 3104, and an image composed of pixels belonging to the covered background area of the teacher image is supplied to the covered background area teacher image frame memory 3105, and belongs to the foreground area of the teacher image. An image composed of pixels is supplied to the foreground area teacher image frame memory 3106.
[0906]
The background area teacher image frame memory 3103 stores an image made up of pixels belonging to the background area of the teacher image supplied from the area dividing unit 3102. The background area teacher image frame memory 3103 supplies an image composed of pixels belonging to the background area of the stored teacher image to the weighted average unit 3107-1 and the learning unit 3112-1.
[0907]
The uncovered background area teacher image frame memory 3104 stores an image made up of pixels belonging to the uncovered background area of the teacher image supplied from the area dividing unit 3102. The uncovered background area teacher image frame memory 3104 supplies an image composed of pixels belonging to the uncovered background area of the stored teacher image to the weighted average unit 3107-2 and the learning unit 3112-2.
[0908]
The covered background area teacher image frame memory 3105 stores an image made up of pixels belonging to the covered background area of the teacher image supplied from the area dividing unit 3102. The covered background area teacher image frame memory 3105 supplies an image composed of pixels belonging to the covered background area of the stored teacher image to the weighted average unit 3107-3 and the learning unit 3112-3.
[0909]
The foreground area teacher image frame memory 3106 stores an image made up of pixels belonging to the foreground area of the teacher image supplied from the area dividing unit 3102. The foreground area teacher image frame memory 3106 supplies an image composed of pixels belonging to the foreground area of the stored teacher image to the weighted average unit 3107-4 and the learning unit 3112-4.
[0910]
The weighted average unit 3107-1 is a weighted average of the images that are supplied from the background area teacher image frame memory 3103, for example, pixels that belong to the background area of the teacher image that is an HD image, and is used as a student image. A certain SD image is generated, and the generated SD image is supplied to the background area student image frame memory 3108.
[0911]
The background area student image frame memory 3108 stores a student image corresponding to an image composed of pixels belonging to the background area of the teacher image supplied from the weighted average unit 3107-1. The background area student image frame memory 3108 supplies the stored student image corresponding to the image composed of the pixels belonging to the background area of the teacher image to the learning unit 3112-1.
[0912]
The weighted average unit 3107-2 applies, for example, a quarter weight to an image that is supplied from the uncovered background area teacher image frame memory 3104 and includes pixels belonging to the uncovered background area of the teacher image that is an HD image. On average, an SD image that is a student image is generated, and the generated SD image is supplied to the uncovered background area student image frame memory 3109.
[0913]
The uncovered background area student image frame memory 3109 stores a student image which is an SD image corresponding to an image made up of pixels belonging to the uncovered background area of the teacher image supplied from the weighted average unit 3107-2. .
The uncovered background area student image frame memory 3109 supplies the stored student image corresponding to the image composed of pixels belonging to the uncovered background area of the teacher image to the learning unit 3112-2.
[0914]
The weighted average unit 3107-3 averages, for example, a quarter weighted average of the images made up of the pixels belonging to the covered background area of the teacher image supplied from the covered background area teacher image frame memory 3105, to obtain the student image. The SD image is generated, and the generated SD image is supplied to the covered background area student image frame memory 3110.
[0915]
The covered background area student image frame memory 3110 stores a student image that is an SD image corresponding to an image made up of pixels belonging to the covered background area of the teacher image supplied from the weighted average unit 3107-3. The covered background area student image frame memory 3110 supplies the stored student image corresponding to the image made up of pixels belonging to the covered background area of the teacher image to the learning unit 3112-3.
[0916]
The weighted average unit 3107-4 performs, for example, a one-fourth weighted average of images made up of pixels belonging to the foreground area of the teacher image, which is an HD image, supplied from the foreground area teacher image frame memory 3106, as a student image. A certain SD image is generated, and the generated SD image is supplied to the foreground area student image frame memory 3111.
[0917]
The foreground area student image frame memory 3111 stores a student image which is an SD image corresponding to an image made up of pixels belonging to the foreground area of the teacher image supplied from the weighted average unit 3107-4. The foreground area student image frame memory 3111 supplies the stored student image corresponding to the image including the pixels belonging to the foreground area of the teacher image to the learning unit 3112-4.
[0918]
The learning unit 3112-1 belongs to the background area of the teacher image supplied from the background area student image frame memory 3108 and the image composed of pixels belonging to the background area of the teacher image supplied from the background area teacher image frame memory 3103. A coefficient set corresponding to the background area is generated based on the student image corresponding to the image composed of pixels, and the generated coefficient set is supplied to the coefficient set memory 3113.
[0919]
The learning unit 3112-2 is supplied from the uncovered background area student image frame memory 3109 and an image composed of pixels belonging to the uncovered background area of the teacher image supplied from the uncovered background area teacher image frame memory 3104. In addition, a coefficient set corresponding to the uncovered background area is generated based on a student image corresponding to an image composed of pixels belonging to the uncovered background area of the teacher image, and the generated coefficient set is supplied to the coefficient set memory 3113. To do.
[0920]
The learning unit 3112-3 includes an image composed of pixels belonging to the covered background area of the teacher image supplied from the covered background area teacher image frame memory 3105, and a teacher supplied from the covered background area student image frame memory 3110. A coefficient set corresponding to the covered background area is generated based on a student image corresponding to an image composed of pixels belonging to the covered background area of the image, and the generated coefficient set is supplied to the coefficient set memory 3113.
[0921]
The learning unit 3112-4 belongs to an image composed of pixels belonging to the foreground area of the teacher image supplied from the foreground area teacher image frame memory 3106 and belongs to the foreground area of the teacher image supplied from the foreground area student image frame memory 3110. A coefficient set corresponding to the foreground area is generated based on the student image corresponding to the image composed of pixels, and the generated coefficient set is supplied to the coefficient set memory 3113.
[0922]
The coefficient set memory 3113 is supplied from the learning unit 3112-3, the coefficient set corresponding to the background region supplied from the learning unit 3112-1, the coefficient set corresponding to the uncovered background region supplied from the learning unit 3112-2. The coefficient set corresponding to the covered background area and the coefficient set corresponding to the foreground area supplied from the learning unit 3112-4 are stored.
[0923]
The learning units 3112-1 to 3112-4 have the same configuration as the learning unit 1006, and thus description thereof is omitted.
[0924]
FIG. 134 is a block diagram illustrating a configuration of an area processing unit 3001 that executes class classification adaptation processing to generate a higher resolution image in the spatial direction. The frame memory 3201 stores, for example, an input image that is an SD image in units of frames. The frame memory 3201 supplies the stored input image to the area dividing unit 3202.
[0925]
The area dividing unit 3202 divides the input image for each of the background area, the foreground area, the covered background area, or the uncovered background area based on the area information supplied from the area specifying unit 103. That is, the region dividing unit 3202 supplies the divided input image, which is an image composed of pixels belonging to the background region, to the background region frame memory 3203, and the image composed of pixels belonging to the uncovered background region is uncovered background. The image is supplied to the area frame memory 3204, the image made up of pixels belonging to the covered background area is supplied to the covered background area frame memory 3205, and the image made up of pixels belonging to the foreground area is supplied to the foreground area frame memory 3206.
[0926]
The background area frame memory 3203 stores an image made up of pixels belonging to the background area supplied from the area dividing unit 3202. The background area frame memory 3203 supplies an image composed of the pixels belonging to the stored background area to the mapping unit 3207-1.
[0927]
The uncovered background area frame memory 3204 stores an image made up of pixels belonging to the uncovered background area supplied from the area dividing unit 3202. The uncovered background area frame memory 3204 supplies the stored image including pixels belonging to the uncovered background area to the mapping unit 3207-2.
[0928]
The covered background area frame memory 3205 stores an image made up of pixels belonging to the covered background area supplied from the area dividing unit 3202. The covered background area frame memory 3205 supplies the stored image including pixels belonging to the covered background area to the mapping unit 3207-3.
[0929]
The foreground area frame memory 3206 stores an image made up of pixels belonging to the foreground area supplied from the area dividing unit 3202. The foreground area input image frame memory 3106 supplies the stored image including pixels belonging to the foreground area to the mapping unit 3207-4.
[0930]
Based on the coefficient set corresponding to the background area stored in the coefficient set memory 3208, the mapping unit 3207-1 performs pixel classification adaptation processing from pixels belonging to the background area stored in the background area frame memory 3203. A predicted image corresponding to the resulting image is generated. The mapping unit 3207-1 supplies the generated predicted image to the synthesis unit 3209.
[0931]
The mapping unit 3207-2 is stored in the uncovered background area frame memory 3204 by class classification adaptation processing based on the coefficient set corresponding to the uncovered background area stored in the coefficient set memory 3208. Then, a predicted image corresponding to an image composed of pixels belonging to the uncovered background area is generated. The mapping unit 3207-2 supplies the generated predicted image to the synthesis unit 3209.
[0932]
The mapping unit 3207-3 performs the classified classification adaptation processing based on the coefficient set corresponding to the covered background area stored in the coefficient set memory 3208, and the covered background area frame memory 3205 stores the covered A predicted image corresponding to an image composed of pixels belonging to the background area is generated. The mapping unit 3207-3 supplies the generated predicted image to the synthesis unit 3209.
[0933]
Based on the coefficient set corresponding to the foreground area stored in the coefficient set memory 3208, the mapping unit 3207-4 uses the class classification adaptive processing to determine from the pixels belonging to the foreground area stored in the foreground area frame memory 3206. A predicted image corresponding to the resulting image is generated. The mapping unit 3207-4 supplies the generated predicted image to the synthesis unit 3209.
[0934]
The synthesizing unit 3209 corresponds to a predicted image corresponding to an image composed of pixels belonging to the background area supplied from the mapping unit 3207-1 and an image composed of pixels belonging to the uncovered background area supplied from the mapping unit 3207-2. Corresponding to a predicted image corresponding to an image made up of pixels belonging to the covered background area supplied from the mapping unit 3207-3, and an image made up of pixels belonging to the foreground area supplied from the mapping unit 3207-4. The predicted image is synthesized, and the synthesized predicted image is supplied to the frame memory 3210.
[0935]
The frame memory 3210 stores the prediction image supplied from the synthesis unit 3209 and outputs the stored image as an output image.
[0936]
Since the mapping units 3207-1 to 3207-4 have the same configuration as the mapping unit 1103, description thereof is omitted.
[0937]
An example of processing results of the image processing apparatus of the present invention having the area processing unit 3001 having the configuration shown in FIG. 134 will be described with reference to the images shown in FIGS.
[0938]
In the process of generating the result shown in the example, the total number of classes in the class classification adaptation process of the image processing apparatus of the present invention is the same as the number of classes in the conventional class classification adaptation process. That is, the number of classes in the conventional class classification adaptive processing is 2048, and the number of classes in the class classification adaptive processing of each area of the image processing apparatus of the present invention is 3112.
[0939]
In addition, the number of prediction taps in the conventional class classification adaptation process and the number of prediction taps in the class classification adaptation process of each region of the image processing apparatus of the present invention are set to nine and the same.
[0940]
A prediction result in the covered background region will be described with reference to FIGS. 135 to 137.
[0941]
FIG. 135A is a diagram illustrating an example of an image in the mixed region of the teacher images. FIG. 135 (B) is a diagram showing changes in pixel values corresponding to positions in the spatial direction of images in the mixed region of teacher images.
[0942]
FIG. 136A is a diagram illustrating an example of a mixed region image generated by the conventional class classification adaptation process corresponding to the teacher image illustrated in FIG. 135. FIG. 136B is a diagram showing a change in pixel value corresponding to the position in the spatial direction of the image in the mixed region generated by the conventional class classification adaptation process corresponding to the teacher image shown in FIG. 135. .
[0943]
FIG. 137A is a diagram illustrating an example of a mixed region image generated by the region processing unit 3001 having the configuration illustrated in FIG. 134 corresponding to the teacher image illustrated in FIG. 135.
FIG. 136 (B) shows the change in pixel value corresponding to the position in the spatial direction of the image in the mixed area generated by the area processing unit 3001 having the configuration shown in FIG. 134, corresponding to the teacher image shown in FIG. FIG.
[0944]
The pixel value of the image in the mixed area generated by the conventional classification adaptation process changes stepwise compared to the teacher image, and also changes stepwise in the generated actual image. However, it can be confirmed visually.
[0945]
On the other hand, the pixel value of the image in the mixed region generated by the region processing unit 3001 having the configuration shown in FIG. 134 changes more smoothly than the conventional one and shows a change closer to the teacher image. Even if the image generated by the region processing unit 3001 is visually confirmed, it can be confirmed that the image is smoother than the conventional image.
[0946]
A prediction result in the foreground region in which the pixel value changes substantially linearly with respect to the pixel position will be described with reference to FIGS. 138 to 140.
[0947]
FIG. 138A is a diagram illustrating an example of an image in the foreground area of the teacher image in which the pixel value changes substantially linearly. FIG. 138 (B) is a diagram illustrating a change in the pixel value corresponding to the position in the spatial direction of the image in the foreground region of the teacher image in which the pixel value changes substantially linearly.
[0948]
FIG. 139A is a diagram illustrating an example of an image in the foreground area corresponding to the image in FIG. 138 generated by the conventional class classification adaptation process. FIG. 139B is a diagram illustrating a change in pixel value corresponding to the position in the spatial direction of the image in the foreground region corresponding to the image of FIG. 138 generated by the conventional class classification adaptation process.
[0949]
FIG. 140A is a diagram illustrating an example of an image in the foreground area corresponding to the image in FIG. 138 generated by the area processing unit 3001 having the configuration illustrated in FIG. FIG. 136B is a diagram showing a change in pixel value corresponding to the position in the spatial direction of the image in the foreground region corresponding to the image in FIG. 138 generated by the region processing unit 3001 having the configuration shown in FIG. 134. It is.
[0950]
The pixel values of the image in the foreground area generated by the conventional classification adaptation process change stepwise as compared to the teacher image, as in the mixed area, and change stepwise in the actual image. It can be confirmed visually.
[0951]
On the other hand, the pixel value of the image in the foreground area generated by the area processing unit 3001 having the configuration shown in FIG. 134 changes more smoothly than in the past, and is very close to the teacher image. In visual confirmation of the image generated by the region processing unit 3001, no difference from the teacher image was recognized.
[0952]
Further, for a predetermined image, the SN ratio in each region of the image generated by the conventional class classification adaptive processing, and the SN ratio in each region of the image generated by the class classification adaptive processing of the image processing device according to the present invention Compared to seek.
[0953]
The SN ratio in the covered background area of the image generated by the conventional classification adaptation process is 32.1716 dB, the SN ratio in the uncovered background area is 31.8744 dB, and the SN ratio in the foreground area is 31.8835 dB. The SN ratio in the background area was 31.9985 dB.
[0954]
On the other hand, the SN ratio in the covered background region of the image generated by the image processing apparatus according to the present invention is 32.1799 dB, the SN ratio in the uncovered background region is 31.8922 dB, and in the foreground region The SN ratio was 32.0925 dB, and the SN ratio in the background region was 32.0177 dB.
[0955]
As described above, the SN ratio of the image generated by the image processing apparatus according to the present invention is higher than the SN ratio of the image generated by the conventional classification adaptation process in any region.
[0956]
FIG. 141 is a flowchart for explaining image processing of the image processing apparatus having the configuration shown in FIG. 131.
[0957]
In step S3001, the region specifying unit 103 specifies the foreground region, the background region, the covered background region, and the uncovered background region of the input image based on the motion vector supplied from the motion detecting unit 102 and its position information. To do.
[0958]
In step S3002, the area processing unit 3001 divides the input image into the specified foreground area, background area, covered background area, and uncovered background area, and divides the foreground area, background area, and covered area. The image processing is executed for each background area and uncovered background area, and the process ends.
[0959]
As described above, the image processing apparatus according to the present invention divides an input image into a foreground region, a background region, a covered background region, and an uncovered background region, and the divided foreground region, background region, and covered back region are divided. Image processing is executed for each ground area and uncovered background area.
[0960]
With reference to the flowchart in FIG. 142, a learning process for generating a coefficient set used for pixel value prediction by the class classification adaptive process by the region processing unit 3001 having the configuration shown in FIG. 133 will be described.
[0961]
In step S 3101, the area dividing unit 3102 divides the teacher image stored in the teacher image frame memory 3101 based on the area information supplied from the area specifying unit 103. In other words, the region dividing unit 3102 supplies the background region teacher image frame memory 3103 with an image composed of pixels belonging to the background region of the teacher image, which is a region-divided teacher image. The area dividing unit 3102 supplies an image composed of pixels belonging to the uncovered background area of the teacher image, which is an area-divided teacher image, to the uncovered background area teacher image frame memory 3104.
[0962]
The area dividing unit 3102 supplies an image composed of pixels belonging to the covered background area of the teacher image, which is an area-divided teacher image, to the covered background area teacher image frame memory 3105. The area dividing unit 3102 supplies an image composed of pixels belonging to the foreground area of the teacher image, which is an area-divided teacher image, to the foreground area teacher image frame memory 3106.
[0964]
In step S3102, the weighted average units 3107-1 to 3107-4 generate student images of the background area, the foreground area, the uncovered background area, and the covered background area. That is, the weighted average unit 3107-1 performs, for example, a one-fourth weighted average of the images made up of pixels belonging to the background area of the teacher image stored in the background area teacher image frame memory 3103, and the teacher image A student image corresponding to an image composed of pixels belonging to the background area is generated. The weighted average unit 3107-2 performs, for example, a one-fourth weighted average of images made up of pixels belonging to the uncovered background area of the teacher image stored in the uncovered background area teacher image frame memory 3104. Then, a student image corresponding to an image composed of pixels belonging to the uncovered background area of the teacher image is generated.
[0964]
The weighted average unit 3107-3 averages, for example, a quarter weighted average of the images that belong to the covered background area of the teacher image stored in the covered background area teacher image frame memory 3105, An image composed of pixels belonging to the covered background area of the image is generated. The weighted average unit 3107-4 performs, for example, a one-fourth weighted average of the images made up of the pixels belonging to the foreground area of the teacher image stored in the foreground area teacher image frame memory 3106 to obtain the foreground area of the teacher image. A student image corresponding to an image composed of pixels belonging to is generated.
[0965]
In step S 3103, the learning unit 3112-1 stores the teacher image stored in the background region student image frame memory 3108 and the image including pixels belonging to the background region of the teacher image stored in the background region teacher image frame memory 3103. A coefficient set corresponding to the background area is generated based on a student image corresponding to an image composed of pixels belonging to the background area of the image.
[0966]
In step S 3104, the learning unit 3112-2 stores an image composed of pixels belonging to the uncovered background area of the teacher image and the uncovered background area student image stored in the uncovered background area teacher image frame memory 3104. A coefficient set corresponding to the uncovered background area is generated based on the student image corresponding to the image made up of pixels belonging to the uncovered background area of the teacher image stored in the frame memory 3109.
[0967]
In step S 3105, the learning unit 3112-3 stores an image composed of pixels belonging to the covered background area of the teacher image and the covered background area student image frame memory 3110 stored in the covered background area teacher image frame memory 3105. A coefficient set corresponding to the covered background area is generated based on the student image corresponding to the image made up of pixels belonging to the covered background area of the teacher image stored in (1).
[0968]
In step S 3106, the learning unit 3112-4 determines the teacher image stored in the foreground area student image frame memory 3111 and the image including pixels belonging to the foreground area of the teacher image stored in the foreground area teacher image frame memory 3106. A coefficient set corresponding to the foreground area is generated based on a student image corresponding to an image composed of pixels belonging to the foreground area of the image.
[0969]
In step S 3107, the learning units 3112-1 to 3112-4 respectively apply the coefficient set corresponding to the background area, the coefficient set corresponding to the uncovered background area, the coefficient set corresponding to the covered background area, or the foreground area. The corresponding coefficient set is supplied to the coefficient set memory 3113. The coefficient set memory 3113 stores a coefficient set corresponding to each of the background area, the foreground area, the uncovered background area, or the covered background area, and the process ends.
[0970]
As described above, the area processing unit 3001 having the configuration illustrated in FIG. 133 corresponds to the coefficient set corresponding to the background area, the coefficient set corresponding to the uncovered background area, the coefficient set corresponding to the covered background area, and the foreground area. A coefficient set to be generated can be generated.
[0971]
Details of the processing in steps S3103 to S3106 are the same as the processing described with reference to the flowchart of FIG. 103, and thus description thereof is omitted.
[0972]
Of course, the processing in steps S3103 to S3106 may be executed serially or in parallel.
[0973]
Next, an image creation process of the area processing unit 3001 shown in FIG. 134 will be described with reference to the flowchart of FIG.
[0974]
In step S3201, the area dividing unit 3202 divides the input image into a background area, a foreground area, a covered background area, or an uncovered background area based on the area information supplied from the area specifying unit 103. That is, the region dividing unit 3202 supplies the divided input image, which is an image composed of pixels belonging to the background region, to the background region frame memory 3203, and the image composed of pixels belonging to the uncovered background region is uncovered background. The image is supplied to the area frame memory 3204, the image made up of pixels belonging to the covered background area is supplied to the covered background area frame memory 3205, and the image made up of pixels belonging to the foreground area is supplied to the foreground area frame memory 3206.
[0975]
In step S3202, the mapping unit 3207-1 stores the background area stored in the background area frame memory 3203 by the class classification adaptation process based on the coefficient set corresponding to the background area stored in the coefficient set memory 3208. An image corresponding to an image composed of pixels belonging to is predicted.
[0976]
In step S3203, the mapping unit 3207-2 stores in the uncovered background area frame memory 3204 by class classification adaptation processing based on the coefficient set corresponding to the uncovered background area stored in the coefficient set memory 3208. A stored image corresponding to an image composed of pixels belonging to the uncovered background area is predicted.
[0977]
In step S3204, the mapping unit 3207-3 is stored in the covered background area frame memory 3205 by the class classification adaptation process based on the coefficient set corresponding to the covered background area stored in the coefficient set memory 3208. An image corresponding to an image composed of pixels belonging to the covered background area is predicted.
[0978]
In step S 3205, the mapping unit 3207-4 performs the foreground area stored in the foreground area frame memory 3206 by the class classification adaptation process based on the coefficient set corresponding to the foreground area stored in the coefficient set memory 3208. An image corresponding to an image composed of pixels belonging to is predicted.
[0979]
In step S3206, the synthesis unit 3209 displays a predicted image corresponding to an image composed of pixels belonging to the background area, a predicted image corresponding to an image composed of pixels belonging to the uncovered background area, and an image composed of pixels belonging to the covered background area. And a predicted image corresponding to the foreground area are synthesized. The combining unit 3209 supplies the combined image to the frame memory 3210.
The frame memory 3210 stores the image supplied from the synthesis unit 3209.
[0980]
In step S3207, the frame memory 3210 outputs the stored synthesized image, and the process ends.
[0981]
As described above, the image processing apparatus having the area processing unit 3001 having the configuration illustrated in FIG. 134 divides the input image into the background area, the uncovered background area, the covered background area, and the foreground area. A predicted image can be generated for each image.
[0982]
The details of the processing in steps S3202 to S3205 are the same as the processing described with reference to the flowchart of FIG. 109, and thus description thereof is omitted.
[0983]
Of course, the processing in steps S3202 to S3205 may be executed serially or in parallel.
[0984]
The processing executed by the motion blur removal image processing unit 108, the separated image processing unit 2002, and the region processing unit 3001 is processing for generating coefficients corresponding to SD images and HD images, or processing for generating HD images from SD images. For example, a coefficient for generating an image with high resolution in the spatial direction may be generated, and an image with high resolution in the spatial direction may be generated. Furthermore, the motion blur removal image processing unit 108, the separated image processing unit 2002, and the region processing unit 3001 may execute processing for generating an image with higher resolution in the time direction.
[0985]
Note that the motion blur removal image processing unit 108, the separated image processing unit 2002, and the region processing unit 3001 generate a coefficient from predetermined information, and execute the class classification adaptive process based on the generated coefficient. May be.
[0986]
In addition, the motion blur removal image processing unit 108, the separated image processing unit 2002, and the region processing unit 3001, for example, convert an image size to a desired size, and a color signal such as RGB based on the classification processing. Other processing such as extraction, noise removal, image compression, or encoding may be performed. For example, the motion blur-removed image processing unit 108, the separated image processing unit 2002, and the region processing unit 3001 can set the compression rate in the direction along the motion vector based on the classified class and the motion vector corresponding to each image. If the compression rate is low and the compression rate in the direction orthogonal to the motion vector is increased to compress the image for each region, the compression ratio can be increased while reducing image degradation compared to the conventional case. .
[0987]
Note that the direction of movement of the foreground object has been described from left to right, but it is needless to say that the direction is not limited thereto.
[0988]
In the above, the case where the image of the real space having the three-dimensional space and the time axis information is projected onto the time space having the two-dimensional space and the time axis information by using a video camera is taken as an example. However, the present invention is not limited to this example, and it is possible to apply a case where more first information of the first dimension is projected onto second information of a smaller second dimension.
[0989]
The sensor is not limited to the CCD, and may be a solid-state image sensor, for example, a sensor such as BBD (Bucket Brigade Device), CID (Charge Injection Device), or CPD (Charge Priming Device). The sensor is not limited to a sensor arranged in a matrix, and may be a sensor in which detection elements are arranged in a line.
[0990]
As shown in FIG. 10, a recording medium recording a program for performing signal processing according to the present invention is distributed to provide a program to a user separately from a computer. (Including a registered trademark) disc, optical disc 92 (including compact disc-read only memory (CD-ROM), DVD (digital versatile disc)), and magneto-optical disc 93 (including MD (mini-disc) (trademark)) ), Or a ROM 72 on which a program is recorded, a hard disk included in the storage unit 78, etc. provided to the user in a state of being incorporated in a computer in advance, in addition to a package medium including the semiconductor memory 94 or the like. Consists of.
[0991]
In the present specification, the step of describing the program recorded on the recording medium is not limited to the processing performed in chronological order according to the described order, but is not necessarily performed in chronological order. It also includes processes that are executed individually.
[0992]
【The invention's effect】
According to the image processing apparatus and method, the recording medium, and the program of the present invention, the mixed region in which the foreground object component that forms the foreground object and the background object component that forms the background object are mixed based on the input image data. And a foreground area made up of foreground object components and a non-mixed area made up of one of the background areas made up of background object components that make up the background object, and area identification information corresponding to the identification result is output, Since the class corresponding to each pixel data of the input image data is determined corresponding to the specific information, the image can be processed in accordance with the mixture of the background image and the moving object image. It becomes like this.
[Brief description of the drawings]
FIG. 1 is a block diagram illustrating a configuration of a conventional image processing apparatus.
FIG. 2 is a diagram illustrating class taps.
FIG. 3 is a diagram illustrating a prediction tap.
FIG. 4 is a diagram for explaining an outline of class classification adaptation processing;
FIG. 5 is a diagram illustrating a conventional coefficient set.
FIG. 6 is a flowchart illustrating a conventional learning process.
FIG. 7 is a block diagram illustrating a configuration of a conventional image processing apparatus.
FIG. 8 is a diagram illustrating a pixel value of an input image and a pixel value of an output image generated by the class classification adaptation process.
FIG. 9 is a flowchart illustrating a conventional image creation process.
FIG. 10 is a block diagram showing a configuration of an embodiment of an image processing apparatus according to the present invention.
FIG. 11 is a block diagram illustrating a functional configuration of the image processing apparatus.
FIG. 12 is a diagram illustrating imaging by a sensor.
FIG. 13 is a diagram illustrating an arrangement of pixels.
FIG. 14 is a diagram illustrating the operation of a detection element.
FIG. 15 is a diagram illustrating an image obtained by imaging an object corresponding to a moving foreground and an object corresponding to a stationary background.
FIG. 16 is a diagram illustrating a background area, a foreground area, a mixed area, a covered background area, and an uncovered background area.
FIG. 17 is a model diagram in which pixel values of pixels arranged in a row adjacent to each other in an image obtained by capturing an object corresponding to a stationary foreground and an object corresponding to a stationary background are developed in the time direction; It is.
FIG. 18 is a model diagram in which pixel values are developed in the time direction and a period corresponding to a shutter time is divided.
FIG. 19 is a model diagram in which pixel values are developed in a time direction and a period corresponding to a shutter time is divided.
FIG. 20 is a model diagram in which pixel values are developed in a time direction and a period corresponding to a shutter time is divided.
FIG. 21 is a diagram illustrating an example in which pixels in a foreground area, a background area, and a mixed area are extracted.
FIG. 22 is a diagram illustrating a correspondence between a pixel and a model in which pixel values are expanded in the time direction.
FIG. 23 is a model diagram in which pixel values are developed in a time direction and a period corresponding to a shutter time is divided.
FIG. 24 is a model diagram in which pixel values are developed in a time direction and a period corresponding to a shutter time is divided.
FIG. 25 is a model diagram in which pixel values are developed in a time direction and a period corresponding to a shutter time is divided.
FIG. 26 is a model diagram in which pixel values are developed in a time direction and a period corresponding to a shutter time is divided.
FIG. 27 is a model diagram in which pixel values are developed in a time direction and a period corresponding to a shutter time is divided.
FIG. 28 is a diagram illustrating a correspondence between a divided image and a model diagram in which pixel values of pixels are developed in the time direction.
FIG. 29 is a diagram illustrating a correspondence between a separated image and a model diagram in which pixel values of pixels are developed in a time direction.
FIG. 30 is a diagram illustrating an example of a divided image.
FIG. 31 is a diagram illustrating an example of a separated image.
FIG. 32 is a diagram illustrating a correspondence between an image from which motion blur is removed and a model diagram in which pixel values of pixels are developed in the time direction.
FIG. 33 is a diagram for explaining processing of the image processing apparatus according to the present invention.
FIG. 34 is a flowchart illustrating image processing of the image processing apparatus according to the present invention.
35 is a block diagram illustrating an example of a configuration of a region specifying unit 103. FIG.
FIG. 36 is a diagram illustrating an image when an object corresponding to the foreground is moving.
FIG. 37 is a model diagram in which pixel values are developed in a time direction and a period corresponding to a shutter time is divided.
FIG. 38 is a model diagram in which pixel values are developed in the time direction and a period corresponding to a shutter time is divided.
FIG. 39 is a model diagram in which pixel values are developed in the time direction and a period corresponding to a shutter time is divided.
FIG. 40 is a diagram for explaining region determination conditions;
FIG. 41 is a diagram illustrating an example of a result of specifying a region by the region specifying unit 103;
FIG. 42 is a diagram illustrating an example of a result of specifying a region by the region specifying unit 103;
FIG. 43 is a flowchart illustrating an area specifying process.
44 is a block diagram illustrating another example of the configuration of the area specifying unit 103. FIG.
FIG. 45 is a model diagram in which pixel values are developed in a time direction and a period corresponding to a shutter time is divided.
FIG. 46 is a diagram illustrating an example of a background image.
47 is a block diagram showing a configuration of a binary object image extraction unit 302. FIG.
FIG. 48 is a diagram illustrating calculation of correlation values.
FIG. 49 is a diagram illustrating calculation of correlation values.
FIG. 50 is a diagram illustrating an example of a binary object image.
51 is a block diagram showing a configuration of a time change detection unit 303. FIG.
FIG. 52 is a diagram for explaining determination by an area determination unit 342;
53 is a diagram showing an example of determination by a time change detection unit 303. FIG.
FIG. 54 is a flowchart for describing region specifying processing by the region determining unit 103;
FIG. 55 is a flowchart illustrating details of a region determination process.
56 is a block diagram showing still another configuration of the area specifying unit 103. FIG.
57 is a block diagram illustrating a configuration of a robust unit 361. FIG.
58 is a diagram for explaining motion compensation of a motion compensation unit 381. FIG.
59 is a diagram for explaining motion compensation by a motion compensation unit 381. FIG.
FIG. 60 is a flowchart illustrating an area specifying process.
FIG. 61 is a flowchart illustrating details of robust processing.
62 is a block diagram illustrating an example of a configuration of a mixture ratio calculation unit 104. FIG.
FIG. 63 is a diagram illustrating an example of an ideal mixing ratio α.
FIG. 64 is a model diagram in which pixel values are developed in the time direction and a period corresponding to a shutter time is divided.
FIG. 65 is a model diagram in which pixel values are developed in a time direction and a period corresponding to a shutter time is divided.
FIG. 66 is a diagram for explaining approximation using the correlation of foreground components.
FIG. 67 is a diagram for explaining a relationship between C, N, and P;
68 is a block diagram showing a configuration of an estimated mixture ratio processing unit 401. FIG.
FIG. 69 is a diagram illustrating an example of an estimated mixture ratio.
70 is a block diagram showing another configuration of the mixture ratio calculation unit 104. FIG.
FIG. 71 is a flowchart illustrating processing for calculating a mixture ratio.
FIG. 72 is a flowchart illustrating processing for calculating an estimated mixture ratio.
FIG. 73 is a diagram illustrating a straight line approximating the mixture ratio α.
FIG. 74 is a diagram illustrating a plane that approximates the mixture ratio α.
FIG. 75 is a diagram for explaining the correspondence of pixels in a plurality of frames when calculating the mixture ratio α.
76 is a block diagram illustrating another configuration of the mixture ratio estimation processing unit 401. FIG.
FIG. 77 is a diagram illustrating an example of an estimated mixture ratio.
FIG. 78 is a flowchart illustrating processing for calculating a mixture ratio.
[Fig. 79] Fig. 79 is a flowchart for describing mixing ratio estimation processing using a model corresponding to a covered background region.
80 is a block diagram illustrating an exemplary configuration of a foreground / background separation unit 105. FIG.
FIG. 81 is a diagram illustrating an input image, a foreground component image, and a background component image.
FIG. 82 is a model diagram in which pixel values are developed in the time direction and a period corresponding to a shutter time is divided.
FIG. 83 is a model diagram in which pixel values are developed in a time direction and a period corresponding to a shutter time is divided.
FIG. 84 is a model diagram in which pixel values are developed in a time direction and a period corresponding to a shutter time is divided.
85 is a block diagram illustrating an example of a configuration of a separation unit 601. FIG.
FIG. 86 is a diagram illustrating an example of separated foreground component images and background component images.
FIG. 87 is a flowchart for describing foreground and background separation processing;
88 is a block diagram illustrating an example of a configuration of a motion blur removal unit 106. FIG.
Fig. 89 is a diagram for describing a processing unit.
FIG. 90 is a model diagram in which pixel values of a foreground component image are expanded in the time direction and a period corresponding to a shutter time is divided.
FIG. 91 is a model diagram in which pixel values of a foreground component image are expanded in a time direction and a period corresponding to a shutter time is divided.
FIG. 92 is a model diagram in which pixel values of a foreground component image are developed in the time direction and a period corresponding to a shutter time is divided.
FIG. 93 is a flowchart for describing processing for removing motion blur included in the foreground component image by the motion blur removal unit 106;
FIG. 94 is a diagram showing a model of a background component image.
FIG. 95 is a diagram showing a model of a corrected background component image.
FIG. 96 is a block diagram illustrating a configuration of a motion blur removal image processing unit that generates a coefficient set.
FIG. 97 is a diagram illustrating a relationship between a teacher image and a student image.
98 is a block diagram showing a configuration of a learning unit 1006. FIG.
FIG. 99 is a diagram for explaining class classification processing;
Fig. 100 is a diagram for describing ADRC processing;
FIG. 101 is a diagram illustrating a coefficient set generated by a motion blur removal image processing unit.
FIG. 102 is a flowchart for describing learning processing for generating a coefficient set by the motion blur removal image processing unit;
FIG. 103 is a flowchart illustrating processing for generating a coefficient set corresponding to a background component image.
Fig. 104 is a block diagram illustrating a configuration of a motion deblurred image processing unit that executes class classification adaptation processing and generates a higher-resolution image in the spatial direction.
FIG. 105 is a diagram illustrating a model of a foreground component image from which motion blur has been removed.
FIG. 106 is a diagram illustrating a model of a foreground component image to which motion blur is added.
107 is a block diagram showing a configuration of a mapping unit 1103. FIG.
FIG. 108 is a flowchart for describing image creation processing by the motion blur removal image processing unit;
Fig. 109 is a flowchart for describing prediction processing of an image corresponding to a background component image.
Fig. 110 is a block diagram illustrating another configuration of functions of the image processing device.
111 is a block diagram illustrating an example of a configuration of a mixture ratio calculation unit 1501. FIG.
112 is a block diagram illustrating an example of a configuration of a foreground / background separator 1502. FIG.
FIG. 113 is a block diagram illustrating another configuration of the functions of the image processing apparatus.
114 is a diagram for describing processing of a separated image processing unit 2002. FIG.
115 is a block diagram illustrating an example of a configuration of a foreground / background separator 2001. FIG.
116 is a block diagram illustrating an example of a configuration of a separation unit 2101. FIG.
FIG. 117 is a block diagram illustrating a configuration of a separated image processing unit 2002 that generates a coefficient set.
Fig. 118 is a block diagram illustrating a configuration of a separated image processing unit 2002 that generates a higher-resolution image in the spatial direction.
FIG. 119 is a diagram illustrating an example of an image in a mixed region of teacher images.
FIG. 120 is a diagram illustrating an example of a mixed region image generated by conventional class classification adaptation processing;
FIG. 121 is a diagram illustrating an example of a mixed region image generated by the separated image processing unit 2002;
FIG. 122 is a diagram illustrating an example of an image in a foreground area of a teacher image.
FIG. 123 is a diagram illustrating an example of an image in the foreground area generated by a conventional class classification adaptation process.
124 is a diagram showing an example of an image in the foreground area generated by the separated image processing unit 2002. FIG.
FIG. 125 is a flowchart for describing image processing of the image processing apparatus having the configuration shown in FIG. 113;
FIG. 126 is a flowchart for describing foreground / background separation processing by a foreground / background separation unit 2001;
FIG. 127 is a flowchart for describing learning processing for generating a coefficient set by the separated image processing unit 2002;
FIG. 128 is a flowchart for describing image creation processing by the separated image processing unit 2002;
FIG. 129 is a block diagram illustrating still another configuration of the function of the image processing apparatus.
130 is a block diagram illustrating an exemplary configuration of a foreground / background separation unit 2501. FIG.
FIG. 131 is a block diagram illustrating still another configuration of the function of the image processing apparatus.
132 is a diagram for describing processing of an area processing unit 3001. FIG.
FIG. 133 is a block diagram illustrating a configuration of an area processing unit 3001 that generates a coefficient set.
FIG. 134 is a block diagram illustrating a configuration of an area processing unit 3001 that generates a higher-resolution image in the spatial direction.
FIG. 135 is a diagram illustrating an example of an image in a mixed region of teacher images.
136 is a diagram showing an example of an image of a mixed region generated by conventional class classification adaptation processing. FIG.
137 is a diagram showing an example of an image of a mixed area generated by an area processing unit 3001. FIG.
138 is a diagram illustrating an example of an image in a foreground area of a teacher image. FIG.
FIG. 139 is a diagram illustrating an example of an image of a foreground area generated by a conventional class classification adaptation process.
140 is a diagram illustrating an example of an image of a foreground area generated by the area processing unit 3001. FIG.
FIG. 141 is a flowchart illustrating image processing of the image processing apparatus according to the present invention.
FIG. 142 is a flowchart for describing learning processing for generating a coefficient set by the region processing unit 3001;
FIG. 143 is a flowchart for describing image creation processing by the area processing unit 3001;
[Explanation of symbols]
71 CPU, 72 ROM, 73 RAM, 76 input section, 77 output section, 78 storage section, 79 communication section, 91 magnetic disk, 92 optical disk, 93 magneto-optical disk, 94 semiconductor memory, 101 object extraction section, 102 motion detection section , 103 area specifying unit, 104 mixing ratio calculation unit, 105 foreground / background separation unit, 106 motion blur removal unit, 107 correction unit, 108 motion blur removal image processing unit, 201 frame memory, 202-1 to 202-4 static motion determination , 203-1 to 203-3 area determination unit, 204 determination flag storage frame memory, 205 synthesis unit, 206 determination flag storage frame memory, 301 background image generation unit, 302 binary object image extraction unit, 303 time change detection unit , 321 correlation value calculation unit, 322 threshold processing unit, 341 frame memory, 342 region determination unit, 361 robust unit, 381 motion compensation unit, 382 switch, 383-1 to 383-N frame memory, 384-1 to 384-N weighting unit, 385 Integration unit, 401 estimated mixture ratio processing unit, 402 estimated mixture ratio processing unit, 403 mixing ratio determination unit, 421 frame memory, 422 frame memory, 423 mixing ratio calculation unit, 441 selection unit, 442 estimated mixture ratio processing unit, 443 estimation Mixing ratio processing unit, 444 selection unit, 501 delay circuit, 502 addition unit, 503 operation unit, 601 separation unit, 602 switch, 603 synthesis unit, 604 switch, 605 synthesis unit, 621 frame memory, 622 separation processing block, 623 F Memory, 631 uncovered area processing section, 632 covered area processing section, 633 combining section, 634 combining section, 801 processing unit determining section, 802 modeling section, 803 equation generating section, 804 adding section, 805 calculating section, 1001 background Component teacher image frame memory, 1002 Foreground component teacher image frame memory, 1003-1 and 1003-2 Weighted average unit, 1004 Background component student image frame memory, 1005 Foreground component student image frame memory, 1006-1 and 1006-2 Learning unit , 1007 coefficient set memory, 1031 class classification unit, 1032 prediction tap acquisition unit, 1033 corresponding pixel acquisition unit, 1034 normal equation generation unit, 1035 coefficient calculation unit, 1051 class tap acquisition unit, 105 Waveform classification unit, 1101 background component image frame memory, 1102 foreground component image frame memory, 1103-1 and 1103-2 mapping unit, 1104 coefficient set memory, 1105 correction unit, 1106 motion blur addition unit, 1107 synthesis unit, 1131 mapping processing , 1141 class classification unit, 1142 prediction tap acquisition unit, 1143 prediction calculation unit, 1151 class tap acquisition unit, 1152 waveform classification unit, 1501 mixing ratio calculation unit, 1502 foreground / background separation unit, 1521 selection unit, 2001 foreground / background separation unit , 2002 Separate image processing unit, 2101 separation unit, 2102 switch, 2103 switch, 2201 background region teacher image frame memory, 2202 uncovered background region background component teacher Image frame memory, 2203 uncovered background area foreground component teacher image frame memory, 2204 covered background area background component teacher image frame memory, 2205 covered background area foreground component teacher image frame memory, 2206 foreground area teacher image frame memory, 2207 -1 to 2207-6 weighted average part, 2208 background area student image frame memory, 2209 uncovered background area background component student image frame memory, 2210 uncovered background area foreground component student image frame memory, 2211 covered background area background Component Student Image Frame Memory, 2212 Covered Background Area Foreground Component Student Image Frame Memory, 2213 Scene area student image frame memory, 2214-1 to 2214-6 learning unit, 2215 coefficient set memory, 2301 background area frame memory, 2302 uncovered background area background component image frame memory, 2303 uncovered background area foreground component image frame Memory, 2304 covered background area background component image frame memory, 2305 covered background area foreground component image frame memory, 2306 foreground area frame memory, 2307-1 to 2307-6 mapping unit, 2308 coefficient set memory, 2309 synthesis unit, 2501 Foreground / background separation unit, 2521 selection unit, 3001 region processing unit, 3102 region division unit, 3103 background region teacher image frame memory, 3104 uncovered background area teacher image frame memory, 3105 covered background area teacher image frame memory, 3106 foreground area teacher image frame memory, 3107-1 to 3107-4 weighted average unit, 3108 background area student image frame memory, 3109 ann Covered background area student image frame memory, 3110 Covered background area student image frame memory, 3111 Foreground area student image frame memory, 3112-1 to 3112-4 Learning unit, 3113 Coefficient set memory, 3206 Area dividing unit, 3203 Background region Frame memory, 3204 uncovered background area frame memory, 3205 covered background area frame memory, 3206 Foreground area frame memory, 3207-1 to 3207-4 mapping unit, 3208 coefficient set memory, 3209 synthesis unit

Claims

In an image processing apparatus for processing an input image composed of a predetermined number of pixels acquired by an imaging device having a predetermined number of pixels having a time integration effect,
By comparing the absolute value of the difference between the pixel values of the pixels at the same position and a predetermined threshold value between the input image and the input image acquired before and after the input image, the input image And whether or not the pixel moves between the input image and the previous input image, and whether or not the pixel moves between the input image and the next input image. An area for specifying at least a foreground area including a foreground object component constituting a foreground object and a background area including a background object component constituting a background object, and outputting area specifying information indicating a determination result , based on the input image ; Specific means,
Using the input image input as learning data, for each of the image in the foreground region and the image in the background region, the input image is a teacher image, and the input image is a low-resolution image as a student image The predetermined pixel of the teacher image is represented by a linear primary expression of a prediction coefficient and a plurality of pixels around the corresponding pixel of the student image, and the prediction pixel value represented by the linear primary expression and the teacher image Prediction coefficient generating means for obtaining in advance the prediction coefficient that minimizes an error from the pixel value of the predetermined pixel;
For each of the image in the foreground area and the image in the background area of the input image input as prediction processing data, a linear first order of the prediction coefficient obtained in advance and a plurality of pixels around the pixel of the input image An image processing apparatus comprising: conversion means for converting the image in the foreground area and the image in the background area of the input image into a high-resolution output image by calculating an expression .

The region specifying means also uses the determination result of the presence or absence of movement of the pixels between the input images acquired one before, two before, one after, and two after the input image. of the mixing region in which the background object components and the object component is formed by mixing, the covered background area which is an area to become the foreground object components from said background object components to the elapse of time, corresponding to the elapsed time And further specifying an uncovered background area that is an area from the foreground object component to the background object component ,
The prediction coefficient generation means uses the input image input as learning data, and obtains the prediction coefficient in advance for each of the covered background area and the uncovered background area images,
The converting means converts each of the images of the covered background area and the uncovered background area of the input image input as prediction processing data into the high-resolution output image. The image processing apparatus according to claim 1.

Obtained by converting the foreground area image, the background area image, the covered background area image, and the uncovered background area image of the input image input as prediction processing data to high resolution, respectively. The image processing apparatus according to claim 2 , further comprising a synthesis unit that synthesizes and outputs the output image .

In an image processing method for processing an input image composed of a predetermined number of pixels acquired by an imaging device having a predetermined number of pixels having a time integration effect,
By comparing the absolute value of the difference between the pixel values of the pixels at the same position and a predetermined threshold value between the input image and the input image acquired before and after the input image, the input image And whether or not the pixel moves between the input image and the previous input image, and whether or not the pixel moves between the input image and the next input image. An area for specifying at least a foreground area including a foreground object component constituting a foreground object and a background area including a background object component constituting a background object, and outputting area specifying information indicating a determination result , based on the input image ; Specific steps,
Using the input image input as learning data, for each of the image in the foreground region and the image in the background region, the input image is a teacher image, and the input image is a low-resolution image as a student image The predetermined pixel of the teacher image is represented by a linear primary expression of a prediction coefficient and a plurality of pixels around the corresponding pixel of the student image, and the prediction pixel value represented by the linear primary expression and the teacher image A prediction coefficient generation step for obtaining in advance the prediction coefficient such that an error from the pixel value of the predetermined pixel is minimized;
For each of the image in the foreground area and the image in the background area of the input image input as prediction processing data, a linear first order of the prediction coefficient obtained in advance and a plurality of pixels around the pixel of the input image An image processing method comprising: converting a foreground region image and a background region image of the input image into a high-resolution output image by calculating an expression .

On the computer,
A pixel of the pixel at the same position between an input image composed of a predetermined number of pixels acquired by an imaging device having a predetermined number of pixels having a time integration effect and the input image acquired immediately before and after the input image By comparing the absolute value of the difference in value with a predetermined threshold value, the presence or absence of movement of the pixel between the input image and the previous input image, and the input image and the next one Determining whether or not the pixel moves between the input image and a foreground region including a foreground object component forming a foreground object of the input image and a background forming a background object based on the determination result An area specifying step for specifying at least a background area made up of object components and outputting area specifying information indicating a specifying result;
Using the input image input as learning data, for each of the image in the foreground region and the image in the background region, the input image is a teacher image, and the input image is a low-resolution image as a student image The predetermined pixel of the teacher image is represented by a linear primary expression of a prediction coefficient and a plurality of pixels around the corresponding pixel of the student image, and the prediction pixel value represented by the linear primary expression and the teacher image A prediction coefficient generation step for obtaining in advance the prediction coefficient such that an error from the pixel value of the predetermined pixel is minimized;
For each of the image in the foreground area and the image in the background area of the input image input as prediction processing data, a linear first order of the prediction coefficient obtained in advance and a plurality of pixels around the pixel of the input image A conversion step of converting the image of the foreground region and the image of the background region of the input image into a high-resolution output image by calculating an expression;
The computer-readable recording medium which recorded the program for performing this .

A computer that processes an input image composed of a predetermined number of pixels acquired by an imaging device having a predetermined number of pixels having a time integration effect,
By comparing the absolute value of the difference between the pixel values of the pixels at the same position and a predetermined threshold value between the input image and the input image acquired before and after the input image, the input image And whether or not the pixel moves between the input image and the previous input image, and whether or not the pixel moves between the input image and the next input image. An area for specifying at least a foreground area including a foreground object component constituting a foreground object and a background area including a background object component constituting a background object, and outputting area specifying information indicating a determination result , based on the input image ; Specific steps,
Using the input image input as learning data, for each of the image in the foreground region and the image in the background region, the input image is a teacher image, and the input image is a low-resolution image as a student image The predetermined pixel of the teacher image is represented by a linear primary expression of a prediction coefficient and a plurality of pixels around the corresponding pixel of the student image, and the prediction pixel value represented by the linear primary expression and the teacher image A prediction coefficient generation step for obtaining in advance the prediction coefficient such that an error from the pixel value of the predetermined pixel is minimized;
For each of the image in the foreground area and the image in the background area of the input image input as prediction processing data, a linear first order of the prediction coefficient obtained in advance and a plurality of pixels around the pixel of the input image A program for executing a conversion step of converting an image of the foreground area and an image of the background area of the input image into a high-resolution output image by calculating an expression .