JP4333183B2

JP4333183B2 - Image processing apparatus and method, recording medium, and program

Info

Publication number: JP4333183B2
Application number: JP2003094660A
Authority: JP
Inventors: 哲二郎近藤; 貴志沢尾; 淳一石橋; 隆浩永野; 直樹藤原; 徹三宅; 成司和田
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2003-03-31
Filing date: 2003-03-31
Publication date: 2009-09-16
Anticipated expiration: 2023-03-31
Also published as: JP2004302823A

Description

【０００１】
【発明の属する技術分野】
本発明は、画像処理装置および方法、記録媒体、並びにプログラムに関し、特に、破綻のない画像を合成できるようにした画像処理装置および方法、記録媒体、並びにプログラムに関する。
【０００２】
【従来の技術】
前景成分画像と、背景成分画像とを１枚の画像に合成する画像合成技術が一般に普及している。すなわち、画像合成技術とは、第１の画像に含まれている前景成分画像と、第２の画像に含まれている背景成分画像を合成することにより、１枚の合成画像を生成すると言うものである。
【０００３】
また、前景成分画像の動きボケを調整して、背景成分画像と合成するものがある。（例えば、特許文献１参照）。
【０００４】
【特許文献１】
特開２００３−１８５６４号公報（特願２００１−１９５６０５）
【０００５】
【発明が解決しようとする課題】
ところで、上述した従来手法を用いて、図１Ａで示されるような合成画像を生成した場合、その前景成分画像と背景成分画像のそれぞれの混合比は、図１Ｂのような分布となる。図１Ｂは、各画素毎の前景成分と背景成分の混合比の分布を色で示している。図１Ｂにおいて、白は前景成分の割合が１で背景成分が０を示し、黒は、逆に、前景成分の割合が０で背景成分が１であることを示している。従って、前景成分と背景成分の割合がそれぞれ1/2の場合、灰色（白と黒の中間色）が表示されていることになる。
【０００６】
図１Ａ，Ｂにおいて、前景成分となっている画像は、水平方向に動きのある画像であるため、背景成分に対して動きボケが生じており、このため、前景成分と背景成分の間の混合領域には、それぞれの成分が混合した混合領域が存在する。
【０００７】
また、図２は、図１Ａ上の前景成分画像の画素と背景成分画像の画素値の分布を示したものであり、水平方向と垂直方向の座標位置上の画素値の分布が示されている。
【０００８】
図１Ａ，Ｂ、または、図２で示されるように、前景成分画像と背景成分画像の境界付近は、階段状に画素値や混合比が変化している。このため、合成された画像には、混合領域において破綻が生じて、見た目に不自然なものとなってしまうという課題があった。
【０００９】
これは、前景成分画像、および、背景成分画像のそれぞれを画素単位で求めていたため境界部分は、図３で示されるように、動きボケが除去された前景成分画像と背景成分画像とを合成してみるとわかるように、形状の連続性が保たれず、階段状にギザギザとした形状になってしまうからである。
【００１０】
すなわち、図４で示されるように、CCD（Charge Coupled Device）などに入射される光を、各画素毎に表現していた。図４では、背景成分画像の光の発する領域を黒色で、前景成分画像の光の発する部分を斜線部で示しており、発せられた光をCCDの各画素に見立てたコップ状の入れ物に充填させることにより、各画素が表現されることを示している。図４の場合、左から４番目までの画素は前景成分画像を、右から６番目までの背景成分画像を、それぞれ示す画素値となっている。つまり、図４においては、各画素単位で前景成分画像と背景成分画像が表現されている。しかしながら、実際の前景成分画像と背景成分画像の境界は、CCDの各画素単位で存在するわけではなく、その境界が画素上となることも考えられる。
【００１１】
例えば、図５で示されるように、実際には、CCDの画素上に前景成分画像と背景成分画像の境界が存在することもある。図５においては、左から５番目の画素の中央位置までが前景成分画像であり、右から６番目の画素の中央位置までが背景成分画像を表しており、左から５番目の画素（右から６番目の画素）の中央位置が境界となっている。この場合、実際には、左から５番目の画素は、前景成分画像と、背景成分画像が、混合した画素値となるはずである。
【００１２】
しかしながら、上述した前景成分画像と背景成分画像は、画素毎に分けられているため、これらを合成し、動きボケを付加しても、その境界部分が階段状となってしまい、違和感のある画像になってしまうと言う課題があった。
【００１３】
本発明はこのような状況に鑑みてなされたものであり、前景成分画像と背景成分画像との境界となる混合領域をより詳細に表現することにより正確に合成画像を生成できるようにするものである。
【００１４】
【課題を解決するための手段】
本発明の画像処理装置は、動きボケの無い前景画像のうち、付加する動きの方向の輪郭部の端部の画素を検出する画素検出手段と、画素検出手段により検出された端部の画素の画素値と、端部の画素の周辺に位置する前景画像内の周辺画素から外挿補間されることにより推定画素値との比を検出し、端部の混合比とする混合比検出手段と、前景画像に対して所定の動きボケを付加した動きボケ付加前景画像を生成する動きボケ付加手段と、動きボケ付加前景画像及び端部の混合比に基いて、前景画像を背景画像に合成する合成手段とを備えることを特徴とする。
【００１５】
本発明の画像処理方法は、動きボケの無い前景画像のうち、付加する動きの方向の輪郭部の端部の画素を検出する画素検出ステップと、画素検出ステップの処理で検出された端部の画素の画素値と、端部の画素の周辺に位置する前景画像内の周辺画素から外挿補間されることにより推定される推定画素値との比を検出し、端部の混合比とする混合比検出ステップと、前景画像に対して所定の動きボケを付加した動きボケ付加前景画像を生成する動きボケ付加ステップと、動きボケ付加前景画像及び端部の混合比に基いて、前景画像を背景画像に合成する合成ステップとを含むことを特徴とする。
【００１６】
本発明の記録媒体のプログラムは、動きボケの無い前景画像のうち、付加する動きの方向の輪郭部の端部の画素を検出する画素検出ステップと、画素検出ステップの処理で検出された端部の画素の画素値と、端部の画素の周辺に位置する前景画像内の周辺画素から外挿補間されることにより推定される推定画素値との比を検出し、端部の混合比とする混合比検出ステップと、前景画像に対して所定の動きボケを付加した動きボケ付加前景画像を生成する動きボケ付加ステップと、動きボケ付加前景画像及び端部の混合比に基いて、前景画像を背景画像に合成する合成ステップとを含むことを特徴とする。
【００１７】
本発明のプログラムは、動きボケの無い前景画像のうち、付加する動きの方向の輪郭部の端部の画素を検出する画素検出ステップと、画素検出ステップの処理で検出された端部の画素の画素値と、端部の画素の周辺に位置する前景画像内の周辺画素から外挿補間されることにより推定される推定画素値との比を検出し、端部の混合比とする混合比検出ステップと、前景画像に対して所定の動きボケを付加した動きボケ付加前景画像を生成する動きボケ付加ステップと、動きボケ付加前景画像及び端部の混合比に基いて、前景画像を背景画像に合成する合成ステップとをコンピュータに実行させることを特徴とする。
【００１８】
本発明の画像処理装置および方法、並びにプログラムにおいては、動きボケの無い前景画像のうち、付加する動きの方向の輪郭部の端部の画素が検出され、検出された端部の画素の画素値と、端部の画素の周辺に位置する前景画像内の周辺画素から外挿補間されることにより推定される推定画素値との比を検出され、端部の混合比とされ、前景画像に対して所定の動きボケを付加した動きボケ付加前景画像が生成され、動きボケ付加前景画像及び端部の混合比に基いて、前景画像が背景画像に合成される。
【００１９】
本発明の画像処理装置は、独立した装置であっても良いし、画像処理を行うブロックであっても良い。
【００２０】
【発明の実施の形態】
以下に本発明の実施の形態を説明するが、請求項に記載の構成要件と、発明の実施の形態における具体例との対応関係を例示すると、次のようになる。この記載は、請求項に記載されている発明をサポートする具体例が、発明の実施の形態に記載されていることを確認するためのものである。従って、発明の実施の形態中には記載されているが、構成要件に対応するものとして、ここには記載されていない具体例があったとしても、そのことは、その具体例が、その構成要件に対応するものではないことを意味するものではない。逆に、具体例が構成要件に対応するものとしてここに記載されていたとしても、そのことは、その具体例が、その構成要件以外の構成要件には対応しないものであることを意味するものでもない。
【００２１】
さらに、この記載は、発明の実施の形態に記載されている具体例に対応する発明が、請求項に全て記載されていることを意味するものではない。換言すれば、この記載は、発明の実施の形態に記載されている具体例に対応する発明であって、この出願の請求項には記載されていない発明の存在、すなわち、将来、分割出願されたり、補正により追加される発明の存在を否定するものではない。
【００２２】
即ち、本発明の画像処理装置は、動きボケの無い前景画像のうち、付加する動きの方向の輪郭部の端部の画素を検出する画素検出手段（例えば、図９７の輪郭近傍画素抽出部１１０１）と、画素検出手段により検出された端部の画素の画素値と、端部の画素の周辺に位置する前景画像内の周辺画素から外挿補間されることにより推定される推定画素値との比を検出し、端部の混合比とする混合比検出手段（例えば、図９７の空間混合比生成部１１０１）と、前景画像に対して所定の動きボケを付加した動きボケ付加前景画像を生成する動きボケ付加手段（例えば、図９７の動きボケ付加部１００５）と、動きボケ付加前景画像及び端部の混合比に基いて、前景画像を背景画像に合成する合成手段（例えば、図９７の画像合成部１０１０）とを備えることを特徴とする。
【００２３】
また、本発明の画像処理方法は、動きボケの無い前景画像のうち、付加する動きの方向の輪郭部の端部の画素を検出する画素検出ステップ（図１００のフローチャートのステップＳ９４１の処理）と、画素検出ステップの処理で検出された端部の画素の画素値と、端部の画素の周辺に位置する前景画像内の周辺画素から外挿補間されることにより推定される推定画素値との比を検出し、端部の混合比とする混合比検出ステップ（図１００のフローチャートのステップＳ９４３の処理）と、前景画像に対して所定の動きボケを付加した動きボケ付加前景画像を生成する動きボケ付加ステップ（図１００のフローチャートのステップＳ９４５の処理）と、動きボケ付加前景画像及び端部の混合比に基いて、前景画像を背景画像に合成する合成ステップ（図１００のフローチャートのステップＳ９４９の処理）とを含むことを特徴とする。
【００２４】
図６は、本発明を適用した画像処理部の一実施の形態の構成を示すブロック図である。
【００２５】
なお、画像処理部の各機能をハードウェアで実現するか、ソフトウェアで実現するかは問わない。つまり、本明細書の各ブロック図は、ハードウェアのブロック図と考えても、ソフトウェアによる機能ブロック図と考えても良い。
【００２６】
ここで、動きボケとは、撮像の対象となる、現実世界におけるオブジェクトの動きと、センサの撮像の特性とにより生じる、動いているオブジェクトに対応する画像に含まれている歪みをいう。
【００２７】
この明細書では、撮像の対象となる、現実世界におけるオブジェクトに対応する画像を、画像オブジェクトと称する。
【００２８】
画像処理部に供給された入力画像は、オブジェクト抽出部１０１、領域特定部１０３、混合比算出部１０４、および前景背景分離部１０５に供給される。
【００２９】
オブジェクト抽出部１０１は、入力画像に含まれる前景のオブジェクトに対応する画像オブジェクトを粗く抽出して、抽出した画像オブジェクトを動き検出部１０２に供給する。オブジェクト抽出部１０１は、例えば、入力画像に含まれる前景のオブジェクトに対応する画像オブジェクトの輪郭を検出することで、前景のオブジェクトに対応する画像オブジェクトを粗く抽出する。
【００３０】
オブジェクト抽出部１０１は、入力画像に含まれる背景のオブジェクトに対応する画像オブジェクトを粗く抽出して、抽出した画像オブジェクトを動き検出部１０２に供給する。オブジェクト抽出部１０１は、例えば、入力画像と、抽出された前景のオブジェクトに対応する画像オブジェクトとの差から、背景のオブジェクトに対応する画像オブジェクトを粗く抽出する。
【００３１】
また、例えば、オブジェクト抽出部１０１は、内部に設けられている背景メモリに記憶されている背景の画像と、入力画像との差から、前景のオブジェクトに対応する画像オブジェクト、および背景のオブジェクトに対応する画像オブジェクトを粗く抽出するようにしてもよい。
【００３２】
動き検出部１０２は、例えば、ブロックマッチング法、勾配法、位相相関法、およびペルリカーシブ法などの手法により、粗く抽出された前景のオブジェクトに対応する画像オブジェクトの動きベクトルを算出して、算出した動きベクトルおよび動きベクトルの位置情報（動きベクトルに対応する画素の位置を特定する情報）を領域特定部１０３、混合比算出部１０４、および動きボケ抽出部１０６に供給する。
【００３３】
動き検出部１０２が出力する動きベクトルには、動き量vに対応する情報が含まれている。
【００３４】
また、例えば、動き検出部１０２は、画像オブジェクトに画素を特定する画素位置情報と共に、画像オブジェクト毎の動きベクトルを動きボケ除去部１０６に出力するようにしてもよい。
【００３５】
動き量vは、動いているオブジェクトに対応する画像の位置の変化を画素間隔を単位として表す値である。例えば、前景に対応するオブジェクトの画像が、あるフレームを基準として次のフレームにおいて４画素分離れた位置に表示されるように移動しているとき、前景に対応するオブジェクトの画像の動き量vは、４とされる。
【００３６】
なお、オブジェクト抽出部１０１および動き検出部１０２は、動いているオブジェクトに対応した動きボケ量の調整を行う場合に必要となる。
【００３７】
領域特定部１０３は、入力された画像の画素のそれぞれを、前景領域、背景領域、または混合領域のいずれかに特定し、画素毎に前景領域、背景領域、または混合領域のいずれかに属するかを示す情報（以下、領域情報と称する）を混合比算出部１０４、前景背景分離部１０５、および動きボケ除去部１０６に供給する。
【００３８】
混合比算出部１０４は、入力画像、および領域特定部１０３から供給された領域情報を基に、混合領域６３に含まれる画素に対応する混合比（以下、混合比αと称する）を算出して、算出した混合比を前景背景分離部１０５に供給する。
【００３９】
混合比αは、後述する式（３）に示されるように、画素値における、背景のオブジェクトに対応する画像の成分（以下、背景の成分とも称する）の割合を示す値である。
【００４０】
前景背景分離部１０５は、領域特定部１０３から供給された領域情報、および混合比算出部１０４から供給された混合比αを基に、前景のオブジェクトに対応する画像の成分（以下、前景の成分とも称する）のみから成る前景成分画像と、背景の成分のみから成る背景成分画像とに入力画像を分離して、前景成分画像を動きボケ除去部１０６および合成部１０７に供給する。従来の混合領域を考慮しないで前景と背景だけを特定し、分離していた方式に比べ正確な前景と背景を得ることが出来る。
【００４１】
動きボケ除去部１０６は、動きベクトルからわかる動き量vおよび領域情報を基に、前景成分画像に含まれる１以上の画素を示す処理単位を決定する。処理単位は、動きボケの量の調整の処理の対象となる１群の画素を指定するデータである。
【００４２】
動きボケ除去部１０６は、前景背景分離部１０５から供給された前景成分画像、動き検出部１０２から供給された動きベクトルおよびその位置情報、並びに処理単位を基に、前景成分画像に含まれる動きボケを除去して、動きボケを除去した前景成分画像を合成部１０７に出力する。動きベクトルとその位置情報は使わないこともある。
【００４３】
合成部１０７は、動き検出部１０２から供給された動きベクトルおよびその位置情報、並びに、領域特定部１０３より入力された領域情報に基づいて、動きボケ除去部１０６により動きボケが除去された前景成分画像を入力された任意の背景画像と合成し、合成画像を生成して出力する。
【００４４】
次に、図７乃至図２２を参照して、画像処理部に供給される入力画像について説明する。
【００４５】
図７は、センサによる撮像を説明する図である。センサは、例えば、固体撮像素子であるCCD（Charge-Coupled Device）エリアセンサを備えたCCDビデオカメラなどで構成される。現実世界における、前景に対応するオブジェクトは、現実世界における、背景に対応するオブジェクトと、センサとの間を、例えば、図中の左側から右側に水平に移動する。
【００４６】
センサは、前景に対応するオブジェクトを、背景に対応するオブジェクトと共に撮像する。センサは、撮像した画像を１フレーム単位で出力する。例えば、センサは、１秒間に３０フレームから成る画像を出力する。センサの露光時間は、１／３０秒とすることができる。露光時間は、センサが入力された光の電荷への変換を開始してから、入力された光の電荷への変換を終了するまでの期間である。以下、露光時間をシャッタ時間とも称する。
【００４７】
図８は、画素の配置を説明する図である。図８中において、Ａ乃至Ｉは、個々の画素を示す。画素は、画像に対応する平面上に配置されている。１つの画素に対応する１つの検出素子は、センサ上に配置されている。センサが画像を撮像するとき、１つの検出素子は、画像を構成する１つの画素に対応する画素値を出力する。例えば、検出素子のＸ方向の位置は、画像上の横方向の位置に対応し、検出素子のＹ方向の位置は、画像上の縦方向の位置に対応する。
【００４８】
図９に示すように、例えば、CCDである検出素子は、シャッタ時間に対応する期間、入力された光を電荷に変換して、変換された電荷を蓄積する。電荷の量は、入力された光の強さと、光が入力されている時間にほぼ比例する。検出素子は、シャッタ時間に対応する期間において、入力された光から変換された電荷を、既に蓄積されている電荷に加えていく。すなわち、検出素子は、シャッタ時間に対応する期間、入力される光を積分して、積分された光に対応する量の電荷を蓄積する。検出素子は、時間に対して、積分効果があるとも言える。
【００４９】
検出素子に蓄積された電荷は、図示せぬ回路により、電圧値に変換され、電圧値は更にデジタルデータなどの画素値に変換されて出力される。従って、センサから出力される個々の画素値は、前景または背景に対応するオブジェクトの空間的に広がりを有するある部分を、シャッタ時間について積分した結果である、１次元の空間に射影された値を有する。
【００５０】
画像処理部は、このようなセンサの蓄積の動作により、出力信号に埋もれてしまった有意な情報、例えば、混合比αを抽出する。画像処理部は、前景の画像オブジェクト自身が混ざり合うことによる生ずる歪みの量、例えば、動きボケの量などを調整する。また、画像処理部は、前景の画像オブジェクトと背景の画像オブジェクトとが混ざり合うことにより生ずる歪みの量を調整する。
【００５１】
図１０は、動いている前景に対応するオブジェクトと、静止している背景に対応するオブジェクトとを撮像して得られる画像を説明する図である。図１０Ａは、動きを伴う前景に対応するオブジェクトと、静止している背景に対応するオブジェクトとを撮像して得られる画像を示している。図１０Ａに示す例において、前景に対応するオブジェクトは、画面に対して水平に左から右に動いている。
【００５２】
図１０Ｂは、図１０Ａに示す画像の１つのラインに対応する画素値を時間方向に展開したモデル図である。図１０Ｂの横方向は、図１０Ａの空間方向Ｘに対応している。
【００５３】
背景領域の画素は、背景の成分、すなわち、背景のオブジェクトに対応する画像の成分のみから、その画素値が構成されている。前景領域の画素は、前景の成分、すなわち、前景のオブジェクトに対応する画像の成分のみから、その画素値が構成されている。
【００５４】
混合領域の画素は、背景の成分、および前景の成分から、その画素値が構成されている。混合領域は、背景の成分、および前景の成分から、その画素値が構成されているので、歪み領域ともいえる。混合領域は、更に、カバードバックグラウンド領域およびアンカバードバックグラウンド領域に分類される。
【００５５】
カバードバックグラウンド領域は、前景領域に対して、前景のオブジェクトの進行方向の前端部に対応する位置の混合領域であり、時間の経過に対応して背景成分が前景に覆い隠される領域をいう。
【００５６】
これに対して、アンカバードバックグラウンド領域は、前景領域に対して、前景のオブジェクトの進行方向の後端部に対応する位置の混合領域であり、時間の経過に対応して背景成分が現れる領域をいう。
【００５７】
このように、前景領域、背景領域、またはカバードバックグラウンド領域若しくはアンカバードバックグラウンド領域を含む画像が、領域特定部１０３、混合比算出部１０４、および前景背景分離部１０５に入力画像として入力される。
【００５８】
図１１は、以上のような、背景領域、前景領域、混合領域、カバードバックグラウンド領域、およびアンカバードバックグラウンド領域を説明する図である。図１１に示す画像に対応する場合、背景領域は、静止部分であり、前景領域は、動き部分であり、混合領域のカバードバックグラウンド領域は、背景から前景に変化する部分であり、混合領域のアンカバードバックグラウンド領域は、前景から背景に変化する部分である。
【００５９】
図１２は、静止している前景に対応するオブジェクトおよび静止している背景に対応するオブジェクトを撮像した画像における、隣接して１列に並んでいる画素の画素値を時間方向に展開したモデル図である。例えば、隣接して１列に並んでいる画素として、画面の１つのライン上に並んでいる画素を選択することができる。
【００６０】
図１２に示すF01乃至F04の画素値は、静止している前景のオブジェクトに対応する画素の画素値である。図１２に示すB01乃至B04の画素値は、静止している背景のオブジェクトに対応する画素の画素値である。
【００６１】
図１２における縦方向は、図中の上から下に向かって時間が経過する。図１２中の矩形の上辺の位置は、センサが入力された光の電荷への変換を開始する時刻に対応し、図１２中の矩形の下辺の位置は、センサが入力された光の電荷への変換を終了する時刻に対応する。すなわち、図１２中の矩形の上辺から下辺までの距離は、シャッタ時間に対応する。
【００６２】
以下において、シャッタ時間とフレーム間隔とが同一である場合を例に説明する。
【００６３】
図１２における横方向は、図１０で説明した空間方向Xに対応する。より具体的には、図１２に示す例において、図１２中の”F01”と記載された矩形の左辺から”B04”と記載された矩形の右辺までの距離は、画素のピッチの８倍、すなわち、連続している８つの画素の間隔に対応する。
【００６４】
前景のオブジェクトおよび背景のオブジェクトが静止している場合、シャッタ時間に対応する期間において、センサに入力される光は変化しない。
【００６５】
ここで、シャッタ時間に対応する期間を２つ以上の同じ長さの期間に分割する。例えば、仮想分割数を４とすると、図１２に示すモデル図は、図１３に示すモデルとして表すことができる。仮想分割数は、前景に対応するオブジェクトのシャッタ時間内での動き量vなどに対応して設定される。例えば、４である動き量vに対応して、仮想分割数は、４とされ、シャッタ時間に対応する期間は４つに分割される。
【００６６】
図中の最も上の行は、シャッタが開いて最初の、分割された期間に対応する。図中の上から２番目の行は、シャッタが開いて２番目の、分割された期間に対応する。図中の上から３番目の行は、シャッタが開いて３番目の、分割された期間に対応する。図中の上から４番目の行は、シャッタが開いて４番目の、分割された期間に対応する。
【００６７】
以下、動き量vに対応して分割されたシャッタ時間をシャッタ時間/vとも称する。
【００６８】
前景に対応するオブジェクトが静止しているとき、センサに入力される光は変化しないので、前景の成分F01/vは、画素値F01を仮想分割数で除した値に等しい。同様に、前景に対応するオブジェクトが静止しているとき、前景の成分F02/vは、画素値F02を仮想分割数で除した値に等しく、前景の成分F03/vは、画素値F03を仮想分割数で除した値に等しく、前景の成分F04/vは、画素値F04を仮想分割数で除した値に等しい。
【００６９】
背景に対応するオブジェクトが静止しているとき、センサに入力される光は変化しないので、背景の成分B01/vは、画素値B01を仮想分割数で除した値に等しい。同様に、背景に対応するオブジェクトが静止しているとき、背景の成分B02/vは、画素値B02を仮想分割数で除した値に等しく、B03/vは、画素値B03を仮想分割数で除した値に等しく、B04/vは、画素値B04を仮想分割数で除した値に等しい。
【００７０】
すなわち、前景に対応するオブジェクトが静止している場合、シャッタ時間に対応する期間において、センサに入力される前景のオブジェクトに対応する光が変化しないので、シャッタが開いて最初の、シャッタ時間/vに対応する前景の成分F01/vと、シャッタが開いて２番目の、シャッタ時間/vに対応する前景の成分F01/vと、シャッタが開いて３番目の、シャッタ時間/vに対応する前景の成分F01/vと、シャッタが開いて４番目の、シャッタ時間/vに対応する前景の成分F01/vとは、同じ値となる。F02/v乃至F04/vも、F01/vと同様の関係を有する。
【００７１】
背景に対応するオブジェクトが静止している場合、シャッタ時間に対応する期間において、センサに入力される背景のオブジェクトに対応する光は変化しないので、シャッタが開いて最初の、シャッタ時間/vに対応する背景の成分B01/vと、シャッタが開いて２番目の、シャッタ時間/vに対応する背景の成分B01/vと、シャッタが開いて３番目の、シャッタ時間/vに対応する背景の成分B01/vと、シャッタが開いて４番目の、シャッタ時間/vに対応する背景の成分B01/vとは、同じ値となる。B02/v乃至B04/vも、同様の関係を有する。
【００７２】
次に、前景に対応するオブジェクトが移動し、背景に対応するオブジェクトが静止している場合について説明する。
【００７３】
図１４は、前景に対応するオブジェクトが図中の右側に向かって移動する場合の、カバードバックグラウンド領域を含む、１つのライン上の画素の画素値を時間方向に展開したモデル図である。図１４において、前景の動き量vは、４である。１フレームは短い時間なので、前景に対応するオブジェクトが剛体であり、等速で移動していると仮定することができる。図１４において、前景に対応するオブジェクトの画像は、あるフレームを基準として次のフレームにおいて４画素分右側に表示されるように移動する。
【００７４】
図１４において、最も左側の画素乃至左から４番目の画素は、前景領域に属する。図１４において、左から５番目乃至左から７番目の画素は、カバードバックグラウンド領域である混合領域に属する。図１４において、最も右側の画素は、背景領域に属する。
【００７５】
前景に対応するオブジェクトが時間の経過と共に背景に対応するオブジェクトを覆い隠すように移動しているので、カバードバックグラウンド領域に属する画素の画素値に含まれる成分は、シャッタ時間に対応する期間のある時点で、背景の成分から、前景の成分に替わる。
【００７６】
例えば、図１４中に太線枠を付した画素値Mは、式（１）で表される。
【００７７】
M=B02/v+B02/v+F07/v+F06/v ・・・（１）
【００７８】
例えば、左から５番目の画素は、１つのシャッタ時間/vに対応する背景の成分を含み、３つのシャッタ時間/vに対応する前景の成分を含むので、左から５番目の画素の混合比αは、1/4である。左から６番目の画素は、２つのシャッタ時間/vに対応する背景の成分を含み、２つのシャッタ時間/vに対応する前景の成分を含むので、左から６番目の画素の混合比αは、1/2である。左から７番目の画素は、３つのシャッタ時間/vに対応する背景の成分を含み、１つのシャッタ時間/vに対応する前景の成分を含むので、左から７番目の画素の混合比αは、3/4である。
【００７９】
前景に対応するオブジェクトが、剛体であり、前景の画像が次のフレームにおいて４画素右側に表示されるように等速で移動すると仮定できるので、例えば、図１４中の左から４番目の画素の、シャッタが開いて最初の、シャッタ時間/vの前景の成分F07/vは、図１４中の左から５番目の画素の、シャッタが開いて２番目のシャッタ時間/vに対応する前景の成分に等しい。同様に、前景の成分F07/vは、図１４中の左から６番目の画素の、シャッタが開いて３番目のシャッタ時間/vに対応する前景の成分と、図１４中の左から７番目の画素の、シャッタが開いて４番目のシャッタ時間/vに対応する前景の成分とに、それぞれ等しい。
【００８０】
前景に対応するオブジェクトが、剛体であり、前景の画像が次のフレームにおいて４画素右側に表示されるように等速で移動すると仮定できるので、例えば、図１４中の左から３番目の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分F06/vは、図１４中の左から４番目の画素の、シャッタが開いて２番目のシャッタ時間/vに対応する前景の成分に等しい。同様に、前景の成分F06/vは、図１４中の左から５番目の画素の、シャッタが開いて３番目のシャッタ時間/vに対応する前景の成分と、図１４中の左から６番目の画素の、シャッタが開いて４番目のシャッタ時間/vに対応する前景の成分とに、それぞれ等しい。
【００８１】
前景に対応するオブジェクトが、剛体であり、前景の画像が次のフレームにおいて４画素右側に表示されるように等速で移動すると仮定できるので、例えば、図１４中の左から２番目の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分F05/vは、図１４中の左から３番目の画素の、シャッタが開いて２番目のシャッタ時間/vに対応する前景の成分に等しい。同様に、前景の成分F05/vは、図１４中の左から４番目の画素の、シャッタが開いて３番目のシャッタ時間/vに対応する前景の成分と、図１４中の左から５番目の画素の、シャッタが開いて４番目のシャッタ時間/vに対応する前景の成分とに、それぞれ等しい。
【００８２】
前景に対応するオブジェクトが、剛体であり、前景の画像が次のフレームにおいて４画素右側に表示されるように等速で移動すると仮定できるので、例えば、図１４中の最も左側の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分F04/vは、図１４中の左から２番目の画素の、シャッタが開いて２番目のシャッタ時間/vに対応する前景の成分に等しい。同様に、前景の成分F04/vは、図１４中の左から３番目の画素の、シャッタが開いて３番目のシャッタ時間/vに対応する前景の成分と、図１４中の左から４番目の画素の、シャッタが開いて４番目のシャッタ時間/vに対応する前景の成分とに、それぞれ等しい。
【００８３】
動いているオブジェクトに対応する前景の領域は、このように動きボケを含むので、歪み領域とも言える。
【００８４】
図１５は、前景が図中の右側に向かって移動する場合の、アンカバードバックグラウンド領域を含む、１つのライン上の画素の画素値を時間方向に展開したモデル図である。図１５において、前景の動き量vは、４である。１フレームは短い時間なので、前景に対応するオブジェクトが剛体であり、等速で移動していると仮定することができる。図１５において、前景に対応するオブジェクトの画像は、あるフレームを基準として次のフレームにおいて４画素分右側に移動する。
【００８５】
図１５において、最も左側の画素乃至左から４番目の画素は、背景領域に属する。図１５において、左から５番目乃至左から７番目の画素は、アンカバードバックグラウンドである混合領域に属する。図１５において、最も右側の画素は、前景領域に属する。
【００８６】
背景に対応するオブジェクトを覆っていた前景に対応するオブジェクトが時間の経過と共に背景に対応するオブジェクトの前から取り除かれるように移動しているので、アンカバードバックグラウンド領域に属する画素の画素値に含まれる成分は、シャッタ時間に対応する期間のある時点で、前景の成分から、背景の成分に替わる。
【００８７】
例えば、図１５中に太線枠を付した画素値M'は、式（２）で表される。
【００８８】
M'=F02/v+F01/v+B26/v+B26/v ・・・（２）
【００８９】
例えば、左から５番目の画素は、３つのシャッタ時間/vに対応する背景の成分を含み、１つのシャッタ時間/vに対応する前景の成分を含むので、左から５番目の画素の混合比αは、3/4である。左から６番目の画素は、２つのシャッタ時間/vに対応する背景の成分を含み、２つのシャッタ時間/vに対応する前景の成分を含むので、左から６番目の画素の混合比αは、1/2である。左から７番目の画素は、１つのシャッタ時間/vに対応する背景の成分を含み、３つのシャッタ時間/vに対応する前景の成分を含むので、左から７番目の画素の混合比αは、1/4である。
【００９０】
式（１）および式（２）をより一般化すると、画素値Mは、式（３）で表される。
【００９１】
【数１】

・・・（３）
ここで、αは、混合比である。Ｂは、背景の画素値であり、Fi/vは、前景の成分である。
【００９２】
前景に対応するオブジェクトが剛体であり、等速で動くと仮定でき、かつ、動き量vが４であるので、例えば、図１５中の左から５番目の画素の、シャッタが開いて最初の、シャッタ時間/vの前景の成分F01/vは、図１５中の左から６番目の画素の、シャッタが開いて２番目のシャッタ時間/vに対応する前景の成分に等しい。同様に、F01/vは、図１５中の左から７番目の画素の、シャッタが開いて３番目のシャッタ時間/vに対応する前景の成分と、図１５中の左から８番目の画素の、シャッタが開いて４番目のシャッタ時間/vに対応する前景の成分とに、それぞれ等しい。
【００９３】
前景に対応するオブジェクトが剛体であり、等速で動くと仮定でき、かつ、仮想分割数が４であるので、例えば、図１５中の左から６番目の画素の、シャッタが開いて最初の、シャッタ時間/vの前景の成分F02/vは、図１５中の左から７番目の画素の、シャッタが開いて２番目のシャッタ時間/vに対応する前景の成分に等しい。同様に、前景の成分F02/vは、図１５中の左から８番目の画素の、シャッタが開いて３番目のシャッタ時間/vに対応する前景の成分に等しい。
【００９４】
前景に対応するオブジェクトが剛体であり、等速で動くと仮定でき、かつ、動き量vが４であるので、例えば、図１５中の左から７番目の画素の、シャッタが開いて最初の、シャッタ時間/vの前景の成分F03/vは、図１５中の左から８番目の画素の、シャッタが開いて２番目のシャッタ時間/vに対応する前景の成分に等しい。
【００９５】
図１３乃至図１５の説明において、仮想分割数は、４であるとして説明したが、仮想分割数は、動き量vに対応する。動き量vは、一般に、前景に対応するオブジェクトの移動速度に対応する。例えば、前景に対応するオブジェクトが、あるフレームを基準として次のフレームにおいて４画素分右側に表示されるように移動しているとき、動き量vは、４とされる。動き量vに対応し、仮想分割数は、４とされる。同様に、例えば、前景に対応するオブジェクトが、あるフレームを基準として次のフレームにおいて６画素分左側に表示されるように移動しているとき、動き量vは、６とされ、仮想分割数は、６とされる。
【００９６】
図１６および図１７に、以上で説明した、前景領域、背景領域、カバードバックグラウンド領域若しくはアンカバードバックグラウンド領域から成る混合領域と、分割されたシャッタ時間に対応する前景の成分および背景の成分との関係を示す。
【００９７】
図１６は、静止している背景の前を移動しているオブジェクトに対応する前景を含む画像から、前景領域、背景領域、および混合領域の画素を抽出した例を示す。図１６に示す例において、前景に対応するオブジェクトは、画面に対して水平に移動している。
【００９８】
フレーム#n+1は、フレーム#nの次のフレームであり、フレーム#n+2は、フレーム#n+1の次のフレームである。
【００９９】
フレーム#n乃至フレーム#n+2のいずれかから抽出した、前景領域、背景領域、および混合領域の画素を抽出して、動き量vを４として、抽出された画素の画素値を時間方向に展開したモデルを図１７に示す。
【０１００】
前景領域の画素値は、前景に対応するオブジェクトが移動するので、シャッタ時間/vの期間に対応する、４つの異なる前景の成分から構成される。例えば、図１７に示す前景領域の画素のうち最も左側に位置する画素は、F01/v,F02/v,F03/v、およびF04/vから構成される。すなわち、前景領域の画素は、動きボケを含んでいる。
【０１０１】
背景に対応するオブジェクトが静止しているので、シャッタ時間に対応する期間において、センサに入力される背景に対応する光は変化しない。この場合、背景領域の画素値は、動きボケを含まない。
【０１０２】
カバードバックグラウンド領域若しくはアンカバードバックグラウンド領域から成る混合領域に属する画素の画素値は、前景の成分と、背景の成分とから構成される。
【０１０３】
次に、オブジェクトに対応する画像が動いているとき、複数のフレームにおける、隣接して１列に並んでいる画素であって、フレーム上で同一の位置の画素の画素値を時間方向に展開したモデルについて説明する。例えば、オブジェクトに対応する画像が画面に対して水平に動いているとき、隣接して１列に並んでいる画素として、画面の１つのライン上に並んでいる画素を選択することができる。
【０１０４】
図１８は、静止している背景に対応するオブジェクトを撮像した画像の３つのフレームの、隣接して１列に並んでいる画素であって、フレーム上で同一の位置の画素の画素値を時間方向に展開したモデル図である。フレーム#nは、フレーム#n-1の次のフレームであり、フレーム#n+1は、フレーム#nの次のフレームである。他のフレームも同様に称する。
【０１０５】
図１８に示すB01乃至B12の画素値は、静止している背景のオブジェクトに対応する画素の画素値である。背景に対応するオブジェクトが静止しているので、フレーム#n-1乃至フレームn+1において、対応する画素の画素値は、変化しない。例えば、フレーム#n-1におけるB05の画素値を有する画素の位置に対応する、フレーム#nにおける画素、およびフレーム#n+1における画素は、それぞれ、B05の画素値を有する。
【０１０６】
図１９は、静止している背景に対応するオブジェクトと共に図中の右側に移動する前景に対応するオブジェクトを撮像した画像の３つのフレームの、隣接して１列に並んでいる画素であって、フレーム上で同一の位置の画素の画素値を時間方向に展開したモデル図である。図１９に示すモデルは、カバードバックグラウンド領域を含む。
【０１０７】
図１９において、前景に対応するオブジェクトが、剛体であり、等速で移動すると仮定でき、前景の画像が次のフレームにおいて４画素右側に表示されるように移動するので、前景の動き量vは、４であり、仮想分割数は、４である。
【０１０８】
例えば、図１９中のフレーム#n-1の最も左側の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分は、F12/vとなり、図１９中の左から２番目の画素の、シャッタが開いて２番目のシャッタ時間/vの前景の成分も、F12/vとなる。図１９中の左から３番目の画素の、シャッタが開いて３番目のシャッタ時間/vの前景の成分、および図１９中の左から４番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分は、F12/vとなる。
【０１０９】
図１９中のフレーム#n-1の最も左側の画素の、シャッタが開いて２番目のシャッタ時間/vの前景の成分は、F11/vとなり、図１９中の左から２番目の画素の、シャッタが開いて３番目のシャッタ時間/vの前景の成分も、F11/vとなる。図１９中の左から３番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分は、F11/vとなる。
【０１１０】
図１９中のフレーム#n-1の最も左側の画素の、シャッタが開いて３番目のシャッタ時間/vの前景の成分は、F10/vとなり、図１９中の左から２番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分も、F10/vとなる。図１９中のフレーム#n-1の最も左側の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分は、F09/vとなる。
【０１１１】
背景に対応するオブジェクトが静止しているので、図１９中のフレーム#n-1の左から２番目の画素の、シャッタが開いて最初のシャッタ時間/vの背景の成分は、B01/vとなる。図１９中のフレーム#n-1の左から３番目の画素の、シャッタが開いて最初および２番目のシャッタ時間/vの背景の成分は、B02/vとなる。図１９中のフレーム#n-1の左から４番目の画素の、シャッタが開いて最初乃至３番目のシャッタ時間/vの背景の成分は、B03/vとなる。
【０１１２】
図１９中のフレーム#n-1において、最も左側の画素は、前景領域に属し、左側から２番目乃至４番目の画素は、カバードバックグラウンド領域である混合領域に属する。
【０１１３】
図１９中のフレーム#n-1の左から５番目の画素乃至１２番目の画素は、背景領域に属し、その画素値は、それぞれ、B04乃至B11となる。
【０１１４】
図１９中のフレーム#nの左から１番目の画素乃至５番目の画素は、前景領域に属する。フレーム#nの前景領域における、シャッタ時間/vの前景の成分は、F05/v乃至F12/vのいずれかである。
【０１１５】
前景に対応するオブジェクトが、剛体であり、等速で移動すると仮定でき、前景の画像が次のフレームにおいて４画素右側に表示されるように移動するので、図１９中のフレーム#nの左から５番目の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分は、F12/vとなり、図１９中の左から６番目の画素の、シャッタが開いて２番目のシャッタ時間/vの前景の成分も、F12/vとなる。図１９中の左から７番目の画素の、シャッタが開いて３番目のシャッタ時間/vの前景の成分、および図１９中の左から８番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分は、F12/vとなる。
【０１１６】
図１９中のフレーム#nの左から５番目の画素の、シャッタが開いて２番目のシャッタ時間/vの前景の成分は、F11/vとなり、図１９中の左から６番目の画素の、シャッタが開いて３番目のシャッタ時間/vの前景の成分も、F11/vとなる。図１９中の左から７番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分は、F11/vとなる。
【０１１７】
図１９中のフレーム#nの左から５番目の画素の、シャッタが開いて３番目のシャッタ時間/vの前景の成分は、F10/vとなり、図１９中の左から６番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分も、F10/vとなる。図１９中のフレーム#nの左から５番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分は、F09/vとなる。
【０１１８】
背景に対応するオブジェクトが静止しているので、図１９中のフレーム#nの左から６番目の画素の、シャッタが開いて最初のシャッタ時間/vの背景の成分は、B05/vとなる。図１９中のフレーム#nの左から７番目の画素の、シャッタが開いて最初および２番目のシャッタ時間/vの背景の成分は、B06/vとなる。図１９中のフレーム#nの左から８番目の画素の、シャッタが開いて最初乃至３番目の、シャッタ時間/vの背景の成分は、B07/vとなる。
【０１１９】
図１９中のフレーム#nにおいて、左側から６番目乃至８番目の画素は、カバードバックグラウンド領域である混合領域に属する。
【０１２０】
図１９中のフレーム#nの左から９番目の画素乃至１２番目の画素は、背景領域に属し、画素値は、それぞれ、B08乃至B11となる。
【０１２１】
図１９中のフレーム#n+1の左から１番目の画素乃至９番目の画素は、前景領域に属する。フレーム#n+1の前景領域における、シャッタ時間/vの前景の成分は、F01/v乃至F12/vのいずれかである。
【０１２２】
前景に対応するオブジェクトが、剛体であり、等速で移動すると仮定でき、前景の画像が次のフレームにおいて４画素右側に表示されるように移動するので、図１９中のフレーム#n+1の左から９番目の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分は、F12/vとなり、図１９中の左から１０番目の画素の、シャッタが開いて２番目のシャッタ時間/vの前景の成分も、F12/vとなる。図１９中の左から１１番目の画素の、シャッタが開いて３番目のシャッタ時間/vの前景の成分、および図１９中の左から１２番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分は、F12/vとなる。
【０１２３】
図１９中のフレーム#n+1の左から９番目の画素の、シャッタが開いて２番目のシャッタ時間/vの期間の前景の成分は、F11/vとなり、図１９中の左から１０番目の画素の、シャッタが開いて３番目のシャッタ時間/vの前景の成分も、F11/vとなる。図１９中の左から１１番目の画素の、シャッタが開いて４番目の、シャッタ時間/vの前景の成分は、F11/vとなる。
【０１２４】
図１９中のフレーム#n+1の左から９番目の画素の、シャッタが開いて３番目の、シャッタ時間/vの前景の成分は、F10/vとなり、図１９中の左から１０番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分も、F10/vとなる。図１９中のフレーム#n+1の左から９番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分は、F09/vとなる。
【０１２５】
背景に対応するオブジェクトが静止しているので、図１９中のフレーム#n+1の左から１０番目の画素の、シャッタが開いて最初のシャッタ時間/vの背景の成分は、B09/vとなる。図１９中のフレーム#n+1の左から１１番目の画素の、シャッタが開いて最初および２番目のシャッタ時間/vの背景の成分は、B10/vとなる。図１９中のフレーム#n+1の左から１２番目の画素の、シャッタが開いて最初乃至３番目の、シャッタ時間/vの背景の成分は、B11/vとなる。
【０１２６】
図１９中のフレーム#n+1において、左側から１０番目乃至１２番目の画素は、カバードバックグラウンド領域である混合領域に対応する。
【０１２７】
図２０は、図１９に示す画素値から前景の成分を抽出した画像のモデル図である。
【０１２８】
図２１は、静止している背景と共に図中の右側に移動するオブジェクトに対応する前景を撮像した画像の３つのフレームの、隣接して１列に並んでいる画素であって、フレーム上で同一の位置の画素の画素値を時間方向に展開したモデル図である。図２１において、アンカバードバックグラウンド領域が含まれている。
【０１２９】
図２１において、前景に対応するオブジェクトは、剛体であり、かつ等速で移動していると仮定できる。前景に対応するオブジェクトが、次のフレームにおいて４画素分右側に表示されるように移動しているので、動き量vは、４である。
【０１３０】
例えば、図２１中のフレーム#n-1の最も左側の画素の、シャッタが開いて最初の、シャッタ時間/vの前景の成分は、F13/vとなり、図２１中の左から２番目の画素の、シャッタが開いて２番目のシャッタ時間/vの前景の成分も、F13/vとなる。図２１中の左から３番目の画素の、シャッタが開いて３番目のシャッタ時間/vの前景の成分、および図２１中の左から４番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分は、F13/vとなる。
【０１３１】
図２１中のフレーム#n-1の左から２番目の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分は、F14/vとなり、図２１中の左から３番目の画素の、シャッタが開いて２番目のシャッタ時間/vの前景の成分も、F14/vとなる。図２１中の左から３番目の画素の、シャッタが開いて最初の、シャッタ時間/vの前景の成分は、F15/vとなる。
【０１３２】
背景に対応するオブジェクトが静止しているので、図２１中のフレーム#n-1の最も左側の画素の、シャッタが開いて２番目乃至４番目の、シャッタ時間/vの背景の成分は、B25/vとなる。図２１中のフレーム#n-1の左から２番目の画素の、シャッタが開いて３番目および４番目の、シャッタ時間/vの背景の成分は、B26/vとなる。図２１中のフレーム#n-1の左から３番目の画素の、シャッタが開いて４番目のシャッタ時間/vの背景の成分は、B27/vとなる。
【０１３３】
図２１中のフレーム#n-1において、最も左側の画素乃至３番目の画素は、アンカバードバックグラウンド領域である混合領域に属する。
【０１３４】
図２１中のフレーム#n-1の左から４番目の画素乃至１２番目の画素は、前景領域に属する。フレームの前景の成分は、F13/v乃至F24/vのいずれかである。
【０１３５】
図２１中のフレーム#nの最も左側の画素乃至左から４番目の画素は、背景領域に属し、画素値は、それぞれ、B25乃至B28となる。
【０１３６】
前景に対応するオブジェクトが、剛体であり、等速で移動すると仮定でき、前景の画像が次のフレームにおいて４画素右側に表示されるように移動するので、図２１中のフレーム#nの左から５番目の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分は、F13/vとなり、図２１中の左から６番目の画素の、シャッタが開いて２番目のシャッタ時間/vの前景の成分も、F13/vとなる。図２１中の左から７番目の画素の、シャッタが開いて３番目のシャッタ時間/vの前景の成分、および図２１中の左から８番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分は、F13/vとなる。
【０１３７】
図２１中のフレーム#nの左から６番目の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分は、F14/vとなり、図２１中の左から７番目の画素の、シャッタが開いて２番目のシャッタ時間/vの前景の成分も、F14/vとなる。図２１中の左から８番目の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分は、F15/vとなる。
【０１３８】
背景に対応するオブジェクトが静止しているので、図２１中のフレーム#nの左から５番目の画素の、シャッタが開いて２番目乃至４番目のシャッタ時間/vの背景の成分は、B29/vとなる。図２１中のフレーム#nの左から６番目の画素の、シャッタが開いて３番目および４番目のシャッタ時間/vの背景の成分は、B30/vとなる。図２１中のフレーム#nの左から７番目の画素の、シャッタが開いて４番目のシャッタ時間/vの背景の成分は、B31/vとなる。
【０１３９】
図２１中のフレーム#nにおいて、左から５番目の画素乃至７番目の画素は、アンカバードバックグラウンド領域である混合領域に属する。
【０１４０】
図２１中のフレーム#nの左から８番目の画素乃至１２番目の画素は、前景領域に属する。フレーム#nの前景領域における、シャッタ時間/vの期間に対応する値は、F13/v乃至F20/vのいずれかである。
【０１４１】
図２１中のフレーム#n+1の最も左側の画素乃至左から８番目の画素は、背景領域に属し、画素値は、それぞれ、B25乃至B32となる。
【０１４２】
前景に対応するオブジェクトが、剛体であり、等速で移動すると仮定でき、前景の画像が次のフレームにおいて４画素右側に表示されるように移動するので、図２１中のフレーム#n+1の左から９番目の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分は、F13/vとなり、図２１中の左から１０番目の画素の、シャッタが開いて２番目のシャッタ時間/vの前景の成分も、F13/vとなる。図２１中の左から１１番目の画素の、シャッタが開いて３番目のシャッタ時間/vの前景の成分、および図２１中の左から１２番目の画素の、シャッタが開いて４番目のシャッタ時間/vの前景の成分は、F13/vとなる。
【０１４３】
図２１中のフレーム#n+1の左から１０番目の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分は、F14/vとなり、図２１中の左から１１番目の画素の、シャッタが開いて２番目のシャッタ時間/vの前景の成分も、F14/vとなる。図２１中の左から１２番目の画素の、シャッタが開いて最初のシャッタ時間/vの前景の成分は、F15/vとなる。
【０１４４】
背景に対応するオブジェクトが静止しているので、図２１中のフレーム#n+1の左から９番目の画素の、シャッタが開いて２番目乃至４番目の、シャッタ時間/vの背景の成分は、B33/vとなる。図２１中のフレーム#n+1の左から１０番目の画素の、シャッタが開いて３番目および４番目のシャッタ時間/vの背景の成分は、B34/vとなる。図２１中のフレーム#n+1の左から１１番目の画素の、シャッタが開いて４番目のシャッタ時間/vの背景の成分は、B35/vとなる。
【０１４５】
図２１中のフレーム#n+1において、左から９番目の画素乃至１１番目の画素は、アンカバードバックグラウンド領域である混合領域に属する。
【０１４６】
図２１中のフレーム#n+1の左から１２番目の画素は、前景領域に属する。フレーム#n+1の前景領域における、シャッタ時間/vの前景の成分は、F13/v乃至F16/vのいずれかである。
【０１４７】
図２２は、図２１に示す画素値から前景の成分を抽出した画像のモデル図である。
【０１４８】
図６に戻り、領域特定部１０３は、複数のフレームの画素値を用いて、前景領域、背景領域、カバードバックグラウンド領域、またはアンカバードバックグラウンド領域に属することを示すフラグを画素毎に対応付けて、領域情報として、混合比算出部１０４および動きボケ除去部１０６に供給する。
【０１４９】
混合比算出部１０４は、複数のフレームの画素値、および領域情報を基に、混合領域に含まれる画素について画素毎に混合比αを算出し、算出した混合比αを前景背景分離部１０５に供給する。
【０１５０】
前景背景分離部１０５は、複数のフレームの画素値、領域情報、および混合比αを基に、前景の成分のみからなる前景成分画像を抽出して、動きボケ除去部１０６に供給する。
【０１５１】
動きボケ除去部１０６は、前景背景分離部１０５から供給された前景成分画像、動き検出部１０２から供給された動きベクトル、および領域特定部１０３から供給された領域情報を基に、前景成分画像に含まれる動きボケを除去して、動きボケを除去した前景成分画像を出力する。
【０１５２】
図２３のフローチャートを参照して、画像処理部による合成画像生成の処理を説明する。ステップＳ１１において、領域特定部１０３は、入力画像を基に、入力画像の画素毎に前景領域、背景領域、カバードバックグラウンド領域、またはアンカバードバックグラウンド領域のいずれかに属するかを示す領域情報を生成する領域特定の処理を実行する。領域特定の処理の詳細は、後述する。領域特定部１０３は、生成した領域情報を混合比算出部１０４に供給する。
【０１５３】
なお、ステップＳ１１において、領域特定部１０３は、入力画像を基に、入力画像の画素毎に前景領域、背景領域、または混合領域（カバードバックグラウンド領域、またはアンカバードバックグラウンド領域の区別をしない）のいずれかに属するかを示す領域情報を生成するようにしてもよい。この場合において、前景背景分離部１０５および動きボケ除去部１０６は、動きベクトルの方向を基に、混合領域がカバードバックグラウンド領域であるか、またはアンカバードバックグラウンド領域であるかを判定する。例えば、動きベクトルの方向に対応して、前景領域、混合領域、および背景領域と順に並んでいるとき、その混合領域は、カバードバックグラウンド領域と判定され、動きベクトルの方向に対応して、背景領域、混合領域、および前景領域と順に並んでいるとき、その混合領域は、アンカバードバックグラウンド領域と判定される。
【０１５４】
ステップＳ１２において、混合比算出部１０４は、入力画像および領域情報を基に、混合領域に含まれる画素毎に、混合比αを算出する。混合比算出の処理の詳細は、後述する。混合比算出部１０４は、算出した混合比αを前景背景分離部１０５に供給する。
【０１５５】
ステップＳ１３において、前景背景分離部１０５は、領域情報、および混合比αを基に、入力画像から前景の成分を抽出して、前景成分画像として動きボケ除去部１０６に供給する。
【０１５６】
ステップＳ１４において、動きボケ除去部１０６は、動きベクトルおよび領域情報を基に、動き方向に並ぶ連続した画素であって、アンカバードバックグラウンド領域、前景領域、およびカバードバックグラウンド領域のいずれかに属するものの画像上の位置を示す処理単位を生成し、処理単位に対応する前景成分に含まれる動きボケの量を除去する。動きボケの除去の処理の詳細については、後述する。
【０１５７】
ステップＳ１５において、画像処理部は、画面全体について処理を終了したか否かを判定し、画面全体について処理を終了していないと判定された場合、ステップＳ１４に進み、処理単位に対応する前景の成分を対象とした動きボケの除去の処理を繰り返す。
【０１５８】
ステップＳ１５において、画面全体について処理を終了したと判定された場合、ステップＳ１６において、合成部１０７は、前景成分画像と任意の背景画像を合成し、合成画像を生成して出力する。尚、合成画像の生成処理については、詳細を後述する。
【０１５９】
このように、画像処理部は、前景と背景を分離して、前景に含まれる動きボケを除去し、任意の背景画像と合成することができる。
【０１６０】
以下、領域特定部１０３、混合比算出部１０４、前景背景分離部１０５、および動きボケ除去部１０６のそれぞれの構成について説明する。
【０１６１】
図２４は、領域特定部１０３の構成を示すブロック図である。図２４に構成を示す領域特定部１０３は、動きベクトルを利用しない。フレームメモリ２０１は、入力された画像をフレーム単位で記憶する。フレームメモリ２０１は、処理の対象がフレーム#nであるとき、フレーム#nの２つ前のフレームであるフレーム#n-2、フレーム#nの１つ前のフレームであるフレーム#n-1、フレーム#n、フレーム#nの１つ後のフレームであるフレーム#n+1、およびフレーム#nの２つ後のフレームであるフレーム#n+2を記憶する。
【０１６２】
静動判定部２０２−１は、フレーム#nの領域特定の対象である画素の画像上の位置と同一の位置にあるフレーム#n+2の画素の画素値、およびフレーム#nの領域特定の対象である画素の画像上の位置と同一の位置にあるフレーム#n+1の画素の画素値をフレームメモリ２０１から読み出して、読み出した画素値の差の絶対値を算出する。静動判定部２０２−１は、フレーム#n+2の画素値とフレーム#n+1の画素値との差の絶対値が、予め設定している閾値Thより大きいか否かを判定し、差の絶対値が閾値Thより大きいと判定された場合、動きを示す静動判定を領域判定部２０３−１に供給する。フレーム#n+2の画素の画素値とフレーム#n+1の画素の画素値との差の絶対値が閾値Th以下であると判定された場合、静動判定部２０２−１は、静止を示す静動判定を領域判定部２０３−１に供給する。
【０１６３】
静動判定部２０２−２は、フレーム#nの領域特定の対象である画素の画像上の位置と同一の位置にあるフレーム#n+1の画素の画素値、およびフレーム#nの対象となる画素の画素値をフレームメモリ２０１から読み出して、画素値の差の絶対値を算出する。静動判定部２０２−２は、フレーム#n+1の画素値とフレーム#nの画素値との差の絶対値が、予め設定している閾値Thより大きいか否かを判定し、画素値の差の絶対値が、閾値Thより大きいと判定された場合、動きを示す静動判定を領域判定部２０３−１および領域判定部２０３−２に供給する。フレーム#n+1の画素の画素値とフレーム#nの画素の画素値との差の絶対値が、閾値Th以下であると判定された場合、静動判定部２０２−２は、静止を示す静動判定を領域判定部２０３−１および領域判定部２０３−２に供給する。
【０１６４】
静動判定部２０２−３は、フレーム#nの領域特定の対象である画素の画素値、およびフレーム#nの領域特定の対象である画素の画像上の位置と同一の位置にあるフレーム#n-1の画素の画素値をフレームメモリ２０１から読み出して、画素値の差の絶対値を算出する。静動判定部２０２−３は、フレーム#nの画素値とフレーム#n-1の画素値との差の絶対値が、予め設定している閾値Thより大きいか否かを判定し、画素値の差の絶対値が、閾値Thより大きいと判定された場合、動きを示す静動判定を領域判定部２０３−２および領域判定部２０３−３に供給する。フレーム#nの画素の画素値とフレーム#n-1の画素の画素値との差の絶対値が、閾値Th以下であると判定された場合、静動判定部２０２−３は、静止を示す静動判定を領域判定部２０３−２および領域判定部２０３−３に供給する。
【０１６５】
静動判定部２０２−４は、フレーム#nの領域特定の対象である画素の画像上の位置と同一の位置にあるフレーム#n-1の画素の画素値、およびフレーム#nの領域特定の対象である画素の画像上の位置と同一の位置にあるフレーム#n-2の画素の画素値をフレームメモリ２０１から読み出して、画素値の差の絶対値を算出する。静動判定部２０２−４は、フレーム#n-1の画素値とフレーム#n-2の画素値との差の絶対値が、予め設定している閾値Thより大きいか否かを判定し、画素値の差の絶対値が、閾値Thより大きいと判定された場合、動きを示す静動判定を領域判定部２０３−３に供給する。フレーム#n-1の画素の画素値とフレーム#n-2の画素の画素値との差の絶対値が、閾値Th以下であると判定された場合、静動判定部２０２−４は、静止を示す静動判定を領域判定部２０３−３に供給する。
【０１６６】
領域判定部２０３−１は、静動判定部２０２−１から供給された静動判定が静止を示し、かつ、静動判定部２０２−２から供給された静動判定が動きを示しているとき、フレーム#nにおける領域特定の対象である画素がアンカバードバックグラウンド領域に属すると判定し、領域の判定される画素に対応するアンカバードバックグラウンド領域判定フラグに、アンカバードバックグラウンド領域に属することを示す”１”を設定する。
【０１６７】
領域判定部２０３−１は、静動判定部２０２−１から供給された静動判定が動きを示すか、または、静動判定部２０２−２から供給された静動判定が静止を示しているとき、フレーム#nにおける領域特定の対象である画素がアンカバードバックグラウンド領域に属しないと判定し、領域の判定される画素に対応するアンカバードバックグラウンド領域判定フラグに、アンカバードバックグラウンド領域に属しないことを示す”０”を設定する。
【０１６８】
領域判定部２０３−１は、このように”１”または”０”が設定されたアンカバードバックグラウンド領域判定フラグを判定フラグ格納フレームメモリ２０４に供給する。
【０１６９】
領域判定部２０３−２は、静動判定部２０２−２から供給された静動判定が静止を示し、かつ、静動判定部２０２−３から供給された静動判定が静止を示しているとき、フレーム#nにおける領域特定の対象である画素が静止領域に属すると判定し、領域の判定される画素に対応する静止領域判定フラグに、静止領域に属することを示す”１”を設定する。
【０１７０】
領域判定部２０３−２は、静動判定部２０２−２から供給された静動判定が動きを示すか、または、静動判定部２０２−３から供給された静動判定が動きを示しているとき、フレーム#nにおける領域特定の対象である画素が静止領域に属しないと判定し、領域の判定される画素に対応する静止領域判定フラグに、静止領域に属しないことを示す”０”を設定する。
【０１７１】
領域判定部２０３−２は、このように”１”または”０”が設定された静止領域判定フラグを判定フラグ格納フレームメモリ２０４に供給する。
【０１７２】
領域判定部２０３−２は、静動判定部２０２−２から供給された静動判定が動きを示し、かつ、静動判定部２０２−３から供給された静動判定が動きを示しているとき、フレーム#nにおける領域特定の対象である画素が動き領域に属すると判定し、領域の判定される画素に対応する動き領域判定フラグに、動き領域に属することを示す”１”を設定する。
【０１７３】
領域判定部２０３−２は、静動判定部２０２−２から供給された静動判定が静止を示すか、または、静動判定部２０２−３から供給された静動判定が静止を示しているとき、フレーム#nにおける領域特定の対象である画素が動き領域に属しないと判定し、領域の判定される画素に対応する動き領域判定フラグに、動き領域に属しないことを示す”０”を設定する。
【０１７４】
領域判定部２０３−２は、このように”１”または”０”が設定された動き領域判定フラグを判定フラグ格納フレームメモリ２０４に供給する。
【０１７５】
領域判定部２０３−３は、静動判定部２０２−３から供給された静動判定が動きを示し、かつ、静動判定部２０２−４から供給された静動判定が静止を示しているとき、フレーム#nにおける領域特定の対象である画素がカバードバックグラウンド領域に属すると判定し、領域の判定される画素に対応するカバードバックグラウンド領域判定フラグに、カバードバックグラウンド領域に属することを示す”１”を設定する。
【０１７６】
領域判定部２０３−３は、静動判定部２０２−３から供給された静動判定が静止を示すか、または、静動判定部２０２−４から供給された静動判定が動きを示しているとき、フレーム#nにおける領域特定の対象である画素がカバードバックグラウンド領域に属しないと判定し、領域の判定される画素に対応するカバードバックグラウンド領域判定フラグに、カバードバックグラウンド領域に属しないことを示す”０”を設定する。
【０１７７】
領域判定部２０３−３は、このように”１”または”０”が設定されたカバードバックグラウンド領域判定フラグを判定フラグ格納フレームメモリ２０４に供給する。
【０１７８】
判定フラグ格納フレームメモリ２０４は、領域判定部２０３−１から供給されたアンカバードバックグラウンド領域判定フラグ、領域判定部２０３−２から供給された静止領域判定フラグ、領域判定部２０３−２から供給された動き領域判定フラグ、および領域判定部２０３−３から供給されたカバードバックグラウンド領域判定フラグをそれぞれ記憶する。
【０１７９】
判定フラグ格納フレームメモリ２０４は、記憶しているアンカバードバックグラウンド領域判定フラグ、静止領域判定フラグ、動き領域判定フラグ、およびカバードバックグラウンド領域判定フラグを合成部２０５に供給する。合成部２０５は、判定フラグ格納フレームメモリ２０４から供給された、アンカバードバックグラウンド領域判定フラグ、静止領域判定フラグ、動き領域判定フラグ、およびカバードバックグラウンド領域判定フラグを基に、各画素が、アンカバードバックグラウンド領域、静止領域、動き領域、およびカバードバックグラウンド領域のいずれかに属することを示す領域情報を生成し、判定フラグ格納フレームメモリ２０６に供給する。
【０１８０】
判定フラグ格納フレームメモリ２０６は、合成部２０５から供給された領域情報を記憶すると共に、記憶している領域情報を出力する。
【０１８１】
次に、領域特定部１０３の処理の例を図２５乃至図２９を参照して説明する。
【０１８２】
前景に対応するオブジェクトが移動しているとき、オブジェクトに対応する画像の画面上の位置は、フレーム毎に変化する。図２５に示すように、フレーム#nにおいて、Yn(x,y)で示される位置に位置するオブジェクトに対応する画像は、次のフレームであるフレーム#n+1において、Yn+1(x,y)に位置する。
【０１８３】
前景のオブジェクトに対応する画像の動き方向に隣接して１列に並ぶ画素の画素値を時間方向に展開したモデル図を図２６に示す。例えば、前景のオブジェクトに対応する画像の動き方向が画面に対して水平であるとき、図２６におけるモデル図は、１つのライン上の隣接する画素の画素値を時間方向に展開したモデルを示す。
【０１８４】
図２６において、フレーム#nにおけるラインは、フレーム#n+1におけるラインと同一である。
【０１８５】
フレーム#nにおいて、左から２番目の画素乃至１３番目の画素に含まれているオブジェクトに対応する前景の成分は、フレーム#n+1において、左から６番目乃至１７番目の画素に含まれる。
【０１８６】
フレーム#nにおいて、カバードバックグラウンド領域に属する画素は、左から１１番目乃至１３番目の画素であり、アンカバードバックグラウンド領域に属する画素は、左から２番目乃至４番目の画素である。フレーム#n+1において、カバードバックグラウンド領域に属する画素は、左から１５番目乃至１７番目の画素であり、アンカバードバックグラウンド領域に属する画素は、左から６番目乃至８番目の画素である。
【０１８７】
図２６に示す例において、フレーム#nに含まれる前景の成分が、フレーム#n+1において４画素移動しているので、動き量vは、４である。仮想分割数は、動き量vに対応し、４である。
【０１８８】
次に、注目しているフレームの前後における混合領域に属する画素の画素値の変化について説明する。
【０１８９】
図２７に示す、背景が静止し、前景の動き量vが４であるフレーム#nにおいて、カバードバックグラウンド領域に属する画素は、左から１５番目乃至１７番目の画素である。動き量vが４であるので、１つ前のフレーム#n-1において、左から１５番目乃至１７番目の画素は、背景の成分のみを含み、背景領域に属する。また、更に１つ前のフレーム#n-2において、左から１５番目乃至１７番目の画素は、背景の成分のみを含み、背景領域に属する。
【０１９０】
ここで、背景に対応するオブジェクトが静止しているので、フレーム#n-1の左から１５番目の画素の画素値は、フレーム#n-2の左から１５番目の画素の画素値から変化しない。同様に、フレーム#n-1の左から１６番目の画素の画素値は、フレーム#n-2の左から１６番目の画素の画素値から変化せず、フレーム#n-1の左から１７番目の画素の画素値は、フレーム#n-2の左から１７番目の画素の画素値から変化しない。
【０１９１】
すなわち、フレーム#nにおけるカバードバックグラウンド領域に属する画素に対応する、フレーム#n-1およびフレーム#n-2の画素は、背景の成分のみから成り、画素値が変化しないので、その差の絶対値は、ほぼ０の値となる。従って、フレーム#nにおける混合領域に属する画素に対応する、フレーム#n-1およびフレーム#n-2の画素に対する静動判定は、静動判定部２０２−４により、静止と判定される。
【０１９２】
フレーム#nにおけるカバードバックグラウンド領域に属する画素は、前景の成分を含むので、フレーム#n-1における背景の成分のみから成る場合と、画素値が異なる。従って、フレーム#nにおける混合領域に属する画素、および対応するフレーム#n-1の画素に対する静動判定は、静動判定部２０２−３により、動きと判定される。
【０１９３】
このように、領域判定部２０３−３は、静動判定部２０２−３から動きを示す静動判定の結果が供給され、静動判定部２０２−４から静止を示す静動判定の結果が供給されたとき、対応する画素がカバードバックグラウンド領域に属すると判定する。
【０１９４】
図２８に示す、背景が静止し、前景の動き量vが４であるフレーム#nにおいて、アンカバードバックグラウンド領域に含まれる画素は、左から２番目乃至４番目の画素である。動き量vが４であるので、１つ後のフレーム#n+1において、左から２番目乃至４番目の画素は、背景の成分のみを含み、背景領域に属する。また、更に１つ後のフレーム#n+2において、左から２番目乃至４番目の画素は、背景の成分のみを含み、背景領域に属する。
【０１９５】
ここで、背景に対応するオブジェクトが静止しているので、フレーム#n+2の左から２番目の画素の画素値は、フレーム#n+1の左から２番目の画素の画素値から変化しない。同様に、フレーム#n+2の左から３番目の画素の画素値は、フレーム#n+1の左から３番目の画素の画素値から変化せず、フレーム#n+2の左から４番目の画素の画素値は、フレーム#n+1の左から４番目の画素の画素値から変化しない。
【０１９６】
すなわち、フレーム#nにおけるアンカバードバックグラウンド領域に属する画素に対応する、フレーム#n+1およびフレーム#n+2の画素は、背景の成分のみから成り、画素値が変化しないので、その差の絶対値は、ほぼ０の値となる。従って、フレーム#nにおける混合領域に属する画素に対応する、フレーム#n+1およびフレーム#n+2の画素に対する静動判定は、静動判定部２０２−１により、静止と判定される。
【０１９７】
フレーム#nにおけるアンカバードバックグラウンド領域に属する画素は、前景の成分を含むので、フレーム#n+1における背景の成分のみから成る場合と、画素値が異なる。従って、フレーム#nにおける混合領域に属する画素、および対応するフレーム#n+1の画素に対する静動判定は、静動判定部２０２−２により、動きと判定される。
【０１９８】
このように、領域判定部２０３−１は、静動判定部２０２−２から動きを示す静動判定の結果が供給され、静動判定部２０２−１から静止を示す静動判定の結果が供給されたとき、対応する画素がアンカバードバックグラウンド領域に属すると判定する。
【０１９９】
図２９は、フレーム#nにおける領域特定部１０３の判定条件を示す図である。フレーム#nの判定の対象となる画素の画像上の位置と同一の位置にあるフレーム#n-2の画素と、フレーム#nの判定の対象となる画素の画像上の位置と同一の位置にあるフレーム#n-1の画素とが静止と判定され、フレーム#nの判定の対象となる画素の画像上の位置と同一の位置にあるフレーム#n-1の画素と、フレーム#nの画素とが動きと判定されたとき、領域特定部１０３は、フレーム#nの判定の対象となる画素がカバードバックグラウンド領域に属すると判定する。
【０２００】
フレーム#nの判定の対象となる画素の画像上の位置と同一の位置にあるフレーム#n-1の画素と、フレーム#nの画素とが静止と判定され、フレーム#nの画素と、フレーム#nの判定の対象となる画素の画像上の位置と同一の位置にあるフレーム#n+1の画素とが静止と判定されたとき、領域特定部１０３は、フレーム#nの判定の対象となる画素が静止領域に属すると判定する。
【０２０１】
フレーム#nの判定の対象となる画素の画像上の位置と同一の位置にあるフレーム#n-1の画素と、フレーム#nの画素とが動きと判定され、フレーム#nの画素と、フレーム#nの判定の対象となる画素の画像上の位置と同一の位置にあるフレーム#n+1の画素とが動きと判定されたとき、領域特定部１０３は、フレーム#nの判定の対象となる画素が動き領域に属すると判定する。
【０２０２】
フレーム#nの画素と、フレーム#nの判定の対象となる画素の画像上の位置と同一の位置にあるフレーム#n+1の画素とが動きと判定され、フレーム#nの判定の対象となる画素の画像上の位置と同一の位置にあるフレーム#n+1の画素と、フレーム#nの判定の対象となる画素の画像上の位置と同一の位置にあるフレーム#n+2の画素とが静止と判定されたとき、領域特定部１０３は、フレーム#nの判定の対象となる画素がアンカバードバックグラウンド領域に属すると判定する。
【０２０３】
図２６は、領域特定部１０３の領域の特定の結果の例を示す図である。図３０Ａにおいて、カバードバックグラウンド領域に属すると判定された画素は、白で表示されている。図３０Ｂにおいて、アンカバードバックグラウンド領域に属すると判定された画素は、白で表示されている。
【０２０４】
図３０Ｃにおいて、動き領域に属すると判定された画素は、白で表示されている。図３０Ｄにおいて、静止領域に属すると判定された画素は、白で表示されている。
【０２０５】
図３１は、判定フラグ格納フレームメモリ２０６が出力する領域情報の内、混合領域を示す領域情報を画像として示す図である。図３１において、カバードバックグラウンド領域またはアンカバードバックグラウンド領域に属すると判定された画素、すなわち混合領域に属すると判定された画素は、白で表示されている。判定フラグ格納フレームメモリ２０６が出力する混合領域を示す領域情報は、混合領域、および前景領域内のテクスチャの無い部分に囲まれたテクスチャの有る部分を示す。
【０２０６】
次に、図３２のフローチャートを参照して、領域特定部１０３の領域特定の処理を説明する。ステップＳ２０１において、フレームメモリ２０１は、判定の対象となるフレーム#nを含むフレーム#n-2乃至フレーム#n+2の画像を取得する。
【０２０７】
ステップＳ２０２において、静動判定部２０２−３は、フレーム#n-1の画素とフレーム#nの同一位置の画素とで、静止か否かを判定し、静止と判定された場合、ステップＳ２０３に進み、静動判定部２０２−２は、フレーム#nの画素とフレーム#n+1の同一位置の画素とで、静止か否かを判定する。
【０２０８】
ステップＳ２０３において、フレーム#nの画素とフレーム#n+1の同一位置の画素とで、静止と判定された場合、ステップＳ２０４に進み、領域判定部２０３−２は、領域の判定される画素に対応する静止領域判定フラグに、静止領域に属することを示す”１”を設定する。領域判定部２０３−２は、静止領域判定フラグを判定フラグ格納フレームメモリ２０４に供給し、手続きは、ステップＳ２０５に進む。
【０２０９】
ステップＳ２０２において、フレーム#n-1の画素とフレーム#nの同一位置の画素とで、動きと判定された場合、または、ステップＳ２０３において、フレーム#nの画素とフレーム#n+1の同一位置の画素とで、動きと判定された場合、フレーム#nの画素が静止領域には属さないので、ステップＳ２０４の処理はスキップされ、手続きは、ステップＳ２０５に進む。
【０２１０】
ステップＳ２０５において、静動判定部２０２−３は、フレーム#n-1の画素とフレーム#nの同一位置の画素とで、動きか否かを判定し、動きと判定された場合、ステップＳ２０６に進み、静動判定部２０２−２は、フレーム#nの画素とフレーム#n+1の同一位置の画素とで、動きか否かを判定する。
【０２１１】
ステップＳ２０６において、フレーム#nの画素とフレーム#n+1の同一位置の画素とで、動きと判定された場合、ステップＳ２０７に進み、領域判定部２０３−２は、領域の判定される画素に対応する動き領域判定フラグに、動き領域に属することを示す”１”を設定する。領域判定部２０３−２は、動き領域判定フラグを判定フラグ格納フレームメモリ２０４に供給し、手続きは、ステップＳ２０８に進む。
【０２１２】
ステップＳ２０５において、フレーム#n-1の画素とフレーム#nの同一位置の画素とで、静止と判定された場合、または、ステップＳ２０６において、フレーム#nの画素とフレーム#n+1の同一位置の画素とで、静止と判定された場合、フレーム#nの画素が動き領域には属さないので、ステップＳ２０７の処理はスキップされ、手続きは、ステップＳ２０８に進む。
【０２１３】
ステップＳ２０８において、静動判定部２０２−４は、フレーム#n-2の画素とフレーム#n-1の同一位置の画素とで、静止か否かを判定し、静止と判定された場合、ステップＳ２０９に進み、静動判定部２０２−３は、フレーム#n-1の画素とフレーム#nの同一位置の画素とで、動きか否かを判定する。
【０２１４】
ステップＳ２０９において、フレーム#n-1の画素とフレーム#nの同一位置の画素とで、動きと判定された場合、ステップＳ２１０に進み、領域判定部２０３−３は、領域の判定される画素に対応するカバードバックグラウンド領域判定フラグに、カバードバックグラウンド領域に属することを示す”１”を設定する。領域判定部２０３−３は、カバードバックグラウンド領域判定フラグを判定フラグ格納フレームメモリ２０４に供給し、手続きは、ステップＳ２１１に進む。
【０２１５】
ステップＳ２０８において、フレーム#n-2の画素とフレーム#n-1の同一位置の画素とで、動きと判定された場合、または、ステップＳ２０９において、フレーム#n-1の画素とフレーム#nの同一位置の画素とで、静止と判定された場合、フレーム#nの画素がカバードバックグラウンド領域には属さないので、ステップＳ２１０の処理はスキップされ、手続きは、ステップＳ２１１に進む。
【０２１６】
ステップＳ２１１において、静動判定部２０２−２は、フレーム#nの画素とフレーム#n+1の同一位置の画素とで、動きか否かを判定し、動きと判定された場合、ステップＳ２１２に進み、静動判定部２０２−１は、フレーム#n+1の画素とフレーム#n+2の同一位置の画素とで、静止か否かを判定する。
【０２１７】
ステップＳ２１２において、フレーム#n+1の画素とフレーム#n+2の同一位置の画素とで、静止と判定された場合、ステップＳ２１３に進み、領域判定部２０３−１は、領域の判定される画素に対応するアンカバードバックグラウンド領域判定フラグに、アンカバードバックグラウンド領域に属することを示す”１”を設定する。領域判定部２０３−１は、アンカバードバックグラウンド領域判定フラグを判定フラグ格納フレームメモリ２０４に供給し、手続きは、ステップＳ２１４に進む。
【０２１８】
ステップＳ２１１において、フレーム#nの画素とフレーム#n+1の同一位置の画素とで、静止と判定された場合、または、ステップＳ２１２において、フレーム#n+1の画素とフレーム#n+2の同一位置の画素とで、動きと判定された場合、フレーム#nの画素がアンカバードバックグラウンド領域には属さないので、ステップＳ２１３の処理はスキップされ、手続きは、ステップＳ２１４に進む。
【０２１９】
ステップＳ２１４において、領域特定部１０３は、フレーム#nの全ての画素について領域を特定したか否かを判定し、フレーム#nの全ての画素について領域を特定していないと判定された場合、手続きは、ステップＳ２０２に戻り、他の画素について、領域特定の処理を繰り返す。
【０２２０】
ステップＳ２１４において、フレーム#nの全ての画素について領域を特定したと判定された場合、ステップＳ２１５に進み、合成部２０５は、判定フラグ格納フレームメモリ２０４に記憶されているアンカバードバックグラウンド領域判定フラグ、およびカバードバックグラウンド領域判定フラグを基に、混合領域を示す領域情報を生成し、更に、各画素が、アンカバードバックグラウンド領域、静止領域、動き領域、およびカバードバックグラウンド領域のいずれかに属することを示す領域情報を生成し、生成した領域情報を判定フラグ格納フレームメモリ２０６に設定し、処理は終了する。
【０２２１】
このように、領域特定部１０３は、フレームに含まれている画素のそれぞれについて、動き領域、静止領域、アンカバードバックグラウンド領域、またはカバードバックグラウンド領域に属することを示す領域情報を生成することができる。
【０２２２】
なお、領域特定部１０３は、アンカバードバックグラウンド領域およびカバードバックグラウンド領域に対応する領域情報に論理和を適用することにより、混合領域に対応する領域情報を生成して、フレームに含まれている画素のそれぞれについて、動き領域、静止領域、または混合領域に属することを示すフラグから成る領域情報を生成するようにしてもよい。
【０２２３】
前景に対応するオブジェクトがテクスチャを有す場合、領域特定部１０３は、より正確に動き領域を特定することができる。
【０２２４】
領域特定部１０３は、動き領域を示す領域情報を前景領域を示す領域情報として、また、静止領域を示す領域情報を背景領域を示す領域情報として出力することができる。
【０２２５】
なお、背景に対応するオブジェクトが静止しているとして説明したが、背景領域に対応する画像が動きを含んでいても上述した領域を特定する処理を適用することができる。例えば、背景領域に対応する画像が一様に動いているとき、領域特定部１０３は、この動きに対応して画像全体をシフトさせ、背景に対応するオブジェクトが静止している場合と同様に処理する。また、背景領域に対応する画像が局所毎に異なる動きを含んでいるとき、領域特定部１０３は、動きに対応した画素を選択して、上述の処理を実行する。
【０２２６】
図３３は、領域特定部１０３の構成を示すブロック図である。図３３に示す領域特定部１０３は、動きベクトルを使用しない。背景画像生成部３０１は、入力画像に対応する背景画像を生成し、生成した背景画像を２値オブジェクト画像抽出部３０２に供給する。背景画像生成部３０１は、例えば、入力画像に含まれる背景のオブジェクトに対応する画像オブジェクトを抽出して、背景画像を生成する。
【０２２７】
前景のオブジェクトに対応する画像の動き方向に隣接して１列に並ぶ画素の画素値を時間方向に展開したモデル図の例を図３４に示す。例えば、前景のオブジェクトに対応する画像の動き方向が画面に対して水平であるとき、図３４におけるモデル図は、１つのライン上の隣接する画素の画素値を時間方向に展開したモデルを示す。
【０２２８】
図３４において、フレーム#nにおけるラインは、フレーム#n-1およびフレーム#n+1におけるラインと同一である。
【０２２９】
フレーム#nにおいて、左から６番目の画素乃至１７番目の画素に含まれているオブジェクトに対応する前景の成分は、フレーム#n-1において、左から２番目乃至１３番目の画素に含まれ、フレーム#n+1において、左から１０番目乃至２１番目の画素に含まれる。
【０２３０】
フレーム#n-1において、カバードバックグラウンド領域に属する画素は、左から１１番目乃至１３番目の画素であり、アンカバードバックグラウンド領域に属する画素は、左から２番目乃至４番目の画素である。フレーム#nにおいて、カバードバックグラウンド領域に属する画素は、左から１５番目乃至１７番目の画素であり、アンカバードバックグラウンド領域に属する画素は、左から６番目乃至８番目の画素である。フレーム#n+1において、カバードバックグラウンド領域に属する画素は、左から１９番目乃至２１番目の画素であり、アンカバードバックグラウンド領域に属する画素は、左から１０番目乃至１２番目の画素である。
【０２３１】
フレーム#n-1において、背景領域に属する画素は、左から１番目の画素、および左から１４番目乃至２１番目の画素である。フレーム#nにおいて、背景領域に属する画素は、左から１番目乃至５番目の画素、および左から１８番目乃至２１番目の画素である。フレーム#n+1において、背景領域に属する画素は、左から１番目乃至９番目の画素である。
【０２３２】
背景画像生成部３０１が生成する、図３４の例に対応する背景画像の例を図３５に示す。背景画像は、背景のオブジェクトに対応する画素から構成され、前景のオブジェクトに対応する画像の成分を含まない。
【０２３３】
２値オブジェクト画像抽出部３０２は、背景画像および入力画像の相関を基に、２値オブジェクト画像を生成し、生成した２値オブジェクト画像を時間変化検出部３０３に供給する。
【０２３４】
図３６は、２値オブジェクト画像抽出部３０２の構成を示すブロック図である。相関値演算部３２１は、背景画像生成部３０１から供給された背景画像および入力画像の相関を演算し、相関値を生成して、生成した相関値をしきい値処理部３２２に供給する。
【０２３５】
相関値演算部３２１は、例えば、図３７Ａに示すように、Ｘ4を中心とした３×３の背景画像の中のブロックと、図３７Ｂに示すように、背景画像の中のブロックに対応するＹ4を中心とした３×３の入力画像の中のブロックに、式（４）を適用して、Ｙ4に対応する相関値を算出する。
【０２３６】
【数２】

・・・（４）
【０２３７】
【数３】

・・・（５）
【０２３８】
【数４】

・・・（６）
【０２３９】
相関値演算部３２１は、このように各画素に対応して算出された相関値をしきい値処理部３２２に供給する。
【０２４０】
また、相関値演算部３２１は、例えば、図３８Ａに示すように、Ｘ4を中心とした３×３の背景画像の中のブロックと、図３８Ｂに示すように、背景画像の中のブロックに対応するＹ4を中心とした３×３の入力画像の中のブロックに、式（７）を適用して、Ｙ4に対応する差分絶対値和を算出するようにしてもよい。
【０２４１】
【数５】

・・・（７）
【０２４２】
相関値演算部３２１は、このように算出された差分絶対値和を相関値として、しきい値処理部３２２に供給する。
【０２４３】
しきい値処理部３２２は、相関画像の画素値としきい値th0とを比較して、相関値がしきい値th0以下である場合、２値オブジェクト画像の画素値に1を設定し、相関値がしきい値th0より大きい場合、２値オブジェクト画像の画素値に0を設定して、0または1が画素値に設定された２値オブジェクト画像を出力する。しきい値処理部３２２は、しきい値th0を予め記憶するようにしてもよく、または、外部から入力されたしきい値th0を使用するようにしてもよい。
【０２４４】
図３９は、図３４に示す入力画像のモデルに対応する２値オブジェクト画像の例を示す図である。２値オブジェクト画像において、背景画像と相関の高い画素には、画素値に0が設定される。
【０２４５】
図４０は、時間変化検出部３０３の構成を示すブロック図である。フレームメモリ３４１は、フレーム#nの画素について領域を判定するとき、２値オブジェクト画像抽出部３０２から供給された、フレーム#n-1、フレーム#n、およびフレーム#n+1の２値オブジェクト画像を記憶する。
【０２４６】
領域判定部３４２は、フレームメモリ３４１に記憶されているフレーム#n-1、フレーム#n、およびフレーム#n+1の２値オブジェクト画像を基に、フレーム#nの各画素について領域を判定して、領域情報を生成し、生成した領域情報を出力する。
【０２４７】
図４１は、領域判定部３４２の判定を説明する図である。フレーム#nの２値オブジェクト画像の注目している画素が0であるとき、領域判定部３４２は、フレーム#nの注目している画素が背景領域に属すると判定する。
【０２４８】
フレーム#nの２値オブジェクト画像の注目している画素が1であり、フレーム#n-1の２値オブジェクト画像の対応する画素が1であり、フレーム#n+1の２値オブジェクト画像の対応する画素が1であるとき、領域判定部３４２は、フレーム#nの注目している画素が前景領域に属すると判定する。
【０２４９】
フレーム#nの２値オブジェクト画像の注目している画素が1であり、フレーム#n-1の２値オブジェクト画像の対応する画素が0であるとき、領域判定部３４２は、フレーム#nの注目している画素がカバードバックグラウンド領域に属すると判定する。
【０２５０】
フレーム#nの２値オブジェクト画像の注目している画素が1であり、フレーム#n+1の２値オブジェクト画像の対応する画素が0であるとき、領域判定部３４２は、フレーム#nの注目している画素がアンカバードバックグラウンド領域に属すると判定する。
【０２５１】
図４２は、図３４に示す入力画像のモデルに対応する２値オブジェクト画像について、時間変化検出部３０３の判定した例を示す図である。時間変化検出部３０３は、２値オブジェクト画像のフレーム#nの対応する画素が0なので、フレーム#nの左から１番目乃至５番目の画素を背景領域に属すると判定する。
【０２５２】
時間変化検出部３０３は、２値オブジェクト画像のフレーム#nの画素が1であり、フレーム#n+1の対応する画素が0なので、左から６番目乃至９番目の画素をアンカバードバックグラウンド領域に属すると判定する。
【０２５３】
時間変化検出部３０３は、２値オブジェクト画像のフレーム#nの画素が1であり、フレーム#n-1の対応する画素が1であり、フレーム#n+1の対応する画素が1なので、左から１０番目乃至１３番目の画素を前景領域に属すると判定する。
【０２５４】
時間変化検出部３０３は、２値オブジェクト画像のフレーム#nの画素が1であり、フレーム#n-1の対応する画素が0なので、左から１４番目乃至１７番目の画素をカバードバックグラウンド領域に属すると判定する。
【０２５５】
時間変化検出部３０３は、２値オブジェクト画像のフレーム#nの対応する画素が0なので、左から１８番目乃至２１番目の画素を背景領域に属すると判定する。
【０２５６】
次に、図４３のフローチャートを参照して、領域判定部１０３の領域特定の処理を説明する。ステップＳ３０１において、領域判定部１０３の背景画像生成部３０１は、入力画像を基に、例えば、入力画像に含まれる背景のオブジェクトに対応する画像オブジェクトを抽出して背景画像を生成し、生成した背景画像を２値オブジェクト画像抽出部３０２に供給する。
【０２５７】
ステップＳ３０２において、２値オブジェクト画像抽出部３０２は、例えば、図３７を参照して説明した演算により、入力画像と背景画像生成部３０１から供給された背景画像との相関値を演算する。ステップＳ３０３において、２値オブジェクト画像抽出部３０２は、例えば、相関値としきい値th0とを比較することにより、相関値およびしきい値th0から２値オブジェクト画像を演算する。
【０２５８】
ステップＳ３０４において、時間変化検出部３０３は、領域判定の処理を実行して、処理は終了する。
【０２５９】
図４４のフローチャートを参照して、ステップＳ３０４に対応する領域判定の処理の詳細を説明する。ステップＳ３２１において、時間変化検出部３０３の領域判定部３４２は、フレームメモリ３４１に記憶されているフレーム#nにおいて、注目する画素が0であるか否かを判定し、フレーム#nにおいて、注目する画素が0であると判定された場合、ステップＳ３２２に進み、フレーム#nの注目する画素が背景領域に属すると設定して、処理は終了する。
【０２６０】
ステップＳ３２１において、フレーム#nにおいて、注目する画素が1であると判定された場合、ステップＳ３２３に進み、時間変化検出部３０３の領域判定部３４２は、フレームメモリ３４１に記憶されているフレーム#nにおいて、注目する画素が1であり、かつ、フレーム#n-1において、対応する画素が0であるか否かを判定し、フレーム#nにおいて、注目する画素が1であり、かつ、フレーム#n-1において、対応する画素が0であると判定された場合、ステップＳ３２４に進み、フレーム#nの注目する画素がカバードバックグラウンド領域に属すると設定して、処理は終了する。
【０２６１】
ステップＳ３２３において、フレーム#nにおいて、注目する画素が0であるか、または、フレーム#n-1において、対応する画素が1であると判定された場合、ステップＳ３２５に進み、時間変化検出部３０３の領域判定部３４２は、フレームメモリ３４１に記憶されているフレーム#nにおいて、注目する画素が1であり、かつ、フレーム#n+1において、対応する画素が0であるか否かを判定し、フレーム#nにおいて、注目する画素が1であり、かつ、フレーム#n+1において、対応する画素が0であると判定された場合、ステップＳ３２６に進み、フレーム#nの注目する画素がアンカバードバックグラウンド領域に属すると設定して、処理は終了する。
【０２６２】
ステップＳ３２５において、フレーム#nにおいて、注目する画素が0であるか、または、フレーム#n+1において、対応する画素が１であると判定された場合、ステップＳ３２７に進み、時間変化検出部３０３の領域判定部３４２は、フレーム#nの注目する画素を前景領域と設定して、処理は終了する。
【０２６３】
このように、領域特定部１０３は、入力された画像と対応する背景画像との相関値を基に、入力画像の画素が前景領域、背景領域、カバードバックグラウンド領域、およびアンカバードバックグラウンド領域のいずれかに属するかを特定して、特定した結果に対応する領域情報を生成することができる。
【０２６４】
図４５は、領域特定部１０３の他の構成を示すブロック図である。図４５に示す領域特定部１０３は、動き検出部１０２から供給される動きベクトルとその位置情報を使用する。図３３に示す場合と同様の部分には、同一の番号を付してあり、その説明は省略する。
【０２６５】
ロバスト化部３６１は、２値オブジェクト画像抽出部３０２から供給された、Ｎ個のフレームの２値オブジェクト画像を基に、ロバスト化された２値オブジェクト画像を生成して、時間変化検出部３０３に出力する。
【０２６６】
図４６は、ロバスト化部３６１の構成を説明するブロック図である。動き補償部３８１は、動き検出部１０２から供給された動きベクトルとその位置情報を基に、Ｎ個のフレームの２値オブジェクト画像の動きを補償して、動きが補償された２値オブジェクト画像をスイッチ３８２に出力する。
【０２６７】
図４７および図４８の例を参照して、動き補償部３８１の動き補償について説明する。例えば、フレーム#nの領域を判定するとき、図４７に例を示すフレーム#n-1、フレーム#n、およびフレーム#n+1の２値オブジェクト画像が入力された場合、動き補償部３８１は、動き検出部１０２から供給された動きベクトルを基に、図４８に例を示すように、フレーム#n-1の２値オブジェクト画像、およびフレーム#n+1の２値オブジェクト画像を動き補償して、動き補償された２値オブジェクト画像をスイッチ３８２に供給する。
【０２６８】
スイッチ３８２は、１番目のフレームの動き補償された２値オブジェクト画像をフレームメモリ３８３−１に出力し、２番目のフレームの動き補償された２値オブジェクト画像をフレームメモリ３８３−２に出力する。同様に、スイッチ３８２は、３番目乃至Ｎ−１番目のフレームの動き補償された２値オブジェクト画像のそれぞれをフレームメモリ３８３−３乃至フレームメモリ３８３−（Ｎ−１）のいずれかに出力し、Ｎ番目のフレームの動き補償された２値オブジェクト画像をフレームメモリ３８３−Ｎに出力する。
【０２６９】
フレームメモリ３８３−１は、１番目のフレームの動き補償された２値オブジェクト画像を記憶し、記憶されている２値オブジェクト画像を重み付け部３８４−１に出力する。フレームメモリ３８３−２は、２番目のフレームの動き補償された２値オブジェクト画像を記憶し、記憶されている２値オブジェクト画像を重み付け部３８４−２に出力する。
【０２７０】
同様に、フレームメモリ３８３−３乃至フレームメモリ３８３−（Ｎ−１）のそれぞれは、３番目のフレーム乃至Ｎ−１番目のフレームの動き補償された２値オブジェクト画像のいずれかを記憶し、記憶されている２値オブジェクト画像を重み付け部３８４−３乃至重み付け部３８４−（Ｎ−１）のいずれかに出力する。フレームメモリ３８３−Ｎは、Ｎ番目のフレームの動き補償された２値オブジェクト画像を記憶し、記憶されている２値オブジェクト画像を重み付け部３８４−Ｎに出力する。
【０２７１】
重み付け部３８４−１は、フレームメモリ３８３−１から供給された１番目のフレームの動き補償された２値オブジェクト画像の画素値に予め定めた重みw1を乗じて、積算部３８５に供給する。重み付け部３８４−２は、フレームメモリ３８３−２から供給された２番目のフレームの動き補償された２値オブジェクト画像の画素値に予め定めた重みw2を乗じて、積算部３８５に供給する。
【０２７２】
同様に、重み付け部３８４−３乃至重み付け部３８４−（Ｎ−１）のそれぞれは、フレームメモリ３８３−３乃至フレームメモリ３８３−（Ｎ−１）のいずれかから供給された３番目乃至Ｎ−１番目のいずれかのフレームの動き補償された２値オブジェクト画像の画素値に予め定めた重みw3乃至重みw(N-1)のいずれかを乗じて、積算部３８５に供給する。重み付け部３８４−Ｎは、フレームメモリ３８３−Ｎから供給されたＮ番目のフレームの動き補償された２値オブジェクト画像の画素値に予め定めた重みwNを乗じて、積算部３８５に供給する。
【０２７３】
積算部３８５は、１乃至Ｎ番目のフレームの動き補償され、それぞれ重みw1乃至wNのいずれかが乗じられた、２値オブジェクト画像の対応する画素値を積算して、積算された画素値を予め定めたしきい値th0と比較することにより２値オブジェクト画像を生成する。
【０２７４】
このように、ロバスト化部３６１は、Ｎ個の２値オブジェクト画像からロバスト化された２値オブジェト画像を生成して、時間変化検出部３０３に供給するので、図４５に構成を示す領域特定部１０３は、入力画像にノイズが含まれていても、図３３に示す場合に比較して、より正確に領域を特定することができる。
【０２７５】
次に、図４５に構成を示す領域特定部１０３の領域特定の処理について、図４９のフローチャートを参照して説明する。ステップＳ３４１乃至ステップＳ３４３の処理は、図４３のフローチャートで説明したステップＳ３０１乃至ステップＳ３０３とそれぞれ同様なのでその説明は省略する。
【０２７６】
ステップＳ３４４において、ロバスト化部３６１は、ロバスト化の処理を実行する。
【０２７７】
ステップＳ３４５において、時間変化検出部３０３は、領域判定の処理を実行して、処理は終了する。ステップＳ３４５の処理の詳細は、図４４のフローチャートを参照して説明した処理と同様なのでその説明は省略する。
【０２７８】
次に、図５０のフローチャートを参照して、図４９のステップＳ３４４の処理に対応する、ロバスト化の処理の詳細について説明する。ステップＳ３６１において、動き補償部３８１は、動き検出部１０２から供給される動きベクトルとその位置情報を基に、入力された２値オブジェクト画像の動き補償の処理を実行する。ステップＳ３６２において、フレームメモリ３８３−１乃至３８３−Ｎのいずれかは、スイッチ３８２を介して供給された動き補償された２値オブジェクト画像を記憶する。
【０２７９】
ステップＳ３６３において、ロバスト化部３６１は、Ｎ個の２値オブジェクト画像が記憶されたか否かを判定し、Ｎ個の２値オブジェクト画像が記憶されていないと判定された場合、ステップＳ３６１に戻り、２値オブジェクト画像の動き補償の処理および２値オブジェクト画像の記憶の処理を繰り返す。
【０２８０】
ステップＳ３６３において、Ｎ個の２値オブジェクト画像が記憶されたと判定された場合、ステップＳ３６４に進み、重み付け部３８４−１乃至３８４−Ｎのそれぞれは、Ｎ個の２値オブジェクト画像のそれぞれにw1乃至wNのいずれかの重みを乗じて、重み付けする。
【０２８１】
ステップＳ３６５において、積算部３８５は、重み付けされたＮ個の２値オブジェクト画像を積算する。
【０２８２】
ステップＳ３６６において、積算部３８５は、例えば、予め定められたしきい値th1との比較などにより、積算された画像から２値オブジェクト画像を生成して、処理は終了する。
【０２８３】
このように、図４５に構成を示す領域特定部１０３は、ロバスト化された２値オブジェクト画像を基に、領域情報を生成することができる。
【０２８４】
以上のように、領域特定部１０３は、フレームに含まれている画素のそれぞれについて、動き領域、静止領域、アンカバードバックグラウンド領域、またはカバードバックグラウンド領域に属することを示す領域情報を生成することができる。
【０２８５】
図５１は、混合比算出部１０４の構成の一例を示すブロック図である。推定混合比処理部４０１は、入力画像を基に、カバードバックグラウンド領域のモデルに対応する演算により、画素毎に推定混合比を算出して、算出した推定混合比を混合比決定部４０３に供給する。
【０２８６】
推定混合比処理部４０２は、入力画像を基に、アンカバードバックグラウンド領域のモデルに対応する演算により、画素毎に推定混合比を算出して、算出した推定混合比を混合比決定部４０３に供給する。
【０２８７】
前景に対応するオブジェクトがシャッタ時間内に等速で動いていると仮定できるので、混合領域に属する画素の混合比αは、以下の性質を有する。すなわち、混合比αは、画素の位置の変化に対応して、直線的に変化する。画素の位置の変化を１次元とすれば、混合比αの変化は、直線で表現することができ、画素の位置の変化を２次元とすれば、混合比αの変化は、平面で表現することができる。
【０２８８】
なお、１フレームの期間は短いので、前景に対応するオブジェクトが剛体であり、等速で移動していると仮定が成り立つ。
【０２８９】
この場合、混合比αの傾きは、前景のシャッタ時間内での動き量vの逆比となる。
【０２９０】
理想的な混合比αの例を図５２に示す。理想的な混合比αの混合領域における傾きlは、動き量vの逆数として表すことができる。
【０２９１】
図５２に示すように、理想的な混合比αは、背景領域において、１の値を有し、前景領域において、０の値を有し、混合領域において、０を越え１未満の値を有する。
【０２９２】
図５３の例において、フレーム#nの左から７番目の画素の画素値C06は、フレーム#n-1の左から７番目の画素の画素値P06を用いて、式（８）で表すことができる。
【０２９３】
【数６】

・・・（８）
【０２９４】
式（８）において、画素値C06を混合領域の画素の画素値Mと、画素値P06を背景領域の画素の画素値Bと表現する。すなわち、混合領域の画素の画素値Mおよび背景領域の画素の画素値Bは、それぞれ、式（９）および式（１０）のように表現することができる。
【０２９５】
M=C06 ・・・（９）
B=P06 ・・・（１０）
【０２９６】
式（８）中の2/vは、混合比αに対応する。動き量vが４なので、フレーム#nの左から７番目の画素の混合比αは、0.5となる。
【０２９７】
以上のように、注目しているフレーム#nの画素値Cを混合領域の画素値と見なし、フレーム#nの前のフレーム#n-1の画素値Pを背景領域の画素値と見なすことで、混合比αを示す式（３）は、式（１１）のように書き換えられる。
【０２９８】
C=α・P+f ・・・（１１）
式（１１）のfは、注目している画素に含まれる前景の成分の和ΣiFi/vである。式（１１）に含まれる変数は、混合比αおよび前景の成分の和fの２つである。
【０２９９】
同様に、アンカバードバックグラウンド領域における、動き量vが４であり、時間方向の仮想分割数が４である、画素値を時間方向に展開したモデルを図５４に示す。
【０３００】
アンカバードバックグラウンド領域において、上述したカバードバックグラウンド領域における表現と同様に、注目しているフレーム#nの画素値Cを混合領域の画素値と見なし、フレーム#nの後のフレーム#n+1の画素値Nを背景領域の画素値と見なすことで、混合比αを示す式（３）は、式（１２）のように表現することができる。
【０３０１】
C=α・N+f ・・・（１２）
【０３０２】
なお、背景のオブジェクトが静止しているとして説明したが、背景のオブジェクトが動いている場合においても、背景の動き量vに対応させた位置の画素の画素値を利用することにより、式（８）乃至式（１２）を適用することができる。例えば、図５３において、背景に対応するオブジェクトの動き量vが２であり、仮想分割数が２であるとき、背景に対応するオブジェクトが図中の右側に動いているとき、式（１０）における背景領域の画素の画素値Bは、画素値P04とされる。
【０３０３】
式（１１）および式（１２）は、それぞれ２つの変数を含むので、そのままでは混合比αを求めることができない。ここで、画像は一般的に空間的に相関が強いので近接する画素同士でほぼ同じ画素値となる。
【０３０４】
そこで、前景成分は、空間的に相関が強いので、前景の成分の和fを前または後のフレームから導き出せるように式を変形して、混合比αを求める。
【０３０５】
図５５のフレーム#nの左から７番目の画素の画素値Mcは、式（１３）で表すことができる。
【０３０６】
【数７】

・・・（１３）
式（１３）の右辺第１項の2/vは、混合比αに相当する。式（１３）の右辺第２項は、後のフレーム#n+1の画素値を利用して、式（１４）のように表すこととする。
【０３０７】
【数８】

・・・（１４）
【０３０８】
ここで、前景の成分の空間相関を利用して、式（１５）が成立するとする。
【０３０９】
F=F05=F06=F07=F08=F09=F10=F11=F12 ・・・（１５）
式（１４）は、式（１５）を利用して、式（１６）のように置き換えることができる。
【０３１０】
【数９】

・・・（１６）
【０３１１】
結果として、βは、式（１７）で表すことができる。
【０３１２】
β=2/4 ・・・（１７）
【０３１３】
一般的に、式（１５）に示すように混合領域に関係する前景の成分が等しいと仮定すると、混合領域の全ての画素について、内分比の関係から式（１８）が成立する。
【０３１４】
β=1-α ・・・（１８）
【０３１５】
式（１８）が成立するとすれば、式（１１）は、式（１９）に示すように展開することができる。
【０３１６】
【数１０】

・・・（１９）
【０３１７】
同様に、式（１８）が成立するとすれば、式（１２）は、式（２０）に示すように展開することができる。
【０３１８】
【数１１】

・・・（２０）
【０３１９】
式（１９）および式（２０）において、C，N、およびPは、既知の画素値なので、式（１９）および式（２０）に含まれる変数は、混合比αのみである。式（１９）および式（２０）における、C，N、およびPの関係を図５６に示す。Cは、混合比αを算出する、フレーム#nの注目している画素の画素値である。Nは、注目している画素と空間方向の位置が対応する、フレーム#n+1の画素の画素値である。Pは、注目している画素と空間方向の位置が対応する、フレーム#n-1の画素の画素値である。
【０３２０】
従って、式（１９）および式（２０）のそれぞれに１つの変数が含まれることとなるので、３つのフレームの画素の画素値を利用して、混合比αを算出することができる。式（１９）および式（２０）を解くことにより、正しい混合比αが算出されるための条件は、混合領域に関係する前景の成分が等しい、すなわち、前景のオブジェクトが静止しているとき撮像された前景の画像オブジェクトにおいて、前景のオブジェクトの動きの方向に対応する、画像オブジェクトの境界に位置する画素であって、動き量vの２倍の数の連続している画素の画素値が、一定であることである。
【０３２１】
以上のように、カバードバックグラウンド領域に属する画素の混合比αは、式（２１）により算出され、アンカバードバックグラウンド領域に属する画素の混合比αは、式（２２）により算出される。
【０３２２】
α=(C-N)/(P-N) ・・・（２１）
α=(C-P)/(N-P) ・・・（２２）
【０３２３】
図５７は、推定混合比処理部４０１の構成を示すブロック図である。フレームメモリ４２１は、入力された画像をフレーム単位で記憶し、入力画像として入力されているフレームから１つ後のフレームをフレームメモリ４２２および混合比演算部４２３に供給する。
【０３２４】
フレームメモリ４２２は、入力された画像をフレーム単位で記憶し、フレームメモリ４２１から供給されているフレームから１つ後のフレームを混合比演算部４２３に供給する。
【０３２５】
従って、入力画像としてフレーム#n+1が混合比演算部４２３に入力されているとき、フレームメモリ４２１は、フレーム#nを混合比演算部４２３に供給し、フレームメモリ４２２は、フレーム#n-1を混合比演算部４２３に供給する。
【０３２６】
混合比演算部４２３は、式（２１）に示す演算により、フレーム#nの注目している画素の画素値C、注目している画素と空間的位置が対応する、フレーム#n+1の画素の画素値N、および注目している画素と空間的位置が対応する、フレーム#n-1の画素の画素値Pを基に、注目している画素の推定混合比を算出して、算出した推定混合比を出力する。例えば、背景が静止しているとき、混合比演算部４２３は、フレーム#nの注目している画素の画素値C、注目している画素とフレーム内の位置が同じ、フレーム#n+1の画素の画素値N、および注目している画素とフレーム内の位置が同じ、フレーム#n-1の画素の画素値Pを基に、注目している画素の推定混合比を算出して、算出した推定混合比を出力する。
【０３２７】
このように、推定混合比処理部４０１は、入力画像を基に、推定混合比を算出して、混合比決定部４０３に供給することができる。
【０３２８】
なお、推定混合比処理部４０２は、推定混合比処理部４０１が式（２１）に示す演算により、注目している画素の推定混合比を算出するのに対して、式（２２）に示す演算により、注目している画素の推定混合比を算出する部分が異なることを除き、推定混合比処理部４０１と同様なので、その説明は省略する。
【０３２９】
図５８は、推定混合比処理部４０１により算出された推定混合比の例を示す図である。図５８に示す推定混合比は、等速で動いているオブジェクトに対応する前景の動き量vが１１である場合の結果を、１ラインに対して示すものである。
【０３３０】
推定混合比は、混合領域において、図５８に示すように、ほぼ直線的に変化していることがわかる。
【０３３１】
図５１に戻り、混合比決定部４０３は、領域特定部１０３から供給された、混合比αの算出の対象となる画素が、前景領域、背景領域、カバードバックグラウンド領域、またはアンカバードバックグラウンド領域のいずれかに属するかを示す領域情報を基に、混合比αを設定する。混合比決定部４０３は、対象となる画素が前景領域に属する場合、０を混合比αに設定し、対象となる画素が背景領域に属する場合、１を混合比αに設定し、対象となる画素がカバードバックグラウンド領域に属する場合、推定混合比処理部４０１から供給された推定混合比を混合比αに設定し、対象となる画素がアンカバードバックグラウンド領域に属する場合、推定混合比処理部４０２から供給された推定混合比を混合比αに設定する。混合比決定部４０３は、領域情報を基に設定した混合比αを出力する。
【０３３２】
図５９は、混合比算出部１０４の他の構成を示すブロック図である。選択部４４１は、領域特定部１０３から供給された領域情報を基に、カバードバックグラウンド領域に属する画素および、これに対応する前および後のフレームの画素を推定混合比処理部４４２に供給する。選択部４４１は、領域特定部１０３から供給された領域情報を基に、アンカバードバックグラウンド領域に属する画素および、これに対応する前および後のフレームの画素を推定混合比処理部４４３に供給する。
【０３３３】
推定混合比処理部４４２は、選択部４４１から入力された画素値を基に、式（２１）に示す演算により、カバードバックグラウンド領域に属する、注目している画素の推定混合比を算出して、算出した推定混合比を選択部４４４に供給する。
【０３３４】
推定混合比処理部４４３は、選択部４４１から入力された画素値を基に、式（２２）に示す演算により、アンカバードバックグラウンド領域に属する、注目している画素の推定混合比を算出して、算出した推定混合比を選択部４４４に供給する。
【０３３５】
選択部４４４は、領域特定部１０３から供給された領域情報を基に、対象となる画素が前景領域に属する場合、０である推定混合比を選択して、混合比αに設定し、対象となる画素が背景領域に属する場合、１である推定混合比を選択して、混合比αに設定する。選択部４４４は、対象となる画素がカバードバックグラウンド領域に属する場合、推定混合比処理部４４２から供給された推定混合比を選択して混合比αに設定し、対象となる画素がアンカバードバックグラウンド領域に属する場合、推定混合比処理部４４３から供給された推定混合比を選択して混合比αに設定する。選択部４４４は、領域情報を基に選択して設定した混合比αを出力する。
【０３３６】
このように、図５９に示す他の構成を有する混合比算出部１０４は、画像の含まれる画素毎に混合比αを算出して、算出した混合比αを出力することができる。
【０３３７】
図６０のフローチャートを参照して、図５１に構成を示す混合比算出部１０４の混合比αの算出の処理を説明する。ステップＳ４０１において、混合比算出部１０４は、領域特定部１０３から供給された領域情報を取得する。ステップＳ４０２において、推定混合比処理部４０１は、カバードバックグラウンド領域に対応するモデルにより推定混合比の演算の処理を実行し、算出した推定混合比を混合比決定部４０３に供給する。混合比推定の演算の処理の詳細は、図６１のフローチャートを参照して、後述する。
【０３３８】
ステップＳ４０３において、推定混合比処理部４０２は、アンカバードバックグラウンド領域に対応するモデルにより推定混合比の演算の処理を実行し、算出した推定混合比を混合比決定部４０３に供給する。
【０３３９】
ステップＳ４０４において、混合比算出部１０４は、フレーム全体について、混合比αを推定したか否かを判定し、フレーム全体について、混合比αを推定していないと判定された場合、ステップＳ４０２に戻り、次の画素について混合比αを推定する処理を実行する。
【０３４０】
ステップＳ４０４において、フレーム全体について、混合比αを推定したと判定された場合、ステップＳ４０５に進み、混合比決定部４０３は、画素が、前景領域、背景領域、カバードバックグラウンド領域、またはアンカバードバックグラウンド領域のいずれかに属するかを示す、領域特定部１０３から供給された領域情報を基に、混合比αを設定する。混合比決定部４０３は、対象となる画素が前景領域に属する場合、０を混合比αに設定し、対象となる画素が背景領域に属する場合、１を混合比αに設定し、対象となる画素がカバードバックグラウンド領域に属する場合、推定混合比処理部４０１から供給された推定混合比を混合比αに設定し、対象となる画素がアンカバードバックグラウンド領域に属する場合、推定混合比処理部４０２から供給された推定混合比を混合比αに設定し、処理は終了する。
【０３４１】
このように、混合比算出部１０４は、領域特定部１０３から供給された領域情報、および入力画像を基に、各画素に対応する特徴量である混合比αを算出することができる。
【０３４２】
図５９に構成を示す混合比算出部１０４の混合比αの算出の処理は、図６０のフローチャートで説明した処理と同様なので、その説明は省略する。
【０３４３】
次に、図６０のステップＳ４０２に対応する、カバードバックグラウンド領域に対応するモデルによる混合比推定の処理を図６１のフローチャートを参照して説明する。
【０３４４】
ステップＳ４２１において、混合比演算部４２３は、フレームメモリ４２１から、フレーム#nの注目画素の画素値Cを取得する。
【０３４５】
ステップＳ４２２において、混合比演算部４２３は、フレームメモリ４２２から、注目画素に対応する、フレーム#n-1の画素の画素値Pを取得する。
【０３４６】
ステップＳ４２３において、混合比演算部４２３は、入力画像に含まれる注目画素に対応する、フレーム#n+1の画素の画素値Nを取得する。
【０３４７】
ステップＳ４２４において、混合比演算部４２３は、フレーム#nの注目画素の画素値C、フレーム#n-1の画素の画素値P、およびフレーム#n+1の画素の画素値Nを基に、推定混合比を演算する。
【０３４８】
ステップＳ４２５において、混合比演算部４２３は、フレーム全体について、推定混合比を演算する処理を終了したか否かを判定し、フレーム全体について、推定混合比を演算する処理を終了していないと判定された場合、ステップＳ４２１に戻り、次の画素について推定混合比を算出する処理を繰り返す。
【０３４９】
ステップＳ４２５において、フレーム全体について、推定混合比を演算する処理を終了したと判定された場合、処理は終了する。
【０３５０】
このように、推定混合比処理部４０１は、入力画像を基に、推定混合比を演算することができる。
【０３５１】
図６０のステップＳ４０３におけるアンカバードバックグラウンド領域に対応するモデルによる混合比推定の処理は、アンカバードバックグラウンド領域のモデルに対応する式を利用した、図６１のフローチャートに示す処理と同様なので、その説明は省略する。
【０３５２】
なお、図５９に示す推定混合比処理部４４２および推定混合比処理部４４３は、図６１に示すフローチャートと同様の処理を実行して推定混合比を演算するので、その説明は省略する。
【０３５３】
また、背景に対応するオブジェクトが静止しているとして説明したが、背景領域に対応する画像が動きを含んでいても上述した混合比αを求める処理を適用することができる。例えば、背景領域に対応する画像が一様に動いているとき、推定混合比処理部４０１は、背景の動きに対応して画像全体をシフトさせ、背景に対応するオブジェクトが静止している場合と同様に処理する。また、背景領域に対応する画像が局所毎に異なる背景の動きを含んでいるとき、推定混合比処理部４０１は、混合領域に属する画素に対応する画素として、背景の動きに対応した画素を選択して、上述の処理を実行する。
【０３５４】
また、混合比算出部１０４は、全ての画素について、カバードバックグラウンド領域に対応するモデルによる混合比推定の処理のみを実行して、算出された推定混合比を混合比αとして出力するようにしてもよい。この場合において、混合比αは、カバードバックグラウンド領域に属する画素について、背景の成分の割合を示し、アンカバードバックグラウンド領域に属する画素について、前景の成分の割合を示す。アンカバードバックグラウンド領域に属する画素について、このように算出された混合比αと１との差分の絶対値を算出して、算出した絶対値を混合比αに設定すれば、分離処理サーバ１１は、アンカバードバックグラウンド領域に属する画素について、背景の成分の割合を示す混合比αを求めることができる。
【０３５５】
なお、同様に、混合比算出部１０４は、全ての画素について、アンカバードバックグラウンド領域に対応するモデルによる混合比推定の処理のみを実行して、算出された推定混合比を混合比αとして出力するようにしてもよい。
【０３５６】
次に、混合比αが直線的に変化する性質を利用して混合比αを算出する混合比算出部１０４について説明する。
【０３５７】
上述したように、式（１１）および式（１２）は、それぞれ２つの変数を含むので、そのままでは混合比αを求めることができない。
【０３５８】
そこで、シャッタ時間内において、前景に対応するオブジェクトが等速で動くことによる、画素の位置の変化に対応して、混合比αが直線的に変化する性質を利用して、空間方向に、混合比αと前景の成分の和fとを近似した式を立てる。混合領域に属する画素の画素値および背景領域に属する画素の画素値の組の複数を利用して、混合比αと前景の成分の和fとを近似した式を解く。
【０３５９】
混合比αの変化を、直線として近似すると、混合比αは、式（２３）で表される。
【０３６０】
α=il+p ・・・（２３）
式（２３）において、iは、注目している画素の位置を０とした空間方向のインデックスである。lは、混合比αの直線の傾きである。pは、混合比αの直線の切片である共に、注目している画素の混合比αである。式（２３）において、インデックスiは、既知であるが、傾きlおよび切片pは、未知である。
【０３６１】
インデックスi、傾きl、および切片pの関係を図６２に示す。
【０３６２】
混合比αを式（２３）のように近似することにより、複数の画素に対して複数の異なる混合比αは、２つの変数で表現される。図６２に示す例において、５つの画素に対する５つの混合比は、２つの変数である傾きlおよび切片pにより表現される。
【０３６３】
図６３に示す平面で混合比αを近似すると、画像の水平方向および垂直方向の２つの方向に対応する動きvを考慮したとき、式（２３）を平面に拡張して、混合比αは、式（２４）で表される。
【０３６４】
α=jm+kq+p ・・・（２４）
式（２４）において、jは、注目している画素の位置を０とした水平方向のインデックスであり、kは、垂直方向のインデックスである。mは、混合比αの面の水平方向の傾きであり、qは、混合比αの面の垂直方向の傾きである。pは、混合比αの面の切片である。
【０３６５】
例えば、図５３に示すフレーム#nにおいて、C05乃至C07について、それぞれ、式（２５）乃至式（２７）が成立する。
【０３６６】
C05=α05・B05/v+f05 ・・・（２５）
C06=α06・B06/v+f06 ・・・（２６）
C07=α07・B07/v+f07 ・・・（２７）
【０３６７】
前景の成分が近傍で一致する、すなわち、F01乃至F03が等しいとして、F01乃至F03をFcに置き換えると式（２８）が成立する。
【０３６８】
f(x)=(1-α(x))・Fc ・・・（２８）
式（２８）において、xは、空間方向の位置を表す。
【０３６９】
α（ｘ）を式（２４）で置き換えると、式（２８）は、式（２９）として表すことができる。
【０３７０】
f(x)=(1-(jm+kq+p))・Fc
=j・（-m・Fc）+k・（-q・Fc）+((1-p)・Fc)
=js+kt+u ・・・（２９）
【０３７１】
式（２９）において、（-m・Fc）、（-q・Fc）、および(1-p)・Fcは、式（３０）乃至式（３２）に示すように置き換えられている。
【０３７２】
s=-m・Fc ・・・（３０）
t=-q・Fc ・・・（３１）
u=(1-p)・Fc ・・・（３２）
【０３７３】
式（２９）において、jは、注目している画素の位置を０とした水平方向のインデックスであり、kは、垂直方向のインデックスである。
【０３７４】
このように、前景に対応するオブジェクトがシャッタ時間内において等速に移動し、前景に対応する成分が近傍において一定であるという仮定が成立するので、前景の成分の和は、式（２９）で近似される。
【０３７５】
なお、混合比αを直線で近似する場合、前景の成分の和は、式（３３）で表すことができる。
【０３７６】
f(x)=is+u ・・・（３３）
【０３７７】
式（１３）の混合比αおよび前景成分の和を、式（２４）および式（２９）を利用して置き換えると、画素値Mは、式（３４）で表される。
【０３７８】
M=(jm+kq+p)・B+js+kt+u
=jB・m+kB・q+B・p+j・s+k・t+u ・・・（３４）
【０３７９】
式（３４）において、未知の変数は、混合比αの面の水平方向の傾きm、混合比αの面の垂直方向の傾きq、混合比αの面の切片p、s、t、およびuの６つである。
【０３８０】
注目している画素の近傍の画素に対応させて、式（３４）に示す正規方程式に、画素値Mまたは画素値Bを設定し、画素値Mまたは画素値Bが設定された複数の正規方程式を最小自乗法で解いて、混合比αを算出する。
【０３８１】
例えば、注目している画素の水平方向のインデックスjを０とし、垂直方向のインデックスkを０とし、注目している画素の近傍の３×３の画素について、式（３４）に示す正規方程式に画素値Mまたは画素値Bを設定すると、式（３５）乃至式（４３）を得る。
【０３８２】
M-1,-1=(-1)・B-1,-1・m+(-1)・B-1,-1・q+B-1,-1・p+(-1)・s+(-1)・t+u・・・（３５）
M0,-1=(0)・B0,-1・m+(-1)・B0,-1・q+B0,-1・p+(0)・s+(-1)・t+u・・・（３６）
M+1,-1=(+1)・B+1,-1・m+(-1)・B+1,-1・q+B+1,-1・p+(+1)・s+(-1)・t+u・・・（３７）
M-1,0=(-1)・B-1,0・m+(0)・B-1,0・q+B-1,0・p+(-1)・s+(0)・t+u・・・（３８）
M0,0=(0)・B0,0・m+(0)・B0,0・q+B0,0・p+(0)・s+(0)・t+u・・・（３９）
M+1,0=(+1)・B+1,0・m+(0)・B+1,0・q+B+1,0・p+(+1)・s+(0)・t+u・・・（４０）
M-1,+1=(-1)・B-1,+1・m+(+1)・B-1,+1・q+B-1,+1・p+(-1)・s+(+1)・t+u・・・（４１）
M0,+1=(0)・B0,+1・m+(+1)・B0,+1・q+B0,+1・p+(0)・s+(+1)・t+u・・・（４２）
M+1,+1=(+1)・B+1,+1・m+(+1)・B+1,+1・q+B+1,+1・p+(+1)・s+(+1)・t+u・・・（４３）
【０３８３】
注目している画素の水平方向のインデックスjが０であり、垂直方向のインデックスkが０であるので、注目している画素の混合比αは、式（２４）より、j=0およびk=0のときの値、すなわち、切片pに等しい。
【０３８４】
従って、式（３５）乃至式（４３）の９つの式を基に、最小自乗法により、水平方向の傾きm、垂直方向の傾きq、切片p、s、t、およびuのそれぞれの値を算出し、切片pを混合比αとして出力すればよい。
【０３８５】
次に、最小自乗法を適用して混合比αを算出するより具体的な手順を説明する。
【０３８６】
インデックスiおよびインデックスkを１つのインデックスxで表現すると、インデックスi、インデックスk、およびインデックスxの関係は、式（４４）で表される。
【０３８７】
x=(j+1)・3+(k+1) ・・・（４４）
【０３８８】
水平方向の傾きm、垂直方向の傾きq、切片p、s、t、およびuをそれぞれ変数w0,w1,w2,w3,w4、およびW5と表現し、jB,kB,B,j,k、および1をそれぞれa0,a1,a2,a3,a4、およびa5と表現する。誤差exを考慮すると、式（３５）乃至式（４３）は、式（４５）に書き換えることができる。
【０３８９】
【数１２】

・・・（４５）
式（４５）において、xは、０乃至８の整数のいずれかの値である。
【０３９０】
式（４５）から、式（４６）を導くことができる。
【０３９１】
【数１３】

・・・（４６）
【０３９２】
ここで、最小自乗法を適用するため、誤差の自乗和Eを式（４７）に示すようにに定義する。
【０３９３】
【数１４】

・・・（４７）
【０３９４】
誤差が最小になるためには、誤差の自乗和Eに対する、変数Wvの偏微分が０になればよい。ここで、vは、０乃至５の整数のいずれかの値である。従って、式（４８）を満たすようにwyを求める。
【０３９５】
【数１５】

・・・（４８）
【０３９６】
式（４８）に式（４６）を代入すると、式（４９）を得る。
【０３９７】
【数１６】

・・・（４９）
【０３９８】
式（４９）のvに０乃至５の整数のいずれか１つを代入して得られる６つの式に、例えば、掃き出し法（Gauss-Jordanの消去法）などを適用して、wyを算出する。上述したように、w0は水平方向の傾きmであり、w1は垂直方向の傾きqであり、w2は切片pであり、w3はsであり、w4はtであり、w5はuである。
【０３９９】
以上のように、画素値Mおよび画素値Bを設定した式に、最小自乗法を適用することにより、水平方向の傾きm、垂直方向の傾きq、切片p、s、t、およびuを求めることができる。
【０４００】
式（３５）乃至式（４３）に対応する説明において、混合領域に含まれる画素の画素値をMとし、背景領域に含まれる画素の画素値をBとして説明したが、注目している画素が、カバードバックグラウンド領域に含まれる場合、またはアンカバードバックグラウンド領域に含まれる場合のそれぞれに対して、正規方程式を立てる必要がある。
【０４０１】
例えば、図５３に示す、フレーム#nのカバードバックグラウンド領域に含まれる画素の混合比αを求める場合、フレーム#nの画素のC04乃至C08、およびフレーム#n-1の画素の画素値P04乃至P08が、正規方程式に設定される。
【０４０２】
図５３に示す、フレーム#nのアンカバードバックグラウンド領域に含まれる画素の混合比αを求める場合、フレーム#nの画素のC28乃至C32、およびフレーム#n+1の画素の画素値N28乃至N32が、正規方程式に設定される。
【０４０３】
また、例えば、図６４に示す、カバードバックグラウンド領域に含まれる画素の混合比αを算出するとき、以下の式（５０）乃至式（５８）が立てられる。混合比αを算出する画素の画素値は、Mc5である。
【０４０４】
Mc1=(-1)・Bc1・m+(-1)・Bc1・q+Bc1・p+(-1)・s+(-1)・t+u・・・（５０）
Mc2=(0)・Bc2・m+(-1)・Bc2・q+Bc2・p+(0)・s+(-1)・t+u・・・（５１）
Mc3=(+1)・Bc3・m+(-1)・Bc3・q+Bc3・p+(+1)・s+(-1)・t+u・・・（５２）
Mc4=(-1)・Bc4・m+(0)・Bc4・q+Bc4・p+(-1)・s+(0)・t+u・・・（５３）
Mc5=(0)・Bc5・m+(0)・Bc5・q+Bc5・p+(0)・s+(0)・t+u・・・（５４）
Mc6=(+1)・Bc6・m+(0)・Bc6・q+Bc6・p+(+1)・s+(0)・t+u・・・（５５）
Mc7=(-1)・Bc7・m+(+1)・Bc7・q+Bc7・p+(-1)・s+(+1)・t+u・・・（５６）
Mc8=(0)・Bc8・m+(+1)・Bc8・q+Bc8・p+(0)・s+(+1)・t+u・・・（５７）
Mc9=(+1)・Bc9・m+(+1)・Bc9・q+Bc9・p+(+1)・s+(+1)・t+u・・・（５８）
【０４０５】
フレーム#nのカバードバックグラウンド領域に含まれる画素の混合比αを算出するとき、式（５０）乃至式（５８）において、フレーム#nの画素に対応する、フレーム#n-1の画素の背景領域の画素の画素値Bc1乃至Bc9が使用される。
【０４０６】
図６４に示す、アンカバードバックグラウンド領域に含まれる画素の混合比αを算出するとき、以下の式（５９）乃至式（６７）が立てられる。混合比αを算出する画素の画素値は、Mu5である。
【０４０７】
Mu1=(-1)・Bu1・m+(-1)・Bu1・q+Bu1・p+(-1)・s+(-1)・t+u・・・（５９）
Mu2=(0)・Bu2・m+(-1)・Bu2・q+Bu2・p+(0)・s+(-1)・t+u・・・（６０）
Mu3=(+1)・Bu3・m+(-1)・Bu3・q+Bu3・p+(+1)・s+(-1)・t+u・・・（６１）
Mu4=(-1)・Bu4・m+(0)・Bu4・q+Bu4・p+(-1)・s+(0)・t+u・・・（６２）
Mu5=(0)・Bu5・m+(0)・Bu5・q+Bu5・p+(0)・s+(0)・t+u・・・（６３）
Mu6=(+1)・Bu6・m+(0)・Bu6・q+Bu6・p+(+1)・s+(0)・t+u・・・（６４）
Mu7=(-1)・Bu7・m+(+1)・Bu7・q+Bu7・p+(-1)・s+(+1)・t+u・・・（６５）
Mu8=(0)・Bu8・m+(+1)・Bu8・q+Bu8・p+(0)・s+(+1)・t+u・・・（６６）
Mu9=(+1)・Bu9・m+(+1)・Bu9・q+Bu9・p+(+1)・s+(+1)・t+u・・・（６７）
【０４０８】
フレーム#nのアンカバードバックグラウンド領域に含まれる画素の混合比αを算出するとき、式（５９）乃至式（６７）において、フレーム#nの画素に対応する、フレーム#n+1の画素の背景領域の画素の画素値Bu1乃至Bu9が使用される。
【０４０９】
図６５は、推定混合比処理部４０１の構成を示すブロック図である。推定混合比処理部４０１に入力された画像は、遅延部５０１および足し込み部５０２に供給される。
【０４１０】
遅延回路２２１は、入力画像を１フレーム遅延させ、足し込み部５０２に供給する。足し込み部５０２に、入力画像としてフレーム#nが入力されているとき、遅延回路２２１は、フレーム#n-1を足し込み部５０２に供給する。
【０４１１】
足し込み部５０２は、混合比αを算出する画素の近傍の画素の画素値、およびフレーム#n-1の画素値を、正規方程式に設定する。例えば、足し込み部５０２は、式（５０）乃至式（５８）に基づいて、正規方程式に画素値Mc1乃至Mc9および画素値Bc1乃至Bc9を設定する。足し込み部５０２は、画素値が設定された正規方程式を演算部５０３に供給する。
【０４１２】
演算部５０３は、足し込み部５０２から供給された正規方程式を掃き出し法などにより解いて推定混合比を求め、求められた推定混合比を出力する。
【０４１３】
このように、推定混合比処理部４０１は、入力画像を基に、推定混合比を算出して、混合比決定部４０３に供給することができる。
【０４１４】
なお、推定混合比処理部４０２は、推定混合比処理部４０１と同様の構成を有するので、その説明は省略する。
【０４１５】
図６６は、推定混合比処理部４０１により算出された推定混合比の例を示す図である。図６６に示す推定混合比は、等速で動いているオブジェクトに対応する前景の動きvが１１であり、７×７画素のブロックを単位として方程式を生成して算出された結果を、１ラインに対して示すものである。
【０４１６】
推定混合比は、混合領域において、図６６に示すように、ほぼ直線的に変化していることがわかる。
【０４１７】
次に、図６５に構成を示す推定混合比処理部４０１による、カバードバックグラウンド領域に対応するモデルによる混合比推定の処理を図６７のフローチャートを参照して説明する。
【０４１８】
ステップＳ５２１において、足し込み部５０２は、入力された画像に含まれる画素値、および遅延回路２２１から供給される画像に含まれる画素値を、カバードバックグラウンド領域のモデルに対応する正規方程式に設定する。
【０４１９】
ステップＳ５２２において、推定混合比処理部４０１は、対象となる画素についての設定が終了したか否かを判定し、対象となる画素についての設定が終了していないと判定された場合、ステップＳ５２１に戻り、正規方程式への画素値の設定の処理を繰り返す。
【０４２０】
ステップＳ５２２において、対象となる画素についての画素値の設定が終了したと判定された場合、ステップＳ５２３に進み、演算部１７３は、画素値が設定された正規方程式を基に、推定混合比を演算して、求められた推定混合比を出力する。
【０４２１】
このように、図６５に構成を示す推定混合比処理部４０１は、入力画像を基に、推定混合比を演算することができる。
【０４２２】
アンカバードバックグラウンド領域に対応するモデルによる混合比推定の処理は、アンカバードバックグラウンド領域のモデルに対応する正規方程式を利用した、図６７のフローチャートに示す処理と同様なので、その説明は省略する。
【０４２３】
なお、背景に対応するオブジェクトが静止しているとして説明したが、背景領域に対応する画像が動きを含んでいても上述した混合比を求める処理を適用することができる。例えば、背景領域に対応する画像が一様に動いているとき、推定混合比処理部４０１は、この動きに対応して画像全体をシフトさせ、背景に対応するオブジェクトが静止している場合と同様に処理する。また、背景領域に対応する画像が局所毎に異なる動きを含んでいるとき、推定混合比処理部４０１は、混合領域に属する画素に対応する画素として、動きに対応した画素を選択して、上述の処理を実行する。
【０４２４】
このように、混合比算出部１０２は、領域特定部１０１から供給された領域情報、および入力画像を基に、各画素に対応する特徴量である混合比αを算出することができる。
【０４２５】
混合比αを利用することにより、動いているオブジェクトに対応する画像に含まれる動きボケの情報を残したままで、画素値に含まれる前景の成分と背景の成分とを分離することが可能になる。
【０４２６】
また、混合比αに基づいて画像を合成すれば、実世界を実際に撮影し直したような動いているオブジェクトのスピードに合わせた正しい動きボケを含む画像を作ることが可能になる。
【０４２７】
次に、前景背景分離部１０５について説明する。図６８は、前景背景分離部１０５の構成の一例を示すブロック図である。前景背景分離部１０５に供給された入力画像は、分離部６０１、スイッチ６０２、およびスイッチ６０４に供給される。カバードバックグラウンド領域を示す情報、およびアンカバードバックグラウンド領域を示す、領域特定部１０３から供給された領域情報は、分離部６０１に供給される。前景領域を示す領域情報は、スイッチ６０２に供給される。背景領域を示す領域情報は、スイッチ６０４に供給される。
【０４２８】
混合比算出部１０４から供給された混合比αは、分離部６０１に供給される。
【０４２９】
分離部６０１は、カバードバックグラウンド領域を示す領域情報、アンカバードバックグラウンド領域を示す領域情報、および混合比αを基に、入力画像から前景の成分を分離して、分離した前景の成分を合成部６０３に供給するとともに、入力画像から背景の成分を分離して、分離した背景の成分を合成部６０５に供給する。
【０４３０】
スイッチ６０２は、前景領域を示す領域情報を基に、前景に対応する画素が入力されたとき、閉じられ、入力画像に含まれる前景に対応する画素のみを合成部６０３に供給する。
【０４３１】
スイッチ６０４は、背景領域を示す領域情報を基に、背景に対応する画素が入力されたとき、閉じられ、入力画像に含まれる背景に対応する画素のみを合成部６０５に供給する。
【０４３２】
合成部６０３は、分離部６０１から供給された前景に対応する成分、スイッチ６０２から供給された前景に対応する画素を基に、前景成分画像を合成し、合成した前景成分画像を出力する。前景領域と混合領域とは重複しないので、合成部６０３は、例えば、前景に対応する成分と、前景に対応する画素とに論理和の演算を適用して、前景成分画像を合成する。
【０４３３】
合成部６０３は、前景成分画像の合成の処理の最初に実行される初期化の処理において、内蔵しているフレームメモリに全ての画素値が０である画像を格納し、前景成分画像の合成の処理において、前景成分画像を格納（上書き）する。従って、合成部６０３が出力する前景成分画像の内、背景領域に対応する画素には、画素値として０が格納されている。
【０４３４】
合成部６０５は、分離部６０１から供給された背景に対応する成分、スイッチ６０４から供給された背景に対応する画素を基に、背景成分画像を合成して、合成した背景成分画像を出力する。背景領域と混合領域とは重複しないので、合成部６０５は、例えば、背景に対応する成分と、背景に対応する画素とに論理和の演算を適用して、背景成分画像を合成する。
【０４３５】
合成部６０５は、背景成分画像の合成の処理の最初に実行される初期化の処理において、内蔵しているフレームメモリに全ての画素値が０である画像を格納し、背景成分画像の合成の処理において、背景成分画像を格納（上書き）する。従って、合成部６０５が出力する背景成分画像の内、前景領域に対応する画素には、画素値として０が格納されている。
【０４３６】
図６９は、前景背景分離部１０５に入力される入力画像、並びに前景背景分離部１０５から出力される前景成分画像および背景成分画像を示す図である。
【０４３７】
図６９は、表示される画像の模式図であり、図７０は、図６９に対応する前景領域に属する画素、背景領域に属する画素、および混合領域に属する画素を含む１ラインの画素を時間方向に展開したモデル図を示す。
【０４３８】
図６９および図７０に示すように、前景背景分離部１０５から出力される背景成分画像は、背景領域に属する画素、および混合領域の画素に含まれる背景の成分から構成される。
【０４３９】
図６９および図７０に示すように、前景背景分離部１０５から出力される前景成分画像は、前景領域に属する画素、および混合領域の画素に含まれる前景の成分から構成される。
【０４４０】
混合領域の画素の画素値は、前景背景分離部１０５により、背景の成分と、前景の成分とに分離される。分離された背景の成分は、背景領域に属する画素と共に、背景成分画像を構成する。分離された前景の成分は、前景領域に属する画素と共に、前景成分画像を構成する。
【０４４１】
このように、前景成分画像は、背景領域に対応する画素の画素値が０とされ、前景領域に対応する画素および混合領域に対応する画素に意味のある画素値が設定される。同様に、背景成分画像は、前景領域に対応する画素の画素値が０とされ、背景領域に対応する画素および混合領域に対応する画素に意味のある画素値が設定される。
【０４４２】
次に、分離部６０１が実行する、混合領域に属する画素から前景の成分、および背景の成分を分離する処理について説明する。
【０４４３】
図７１は、図中の左から右に移動するオブジェクトに対応する前景を含む、２つのフレームの前景の成分および背景の成分を示す画像のモデルである。図７１に示す画像のモデルにおいて、前景の動き量vは４であり、仮想分割数は、４とされている。
【０４４４】
フレーム#nにおいて、最も左の画素、および左から１４番目乃至１８番目の画素は、背景の成分のみから成り、背景領域に属する。フレーム#nにおいて、左から２番目乃至４番目の画素は、背景の成分および前景の成分を含み、アンカバードバックグラウンド領域に属する。フレーム#nにおいて、左から１１番目乃至１３番目の画素は、背景の成分および前景の成分を含み、カバードバックグラウンド領域に属する。フレーム#nにおいて、左から５番目乃至１０番目の画素は、前景の成分のみから成り、前景領域に属する。
【０４４５】
フレーム#n+1において、左から１番目乃至５番目の画素、および左から１８番目の画素は、背景の成分のみから成り、背景領域に属する。フレーム#n+1において、左から６番目乃至８番目の画素は、背景の成分および前景の成分を含み、アンカバードバックグラウンド領域に属する。フレーム#n+1において、左から１５番目乃至１７番目の画素は、背景の成分および前景の成分を含み、カバードバックグラウンド領域に属する。フレーム#n+1において、左から９番目乃至１４番目の画素は、前景の成分のみから成り、前景領域に属する。
【０４４６】
図７２は、カバードバックグラウンド領域に属する画素から前景の成分を分離する処理を説明する図である。図７２において、α１乃至α１８は、フレーム#nにおける画素のぞれぞれに対応する混合比である。図７２において、左から１５番目乃至１７番目の画素は、カバードバックグラウンド領域に属する。
【０４４７】
フレーム#nの左から１５番目の画素の画素値C15は、式（６８）で表される。
【０４４８】
C15=B15/v+F09/v+F08/v+F07/v
=α15・B15+F09/v+F08/v+F07/v
=α15・P15+F09/v+F08/v+F07/v ・・・（６８）
【０４４９】
ここで、α15は、フレーム#nの左から１５番目の画素の混合比である。P15は、フレーム#n-1の左から１５番目の画素の画素値である。
【０４５０】
式（６８）を基に、フレーム#nの左から１５番目の画素の前景の成分の和f15は、式（６９）で表される。
【０４５１】
f15=F09/v+F08/v+F07/v
=C15-α15・P15 ・・・（６９）
【０４５２】
同様に、フレーム#nの左から１６番目の画素の前景の成分の和f16は、式（７０）で表され、フレーム#nの左から１７番目の画素の前景の成分の和f17は、式（７１）で表される。
【０４５３】
f16=C16-α16・P16 ・・・（７０）
f17=C17-α17・P17 ・・・（７１）
【０４５４】
このように、カバードバックグラウンド領域に属する画素の画素値Cに含まれる前景の成分fcは、式（７２）で計算される。
【０４５５】
fc=C-α・P ・・・（７２）
【０４５６】
Pは、１つ前のフレームの、対応する画素の画素値である。
【０４５７】
図７３は、アンカバードバックグラウンド領域に属する画素から前景の成分を分離する処理を説明する図である。図７３において、α１乃至α１８は、フレーム#nにおける画素のぞれぞれに対応する混合比である。図７３において、左から２番目乃至４番目の画素は、アンカバードバックグラウンド領域に属する。
【０４５８】
フレーム#nの左から２番目の画素の画素値C02は、式（７３）で表される。
【０４５９】
C02=B02/v+B02/v+B02/v+F01/v
=α2・B02+F01/v
=α2・N02+F01/v ・・・（７３）
【０４６０】
ここで、α2は、フレーム#nの左から２番目の画素の混合比である。N02は、フレーム#n+1の左から２番目の画素の画素値である。
【０４６１】
式（７３）を基に、フレーム#nの左から２番目の画素の前景の成分の和f02は、式（７４）で表される。
【０４６２】
f02=F01/v
=C02-α2・N02 ・・・（７４）
【０４６３】
同様に、フレーム#nの左から３番目の画素の前景の成分の和f03は、式（７５）で表され、フレーム#nの左から４番目の画素の前景の成分の和f04は、式（７６）で表される。
【０４６４】
f03=C03-α3・N03 ・・・（７５）
f04=C04-α4・N04 ・・・（７６）
【０４６５】
このように、アンカバードバックグラウンド領域に属する画素の画素値Cに含まれる前景の成分fuは、式（７７）で計算される。
【０４６６】
fu=C-α・N ・・・（７７）
【０４６７】
Nは、１つ後のフレームの、対応する画素の画素値である。
【０４６８】
このように、分離部６０１は、領域情報に含まれる、カバードバックグラウンド領域を示す情報、およびアンカバードバックグラウンド領域を示す情報、並びに画素毎の混合比αを基に、混合領域に属する画素から前景の成分、および背景の成分を分離することができる。
【０４６９】
図７４は、以上で説明した処理を実行する分離部６０１の構成の一例を示すブロック図である。分離部６０１に入力された画像は、フレームメモリ６２１に供給され、混合比算出部１０４から供給されたカバードバックグラウンド領域およびアンカバードバックグラウンド領域を示す領域情報、並びに混合比αは、分離処理ブロック６２２に入力される。
【０４７０】
フレームメモリ６２１は、入力された画像をフレーム単位で記憶する。フレームメモリ６２１は、処理の対象がフレーム#nであるとき、フレーム#nの１つ前のフレームであるフレーム#n-1、フレーム#n、およびフレーム#nの１つ後のフレームであるフレーム#n+1を記憶する。
【０４７１】
フレームメモリ６２１は、フレーム#n-1、フレーム#n、およびフレーム#n+1の対応する画素を分離処理ブロック６２２に供給する。
【０４７２】
分離処理ブロック６２２は、カバードバックグラウンド領域およびアンカバードバックグラウンド領域を示す領域情報、並びに混合比αを基に、フレームメモリ６２１から供給されたフレーム#n-1、フレーム#n、およびフレーム#n+1の対応する画素の画素値に図７２および図７３を参照して説明した演算を適用して、フレーム#nの混合領域に属する画素から前景の成分および背景の成分を分離して、フレームメモリ６２３に供給する。
【０４７３】
分離処理ブロック６２２は、アンカバード領域処理部６３１、カバード領域処理部６３２、合成部６３３、および合成部６３４で構成されている。
【０４７４】
アンカバード領域処理部６３１の乗算器６４１は、混合比αを、フレームメモリ６２１から供給されたフレーム#n+1の画素の画素値に乗じて、スイッチ６４２に出力する。スイッチ６４２は、フレームメモリ６２１から供給されたフレーム#nの画素（フレーム#n+1の画素に対応する）がアンカバードバックグラウンド領域であるとき、閉じられ、乗算器６４１から供給された混合比αを乗じた画素値を演算器６４３および合成部６３４に供給する。スイッチ６４２から出力されるフレーム#n+1の画素の画素値に混合比αを乗じた値は、フレーム#nの対応する画素の画素値の背景の成分に等しい。
【０４７５】
演算器６４３は、フレームメモリ６２１から供給されたフレーム#nの画素の画素値から、スイッチ６４２から供給された背景の成分を減じて、前景の成分を求める。演算器６４３は、アンカバードバックグラウンド領域に属する、フレーム#nの画素の前景の成分を合成部６３３に供給する。
【０４７６】
カバード領域処理部６３２の乗算器６５１は、混合比αを、フレームメモリ６２１から供給されたフレーム#n-1の画素の画素値に乗じて、スイッチ６５２に出力する。スイッチ６５２は、フレームメモリ６２１から供給されたフレーム#nの画素（フレーム#n-1の画素に対応する）がカバードバックグラウンド領域であるとき、閉じられ、乗算器６５１から供給された混合比αを乗じた画素値を演算器６５３および合成部６３４に供給する。スイッチ６５２から出力されるフレーム#n-1の画素の画素値に混合比αを乗じた値は、フレーム#nの対応する画素の画素値の背景の成分に等しい。
【０４７７】
演算器６５３は、フレームメモリ６２１から供給されたフレーム#nの画素の画素値から、スイッチ６５２から供給された背景の成分を減じて、前景の成分を求める。演算器６５３は、カバードバックグラウンド領域に属する、フレーム#nの画素の前景の成分を合成部６３３に供給する。
【０４７８】
合成部６３３は、フレーム#nの、演算器６４３から供給された、アンカバードバックグラウンド領域に属する画素の前景の成分、および演算器６５３から供給された、カバードバックグラウンド領域に属する画素の前景の成分を合成して、フレームメモリ６２３に供給する。
【０４７９】
合成部６３４は、フレーム#nの、スイッチ６４２から供給された、アンカバードバックグラウンド領域に属する画素の背景の成分、およびスイッチ６５２から供給された、カバードバックグラウンド領域に属する画素の背景の成分を合成して、フレームメモリ６２３に供給する。
【０４８０】
フレームメモリ６２３は、分離処理ブロック６２２から供給された、フレーム#nの混合領域の画素の前景の成分と、背景の成分とをそれぞれに記憶する。
【０４８１】
フレームメモリ６２３は、記憶しているフレーム#nの混合領域の画素の前景の成分、および記憶しているフレーム#nの混合領域の画素の背景の成分を出力する。
【０４８２】
特徴量である混合比αを利用することにより、画素値に含まれる前景の成分と背景の成分とを完全に分離することが可能になる。
【０４８３】
図７５は、図７１のフレーム#nに対応する、前景成分画像の例と、背景成分画像の例を示す図である。
【０４８４】
図７５Ａは、図７１のフレーム#nに対応する、前景成分画像の例を示す。最も左の画素、および左から１４番目の画素は、前景と背景が分離される前において、背景の成分のみから成っていたので、画素値が０とされる。
【０４８５】
左から２番目乃至４番目の画素は、前景と背景とが分離される前において、アンカバードバックグラウンド領域に属し、背景の成分が０とされ、前景の成分がそのまま残されている。左から１１番目乃至１３番目の画素は、前景と背景とが分離される前において、カバードバックグラウンド領域に属し、背景の成分が０とされ、前景の成分がそのまま残されている。左から５番目乃至１０番目の画素は、前景の成分のみから成るので、そのまま残される。
【０４８６】
図７５Ｂは、図７１のフレーム#nに対応する、背景成分画像の例を示す。最も左の画素、および左から１４番目の画素は、前景と背景とが分離される前において、背景の成分のみから成っていたので、そのまま残される。
【０４８７】
左から２番目乃至４番目の画素は、前景と背景とが分離される前において、アンカバードバックグラウンド領域に属し、前景の成分が０とされ、背景の成分がそのまま残されている。左から１１番目乃至１３番目の画素は、前景と背景とが分離される前において、カバードバックグラウンド領域に属し、前景の成分が０とされ、背景の成分がそのまま残されている。左から５番目乃至１０番目の画素は、前景と背景とが分離される前において、前景の成分のみから成っていたので、画素値が０とされる。
【０４８８】
次に、図７６に示すフローチャートを参照して、前景背景分離部１０５による前景と背景との分離の処理を説明する。ステップＳ６０１において、分離部６０１のフレームメモリ６２１は、入力画像を取得し、前景と背景との分離の対象となるフレーム#nを、その前のフレーム#n-1およびその後のフレーム#n+1と共に記憶する。
【０４８９】
ステップＳ６０２において、分離部６０１の分離処理ブロック６２２は、混合比算出部１０４から供給された領域情報を取得する。ステップＳ６０３において、分離部６０１の分離処理ブロック６２２は、混合比算出部１０４から供給された混合比αを取得する。
【０４９０】
ステップＳ６０４において、アンカバード領域処理部６３１は、領域情報および混合比αを基に、フレームメモリ６２１から供給された、アンカバードバックグラウンド領域に属する画素の画素値から、背景の成分を抽出する。
【０４９１】
ステップＳ６０５において、アンカバード領域処理部６３１は、領域情報および混合比αを基に、フレームメモリ６２１から供給された、アンカバードバックグラウンド領域に属する画素の画素値から、前景の成分を抽出する。
【０４９２】
ステップＳ６０６において、カバード領域処理部６３２は、領域情報および混合比αを基に、フレームメモリ６２１から供給された、カバードバックグラウンド領域に属する画素の画素値から、背景の成分を抽出する。
【０４９３】
ステップＳ６０７において、カバード領域処理部６３２は、領域情報および混合比αを基に、フレームメモリ６２１から供給された、カバードバックグラウンド領域に属する画素の画素値から、前景の成分を抽出する。
【０４９４】
ステップＳ６０８において、合成部６３３は、ステップＳ６０５の処理で抽出されたアンカバードバックグラウンド領域に属する画素の前景の成分と、ステップＳ６０７の処理で抽出されたカバードバックグラウンド領域に属する画素の前景の成分とを合成する。合成された前景の成分は、合成部６０３に供給される。更に、合成部６０３は、スイッチ６０２を介して供給された前景領域に属する画素と、分離部６０１から供給された前景の成分とを合成して、前景成分画像を生成する。
【０４９５】
ステップＳ６０９において、合成部６３４は、ステップＳ６０４の処理で抽出されたアンカバードバックグラウンド領域に属する画素の背景の成分と、ステップＳ６０６の処理で抽出されたカバードバックグラウンド領域に属する画素の背景の成分とを合成する。合成された背景の成分は、合成部６０５に供給される。更に、合成部６０５は、スイッチ６０４を介して供給された背景領域に属する画素と、分離部６０１から供給された背景の成分とを合成して、背景成分画像を生成する。
【０４９６】
ステップＳ６１０において、合成部６０３は、前景成分画像を出力する。ステップＳ６１１において、合成部６０５は、背景成分画像を出力し、処理は終了する。
【０４９７】
このように、前景背景分離部１０５は、領域情報および混合比αを基に、入力画像から前景の成分と、背景の成分とを分離し、前景の成分のみから成る前景成分画像、および背景の成分のみから成る背景成分画像を出力することができる。
【０４９８】
次に、前景成分画像の動きボケの除去について説明する。
【０４９９】
図７７は、動きボケ除去部１０６の構成の一例を示すブロック図である。動き検出部１０２から供給された動きベクトルとその位置情報は、処理単位決定部８０１、モデル化部８０２、および演算部８０５に供給される。領域特定部１０３から供給された領域情報は、処理単位決定部８０１に供給される。前景背景分離部１０５から供給された前景成分画像は、足し込み部８０４に供給される。
【０５００】
処理単位決定部８０１は、動きベクトルとその位置情報、および領域情報を基に、処理単位を生成し、生成した処理単位をモデル化部８０２および足し込み部８０４に供給する。
【０５０１】
処理単位決定部８０１が生成する処理単位は、図７８に例を示すように、前景成分画像のカバードバックグラウンド領域に対応する画素から始まり、アンカバードバックグラウンド領域に対応する画素までの動き方向に並ぶ連続する画素、またはアンカバードバックグラウンド領域に対応する画素から始まり、カバードバックグラウンド領域に対応する画素までの動き方向に並ぶ連続する画素を示す。処理単位は、例えば、左上点（処理単位で指定される画素であって、画像上で最も左または最も上に位置する画素の位置）および右下点の２つのデータから成る。
【０５０２】
モデル化部８０２は、動きベクトルおよび入力された処理単位を基に、モデル化を実行する。より具体的には、例えば、モデル化部８０２は、処理単位に含まれる画素の数、画素値の時間方向の仮想分割数、および画素毎の前景の成分の数に対応する複数のモデルを予め記憶しておき、処理単位、および画素値の時間方向の仮想分割数を基に、図７９に示すような、画素値と前景の成分との対応を指定するモデルを選択する。
【０５０３】
例えば、処理単位に対応する画素の数が１２でありシャッタ時間内の動き量vが５であるときにおいては、モデル化部８０２は、仮想分割数を５とし、最も左に位置する画素が１つの前景の成分を含み、左から２番目の画素が２つの前景の成分を含み、左から３番目の画素が３つの前景の成分を含み、左から４番目の画素が４つの前景の成分を含み、左から５番目の画素が５つの前景の成分を含み、左から６番目の画素が５つの前景の成分を含み、左から７番目の画素が５つの前景の成分を含み、左から８番目の画素が５つの前景の成分を含み、左から９番目の画素が４つの前景の成分を含み、左から１０番目の画素が３つの前景の成分を含み、左から１１番目の画素が２つの前景の成分を含み、左から１２番目の画素が１つの前景の成分を含み、全体として８つの前景の成分から成るモデルを選択する。
【０５０４】
なお、モデル化部８０２は、予め記憶してあるモデルから選択するのではなく、動きベクトル、および処理単位が供給されたとき、動きベクトル、および処理単位を基に、モデルを生成するようにしてもよい。
【０５０５】
モデル化部８０２は、選択したモデルを方程式生成部８０３に供給する。
【０５０６】
方程式生成部８０３は、モデル化部８０２から供給されたモデルを基に、方程式を生成する。図７９に示す前景成分画像のモデルを参照して、前景の成分の数が８であり、処理単位に対応する画素の数が１２であり、動き量vが５であり、仮想分割数が５であるときの、方程式生成部８０３が生成する方程式について説明する。
【０５０７】
前景成分画像に含まれるシャッタ時間/vに対応する前景成分がF01/v乃至F08/vであるとき、F01/v乃至F08/vと画素値C01乃至C12との関係は、式（７８）乃至式（８９）で表される。
【０５０８】
C01=F01/v ・・・（７８）
C02=F02/v+F01/v ・・・（７９）
C03=F03/v+F02/v+F01/v ・・・（８０）
C04=F04/v+F03/v+F02/v+F01/v ・・・（８１）
C05=F05/v+F04/v+F03/v+F02/v+F01/v ・・・（８２）
C06=F06/v+F05/v+F04/v+F03/v+F02/v ・・・（８３）
C07=F07/v+F06/v+F05/v+F04/v+F03/v ・・・（８４）
C08=F08/v+F07/v+F06/v+F05/v+F04/v ・・・（８５）
C09=F08/v+F07/v+F06/v+F05/v ・・・（８６）
C10=F08/v+F07/v+F06/v ・・・（８７）
C11=F08/v+F07/v ・・・（８８）
C12=F08/v ・・・（８９）
【０５０９】
方程式生成部８０３は、生成した方程式を変形して方程式を生成する。方程式生成部８０３が生成する方程式を、式（９０）乃至式（１０１）に示す。
【０５１０】
C01=1・F01/v+0・F02/v+0・F03/v+0・F04/v+0・F05/v
+0・F06/v+0・F07/v+0・F08/v ・・・（９０）
C02=1・F01/v+1・F02/v+0・F03/v+0・F04/v+0・F05/v
+0・F06/v+0・F07/v+0・F08/v ・・・（９１）
C03=1・F01/v+1・F02/v+1・F03/v+0・F04/v+0・F05/v
+0・F06/v+0・F07/v+0・F08/v ・・・（９２）
C04=1・F01/v+1・F02/v+1・F03/v+1・F04/v+0・F05/v
+0・F06/v+0・F07/v+0・F08/v ・・・（９３）
C05=1・F01/v+1・F02/v+1・F03/v+1・F04/v+1・F05/v
+0・F06/v+0・F07/v+0・F08/v ・・・（９４）
C06=0・F01/v+1・F02/v+1・F03/v+1・F04/v+1・F05/v
+1・F06/v+0・F07/v+0・F08/v ・・・（９５）
C07=0・F01/v+0・F02/v+1・F03/v+1・F04/v+1・F05/v
+1・F06/v+1・F07/v+0・F08/v ・・・（９６）
C08=0・F01/v+0・F02/v+0・F03/v+1・F04/v+1・F05/v
+1・F06/v+1・F07/v+1・F08/v ・・・（９７）
C09=0・F01/v+0・F02/v+0・F03/v+0・F04/v+1・F05/v
+1・F06/v+1・F07/v+1・F08/v ・・・（９８）
C10=0・F01/v+0・F02/v+0・F03/v+0・F04/v+0・F05/v
+1・F06/v+1・F07/v+1・F08/v ・・・（９９）
C11=0・F01/v+0・F02/v+0・F03/v+0・F04/v+0・F05/v
+0・F06/v+1・F07/v+1・F08/v ・・・（１００）
C12=0・F01/v+0・F02/v+0・F03/v+0・F04/v+0・F05/v
+0・F06/v+0・F07/v+1・F08/v ・・・（１０１）
【０５１１】
式（９０）乃至式（１０１）は、式（１０２）として表すこともできる。
【０５１２】
【数１７】

・・・（１０２）
【０５１３】
式（１０２）において、jは、画素の位置を示す。この例において、jは、１乃至１２のいずれか１つの値を有する。また、iは、前景値の位置を示す。この例において、iは、１乃至８のいずれか１つの値を有する。aijは、iおよびjの値に対応して、０または１の値を有する。
【０５１４】
誤差を考慮して表現すると、式（１０２）は、式（１０３）のように表すことができる。
【０５１５】
【数１８】

・・・（１０３）
【０５１６】
式（１０３）において、ejは、注目画素Cjに含まれる誤差である。
【０５１７】
式（１０３）は、式（１０４）に書き換えることができる。
【０５１８】
【数１９】

・・・（１０４）
【０５１９】
ここで、最小自乗法を適用するため、誤差の自乗和Eを式（１０５）に示すように定義する。
【０５２０】
【数２０】

・・・（１０５）
【０５２１】
誤差が最小になるためには、誤差の自乗和Eに対する、変数Fkによる偏微分の値が０になればよい。式（１０６）を満たすようにFkを求める。
【０５２２】
【数２１】

・・・（１０６）
【０５２３】
式（１０６）において、動き量vは固定値であるから、式（１０７）を導くことができる。
【０５２４】
【数２２】

・・・（１０７）
【０５２５】
式（１０７）を展開して、移項すると、式（１０８）を得る。
【０５２６】
【数２３】

・・・（１０８）
【０５２７】
式（１０８）のkに１乃至８の整数のいずれか１つを代入して得られる８つの式に展開する。得られた８つの式を、行列により１つの式により表すことができる。この式を正規方程式と呼ぶ。
【０５２８】
このような最小自乗法に基づく、方程式生成部８０３が生成する正規方程式の例を式（１０９）に示す。
【０５２９】
【数２４】

・・・（１０９）
【０５３０】
式（１０９）をA・F=v・Cと表すと、C,A,vが既知であり、Fは未知である。また、A,vは、モデル化の時点で既知だが、Cは、足し込み動作において画素値を入力することで既知となる。
【０５３１】
最小自乗法に基づく正規方程式により前景成分を算出することにより、画素Cに含まれている誤差を分散させることができる。
【０５３２】
方程式生成部８０３は、このように生成された正規方程式を足し込み部８０４に供給する。
【０５３３】
足し込み部８０４は、処理単位決定部８０１から供給された処理単位を基に、前景成分画像に含まれる画素値Cを、方程式生成部８０３から供給された行列の式に設定する。足し込み部８０４は、画素値Cを設定した行列を演算部８０５に供給する。
【０５３４】
演算部８０５は、掃き出し法（Gauss-Jordanの消去法）などの解法に基づく処理により、動きボケが除去された前景成分Fi/vを算出して、動きボケが除去された前景の画素値である、０乃至８の整数のいずれかのiに対応するFiを算出して、図８０に例を示す、動きボケが除去された画素値であるFiから成る、動きボケが除去された前景成分画像を出力する。
【０５３５】
なお、図８０に示す動きボケが除去された前景成分画像において、C03乃至C10のそれぞれにF01乃至F08のそれぞれが設定されているのは、画面に対する前景成分画像の位置を変化させないためであり、任意の位置に対応させることができる。
【０５３６】
このように、動きボケ除去部１０６は、選択信号および動きボケ調整量v'を基に、動きボケの量を除去することができる。
【０５３７】
次に、図８１のフローチャートを参照して、動きボケ除去部１０６による前景成分画像に含まれる動きボケの量の調整の処理を説明する。
【０５３８】
ステップＳ８０１において、動きボケ除去部１０６の処理単位決定部８０１は、動きベクトルおよび領域情報を基に、処理単位を生成し、生成した処理単位をモデル化部８０２に供給する。
【０５３９】
ステップＳ８０２において、動きボケ除去部１０６のモデル化部８０２は、動き量vおよび処理単位に対応して、モデルの選択や生成を行う。ステップＳ８０３において、方程式生成部８０３は、選択されたモデルを基に、正規方程式を作成する。
【０５４０】
ステップＳ８０４において、足し込み部８０４は、作成された正規方程式に前景成分画像の画素値を設定する。ステップＳ８０５において、足し込み部８０４は、処理単位に対応する全ての画素の画素値の設定を行ったか否かを判定し、処理単位に対応する全ての画素の画素値の設定を行っていないと判定された場合、ステップＳ８０４に戻り、正規方程式への画素値の設定の処理を繰り返す。
【０５４１】
ステップＳ８０５において、処理単位の全ての画素の画素値の設定を行ったと判定された場合、ステップＳ８０６に進み、演算部８０５は、足し込み部８０４から供給された画素値が設定された正規方程式を基に、動きボケの量を除去した前景の画素値を算出して、処理は終了する。
【０５４２】
このように、動きボケ除去部１０６は、動きベクトルおよび領域情報を基に、動きボケを含む前景画像から動きボケの量を除去することができる。
【０５４３】
すなわち、サンプルデータである画素値に含まれる動きボケの量を調整することができる。
【０５４４】
図８２は、動きボケ除去部１０６の構成の他の一例を示すブロック図である。動き検出部１０２から供給された動きベクトルとその位置情報は、処理単位決定部９０１および補正部９０５に供給され、領域特定部１０３から供給された領域情報は、処理単位決定部９０１に供給される。前景背景分離部１０５から供給された前景成分画像は、演算部９０４に供給される。
【０５４５】
処理単位決定部９０１は、動きベクトルとその位置情報、および領域情報を基に、処理単位を生成し、動きベクトルと共に、生成した処理単位をモデル化部９０２に供給する。
【０５４６】
モデル化部９０２は、動きベクトルおよび入力された処理単位を基に、モデル化を実行する。より具体的には、例えば、モデル化部９０２は、処理単位に含まれる画素の数、画素値の時間方向の仮想分割数、および画素毎の前景の成分の数に対応する複数のモデルを予め記憶しておき、処理単位、および画素値の時間方向の仮想分割数を基に、図８３に示すような、画素値と前景の成分との対応を指定するモデルを選択する。
【０５４７】
例えば、処理単位に対応する画素の数が１２であり動き量vが５であるときにおいては、モデル化部９０２は、仮想分割数を５とし、最も左に位置する画素が１つの前景の成分を含み、左から２番目の画素が２つの前景の成分を含み、左から３番目の画素が３つの前景の成分を含み、左から４番目の画素が４つの前景の成分を含み、左から５番目の画素が５つの前景の成分を含み、左から６番目の画素が５つの前景の成分を含み、左から７番目の画素が５つの前景の成分を含み、左から８番目の画素が５つの前景の成分を含み、左から９番目の画素が４つの前景の成分を含み、左から１０番目の画素が３つの前景の成分を含み、左から１１番目の画素が２つの前景の成分を含み、左から１２番目の画素が１つの前景の成分を含み、全体として８つの前景の成分から成るモデルを選択する。
【０５４８】
なお、モデル化部９０２は、予め記憶してあるモデルから選択するのではなく、動きベクトル、および処理単位が供給されたとき、動きベクトル、および処理単位を基に、モデルを生成するようにしてもよい。
【０５４９】
方程式生成部９０３は、モデル化部９０２から供給されたモデルを基に、方程式を生成する。
【０５５０】
図８３乃至図８５に示す前景成分画像のモデルを参照して、前景の成分の数が８であり、処理単位に対応する画素の数が１２であり、動き量vが５であるときの、方程式生成部９０３が生成する方程式の例について説明する。
【０５５１】
前景成分画像に含まれるシャッタ時間/vに対応する前景成分がF01/v乃至F08/vであるとき、F01/v乃至F08/vと画素値C01乃至C12との関係は、上述したように、式（７８）乃至式（８９）で表される。
【０５５２】
画素値C12およびC11に注目すると、画素値C12は、式（１１０）に示すように、前景の成分F08/vのみを含み、画素値C11は、前景の成分F08/vおよび前景の成分F07/vの積和から成る。従って、前景の成分F07/vは、式（１１１）で求めることができる。
【０５５３】
F08/v=C12 ・・・（１１０）
F07/v=C11-C12 ・・・（１１１）
【０５５４】
同様に、画素値C10乃至C01に含まれる前景の成分を考慮すると、前景の成分F06/v乃至F01/vは、式（１１２）乃至式（１１７）により求めることができる。
【０５５５】
F06/v=C10-C11 ・・・（１１２）
F05/v=C09-C10 ・・・（１１３）
F04/v=C08-C09 ・・・（１１４）
F03/v=C07-C08+C12 ・・・（１１５）
F02/v=C06-C07+C11-C12 ・・・（１１６）
F01/v=C05-C06+C10-C11 ・・・（１１７）
【０５５６】
方程式生成部９０３は、式（１１０）乃至式（１１７）に例を示す、画素値の差により前景の成分を算出するための方程式を生成する。方程式生成部９０３は、生成した方程式を演算部９０４に供給する。
【０５５７】
演算部９０４は、方程式生成部９０３から供給された方程式に前景成分画像の画素値を設定して、画素値を設定した方程式を基に、前景の成分を算出する。演算部９０４は、例えば、式（１１０）乃至式（１１７）が方程式生成部９０３から供給されたとき、式（１１０）乃至式（１１７）に画素値C05乃至C12を設定する。
【０５５８】
演算部９０４は、画素値が設定された式に基づき、前景の成分を算出する。例えば、演算部９０４は、画素値C05乃至C12が設定された式（１１０）乃至式（１１７）に基づく演算により、図８４に示すように、前景の成分F01/v乃至F08/vを算出する。演算部９０４は、前景の成分F01/v乃至F08/vを補正部９０５に供給する。
【０５５９】
補正部９０５は、演算部９０４から供給された前景の成分に、処理単位決定部９０１から供給された動きベクトルに含まれる動き量vを乗じて、動きボケを除去した前景の画素値を算出する。例えば、補正部９０５は、演算部９０４から供給された前景の成分F01/v乃至F08/vが供給されたとき、前景の成分F01/v乃至F08/vのそれぞれに、５である動き量vを乗じることにより、図８５に示すように、動きボケを除去した前景の画素値F01乃至F08を算出する。
【０５６０】
補正部９０５は、以上のように算出された、動きボケを除去した前景の画素値から成る前景成分画像を出力する。
【０５６１】
このように、動きボケ除去部１０６は、選択信号および動きボケ調整量v'を基に、動きボケの量を除去することができる。
【０５６２】
次に、図８２に構成を示す動きボケ除去部１０６による前景の動きボケの除去の処理を図８６のフローチャートを参照して説明する。
【０５６３】
ステップＳ９０１において、動きボケ除去部１０６の処理単位決定部９０１は、動きベクトルおよび領域情報を基に、処理単位を生成し、生成した処理単位をモデル化部９０２および補正部９０５に供給する。
【０５６４】
ステップＳ９０２において、動きボケ除去部１０６のモデル化部９０２は、動き量vおよび処理単位に対応して、モデルの選択や生成を行う。ステップＳ９０３において、方程式生成部９０３は、選択または生成されたモデルを基に、前景成分画像の画素値の差により前景の成分を算出するための方程式を生成する。
【０５６５】
ステップＳ９０４において、演算部９０４は、作成された方程式に前景成分画像の画素値を設定し、画素値が設定された方程式を基に、画素値の差分から前景の成分を抽出する。ステップＳ９０５において、演算部９０４は、処理単位に対応する全ての前景の成分を抽出したか否かを判定し、処理単位に対応する全ての前景の成分を抽出していないと判定された場合、ステップＳ９０４に戻り、前景の成分を抽出の処理を繰り返す。
【０５６６】
ステップＳ９０５において、処理単位に対応する全ての前景の成分を抽出したと判定された場合、ステップＳ９０６に進み、補正部９０５は、動き量vを基に、演算部９０４から供給された前景の成分F01/v乃至F08/vのそれぞれを補正して、動きボケを除去した前景の画素値F01乃至F08を算出する。
【０５６７】
ステップＳ９０７において、補正部９０５は、動きボケを除去した前景の画素値を算出して、出力し、処理は終了する。
【０５６８】
このように、図８２に構成を示す動きボケ除去部１０６は、より簡単な演算で、より迅速に、動きボケを含む前景画像から動きボケを除去することができる。
【０５６９】
ウィナー・フィルタなど従来の動きボケを部分的に除去する手法が、理想状態では効果が認められるが、量子化され、ノイズを含んだ実際の画像に対して十分な効果が得られないのに対し、図８２に構成を示す動きボケ除去部１０６においては、量子化され、ノイズを含んだ実際の画像に対しても十分な効果が認められ、精度の良い動きボケの除去が可能となる。
【０５７０】
次に、図８７を参照して、合成部１０７の構成について説明する。
【０５７１】
輪郭抽出部１００１は、動きボケ除去部１０６より入力されてくる動きボケが除去された前景成分画像の輪郭となる画素の情報を抽出し輪郭平滑化部１００２に出力する。
【０５７２】
輪郭平滑化部１００２は、輪郭抽出部１００１より供給されてくる動きボケが除去された前景成分画像の輪郭部の画素（輪郭を形成する画素）の情報に基づいて、輪郭部の画素の画素値を平滑化して空間混合比生成部１００３に供給する。
【０５７３】
すなわち、動きボケが除去された前景成分画像は、混合比が１の画像である。この混合比が１である前景成分画像の輪郭部分は、図８８の実線（太線）で示されるような、画素単位の階段状の形状となる。
【０５７４】
そこで、輪郭平滑化部１００２は、この階段状の輪郭部分の各画素について、端部となる１辺の中央部の各点（例えば、図中の点Ｂ１乃至Ｂ６）毎に、その点を通る直線を回帰法により求めて、輪郭部分を直線近似することにより、例えば、図８８の点線で示されるような線分により、輪郭を平滑化する。
【０５７５】
すなわち、図８８においては、前景成分画像の輪郭を形成する各画素の（輪郭の）端部の１辺の中央の各点Ｂ１乃至Ｂ６のそれぞれを通る直線を設定し、その直線の傾きを回帰法（最小自乗法）により求め、前景成分画像の輪郭を平滑化する。図８８においては、各点Ｂ１乃至Ｂ６に対応する線分OP，PQ，QR，RS，ST，TUが点線で示されている。結果として、前景成分画像の輪郭は、階段状の断面から、滑らかな断面（９０度以上の鈍角からなる面）へと変化している。
【０５７６】
尚、輪郭平滑化部１００２による平滑化は、上述したように回帰法により求められる直線を用いて求めるのみならず、例えば、スプライン近似や多項式近似により輪郭を平滑化するようにしてもよい。
【０５７７】
空間混合比生成部１００３は、輪郭平滑化部１００２より供給された、平滑化されている前景成分画像の輪郭に基づいて、輪郭部周辺の画素の空間混合比を算出し、前景成分画像スムージング部１００４、および、領域情報調整部１００６に出力する。すなわち、空間混合比生成部１００３は、図８９で示されるように、点線で示される平滑化された前景成分画像の輪郭形状から、対応する画素の空間的な混合比を求める。空間混合比とは、画素内で生じる混合比のことであり、これまで扱ってきた混合比は、これに対して、時間的に生じる混合であるので以降においては、時間混合比とも称する。
【０５７８】
例えば、図８９で示されるように、太線の実線が元々の輪郭であって、細線で囲まれる正方形が輪郭付近の画素であるものとすると、元々の輪郭の図中左部分が前景成分画像であるものとすれば、従来の前景成分画像は、空間的には、図中の太線で示された輪郭の周辺の画素の空間混合比は前景成分画像の輪郭となる画素が１、今の場合、その前景成分画像の輪郭となる画素の右隣に存在する画素の空間混合比は０となっているはずである。尚、図８９においては、前景成分画像の領域については、斜線部で示されており、それ以外の部分、すなわち、背景成分画像の領域は、無地で示されている。
【０５７９】
空間混合比生成部１００３は、この空間混合比を平滑化された輪郭形状に合わせて求める。すなわち、元々の前景成分画像の輪郭は、図８９で示される太線であり、太線の図中左側の斜線で示される領域である。そして、その斜線部の領域の混合比が１であり、太線より右側の領域が前景成分画像ではない領域（背景成分画像が付加される背景領域）、すなわち、画像を合成するときの混合比が０となる領域である。
【０５８０】
しかしながら、前景成分画像の領域は、平滑化されることにより、図８９（図８８も同じ）で示される点線のように平滑化されている。そこで、空間混合比生成部１００３は、この平滑化された輪郭の情報に基づいて、輪郭を形成する線が跨いでいる領域に存在する各画素について、前景成分画像の占める各画素の面積の割合から空間混合比を求める。
【０５８１】
すなわち、各画素の面積が１であるとすると、図９０で示されるように、図中最上部左側の画素（輪郭となる端部の１辺の中央の点Ｂ１を有する画素）は、点線により示される輪郭により、斜線部で示される前傾成分画像となる領域の面積が、略０．７となるので、空間混合比は、０．７となる。同様に、図中の最上部右側の画素（輪郭となる端部の１辺の中央の点Ｂ１を共有する画素）は、点線により示される輪郭により、斜線部で示される前傾成分画像となる領域の面積が、略０．３となるので空間混合比は、０．３となる。同様にして、図中上から２段目の点Ｂ２を共有する２画素の空間混合比は、それぞれの斜線部の面積に応じて、左から０．９と０．１であり、図中上から３段目の点Ｂ３を共有する２画素の空間混合比は、左から０．８と０．２であり、図中上から４段目の点Ｂ４を共有する２画素の空間混合比は、左から０．９と０．１であり、図中上から５段目の点Ｂ５を共有する２画素の空間混合比は、左から０．７と０．３であり、さらに、図中上から６段目の点Ｂ６を共有する２画素、および、さらに左側の画素からなる3画素の空間混合比は、左から０．９５、０．６５、および０．３である。
【０５８２】
前景成分画像スムージング部１００４は、空間混合比生成部１００３より供給された空間混合比に基づいて、前景成分画像をスムージングして動きボケ付加部１００５に出力する。すなわち、前景成分画像スムージング部１００４は、平滑化された前景成分画像の輪郭に対応した空間混合比に基づいて前景成分画像の輪郭部をスムージングする。より詳細には、前景成分画像スムージング部１００４は、入力された前景成分画像の各画素に対して、対応する空間混合比を乗じて、画素値を調整し、全体として前景成分画像の輪郭をスムージングする。
【０５８３】
このように、平滑化された輪郭を用いることにより各画素の混合比は、これまで１および０の２値による表現から平滑化された輪郭に合わせて、空間混合比が考慮された値となり、前景成分画像の占める面積の割合に応じた値が設定される。結果として、前景成分画像が、この空間混合比に応じて、スムージングされることにより、これまで画素単位でしか表現することのできなかった前景成分画像の輪郭を画素以下のサイズまで考慮した処理を実行することが可能となる。
【０５８４】
尚、前景成分画像の輪郭がスムージングされることにより、前景成分画像の領域（前景成分画像の空間混合比が０以外となる画素の領域）は、水平方向、または、垂直方向に数画素分増えることになる。また、同様にして、背景成分画像を含む画素の領域も増えるようにしてもよい。
【０５８５】
動きボケ付加部１００５は、動き検出部１０２より供給される動きベクトルの情報に基づいて、前景成分画像スムージング部１００４により、その輪郭がスムージングされた前景成分画像に動きボケを付加し、時間混合比算出部１００７、混合領域画像合成部１００９、および、画像合成部１０１０に出力する。
【０５８６】
すなわち、動きボケ付加部１００５は、動き量vとは異なる値の動きボケ調整量v'、例えば、動き量vの半分の値の動きボケ調整量v'や、動き量vと無関係の値の動きボケ調整量v'を与えることで、動きボケの量を調整することができる。例えば、図９１に示すように、動きボケ付加部１００５は、動きボケが除去された前景の画素値Fiを動きボケ調整量v'で除すことにより、前景成分Fi/v'を算出して、前景成分Fi/v'の和を算出して、動きボケの量が調整された画素値を生成する。例えば、動きボケ調整量v'が３のとき、画素値C02は、（F01）/v'とされ、画素値C03は、（F01+F02）/v'とされ、画素値C04は、（F01+F02+F03）/v'とされ、画素値C05は、（F02+F03+F04）/v'とされる。
【０５８７】
また、例えば、図９２に示すように、処理単位に対応する画素の数が８であり、動き量vが４であるとき、動きボケ付加部１００５は、式（１１８）に示す行列の式を生成する。
【０５８８】
【数２５】

・・・（１１８）
【０５８９】
動きボケ付加部１００５は、このように処理単位の長さに対応した数の式を立てて、動きボケの量が調整された画素値であるFiを算出する。同様に、例えば、処理単位に含まれる画素の数が１００あるとき、１００個の画素に対応する式を生成して、Fiを算出する。
【０５９０】
領域情報調整部１００６は、領域特定部１０３より供給された領域情報を、空間混合比生成部１００３より供給される空間混合比の情報に基づいて調整し、調整した領域情報を時間混合比算出部１００７、および、画像合成部１０１０に出力する。すなわち、上述したように前景成分画像の輪郭をスムージングすることにより新たに生成された空間混合比により前景成分画像の領域は、広がることになるため、領域情報調整部１００６は、前景成分画像の輪郭部分の領域が対応して広がるように領域情報を調整する。
【０５９１】
時間混合比算出部１００７は、図６の混合比算出部１０４と同様のものであり、動きボケが付加された前景成分画像と、領域情報調整部１００６より供給される領域情報に基づいて、スムージングされた前景成分画像に対応した時間混合比の情報を背景成分生成部１００８に供給する。
【０５９２】
背景成分生成部１００８は、時間混合比および任意の背景画像を基に、背景成分画像を生成して、混合領域画像合成部１００９に供給する。
【０５９３】
混合領域画像合成部１００９は、背景成分生成部１００７から供給された背景成分画像と前景成分画像との混合領域の画像を合成することにより、混合領域合成画像を生成して、生成した混合領域合成画像を画像合成部１０１０に供給する。
【０５９４】
画像合成部１０１０は、領域情報調整部１００６より供給される領域情報を基に、動きボケ付加部１００５により動きボケが付加された前景成分画像、混合領域画像合成部１００９から供給された混合領域合成画像、および任意の背景画像を合成して、合成画像を生成して出力する。
【０５９５】
次に、図９３のフローチャートを参照して、図２３のステップＳ１６の処理に対応する、図８７の合成部１０７による画像の合成の処理について説明する。
【０５９６】
ステップＳ９２１において、輪郭抽出部１００１は、動きボケ除去部１０６より入力されてくる動きボケが除去された前景成分画像の輪郭となる画素の情報を抽出し、輪郭平滑化部１００２に出力する。すなわち、輪郭抽出部１００１は、例えば、図８９で示される点Ｂ１乃至Ｂ６を有する画素、および、その近傍の画素の情報を抽出し、輪郭平滑化部１００２に出力する。
【０５９７】
ステップＳ９２２において、輪郭平滑下部１００２は、輪郭抽出部１００１より供給されてくる動きボケが除去された前景成分画像の輪郭部分の画素の情報に基づいて、輪郭部を平滑化して空間混合比生成部１００３に供給する。すなわち、輪郭部分の画素の情報に基づいて、輪郭平滑下部１００２は、この階段状の輪郭部分の各画素について、端部となる１辺の中央部の各点（例えば、図中の点Ｂ１乃至Ｂ６）毎に回帰直線を求めて、輪郭部分を直線近似することにより、例えば、図８８の点線で示されるような線分により、輪郭を平滑化する。すなわち、図８８においては、前景成分画像の輪郭を形成する各画素の（輪郭の）端部の１辺の中央の各点Ｂ１乃至Ｂ６のそれぞれを通る直線を設定し、その直線の傾きを回帰法（最小自乗法）により求め、各点Ｂ１乃至Ｂ６に対応する線分OP，PQ，QR，RS，ST，TUを点線で示している。
【０５９８】
より詳細には、輪郭平滑下部１００２は、例えば、各点Ｂ１乃至Ｂ６の各点に対応する直線を以下の式（１１９）で示されるような関数として定義し、その傾きを回帰法（最小自乗法）により求める。
【０５９９】
y=Ax
・・・（１１９）
【０６００】
尚、ここで、(x,y)は、各点Ｂ１乃至Ｂ６の座標であり、Aは、その傾きである。また、上述の式（１１９）の原点は、求めようとする直線が通る上述の点Ｂ１乃至Ｂ６の位置であるものとする。従って、例えば、求めようとする直線が点Ｂ１を通る線分OPに対応するものである場合、原点は点Ｂ１の位置であり、同様にして、直線が点Ｂ２を通る線分PQに対応するものである場合、原点は点Ｂ２であるものとする。従って、点Ｂ１乃至Ｂ６の座標は、求める直線の通る点の位置により変化する。
【０６０１】
最小自乗法は、以下の式（１２０）により定義される、設定された傾きAにより得られるy'と、実際のyとの差分自乗和Qが最小となる傾きAを求める方法である。
【０６０２】
【数２６】

・・・（１２０）
【０６０３】
式（１２０）で示される差分自乗和Qは、２次関数であるので、変数A（傾きA）について下に凸の曲線となるため、傾きAが最小となるAminが最小自乗法の解となる。
【０６０４】
式（１２０）で示される差分自乗和Qは、変数Aで微分されると、以下に示す式（１２１）で示される（Roud dQ）／（Roud dA）となる。
【０６０５】
【数２７】

・・・（１２１）
【０６０６】
式（１２１）が、０となるAが差分自乗和Qの最小値をとるAminとなるので、式（１２１）が０となるときの式を展開することにより、以下の式（１２２）で傾きAが求められることになる。
【０６０７】
【数２８】

・・・（１２２）
【０６０８】
上述の式（１２２）は、いわゆる、１変数（傾きA）の正規方程式となる。
【０６０９】
このようにして、得られた傾きAにより各点の線分となる直線（図８９中の点線により示される各線分）が求められ、各点に対応する線分が前景成分画像の輪郭を平滑化する。尚、平滑化の手法は、回帰法（最小自乗法）により各点に対応する線分を求めるのみならず、各点を用いてスプライン近似により得られる曲線を用いてもよいし、多項式近似により輪郭となる曲線を求めるようにしてもよい。
【０６１０】
ステップＳ９２３において、空間混合比生成部１００３は、輪郭平滑化部１００２より供給された平滑化された輪郭情報に基づいて、空間混合比を求める。すなわち、上述したように、図８８の太線で示された階段状の輪郭が図８９の点線で示されるように平滑化された輪郭に基づいて、輪郭近傍の各画素上の前景成分画像の面積に対応して、図９０で示されるように、輪郭近傍の画素の空間混合比を求めて前景成分画像スムージング部１００７、および、領域情報調整部１００６に出力する。
【０６１１】
ステップＳ９２４において、前景成分画像スムージング部１００４は、動きボケが除去された前景成分画像を、空間混合比生成部１００３より供給された、前景成分画像の輪郭部分の画素の空間混合比の情報に基づいて、スムージング処理を施し、スムージングした前景成分画像を動きボケ付加部１００５に出力する。すなわち、前景成分画像スムージング部１００４は、空間混合比を前景成分画像の各画素（実際には、輪郭を形成する画素のみ）の画素値に空間混合比を乗じて、前景成分画像の輪郭をスムージングする。
【０６１２】
ステップＳ９２５において、動きボケ付加部１００５は、図９１，図９２で示して、上述したようにスムージングされた前景成分画像に、動き検出部１０２より供給される動きベクトルに基づいて動きボケを付加することにより、入力画像に元々含まれていた動きボケを付加して、時間混合比算出部１００７、混合領域画像合成部１００９、および、画像合成部１０１０に出力する。
【０６１３】
ステップＳ９２６において、領域情報生成部１００６は、領域特定部１０３より供給された領域情報を、空間混合比生成部１００３より供給される空間混合比の情報に基づいて調整し、調整した領域情報を時間混合比算出部１００７、および、画像合成部１０１０に出力する。すなわち、上述したように前景成分画像の輪郭をスムージングすることにより新たに生成された空間混合比により前景成分画像の領域は、平滑化された輪郭となる線分が通る画素にまで広がることになるため、領域情報調整部１００６は、前景成分画像の輪郭部分の領域が広がるように調整する。
【０６１４】
ステップＳ９２７において、時間混合比算出部１００７は、図６の混合比算出部１０４と同様に、動きボケが付加された前景成分画像と、領域情報調整部１００６より供給される領域情報に基づいて、入力される任意の背景画像から背景成分画像を生成し、混合領域画像合成部１００９に供給する。
【０６１５】
ステップＳ９２８において、背景成分生成部１００８は、時間混合比および任意の背景画像を基に、背景成分画像を生成して、混合領域画像合成部１００９に供給する。
【０６１６】
ステップＳ９２９において、混合領域画像合成部１００９は、背景成分生成部１００７から供給された背景成分画像と動きボケが付加された前景成分画像とを合成することにより、混合領域合成画像を生成して、生成した混合領域合成画像を画像合成部１０１０に供給する。
【０６１７】
画像合成部１０１０は、領域情報調整部１００６より供給される領域情報を基に、動きボケ付加部１００５により動きボケが付加された前景成分画像、混合領域画像合成部１０２２から供給された混合領域合成画像、および任意の背景画像を合成して、合成画像を生成して出力する。
【０６１８】
以上の処理により、前景成分画像の輪郭部分が画素単位で形成された輪郭ではなく、空間混合比を考慮して、スムージングした前景成分画像が合成されるため、図９４で示される混合比のように、階段状の前景成分画像の空間の連続性が考慮された、違和感のない合成画像を生成することが可能となる。尚、図９４では、白色が前景成分画像の混合比１を示し、黒色が前景成分画像の混合比０、すなわち、前景成分画像以外の領域（例えば、背景成分画像の領域）を示している。従って、白、および、黒の中間色は、混合領域であることを示しており、黒色に近い部分ほど混合比が高いことが示されている。
【０６１９】
また、図９５は、図９４上の前景成分画像の画素と背景成分画像の画素の混合比に基づいて、合成された画像の画素値の分布を示したものであり、水平方向と垂直方向の座標位置上の画素値の分布が示されている。高いところほど画素値が高いことを示している。
【０６２０】
図９５で示されるように、以上の処理により、前景成分画像と背景成分画像の境界付近は、滑らかに画素値が変化している。このため、合成された画像には、混合領域においても破綻が生じることなく、見た目にも自然な合成画像が生成されていることが示されている。
【０６２１】
結果として、図９６で示されているような前景成分画像と背景成分画像が、より自然な状態で合成された合成画像を生成することが可能となる。尚、図９６においては、図中下部で示されるように、略左部が前景成分画像であり、略右部が背景成分画像となっている。
【０６２２】
以上においては、前景成分画像の輪郭を形成する画素の空間混合比を、前景成分画像の輪郭を平滑化して、輪郭の画素上の面積の割合に応じた空間混合比を求めることによりスムージングし、前景成分画像と背景成分画像を見た目に自然に合成させる例について説明してきたが、前景成分画像の輪郭となる端部の画素は、上述したように空間的に前景成分画像と背景成分画像が混合していることが多い。そこで、前景成分画像の輪郭を形成する画素については、空間的に前景成分画像と背景成分画像が混合しているものとし、端部以外の画素より外挿補間により空間混合比を求めるようにしてもよい。
【０６２３】
図９７は、前景成分画像の輪郭を形成する画素については、空間的に前景成分画像と背景成分画像が混合しているものとし、端部以外の画素より外挿補間により空間混合比を求めるようにした合成部１０７の構成を示している。
【０６２４】
尚、図８７で示した構成と同一のものについては、同一の番号を付して適宜説明は省略するものとする。
【０６２５】
図９７の合成部１０７は、図８７の合成部１０７と基本的に同様の機能を果たすものであるが、輪郭抽出部１００１、輪郭平滑化部１００２、および空間混合比生成部１００３に代えて、輪郭近傍画素抽出部１１０１、外挿補間部１１０２および、空間混合比生成部１１０３が設けられており、さらに、領域情報調整部１００６が削除されていることが異なる。
【０６２６】
輪郭近傍画素抽出部１１０１は、入力された動きボケが除去された前景成分画像（動きボケのない前景画像）の輪郭となる端部とその付近の画素の情報を抽出し、外挿補間部１１０２に供給する。
【０６２７】
外挿補間部１１０２は、輪郭近傍画素抽出部１１０１より供給された前景成分画像の輪郭とその近傍の画素に基づいて、前景成分画像の輪郭を形成する画素の前景成分画像のみからなる画素値を外挿補間により求めて、求められた前景成分画像の輪郭を形成する画素の情報と、輪郭の近傍の画素を空間混合比生成部１１０３に供給する。
【０６２８】
すなわち、前景成分画像の画素値と、背景成分画像の画素値が図９８の図中左部で示されるような場合、対応する空間混合比は、図９８の右部で示されるように、前景成分画像の画素が存在する領域は１となり背景成分画像の領域は０となる。これは、前景成分画像の輪郭を形成する画素に空間的な混合が生じていないことが前提となったものであり、図９８の右部で示される前景成分画像の輪郭を形成する画素には、背景成分画像の成分を含まない（混合が生じていないこと）が前提となっている。尚、図９８の左部においては、左から３番目までの縦棒が前景成分画像の画素の画素値のレベルを示しており、右から２番目までの縦棒が背景成分画像の画素の画素値のレベルを示している。従って、左から３番目の縦棒が前景成分画像の輪郭を形成する端部の画素である。
【０６２９】
しかしながら、前景成分画像の輪郭を形成する画素は、図８８で示されるように、画素内で輪郭（エッジ）が生じることが多く、実際には、前景成分画像と背景成分画像が混合しているものと考えられる。
【０６３０】
外挿補間部１１０２は、前景成分画像の輪郭を形成する端部の画素の画素値を、その端部近傍の画素を用いた外挿補間により前景成分画像の輪郭を形成する端部の、前景成分画像の成分のみからなる画素値を求める（推定する）。例えば、図９９の左部で示されるように、前景成分画像の画素が左から３画素連続して、水平方向、または、垂直方向に並んでいるものとする。
【０６３１】
このとき、図９９の左部の左から３番目の画素に対応する縦棒で示される画素のレベルは、前景成分画像と背景成分画像が空間的に混合している画素値であり、左から２番目までの画素のレベルは、前景成分画像のみのレベルであるので、前景成分画像の輪郭を形成する画素（図９９右部の左から３番目の画素）の本来の画素値（前景成分画像だけの画素値）は、図９９中の右上部の左から２番目までの画素値に基づいて外挿補間された値（外挿補間により推定された値）である。
【０６３２】
より詳細には、外挿補間部１１０２は、前景成分画像の輪郭を形成する複数の画素（例えば、図９９の右上部左から２番目までの画素）の画素値から図９９中の右上部の点線で示された直線に相当する関数を求めると共に、その関数を用いて前景成分画像の輪郭を形成する画素（図９９中の右上部の左から３番目の画素）の前景成分画像のみからなる画素値を求める。尚、図９９の右上部においては、黒色で塗られた縦棒が、入力された前景成分画像の画素値（前景成分画像と背景成分画像が空間的に混合している画素の画素値）を示し、黒色と灰色の領域を合計した縦棒が外挿補間により求められた前景成分画像のみの画素値（推定画素値）を示している。
【０６３３】
尚、外挿補間に用いられる前景成分画像の輪郭を形成する画素の近傍画素は、輪郭を形成する画素から見て、後述する動きボケを付加する処理において、付加される動きの方向に隣接する画素が用いられる。このようにすることで、前景成分画像を構成する画素間の連続性が保たれるので、前景成分画像により自然な動きボケを付加することが可能となり、さらに、任意の背景画像と合成することで、より自然な合成画像を生成することが可能となる。また、外挿補間するために求められる関数を求める際、外挿補間部１１０２は、前景成分画像の輪郭を形成する画素の近傍の２画素を用いて求めるように説明してきているが、２画素以上の画素を用いるようにしてもよい。
【０６３４】
空間混合比生成部１１０３は、入力された前景成分画像の輪郭を形成する画素の外挿補間により求められた画素値を用いて、輪郭を形成する画素の空間混合比を生成し前景成分画像スムージング部１００４に出力する。
【０６３５】
すなわち、空間混合比生成部１１０３は、外挿補間により求められた前景成分画像の輪郭を形成する画素に対する入力された前景成分画像の輪郭を形成する画素の画素値の割合を求め、これを空間混合比として前景成分画像スムージング部１００４に出力する。
【０６３６】
より詳細には、図９９の右上部の左から３番目の画素の画素値の黒色と灰色で塗られた部分で示されている画素値（外挿補間により求められた画素値）をPpとし、黒色のみが塗られた部分で示される画素値（入力された前景成分画像の画素値）をPrとするとき、空間混合比生成部１１０３は、空間混合比としてPr／Ppを求める。
【０６３７】
このような処理により、空間混合が考慮された空間混合比は、図９９の右下部で示されるような分布となる。すなわち、左から2番目までの画素の空間混合比は１であり、左から3番目の画素の混合比は、Pr／Ppとなり、さらに、左から４，５番目の画素の空間混合比は０となる。
【０６３８】
次に、図１００のフローチャートを参照して、図９７の合成部１０７による画像の合成の処理について説明する。尚、図１００のステップＳ９４４乃至Ｓ９４９の処理は、図９３のフローチャートを参照して説明したステップＳ９２４，Ｓ９２５，９２７乃至９３０の処理と同様であるので、その説明は省略する。
【０６３９】
ステップＳ９４１において、輪郭近傍画素抽出部１１０１は、入力された前景成分画像より、その輪郭を形成している画素とその近傍の画素を抽出して、外挿補間部１１０２に出力する。
【０６４０】
すなわち、輪郭近傍画素抽出部１１０１は、例えば、図９９を参照して説明したように、図９９の左部の左から３番目の画素は、前景成分画像の輪郭を形成する画素となるので、その画素そのものと、左から２番目までの画素を、前景成分画像の輪郭を形成する画素の近傍の画素として抽出する。輪郭近傍画素抽出部１１０１は、このような処理を、前景成分画像の輪郭を形成している全ての画素について実行し、抽出した輪郭を形成する画素と、その近傍の画素の情報を外挿補間部１１０２に出力する。
【０６４１】
ステップＳ９４２において、外挿補間部１１０２は、輪郭近傍画素抽出部１１０１より供給される前景成分画像の輪郭を形成する画素と、その近傍の画素の画素値の情報に基づいて、輪郭を形成する画素を外挿補間により生成する。
【０６４２】
すなわち、外挿補間部１１０２は、例えば、図９９の右上部の左から１，２番目の画素を前景成分画像の輪郭を形成する画素の近傍の画素として用い、例えば、最小自乗法により、図９９の右上部の点線で示されるような近傍画素の画素値の変化を直線近似し、さらに、近似により求められた直線を用いて外挿補間することにより、図９９の右上部の黒色と灰色に塗りつぶされた領域からなる縦棒からなる画素値を、前景成分画像の輪郭を形成する画素の画素値として求めて（推定して）、輪郭近傍画素抽出部１１０１より供給された入力された、前景成分画像の輪郭を形成する画素の画素値と共に空間混合比生成部１１０３に供給する。
【０６４３】
ステップＳ９４３において、空間混合比生成部１１０３は、外挿補間により求められた、前景成分画像の輪郭を形成する画素の画素値に対する、入力された前景成分画像の輪郭を形成する画素の画素値の割合を空間混合比として、全ての前景成分画像の輪郭を形成する画素毎に求めて、前景成分画像スムージング部１００４に出力する。
【０６４４】
すなわち、空間混合比生成部１１０３は、上述したように、図９９の右上部の左から3番目の画素の画素値の黒色と灰色で塗られた部分で示されている画素値（外挿補間により求められた画素値）をPpとし、黒色のみが塗られた部分で示される画素値（入力された前景成分画像の画素値）をPrとするとき、空間混合比生成部１１０３は、Pr／Ppを空間混合比として求めて出力する。
【０６４５】
このような処理により、空間混合が考慮された空間混合比は、図９９の右下部で示されるように、左から2番目までの画素の空間混合比は１となり、左から３番目の画素の混合比は、Pr／Ppとなり、さらに、左から４，５番目の画素の空間混合比は０となる。
【０６４６】
以上によれば、画素単位で前景成分画像の輪郭を表現することにより生じていた階段状の破綻現象は、空間混合比が考慮されるので、輪郭を形成する画素の画素値を正確に表現することができ、前景成分画像と任意の背景成分画像を合成しても、見た目に自然な合成画像を生成することが可能となる。
【０６４７】
なお、センサは、CCDに限らす、固体撮像素子である、例えば、BBD（Bucket Brigade Device）、CID（Charge Injection Device）、またはCPD（Charge Priming Device）などのセンサでもよく、また、検出素子がマトリックス状に配置されているセンサに限らず、検出素子が１列に並んでいるセンサでもよい。
【０６４８】
上述した一連の処理は、ハードウェアにより実行させることもできるが、ソフトウェアにより実行させることもできる。一連の処理をソフトウェアにより実行させる場合には、そのソフトウェアを構成するプログラムが、専用のハードウェアに組み込まれているコンピュータ、または、各種のプログラムをインストールすることで、各種の機能を実行させることが可能な、例えば汎用のパーソナルコンピュータなどに記録媒体からインストールされる。
【０６４９】
図１０１は、図６の画像処理部の電気的な内部構成をソフトウェアにより実現する場合のパーソナルコンピュータの一実施の形態の構成を示している。パーソナルコンピュータのCPU２００１は、パーソナルコンピュータの全体の動作を制御する。また、CPU２００１は、バス２００４および入出力インタフェース２００５を介してユーザからキーボードやマウスなどからなる入力部２００６から指令が入力されると、それに対応してROM(Read Only Memory)２００２に格納されているプログラムを実行する。あるいはまた、CPU２００１は、ドライブ２０１０に接続された磁気ディスク２０２１、光ディスク２０２２、光磁気ディスク２０２３、または半導体メモリ２０２４から読み出され、記憶部２００８にインストールされたプログラムを、RAM(Random Access Memory)２００３にロードして実行する。これにより、上述した図６の画像処理部の機能が、ソフトウェアにより実現されている。さらに、CPU２００１は、通信部２００９を制御して、外部と通信し、データの授受を実行する。
【０６５０】
プログラムが記録されている記録媒体は、図１０１に示すように、コンピュータとは別に、ユーザにプログラムを提供するために配布される、プログラムが記録されている磁気ディスク２０２１（フレキシブルディスクを含む）、光ディスク２０２２（CD-ROM(Compact Disc-Read Only Memory)，DVD（Digital Versatile Disk）を含む）、光磁気ディスク２０２３（MD（Mini-Disc）を含む）、もしくは半導体メモリ２０２４などよりなるパッケージメディアにより構成されるだけでなく、コンピュータに予め組み込まれた状態でユーザに提供される、プログラムが記録されているROM２００２や、記憶部２００８に含まれるハードディスクなどで構成される。
【０６５１】
尚、本明細書において、記録媒体に記録されるプログラムを記述するステップは、記載された順序に沿って時系列的に行われる処理は、もちろん、必ずしも時系列的に処理されなくとも、並列的あるいは個別に実行される処理を含むものである。
【０６５２】
【発明の効果】
本発明によれば、前景成分画像と任意の背景成分画像を見た目に自然な画像として合成することが可能となる。
【図面の簡単な説明】
【図１】従来の合成画像を示す図である。
【図２】従来の合成画像の画素値の分布を示す図である。
【図３】従来の合成画像の混合比を示す図である。
【図４】従来の合成画像の前景成分画像と背景成分画像の画素単位の関係を示す図である。
【図５】本来の合成画像の前景成分画像と背景成分画像の画素単位の関係を示す図である。
【図６】画像処理部の構成例を示すブロック図である。
【図７】センサによる撮像を説明する図である。
【図８】画素の配置を説明する図である。
【図９】検出素子の動作を説明する図である。
【図１０】動いている前景に対応するオブジェクトと、静止している背景に対応するオブジェクトとを撮像して得られる画像を説明する図である。
【図１１】背景領域、前景領域、混合領域、カバードバックグラウンド領域、およびアンカバードバックグラウンド領域を説明する図である。
【図１２】静止している前景に対応するオブジェクトおよび静止している背景に対応するオブジェクトを撮像した画像における、隣接して１列に並んでいる画素の画素値を時間方向に展開したモデル図である。
【図１３】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図１４】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図１５】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図１６】前景領域、背景領域、および混合領域の画素を抽出した例を示す図である。
【図１７】画素と画素値を時間方向に展開したモデルとの対応を示す図である。
【図１８】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図１９】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図２０】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図２１】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図２２】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図２３】合成画像生成の処理を説明するフローチャートである。
【図２４】領域特定部１０３の構成を示すブロック図である。
【図２５】前景に対応するオブジェクトが移動しているときの画像を説明する図である。
【図２６】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図２７】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図２８】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図２９】領域判定の条件を説明する図である。
【図３０】領域特定部１０３の領域の特定の結果の例を示す図である。
【図３１】領域特定部１０３の領域の特定の結果の例を示す図である。
【図３２】領域特定の処理を説明するフローチャートである。
【図３３】領域特定部１０３の他の構成を示すブロック図である。
【図３４】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図３５】背景画像の例を示す図である。
【図３６】２値オブジェクト画像抽出部３０２の構成を示すブロック図である。
【図３７】相関値の算出を説明する図である。
【図３８】相関値の算出を説明する図である。
【図３９】２値オブジェクト画像の例を示す図である。
【図４０】時間変化検出部３０３の構成を示すブロック図である。
【図４１】領域判定部３４２の判定を説明する図である。
【図４２】時間変化検出部３０３の判定の例を示す図である。
【図４３】領域判定部１０３の領域特定の処理を説明するフローチャートである。
【図４４】領域判定の処理の詳細を説明するフローチャートである。
【図４５】領域特定部１０３のさらに他の構成を示すブロック図である。
【図４６】ロバスト化部３６１の構成を説明するブロック図である。
【図４７】動き補償部３８１の動き補償を説明する図である。
【図４８】動き補償部３８１の動き補償を説明する図である。
【図４９】領域特定の処理を説明するフローチャートである。
【図５０】ロバスト化の処理の詳細を説明するフローチャートである。
【図５１】混合比算出部１０４の構成を示すブロック図である。
【図５２】推定混合比処理部４０１の構成を示すブロック図である。
【図５３】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図５４】重み付け差分を説明するモデル図である。
【図５５】重み付け差分を説明するモデル図である。
【図５６】重み付け差分と動きベクトルとの関係を説明するモデル図である。
【図５７】相関値の演算を説明するモデル図である。
【図５８】相関値の演算の対象となる画素のブロックの例を示す図である。
【図５９】混合比算出部１０４の他の構成を示すブロック図である。
【図６０】混合比の算出の処理を説明するフローチャートである。
【図６１】カバードバックグラウンド領域に対応する混合比推定の処理を説明するフローチャートである。
【図６２】混合比αを近似する直線を説明する図である。
【図６３】混合比αを近似する平面を説明する図である。
【図６４】混合比αを算出するときの複数のフレームの画素の対応を説明する図である。
【図６５】混合比推定処理部４０１の他の構成を示すブロック図である。
【図６６】推定混合比の例を示す図である。
【図６７】カバードバックグラウンド領域に対応するモデルによる混合比推定の処理を説明するフローチャートである。
【図６８】前景背景分離部１０５の構成の一例を示すブロック図である。
【図６９】入力画像、前景成分画像、および背景成分画像を示す図である。
【図７０】入力画像、前景成分画像、および背景成分画像を示す図である。
【図７１】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図７２】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図７３】画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図７４】分離部６０１の構成の一例を示すブロック図である。
【図７５】分離された前景成分画像、および背景成分画像の例を示す図である。
【図７６】前景と背景との分離の処理を説明するフローチャートである。
【図７７】動きボケ除去部１０６の構成の一例を示すブロック図である。
【図７８】処理単位を説明する図である。
【図７９】前景成分画像の画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図８０】前景成分画像の画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図８１】動きボケ除去部１０６による前景成分画像に含まれる動きボケの除去の処理を説明するフローチャートである。
【図８２】動きボケ除去部１０６の他の構成を示す図である。
【図８３】画素値と前景の成分のとの対応を指定するモデルの例を示す図である。
【図８４】前景の成分の算出を説明する図である。
【図８５】前景の成分の算出を説明する図である。
【図８６】前景の動きボケの除去の処理を説明するフローチャートである。
【図８７】合成部１０７の構成の一例を示すブロック図である。
【図８８】前景成分画像の輪郭部の平滑化処理を説明する図である。
【図８９】前景成分画像の輪郭部のスムージング処理を説明する図である。
【図９０】前景成分画像の輪郭部のスムージング処理を説明する図である。
【図９１】前景成分画像の画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図９２】前景成分画像の画素値を時間方向に展開し、シャッタ時間に対応する期間を分割したモデル図である。
【図９３】画像の合成の処理を説明するフローチャートである。
【図９４】合成された画像の混合比を示す図である。
【図９５】合成された画像の画素値の分布を示す図である。
【図９６】合成された画像を示す図である。
【図９７】合成部１０７の構成のその他の一例を示すブロック図である。
【図９８】前景成分画像の画素値レベルと混合比の対応を示す図である。
【図９９】前景成分画像の輪郭を形成する画素の画素値レベルを外挿補間して求め、外挿補間により求められた画素値と、入力された画像の画素値に基づいて混合比を求める手法を説明する図である。
【図１００】画像の合成の処理を説明するフローチャートである。
【図１０１】記録媒体を説明する図である。
【符号の説明】
２１ CPU，２２ ROM，２３ RAM，２６入力部，２７出力部，２８記憶部，２９通信部，５１磁気ディスク，５２光ディスク，５３光磁気ディスク，５４半導体メモリ，１０１オブジェクト抽出部，１０２動き検出部，１０３領域特定部，１０４混合比算出部，１０５前景背景分離部，１０６動きボケ除去部，１０７合成部，２０１フレームメモリ，２０２−１乃至２０２−４静動判定部，２０３−１乃至２０３−３領域判定部，２０４判定フラグ格納フレームメモリ，２０５合成部，２０６判定フラグ格納フレームメモリ，３０１背景画像生成部，３０２２値オブジェクト画像抽出部，３０３時間変化検出部，３２１相関値演算部，３２２しきい値処理部，３４１フレームメモリ，３４２領域判定部，３６１ロバスト化部，３８１動き補償部，３８２スイッチ，３８３−１乃至３８３−Ｎフレームメモリ、３８４−１乃至３８４−Ｎ重み付け部，３８５積算部，４０１推定混合比処理部，４０２推定混合比処理部，４０３混合比決定部，４２１フレームメモリ，４２２重み生成部，４２３重み付けフレーム差分演算部，４２４動き補償部，４２５フレームメモリ，４２６相関値演算部，４２７最大値判定部，４４１選択部，４４２選択部，６０１分離部，６０２スイッチ，６０３合成部，６０４スイッチ，６０５合成部，６２１フレームメモリ，６２２分離処理ブロック，６２３フレームメモリ，６３１アンカバード領域処理部，６３２カバード領域処理部，６３３合成部，６３４合成部，８０１処理単位決定部，８０２モデル化部，８０３方程式生成部，８０４足し込み部，８０５演算部，９０１処理単位決定部，９０２モデル化部，９０３方程式生成部，９０４演算部，９０５補正部，１００１輪郭抽出部，１００２輪郭平滑化部，１００３空間混合比生成部，１００４前景成分画像スムージング部，１００５動きボケ付加部，１００６領域情報調整部，１００７時間混合比算出部，１００８背景成分生成部，１００９混合領域画像合成部，１０１０画像合成部，１１０１輪郭近傍画素抽出部，１１０２外挿補間部，１１０３空間混合比生成部[0001]
BACKGROUND OF THE INVENTION
The present invention relates to an image processing apparatus and method, a recording medium, and a program, and more particularly to an image processing apparatus and method, a recording medium, and a program that can synthesize an image without failure.
[0002]
[Prior art]
An image synthesis technique for synthesizing a foreground component image and a background component image into one image is generally popular. That is, the image composition technique is to generate one composite image by compositing the foreground component image included in the first image and the background component image included in the second image. It is.
[0003]
In addition, there is one that adjusts the motion blur of the foreground component image and combines it with the background component image. (For example, refer to Patent Document 1).
[0004]
[Patent Document 1]
JP 2003-18564 A (Japanese Patent Application 2001-195605)
[0005]
[Problems to be solved by the invention]
By the way, when the composite image as shown in FIG. 1A is generated using the above-described conventional method, the mixing ratio of the foreground component image and the background component image has a distribution as shown in FIG. 1B. FIG. 1B shows the distribution of the mixture ratio of the foreground component and the background component for each pixel in color. In FIG. 1B, white indicates that the foreground component ratio is 1 and the background component is 0, and black indicates that the foreground component ratio is 0 and the background component is 1. Therefore, when the ratio of the foreground component and the background component is ½, gray (an intermediate color between white and black) is displayed.
[0006]
In FIGS. 1A and 1B, since the image that is the foreground component is an image that moves in the horizontal direction, motion blur occurs with respect to the background component. Therefore, the mixing between the foreground component and the background component In the region, there is a mixed region where the respective components are mixed.
[0007]
FIG. 2 shows the distribution of the pixel values of the foreground component image and the background component image on FIG. 1A, and the distribution of the pixel values at the coordinate positions in the horizontal and vertical directions. .
[0008]
As shown in FIGS. 1A, 1B, or 2, the pixel values and the mixing ratio change stepwise near the boundary between the foreground component image and the background component image. For this reason, the synthesized image has a problem in that the breakdown occurs in the mixed region and the image becomes unnatural.
[0009]
This is because the foreground component image and the background component image are obtained in units of pixels, so that the boundary portion combines the foreground component image from which motion blur is removed and the background component image as shown in FIG. As you can see, the continuity of the shape is not maintained, and the shape becomes stepped and jagged.
[0010]
That is, as shown in FIG. 4, light incident on a CCD (Charge Coupled Device) or the like is expressed for each pixel. In FIG. 4, the light-emitting area of the background component image is shown in black, and the light-emitting portion of the foreground component image is shown by a hatched portion, and the cup-like container is filled with the emitted light as if each pixel of the CCD was used. This indicates that each pixel is expressed. In the case of FIG. 4, the pixels from the left to the fourth have pixel values indicating the foreground component image, and the pixels from the right to the sixth background component image. That is, in FIG. 4, the foreground component image and the background component image are expressed in units of pixels. However, the actual boundary between the foreground component image and the background component image does not exist for each pixel of the CCD, and the boundary may be on the pixel.
[0011]
For example, as shown in FIG. 5, there may actually be a boundary between the foreground component image and the background component image on the CCD pixel. In FIG. 5, the center position of the fifth pixel from the left is the foreground component image, and the center position of the sixth pixel from the right represents the background component image. The fifth pixel from the left (from the right) The center position of the sixth pixel) is the boundary. In this case, in fact, the fifth pixel from the left should have a pixel value obtained by mixing the foreground component image and the background component image.
[0012]
However, since the foreground component image and the background component image described above are divided for each pixel, even if they are combined and motion blur is added, the boundary portion becomes a stepped shape, and the image is uncomfortable. There was a problem that it would become.
[0013]
The present invention has been made in view of such a situation, and enables a composite image to be accurately generated by expressing a mixed region that becomes a boundary between a foreground component image and a background component image in more detail. is there.
[0014]
[Means for Solving the Problems]
The image processing apparatus according to the present invention includes a pixel detection unit that detects a pixel at an end of a contour portion in a direction of a motion to be added, and a pixel detected at the end detected by the pixel detection unit. Located around the pixel value and the edge pixel In the foreground image From surrounding pixels By extrapolating A mixture ratio detecting means for detecting a ratio with the estimated pixel value to obtain a mixture ratio at an end, a motion blur adding means for generating a motion blur added foreground image obtained by adding a predetermined motion blur to the foreground image, a motion And a synthesizing unit that synthesizes the foreground image with the background image based on the blending ratio of the blur added foreground image and the edge.
[0015]
The image processing method according to the present invention includes a pixel detection step for detecting a pixel at an end portion of a contour portion in a direction of a motion to be added in a foreground image having no motion blur, and an end portion detected by the processing of the pixel detection step. Located around the pixel value of the pixel and the edge pixel In the foreground image From surrounding pixels By extrapolating A mixing ratio detection step for detecting a ratio with the estimated pixel value to be estimated and setting the mixing ratio at the end, and a motion blur adding step for generating a motion blur added foreground image in which a predetermined motion blur is added to the foreground image And a synthesis step of synthesizing the foreground image with the background image based on the mixing ratio of the motion blur added foreground image and the end portion.
[0016]
The recording medium program of the present invention includes a pixel detection step for detecting a pixel at an end portion of a contour portion in a direction of motion to be added from a foreground image without motion blur, and an end portion detected by processing of the pixel detection step. Is located around the pixel value of the pixel and the edge pixel In the foreground image From surrounding pixels By extrapolating A mixing ratio detection step for detecting a ratio with the estimated pixel value to be estimated and setting the mixing ratio at the end, and a motion blur adding step for generating a motion blur added foreground image in which a predetermined motion blur is added to the foreground image And a synthesis step of synthesizing the foreground image with the background image based on the mixing ratio of the motion blur added foreground image and the end portion.
[0017]
The program according to the present invention includes a pixel detection step for detecting a pixel at an end portion of a contour portion in a direction of a motion to be added and a pixel at an end portion detected by the processing of the pixel detection step in a foreground image without motion blur. Located around the pixel value and the edge pixel In the foreground image From surrounding pixels By extrapolating A mixing ratio detection step for detecting a ratio with the estimated pixel value to be estimated and setting the mixing ratio at the end, and a motion blur adding step for generating a motion blur added foreground image in which a predetermined motion blur is added to the foreground image And a synthesis step of synthesizing the foreground image with the background image based on the mixture ratio of the motion blur added foreground image and the edges.
[0018]
In the image processing apparatus, method, and program according to the present invention, the pixel at the edge of the contour in the direction of motion to be added is detected from the foreground image without motion blur, and the pixel value of the detected pixel at the edge And located around the edge pixels In the foreground image From surrounding pixels By extrapolating A ratio with the estimated pixel value to be estimated is detected and used as a mixture ratio of the end portion, and a motion blur added foreground image in which a predetermined motion blur is added to the foreground image is generated, and a motion blur added foreground image and an end portion are generated. Based on the mixing ratio, the foreground image is synthesized with the background image.
[0019]
The image processing apparatus of the present invention may be an independent apparatus or a block that performs image processing.
[0020]
DETAILED DESCRIPTION OF THE INVENTION
Embodiments of the present invention will be described below. Correspondences between constituent elements described in the claims and specific examples in the embodiments of the present invention are exemplified as follows. This description is intended to confirm that specific examples supporting the invention described in the claims are described in the embodiments of the invention. Therefore, even though there are specific examples that are described in the embodiment of the invention but are not described here as corresponding to the configuration requirements, the specific examples are not included in the configuration. It does not mean that it does not correspond to a requirement. On the contrary, even if a specific example is described here as corresponding to a configuration requirement, this means that the specific example does not correspond to a configuration requirement other than the configuration requirement. not.
[0021]
Further, this description does not mean that all the inventions corresponding to the specific examples described in the embodiments of the invention are described in the claims. In other words, this description is an invention corresponding to the specific example described in the embodiment of the invention, and the existence of an invention not described in the claims of this application, that is, a future divisional application. Or the existence of an invention added by amendment is not denied.
[0022]
That is, the image processing apparatus of the present invention detects pixel detection means (for example, the contour neighboring pixel extraction unit 1101 in FIG. 97) that detects pixels at the edge of the contour in the direction of motion to be added from the foreground image without motion blur. ), The pixel value of the edge pixel detected by the pixel detection means, and the periphery of the edge pixel In the foreground image From surrounding pixels By extrapolating Mixing ratio detection means (for example, the spatial mixing ratio generation unit 1101 in FIG. 97) that detects the ratio with the estimated pixel value to be used and sets the mixing ratio at the end, and adds a predetermined motion blur to the foreground image Based on the motion blur adding means (for example, the motion blur adding unit 1005 in FIG. 97) for generating the motion blur added foreground image and the mixing ratio of the motion blur added foreground image and the end portion, the foreground image is synthesized with the background image. And a synthesizing unit (for example, an image synthesizing unit 1010 in FIG. 97).
[0023]
Further, the image processing method of the present invention includes a pixel detection step (the process of step S941 in the flowchart of FIG. 100) for detecting pixels at the edge of the contour portion in the direction of motion to be added from the foreground image without motion blur. , The pixel value of the end pixel detected by the processing of the pixel detection step and the periphery of the end pixel In the foreground image From surrounding pixels By extrapolating A mixture ratio detection step (processing in step S943 in the flowchart of FIG. 100) that detects a ratio with the estimated pixel value to be an end portion, and a motion in which a predetermined motion blur is added to the foreground image A motion blur adding step for generating a blur-added foreground image (processing in step S945 in the flowchart of FIG. 100) and a combining step for combining the foreground image with the background image based on the mixture ratio of the motion blur-added foreground image and the end portion ( Step S949 in the flowchart of FIG. 100).
[0024]
FIG. 6 is a block diagram showing a configuration of an embodiment of an image processing unit to which the present invention is applied.
[0025]
It does not matter whether each function of the image processing unit is realized by hardware or software. That is, each block diagram in this specification may be considered as a hardware block diagram or a software functional block diagram.
[0026]
Here, the motion blur refers to a distortion included in an image corresponding to a moving object, which is caused by the movement of an object in the real world to be imaged and the imaging characteristics of the sensor.
[0027]
In this specification, an image corresponding to an object in the real world to be imaged is referred to as an image object.
[0028]
The input image supplied to the image processing unit is supplied to the object extracting unit 101, the region specifying unit 103, the mixture ratio calculating unit 104, and the foreground / background separating unit 105.
[0029]
The object extraction unit 101 roughly extracts an image object corresponding to a foreground object included in the input image, and supplies the extracted image object to the motion detection unit 102. For example, the object extraction unit 101 detects the outline of the image object corresponding to the foreground object included in the input image, thereby roughly extracting the image object corresponding to the foreground object.
[0030]
The object extraction unit 101 roughly extracts an image object corresponding to a background object included in the input image, and supplies the extracted image object to the motion detection unit 102. For example, the object extraction unit 101 roughly extracts an image object corresponding to the background object from the difference between the input image and the image object corresponding to the extracted foreground object.
[0031]
Further, for example, the object extraction unit 101 corresponds to the image object corresponding to the foreground object and the background object from the difference between the background image stored in the background memory provided therein and the input image. You may make it extract the image object to perform roughly.
[0032]
The motion detection unit 102 calculates the motion vector of the image object corresponding to the coarsely extracted foreground object by a method such as a block matching method, a gradient method, a phase correlation method, and a per-recursive method. The motion vector and the position information of the motion vector (information specifying the position of the pixel corresponding to the motion vector) are supplied to the region specifying unit 103, the mixture ratio calculating unit 104, and the motion blur extracting unit 106.
[0033]
The motion vector output from the motion detection unit 102 includes information corresponding to the motion amount v.
[0034]
Further, for example, the motion detection unit 102 may output a motion vector for each image object to the motion blur removal unit 106 together with pixel position information for specifying a pixel in the image object.
[0035]
The motion amount v is a value that represents a change in the position of the image corresponding to the moving object in units of pixel intervals. For example, when the image of the object corresponding to the foreground is moved so as to be displayed at a position separated by four pixels in the next frame with reference to a certain frame, the motion amount v of the image of the object corresponding to the foreground is 4.
[0036]
Note that the object extraction unit 101 and the motion detection unit 102 are necessary when adjusting the amount of motion blur corresponding to a moving object.
[0037]
The area specifying unit 103 specifies each pixel of the input image as one of the foreground area, the background area, or the mixed area, and whether each pixel belongs to one of the foreground area, the background area, or the mixed area (Hereinafter referred to as area information) is supplied to the mixture ratio calculation unit 104, the foreground / background separation unit 105, and the motion blur removal unit 106.
[0038]
Based on the input image and the region information supplied from the region specifying unit 103, the mixture ratio calculation unit 104 calculates a mixture ratio (hereinafter referred to as a mixture ratio α) corresponding to the pixels included in the mixture region 63. The calculated mixture ratio is supplied to the foreground / background separator 105.
[0039]
The mixing ratio α is a value indicating a ratio of an image component (hereinafter also referred to as a background component) corresponding to a background object in a pixel value, as shown in an equation (3) described later.
[0040]
Based on the region information supplied from the region specifying unit 103 and the mixture ratio α supplied from the mixture ratio calculation unit 104, the foreground / background separation unit 105 performs image component corresponding to the foreground object (hereinafter referred to as foreground component). The input image is separated into a foreground component image consisting of only the background component and a background component image consisting only of the background component, and the foreground component image is supplied to the motion blur removal unit 106 and the synthesis unit 107. Only the foreground and the background can be specified without considering the conventional mixed region, and an accurate foreground and background can be obtained as compared with the separated method.
[0041]
The motion blur removal unit 106 determines a processing unit indicating one or more pixels included in the foreground component image based on the motion amount v and the region information that can be known from the motion vector. The processing unit is data that designates a group of pixels to be subjected to a process for adjusting the amount of motion blur.
[0042]
The motion blur removal unit 106 is based on the foreground component image supplied from the foreground / background separation unit 105, the motion vector and the position information supplied from the motion detection unit 102, and the processing unit, and the motion blur included in the foreground component image. And the foreground component image from which the motion blur is removed is output to the synthesis unit 107. The motion vector and its position information may not be used.
[0043]
The synthesizing unit 107 detects the foreground component from which the motion blur is removed by the motion blur removing unit 106 based on the motion vector and the position information supplied from the motion detecting unit 102 and the region information input from the region specifying unit 103. The image is combined with an input arbitrary background image, and a combined image is generated and output.
[0044]
Next, an input image supplied to the image processing unit will be described with reference to FIGS.
[0045]
FIG. 7 is a diagram for explaining imaging by a sensor. The sensor is composed of, for example, a CCD video camera equipped with a CCD (Charge-Coupled Device) area sensor which is a solid-state image sensor. The object corresponding to the foreground in the real world moves horizontally between the object corresponding to the background and the sensor in the real world, for example, from the left side to the right side in the drawing.
[0046]
The sensor images an object corresponding to the foreground together with an object corresponding to the background. The sensor outputs the captured image in units of one frame. For example, the sensor outputs an image composed of 30 frames per second. The exposure time of the sensor can be 1/30 second. The exposure time is a period from the start of the conversion of the input light into the electric charge until the end of the conversion of the input light into the electric charge. Hereinafter, the exposure time is also referred to as shutter time.
[0047]
FIG. 8 is a diagram for explaining the arrangement of pixels. In FIG. 8, A to I indicate individual pixels. The pixels are arranged on a plane corresponding to the image. One detection element corresponding to one pixel is arranged on the sensor. When the sensor captures an image, one detection element outputs a pixel value corresponding to one pixel constituting the image. For example, the position of the detection element in the X direction corresponds to the horizontal position on the image, and the position of the detection element in the Y direction corresponds to the vertical position on the image.
[0048]
As shown in FIG. 9, for example, a detection element that is a CCD converts input light into electric charges for a period corresponding to a shutter time, and accumulates the converted electric charges. The amount of charge is approximately proportional to the intensity of input light and the time during which light is input. The detection element adds the electric charge converted from the input light to the already accumulated electric charge in a period corresponding to the shutter time. That is, the detection element integrates the input light for a period corresponding to the shutter time, and accumulates an amount of charge corresponding to the integrated light. It can be said that the detection element has an integration effect with respect to time.
[0049]
The electric charge accumulated in the detection element is converted into a voltage value by a circuit (not shown), and the voltage value is further converted into a pixel value such as digital data and output. Therefore, each pixel value output from the sensor is a value projected onto a one-dimensional space, which is the result of integrating a part of the object corresponding to the foreground or background having a spatial extent with respect to the shutter time. Have.
[0050]
The image processing unit extracts significant information buried in the output signal, for example, the mixing ratio α, by the accumulation operation of the sensor. The image processing unit adjusts the amount of distortion caused by the mixture of the foreground image objects themselves, for example, the amount of motion blur. In addition, the image processing unit adjusts the amount of distortion caused by the mixture of the foreground image object and the background image object.
[0051]
FIG. 10 is a diagram for explaining an image obtained by imaging an object corresponding to a moving foreground and an object corresponding to a stationary background. FIG. 10A shows an image obtained by imaging an object corresponding to a foreground with movement and an object corresponding to a stationary background. In the example shown in FIG. 10A, the object corresponding to the foreground is moving horizontally from the left to the right with respect to the screen.
[0052]
FIG. 10B is a model diagram in which pixel values corresponding to one line of the image shown in FIG. 10A are expanded in the time direction. The horizontal direction in FIG. 10B corresponds to the spatial direction X in FIG. 10A.
[0053]
The pixel value of the background region pixel is composed of only the background component, that is, the image component corresponding to the background object. The pixel value of the foreground region pixel is composed of only the foreground component, that is, the image component corresponding to the foreground object.
[0054]
The pixel value of the pixel in the mixed area is composed of a background component and a foreground component. Since the pixel value is composed of the background component and the foreground component, the mixed region can be said to be a distortion region. The mixed area is further classified into a covered background area and an uncovered background area.
[0055]
The covered background area is a mixed area at a position corresponding to the front end of the foreground object in the advancing direction with respect to the foreground area, and is an area where the background component is covered with the foreground as time passes.
[0056]
On the other hand, the uncovered background area is a mixed area at a position corresponding to the rear end portion of the foreground object in the advancing direction with respect to the foreground area, and an area where a background component appears as time passes. Say.
[0057]
In this way, an image including a foreground area, a background area, or a covered background area or an uncovered background area is input as an input image to the area specifying unit 103, the mixture ratio calculation unit 104, and the foreground / background separation unit 105. .
[0058]
FIG. 11 is a diagram illustrating the background area, the foreground area, the mixed area, the covered background area, and the uncovered background area as described above. In the case of corresponding to the image shown in FIG. 11, the background area is a static part, the foreground area is a moving part, and the covered background area of the mixed area is a part that changes from the background to the foreground. The uncovered background area is a portion that changes from the foreground to the background.
[0059]
FIG. 12 is a model diagram in which pixel values of pixels arranged in a row adjacent to each other in an image obtained by capturing an object corresponding to a stationary foreground and an object corresponding to a stationary background are expanded in the time direction. It is. For example, pixels arranged on one line of the screen can be selected as the pixels arranged adjacent to each other in one column.
[0060]
The pixel values F01 to F04 shown in FIG. 12 are pixel values corresponding to the still foreground object. The pixel values B01 to B04 shown in FIG. 12 are pixel values corresponding to the stationary background object.
[0061]
In the vertical direction in FIG. 12, time elapses from the top to the bottom in the figure. The position of the upper side of the rectangle in FIG. 12 corresponds to the time when the sensor starts to convert the input light into electric charge, and the position of the lower side of the rectangle in FIG. 12 indicates the electric charge of the light input by the sensor. Corresponds to the time to finish conversion of. That is, the distance from the upper side to the lower side of the rectangle in FIG. 12 corresponds to the shutter time.
[0062]
Hereinafter, a case where the shutter time and the frame interval are the same will be described as an example.
[0063]
The horizontal direction in FIG. 12 corresponds to the spatial direction X described in FIG. More specifically, in the example shown in FIG. 12, the distance from the left side of the rectangle described as “F01” in FIG. 12 to the right side of the rectangle described as “B04” is eight times the pixel pitch, That is, it corresponds to the interval between eight consecutive pixels.
[0064]
When the foreground object and the background object are stationary, the light input to the sensor does not change during the period corresponding to the shutter time.
[0065]
Here, the period corresponding to the shutter time is divided into two or more periods having the same length. For example, if the number of virtual divisions is 4, the model diagram shown in FIG. 12 can be represented as the model shown in FIG. The virtual division number is set corresponding to the amount of movement v of the object corresponding to the foreground within the shutter time. For example, the number of virtual divisions is 4 corresponding to the motion amount v being 4, and the period corresponding to the shutter time is divided into 4.
[0066]
The top row in the figure corresponds to the first divided period after the shutter opens. The second row from the top in the figure corresponds to the second divided period from when the shutter has opened. The third line from the top in the figure corresponds to the third divided period from when the shutter has opened. The fourth row from the top in the figure corresponds to the fourth divided period from when the shutter has opened.
[0067]
Hereinafter, the shutter time divided in accordance with the motion amount v is also referred to as shutter time / v.
[0068]
Since the light input to the sensor does not change when the object corresponding to the foreground is stationary, the foreground component F01 / v is equal to a value obtained by dividing the pixel value F01 by the virtual division number. Similarly, when the object corresponding to the foreground is stationary, the foreground component F02 / v is equal to the value obtained by dividing the pixel value F02 by the number of virtual divisions, and the foreground component F03 / v is the virtual value of the pixel value F03. The foreground component F04 / v is equal to the value obtained by dividing the pixel value F04 by the virtual division number.
[0069]
Since the light input to the sensor does not change when the object corresponding to the background is stationary, the background component B01 / v is equal to the value obtained by dividing the pixel value B01 by the virtual division number. Similarly, when the object corresponding to the background is stationary, the background component B02 / v is equal to the value obtained by dividing the pixel value B02 by the virtual division number, and B03 / v is obtained by dividing the pixel value B03 by the virtual division number. B04 / v is equal to a value obtained by dividing the pixel value B04 by the number of virtual divisions.
[0070]
That is, when the object corresponding to the foreground is stationary, the light corresponding to the foreground object input to the sensor does not change during the period corresponding to the shutter time. The foreground component F01 / v corresponding to, the foreground component F01 / v corresponding to the second shutter time / v after the shutter opens, and the third foreground corresponding to the shutter time / v corresponding to the shutter time / v. And the foreground component F01 / v corresponding to the fourth shutter time / v after the shutter is opened have the same value. F02 / v to F04 / v have the same relationship as F01 / v.
[0071]
When the object corresponding to the background is stationary, the light corresponding to the background object input to the sensor does not change during the period corresponding to the shutter time, so it corresponds to the first shutter time / v after the shutter opens. Background component B01 / v, the second background component B01 / v corresponding to the shutter time / v after the shutter opens, and the third background component corresponding to the shutter time / v corresponding to the shutter time / v B01 / v and the fourth background component B01 / v corresponding to the shutter time / v after the shutter is opened have the same value. B02 / v to B04 / v have the same relationship.
[0072]
Next, a case where the object corresponding to the foreground moves and the object corresponding to the background is stationary will be described.
[0073]
FIG. 14 is a model diagram in which pixel values of pixels on one line including the covered background area are expanded in the time direction when the object corresponding to the foreground moves toward the right side in the drawing. In FIG. 14, the foreground motion amount v is 4. Since one frame is a short time, it can be assumed that the object corresponding to the foreground is a rigid body and is moving at a constant speed. In FIG. 14, the image of the object corresponding to the foreground moves so as to be displayed on the right side by four pixels in the next frame with reference to a certain frame.
[0074]
In FIG. 14, the leftmost pixel through the fourth pixel from the left belong to the foreground area. In FIG. 14, the fifth through seventh pixels from the left belong to the mixed area, which is a covered background area. In FIG. 14, the rightmost pixel belongs to the background area.
[0075]
Since the object corresponding to the foreground is moving so as to cover the object corresponding to the background with the passage of time, the component included in the pixel value of the pixel belonging to the covered background area has a period corresponding to the shutter time. At this point, the background component is replaced by the foreground component.
[0076]
For example, the pixel value M with a thick line frame in FIG. 14 is expressed by Expression (1).
[0077]
M = B02 / v + B02 / v + F07 / v + F06 / v (1)
[0078]
For example, since the fifth pixel from the left includes a background component corresponding to one shutter time / v and includes a foreground component corresponding to three shutter times / v, the mixture ratio of the fifth pixel from the left α is 1/4. The sixth pixel from the left includes a background component corresponding to two shutter times / v and includes a foreground component corresponding to two shutter times / v. Therefore, the mixture ratio α of the sixth pixel from the left is 1/2. The seventh pixel from the left includes a background component corresponding to three shutter times / v, and includes a foreground component corresponding to one shutter time / v. Therefore, the mixture ratio α of the seventh pixel from the left is 3/4.
[0079]
Since it can be assumed that the object corresponding to the foreground is a rigid body and moves at a constant speed so that the foreground image is displayed on the right side of four pixels in the next frame, for example, the fourth pixel from the left in FIG. The foreground component F07 / v of the first shutter time / v after the shutter opens is the foreground component corresponding to the second shutter time / v of the fifth pixel from the left in FIG. be equivalent to. Similarly, the foreground component F07 / v corresponds to the foreground component of the sixth pixel from the left in FIG. 14 corresponding to the third shutter time / v from when the shutter has opened, and the seventh pixel from the left in FIG. And the foreground component corresponding to the fourth shutter time / v after the shutter is opened.
[0080]
Since it can be assumed that the object corresponding to the foreground is a rigid body and moves at a constant speed so that the foreground image is displayed on the right side of four pixels in the next frame, for example, the third pixel from the left in FIG. The foreground component F06 / v of the first shutter time / v after the shutter is opened is the foreground component corresponding to the second shutter time / v of the fourth pixel from the left in FIG. equal. Similarly, the foreground component F06 / v is the sixth pixel from the left in FIG. 14 and the foreground component of the fifth pixel from the left in FIG. 14 corresponding to the third shutter time / v from when the shutter has opened. And the foreground component corresponding to the fourth shutter time / v after the shutter is opened.
[0081]
Since it can be assumed that the object corresponding to the foreground is a rigid body and moves at a constant speed so that the foreground image is displayed on the right side of four pixels in the next frame, for example, the second pixel from the left in FIG. The foreground component F05 / v of the first shutter time / v after the shutter is opened is the foreground component of the third pixel from the left in FIG. 14 corresponding to the second shutter time / v after the shutter is opened. equal. Similarly, the foreground component F05 / v is the fourth pixel from the left in FIG. 14 and the foreground component corresponding to the third shutter time / v after the shutter is opened, and the fifth pixel from the left in FIG. And the foreground component corresponding to the fourth shutter time / v after the shutter is opened.
[0082]
Since it can be assumed that the object corresponding to the foreground is a rigid body and moves at a constant speed so that the foreground image is displayed on the right side of four pixels in the next frame, for example, the shutter of the leftmost pixel in FIG. The foreground component F04 / v of the first shutter time / v after opening is equal to the foreground component of the second pixel from the left in FIG. 14 corresponding to the second shutter time / v after the shutter is opened. Similarly, the foreground component F04 / v corresponds to the foreground component of the third pixel from the left in FIG. 14 corresponding to the third shutter time / v from when the shutter has opened, and the fourth pixel from the left in FIG. And the foreground component corresponding to the fourth shutter time / v after the shutter is opened.
[0083]
Since the foreground area corresponding to the moving object includes motion blur as described above, it can be said to be a distortion area.
[0084]
FIG. 15 is a model diagram in which pixel values of pixels on one line including the uncovered background area are expanded in the time direction when the foreground moves toward the right side in the drawing. In FIG. 15, the foreground motion amount v is 4. Since one frame is a short time, it can be assumed that the object corresponding to the foreground is a rigid body and is moving at a constant speed. In FIG. 15, the image of the object corresponding to the foreground moves to the right by four pixels in the next frame with reference to a certain frame.
[0085]
In FIG. 15, the leftmost pixel through the fourth pixel from the left belong to the background area. In FIG. 15, the fifth through seventh pixels from the left belong to the mixed area that is an uncovered background. In FIG. 15, the rightmost pixel belongs to the foreground area.
[0086]
Since the object corresponding to the foreground that covered the object corresponding to the background is moved so as to be removed from the front of the object corresponding to the background over time, it is included in the pixel value of the pixel belonging to the uncovered background area The component to be changed from the foreground component to the background component at a certain point in time corresponding to the shutter time.
[0087]
For example, the pixel value M ′ with a thick frame in FIG. 15 is expressed by Expression (2).
[0088]
M '= F02 / v + F01 / v + B26 / v + B26 / v (2)
[0089]
For example, the fifth pixel from the left includes a background component corresponding to three shutter times / v, and includes a foreground component corresponding to one shutter time / v, so the mixing ratio of the fifth pixel from the left α is 3/4. The sixth pixel from the left includes a background component corresponding to two shutter times / v and includes a foreground component corresponding to two shutter times / v. Therefore, the mixture ratio α of the sixth pixel from the left is 1/2. Since the seventh pixel from the left includes a background component corresponding to one shutter time / v and includes a foreground component corresponding to three shutter times / v, the mixture ratio α of the seventh pixel from the left is 1/4.
[0090]
When the expressions (1) and (2) are generalized, the pixel value M is expressed by the expression (3).
[0091]
[Expression 1]

... (3)
Here, α is a mixing ratio. B is a background pixel value, and Fi / v is a foreground component.
[0092]
Since the object corresponding to the foreground is a rigid body and can be assumed to move at a constant speed, and the amount of movement v is 4, for example, the first pixel from the left in FIG. The foreground component F01 / v of the shutter time / v is equal to the foreground component of the sixth pixel from the left in FIG. 15 corresponding to the second shutter time / v after the shutter is opened. Similarly, F01 / v represents the foreground component of the seventh pixel from the left in FIG. 15 corresponding to the third shutter time / v from when the shutter has opened, and the eighth pixel from the left in FIG. , And the foreground component corresponding to the fourth shutter time / v after the shutter is opened.
[0093]
Since it can be assumed that the object corresponding to the foreground is a rigid body and moves at a constant speed, and the number of virtual divisions is 4, for example, the first pixel from the left in FIG. The foreground component F02 / v of the shutter time / v is equal to the foreground component of the seventh pixel from the left in FIG. 15 corresponding to the second shutter time / v after the shutter is opened. Similarly, the foreground component F02 / v is equal to the foreground component of the eighth pixel from the left in FIG. 15 corresponding to the third shutter time / v from when the shutter has opened.
[0094]
Since the object corresponding to the foreground is a rigid body and can be assumed to move at a constant speed and the amount of movement v is 4, for example, the seventh pixel from the left in FIG. The foreground component F03 / v of the shutter time / v is equal to the foreground component of the eighth pixel from the left in FIG. 15 corresponding to the second shutter time / v after the shutter is opened.
[0095]
In the description of FIGS. 13 to 15, it has been described that the virtual division number is 4, but the virtual division number corresponds to the motion amount v. The amount of movement v generally corresponds to the moving speed of the object corresponding to the foreground. For example, when the object corresponding to the foreground is moving so as to be displayed to the right by four pixels in the next frame with reference to a certain frame, the amount of movement v is 4. Corresponding to the motion amount v, the number of virtual divisions is 4. Similarly, for example, when the object corresponding to the foreground is moving so that it is displayed on the left by 6 pixels in the next frame with reference to a certain frame, the motion amount v is set to 6, and the number of virtual divisions is , 6.
[0096]
FIG. 16 and FIG. 17 show the above-described mixed area composed of the foreground area, the background area, the covered background area, or the uncovered background area, and the foreground components and the background components corresponding to the divided shutter times. The relationship is shown.
[0097]
FIG. 16 shows an example in which pixels in the foreground area, background area, and mixed area are extracted from an image including a foreground corresponding to an object moving in front of a stationary background. In the example shown in FIG. 16, the object corresponding to the foreground is moving horizontally with respect to the screen.
[0098]
Frame # n + 1 is the next frame after frame #n, and frame # n + 2 is the next frame after frame # n + 1.
[0099]
Extract the pixels in the foreground area, background area, and mixed area extracted from any of frame #n to frame # n + 2, set the amount of motion v to 4, and set the pixel values of the extracted pixels in the time direction The developed model is shown in FIG.
[0100]
Since the object corresponding to the foreground moves, the pixel value in the foreground area is composed of four different foreground components corresponding to the shutter time / v period. For example, the leftmost pixel among the pixels in the foreground area shown in FIG. 17 is composed of F01 / v, F02 / v, F03 / v, and F04 / v. That is, the pixels in the foreground area include motion blur.
[0101]
Since the object corresponding to the background is stationary, the light corresponding to the background input to the sensor does not change during the period corresponding to the shutter time. In this case, the pixel value in the background area does not include motion blur.
[0102]
The pixel value of the pixel belonging to the mixed area composed of the covered background area or the uncovered background area is composed of a foreground component and a background component.
[0103]
Next, when the image corresponding to the object is moving, the pixel values of the pixels at the same position on the frame that are adjacent to each other in a plurality of frames are developed in the time direction. The model will be described. For example, when the image corresponding to the object moves horizontally with respect to the screen, the pixels arranged on one line of the screen can be selected as the pixels arranged in a row adjacent to each other.
[0104]
FIG. 18 shows pixels arranged in a row adjacent to three frames of an image obtained by capturing an object corresponding to a stationary background, and the pixel values of the pixels at the same position on the frame are represented by time. It is the model figure developed in the direction. Frame #n is the next frame after frame # n-1, and frame # n + 1 is the next frame after frame #n. Other frames are also referred to in the same manner.
[0105]
The pixel values B01 to B12 shown in FIG. 18 are pixel values corresponding to the stationary background object. Since the object corresponding to the background is stationary, the pixel value of the corresponding pixel does not change in frame # n−1 to frame n + 1. For example, the pixel in frame #n and the pixel in frame # n + 1 corresponding to the position of the pixel having a pixel value of B05 in frame # n−1 each have a pixel value of B05.
[0106]
FIG. 19 shows pixels arranged in a row adjacent to each other in three frames of an image obtained by imaging an object corresponding to the foreground moving to the right side in the drawing together with an object corresponding to the stationary background, FIG. 5 is a model diagram in which pixel values of pixels at the same position on a frame are developed in the time direction. The model shown in FIG. 19 includes a covered background area.
[0107]
In FIG. 19, it can be assumed that the object corresponding to the foreground is a rigid body and moves at a constant speed, and the foreground image is moved so that it is displayed on the right side by four pixels in the next frame. 4 and the number of virtual divisions is 4.
[0108]
For example, the foreground component of the leftmost pixel of frame # n-1 in FIG. 19 for the first shutter time / v after the shutter opens is F12 / v, and the second pixel from the left in FIG. The foreground component of the second shutter time / v after the shutter is opened is also F12 / v. The foreground component of the third pixel from the left in FIG. 19 corresponding to the third shutter time / v from when the shutter has opened, and the fourth shutter time from the shutter opening of the fourth pixel from the left in FIG. The foreground component of / v is F12 / v.
[0109]
The foreground component of the leftmost pixel in frame # n−1 in FIG. 19 corresponding to the second shutter time / v from when the shutter has opened is F11 / v, and the second pixel from the left in FIG. The foreground component of the third shutter time / v after the shutter is opened is also F11 / v. The foreground component of the third pixel from the left in FIG. 19 corresponding to the fourth portion of the shutter time / v from when the shutter has opened is F11 / v.
[0110]
The foreground component of the leftmost pixel of frame # n-1 in FIG. 19 corresponding to the third shutter time / v from when the shutter has opened is F10 / v, and the second pixel from the left in FIG. The foreground component of the fourth shutter time / v after the shutter is opened is also F10 / v. The foreground component of the leftmost pixel of frame # n−1 in FIG. 19 corresponding to the fourth shutter time / v from when the shutter has opened is F09 / v.
[0111]
Since the object corresponding to the background is stationary, the background component of the second pixel from the left of frame # n-1 in FIG. 19 corresponding to the first shutter time / v after the shutter is opened is B01 / v Become. The background component of the third pixel from the left of frame # n−1 in FIG. 19 corresponding to the first and second shutter time / v from when the shutter has opened is B02 / v. The background component of the fourth pixel from the left of frame # n−1 in FIG. 19 corresponding to the first through third shutter time / v from when the shutter has opened is B03 / v.
[0112]
In frame # n−1 in FIG. 19, the leftmost pixel belongs to the foreground area, and the second through fourth pixels from the left belong to the mixed area, which is a covered background area.
[0113]
The fifth through twelfth pixels from the left of frame # n−1 in FIG. 19 belong to the background area, and the pixel values thereof are B04 through B11, respectively.
[0114]
The first through fifth pixels from the left of frame #n in FIG. 19 belong to the foreground area. The foreground component of the shutter time / v in the foreground area of frame #n is any one of F05 / v to F12 / v.
[0115]
It can be assumed that the object corresponding to the foreground is a rigid body and moves at a constant speed, and the foreground image moves so as to be displayed on the right side of four pixels in the next frame, so from the left of frame #n in FIG. The foreground component of the fifth pixel of the first shutter time / v after the shutter opens is F12 / v, and the sixth pixel from the left in FIG. The foreground component is also F12 / v. The foreground component of the seventh pixel from the left in FIG. 19 corresponding to the third shutter time / v from when the shutter has opened, and the fourth shutter time from the shutter opening of the eighth pixel from the left in FIG. The foreground component of / v is F12 / v.
[0116]
The foreground component of the fifth pixel from the left in frame #n in FIG. 19 corresponding to the second shutter time / v from when the shutter has opened is F11 / v, and the sixth pixel from the left in FIG. The foreground component of the third shutter time / v after the shutter is opened is also F11 / v. The foreground component of the seventh pixel from the left in FIG. 19 corresponding to the fourth portion of the shutter time / v from when the shutter has opened is F11 / v.
[0117]
The foreground component of the fifth pixel from the left in frame #n in FIG. 19 corresponding to the third shutter time / v from when the shutter has opened is F10 / v, and the sixth pixel from the left in FIG. The foreground component of the fourth shutter time / v after the shutter is opened is also F10 / v. The foreground component of the fifth pixel from the left of frame #n in FIG. 19 corresponding to the fourth portion of the shutter time / v from when the shutter has opened is F09 / v.
[0118]
Since the object corresponding to the background is stationary, the background component of the sixth pixel from the left in frame #n in FIG. 19 corresponding to the first shutter time / v after the shutter is opened is B05 / v. The background component of the seventh pixel from the left of frame #n in FIG. 19 corresponding to the first and second shutter time / v from when the shutter has opened is B06 / v. The background component of the eighth pixel from the left of frame #n in FIG. 19 corresponding to the first through third shutter time / v from when the shutter has opened is B07 / v.
[0119]
In frame #n in FIG. 19, the sixth through eighth pixels from the left belong to the mixed area, which is a covered background area.
[0120]
The ninth through twelfth pixels from the left of frame #n in FIG. 19 belong to the background area, and the pixel values thereof are B08 through B11, respectively.
[0121]
The first through ninth pixels from the left of frame # n + 1 in FIG. 19 belong to the foreground area. The foreground component of the shutter time / v in the foreground area of frame # n + 1 is any one of F01 / v to F12 / v.
[0122]
It can be assumed that the object corresponding to the foreground is a rigid body and moves at a constant speed, and moves so that the foreground image is displayed on the right side by four pixels in the next frame. The foreground component of the ninth pixel from the left when the shutter opens is the first shutter time / v is F12 / v, and the tenth pixel from the left in FIG. 19 is the second shutter time after the shutter is opened. The foreground component of / v is also F12 / v. The foreground component of the eleventh pixel from the left in FIG. 19 corresponding to the third shutter time / v from when the shutter has opened, and the fourth shutter time from the shutter opening of the twelfth pixel from the left in FIG. The foreground component of / v is F12 / v.
[0123]
The foreground component of the ninth pixel from the left in frame # n + 1 in FIG. 19 corresponding to the second shutter time / v from when the shutter has opened is F11 / v, which is the tenth from the left in FIG. The foreground component of the third shutter time / v after the shutter opens is also F11 / v. The foreground component of the eleventh pixel from the left in FIG. 19 corresponding to the fourth portion of the shutter time / v from when the shutter has opened is F11 / v.
[0124]
The foreground component of the ninth pixel from the left of frame # n + 1 in FIG. 19 corresponding to the third shutter time / v from when the shutter has opened is F10 / v, which is the tenth pixel from the left in FIG. The foreground component of the fourth shutter time / v after the shutter is opened is also F10 / v. The foreground component of the ninth pixel from the left of frame # n + 1 in FIG. 19 corresponding to the fourth portion of the shutter time / v from when the shutter has opened is F09 / v.
[0125]
Since the object corresponding to the background is stationary, the background component of the tenth pixel from the left of frame # n + 1 in FIG. 19 corresponding to the first shutter time / v after the shutter is opened is B09 / v Become. The background component of the eleventh pixel from the left of frame # n + 1 in FIG. 19 corresponding to the first and second shutter time / v from when the shutter has opened is B10 / v. The background component of the twelfth pixel from the left of frame # n + 1 in FIG. 19 corresponding to the first through third shutter time / v from when the shutter has opened is B11 / v.
[0126]
In frame # n + 1 in FIG. 19, the tenth through twelfth pixels from the left correspond to the mixed region, which is a covered background region.
[0127]
FIG. 20 is a model diagram of an image obtained by extracting foreground components from the pixel values shown in FIG.
[0128]
FIG. 21 shows pixels arranged in a row adjacent to each other in three frames of an image obtained by capturing a foreground corresponding to an object moving to the right side in the figure together with a stationary background. It is the model figure which expand | deployed the pixel value of the pixel of the position of the time direction. In FIG. 21, an uncovered background area is included.
[0129]
In FIG. 21, it can be assumed that the object corresponding to the foreground is a rigid body and is moving at a constant speed. Since the object corresponding to the foreground is moved so as to be displayed on the right side by four pixels in the next frame, the motion amount v is 4.
[0130]
For example, the foreground component of the leftmost pixel of frame # n-1 in FIG. 21 that is the first shutter time / v after the shutter is opened is F13 / v, and is the second pixel from the left in FIG. The foreground component of the second shutter time / v after the shutter is opened is also F13 / v. The foreground component of the third pixel from the left in FIG. 21 corresponding to the third shutter time / v from when the shutter has opened, and the fourth shutter time from the shutter opening of the fourth pixel from the left in FIG. The foreground component of / v is F13 / v.
[0131]
The foreground component of the second pixel from the left of frame # n-1 in FIG. 21 corresponding to the first shutter time / v after the shutter opens is F14 / v, and the third pixel from the left in FIG. The foreground component of the second shutter time / v after the shutter is opened is also F14 / v. The foreground component of the third pixel from the left in FIG. 21 corresponding to the first shutter time / v from when the shutter has opened is F15 / v.
[0132]
Since the object corresponding to the background is stationary, the background component of the leftmost pixel of frame # n−1 in FIG. 21 corresponding to the second to fourth shutter time / v from the shutter opening is B25. / v. The background components of the second pixel from the left of frame # n−1 in FIG. 21 corresponding to the third and fourth shutter time / v from when the shutter has opened are B26 / v. The background component of the third pixel from the left of frame # n−1 in FIG. 21 corresponding to the fourth portion of the shutter time / v from when the shutter has opened is B27 / v.
[0133]
In frame # n−1 in FIG. 21, the leftmost pixel through the third pixel belong to the mixed area, which is an uncovered background area.
[0134]
The fourth through twelfth pixels from the left of frame # n−1 in FIG. 21 belong to the foreground area. The foreground component of the frame is any one of F13 / v to F24 / v.
[0135]
The leftmost pixel through the fourth pixel from the left in frame #n in FIG. 21 belong to the background area, and the pixel values thereof are B25 through B28, respectively.
[0136]
It can be assumed that the object corresponding to the foreground is a rigid body and moves at a constant speed, and moves so that the foreground image is displayed on the right side of four pixels in the next frame. The foreground component of the fifth pixel at the first shutter time / v after the shutter opens is F13 / v, and the sixth shutter pixel from the left in FIG. The foreground component is also F13 / v. The foreground component of the seventh pixel from the left in FIG. 21 corresponding to the third shutter time / v from when the shutter has opened, and the fourth shutter time from the shutter opening of the eighth pixel from the left in FIG. The foreground component of / v is F13 / v.
[0137]
The foreground component of the sixth pixel from the left of frame #n in FIG. 21 corresponding to the first shutter time / v from when the shutter has opened is F14 / v, and the seventh pixel from the left in FIG. The foreground component of the second shutter time / v after opening is also F14 / v. The foreground component of the eighth pixel from the left in FIG. 21 corresponding to the first portion of the shutter time / v from when the shutter has opened is F15 / v.
[0138]
Since the object corresponding to the background is stationary, the background component of the fifth pixel from the left of frame #n in FIG. 21 corresponding to the second through fourth shutter time / v from when the shutter has opened is B29 / v. The background component of the sixth pixel from the left of frame #n in FIG. 21 corresponding to the third and fourth shutter time / v from when the shutter has opened is B30 / v. The background component of the seventh pixel from the left of frame #n in FIG. 21 corresponding to the fourth portion of the shutter time / v from when the shutter has opened is B31 / v.
[0139]
In frame #n in FIG. 21, the fifth through seventh pixels from the left belong to the mixed area, which is an uncovered background area.
[0140]
The eighth through twelfth pixels from the left of frame #n in FIG. 21 belong to the foreground area. The value corresponding to the period of the shutter time / v in the foreground area of frame #n is any one of F13 / v to F20 / v.
[0141]
The leftmost pixel through the eighth pixel from the left of frame # n + 1 in FIG. 21 belong to the background area, and the pixel values thereof are B25 through B32, respectively.
[0142]
It can be assumed that the object corresponding to the foreground is a rigid body and moves at a constant speed, and the foreground image moves so as to be displayed on the right side by four pixels in the next frame, so that the frame # n + 1 in FIG. The foreground component of the ninth pixel from the left when the shutter opens is the first shutter time / v is F13 / v, and the tenth pixel from the left in FIG. The foreground component of / v is also F13 / v. The foreground component of the eleventh pixel from the left in FIG. 21 corresponding to the third shutter time / v from when the shutter has opened, and the fourth shutter time from the shutter in the twelfth pixel from the left in FIG. The foreground component of / v is F13 / v.
[0143]
The foreground component of the tenth pixel from the left of frame # n + 1 in FIG. 21 corresponding to the first shutter time / v from when the shutter has opened is F14 / v, and the eleventh pixel from the left in FIG. The foreground component of the second shutter time / v after the shutter is opened is also F14 / v. The foreground component of the twelfth pixel from the left in FIG. 21 corresponding to the first shutter time / v from when the shutter has opened is F15 / v.
[0144]
Since the object corresponding to the background is stationary, the background component of the ninth pixel from the left of frame # n + 1 in FIG. , B33 / v. The background component of the tenth pixel from the left of frame # n + 1 in FIG. 21 corresponding to the third and fourth shutter time / v from when the shutter has opened is B34 / v. The background component of the eleventh pixel from the left of frame # n + 1 in FIG. 21 corresponding to the fourth portion of the shutter time / v from when the shutter has opened is B35 / v.
[0145]
In frame # n + 1 in FIG. 21, the ninth through eleventh pixels from the left belong to the mixed area, which is an uncovered background area.
[0146]
The twelfth pixel from the left of frame # n + 1 in FIG. 21 belongs to the foreground area. The foreground component of the shutter time / v in the foreground area of frame # n + 1 is any one of F13 / v to F16 / v.
[0147]
FIG. 22 is a model diagram of an image obtained by extracting foreground components from the pixel values shown in FIG.
[0148]
Returning to FIG. 6, the region specifying unit 103 associates a flag indicating that the pixel belongs to the foreground region, the background region, the covered background region, or the uncovered background region for each pixel by using the pixel values of a plurality of frames. Thus, the region information is supplied to the mixture ratio calculation unit 104 and the motion blur removal unit 106.
[0149]
The mixture ratio calculation unit 104 calculates the mixture ratio α for each pixel for the pixels included in the mixed region based on the pixel values of a plurality of frames and the region information, and supplies the calculated mixture ratio α to the foreground / background separation unit 105. Supply.
[0150]
The foreground / background separation unit 105 extracts a foreground component image including only foreground components based on the pixel values of a plurality of frames, region information, and the mixture ratio α, and supplies the foreground component image to the motion blur removal unit 106.
[0151]
The motion blur removal unit 106 converts the foreground component image based on the foreground component image supplied from the foreground / background separation unit 105, the motion vector supplied from the motion detection unit 102, and the region information supplied from the region specifying unit 103. The motion blur included is removed, and a foreground component image from which motion blur is removed is output.
[0152]
With reference to the flowchart of FIG. 23, the composite image generation processing by the image processing unit will be described. In step S11, the area specifying unit 103 obtains area information indicating whether each pixel of the input image belongs to the foreground area, the background area, the covered background area, or the uncovered background area based on the input image. A process for specifying the area to be generated is executed. Details of the area specifying process will be described later. The area specifying unit 103 supplies the generated area information to the mixture ratio calculating unit 104.
[0153]
In step S11, the area specifying unit 103, based on the input image, foreground area, background area, or mixed area for each pixel of the input image (does not distinguish between a covered background area or an uncovered background area). It may be possible to generate region information indicating which one of the items belongs to. In this case, the foreground / background separation unit 105 and the motion blur removal unit 106 determine whether the mixed region is a covered background region or an uncovered background region based on the direction of the motion vector. For example, when the foreground region, the mixed region, and the background region are arranged in order corresponding to the direction of the motion vector, the mixed region is determined as the covered background region, and the background is determined corresponding to the direction of the motion vector. When the region, the mixed region, and the foreground region are arranged in this order, the mixed region is determined as an uncovered background region.
[0154]
In step S12, the mixture ratio calculation unit 104 calculates the mixture ratio α for each pixel included in the mixed area based on the input image and the area information. Details of the mixing ratio calculation process will be described later. The mixture ratio calculation unit 104 supplies the calculated mixture ratio α to the foreground / background separation unit 105.
[0155]
In step S13, the foreground / background separator 105 extracts a foreground component from the input image based on the region information and the mixture ratio α, and supplies the foreground component image to the motion blur removal unit 106 as a foreground component image.
[0156]
In step S14, the motion blur removal unit 106 is a continuous pixel lined up in the motion direction based on the motion vector and the region information, and belongs to any one of the uncovered background region, the foreground region, and the covered background region. A processing unit indicating the position of the object on the image is generated, and the amount of motion blur included in the foreground component corresponding to the processing unit is removed. Details of the motion blur removal process will be described later.
[0157]
In step S15, the image processing unit determines whether or not the process has been completed for the entire screen. If it is determined that the process has not been completed for the entire screen, the process proceeds to step S14, and the foreground corresponding to the processing unit is determined. The motion blur removal process for the component is repeated.
[0158]
If it is determined in step S15 that the processing has been completed for the entire screen, in step S16, the combining unit 107 combines the foreground component image and an arbitrary background image, generates a combined image, and outputs the combined image. Details of the composite image generation processing will be described later.
[0159]
In this manner, the image processing unit can separate the foreground and the background, remove the motion blur included in the foreground, and synthesize it with an arbitrary background image.
[0160]
Hereinafter, the configurations of the area specifying unit 103, the mixture ratio calculation unit 104, the foreground / background separation unit 105, and the motion blur removal unit 106 will be described.
[0161]
FIG. 24 is a block diagram illustrating a configuration of the area specifying unit 103. The area specifying unit 103 shown in FIG. 24 does not use a motion vector. The frame memory 201 stores the input image in units of frames. When the processing target is the frame #n, the frame memory 201 is a frame # n-2 that is a frame immediately before the frame #n, a frame # n-1 that is a frame immediately before the frame #n, A frame #n, a frame # n + 1 that is a frame subsequent to the frame #n, and a frame # n + 2 that is a frame subsequent to the frame #n are stored.
[0162]
The static motion determination unit 202-1 determines the pixel value of the pixel of frame # n + 2 at the same position on the image of the pixel that is the target of region specification of frame #n, and the region specification of frame #n. The pixel value of the pixel of frame # n + 1 at the same position as the position of the target pixel on the image is read from the frame memory 201, and the absolute value of the difference between the read pixel values is calculated. The static motion determination unit 202-1 determines whether or not the absolute value of the difference between the pixel value of frame # n + 2 and the pixel value of frame # n + 1 is greater than a preset threshold Th, When it is determined that the absolute value of the difference is greater than the threshold value Th, a static motion determination indicating motion is supplied to the region determination unit 203-1. When it is determined that the absolute value of the difference between the pixel value of the pixel of frame # n + 2 and the pixel value of the pixel of frame # n + 1 is equal to or less than the threshold value Th, the static motion determination unit 202-1 The static motion determination shown is supplied to the region determination unit 203-1.
[0163]
The static motion determination unit 202-2 is the target of the pixel value of the frame # n + 1 at the same position on the image of the pixel that is the target of region identification of the frame #n, and the target of the frame #n. The pixel value of the pixel is read from the frame memory 201, and the absolute value of the difference between the pixel values is calculated. The static motion determination unit 202-2 determines whether or not the absolute value of the difference between the pixel value of the frame # n + 1 and the pixel value of the frame #n is greater than a preset threshold value Th. When it is determined that the absolute value of the difference between the two is greater than the threshold value Th, a static motion determination indicating motion is supplied to the region determination unit 203-1 and the region determination unit 203-2. When it is determined that the absolute value of the difference between the pixel value of the pixel of frame # n + 1 and the pixel value of the pixel of frame #n is equal to or less than the threshold value Th, the static motion determination unit 202-2 indicates stillness. The static motion determination is supplied to the region determination unit 203-1 and the region determination unit 203-2.
[0164]
The static motion determination unit 202-3 determines the frame #n at the same position as the pixel value of the pixel that is the region specification target of the frame #n and the position of the pixel that is the region specification target of the frame #n. The pixel value of −1 pixel is read from the frame memory 201, and the absolute value of the difference between the pixel values is calculated. The static motion determination unit 202-3 determines whether or not the absolute value of the difference between the pixel value of the frame #n and the pixel value of the frame # n-1 is larger than a preset threshold value Th. When it is determined that the absolute value of the difference between the two is greater than the threshold value Th, a static motion determination indicating motion is supplied to the region determination unit 203-2 and the region determination unit 203-3. When it is determined that the absolute value of the difference between the pixel value of the pixel of frame #n and the pixel value of the pixel of frame # n-1 is equal to or less than the threshold value Th, the static motion determination unit 202-3 indicates the still state The static motion determination is supplied to the region determination unit 203-2 and the region determination unit 203-3.
[0165]
The static motion determination unit 202-4 determines the pixel value of the pixel of frame # n-1 at the same position on the image of the pixel that is the target of region specification of frame #n, and the region specification of frame #n. The pixel value of the pixel of frame # n-2 located at the same position on the image of the target pixel is read from the frame memory 201, and the absolute value of the difference between the pixel values is calculated. The static motion determination unit 202-4 determines whether or not the absolute value of the difference between the pixel value of the frame # n-1 and the pixel value of the frame # n-2 is greater than a preset threshold Th, When it is determined that the absolute value of the difference between the pixel values is greater than the threshold value Th, a static motion determination indicating motion is supplied to the region determination unit 203-3. When it is determined that the absolute value of the difference between the pixel value of the pixel of frame # n-1 and the pixel value of the pixel of frame # n-2 is equal to or less than the threshold value Th, the static motion determination unit 202-4 Is supplied to the region determination unit 203-3.
[0166]
The region determination unit 203-1 is configured such that the static motion determination supplied from the static motion determination unit 202-1 indicates stillness and the static motion determination supplied from the static motion determination unit 202-2 indicates movement. The pixel that is the target of region identification in frame #n is determined to belong to the uncovered background region, and the uncovered background region determination flag corresponding to the pixel that is determined to belong to the region belongs to the uncovered background region. “1” indicating “” is set.
[0167]
The area determination unit 203-1 indicates that the static motion determination supplied from the static motion determination unit 202-1 indicates movement, or the static motion determination supplied from the static motion determination unit 202-2 indicates stillness. When determining that the pixel that is the target of region identification in frame #n does not belong to the uncovered background region, the uncovered background region determination flag corresponding to the pixel to be determined for the region is set to the uncovered background region. “0” is set to indicate that it does not belong.
[0168]
The area determination unit 203-1 supplies the uncovered background area determination flag in which “1” or “0” is set as described above to the determination flag storage frame memory 204.
[0169]
The region determination unit 203-2 is configured such that the static motion determination supplied from the static motion determination unit 202-2 indicates static and the static motion determination supplied from the static motion determination unit 202-3 indicates static. Then, it is determined that the pixel that is the target of region identification in frame #n belongs to the still region, and “1” indicating that it belongs to the still region is set in the still region determination flag corresponding to the pixel to be determined for the region.
[0170]
In the area determination unit 203-2, the static motion determination supplied from the static motion determination unit 202-2 indicates a motion, or the static motion determination supplied from the static motion determination unit 202-3 indicates a motion. At this time, it is determined that the pixel that is the region identification target in frame #n does not belong to the still region, and “0” indicating that it does not belong to the still region is set in the still region determination flag corresponding to the pixel to be determined for the region Set.
[0171]
The region determination unit 203-2 supplies the still region determination flag in which “1” or “0” is set as described above to the determination flag storage frame memory 204.
[0172]
The region determination unit 203-2 is configured such that the static motion determination supplied from the static motion determination unit 202-2 indicates movement and the static motion determination supplied from the static motion determination unit 202-3 indicates movement. Then, it is determined that the pixel that is the target of region identification in frame #n belongs to the motion region, and “1” indicating that it belongs to the motion region is set in the motion region determination flag corresponding to the pixel determined for the region.
[0173]
The region determination unit 203-2 indicates that the static motion determination supplied from the static motion determination unit 202-2 indicates static or the static motion determination supplied from the static motion determination unit 202-3 indicates static. At this time, it is determined that the pixel that is the region identification target in frame #n does not belong to the motion region, and “0” indicating that it does not belong to the motion region is set in the motion region determination flag corresponding to the pixel that is determined to be the region. Set.
[0174]
The region determination unit 203-2 supplies the motion region determination flag set to “1” or “0” to the determination flag storage frame memory 204 in this way.
[0175]
The region determination unit 203-3 is configured such that the static motion determination supplied from the static motion determination unit 202-3 indicates movement and the static motion determination supplied from the static motion determination unit 202-4 indicates stillness. , It is determined that the pixel that is the target of region identification in frame #n belongs to the covered background region, and the covered background region determination flag corresponding to the pixel to be determined of the region indicates that it belongs to the covered background region. 1 ”is set.
[0176]
In the area determination unit 203-3, the static motion determination supplied from the static motion determination unit 202-3 indicates stillness, or the static motion determination supplied from the static motion determination unit 202-4 indicates movement. When determining that the pixel that is the target of region identification in frame #n does not belong to the covered background region, the covered background region determination flag corresponding to the pixel to be determined for the region does not belong to the covered background region. “0” is set to indicate.
[0177]
The area determination unit 203-3 supplies the covered background area determination flag set to “1” or “0” to the determination flag storage frame memory 204 in this way.
[0178]
The determination flag storage frame memory 204 is supplied from the uncovered background region determination flag supplied from the region determination unit 203-1, the still region determination flag supplied from the region determination unit 203-2, and the region determination unit 203-2. The movement area determination flag and the covered background area determination flag supplied from the area determination unit 203-3 are stored.
[0179]
The determination flag storage frame memory 204 supplies the stored uncovered background area determination flag, still area determination flag, motion area determination flag, and covered background area determination flag to the synthesis unit 205. Based on the uncovered background area determination flag, the still area determination flag, the motion area determination flag, and the covered background area determination flag supplied from the determination flag storage frame memory 204, the combining unit 205 Area information indicating that it belongs to any one of the covered background area, the stationary area, the motion area, and the covered background area is generated and supplied to the determination flag storage frame memory 206.
[0180]
The determination flag storage frame memory 206 stores the area information supplied from the synthesis unit 205 and outputs the stored area information.
[0181]
Next, an example of processing of the area specifying unit 103 will be described with reference to FIGS.
[0182]
When the object corresponding to the foreground is moving, the position on the screen of the image corresponding to the object changes for each frame. As shown in FIG. 25, in frame #n, an image corresponding to an object located at a position indicated by Yn (x, y) is Yn + 1 (x, y in frame # n + 1 which is the next frame. Located in y).
[0183]
FIG. 26 shows a model diagram in which pixel values of pixels arranged in a row adjacent to the moving direction of the image corresponding to the foreground object are developed in the time direction. For example, when the moving direction of the image corresponding to the foreground object is horizontal with respect to the screen, the model diagram in FIG. 26 shows a model in which pixel values of adjacent pixels on one line are expanded in the time direction.
[0184]
In FIG. 26, the line in frame #n is the same as the line in frame # n + 1.
[0185]
Foreground components corresponding to the objects included in the second through thirteenth pixels from the left in frame #n are included in the sixth through seventeenth pixels from the left in frame # n + 1.
[0186]
In frame #n, the pixels belonging to the covered background area are the 11th to 13th pixels from the left, and the pixels belonging to the uncovered background area are the 2nd to 4th pixels from the left. In frame # n + 1, the pixels belonging to the covered background area are the 15th to 17th pixels from the left, and the pixels belonging to the uncovered background area are the 6th to 8th pixels from the left.
[0187]
In the example shown in FIG. 26, the foreground component included in frame #n has moved four pixels in frame # n + 1, so the amount of motion v is four. The virtual division number corresponds to the motion amount v and is 4.
[0188]
Next, changes in pixel values of pixels belonging to the mixed region before and after the frame of interest will be described.
[0189]
In frame #n shown in FIG. 27 where the background is stationary and the foreground motion amount v is 4, the pixels belonging to the covered background area are the fifteenth through seventeenth pixels from the left. Since the motion amount v is 4, in the previous frame # n−1, the fifteenth through seventeenth pixels from the left include only background components and belong to the background area. In frame # n-2, the fifteenth through seventeenth pixels from the left include only background components and belong to the background area.
[0190]
Here, since the object corresponding to the background is stationary, the pixel value of the fifteenth pixel from the left in frame # n-1 does not change from the pixel value of the fifteenth pixel from the left in frame # n-2. . Similarly, the pixel value of the 16th pixel from the left of frame # n-1 does not change from the pixel value of the 16th pixel from the left of frame # n-2, and the 17th pixel from the left of frame # n-1 The pixel value of this pixel does not change from the pixel value of the 17th pixel from the left in frame # n-2.
[0191]
That is, the pixels of frame # n-1 and frame # n-2 corresponding to the pixels belonging to the covered background area in frame #n are composed of only background components, and the pixel value does not change. The value is almost zero. Therefore, the static motion determination for the pixels in frame # n-1 and frame # n-2 corresponding to the pixels belonging to the mixed region in frame #n is determined as static by the static motion determination unit 202-4.
[0192]
Since the pixels belonging to the covered background area in frame #n include the foreground components, the pixel values are different from the case of only the background components in frame # n-1. Therefore, the static motion determination for the pixels belonging to the mixed region in frame #n and the corresponding pixels in frame # n-1 is determined as motion by the static motion determination unit 202-3.
[0193]
As described above, the region determination unit 203-3 is supplied with the result of the static motion determination indicating the motion from the static motion determination unit 202-3, and is supplied with the result of the static motion determination indicating the static motion from the static motion determination unit 202-4. When it is done, it is determined that the corresponding pixel belongs to the covered background area.
[0194]
In frame #n shown in FIG. 28 where the background is stationary and the foreground motion amount v is 4, the pixels included in the uncovered background area are the second through fourth pixels from the left. Since the motion amount v is 4, in the next frame # n + 1, the second through fourth pixels from the left include only background components and belong to the background area. Further, in the next frame # n + 2, the second through fourth pixels from the left include only background components and belong to the background area.
[0195]
Here, since the object corresponding to the background is stationary, the pixel value of the second pixel from the left of frame # n + 2 does not change from the pixel value of the second pixel from the left of frame # n + 1. . Similarly, the pixel value of the third pixel from the left of frame # n + 2 does not change from the pixel value of the third pixel from the left of frame # n + 1, and is the fourth from the left of frame # n + 2. The pixel value of this pixel does not change from the pixel value of the fourth pixel from the left in frame # n + 1.
[0196]
That is, the pixels of frame # n + 1 and frame # n + 2, which correspond to the pixels belonging to the uncovered background area in frame #n, consist only of background components, and the pixel value does not change. The absolute value is almost zero. Therefore, the static motion determination for the pixels in frame # n + 1 and frame # n + 2 corresponding to the pixels belonging to the mixed region in frame #n is determined as static by the static motion determination unit 202-1.
[0197]
Since the pixels belonging to the uncovered background area in frame #n include the foreground components, the pixel values are different from the case of only the background components in frame # n + 1. Therefore, the static motion determination for the pixels belonging to the mixed region in frame #n and the corresponding pixels in frame # n + 1 is determined as motion by the static motion determination unit 202-2.
[0198]
As described above, the region determination unit 203-1 is supplied with the result of the static motion determination indicating the motion from the static motion determination unit 202-2, and is supplied with the result of the static motion determination indicating the static motion from the static motion determination unit 202-1. Is determined to belong to the uncovered background area.
[0199]
FIG. 29 is a diagram illustrating determination conditions of the area specifying unit 103 in frame #n. The pixel in frame # n-2 at the same position on the image of the pixel to be judged in frame #n and the same position on the image of the pixel to be judged in frame #n A pixel in frame # n-1 is determined to be stationary, and a pixel in frame # n-1 and a pixel in frame #n at the same position on the image of the pixel to be determined in frame #n Are determined to be movements, the area specifying unit 103 determines that the pixel to be determined for frame #n belongs to the covered background area.
[0200]
The pixel in frame # n-1 and the pixel in frame #n at the same position on the image of the pixel to be determined in frame #n are determined to be stationary, and the pixel in frame #n When it is determined that the pixel of frame # n + 1 at the same position on the image of the pixel to be determined as #n is still, the area specifying unit 103 determines that the determination target of frame #n is Is determined to belong to the still region.
[0201]
The pixel in frame # n-1 and the pixel in frame #n at the same position on the image of the pixel to be determined in frame #n are determined to move, and the pixel in frame #n When it is determined that a pixel in frame # n + 1 located at the same position on the image of a pixel to be determined as #n is a motion, the region specifying unit 103 determines that the determination is performed in frame #n. Is determined to belong to the motion region.
[0202]
The pixel in frame #n and the pixel in frame # n + 1 at the same position on the image of the pixel to be determined in frame #n are determined as motion, and the determination target in frame #n The pixel of frame # n + 1 at the same position as the position of the pixel on the image and the pixel of frame # n + 2 at the same position as the position of the pixel to be determined at frame #n on the image Are determined to be stationary, the area specifying unit 103 determines that the pixel to be determined for frame #n belongs to the uncovered background area.
[0203]
FIG. 26 is a diagram illustrating an example of the result of specifying the area of the area specifying unit 103. In FIG. 30A, pixels determined to belong to the covered background area are displayed in white. In FIG. 30B, the pixels determined to belong to the uncovered background area are displayed in white.
[0204]
In FIG. 30C, pixels that are determined to belong to the motion region are displayed in white. In FIG. 30D, the pixels determined to belong to the still region are displayed in white.
[0205]
FIG. 31 is a diagram showing, as an image, region information indicating a mixed region among region information output from the determination flag storage frame memory 206. In FIG. 31, pixels that are determined to belong to the covered background area or the uncovered background area, that is, pixels that are determined to belong to the mixed area are displayed in white. The area information indicating the mixed area output from the determination flag storage frame memory 206 indicates a mixed area and a portion having a texture surrounded by a portion having no texture in the foreground area.
[0206]
Next, the area specifying process of the area specifying unit 103 will be described with reference to the flowchart of FIG. In step S201, the frame memory 201 acquires images of frames # n-2 to # n + 2 including the frame #n to be determined.
[0207]
In step S202, the static motion determination unit 202-3 determines whether or not the pixel in frame # n-1 and the pixel at the same position in frame #n are stationary. Then, the static motion determination unit 202-2 determines whether or not the frame #n and the pixel at the same position in the frame # n + 1 are still.
[0208]
In step S203, if it is determined that the pixel in frame #n and the pixel in the same position in frame # n + 1 are determined to be stationary, the process proceeds to step S204, and the region determination unit 203-2 determines that the region is determined. A corresponding still area determination flag is set to “1” indicating that it belongs to a still area. The region determination unit 203-2 supplies the still region determination flag to the determination flag storage frame memory 204, and the procedure proceeds to step S205.
[0209]
When it is determined in step S202 that the pixel in frame # n-1 and the pixel at the same position in frame #n are in motion, or in step S203, the pixel in frame #n and the same position in frame # n + 1 If the pixel is determined to be moving, the pixel in frame #n does not belong to the still region, so the process of step S204 is skipped, and the procedure proceeds to step S205.
[0210]
In step S205, the static motion determination unit 202-3 determines whether or not the pixel in frame # n-1 and the pixel at the same position in frame #n are in motion, and if it is determined as motion, the process proceeds to step S206. Then, the static motion determination unit 202-2 determines whether or not there is motion between the pixel of frame #n and the pixel at the same position of frame # n + 1.
[0211]
If it is determined in step S206 that the pixel in frame #n and the pixel in the same position in frame # n + 1 are in motion, the process proceeds to step S207, and the region determination unit 203-2 determines that the region is determined. “1” indicating that it belongs to a motion region is set in the corresponding motion region determination flag. The region determination unit 203-2 supplies the motion region determination flag to the determination flag storage frame memory 204, and the procedure proceeds to step S208.
[0212]
If it is determined in step S205 that the pixel in frame # n-1 and the pixel in the same position in frame #n are still, or in step S206, the pixel in frame #n and the same position in frame # n + 1 If the current pixel is determined to be still, the pixel of frame #n does not belong to the motion region, so the process of step S207 is skipped, and the procedure proceeds to step S208.
[0213]
In step S208, the static motion determination unit 202-4 determines whether or not the pixel in frame # n-2 and the pixel in the same position in frame # n-1 are stationary. In step S209, the static motion determination unit 202-3 determines whether or not there is motion between the pixel in frame # n-1 and the pixel at the same position in frame #n.
[0214]
If it is determined in step S209 that the motion of the pixel in frame # n-1 and the pixel at the same position in frame #n is determined as moving, the process proceeds to step S210, and the region determination unit 203-3 determines that the region is to be determined. The corresponding covered background area determination flag is set to “1” indicating that it belongs to the covered background area. The area determination unit 203-3 supplies the covered background area determination flag to the determination flag storage frame memory 204, and the procedure proceeds to step S211.
[0215]
If it is determined in step S208 that the pixel in frame # n-2 and the pixel in the same position in frame # n-1 are in motion, or in step S209, the pixel in frame # n-1 and the pixel in frame #n If it is determined that the pixel at the same position is still, the pixel of frame #n does not belong to the covered background area, so the process of step S210 is skipped, and the procedure proceeds to step S211.
[0216]
In step S211, the static motion determination unit 202-2 determines whether or not the pixel in the frame #n and the pixel in the same position in the frame # n + 1 are in motion, and if it is determined to be in motion, the process proceeds to step S212. Then, the static motion determination unit 202-1 determines whether or not the pixel of frame # n + 1 and the pixel at the same position of frame # n + 2 are still.
[0217]
If it is determined in step S212 that the pixel in frame # n + 1 and the pixel in the same position in frame # n + 2 are stationary, the process proceeds to step S213, and the region determination unit 203-1 determines the region. In the uncovered background area determination flag corresponding to the pixel, “1” indicating that the pixel belongs to the uncovered background area is set. The area determination unit 203-1 supplies the uncovered background area determination flag to the determination flag storage frame memory 204, and the procedure proceeds to step S214.
[0218]
If it is determined in step S211 that the pixel in frame #n and the pixel in the same position in frame # n + 1 are stationary, or in step S212, the pixel in frame # n + 1 and the frame # n + 2 If it is determined that the motion is the same pixel, the pixel of frame #n does not belong to the uncovered background area, so the process of step S213 is skipped, and the procedure proceeds to step S214.
[0219]
In step S214, the area specifying unit 103 determines whether or not an area has been specified for all the pixels of frame #n. If it is determined that no area has been specified for all the pixels of frame #n, Returns to step S202 and repeats the area specifying process for other pixels.
[0220]
If it is determined in step S214 that the area has been specified for all the pixels of frame #n, the process proceeds to step S215, where the synthesis unit 205 stores the uncovered background area determination flag stored in the determination flag storage frame memory 204. And a covered background area determination flag, area information indicating a mixed area is generated, and each pixel belongs to one of an uncovered background area, a stationary area, a motion area, and a covered background area. The region information indicating this is generated, the generated region information is set in the determination flag storage frame memory 206, and the process ends.
[0221]
As described above, the region specifying unit 103 may generate region information indicating that each pixel included in the frame belongs to the motion region, the still region, the uncovered background region, or the covered background region. it can.
[0222]
The area specifying unit 103 generates area information corresponding to the mixed area by applying a logical sum to the area information corresponding to the uncovered background area and the covered background area, and is included in the frame. For each pixel, region information including a flag indicating that the pixel belongs to a motion region, a still region, or a mixed region may be generated.
[0223]
When the object corresponding to the foreground has a texture, the area specifying unit 103 can specify the movement area more accurately.
[0224]
The area specifying unit 103 can output area information indicating a motion area as area information indicating a foreground area, and area information indicating a still area as area information indicating a background area.
[0225]
In addition, although the object corresponding to the background has been described as stationary, the above-described processing for specifying the region can be applied even if the image corresponding to the background region includes a motion. For example, when the image corresponding to the background area is moving uniformly, the area specifying unit 103 shifts the entire image corresponding to this movement, and performs the same processing as when the object corresponding to the background is stationary. To do. In addition, when the image corresponding to the background region includes a different motion for each local area, the region specifying unit 103 selects a pixel corresponding to the motion and executes the above-described processing.
[0226]
FIG. 33 is a block diagram illustrating a configuration of the area specifying unit 103. The area specifying unit 103 illustrated in FIG. 33 does not use a motion vector. The background image generation unit 301 generates a background image corresponding to the input image, and supplies the generated background image to the binary object image extraction unit 302. For example, the background image generation unit 301 extracts an image object corresponding to a background object included in the input image, and generates a background image.
[0227]
FIG. 34 shows an example of a model diagram in which pixel values of pixels arranged in a line adjacent to the moving direction of the image corresponding to the foreground object are developed in the time direction. For example, when the motion direction of the image corresponding to the foreground object is horizontal with respect to the screen, the model diagram in FIG. 34 shows a model in which pixel values of adjacent pixels on one line are expanded in the time direction.
[0228]
In FIG. 34, the line in frame #n is the same as the line in frame # n−1 and frame # n + 1.
[0229]
In frame #n, the foreground components corresponding to the objects included in the sixth through seventeenth pixels from the left are included in the second through thirteenth pixels from the left in frame # n-1. In frame # n + 1, they are included in the 10th to 21st pixels from the left.
[0230]
In frame # n−1, the pixels belonging to the covered background area are the 11th to 13th pixels from the left, and the pixels belonging to the uncovered background area are the 2nd to 4th pixels from the left. In frame #n, the pixels belonging to the covered background area are the 15th to 17th pixels from the left, and the pixels belonging to the uncovered background area are the 6th to 8th pixels from the left. In frame # n + 1, the pixels belonging to the covered background area are the 19th to 21st pixels from the left, and the pixels belonging to the uncovered background area are the 10th to 12th pixels from the left.
[0231]
In frame # n−1, the pixels belonging to the background area are the first pixel from the left and the fourteenth through twenty-first pixels from the left. In frame #n, the pixels belonging to the background area are the first through fifth pixels from the left, and the eighteenth through twenty-first pixels from the left. In frame # n + 1, the pixels belonging to the background area are the first through ninth pixels from the left.
[0232]
An example of a background image corresponding to the example of FIG. 34 generated by the background image generation unit 301 is shown in FIG. The background image is composed of pixels corresponding to the background object, and does not include image components corresponding to the foreground object.
[0233]
The binary object image extraction unit 302 generates a binary object image based on the correlation between the background image and the input image, and supplies the generated binary object image to the time change detection unit 303.
[0234]
FIG. 36 is a block diagram illustrating a configuration of the binary object image extraction unit 302. The correlation value calculation unit 321 calculates the correlation between the background image and the input image supplied from the background image generation unit 301, generates a correlation value, and supplies the generated correlation value to the threshold processing unit 322.
[0235]
The correlation value calculation unit 321 may, for example, generate Y4 corresponding to blocks in a 3 × 3 background image centered on X4 as shown in FIG. 37A and blocks in the background image as shown in FIG. 37B. The correlation value corresponding to Y4 is calculated by applying the equation (4) to the block in the 3 × 3 input image centered at.
[0236]
[Expression 2]

... (4)
[0237]
[Equation 3]

... (5)
[0238]
[Expression 4]

... (6)
[0239]
The correlation value calculation unit 321 supplies the correlation value calculated for each pixel in this way to the threshold processing unit 322.
[0240]
Further, for example, the correlation value calculation unit 321 corresponds to a block in a 3 × 3 background image centered on X4 as shown in FIG. 38A and a block in the background image as shown in FIG. 38B. The sum of absolute differences corresponding to Y4 may be calculated by applying Equation (7) to a block in the 3 × 3 input image centered on Y4.
[0241]
[Equation 5]

... (7)
[0242]
The correlation value calculation unit 321 supplies the sum of absolute differences calculated as described above to the threshold processing unit 322 as a correlation value.
[0243]
The threshold value processing unit 322 compares the pixel value of the correlation image with the threshold value th0. When the correlation value is equal to or less than the threshold value th0, the threshold value processing unit 322 sets the pixel value of the binary object image to 1 and sets the correlation value. Is greater than the threshold th0, the pixel value of the binary object image is set to 0, and a binary object image with 0 or 1 set to the pixel value is output. The threshold processing unit 322 may store the threshold th0 in advance, or may use the threshold th0 input from the outside.
[0244]
FIG. 39 is a diagram showing an example of a binary object image corresponding to the model of the input image shown in FIG. In the binary object image, the pixel value is set to 0 for a pixel having a high correlation with the background image.
[0245]
FIG. 40 is a block diagram illustrating a configuration of the time change detection unit 303. The frame memory 341 determines the area for the pixel of frame #n, and the binary object image of frame # n−1, frame #n, and frame # n + 1 supplied from the binary object image extraction unit 302 Remember.
[0246]
The area determination unit 342 determines an area for each pixel of the frame #n based on the binary object images of the frame # n−1, the frame #n, and the frame # n + 1 stored in the frame memory 341. Region information is generated, and the generated region information is output.
[0247]
FIG. 41 is a diagram for explaining the determination by the region determination unit 342. When the pixel of interest of the binary object image of frame #n is 0, the region determination unit 342 determines that the pixel of interest of frame #n belongs to the background region.
[0248]
The pixel of interest of the binary object image of frame #n is 1, the corresponding pixel of the binary object image of frame # n-1 is 1, and the correspondence of the binary object image of frame # n + 1 When the pixel to be processed is 1, the region determination unit 342 determines that the pixel of interest in frame #n belongs to the foreground region.
[0249]
When the pixel of interest of the binary object image of frame #n is 1 and the corresponding pixel of the binary object image of frame # n-1 is 0, the region determination unit 342 It is determined that the pixel in question belongs to the covered background area.
[0250]
When the pixel of interest of the binary object image of frame #n is 1 and the corresponding pixel of the binary object image of frame # n + 1 is 0, the region determination unit 342 It is determined that the current pixel belongs to the uncovered background area.
[0251]
FIG. 42 is a diagram illustrating an example in which the time change detection unit 303 determines the binary object image corresponding to the input image model illustrated in FIG. Since the corresponding pixel of frame #n of the binary object image is 0, the time change detection unit 303 determines that the first to fifth pixels from the left of the frame #n belong to the background area.
[0252]
The temporal change detection unit 303 has the uncovered background region as the sixth to ninth pixels from the left because the pixel of frame #n of the binary object image is 1 and the corresponding pixel of frame # n + 1 is 0. It is determined that it belongs to.
[0253]
The temporal change detection unit 303 has a pixel of frame #n of 1 in the binary object image, a corresponding pixel of frame # n−1 is 1, and a corresponding pixel of frame # n + 1 is 1. The tenth through thirteenth pixels are determined to belong to the foreground area.
[0254]
Since the pixel of frame #n of the binary object image is 1 and the corresponding pixel of frame # n−1 is 0, the time change detection unit 303 sets the 14th to 17th pixels from the left as the covered background area. Judge as belonging.
[0255]
The time change detection unit 303 determines that the 18th to 21st pixels from the left belong to the background area because the corresponding pixel of frame #n of the binary object image is 0.
[0256]
Next, the area specifying process of the area determining unit 103 will be described with reference to the flowchart of FIG. In step S301, the background image generation unit 301 of the region determination unit 103 generates a background image by extracting, for example, an image object corresponding to a background object included in the input image based on the input image, and generates the generated background. The image is supplied to the binary object image extraction unit 302.
[0257]
In step S302, the binary object image extraction unit 302 calculates a correlation value between the input image and the background image supplied from the background image generation unit 301, for example, by the calculation described with reference to FIG. In step S303, the binary object image extraction unit 302 calculates a binary object image from the correlation value and the threshold value th0, for example, by comparing the correlation value with the threshold value th0.
[0258]
In step S304, the time change detection unit 303 executes region determination processing, and the processing ends.
[0259]
Details of the area determination process corresponding to step S304 will be described with reference to the flowchart of FIG. In step S321, the region determination unit 342 of the time change detection unit 303 determines whether or not the pixel of interest is 0 in the frame #n stored in the frame memory 341, and pays attention in the frame #n. If it is determined that the pixel is 0, the process proceeds to step S322, the pixel of interest in frame #n is set as belonging to the background area, and the process ends.
[0260]
If it is determined in step S321 that the pixel of interest is 1 in frame #n, the process proceeds to step S323, where the area determination unit 342 of the time change detection unit 303 stores the frame #n stored in the frame memory 341. In frame # n-1, it is determined whether or not the corresponding pixel is 0, and in frame #n, the target pixel is 1 and frame #n If it is determined at n−1 that the corresponding pixel is 0, the process proceeds to step S324, the pixel of interest in frame #n is set as belonging to the covered background area, and the process ends.
[0261]
If it is determined in step S323 that the pixel of interest is 0 in frame #n or the corresponding pixel is 1 in frame # n-1, the process proceeds to step S325, and the time change detection unit 303 The area determination unit 342 determines whether the pixel of interest is 1 in frame #n stored in the frame memory 341 and whether the corresponding pixel is 0 in frame # n + 1. If it is determined that the pixel of interest is 1 in frame #n and the corresponding pixel is 0 in frame # n + 1, the process proceeds to step S326, and the pixel of interest of frame #n is undefined. The process ends with setting to belong to the covered background area.
[0262]
If it is determined in step S325 that the pixel of interest is 0 in frame #n or the corresponding pixel is 1 in frame # n + 1, the process proceeds to step S327, and the time change detection unit 303 The area determination unit 342 sets the pixel of interest in frame #n as the foreground area, and the process ends.
[0263]
As described above, the area specifying unit 103 determines whether the pixels of the input image are the foreground area, the background area, the covered background area, and the uncovered background area based on the correlation value between the input image and the corresponding background image. It is possible to specify which one belongs, and generate region information corresponding to the specified result.
[0264]
FIG. 45 is a block diagram showing another configuration of the area specifying unit 103. 45 uses the motion vector supplied from the motion detection unit 102 and its position information. The same parts as those shown in FIG. 33 are denoted by the same reference numerals, and the description thereof is omitted.
[0265]
The robust unit 361 generates a robust binary object image based on the binary object images of N frames supplied from the binary object image extraction unit 302, and sends it to the time change detection unit 303. Output.
[0266]
FIG. 46 is a block diagram illustrating the configuration of the robust unit 361. The motion compensation unit 381 compensates for the motion of the binary object image of N frames based on the motion vector supplied from the motion detection unit 102 and its position information, and obtains a binary object image with motion compensated. Output to the switch 382.
[0267]
With reference to the example of FIG. 47 and FIG. 48, the motion compensation of the motion compensation unit 381 will be described. For example, when determining the region of frame #n, when binary object images of frame # n-1, frame #n, and frame # n + 1 shown in FIG. 47 are input, the motion compensation unit 381 Based on the motion vector supplied from the motion detection unit 102, motion compensation is performed on the binary object image of frame # n-1 and the binary object image of frame # n + 1, as shown in FIG. Then, the binary object image subjected to motion compensation is supplied to the switch 382.
[0268]
The switch 382 outputs the motion-compensated binary object image of the first frame to the frame memory 383-1, and outputs the motion-compensated binary object image of the second frame to the frame memory 383-2. Similarly, the switch 382 outputs each of the motion compensated binary object images of the third to N−1th frames to any of the frame memory 383-3 to the frame memory 383- (N−1), The motion-compensated binary object image of the Nth frame is output to the frame memory 383-N.
[0269]
The frame memory 383-1 stores the binary object image for which motion compensation has been performed for the first frame, and outputs the stored binary object image to the weighting unit 384-1. The frame memory 383-2 stores the binary object image with motion compensation of the second frame, and outputs the stored binary object image to the weighting unit 384-2.
[0270]
Similarly, each of the frame memories 383-3 to 383- (N-1) stores and stores any of the motion compensated binary object images of the third frame to the (N-1) th frame. The binary object image thus output is output to any one of the weighting unit 384-3 to the weighting unit 384- (N-1). The frame memory 383-N stores the binary object image with motion compensation of the Nth frame, and outputs the stored binary object image to the weighting unit 384-N.
[0271]
The weighting unit 384-1 multiplies the pixel value of the motion-compensated binary object image of the first frame supplied from the frame memory 383-1 by a predetermined weight w1 and supplies the result to the integrating unit 385. The weighting unit 384-2 multiplies the pixel value of the motion compensated binary object image of the second frame supplied from the frame memory 383-2 by a predetermined weight w2, and supplies the result to the integrating unit 385.
[0272]
Similarly, each of the weighting units 384-3 to 384- (N-1) is the third to N-1 supplied from any one of the frame memories 383-3 to 383- (N-1). The pixel value of the motion-compensated binary object image of any one of the frames is multiplied by one of the predetermined weights w3 to w (N−1) and supplied to the accumulating unit 385. The weighting unit 384-N multiplies the pixel value of the motion-compensated binary object image of the Nth frame supplied from the frame memory 383-N by a predetermined weight wN and supplies the result to the integrating unit 385.
[0273]
The accumulating unit 385 accumulates the corresponding pixel values of the binary object image that have been subjected to motion compensation of the 1st to Nth frames and multiplied by one of the weights w1 to wN, respectively, and the accumulated pixel values are obtained in advance. A binary object image is generated by comparing with a predetermined threshold value th0.
[0274]
In this way, the robust unit 361 generates a robust binary object image from the N binary object images and supplies the generated binary object image to the time change detection unit 303. Therefore, the region specifying unit shown in FIG. The area 103 can specify the area more accurately than the case shown in FIG. 33 even if the input image includes noise.
[0275]
Next, the area specifying process of the area specifying unit 103 having the configuration shown in FIG. 45 will be described with reference to the flowchart of FIG. The processing in steps S341 through S343 is the same as that in steps S301 through S303 described with reference to the flowchart in FIG.
[0276]
In step S344, the robust unit 361 executes a robust process.
[0277]
In step S345, the time change detection unit 303 executes region determination processing, and the processing ends. Details of the process in step S345 are the same as those described with reference to the flowchart of FIG.
[0278]
Next, details of the robust processing corresponding to the processing in step S344 in FIG. 49 will be described with reference to the flowchart in FIG. In step S361, the motion compensation unit 381 performs motion compensation processing on the input binary object image based on the motion vector supplied from the motion detection unit 102 and its position information. In step S362, any of the frame memories 383-1 to 383-N stores the motion compensated binary object image supplied via the switch 382.
[0279]
In step S363, the robust unit 361 determines whether or not N binary object images are stored. If it is determined that N binary object images are not stored, the process returns to step S361. The motion compensation processing of the binary object image and the storage processing of the binary object image are repeated.
[0280]
If it is determined in step S363 that N binary object images have been stored, the process proceeds to step S364, and each of the weighting units 384-1 to 384-N adds w1 to w in each of the N binary object images. Multiply by one of the weights of wN.
[0281]
In step S365, the integration unit 385 integrates the weighted N binary object images.
[0282]
In step S366, the integrating unit 385 generates a binary object image from the integrated image, for example, by comparison with a predetermined threshold value th1, and the process ends.
[0283]
As described above, the region specifying unit 103 having the configuration shown in FIG. 45 can generate region information based on the robust binary object image.
[0284]
As described above, the area specifying unit 103 generates area information indicating that each of the pixels included in the frame belongs to the motion area, the still area, the uncovered background area, or the covered background area. Can do.
[0285]
FIG. 51 is a block diagram illustrating an example of the configuration of the mixture ratio calculation unit 104. Based on the input image, the estimated mixture ratio processing unit 401 calculates an estimated mixture ratio for each pixel by an operation corresponding to the model of the covered background area, and supplies the calculated estimated mixture ratio to the mixture ratio determining unit 403. To do.
[0286]
Based on the input image, the estimated mixture ratio processing unit 402 calculates an estimated mixture ratio for each pixel by an operation corresponding to the model of the uncovered background region, and the calculated estimated mixture ratio is sent to the mixture ratio determining unit 403. Supply.
[0287]
Since it can be assumed that the object corresponding to the foreground is moving at a constant speed within the shutter time, the mixture ratio α of the pixels belonging to the mixed area has the following properties. That is, the mixture ratio α changes linearly in response to changes in the pixel position. If the change in the pixel position is one-dimensional, the change in the mixture ratio α can be expressed by a straight line. If the change in the pixel position is two-dimensional, the change in the mixture ratio α is expressed by a plane. be able to.
[0288]
Since the period of one frame is short, it is assumed that the object corresponding to the foreground is a rigid body and moves at a constant speed.
[0289]
In this case, the gradient of the mixture ratio α is the inverse ratio of the motion amount v within the foreground shutter time.
[0290]
An example of an ideal mixing ratio α is shown in FIG. The gradient l in the mixing region of the ideal mixing ratio α can be expressed as the reciprocal of the motion amount v.
[0291]
As shown in FIG. 52, the ideal mixture ratio α has a value of 1 in the background region, a value of 0 in the foreground region, and a value greater than 0 and less than 1 in the mixed region. .
[0292]
In the example of FIG. 53, the pixel value C06 of the seventh pixel from the left of frame #n can be expressed by Expression (8) using the pixel value P06 of the seventh pixel from the left of frame # n-1. it can.
[0293]
[Formula 6]

... (8)
[0294]
In Expression (8), the pixel value C06 is expressed as the pixel value M of the pixel in the mixed region, and the pixel value P06 is expressed as the pixel value B of the pixel in the background region. That is, the pixel value M of the pixel in the mixed region and the pixel value B of the pixel in the background region can be expressed as Equation (9) and Equation (10), respectively.
[0295]
M = C06 (9)
B = P06 (10)
[0296]
2 / v in equation (8) corresponds to the mixing ratio α. Since the motion amount v is 4, the mixture ratio α of the seventh pixel from the left of the frame #n is 0.5.
[0297]
As described above, the pixel value C of the focused frame #n is regarded as the pixel value of the mixed region, and the pixel value P of the frame # n-1 before the frame #n is regarded as the pixel value of the background region. Equation (3) indicating the mixing ratio α can be rewritten as Equation (11).
[0298]
C = α · P + f (11)
F in Expression (11) is the sum ΣiFi / v of the foreground components included in the pixel of interest. There are two variables included in equation (11): the mixture ratio α and the sum f of the foreground components.
[0299]
Similarly, FIG. 54 shows a model in which pixel values are expanded in the time direction, in which the amount of motion v is 4 and the number of virtual divisions in the time direction is 4, in the uncovered background area.
[0300]
In the uncovered background area, similarly to the above-described expression in the covered background area, the pixel value C of the frame #n of interest is regarded as the pixel value of the mixed area, and the frame # n + 1 after the frame #n Eq. (3) indicating the mixture ratio α can be expressed as Eq. (12) by regarding the pixel value N of と as the pixel value of the background region.
[0301]
C = α ・ N + f (12)
[0302]
Although it has been described that the background object is stationary, even when the background object is moving, by using the pixel value of the pixel at the position corresponding to the background motion amount v, the expression (8 ) To (12) can be applied. For example, in FIG. 53, when the motion amount v of the object corresponding to the background is 2 and the number of virtual divisions is 2, when the object corresponding to the background is moving to the right side in the figure, The pixel value B of the pixel in the background area is set to a pixel value P04.
[0303]
Since Expression (11) and Expression (12) each include two variables, the mixture ratio α cannot be obtained as it is. Here, since an image generally has a strong spatial correlation, adjacent pixels have almost the same pixel value.
[0304]
Therefore, since the foreground components have a strong spatial correlation, the formula is modified so that the sum f of the foreground components can be derived from the previous or subsequent frame to obtain the mixture ratio α.
[0305]
The pixel value Mc of the seventh pixel from the left in frame #n in FIG. 55 can be expressed by Expression (13).
[0306]
[Expression 7]

(13)
2 / v in the first term on the right side of Equation (13) corresponds to the mixing ratio α. The second term on the right side of Expression (13) is expressed as Expression (14) using the pixel value of the subsequent frame # n + 1.
[0307]
[Equation 8]

(14)
[0308]
Here, Equation (15) is established using the spatial correlation of the foreground components.
[0309]
F = F05 = F06 = F07 = F08 = F09 = F10 = F11 = F12 (15)
Expression (14) can be replaced with Expression (16) using Expression (15).
[0310]
[Equation 9]

... (16)
[0311]
As a result, β can be expressed by equation (17).
[0312]
β = 2/4 (17)
[0313]
In general, assuming that the foreground components related to the mixed region are equal as shown in Equation (15), Equation (18) is established from the relationship of the internal ratio for all the pixels in the mixed region.
[0314]
β = 1-α (18)
[0315]
If Expression (18) is established, Expression (11) can be expanded as shown in Expression (19).
[0316]
[Expression 10]

... (19)
[0317]
Similarly, if equation (18) holds, equation (12) can be expanded as shown in equation (20).
[0318]
[Expression 11]

... (20)
[0319]
In Expression (19) and Expression (20), C, N, and P are known pixel values, and therefore the variable included in Expression (19) and Expression (20) is only the mixture ratio α. FIG. 56 shows the relationship among C, N, and P in the equations (19) and (20). C is the pixel value of the pixel of interest in frame #n for calculating the mixture ratio α. N is a pixel value of a pixel in frame # n + 1 corresponding to a pixel of interest corresponding to a position in the spatial direction. P is a pixel value of a pixel in frame # n−1 in which the pixel of interest corresponds to the position in the spatial direction.
[0320]
Accordingly, since one variable is included in each of the equations (19) and (20), the mixture ratio α can be calculated using the pixel values of the pixels of the three frames. The condition for calculating the correct mixture ratio α by solving the equations (19) and (20) is that the foreground components related to the mixed region are equal, that is, the imaging is performed when the foreground object is stationary. In the foreground image object thus obtained, the pixel values of the pixels located at the boundary of the image object corresponding to the direction of the motion of the foreground object, which are twice as many as the movement amount v, are continuous. It is constant.
[0321]
As described above, the mixing ratio α of the pixels belonging to the covered background area is calculated by Expression (21), and the mixing ratio α of the pixels belonging to the uncovered background area is calculated by Expression (22).
[0322]
α = (CN) / (PN) (21)
α = (CP) / (NP) (22)
[0323]
FIG. 57 is a block diagram illustrating a configuration of the estimated mixture ratio processing unit 401. The frame memory 421 stores the input image in units of frames, and supplies the frame immediately after the frame input as the input image to the frame memory 422 and the mixture ratio calculation unit 423.
[0324]
The frame memory 422 stores the input image in units of frames, and supplies the frame immediately after the frame supplied from the frame memory 421 to the mixture ratio calculation unit 423.
[0325]
Therefore, when the frame # n + 1 is input to the mixing ratio calculation unit 423 as an input image, the frame memory 421 supplies the frame #n to the mixing ratio calculation unit 423, and the frame memory 422 stores the frame # n− 1 is supplied to the mixture ratio calculation unit 423.
[0326]
The mixture ratio calculation unit 423 calculates the pixel value C of the pixel of interest in frame #n and the pixel of frame # n + 1 corresponding to the spatial position of the pixel of interest by the calculation shown in Expression (21). And the estimated mixture ratio of the pixel of interest was calculated based on the pixel value N of the pixel and the pixel value P of the pixel of frame # n-1 whose spatial position corresponds to the pixel of interest. Output the estimated mixture ratio. For example, when the background is stationary, the mixture ratio calculation unit 423 determines that the pixel value C of the pixel of interest in frame #n is the same as the pixel of interest in the frame # n + 1. Calculate the estimated mixture ratio of the pixel of interest based on the pixel value N of the pixel and the pixel value P of the pixel of frame # n-1, which has the same position in the frame as the pixel of interest. The estimated mixture ratio is output.
[0327]
As described above, the estimated mixture ratio processing unit 401 can calculate the estimated mixture ratio based on the input image and supply the estimated mixture ratio to the mixture ratio determining unit 403.
[0328]
The estimated mixture ratio processing unit 402 calculates the estimated mixture ratio of the pixel of interest by the calculation shown in the equation (21) by the estimated mixture ratio processing unit 401, whereas the calculation shown in the equation (22). Thus, the estimated mixture ratio processing unit 401 is the same as the estimated mixture ratio processing unit 401 except that a part for calculating the estimated mixture ratio of the pixel of interest is different.
[0329]
FIG. 58 is a diagram illustrating an example of the estimated mixture ratio calculated by the estimated mixture ratio processing unit 401. The estimated mixture ratio shown in FIG. 58 shows the result when the foreground motion amount v corresponding to an object moving at a constant speed is 11, for one line.
[0330]
It can be seen that the estimated mixture ratio changes almost linearly in the mixed region as shown in FIG.
[0331]
Returning to FIG. 51, the mixture ratio determining unit 403 determines whether the pixel for which the mixture ratio α supplied from the region specifying unit 103 is to be calculated is a foreground area, a background area, a covered background area, or an uncovered background area. The mixing ratio α is set on the basis of the area information indicating which of the two. The mixture ratio determining unit 403 sets 0 as the mixture ratio α when the target pixel belongs to the foreground area, and sets 1 as the mixture ratio α when the target pixel belongs to the background area. When the pixel belongs to the covered background area, the estimated mixture ratio supplied from the estimated mixture ratio processing unit 401 is set to the mixture ratio α, and when the target pixel belongs to the uncovered background area, the estimated mixture ratio processing unit The estimated mixing ratio supplied from 402 is set to the mixing ratio α. The mixture ratio determination unit 403 outputs a mixture ratio α set based on the region information.
[0332]
FIG. 59 is a block diagram showing another configuration of the mixture ratio calculation unit 104. Based on the region information supplied from the region specifying unit 103, the selection unit 441 supplies the pixels belonging to the covered background region and the corresponding pixels of the previous and subsequent frames to the estimated mixture ratio processing unit 442. Based on the region information supplied from the region specifying unit 103, the selection unit 441 supplies the pixels belonging to the uncovered background region and the corresponding pixels in the previous and subsequent frames to the estimated mixture ratio processing unit 443. .
[0333]
Based on the pixel value input from the selection unit 441, the estimated mixture ratio processing unit 442 calculates the estimated mixture ratio of the pixel of interest belonging to the covered background region by the calculation shown in Expression (21). The calculated estimated mixture ratio is supplied to the selection unit 444.
[0334]
Based on the pixel value input from the selection unit 441, the estimated mixture ratio processing unit 443 calculates an estimated mixture ratio of the pixel of interest belonging to the uncovered background region by the calculation shown in Expression (22). Then, the calculated estimated mixture ratio is supplied to the selection unit 444.
[0335]
When the target pixel belongs to the foreground area based on the area information supplied from the area specifying unit 103, the selection unit 444 selects an estimated mixture ratio that is 0, sets the mixture ratio α, If the pixel belongs to the background region, an estimated mixture ratio of 1 is selected and set to the mixture ratio α. When the target pixel belongs to the covered background area, the selection unit 444 selects the estimated mixture ratio supplied from the estimated mixture ratio processing unit 442 and sets it to the mixture ratio α, and the target pixel is uncovered back. When belonging to the ground region, the estimated mixture ratio supplied from the estimated mixture ratio processing unit 443 is selected and set to the mixture ratio α. The selection unit 444 outputs the mixture ratio α selected and set based on the region information.
[0336]
As described above, the mixture ratio calculation unit 104 having another configuration shown in FIG. 59 can calculate the mixture ratio α for each pixel included in the image and output the calculated mixture ratio α.
[0337]
With reference to the flowchart of FIG. 60, the process of calculating the mixture ratio α of the mixture ratio calculator 104 shown in FIG. In step S 401, the mixture ratio calculation unit 104 acquires the region information supplied from the region specifying unit 103. In step S 402, the estimated mixture ratio processing unit 401 performs an estimated mixture ratio calculation process using a model corresponding to the covered background region, and supplies the calculated estimated mixture ratio to the mixture ratio determining unit 403. Details of the calculation process of the mixture ratio estimation will be described later with reference to the flowchart of FIG.
[0338]
In step S 403, the estimated mixture ratio processing unit 402 performs an estimated mixture ratio calculation process using a model corresponding to the uncovered background region, and supplies the calculated estimated mixture ratio to the mixture ratio determining unit 403.
[0339]
In step S404, the mixture ratio calculation unit 104 determines whether or not the mixture ratio α is estimated for the entire frame. If it is determined that the mixture ratio α is not estimated for the entire frame, the process returns to step S402. Then, the process of estimating the mixture ratio α for the next pixel is executed.
[0340]
If it is determined in step S404 that the mixture ratio α has been estimated for the entire frame, the process proceeds to step S405, where the mixture ratio determination unit 403 determines that the pixel is a foreground area, a background area, a covered background area, or an uncovered back. The mixture ratio α is set based on the area information supplied from the area specifying unit 103 that indicates which of the ground areas belongs. The mixture ratio determining unit 403 sets 0 as the mixture ratio α when the target pixel belongs to the foreground area, and sets 1 as the mixture ratio α when the target pixel belongs to the background area. When the pixel belongs to the covered background area, the estimated mixture ratio supplied from the estimated mixture ratio processing unit 401 is set to the mixture ratio α, and when the target pixel belongs to the uncovered background area, the estimated mixture ratio processing unit The estimated mixture ratio supplied from 402 is set to the mixture ratio α, and the process ends.
[0341]
As described above, the mixture ratio calculation unit 104 can calculate the mixture ratio α, which is a feature amount corresponding to each pixel, based on the region information supplied from the region specifying unit 103 and the input image.
[0342]
59 is the same as the processing described with reference to the flowchart of FIG. 60, and thus the description thereof is omitted.
[0343]
Next, the mixing ratio estimation process using the model corresponding to the covered background area corresponding to step S402 in FIG. 60 will be described with reference to the flowchart in FIG.
[0344]
In step S421, the mixture ratio calculation unit 423 acquires the pixel value C of the target pixel of frame #n from the frame memory 421.
[0345]
In step S422, the mixture ratio calculation unit 423 acquires the pixel value P of the pixel of frame # n−1 corresponding to the target pixel from the frame memory 422.
[0346]
In step S423, the mixture ratio calculation unit 423 acquires the pixel value N of the pixel of frame # n + 1 corresponding to the target pixel included in the input image.
[0347]
In step S424, the mixture ratio calculation unit 423, based on the pixel value C of the pixel of interest in frame #n, the pixel value P of the pixel of frame # n-1, and the pixel value N of the pixel of frame # n + 1, Calculate the estimated mixture ratio.
[0348]
In step S425, the mixture ratio calculation unit 423 determines whether or not the process of calculating the estimated mixture ratio has been completed for the entire frame, and determines that the process of calculating the estimated mixture ratio has not been completed for the entire frame. If so, the process returns to step S421, and the process of calculating the estimated mixture ratio for the next pixel is repeated.
[0349]
If it is determined in step S425 that the process of calculating the estimated mixture ratio has been completed for the entire frame, the process ends.
[0350]
Thus, the estimated mixture ratio processing unit 401 can calculate the estimated mixture ratio based on the input image.
[0351]
The processing of the mixture ratio estimation by the model corresponding to the uncovered background area in step S403 of FIG. 60 is the same as the process shown in the flowchart of FIG. 61 using the formula corresponding to the model of the uncovered background area. Description is omitted.
[0352]
Note that the estimated mixture ratio processing unit 442 and the estimated mixture ratio processing unit 443 illustrated in FIG. 59 perform the same processing as the flowchart illustrated in FIG. 61 to calculate the estimated mixture ratio, and thus description thereof is omitted.
[0353]
In addition, although it has been described that the object corresponding to the background is stationary, the above-described processing for obtaining the mixture ratio α can be applied even if the image corresponding to the background region includes a motion. For example, when the image corresponding to the background region is moving uniformly, the estimated mixture ratio processing unit 401 shifts the entire image corresponding to the movement of the background, and the object corresponding to the background is stationary. Process in the same way. In addition, when the image corresponding to the background region includes a different background motion for each local area, the estimated mixture ratio processing unit 401 selects a pixel corresponding to the background motion as a pixel corresponding to the pixel belonging to the mixed region. Then, the above-described processing is executed.
[0354]
Further, the mixture ratio calculation unit 104 performs only the process of estimating the mixture ratio using the model corresponding to the covered background region for all the pixels, and outputs the calculated estimated mixture ratio as the mixture ratio α. Also good. In this case, the mixing ratio α indicates the ratio of the background components for the pixels belonging to the covered background area, and indicates the ratio of the foreground components for the pixels belonging to the uncovered background area. If the absolute value of the difference between the mixture ratio α and 1 calculated in this way is calculated for the pixels belonging to the uncovered background area and the calculated absolute value is set to the mixture ratio α, the separation processing server 11 For the pixels belonging to the uncovered background area, the mixing ratio α indicating the ratio of the background components can be obtained.
[0355]
Similarly, the mixture ratio calculation unit 104 executes only the mixture ratio estimation process using the model corresponding to the uncovered background area for all pixels, and outputs the calculated estimated mixture ratio as the mixture ratio α. You may make it do.
[0356]
Next, the mixing ratio calculation unit 104 that calculates the mixing ratio α using the property that the mixing ratio α changes linearly will be described.
[0357]
As described above, since the equations (11) and (12) each include two variables, the mixture ratio α cannot be obtained as it is.
[0358]
Therefore, using the property that the mixing ratio α changes linearly in response to changes in the pixel position due to the object corresponding to the foreground moving at a constant speed within the shutter time, mixing is performed in the spatial direction. An expression that approximates the ratio α and the sum f of the foreground components is established. By using a plurality of sets of pixel values of pixels belonging to the mixed area and pixel values belonging to the background area, an equation that approximates the mixing ratio α and the sum f of the foreground components is solved.
[0359]
When the change in the mixing ratio α is approximated as a straight line, the mixing ratio α is expressed by Expression (23).
[0360]
α = il + p (23)
In Expression (23), i is an index in the spatial direction where the position of the pixel of interest is 0. l is the slope of the straight line of the mixing ratio α. p is a straight line intercept of the mixing ratio α and is the mixing ratio α of the pixel of interest. In equation (23), the index i is known, but the slope l and the intercept p are unknown.
[0361]
FIG. 62 shows the relationship between the index i, the slope l, and the intercept p.
[0362]
By approximating the mixture ratio α as shown in Expression (23), a plurality of different mixture ratios α for a plurality of pixels can be expressed by two variables. In the example shown in FIG. 62, the five mixing ratios for the five pixels are expressed by two variables, the gradient l and the intercept p.
[0363]
Approximating the mixture ratio α in the plane shown in FIG. 63, when considering the motion v corresponding to two directions of the horizontal direction and the vertical direction of the image, the formula (23) is expanded to a plane, and the mixture ratio α is It is represented by Formula (24).
[0364]
α = jm + kq + p (24)
In Expression (24), j is a horizontal index with the position of the pixel of interest being 0, and k is a vertical index. m is the horizontal inclination of the surface of the mixing ratio α, and q is the vertical inclination of the surface of the mixing ratio α. p is an intercept of the surface of the mixing ratio α.
[0365]
For example, in frame #n shown in FIG. 53, equations (25) to (27) are established for C05 to C07, respectively.
[0366]
C05 = α05 ・ B05 / v + f05 (25)
C06 = α06 ・ B06 / v + f06 (26)
C07 = α07 ・ B07 / v + f07 (27)
[0367]
When the foreground components match in the vicinity, that is, F01 to F03 are equal, and F01 to F03 are replaced with Fc, Expression (28) is established.
[0368]
f (x) = (1-α (x)) · Fc (28)
In Expression (28), x represents a position in the spatial direction.
[0369]
When α (x) is replaced with Expression (24), Expression (28) can be expressed as Expression (29).
[0370]
f (x) = (1- (jm + kq + p)) ・ Fc
= j ・ (-m ・ Fc) + k ・ (-q ・ Fc) + ((1-p) ・ Fc)
= js + kt + u (29)
[0371]
In the equation (29), (−m · Fc), (−q · Fc), and (1-p) · Fc are replaced as shown in the equations (30) to (32).
[0372]
s = -m · Fc (30)
t = -q · Fc (31)
u = (1-p) · Fc (32)
[0373]
In Expression (29), j is an index in the horizontal direction with the position of the pixel of interest set to 0, and k is an index in the vertical direction.
[0374]
In this way, since it is assumed that the object corresponding to the foreground moves at a constant speed within the shutter time and the component corresponding to the foreground is constant in the vicinity, the sum of the foreground components is expressed by Equation (29). Approximated.
[0375]
When the mixture ratio α is approximated by a straight line, the sum of the foreground components can be expressed by Expression (33).
[0376]
f (x) = is + u (33)
[0377]
When the sum of the mixture ratio α and the foreground component in Expression (13) is replaced using Expression (24) and Expression (29), the pixel value M is expressed by Expression (34).
[0378]
M = (jm + kq + p) ・ B + js + kt + u
= jB ・ m + kB ・ q + B ・ p + j ・ s + k ・ t + u (34)
[0379]
In equation (34), the unknown variables are the horizontal gradient m of the surface of the mixing ratio α, the vertical inclination q of the surface of the mixing ratio α, the intercepts p, s, t, and u of the surface of the mixing ratio α. These are six.
[0380]
A plurality of normal equations in which the pixel value M or the pixel value B is set in the normal equation shown in the equation (34) in correspondence with the pixel in the vicinity of the pixel of interest, and the pixel value M or the pixel value B is set. Is calculated by the method of least squares to calculate the mixture ratio α.
[0381]
For example, the horizontal index j of the pixel of interest is set to 0, the index k of the vertical direction is set to 0, and a 3 × 3 pixel in the vicinity of the pixel of interest is expressed by the normal equation shown in Expression (34). When the pixel value M or the pixel value B is set, Expressions (35) to (43) are obtained.
[0382]
M-1, -1 = (-1) ・ B-1, -1 ・ m + (-1) ・ B-1, -1 ・ q + B-1, -1 ・ p + (-1) ・ s + (- 1) ・ t + u ・・・ (35)
M0, -1 = (0) ・ B0, -1 ・ m + (-1) ・ B0, -1 ・ q + B0, -1 ・ p + (0) ・ s + (-1) ・ t + u ・・・（ 36)
M + 1, -1 = (+ 1) ・ B + 1, -1 ・ m + (-1) ・ B + 1, -1 ・ q + B + 1, -1 ・ p + (+ 1) ・ s + (- 1) ・ t + u ・・・ (37)
M-1,0 = (-1) ・ B-1,0 ・ m + (0) ・ B-1,0 ・ q + B-1,0 ・ p + (-1) ・ s + (0) ・ t + u ... (38)
M0,0 = (0) ・ B0,0 ・ m + (0) ・ B0,0 ・ q + B0,0 ・ p + (0) ・ s + (0) ・ t + u (39)
M + 1,0 = (+ 1) ・ B + 1,0 ・ m + (0) ・ B + 1,0 ・ q + B + 1,0 ・ p + (+ 1) ・ s + (0) ・ t + u ... (40)
M-1, + 1 = (-1) ・ B-1, + 1 ・ m + (+ 1) ・ B-1, + 1 ・ q + B-1, + 1 ・ p + (-1) ・ s + (+ 1) ・ t + u ・・・（41）
M0, + 1 = (0) ・ B0, + 1 ・ m + (+ 1) ・ B0, + 1 ・ q + B0, + 1 ・ p + (0) ・ s + (+ 1) ・ t + u ・・・ ( 42)
M + 1, + 1 = (+ 1) ・ B + 1, + 1 ・ m + (+ 1) ・ B + 1, + 1 ・ q + B + 1, + 1 ・ p + (+ 1) ・ s + (+ 1) ・ t + u ・・・ (43)
[0383]
Since the index j in the horizontal direction of the pixel of interest is 0 and the index k in the vertical direction is 0, the mixture ratio α of the pixel of interest is expressed by j = 0 and k = It is equal to the value at 0, that is, the intercept p.
[0384]
Therefore, based on the nine equations (35) to (43), the values of the horizontal gradient m, the vertical gradient q, the intercepts p, s, t, and u are calculated by the method of least squares. The intercept p may be output as the mixing ratio α.
[0385]
Next, a more specific procedure for calculating the mixture ratio α by applying the least square method will be described.
[0386]
When the index i and the index k are expressed by one index x, the relationship between the index i, the index k, and the index x is expressed by Expression (44).
[0387]
x = (j + 1) ・ 3+ (k + 1) (44)
[0388]
Express horizontal slope m, vertical slope q, intercepts p, s, t, and u as variables w0, w1, w2, w3, w4, and W5, respectively, jB, kB, B, j, k, And 1 are expressed as a0, a1, a2, a3, a4, and a5, respectively. In consideration of the error ex, Expressions (35) to (43) can be rewritten into Expression (45).
[0389]
[Expression 12]

... (45)
In the formula (45), x is any value in the range of 0 to 8.
[0390]
From equation (45), equation (46) can be derived.
[0390]
[Formula 13]

... (46)
[0392]
Here, in order to apply the method of least squares, an error sum of squares E is defined as shown in equation (47).
[0393]
[Expression 14]

... (47)
[0394]
In order to minimize the error, it is only necessary that the partial differentiation of the variable Wv with respect to the square sum E of the error becomes zero. Here, v is one of integers from 0 to 5. Therefore, wy is obtained so as to satisfy the equation (48).
[0395]
[Expression 15]

... (48)
[0396]
Substituting equation (46) into equation (48) yields equation (49).
[0397]
[Expression 16]

... (49)
[0398]
For example, a sweep method (Gauss-Jordan elimination method) is applied to six formulas obtained by substituting any one of integers 0 to 5 for v in formula (49) to calculate wy. . As described above, w0 is the horizontal gradient m, w1 is the vertical gradient q, w2 is the intercept p, w3 is s, w4 is t, and w5 is u.
[0399]
As described above, horizontal slope m, vertical slope q, intercepts p, s, t, and u are obtained by applying the method of least squares to the equation in which pixel value M and pixel value B are set. be able to.
[0400]
In the description corresponding to the expressions (35) to (43), the pixel value of the pixel included in the mixed area has been described as M, and the pixel value of the pixel included in the background area has been described as B. Therefore, it is necessary to establish a normal equation for each of the cases where they are included in the covered background region or the uncovered background region.
[0401]
For example, when obtaining the mixture ratio α of pixels included in the covered background area of frame #n shown in FIG. 53, the pixel values P04 to C08 of the pixels of frame #n and the pixel values P04 to P08 of the pixels of frame # n−1 P08 is set as a normal equation.
[0402]
When obtaining the mixture ratio α of pixels included in the uncovered background area of frame #n shown in FIG. 53, pixel values N28 to N32 of pixels C28 to C32 of frame #n and pixels of frame # n + 1 Is set to a normal equation.
[0403]
Further, for example, when calculating the mixture ratio α of the pixels included in the covered background region shown in FIG. 64, the following equations (50) to (58) are established. The pixel value of the pixel for calculating the mixture ratio α is Mc5.
[0404]
Mc1 = (-1) ・ Bc1 ・ m + (-1) ・ Bc1 ・ q + Bc1 ・ p + (-1) ・ s + (-1) ・ t + u (50)
Mc2 = (0) ・ Bc2 ・ m + (-1) ・ Bc2 ・ q + Bc2 ・ p + (0) ・ s + (-1) ・ t + u (51)
Mc3 = (+ 1) ・ Bc3 ・ m + (-1) ・ Bc3 ・ q + Bc3 ・ p + (+ 1) ・ s + (-1) ・ t + u (52)
Mc4 = (-1) ・ Bc4 ・ m + (0) ・ Bc4 ・ q + Bc4 ・ p + (-1) ・ s + (0) ・ t + u (53)
Mc5 = (0) ・ Bc5 ・ m + (0) ・ Bc5 ・ q + Bc5 ・ p + (0) ・ s + (0) ・ t + u (54)
Mc6 = (+ 1) ・ Bc6 ・ m + (0) ・ Bc6 ・ q + Bc6 ・ p + (+ 1) ・ s + (0) ・ t + u (55)
Mc7 = (-1) ・ Bc7 ・ m + (+ 1) ・ Bc7 ・ q + Bc7 ・ p + (-1) ・ s + (+ 1) ・ t + u (56)
Mc8 = (0) ・ Bc8 ・ m + (+ 1) ・ Bc8 ・ q + Bc8 ・ p + (0) ・ s + (+ 1) ・ t + u (57)
Mc9 = (+ 1) ・ Bc9 ・ m + (+ 1) ・ Bc9 ・ q + Bc9 ・ p + (+ 1) ・ s + (+ 1) ・ t + u (58)
[0405]
When calculating the mixture ratio α of the pixels included in the covered background area of frame #n, the background of the pixel of frame # n−1 corresponding to the pixel of frame #n in equations (50) to (58) Pixel values Bc1 to Bc9 of the pixels in the area are used.
[0406]
When calculating the mixture ratio α of the pixels included in the uncovered background area shown in FIG. 64, the following equations (59) to (67) are established. The pixel value of the pixel for calculating the mixture ratio α is Mu5.
[0407]
Mu1 = (-1) ・ Bu1 ・ m + (-1) ・ Bu1 ・ q + Bu1 ・ p + (-1) ・ s + (-1) ・ t + u (59)
Mu2 = (0) ・ Bu2 ・ m + (-1) ・ Bu2 ・ q + Bu2 ・ p + (0) ・ s + (-1) ・ t + u (60)
Mu3 = (+ 1) ・ Bu3 ・ m + (-1) ・ Bu3 ・ q + Bu3 ・ p + (+ 1) ・ s + (-1) ・ t + u (61)
Mu4 = (-1) ・ Bu4 ・ m + (0) ・ Bu4 ・ q + Bu4 ・ p + (-1) ・ s + (0) ・ t + u (62)
Mu5 = (0) ・ Bu5 ・ m + (0) ・ Bu5 ・ q + Bu5 ・ p + (0) ・ s + (0) ・ t + u (63)
Mu6 = (+ 1) ・ Bu6 ・ m + (0) ・ Bu6 ・ q + Bu6 ・ p + (+ 1) ・ s + (0) ・ t + u (64)
Mu7 = (-1) ・ Bu7 ・ m + (+ 1) ・ Bu7 ・ q + Bu7 ・ p + (-1) ・ s + (+ 1) ・ t + u ・・・ (65)
Mu8 = (0) ・ Bu8 ・ m + (+ 1) ・ Bu8 ・ q + Bu8 ・ p + (0) ・ s + (+ 1) ・ t + u (66)
Mu9 = (+ 1) ・ Bu9 ・ m + (+ 1) ・ Bu9 ・ q + Bu9 ・ p + (+ 1) ・ s + (+ 1) ・ t + u (67)
[0408]
When calculating the mixture ratio α of the pixels included in the uncovered background area of frame #n, in the equations (59) to (67), the pixels of the frame # n + 1 corresponding to the pixels of the frame #n are calculated. The pixel values Bu1 to Bu9 of the pixels in the background area are used.
[0409]
FIG. 65 is a block diagram illustrating a configuration of the estimated mixture ratio processing unit 401. The image input to the estimated mixture ratio processing unit 401 is supplied to the delay unit 501 and the adding unit 502.
[0410]
The delay circuit 221 delays the input image by one frame and supplies it to the adding unit 502. When the frame #n is input as an input image to the adding unit 502, the delay circuit 221 supplies the frame # n-1 to the adding unit 502.
[0411]
The adding unit 502 sets the pixel value of the pixel near the pixel for calculating the mixture ratio α and the pixel value of the frame # n−1 in a normal equation. For example, the adding unit 502 sets the pixel values Mc1 to Mc9 and the pixel values Bc1 to Bc9 in the normal equation based on the equations (50) to (58). The adding unit 502 supplies the normal equation in which the pixel value is set to the calculation unit 503.
[0412]
The computing unit 503 solves the normal equation supplied from the adding unit 502 by a sweeping method or the like to obtain an estimated mixture ratio, and outputs the obtained estimated mixture ratio.
[0413]
As described above, the estimated mixture ratio processing unit 401 can calculate the estimated mixture ratio based on the input image and supply the estimated mixture ratio to the mixture ratio determining unit 403.
[0414]
Note that the estimated mixture ratio processing unit 402 has the same configuration as the estimated mixture ratio processing unit 401, and thus description thereof is omitted.
[0415]
FIG. 66 is a diagram illustrating an example of the estimated mixture ratio calculated by the estimated mixture ratio processing unit 401. In the estimated mixture ratio shown in FIG. 66, the foreground motion v corresponding to an object moving at a constant speed is 11, and a result obtained by generating an equation with a block of 7 × 7 pixels as a unit is 1 line. Is shown.
[0416]
It can be seen that the estimated mixture ratio changes substantially linearly in the mixed region as shown in FIG.
[0417]
Next, the mixing ratio estimation processing by the model corresponding to the covered background area by the estimated mixing ratio processing unit 401 having the configuration shown in FIG. 65 will be described with reference to the flowchart of FIG.
[0418]
In step S521, the adding unit 502 sets the pixel value included in the input image and the pixel value included in the image supplied from the delay circuit 221 to a normal equation corresponding to the model of the covered background area. .
[0419]
In step S522, the estimated mixture ratio processing unit 401 determines whether or not the setting for the target pixel has been completed. If it is determined that the setting for the target pixel has not been completed, the process proceeds to step S521. Returning, the process of setting the pixel value to the normal equation is repeated.
[0420]
If it is determined in step S522 that the pixel value setting for the target pixel has been completed, the process proceeds to step S523, and the calculation unit 173 calculates the estimated mixture ratio based on the normal equation in which the pixel value is set. Then, the obtained estimated mixture ratio is output.
[0421]
As described above, the estimated mixture ratio processing unit 401 having the configuration shown in FIG. 65 can calculate the estimated mixture ratio based on the input image.
[0422]
The process of estimating the mixture ratio using the model corresponding to the uncovered background area is the same as the process shown in the flowchart of FIG. 67 using the normal equation corresponding to the model of the uncovered background area.
[0423]
Note that although the object corresponding to the background has been described as stationary, the above-described processing for obtaining the mixture ratio can be applied even if the image corresponding to the background area includes movement. For example, when the image corresponding to the background region is moving uniformly, the estimated mixture ratio processing unit 401 shifts the entire image corresponding to this movement, and is the same as when the object corresponding to the background is stationary. To process. Further, when the image corresponding to the background region includes a different motion for each local area, the estimated mixture ratio processing unit 401 selects a pixel corresponding to the motion as a pixel corresponding to the pixel belonging to the mixed region, and Execute the process.
[0424]
As described above, the mixture ratio calculation unit 102 can calculate the mixture ratio α, which is a feature amount corresponding to each pixel, based on the region information supplied from the region specifying unit 101 and the input image.
[0425]
By using the mixing ratio α, it becomes possible to separate the foreground component and the background component included in the pixel value while leaving the motion blur information included in the image corresponding to the moving object. .
[0426]
Also, by compositing images based on the mixture ratio α, it is possible to create an image including correct motion blur that matches the speed of a moving object as if the real world was actually recaptured.
[0427]
Next, the foreground / background separation unit 105 will be described. FIG. 68 is a block diagram illustrating an example of the configuration of the foreground / background separator 105. The input image supplied to the foreground / background separator 105 is supplied to the separator 601, the switch 602, and the switch 604. The information indicating the covered background area and the area information supplied from the area specifying unit 103 indicating the uncovered background area are supplied to the separation unit 601. Area information indicating the foreground area is supplied to the switch 602. Area information indicating the background area is supplied to the switch 604.
[0428]
The mixing ratio α supplied from the mixing ratio calculation unit 104 is supplied to the separation unit 601.
[0429]
The separation unit 601 separates the foreground components from the input image based on the region information indicating the covered background region, the region information indicating the uncovered background region, and the mixing ratio α, and synthesizes the separated foreground components. The background component is separated from the input image, and the separated background component is supplied to the synthesis unit 605.
[0430]
The switch 602 is closed when a pixel corresponding to the foreground is input based on the region information indicating the foreground region, and supplies only the pixel corresponding to the foreground included in the input image to the combining unit 603.
[0431]
The switch 604 is closed when a pixel corresponding to the background is input based on the region information indicating the background region, and supplies only the pixel corresponding to the background included in the input image to the combining unit 605.
[0432]
The combining unit 603 combines the foreground component image based on the component corresponding to the foreground supplied from the separation unit 601 and the pixel corresponding to the foreground supplied from the switch 602, and outputs the combined foreground component image. Since the foreground area and the mixed area do not overlap, the synthesis unit 603 synthesizes the foreground component image by applying a logical sum operation to the component corresponding to the foreground and the pixel corresponding to the foreground, for example.
[0433]
In the initialization process executed at the beginning of the foreground component image synthesis process, the synthesis unit 603 stores an image in which all pixel values are 0 in the built-in frame memory, and performs synthesis of the foreground component image. In the process, the foreground component image is stored (overwritten). Accordingly, 0 is stored as the pixel value in the pixel corresponding to the background area in the foreground component image output by the synthesis unit 603.
[0434]
The combining unit 605 combines the background component images based on the components corresponding to the background supplied from the separation unit 601 and the pixels corresponding to the background supplied from the switch 604, and outputs the combined background component image. Since the background area and the mixed area do not overlap, the synthesis unit 605 synthesizes the background component image by applying a logical sum operation to the component corresponding to the background and the pixel corresponding to the background, for example.
[0435]
In the initialization process executed at the beginning of the background component image synthesis process, the synthesis unit 605 stores an image in which all pixel values are 0 in the built-in frame memory, and performs synthesis of the background component image. In the processing, the background component image is stored (overwritten). Accordingly, 0 is stored as the pixel value in the pixel corresponding to the foreground area in the background component image output from the synthesis unit 605.
[0436]
FIG. 69 is a diagram illustrating an input image input to the foreground / background separator 105 and a foreground component image and a background component image output from the foreground / background separator 105.
[0437]
69 is a schematic diagram of an image to be displayed. FIG. 70 illustrates a time direction of pixels in one line including pixels belonging to the foreground area, pixels belonging to the background area, and pixels belonging to the mixed area corresponding to FIG. Figure 1 shows the developed model diagram.
[0438]
As shown in FIGS. 69 and 70, the background component image output from the foreground / background separation unit 105 is composed of pixels belonging to the background area and background components included in the pixels of the mixed area.
[0439]
As shown in FIGS. 69 and 70, the foreground component image output from the foreground / background separation unit 105 includes pixels belonging to the foreground area and foreground components included in the pixels of the mixed area.
[0440]
The pixel values of the pixels in the mixed region are separated into a background component and a foreground component by the foreground / background separation unit 105. The separated background components together with the pixels belonging to the background area constitute a background component image. The separated foreground components together with the pixels belonging to the foreground area constitute a foreground component image.
[0441]
Thus, in the foreground component image, the pixel value of the pixel corresponding to the background area is set to 0, and a meaningful pixel value is set to the pixel corresponding to the foreground area and the pixel corresponding to the mixed area. Similarly, in the background component image, the pixel value of the pixel corresponding to the foreground area is set to 0, and a meaningful pixel value is set to the pixel corresponding to the background area and the pixel corresponding to the mixed area.
[0442]
Next, a process performed by the separation unit 601 to separate the foreground components and the background components from the pixels belonging to the mixed area will be described.
[0443]
FIG. 71 is an image model showing foreground components and background components of two frames including a foreground corresponding to an object moving from left to right in the drawing. In the image model shown in FIG. 71, the foreground motion amount v is 4, and the number of virtual divisions is 4.
[0444]
In frame #n, the leftmost pixel and the fourteenth through eighteenth pixels from the left consist only of background components and belong to the background area. In frame #n, the second through fourth pixels from the left include a background component and a foreground component, and belong to the uncovered background area. In frame #n, the eleventh through thirteenth pixels from the left include a background component and a foreground component, and belong to the covered background area. In frame #n, the fifth through tenth pixels from the left consist of only the foreground components and belong to the foreground area.
[0445]
In frame # n + 1, the first through fifth pixels from the left and the eighteenth pixel from the left consist of only the background components, and belong to the background area. In frame # n + 1, the sixth through eighth pixels from the left include a background component and a foreground component, and belong to the uncovered background area. In frame # n + 1, the fifteenth through seventeenth pixels from the left include a background component and a foreground component, and belong to the covered background area. In frame # n + 1, the ninth through fourteenth pixels from the left consist of only the foreground components, and belong to the foreground area.
[0446]
FIG. 72 is a diagram illustrating processing for separating foreground components from pixels belonging to the covered background area. In FIG. 72, α1 to α18 are mixing ratios corresponding to the respective pixels in frame #n. In FIG. 72, the fifteenth through seventeenth pixels from the left belong to the covered background area.
[0447]
The pixel value C15 of the fifteenth pixel from the left in frame #n is expressed by equation (68).
[0448]
C15 = B15 / v + F09 / v + F08 / v + F07 / v
= α15 ・ B15 + F09 / v + F08 / v + F07 / v
= α15 ・ P15 + F09 / v + F08 / v + F07 / v (68)
[0449]
Here, α15 is the mixture ratio of the fifteenth pixel from the left in frame #n. P15 is the pixel value of the fifteenth pixel from the left in frame # n-1.
[0450]
Based on Expression (68), the sum f15 of the foreground components of the fifteenth pixel from the left in frame #n is expressed by Expression (69).
[0451]
f15 = F09 / v + F08 / v + F07 / v
= C15-α15 ・ P15 (69)
[0452]
Similarly, the foreground component sum f16 of the 16th pixel from the left in frame #n is expressed by Equation (70), and the foreground component sum f17 of the 17th pixel from the left in frame #n is expressed by Equation (70). (71)
[0453]
f16 = C16-α16 · P16 (70)
f17 = C17-α17 · P17 (71)
[0454]
In this way, the foreground component fc included in the pixel value C of the pixel belonging to the covered background area is calculated by Expression (72).
[0455]
fc = C-α · P (72)
[0456]
P is the pixel value of the corresponding pixel in the previous frame.
[0457]
FIG. 73 is a diagram illustrating processing for separating foreground components from pixels belonging to the uncovered background area. In FIG. 73, α1 to α18 are mixing ratios corresponding to the respective pixels in frame #n. In FIG. 73, the second through fourth pixels from the left belong to the uncovered background area.
[0458]
The pixel value C02 of the second pixel from the left in frame #n is expressed by Expression (73).
[0459]
C02 = B02 / v + B02 / v + B02 / v + F01 / v
= α2 ・ B02 + F01 / v
= α2 ・ N02 + F01 / v (73)
[0460]
Here, α2 is the mixture ratio of the second pixel from the left in frame #n. N02 is the pixel value of the second pixel from the left in frame # n + 1.
[0461]
Based on Expression (73), the sum f02 of the foreground components of the second pixel from the left in frame #n is expressed by Expression (74).
[0462]
f02 = F01 / v
= C02-α2 ・ N02 (74)
[0463]
Similarly, the sum f03 of the foreground components of the third pixel from the left in frame #n is expressed by Expression (75), and the sum f04 of the foreground components of the fourth pixel from the left of frame #n is expressed by Expression (75). (76)
[0464]
f03 = C03-α3 ・ N03 (75)
f04 = C04-α4 ・ N04 (76)
[0465]
In this way, the foreground component fu included in the pixel value C of the pixel belonging to the uncovered background area is calculated by Expression (77).
[0466]
fu = C-α ・ N (77)
[0467]
N is the pixel value of the corresponding pixel in the next frame.
[0468]
As described above, the separation unit 601 determines from the pixels belonging to the mixed region based on the information indicating the covered background region, the information indicating the uncovered background region, and the mixing ratio α for each pixel included in the region information. Foreground and background components can be separated.
[0469]
FIG. 74 is a block diagram illustrating an example of the configuration of the separation unit 601 that executes the processing described above. The image input to the separation unit 601 is supplied to the frame memory 621, and the region information indicating the covered background region and the uncovered background region supplied from the mixture ratio calculation unit 104, and the mixture ratio α are the separation processing block. It is input to 622.
[0470]
The frame memory 621 stores the input image in units of frames. When the object of processing is frame #n, the frame memory 621 is a frame that is the frame immediately after frame # n-1, frame #n, and frame #n. Remember # n + 1.
[0471]
The frame memory 621 supplies the pixels corresponding to the frame # n−1, the frame #n, and the frame # n + 1 to the separation processing block 622.
[0472]
The separation processing block 622 includes the frame # n−1, the frame #n, and the frame #n supplied from the frame memory 621 based on the area information indicating the covered background area and the uncovered background area, and the mixing ratio α. The calculation described with reference to FIGS. 72 and 73 is applied to the pixel value of the corresponding pixel of +1 to separate the foreground component and the background component from the pixels belonging to the mixed region of frame #n, and This is supplied to the memory 623.
[0473]
The separation processing block 622 includes an uncovered area processing unit 631, a covered area processing unit 632, a combining unit 633, and a combining unit 634.
[0474]
The multiplier 641 of the uncovered area processing unit 631 multiplies the mixing ratio α by the pixel value of the pixel of frame # n + 1 supplied from the frame memory 621 and outputs the result to the switch 642. The switch 642 is closed when the pixel of frame #n (corresponding to the pixel of frame # n + 1) supplied from the frame memory 621 is an uncovered background area, and the mixture ratio supplied from the multiplier 641 The pixel value multiplied by α is supplied to the calculator 643 and the synthesis unit 634. The value obtained by multiplying the pixel value of the pixel of frame # n + 1 output from the switch 642 by the mixing ratio α is equal to the background component of the pixel value of the corresponding pixel of frame #n.
[0475]
The computing unit 643 subtracts the background component supplied from the switch 642 from the pixel value of the pixel of frame #n supplied from the frame memory 621 to obtain the foreground component. The computing unit 643 supplies the foreground component of the pixel of frame #n belonging to the uncovered background area to the synthesis unit 633.
[0476]
The multiplier 651 of the covered area processing unit 632 multiplies the mixture ratio α by the pixel value of the pixel of frame # n−1 supplied from the frame memory 621 and outputs the result to the switch 652. The switch 652 is closed when the pixel of the frame #n supplied from the frame memory 621 (corresponding to the pixel of the frame # n−1) is the covered background region, and the mixture ratio α supplied from the multiplier 651 is The pixel value multiplied by is supplied to the calculator 653 and the combining unit 634. A value obtained by multiplying the pixel value of the pixel of frame # n−1 output from the switch 652 by the mixing ratio α is equal to the background component of the pixel value of the corresponding pixel of frame #n.
[0477]
The arithmetic unit 653 subtracts the background component supplied from the switch 652 from the pixel value of the pixel of frame #n supplied from the frame memory 621 to obtain the foreground component. The calculator 653 supplies the foreground components of the pixels of the frame #n belonging to the covered background area to the synthesis unit 633.
[0478]
The synthesizer 633 outputs the foreground components of the pixel belonging to the uncovered background area supplied from the calculator 643 and the foreground of the pixel belonging to the covered background area supplied from the calculator 653 of the frame #n. The components are combined and supplied to the frame memory 623.
[0479]
The combining unit 634 receives the background component of the pixel belonging to the uncovered background area supplied from the switch 642 and the background component of the pixel belonging to the covered background area supplied from the switch 652 of the frame #n. Combined and supplied to the frame memory 623.
[0480]
The frame memory 623 stores the foreground components and the background components of the pixels in the mixed area of the frame #n supplied from the separation processing block 622, respectively.
[0481]
The frame memory 623 outputs the stored foreground components of the pixels in the mixed area of frame #n and the stored background components of the pixels of the mixed area in frame #n.
[0482]
By using the mixture ratio α, which is a feature amount, it is possible to completely separate the foreground component and the background component included in the pixel value.
[0483]
FIG. 75 is a diagram illustrating an example of a foreground component image and an example of a background component image corresponding to frame #n in FIG.
[0484]
FIG. 75A shows an example of a foreground component image corresponding to frame #n in FIG. Since the leftmost pixel and the fourteenth pixel from the left consist of only background components before the foreground and the background are separated, the pixel value is set to zero.
[0485]
The second through fourth pixels from the left belong to the uncovered background area before the foreground and the background are separated, the background component is 0, and the foreground component is left as it is. The eleventh to thirteenth pixels from the left belong to the covered background area before the foreground and the background are separated, the background component is 0, and the foreground component is left as it is. The fifth through tenth pixels from the left are composed of only the foreground components and are left as they are.
[0486]
FIG. 75B shows an example of the background component image corresponding to frame #n in FIG. The leftmost pixel and the fourteenth pixel from the left are left as they are because they consisted only of the background components before the foreground and the background were separated.
[0487]
The second through fourth pixels from the left belong to the uncovered background area before the foreground and the background are separated, the foreground components are set to 0, and the background components are left as they are. The eleventh to thirteenth pixels from the left belong to the covered background area before the foreground and the background are separated, and the foreground components are set to 0 and the background components are left as they are. Since the fifth through tenth pixels from the left consist of only the foreground components before the foreground and the background are separated, the pixel value is set to zero.
[0488]
Next, foreground / background separation processing by the foreground / background separation unit 105 will be described with reference to the flowchart shown in FIG. In step S601, the frame memory 621 of the separation unit 601 obtains an input image, and determines the frame #n to be separated from the foreground and the background as the previous frame # n-1 and the subsequent frame # n + 1. Remember with.
[0489]
In step S 602, the separation processing block 622 of the separation unit 601 acquires the region information supplied from the mixture ratio calculation unit 104. In step S 603, the separation processing block 622 of the separation unit 601 acquires the mixture ratio α supplied from the mixture ratio calculation unit 104.
[0490]
In step S604, the uncovered area processing unit 631 extracts a background component from the pixel values of the pixels belonging to the uncovered background area supplied from the frame memory 621 based on the area information and the mixture ratio α.
[0491]
In step S605, the uncovered area processing unit 631 extracts the foreground components from the pixel values of the pixels belonging to the uncovered background area supplied from the frame memory 621 based on the area information and the mixture ratio α.
[0492]
In step S606, the covered area processing unit 632 extracts a background component from the pixel values of the pixels belonging to the covered background area supplied from the frame memory 621 based on the area information and the mixture ratio α.
[0493]
In step S607, the covered area processing unit 632 extracts the foreground components from the pixel values of the pixels belonging to the covered background area supplied from the frame memory 621 based on the area information and the mixture ratio α.
[0494]
In step S608, the synthesis unit 633 extracts the foreground components of the pixels belonging to the uncovered background area extracted in step S605 and the foreground components of the pixels belonging to the covered background area extracted in step S607. And synthesize. The synthesized foreground components are supplied to the synthesis unit 603. Further, the synthesizing unit 603 combines the pixels belonging to the foreground area supplied via the switch 602 with the foreground components supplied from the separating unit 601 to generate a foreground component image.
[0495]
In step S609, the synthesizer 634 extracts the background component of the pixel belonging to the uncovered background area extracted in step S604 and the background component of the pixel belonging to the covered background area extracted in step S606. And synthesize. The synthesized background component is supplied to the synthesis unit 605. Furthermore, the synthesis unit 605 synthesizes the pixels belonging to the background area supplied via the switch 604 and the background components supplied from the separation unit 601 to generate a background component image.
[0496]
In step S610, the synthesis unit 603 outputs the foreground component image. In step S611, the synthesis unit 605 outputs a background component image, and the process ends.
[0497]
As described above, the foreground / background separation unit 105 separates the foreground component and the background component from the input image based on the region information and the mixture ratio α, and the foreground component image including only the foreground component and the background A background component image consisting only of components can be output.
[0498]
Next, the removal of motion blur from the foreground component image will be described.
[0499]
FIG. 77 is a block diagram illustrating an example of the configuration of the motion blur removal unit 106. The motion vector and its position information supplied from the motion detection unit 102 are supplied to the processing unit determination unit 801, the modeling unit 802, and the calculation unit 805. The area information supplied from the area specifying unit 103 is supplied to the processing unit determination unit 801. The foreground component image supplied from the foreground / background separation unit 105 is supplied to the adding unit 804.
[0500]
The processing unit determination unit 801 generates a processing unit based on the motion vector, its position information, and region information, and supplies the generated processing unit to the modeling unit 802 and the adding unit 804.
[0501]
As shown in FIG. 78, the processing unit generated by the processing unit determination unit 801 starts from a pixel corresponding to the covered background area of the foreground component image, and moves in the direction of movement to the pixel corresponding to the uncovered background area. A continuous pixel lined up in a moving direction starting from a pixel lined up or a pixel corresponding to an uncovered background area to a pixel corresponding to a covered background area is shown. The processing unit is composed of, for example, two pieces of data: an upper left point (a pixel specified by the processing unit and located at the leftmost or uppermost pixel on the image) and a lower right point.
[0502]
The modeling unit 802 executes modeling based on the motion vector and the input processing unit. More specifically, for example, the modeling unit 802 previously stores a plurality of models corresponding to the number of pixels included in the processing unit, the number of virtual divisions of the pixel values in the time direction, and the number of foreground components for each pixel. A model that specifies the correspondence between the pixel value and the foreground components as shown in FIG. 79 is selected based on the processing unit and the number of virtual divisions of the pixel value in the time direction.
[0503]
For example, when the number of pixels corresponding to the processing unit is 12 and the amount of motion v within the shutter time is 5, the modeling unit 802 sets the virtual division number to 5 and sets the leftmost pixel to 1 The foreground component, the second pixel from the left contains the two foreground components, the third pixel from the left contains the three foreground components, and the fourth pixel from the left contains the four foreground components The fifth pixel from the left contains five foreground components, the sixth pixel from the left contains five foreground components, the seventh pixel from the left contains five foreground components, and eight from the left. The tenth pixel includes five foreground components, the ninth pixel from the left includes four foreground components, the tenth pixel from the left includes three foreground components, and the eleventh pixel from the left is 2 The foreground component, and the twelfth pixel from the left contains one foreground component. As a whole, a model consisting of eight foreground components is selected.
[0504]
Note that the modeling unit 802 generates a model based on the motion vector and the processing unit when the motion vector and the processing unit are supplied, instead of selecting from the models stored in advance. Also good.
[0505]
The modeling unit 802 supplies the selected model to the equation generation unit 803.
[0506]
The equation generation unit 803 generates an equation based on the model supplied from the modeling unit 802. Referring to the foreground component image model shown in FIG. 79, the number of foreground components is 8, the number of pixels corresponding to the processing unit is 12, the amount of motion v is 5, and the number of virtual divisions is 5 The equation generated by the equation generation unit 803 will be described.
[0507]
When the foreground components corresponding to the shutter time / v included in the foreground component image are F01 / v to F08 / v, the relationship between F01 / v to F08 / v and the pixel values C01 to C12 is expressed by Equations (78) to (78). It is represented by Formula (89).
[0508]
C01 = F01 / v (78)
C02 = F02 / v + F01 / v (79)
C03 = F03 / v + F02 / v + F01 / v (80)
C04 = F04 / v + F03 / v + F02 / v + F01 / v (81)
C05 = F05 / v + F04 / v + F03 / v + F02 / v + F01 / v (82)
C06 = F06 / v + F05 / v + F04 / v + F03 / v + F02 / v (83)
C07 = F07 / v + F06 / v + F05 / v + F04 / v + F03 / v (84)
C08 = F08 / v + F07 / v + F06 / v + F05 / v + F04 / v (85)
C09 = F08 / v + F07 / v + F06 / v + F05 / v (86)
C10 = F08 / v + F07 / v + F06 / v (87)
C11 = F08 / v + F07 / v (88)
C12 = F08 / v ・・・ (89)
[0509]
The equation generation unit 803 generates an equation by modifying the generated equation. Equations generated by the equation generation unit 803 are shown in equations (90) to (101).
[0510]
C01 = 1 ・ F01 / v + 0 ・ F02 / v + 0 ・ F03 / v + 0 ・ F04 / v + 0 ・ F05 / v
+0 ・ F06 / v + 0 ・ F07 / v + 0 ・ F08 / v ・・・ (90)
C02 = 1 ・ F01 / v + 1 ・ F02 / v + 0 ・ F03 / v + 0 ・ F04 / v + 0 ・ F05 / v
+0 ・ F06 / v + 0 ・ F07 / v + 0 ・ F08 / v ・・・ (91)
C03 = 1 ・ F01 / v + 1 ・ F02 / v + 1 ・ F03 / v + 0 ・ F04 / v + 0 ・ F05 / v
+0 ・ F06 / v + 0 ・ F07 / v + 0 ・ F08 / v ・・・ (92)
C04 = 1 ・ F01 / v + 1 ・ F02 / v + 1 ・ F03 / v + 1 ・ F04 / v + 0 ・ F05 / v
+0 ・ F06 / v + 0 ・ F07 / v + 0 ・ F08 / v (93)
C05 = 1 ・ F01 / v + 1 ・ F02 / v + 1 ・ F03 / v + 1 ・ F04 / v + 1 ・ F05 / v
+0 ・ F06 / v + 0 ・ F07 / v + 0 ・ F08 / v ・・・ (94)
C06 = 0 ・ F01 / v + 1 ・ F02 / v + 1 ・ F03 / v + 1 ・ F04 / v + 1 ・ F05 / v
+1 ・ F06 / v + 0 ・ F07 / v + 0 ・ F08 / v ・・・ (95)
C07 = 0 ・ F01 / v + 0 ・ F02 / v + 1 ・ F03 / v + 1 ・ F04 / v + 1 ・ F05 / v
+1 ・ F06 / v + 1 ・ F07 / v + 0 ・ F08 / v (96)
C08 = 0 ・ F01 / v + 0 ・ F02 / v + 0 ・ F03 / v + 1 ・ F04 / v + 1 ・ F05 / v
+1 ・ F06 / v + 1 ・ F07 / v + 1 ・ F08 / v (97)
C09 = 0 ・ F01 / v + 0 ・ F02 / v + 0 ・ F03 / v + 0 ・ F04 / v + 1 ・ F05 / v
+1 ・ F06 / v + 1 ・ F07 / v + 1 ・ F08 / v ・・・ (98)
C10 = 0 ・ F01 / v + 0 ・ F02 / v + 0 ・ F03 / v + 0 ・ F04 / v + 0 ・ F05 / v
+1 ・ F06 / v + 1 ・ F07 / v + 1 ・ F08 / v (99)
C11 = 0 ・ F01 / v + 0 ・ F02 / v + 0 ・ F03 / v + 0 ・ F04 / v + 0 ・ F05 / v
+0 ・ F06 / v + 1 ・ F07 / v + 1 ・ F08 / v ・・・ (100)
C12 = 0 ・ F01 / v + 0 ・ F02 / v + 0 ・ F03 / v + 0 ・ F04 / v + 0 ・ F05 / v
+0 ・ F06 / v + 0 ・ F07 / v + 1 ・ F08 / v (101)
[0511]
Expressions (90) to (101) can also be expressed as Expression (102).
[0512]
[Expression 17]

... (102)
[0513]
In Expression (102), j indicates the position of the pixel. In this example, j has any one value of 1 to 12. I indicates the position of the foreground value. In this example, i has any one value of 1 to 8. aij has a value of 0 or 1 corresponding to the values of i and j.
[0514]
When expressed in consideration of the error, Expression (102) can be expressed as Expression (103).
[0515]
[Formula 18]

... (103)
[0516]
In Expression (103), ej is an error included in the target pixel Cj.
[0517]
Expression (103) can be rewritten as Expression (104).
[0518]
[Equation 19]

... (104)
[0519]
Here, in order to apply the method of least squares, an error sum of squares E is defined as shown in Expression (105).
[0520]
[Expression 20]

... (105)
[0521]
In order to minimize the error, the partial differential value of the variable Fk with respect to the square sum E of the error may be zero. Fk is obtained so as to satisfy Expression (106).
[0522]
[Expression 21]

... (106)
[0523]
In Expression (106), since the motion amount v is a fixed value, Expression (107) can be derived.
[0524]
[Expression 22]

... (107)
[0525]
When equation (107) is expanded and transferred, equation (108) is obtained.
[0526]
[Expression 23]

... (108)
[0527]
This is expanded into eight equations obtained by substituting any one of integers 1 to 8 for k in equation (108). The obtained eight expressions can be expressed by one expression by a matrix. This equation is called a normal equation.
[0528]
An example of a normal equation generated by the equation generation unit 803 based on such a method of least squares is shown in Equation (109).
[0529]
[Expression 24]

... (109)
[0530]
When Expression (109) is expressed as A · F = v · C, C, A, v are known, and F is unknown. A and v are known at the time of modeling, but C is known by inputting a pixel value in the adding operation.
[0531]
By calculating the foreground component using a normal equation based on the method of least squares, the error included in the pixel C can be dispersed.
[0532]
The equation generation unit 803 supplies the normal equation generated in this way to the addition unit 804.
[0533]
The addition unit 804 sets the pixel value C included in the foreground component image to the matrix expression supplied from the equation generation unit 803 based on the processing unit supplied from the processing unit determination unit 801. The adding unit 804 supplies a matrix in which the pixel value C is set to the calculation unit 805.
[0534]
The calculation unit 805 calculates a foreground component Fi / v from which motion blur has been removed by processing based on a solution method such as a sweep-out method (Gauss-Jordan elimination method), and uses the foreground pixel values from which motion blur has been removed. A foreground component from which motion blur is removed, which is made up of Fi, which is a pixel value from which motion blur has been removed, as shown in the example in FIG. Output an image.
[0535]
In the foreground component image from which the motion blur shown in FIG. 80 is removed, each of F03 to F08 is set to each of C03 to C10, because the position of the foreground component image with respect to the screen is not changed. It can correspond to an arbitrary position.
[0536]
Thus, the motion blur removal unit 106 can remove the amount of motion blur based on the selection signal and the motion blur adjustment amount v ′.
[0537]
Next, a process for adjusting the amount of motion blur included in the foreground component image by the motion blur removal unit 106 will be described with reference to the flowchart in FIG.
[0538]
In step S801, the processing unit determination unit 801 of the motion blur removal unit 106 generates a processing unit based on the motion vector and the region information, and supplies the generated processing unit to the modeling unit 802.
[0539]
In step S802, the modeling unit 802 of the motion blur removal unit 106 selects and generates a model corresponding to the motion amount v and the processing unit. In step S803, the equation generation unit 803 creates a normal equation based on the selected model.
[0540]
In step S804, the adding unit 804 sets the pixel value of the foreground component image in the created normal equation. In step S805, the adding unit 804 determines whether or not the pixel values of all the pixels corresponding to the processing unit have been set, and if the pixel values of all the pixels corresponding to the processing unit have not been set. If it is determined, the process returns to step S804, and the process of setting the pixel value in the normal equation is repeated.
[0541]
If it is determined in step S805 that the pixel values of all the pixels in the processing unit have been set, the process advances to step S806, and the calculation unit 805 calculates a normal equation in which the pixel values supplied from the addition unit 804 are set. Based on this, the foreground pixel value from which the amount of motion blur is removed is calculated, and the process ends.
[0542]
As described above, the motion blur removal unit 106 can remove the amount of motion blur from the foreground image including the motion blur based on the motion vector and the region information.
[0543]
That is, it is possible to adjust the amount of motion blur included in the pixel value that is the sample data.
[0544]
FIG. 82 is a block diagram illustrating another example of the configuration of the motion blur removal unit 106. In FIG. The motion vector and its position information supplied from the motion detection unit 102 are supplied to the processing unit determination unit 901 and the correction unit 905, and the region information supplied from the region specifying unit 103 is supplied to the processing unit determination unit 901. . The foreground component image supplied from the foreground / background separation unit 105 is supplied to the calculation unit 904.
[0545]
The processing unit determination unit 901 generates a processing unit based on the motion vector, its position information, and region information, and supplies the generated processing unit to the modeling unit 902 together with the motion vector.
[0546]
The modeling unit 902 performs modeling based on the motion vector and the input processing unit. More specifically, for example, the modeling unit 902 previously stores a plurality of models corresponding to the number of pixels included in a processing unit, the number of virtual divisions of pixel values in the time direction, and the number of foreground components for each pixel. A model that specifies the correspondence between the pixel value and the foreground components as shown in FIG. 83 is selected based on the processing unit and the number of virtual divisions of the pixel value in the time direction.
[0547]
For example, when the number of pixels corresponding to the processing unit is 12 and the amount of motion v is 5, the modeling unit 902 sets the virtual division number to 5 and the leftmost pixel is one foreground component. The second pixel from the left contains two foreground components, the third pixel from the left contains three foreground components, the fourth pixel from the left contains four foreground components, and The fifth pixel contains five foreground components, the sixth pixel from the left contains five foreground components, the seventh pixel from the left contains five foreground components, and the eighth pixel from the left Contains 5 foreground components, the 9th pixel from the left contains 4 foreground components, the 10th pixel from the left contains 3 foreground components, and the 11th pixel from the left contains 2 foreground components And the twelfth pixel from the left contains one foreground component, Select a model consisting of two foreground components.
[0548]
Note that the modeling unit 902 generates a model based on the motion vector and the processing unit when the motion vector and the processing unit are supplied, instead of selecting from the models stored in advance. Also good.
[0549]
The equation generation unit 903 generates an equation based on the model supplied from the modeling unit 902.
[0550]
Referring to the foreground component image model shown in FIGS. 83 to 85, when the number of foreground components is 8, the number of pixels corresponding to the processing unit is 12, and the amount of motion v is 5, An example of an equation generated by the equation generation unit 903 will be described.
[0551]
When the foreground components corresponding to the shutter time / v included in the foreground component image are F01 / v to F08 / v, the relationship between F01 / v to F08 / v and the pixel values C01 to C12 is as described above. It represents with Formula (78) thru | or Formula (89).
[0552]
Paying attention to the pixel values C12 and C11, the pixel value C12 includes only the foreground component F08 / v as shown in the equation (110), and the pixel value C11 includes the foreground component F08 / v and the foreground component F07 / v. Consists of product sums of v. Therefore, the foreground component F07 / v can be obtained by Expression (111).
[0553]
F08 / v = C12 ・・・ (110)
F07 / v = C11-C12 ・・・ (111)
[0554]
Similarly, in consideration of the foreground components included in the pixel values C10 to C01, the foreground components F06 / v to F01 / v can be obtained by Expressions (112) to (117).
[0555]
F06 / v = C10-C11 (112)
F05 / v = C09-C10 (113)
F04 / v = C08-C09 ・・・ (114)
F03 / v = C07-C08 + C12 ・・・ (115)
F02 / v = C06-C07 + C11-C12 (116)
F01 / v = C05-C06 + C10-C11 (117)
[0556]
The equation generation unit 903 generates an equation for calculating a foreground component based on a difference in pixel values, examples of which are shown in equations (110) to (117). The equation generation unit 903 supplies the generated equation to the calculation unit 904.
[0557]
The calculation unit 904 sets the pixel value of the foreground component image in the equation supplied from the equation generation unit 903, and calculates the foreground component based on the equation in which the pixel value is set. For example, when the equations (110) to (117) are supplied from the equation generator 903, the arithmetic unit 904 sets the pixel values C05 to C12 in the equations (110) to (117).
[0558]
The calculation unit 904 calculates the foreground components based on the formula in which the pixel values are set. For example, the calculation unit 904 calculates the foreground components F01 / v to F08 / v as shown in FIG. 84 by calculations based on the equations (110) to (117) in which the pixel values C05 to C12 are set. . The calculation unit 904 supplies the foreground components F01 / v to F08 / v to the correction unit 905.
[0559]
The correction unit 905 multiplies the foreground component supplied from the calculation unit 904 by the motion amount v included in the motion vector supplied from the processing unit determination unit 901 to calculate a foreground pixel value from which motion blur has been removed. . For example, when the foreground components F01 / v to F08 / v supplied from the calculation unit 904 are supplied, the correction unit 905 has a motion amount v that is 5 for each of the foreground components F01 / v to F08 / v. , The foreground pixel values F01 to F08 from which motion blur is removed are calculated as shown in FIG.
[0560]
The correction unit 905 outputs the foreground component image that is calculated as described above and includes the foreground pixel values from which motion blur is removed.
[0561]
Thus, the motion blur removal unit 106 can remove the amount of motion blur based on the selection signal and the motion blur adjustment amount v ′.
[0562]
Next, foreground motion blur removal processing by the motion blur removal unit 106 having the configuration shown in FIG. 82 will be described with reference to the flowchart of FIG.
[0563]
In step S901, the processing unit determination unit 901 of the motion blur removal unit 106 generates a processing unit based on the motion vector and the region information, and supplies the generated processing unit to the modeling unit 902 and the correction unit 905.
[0564]
In step S902, the modeling unit 902 of the motion blur removal unit 106 selects and generates a model corresponding to the motion amount v and the processing unit. In step S903, the equation generation unit 903 generates an equation for calculating a foreground component based on a difference in pixel values of the foreground component image based on the selected or generated model.
[0565]
In step S904, the calculation unit 904 sets the pixel value of the foreground component image in the created equation, and extracts the foreground component from the difference in pixel value based on the equation in which the pixel value is set. In step S905, the calculation unit 904 determines whether all foreground components corresponding to the processing unit have been extracted, and if it is determined that all foreground components corresponding to the processing unit have not been extracted, Returning to step S904, the process of extracting the foreground components is repeated.
[0566]
If it is determined in step S905 that all foreground components corresponding to the processing unit have been extracted, the process advances to step S906, and the correction unit 905 supplies the foreground components supplied from the calculation unit 904 based on the motion amount v. Each of F01 / v through F08 / v is corrected to calculate foreground pixel values F01 through F08 from which motion blur has been removed.
[0567]
In step S907, the correction unit 905 calculates and outputs a foreground pixel value from which motion blur has been removed, and the process ends.
[0568]
As described above, the motion blur removal unit 106 having the configuration illustrated in FIG. 82 can remove motion blur from a foreground image including motion blur more quickly with simpler calculation.
[0569]
Conventional methods such as the Wiener filter that partially eliminates motion blur are effective in the ideal state, but they are quantized and not effective enough for actual images containing noise. In the motion blur removal unit 106 having the configuration shown in FIG. 82, a sufficient effect is recognized even for an actual image that is quantized and includes noise, and motion blur can be accurately removed.
[0570]
Next, the configuration of the synthesis unit 107 will be described with reference to FIG.
[0571]
The contour extraction unit 1001 extracts pixel information that is the contour of the foreground component image from which the motion blur input from the motion blur removal unit 106 has been removed, and outputs the pixel information to the contour smoothing unit 1002.
[0572]
The contour smoothing unit 1002 supplies the pixel value of the pixel in the contour portion based on the information on the pixel in the contour portion (pixel forming the contour) of the foreground component image from which the motion blur supplied from the contour extraction unit 1001 has been removed. Is supplied to the spatial mixing ratio generation unit 1003.
[0573]
That is, the foreground component image from which motion blur has been removed is an image with a mixing ratio of 1. The contour portion of the foreground component image having a mixing ratio of 1 has a stepped shape in units of pixels as shown by the solid line (thick line) in FIG.
[0574]
Therefore, the contour smoothing unit 1002 passes through each point of the central portion of one side as an end (for example, points B1 to B6 in the drawing) for each pixel of the stepped contour portion. A straight line is obtained by a regression method, and the contour portion is linearly approximated. For example, the contour is smoothed by a line segment shown by a dotted line in FIG.
[0575]
In other words, in FIG. 88, a straight line passing through each of the points B1 to B6 at the center of one side of the (contour) end of each pixel forming the contour of the foreground component image is set, and the slope of the straight line is regressed. The contour of the foreground component image is smoothed by the method (least square method). In FIG. 88, line segments OP, PQ, QR, RS, ST, and TU corresponding to the points B1 to B6 are indicated by dotted lines. As a result, the contour of the foreground component image changes from a step-like cross section to a smooth cross section (a surface having an obtuse angle of 90 degrees or more).
[0576]
Note that the smoothing by the contour smoothing unit 1002 is not only obtained using the straight line obtained by the regression method as described above, but the contour may be smoothed by, for example, spline approximation or polynomial approximation.
[0577]
The spatial mixing ratio generation unit 1003 calculates a spatial mixing ratio of pixels around the contour portion based on the contour of the smoothed foreground component image supplied from the contour smoothing unit 1002, and generates a foreground component image smoothing unit. 1004 and the region information adjustment unit 1006. That is, as shown in FIG. 89, the spatial mixing ratio generation unit 1003 obtains the spatial mixing ratio of the corresponding pixels from the contour shape of the smoothed foreground component image indicated by the dotted line. The spatial mixing ratio is a mixing ratio that occurs in a pixel. Since the mixing ratio that has been treated so far is temporally mixed, it is hereinafter also referred to as a temporal mixing ratio.
[0578]
For example, as shown in FIG. 89, assuming that the bold solid line is the original outline and the square surrounded by the thin line is a pixel near the outline, the left part of the original outline in the figure is the foreground component image. If there is, the conventional foreground component image spatially has a spatial mixing ratio of pixels around the contour indicated by the thick line in the figure, the pixel that becomes the contour of the foreground component image is 1, in this case The spatial mixing ratio of the pixel existing on the right side of the pixel serving as the contour of the foreground component image should be zero. In FIG. 89, the foreground component image region is indicated by a hatched portion, and other portions, that is, the background component image region is indicated by a solid color.
[0579]
The spatial mixing ratio generation unit 1003 obtains this spatial mixing ratio according to the smoothed contour shape. That is, the outline of the original foreground component image is a thick line shown in FIG. 89, and is an area indicated by a diagonal line on the left side of the thick line. The mixing ratio of the shaded area is 1, and the area to the right of the thick line is not the foreground component image (background area to which the background component image is added), that is, the mixing ratio when combining the images is This is a region that becomes zero.
[0580]
However, the region of the foreground component image is smoothed as indicated by the dotted line shown in FIG. 89 (FIG. 88 is the same). Therefore, the spatial mixing ratio generation unit 1003, based on the smoothed contour information, the ratio of the area of each pixel that the foreground component image occupies for each pixel that exists in the region where the line forming the contour straddles. To obtain the spatial mixing ratio.
[0581]
That is, assuming that the area of each pixel is 1, as shown in FIG. 90, the pixel on the upper left side in the drawing (the pixel having the center point B1 on one side of the edge portion to be the outline) is represented by a dotted line. With the contour shown, the area of the forward tilt component image indicated by the hatched portion is approximately 0.7, so the spatial mixing ratio is 0.7. Similarly, the pixel on the upper right side in the drawing (the pixel sharing the center point B1 of one side of the edge serving as the contour) becomes a forward tilt component image indicated by the hatched portion by the contour indicated by the dotted line. Since the area of the region is approximately 0.3, the spatial mixing ratio is 0.3. Similarly, the spatial mixing ratio of the two pixels sharing the second point B2 from the top in the figure is 0.9 and 0.1 from the left, depending on the area of each shaded portion. The spatial mixing ratio of the two pixels sharing the third point B3 from the left is 0.8 and 0.2 from the left, and the spatial mixing ratio of the two pixels sharing the fourth stage point B4 from the top is , 0.9 and 0.1 from the left, and the spatial mixing ratio of the two pixels sharing the fifth point B5 from the top is 0.7 and 0.3 from the left. The spatial mixing ratios of the two pixels sharing the sixth point B6 from the top and the three pixels including the left pixel are 0.95, 0.65, and 0.3 from the left.
[0582]
The foreground component image smoothing unit 1004 smooths the foreground component image based on the spatial mixing ratio supplied from the spatial mixing ratio generation unit 1003 and outputs the image to the motion blur adding unit 1005. That is, the foreground component image smoothing unit 1004 smooths the contour portion of the foreground component image based on the spatial mixing ratio corresponding to the contour of the smoothed foreground component image. More specifically, the foreground component image smoothing unit 1004 adjusts the pixel value by multiplying each pixel of the input foreground component image by the corresponding spatial mixing ratio, and smoothes the outline of the foreground component image as a whole. To do.
[0583]
Thus, by using the smoothed contour, the mixing ratio of each pixel becomes a value in which the spatial mixing ratio is considered in accordance with the smoothed contour from the binary expression of 1 and 0 so far. A value corresponding to the ratio of the area occupied by the foreground component image is set. As a result, the foreground component image is smoothed according to this spatial mixing ratio, so that the processing of the foreground component image that has been able to be expressed only in units of pixels up to the size of the pixel or smaller is performed. It becomes possible to execute.
[0584]
Note that, by smoothing the outline of the foreground component image, the area of the foreground component image (the area of the pixel where the spatial mixing ratio of the foreground component image is other than 0) is increased by several pixels in the horizontal direction or the vertical direction. It will be. Similarly, the pixel area including the background component image may be increased.
[0585]
The motion blur adding unit 1005 adds motion blur to the foreground component image whose contour is smoothed by the foreground component image smoothing unit 1004 based on the information on the motion vector supplied from the motion detection unit 102, and the time mixing ratio. The image is output to the calculation unit 1007, the mixed region image composition unit 1009, and the image composition unit 1010.
[0586]
That is, the motion blur adding unit 1005 has a motion blur adjustment amount v ′ having a value different from the motion amount v, for example, a motion blur adjustment amount v ′ having a value half the motion amount v, or a value unrelated to the motion amount v. By providing the motion blur adjustment amount v ′, the motion blur amount can be adjusted. For example, as shown in FIG. 91, the motion blur adding unit 1005 calculates the foreground component Fi / v ′ by dividing the foreground pixel value Fi from which motion blur is removed by the motion blur adjustment amount v ′. Then, the sum of the foreground components Fi / v ′ is calculated to generate a pixel value in which the amount of motion blur is adjusted. For example, when the motion blur adjustment amount v ′ is 3, the pixel value C02 is (F01) / v ′, the pixel value C03 is (F01 + F02) / v ′, and the pixel value C04 is (F01 + F02 + F03) / v ′, and the pixel value C05 is (F02 + F03 + F04) / v ′.
[0587]
For example, as shown in FIG. 92, when the number of pixels corresponding to the processing unit is 8 and the motion amount v is 4, the motion blur adding unit 1005 uses the matrix equation shown in Equation (118). Generate.
[0588]
[Expression 25]

... (118)
[0589]
The motion blur adding unit 1005 thus calculates an expression corresponding to the length of the processing unit in this way, and calculates Fi, which is a pixel value in which the amount of motion blur is adjusted. Similarly, for example, when the number of pixels included in the processing unit is 100, an expression corresponding to 100 pixels is generated and Fi is calculated.
[0590]
The area information adjustment unit 1006 adjusts the area information supplied from the area specifying unit 103 based on the information of the spatial mixing ratio supplied from the spatial mixing ratio generation unit 1003, and the adjusted area information is a time mixing ratio calculation unit. 1007 and the image composition unit 1010. That is, as described above, the area of the foreground component image is expanded by the spatial mixing ratio newly generated by smoothing the outline of the foreground component image as described above. The area information is adjusted so that the area of the part expands correspondingly.
[0591]
The time mixture ratio calculation unit 1007 is the same as the mixture ratio calculation unit 104 in FIG. 6, and smoothing is performed based on the foreground component image to which motion blur is added and the region information supplied from the region information adjustment unit 1006. Information on the time mixture ratio corresponding to the foreground component image thus obtained is supplied to the background component generation unit 1008.
[0592]
The background component generation unit 1008 generates a background component image based on the time mixing ratio and an arbitrary background image, and supplies the background component image to the mixed region image synthesis unit 1009.
[0593]
The mixed region image synthesis unit 1009 generates a mixed region composite image by synthesizing the mixed region image of the background component image and the foreground component image supplied from the background component generation unit 1007, and generates the generated mixed region composition The image is supplied to the image composition unit 1010.
[0594]
Based on the region information supplied from the region information adjustment unit 1006, the image composition unit 1010 combines the foreground component image to which motion blur is added by the motion blur addition unit 1005 and the mixed region composition supplied from the mixed region image composition unit 1009. An image and an arbitrary background image are combined to generate and output a combined image.
[0595]
Next, with reference to the flowchart in FIG. 93, the image composition processing by the composition unit 107 in FIG. 87 corresponding to the processing in step S16 in FIG. 23 will be described.
[0596]
In step S 921, the contour extraction unit 1001 extracts pixel information that is the contour of the foreground component image from which the motion blur input from the motion blur removal unit 106 has been removed, and outputs the pixel information to the contour smoothing unit 1002. That is, the contour extraction unit 1001 extracts, for example, information on pixels having points B1 to B6 shown in FIG. 89 and pixels in the vicinity thereof, and outputs the information to the contour smoothing unit 1002.
[0597]
In step S922, the contour smoothing lower part 1002 smoothes the contour part based on the pixel information of the contour part of the foreground component image from which the motion blur supplied from the contour extraction unit 1001 is removed, and generates a spatial mixing ratio generation unit. 1003. That is, based on the pixel information of the contour portion, the contour smoothing lower portion 1002 determines each point of the central portion of one side as an end (for example, points B1 to B1 in the figure) for each pixel of the staircase-shaped contour portion. For each B6), a regression line is obtained, and the contour portion is linearly approximated. For example, the contour is smoothed by a line segment as shown by a dotted line in FIG. In other words, in FIG. 88, a straight line passing through each of the points B1 to B6 at the center of one side of the (contour) end of each pixel forming the contour of the foreground component image is set, and the slope of the straight line is regressed. The line segments OP, PQ, QR, RS, ST, and TU corresponding to the points B1 to B6 obtained by the method (least square method) are indicated by dotted lines.
[0598]
More specifically, the contour smoothing lower part 1002 defines, for example, a straight line corresponding to each of the points B1 to B6 as a function as represented by the following equation (119), and the slope is set to the regression method (minimum self Multiplicative).
[0599]
y = Ax
... (119)
[0600]
Here, (x, y) is the coordinates of the points B1 to B6, and A is the inclination. In addition, the origin of the above equation (119) is the position of the above points B1 to B6 along which the straight line to be obtained passes. Therefore, for example, when the straight line to be obtained corresponds to the line segment OP passing through the point B1, the origin is the position of the point B1, and similarly, the straight line corresponds to the line segment PQ passing through the point B2. If it is, the origin is point B2. Accordingly, the coordinates of the points B1 to B6 vary depending on the position of the point through which the desired straight line passes.
[0601]
The least square method is a method for obtaining a slope A that minimizes the sum of squared differences Q between y ′ obtained by a set slope A and an actual y, which is defined by the following equation (120).
[0602]
[Equation 26]

... (120)
[0603]
Since the sum of squared differences Q expressed by the equation (120) is a quadratic function, the variable A (slope A) has a downwardly convex curve, and therefore, Amin with the minimum slope A is the solution of the least squares method. Become.
[0604]
The differential sum of squares Q expressed by the equation (120) is differentiated by the variable A to be (Roud dQ) / (Roud dA) expressed by the following equation (121).
[0605]
[Expression 27]

... (121)
[0606]
In Formula (121), since A that becomes 0 is Amin that takes the minimum value of the sum of squared differences Q, by expanding the formula when Formula (121) becomes 0, the slope of the following Formula (122) is obtained. A will be required.
[0607]
[Expression 28]

... (122)
[0608]
The above equation (122) is a so-called one-variable (gradient A) normal equation.
[0609]
In this way, a straight line (each line segment indicated by a dotted line in FIG. 89) that is a line segment of each point is obtained by the obtained slope A, and the line segment corresponding to each point smoothes the contour of the foreground component image. Turn into. As a smoothing method, not only a line segment corresponding to each point is obtained by a regression method (least square method), but also a curve obtained by spline approximation using each point may be used, or a polynomial approximation may be used. You may make it obtain | require the curve used as an outline.
[0610]
In step S923, the spatial mixing ratio generation unit 1003 obtains the spatial mixing ratio based on the smoothed contour information supplied from the contour smoothing unit 1002. That is, as described above, the area of the foreground component image on each pixel in the vicinity of the contour is based on the contour obtained by smoothing the staircase contour indicated by the thick line in FIG. 88 as indicated by the dotted line in FIG. 90, the spatial mixing ratio of pixels in the vicinity of the contour is obtained and output to the foreground component image smoothing unit 1007 and the region information adjustment unit 1006, as shown in FIG.
[0611]
In step S924, the foreground component image smoothing unit 1004 applies the foreground component image from which motion blur has been removed based on the spatial mixing ratio information of the pixels in the contour portion of the foreground component image supplied from the spatial mixing ratio generation unit 1003. Then, the smoothing process is performed, and the smoothed foreground component image is output to the motion blur adding unit 1005. That is, the foreground component image smoothing unit 1004 smoothes the contour of the foreground component image by multiplying the spatial mixture ratio by the pixel value of each pixel of the foreground component image (actually, only the pixels forming the contour). To do.
[0612]
In step S925, the motion blur adding unit 1005 adds motion blur to the foreground component image smoothed as described above with reference to FIGS. 91 and 92 based on the motion vector supplied from the motion detection unit 102. As a result, the motion blur originally included in the input image is added and output to the time mixture ratio calculation unit 1007, the mixed region image composition unit 1009, and the image composition unit 1010.
[0613]
In step S926, the region information generation unit 1006 adjusts the region information supplied from the region specifying unit 103 based on the information of the spatial mixing ratio supplied from the spatial mixing ratio generation unit 1003, and the adjusted region information is converted to time. It outputs to the mixture ratio calculation unit 1007 and the image composition unit 1010. That is, as described above, the area of the foreground component image is expanded to pixels through which the line segment that becomes the smoothed outline passes due to the spatial mixing ratio newly generated by smoothing the outline of the foreground component image. Therefore, the area information adjustment unit 1006 adjusts so that the area of the outline portion of the foreground component image is expanded.
[0614]
In step S927, the time mixture ratio calculation unit 1007 is based on the foreground component image to which motion blur is added and the region information supplied from the region information adjustment unit 1006, similarly to the mixture ratio calculation unit 104 in FIG. A background component image is generated from an input arbitrary background image and supplied to the mixed region image composition unit 1009.
[0615]
In step S928, the background component generation unit 1008 generates a background component image based on the time mixture ratio and an arbitrary background image, and supplies the background component image to the mixed region image composition unit 1009.
[0616]
In step S929, the mixed region image combining unit 1009 generates a mixed region combined image by combining the background component image supplied from the background component generating unit 1007 and the foreground component image with motion blur added thereto, The generated mixed area composite image is supplied to the image composition unit 1010.
[0617]
Based on the region information supplied from the region information adjustment unit 1006, the image composition unit 1010 combines the foreground component image to which motion blur is added by the motion blur addition unit 1005 and the mixed region composition supplied from the mixed region image composition unit 1022. An image and an arbitrary background image are combined to generate and output a combined image.
[0618]
With the above processing, the contour portion of the foreground component image is not the contour formed in units of pixels, but the smoothed foreground component image is synthesized in consideration of the spatial mixture ratio, so that the mixture ratio shown in FIG. In addition, it is possible to generate a composite image with no sense of incongruity in consideration of the continuity of the space of the stepped foreground component image. In FIG. 94, white indicates a foreground component image mixing ratio of 1, and black indicates a foreground component image mixing ratio of 0, that is, an area other than the foreground component image (for example, a background component image area). Accordingly, the intermediate colors of white and black indicate that the region is a mixed region, and the portion closer to black indicates that the mixing ratio is higher.
[0619]
FIG. 95 shows the distribution of pixel values of the synthesized image based on the mixing ratio of the pixels of the foreground component image and the background component image in FIG. The distribution of pixel values on the coordinate position is shown. The higher the value, the higher the pixel value.
[0620]
As shown in FIG. 95, the pixel values smoothly change near the boundary between the foreground component image and the background component image by the above processing. For this reason, it is shown that a synthesized image that is natural in appearance is generated in the synthesized image without causing breakdown in the mixed region.
[0621]
As a result, it is possible to generate a composite image in which the foreground component image and the background component image as shown in FIG. 96 are combined in a more natural state. In FIG. 96, as shown in the lower part of the figure, the substantially left part is the foreground component image and the substantially right part is the background component image.
[0622]
In the above, the spatial mixing ratio of the pixels forming the outline of the foreground component image is smoothed by obtaining the spatial mixing ratio according to the ratio of the area on the pixels of the outline by smoothing the outline of the foreground component image, Although an example in which the foreground component image and the background component image are naturally synthesized has been described, the foreground component image and the background component image are spatially mixed as described above for the edge pixels that form the outline of the foreground component image. Often doing. Therefore, for the pixels forming the contour of the foreground component image, it is assumed that the foreground component image and the background component image are spatially mixed, and the spatial mixing ratio is obtained by extrapolation from pixels other than the end portions. Also good.
[0623]
In FIG. 97, for the pixels forming the contour of the foreground component image, the foreground component image and the background component image are spatially mixed, and the spatial mixing ratio is obtained by extrapolation from pixels other than the end portions. The configuration of the combining unit 107 is shown.
[0624]
Note that the same components as those shown in FIG. 87 are denoted by the same reference numerals, and description thereof will be omitted as appropriate.
[0625]
The synthesizing unit 107 in FIG. 97 performs basically the same function as the synthesizing unit 107 in FIG. 87, but instead of the contour extracting unit 1001, the contour smoothing unit 1002, and the spatial mixing ratio generating unit 1003, A contour neighboring pixel extraction unit 1101, an extrapolation interpolation unit 1102, and a spatial mixing ratio generation unit 1103 are provided, and the region information adjustment unit 1006 is further deleted.
[0626]
The contour neighboring pixel extraction unit 1101 extracts information about an edge portion that is a contour of the input foreground component image (foreground image without motion blur) from which motion blur has been removed and pixels in the vicinity thereof, and an extrapolation interpolation unit 1102. To supply.
[0627]
The extrapolation interpolation unit 1102 obtains a pixel value consisting only of the foreground component image of the pixels forming the contour of the foreground component image based on the contour of the foreground component image supplied from the contour neighboring pixel extraction unit 1101 and its neighboring pixels. Information obtained by extrapolation and information on the pixels forming the contour of the foreground component image and pixels in the vicinity of the contour are supplied to the spatial mixing ratio generation unit 1103.
[0628]
That is, when the pixel value of the foreground component image and the pixel value of the background component image are shown in the left part of FIG. 98, the corresponding spatial mixing ratio is as shown in the right part of FIG. The area where the component image pixels exist is 1, and the area of the background component image is 0. This is based on the premise that there is no spatial mixing in the pixels that form the contour of the foreground component image, and the pixels that form the contour of the foreground component image shown in the right part of FIG. It is assumed that the background component image component is not included (no mixing occurs). In the left part of FIG. 98, the third vertical bar from the left indicates the pixel value level of the foreground component image pixel, and the second vertical bar from the right indicates the pixel of the background component image pixel. Indicates the level of the value. Therefore, the third vertical bar from the left is an end pixel that forms the outline of the foreground component image.
[0629]
However, as shown in FIG. 88, the pixels that form the contour of the foreground component image often have a contour (edge) in the pixel, and the foreground component image and the background component image are actually mixed. It is considered a thing.
[0630]
The extrapolation interpolation unit 1102 uses the pixel values of the end pixels that form the contour of the foreground component image, and the foreground of the end portion that forms the contour of the foreground component image by extrapolation using pixels near the end. Obtain (estimate) a pixel value consisting only of the component image components. For example, as shown in the left part of FIG. 99, it is assumed that the pixels of the foreground component image are continuously arranged in the horizontal direction or the vertical direction by three pixels from the left.
[0631]
At this time, the level of the pixel indicated by the vertical bar corresponding to the third pixel from the left in the left part of FIG. 99 is a pixel value in which the foreground component image and the background component image are spatially mixed. Since the level of the second pixel is only the level of the foreground component image, the original pixel value (the foreground component image) of the pixel forming the contour of the foreground component image (third pixel from the left in the right part of FIG. 99) Only pixel values) are values extrapolated based on the pixel values from the left in the upper right part in FIG. 99 (values estimated by extrapolation).
[0632]
More specifically, the extrapolation interpolation unit 1102 determines the pixel values in the upper right part in FIG. 99 from the pixel values of a plurality of pixels (for example, the pixels from the upper left part in the upper right part in FIG. 99) forming the contour of the foreground component image. A function corresponding to the straight line indicated by the dotted line is obtained, and only the foreground component image of the pixel (the third pixel from the left in the upper right part in FIG. 99) that forms the contour of the foreground component image using the function is obtained. Obtain the pixel value. In the upper right part of FIG. 99, a vertical bar painted in black indicates the pixel value of the input foreground component image (pixel value of the pixel in which the foreground component image and the background component image are spatially mixed). A vertical bar obtained by summing up the black and gray areas indicates the pixel value (estimated pixel value) of only the foreground component image obtained by extrapolation.
[0633]
Note that the neighboring pixels of the pixels forming the contour of the foreground component image used for extrapolation are adjacent to the direction of the motion to be added in the process of adding motion blur described later, as viewed from the pixels forming the contour. Pixels are used. In this way, continuity between the pixels constituting the foreground component image is maintained, so that natural motion blur can be added to the foreground component image, and further, it can be combined with an arbitrary background image. Thus, a more natural composite image can be generated. In addition, when obtaining a function required for extrapolation, the extrapolation interpolation unit 1102 has been described as using two pixels in the vicinity of the pixels forming the contour of the foreground component image. The above pixels may be used.
[0634]
The spatial mixing ratio generation unit 1103 generates a spatial mixing ratio of the pixels forming the contour by using the pixel values obtained by extrapolation of the pixels forming the contour of the input foreground component image, and smoothes the foreground component image. Output to the unit 1004.
[0635]
That is, the spatial mixing ratio generation unit 1103 obtains the ratio of the pixel value of the pixel that forms the contour of the input foreground component image to the pixel that forms the contour of the foreground component image obtained by extrapolation, and uses this as the space. The mixture ratio is output to the foreground component image smoothing unit 1004.
[0636]
More specifically, the pixel value (pixel value obtained by extrapolation) indicated by the black and gray portions of the pixel value of the third pixel from the left in the upper right part of FIG. 99 is Pp. When the pixel value (pixel value of the input foreground component image) indicated by the portion painted only in black is Pr, the spatial mixture ratio generation unit 1103 obtains Pr / Pp as the spatial mixture ratio.
[0637]
By such processing, the spatial mixing ratio in which spatial mixing is considered becomes a distribution as shown in the lower right part of FIG. That is, the spatial mixing ratio of the pixels from the left to the second is 1, the mixing ratio of the third pixel from the left is Pr / Pp, and the spatial mixing ratio of the fourth and fifth pixels from the left is 0. It becomes.
[0638]
Next, image combining processing by the combining unit 107 in FIG. 97 will be described with reference to the flowchart in FIG. Note that the processing in steps S944 to S949 in FIG. 100 is the same as the processing in steps S924, S925, 927 to 930 described with reference to the flowchart in FIG.
[0639]
In step S 941, the contour neighboring pixel extraction unit 1101 extracts the pixels forming the contour and the neighboring pixels from the input foreground component image, and outputs them to the extrapolation interpolation unit 1102.
[0640]
That is, the contour neighboring pixel extraction unit 1101, for example, as described with reference to FIG. 99, the third pixel from the left in the left part of FIG. 99 is a pixel that forms the contour of the foreground component image. The pixel itself and the second pixel from the left are extracted as pixels in the vicinity of the pixel forming the contour of the foreground component image. The contour neighboring pixel extraction unit 1101 executes such processing for all the pixels forming the contour of the foreground component image, and extrapolates the information of the pixels forming the extracted contour and the neighboring pixels. Output to the unit 1102.
[0641]
In step S 942, the extrapolation interpolation unit 1102 forms a contour based on information about the pixels forming the contour of the foreground component image supplied from the contour neighboring pixel extraction unit 1101 and the pixel values of the neighboring pixels. Are generated by extrapolation.
[0642]
That is, the extrapolation interpolation unit 1102 uses, for example, the first and second pixels from the left in the upper right part of FIG. 99 as the pixels in the vicinity of the pixels that form the contour of the foreground component image. 99, by linearly approximating changes in the pixel values of neighboring pixels as indicated by the dotted line in the upper right part of 99, and by extrapolating using the straight line obtained by the approximation, black and gray in the upper right part of FIG. The pixel value consisting of a vertical bar consisting of the area filled in is obtained (estimated) as the pixel value of the pixel forming the contour of the foreground component image, and the pixel value input from the contour neighboring pixel extraction unit 1101 is input. This is supplied to the spatial mixing ratio generation unit 1103 together with the pixel values of the pixels forming the contour of the foreground component image.
[0643]
In step S943, the spatial mixture ratio generation unit 1103 calculates the pixel value of the pixel that forms the contour of the input foreground component image with respect to the pixel value of the pixel that forms the contour of the foreground component image obtained by extrapolation. The ratio is obtained as a spatial mixing ratio for each pixel forming the outline of all foreground component images, and is output to the foreground component image smoothing unit 1004.
[0644]
That is, as described above, the spatial mixing ratio generation unit 1103 performs pixel values (extrapolated interpolation) indicated by the black and gray portions of the pixel values of the third pixel from the left in the upper right part of FIG. (Pixel value obtained by the above) is Pp, and the pixel value (pixel value of the input foreground component image) indicated by the portion painted only in black is Pr. Find and output Pp as the spatial mixing ratio.
[0645]
As a result of such processing, the spatial mixing ratio in which spatial mixing is considered, as shown in the lower right part of FIG. 99, the spatial mixing ratio of the second pixel from the left becomes 1, and the third pixel from the left The mixing ratio is Pr / Pp, and the spatial mixing ratio of the fourth and fifth pixels from the left is zero.
[0646]
According to the above, the staircase-like failure phenomenon that has occurred by expressing the contour of the foreground component image in units of pixels takes into account the spatial mixing ratio, and thus accurately represents the pixel values of the pixels forming the contour. Even if the foreground component image and an arbitrary background component image are combined, a visually natural combined image can be generated.
[0647]
The sensor is not limited to the CCD, and may be a solid-state image sensor, for example, a sensor such as BBD (Bucket Brigade Device), CID (Charge Injection Device), or CPD (Charge Priming Device). The sensor is not limited to a sensor arranged in a matrix, and may be a sensor in which detection elements are arranged in a line.
[0648]
The series of processes described above can be executed by hardware, but can also be executed by software. When a series of processes is executed by software, a program constituting the software may execute various functions by installing a computer incorporated in dedicated hardware or various programs. For example, it is installed from a recording medium in a general-purpose personal computer or the like.
[0649]
FIG. 101 shows a configuration of an embodiment of a personal computer when the electrical internal configuration of the image processing unit in FIG. 6 is realized by software. A CPU 2001 of the personal computer controls the overall operation of the personal computer. Further, when a command is input from the input unit 2006 such as a keyboard or a mouse from the user via the bus 2004 and the input / output interface 2005, the CPU 2001 is stored in a ROM (Read Only Memory) 2002 correspondingly. Run the program. Alternatively, the CPU 2001 reads a program read from the magnetic disk 2021, the optical disk 2022, the magneto-optical disk 2023, or the semiconductor memory 2024 connected to the drive 2010 and installed in the storage unit 2008 into a RAM (Random Access Memory) 2003. To load and execute. Thereby, the function of the image processing unit in FIG. 6 described above is realized by software. Further, the CPU 2001 controls the communication unit 2009 to communicate with the outside, and exchange data.
[0650]
As shown in FIG. 101, the recording medium on which the program is recorded is distributed to provide the program to the user separately from the computer. The magnetic disk 2021 (including the flexible disk) on which the program is recorded is distributed. By a package medium composed of an optical disk 2022 (including compact disc-read only memory (CD-ROM), DVD (digital versatile disk)), a magneto-optical disk 2023 (including MD (mini-disc)), or a semiconductor memory 2024 In addition to being configured, it is configured by a ROM 2002 on which a program is recorded, a hard disk included in the storage unit 2008, and the like provided to the user in a state of being incorporated in a computer in advance.
[0651]
In this specification, the step of describing the program recorded on the recording medium is not limited to the processing performed in time series in the order described, but of course, it is not necessarily performed in time series. Or the process performed separately is included.
[0652]
【The invention's effect】
According to the present invention, it is possible to synthesize a foreground component image and an arbitrary background component image as a natural image.
[Brief description of the drawings]
FIG. 1 is a diagram showing a conventional composite image.
FIG. 2 is a diagram showing a distribution of pixel values of a conventional composite image.
FIG. 3 is a diagram illustrating a mixing ratio of a conventional composite image.
FIG. 4 is a diagram illustrating a pixel unit relationship between a foreground component image and a background component image of a conventional composite image.
FIG. 5 is a diagram illustrating a relationship between a foreground component image and a background component image of an original composite image in units of pixels.
FIG. 6 is a block diagram illustrating a configuration example of an image processing unit.
FIG. 7 is a diagram illustrating imaging by a sensor.
FIG. 8 is a diagram illustrating an arrangement of pixels.
FIG. 9 is a diagram illustrating the operation of a detection element.
FIG. 10 is a diagram illustrating an image obtained by imaging an object corresponding to a moving foreground and an object corresponding to a stationary background.
FIG. 11 is a diagram illustrating a background area, a foreground area, a mixed area, a covered background area, and an uncovered background area.
FIG. 12 is a model diagram in which pixel values of pixels arranged in a row adjacent to each other in an image obtained by capturing an object corresponding to a stationary foreground and an object corresponding to a stationary background are developed in the time direction; It is.
FIG. 13 is a model diagram in which pixel values are developed in a time direction and a period corresponding to a shutter time is divided.
FIG. 14 is a model diagram in which pixel values are developed in a time direction and a period corresponding to a shutter time is divided.
FIG. 15 is a model diagram in which pixel values are developed in a time direction and a period corresponding to a shutter time is divided.
FIG. 16 is a diagram illustrating an example in which pixels in a foreground area, a background area, and a mixed area are extracted.
FIG. 17 is a diagram illustrating a correspondence between a pixel and a model in which pixel values are expanded in the time direction.
FIG. 18 is a model diagram in which pixel values are developed in the time direction and a period corresponding to a shutter time is divided.
FIG. 19 is a model diagram in which pixel values are developed in a time direction and a period corresponding to a shutter time is divided.
FIG. 20 is a model diagram in which pixel values are developed in a time direction and a period corresponding to a shutter time is divided.
FIG. 21 is a model diagram in which pixel values are developed in a time direction and a period corresponding to a shutter time is divided.
FIG. 22 is a model diagram in which pixel values are developed in a time direction and a period corresponding to a shutter time is divided.
FIG. 23 is a flowchart illustrating a composite image generation process.
24 is a block diagram showing a configuration of a region specifying unit 103. FIG.
FIG. 25 is a diagram illustrating an image when an object corresponding to the foreground is moving.
FIG. 26 is a model diagram in which pixel values are developed in a time direction and a period corresponding to a shutter time is divided.
FIG. 27 is a model diagram in which pixel values are developed in a time direction and a period corresponding to a shutter time is divided.
FIG. 28 is a model diagram in which pixel values are developed in a time direction and a period corresponding to a shutter time is divided.
FIG. 29 is a diagram illustrating region determination conditions.
30 is a diagram showing an example of the result of specifying a region by the region specifying unit 103. FIG.
FIG. 31 is a diagram illustrating an example of a result of specifying a region by the region specifying unit 103;
FIG. 32 is a flowchart illustrating an area specifying process.
33 is a block diagram illustrating another configuration of the area specifying unit 103. FIG.
FIG. 34 is a model diagram in which pixel values are developed in the time direction and a period corresponding to a shutter time is divided.
FIG. 35 is a diagram illustrating an example of a background image.
36 is a block diagram showing a configuration of a binary object image extraction unit 302. FIG.
FIG. 37 is a diagram illustrating calculation of a correlation value.
FIG. 38 is a diagram illustrating calculation of a correlation value.
FIG. 39 is a diagram illustrating an example of a binary object image.
40 is a block diagram showing a configuration of a time change detection unit 303. FIG.
41 is a diagram for describing determination by an area determination unit 342. FIG.
FIG. 42 is a diagram illustrating an example of determination by the time change detection unit 303;
FIG. 43 is a flowchart for describing region specifying processing by the region determining unit 103;
FIG. 44 is a flowchart illustrating details of a region determination process.
45 is a block diagram showing still another configuration of the area specifying unit 103. FIG.
46 is a block diagram illustrating a configuration of a robust unit 361. FIG.
47 is a diagram for explaining motion compensation by a motion compensation unit 381. FIG.
48 is a diagram for explaining motion compensation of a motion compensation unit 381. FIG.
FIG. 49 is a flowchart illustrating an area specifying process.
FIG. 50 is a flowchart illustrating details of robust processing.
51 is a block diagram showing a configuration of a mixture ratio calculation unit 104. FIG.
52 is a block diagram showing a configuration of an estimated mixture ratio processing unit 401. FIG.
FIG. 53 is a model diagram in which pixel values are developed in the time direction and a period corresponding to a shutter time is divided.
FIG. 54 is a model diagram illustrating weighted differences.
FIG. 55 is a model diagram illustrating weighted differences.
FIG. 56 is a model diagram illustrating the relationship between a weighted difference and a motion vector.
FIG. 57 is a model diagram illustrating calculation of a correlation value.
FIG. 58 is a diagram illustrating an example of a block of pixels that are targets of correlation value calculation;
59 is a block diagram showing another configuration of the mixture ratio calculation unit 104. FIG.
FIG. 60 is a flowchart illustrating processing for calculating a mixture ratio.
[Fig. 61] Fig. 61 is a flowchart for describing mixing ratio estimation processing corresponding to a covered background region.
FIG. 62 is a diagram illustrating a straight line approximating the mixture ratio α.
FIG. 63 is a diagram illustrating a plane that approximates the mixture ratio α.
FIG. 64 is a diagram for explaining the correspondence between pixels of a plurality of frames when calculating the mixture ratio α.
65 is a block diagram showing another configuration of the mixture ratio estimation processing unit 401. FIG.
FIG. 66 is a diagram illustrating an example of an estimated mixture ratio.
[Fig. 67] Fig. 67 is a flowchart for describing processing of mixture ratio estimation using a model corresponding to a covered background region.
68 is a block diagram illustrating an example of a configuration of a foreground / background separation unit 105. FIG.
FIG. 69 is a diagram illustrating an input image, a foreground component image, and a background component image.
FIG. 70 is a diagram illustrating an input image, a foreground component image, and a background component image.
FIG. 71 is a model diagram in which pixel values are developed in a time direction and a period corresponding to a shutter time is divided.
FIG. 72 is a model diagram in which pixel values are developed in the time direction and a period corresponding to a shutter time is divided.
FIG. 73 is a model diagram in which pixel values are developed in the time direction and a period corresponding to a shutter time is divided.
74 is a block diagram illustrating an example of a configuration of a separation unit 601. FIG.
FIG. 75 is a diagram illustrating an example of a separated foreground component image and background component image.
FIG. 76 is a flowchart for describing foreground and background separation processing;
77 is a block diagram illustrating an example of a configuration of a motion blur removal unit 106. FIG.
FIG. 78 is a diagram illustrating a processing unit.
FIG. 79 is a model diagram in which pixel values of a foreground component image are developed in a time direction and a period corresponding to a shutter time is divided.
FIG. 80 is a model diagram in which pixel values of a foreground component image are developed in the time direction and a period corresponding to a shutter time is divided.
FIG. 81 is a flowchart illustrating processing for removing motion blur included in the foreground component image by the motion blur removal unit 106;
82 is a diagram showing another configuration of the motion blur removing unit 106. FIG.
FIG. 83 is a diagram illustrating an example of a model that designates correspondence between pixel values and foreground components.
FIG. 84 is a diagram illustrating calculation of foreground components.
FIG. 85 is a diagram illustrating calculation of foreground components.
FIG. 86 is a flowchart for describing foreground motion blur removal processing;
87 is a block diagram illustrating an exemplary configuration of a combining unit 107. FIG.
[Fig. 88] Fig. 88 is a diagram for describing a smoothing process for a contour portion of a foreground component image.
[Fig. 89] Fig. 89 is a diagram for describing smoothing processing for a contour portion of a foreground component image.
FIG. 90 is a diagram for describing smoothing processing for a contour portion of a foreground component image.
FIG. 91 is a model diagram in which pixel values of a foreground component image are expanded in a time direction and a period corresponding to a shutter time is divided.
FIG. 92 is a model diagram in which pixel values of a foreground component image are developed in the time direction and a period corresponding to a shutter time is divided.
FIG. 93 is a flowchart for describing image composition processing;
FIG. 94 is a diagram illustrating a mixing ratio of synthesized images.
FIG. 95 is a diagram illustrating a distribution of pixel values of a combined image.
FIG. 96 is a diagram showing a synthesized image.
97 is a block diagram illustrating another example of the configuration of the combining unit 107. FIG.
FIG. 98 is a diagram illustrating a correspondence between a pixel value level of a foreground component image and a mixing ratio.
FIG. 99 is obtained by extrapolating the pixel value level of the pixels forming the contour of the foreground component image, and obtaining the mixture ratio based on the pixel value obtained by extrapolation and the pixel value of the input image. It is a figure explaining a method.
FIG. 100 is a flowchart for describing image composition processing;
Fig. 101 is a diagram illustrating a recording medium.
[Explanation of symbols]
21 CPU, 22 ROM, 23 RAM, 26 input section, 27 output section, 28 storage section, 29 communication section, 51 magnetic disk, 52 optical disk, 53 magneto-optical disk, 54 semiconductor memory, 101 object extraction section, 102 motion detection section , 103 area specifying unit, 104 mixing ratio calculation unit, 105 foreground / background separation unit, 106 motion blur removal unit, 107 synthesis unit, 201 frame memory, 202-1 to 202-4 static motion determination unit, 203-1 to 203- 3 area determination unit, 204 determination flag storage frame memory, 205 composition unit, 206 determination flag storage frame memory, 301 background image generation unit, 302 binary object image extraction unit, 303 time change detection unit, 321 correlation value calculation unit, 322 Threshold processing unit, 341 frame memory, 342 region determination unit, 3 1 Robust Unit, 381 Motion Compensator, 382 Switch, 383-1 to 383-N Frame Memory, 384-1 to 384-N Weight Unit, 385 Accumulator, 401 Estimated Mixture Ratio Processor, 402 Estimated Mixture Ratio Processor , 403 mixing ratio determination unit, 421 frame memory, 422 weight generation unit, 423 weighted frame difference calculation unit, 424 motion compensation unit, 425 frame memory, 426 correlation value calculation unit, 427 maximum value determination unit, 441 selection unit, 442 selection , 601 separating unit, 602 switch, 603 combining unit, 604 switch, 605 combining unit, 621 frame memory, 622 separation processing block, 623 frame memory, 631 uncovered region processing unit, 632 covered region processing unit, 633 combining unit, 634 Synthesizer, 801 Unit determination unit, 802 modeling unit, 803 equation generation unit, 804 addition unit, 805 calculation unit, 901 processing unit determination unit, 902 modeling unit, 903 equation generation unit, 904 calculation unit, 905 correction unit, 1001 contour extraction , 1002 contour smoothing unit, 1003 spatial mixing ratio generation unit, 1004 foreground component image smoothing unit, 1005 motion blur addition unit, 1006 region information adjustment unit, 1007 time mixing ratio calculation unit, 1008 background component generation unit, 1009 mixing region Image composition unit, 1010 Image composition unit, 1101 Contour neighborhood pixel extraction unit, 1102 Extrapolation interpolation unit, 1103 Spatial mixing ratio generation unit

Claims

Pixel detection means for detecting pixels at the edge of the contour in the direction of motion to be added from the foreground image without motion blur;
The pixel value of the edge pixel detected by the pixel detection means and the estimated pixel value estimated by extrapolating from the surrounding pixels in the foreground image located around the edge pixel A mixing ratio detecting means for detecting the ratio and setting the mixing ratio at the end;
Motion blur adding means for generating a motion blur added foreground image obtained by adding a predetermined motion blur to the foreground image;
An image processing apparatus comprising: a combining unit configured to combine the foreground image with a background image based on the mixture ratio of the motion blur added foreground image and the end portion.

A pixel detection step for detecting a pixel at an end of a contour portion in a direction of a motion to be added from a foreground image without motion blur;
The pixel value of the edge pixel detected by the processing of the pixel detection step and the estimated pixel value estimated by extrapolating from the surrounding pixels in the foreground image located around the edge pixel And a mixing ratio detection step for detecting the ratio to the mixing ratio at the end,
A motion blur adding step for generating a motion blur added foreground image by adding a predetermined motion blur to the foreground image;
And a synthesis step of synthesizing the foreground image with a background image based on the mixture ratio of the motion blur added foreground image and the edge.

A pixel detection step for detecting a pixel at an end of a contour portion in a direction of a motion to be added from a foreground image without motion blur;
The pixel value of the edge pixel detected by the processing of the pixel detection step and the estimated pixel value estimated by extrapolating from the surrounding pixels in the foreground image located around the edge pixel And a mixing ratio detection step for detecting the ratio to the mixing ratio at the end,
A motion blur adding step for generating a motion blur added foreground image by adding a predetermined motion blur to the foreground image;
And a synthesis step of synthesizing the foreground image with a background image based on the mixture ratio of the motion blur added foreground image and the end portion.

A pixel detection step for detecting a pixel at an end of a contour portion in a direction of a motion to be added from a foreground image without motion blur;
The pixel value of the edge pixel detected by the processing of the pixel detection step and the estimated pixel value estimated by extrapolating from the surrounding pixels in the foreground image located around the edge pixel And a mixing ratio detection step for detecting the ratio to the mixing ratio at the end,
A motion blur adding step for generating a motion blur added foreground image by adding a predetermined motion blur to the foreground image;
A program causing a computer to execute a synthesis step of synthesizing the foreground image with a background image based on the mixture ratio of the motion blur added foreground image and the end portion.