JP2004104331A

JP2004104331A - Image processor, image processing method, recording medium, and program

Info

Publication number: JP2004104331A
Application number: JP2002261542A
Authority: JP
Inventors: Akishi Sato; 佐藤　晶司; Hidehiko Sekizawa; 關沢　英彦
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2002-09-06
Filing date: 2002-09-06
Publication date: 2004-04-02
Anticipated expiration: 2022-09-06
Also published as: JP4129786B2

Abstract

<P>PROBLEM TO BE SOLVED: To write arbitrary characters or the like which can be stereoscopically viewed into an image which can be stereoscopically viewed. <P>SOLUTION: In a step S51, a text input area is displayed so as to be overlapped on a stereoscopic image displayed on an main area. When the user inputs arbitrary characters or the like into the text input area, the characters are overlapped on the stereoscopic image so that the inputted characters can be stereoscopically viewed in a predetermined scenographic manner. When it is decided that a "Near" button was depressed in a step S52, the characters are overlapped on the stereoscopic image so that the inputted characters can be stereoscopically viewed nearer in a scenographic manner in a step S53. When it is decided that a "Far" button was depressed in a step S54, the characters are overlapped on the stereoscopic image so that the inputted characters can be stereoscopically viewed farther in a scenographic manner in a step S55. This method may be applicable to an image processing program. <P>COPYRIGHT: (C)2004,JPO

Description

【０００１】
【発明の属する技術分野】
本発明は、画像処理装置および方法、記録媒体、並びにプログラムに関し、例えば、左眼用の画像と右眼用の画像を元にして立体視画像を生成する場合に用いて好適な画像処理装置および方法、記録媒体、並びにプログラムに関する。
【０００２】
【従来の技術】
従来、人の左右の眼がそれぞれ取得する網膜像の空間的ずれ（両眼視差）を利用して、２次元の画像を立体的に視認させる方法（以下、立体視の方法と記述する）が数多く知られている。
【０００３】
立体視の方法としては、特殊なメガネを利用するアナグリフ方式、カラーアナグリフ方式、偏光フィルタ方式、時分割立体テレビジョン方式等と、特殊なメガネを利用しないレンチキュラ方式等が知られている（いずれについても、例えば、非特許文献１参照）。
【０００４】
アナグリフ方式は、例えば、左眼用の画像（以下、Ｌ画像と記述する）を赤色モノトーン画像に変換し、右眼用の画像（以下、Ｒ画像と記述する）を青色モノトーン画像に変換して、赤色モノトーン画像と青色モノトーン画像を重ね合わせる。そして、その画像を、左側に赤色フィルタ、右側に青色フィルタが配置されたメガネ（以下、赤青メガネと記述する）を用いて見る方法である。アナグリフ方式は、比較的容易で安価に実施できることが、画像全体がモノトーンとなってしまう。
【０００５】
カラーアナグリフ方式は、アナグリフ方式の短所を補うものであり、Ｌ画像とＲ画像を重ね合わせたとき、それぞれの画像において対応する箇所がずれる部分（すなわち、視差が大きい部分）については、アナグリフ方式と同様に、例えば、Ｌ画像を赤色モノトーン画像に変換し、Ｒ画像を青色モノトーン画像に変換して重ね合わせる。それぞれの画像の対応する箇所がずれない部分（すなわち、視差が小さい部分）については、本来の色の状態で重ね合わせる。そして、その画像を、赤青メガネを用いて見る方法である。
【０００６】
カラーアナグリフ方式では、画像全体のうち、視差が少ない部分については、本来の色を再現することができる。なお、カラーアナグリフ方式には、視差が少ない部分に用いる色の違いにより、複数のバリエーションが存在する。以下、視差が小さい部分に本来の色を用いる方式を、第１のカラーアナグリフ方式と記述する。また、視差が小さい部分に本来の色を用いない方式を、第２のカラーアナグリフ方式と記述する。
【０００７】
偏光フィルタ方式は、例えば、垂直方向の直線偏光によって投影されたＬ画像と、水平方向の直線偏光によって投影されたＲ画像を重ね合わせる。そして、その画像を、左側に垂直方向の直線偏光フィルタ、右側に水平方向の直線偏光フィルタが配置された偏光フィルタメガネを用いて見る方法である。偏光フィルタ方式は、色の再現性がよく、解像度が高いことが長所であるが、偏光フィルタを利用することにより画像が暗くなってしまう短所がある。
【０００８】
時分割立体テレビジョン方式は、テレビジョン受像機に、Ｌ画像とＲ画像を、フィールド周期毎に交互に表示するようにし、その映像を、テレビジョン受像機のフィールド周期に同期して左眼側と右眼側を交互に開閉する液晶シャッタメガネを用いて見る方法である。時分割立体テレビジョン方式は、液晶シャッタメガネの開閉動作を高い精度で制御することが重要となる。
【０００９】
レンチキュラ方式は、画面を縦方向のストライプ状の領域に区分し、各ストライプ上の領域に交互に、Ｌ画像とＲ画像を表示し、その映像を、レンチキュラスクリーンと称されるレンズで覆う方法である。
【００１０】
ところで、上述したさまざまな立体視の方法を実現するためには、Ｌ画像とＲ画像を取得する必要がある。Ｌ画像とＲ画像を取得するためには、同一の被写体を、カメラの位置を人の両眼の間隔だけ移動して２回撮影する方法が最も容易である。
【００１１】
また、１回の撮影でＬ画像とＲ画像を取得する方法として、例えば、図１に示すように、ミラーなどから構成される光学アダプタ１１を、カメラ１の撮影レンズ３の外側に取り付ける方法が知られている（例えば、特許文献１参照）。
【００１２】
図２は、光学アダプタ１１の構造を模式的に表している。単一の採光窓から入射される右眼用の光学像は、ミラー２１によってミラー２２に向けて反射され、ミラー２２によって撮影レンズ３に向けて反射された後、撮影レンズ３によって集光される。単一の採光窓から入射される左眼用の光学像は、ミラー２１，２２によって反射されることなく、撮影レンズ３によって集光される。
【００１３】
光学アダプタ１１を介して入射された光学像は、図３に示すように、左眼用の領域および右眼用の領域からなる画像（以下、視差画像と記述する）として撮影される。この左眼用の領域がＬ画像として利用され、右眼用の領域がＲ画像として利用される。
【００１４】
【非特許文献１】
泉武博監修、ＮＨＫ放送技術研究所編「３次元映像の基礎」オーム出版、平成７年６月５日発行
【特許文献１】
特開平１１−４６３７３号公報
【００１５】
【発明が解決しようとする課題】
ところで、立体的に視覚できる画像に任意の文字や記号等を書き込み、書き込んだ文字なども立体的に視覚できるようにすればユーザにとって遊戯性が増すが、従来、そのような技術は存在していない課題があった。
【００１６】
本発明はこのような状況に鑑みてなされたものであり、立体的に視覚できる画像に任意の文字や記号等を書き込み、書き込んだ文字なども立体的に視覚できるようにすることを目的とする。
【００１７】
【課題を解決するための手段】
本発明の画像処理装置は、左眼用の画像および右眼用の画像に基づいて立体視画像を表示するための画像データを生成する生成手段と、受付手段によって受け付けられた書き込み情報を受け付ける受付手段と、書き込み情報に対応する画像を、奥行きを付与して、立体視画像に合成する合成手段とを含むことを特徴とする。
【００１８】
前記書き込み情報は、文字、記号、線画のうちの少なくとも１つを含むようにすることができる。
【００１９】
本発明の画像処理装置は、書き込み情報に対応する画像の奥行きを変更する変更手段をさらに含むことができる。
【００２０】
本発明の画像処理装置は、生成手段によって生成された画像データ、受付手段によって受け付けられた書き込み情報、および書き込み情報に対応する画像に付与された奥行きを対応付けて記憶する記憶手段をさらに含むことができる。
【００２１】
本発明の画像処理方法は、左眼用の画像および右眼用の画像に基づいて立体視画像を表示するための画像データを生成する生成ステップと、受付ステップの処理で受け付けられた書き込み情報を受け付ける受付ステップと、書き込み情報に対応する画像を、奥行きを付与して立体視画像に合成する合成ステップとを含むことを特徴とする。
【００２２】
本発明の記録媒体のプログラムは、左眼用の画像および右眼用の画像に基づいて立体視画像を表示するための画像データを生成する生成ステップと、受付ステップの処理で受け付けられた書き込み情報を受け付ける受付ステップと、書き込み情報に対応する画像を、奥行きを付与して、立体視画像に合成する合成ステップとを含むことを特徴とする。
【００２３】
本発明のプログラムは、左眼用の画像および右眼用の画像に基づいて立体視画像を表示するための画像データを生成する生成ステップと、受付ステップの処理で受け付けられた書き込み情報を受け付ける受付ステップと、書き込み情報に対応する画像を、奥行きを付与して、立体視画像に合成する合成ステップとを含む処理をコンピュータに実行させることを特徴とする。
【００２４】
本発明の画像処理装置および方法、並びにプログラムにおいて、左眼用の画像および右眼用の画像に基づいて立体視画像を表示するための画像データが生成され、受け付けられた書き込み情報に対応する画像が、奥行きが付与されて、立体視画像に合成される。
【００２５】
【発明の実施の形態】
図４は、本発明を適用した立体視システムの構成例を示している。この立体視システムは、主に、立体視画像を生成するパーソナルコンピュータ（ＰＣ）３１、表示された立体視画像を見るときにユーザが使用するフィルタメガネ４１、およびパーソナルコンピュータ３１の表示部５７の表示面外側に配置するライン偏光４３から構成される。
【００２６】
パーソナルコンピュータ３１は、図１に示されたように、光学アダプタ１１が装着された状態のカメラ（ディジタルスチルカメラ）１によって撮影された視差画像、光学アダプタ１１が装着されていない状態で撮影された画像などを取り込み、視差画像または連続的に撮影された２枚の画像からなる画像対を元にして立体視画像を生成して表示する。なお、ディジタルスチルカメラ１からパーソナルコンピュータ３１に取り込まれる視差画像等の画像データには、撮影された日時、撮影された順序を示すシリアルなファイル番号、連写モードで撮影されたか否かを示す連写モードフラグなどの属性情報が付与されている。
【００２７】
フィルタメガネ４１は、パーソナルコンピュータ３１に接続された支持棒４２により、パーソナルコンピュータ３１のキーボード付近の上方空間に位置するように支持されている。フィルタメガネ４１の左側の枠には、垂直方向の直線偏光フィルタが配置されている。また、右側の枠には、水平方向の直線偏光フィルタが配置されている。
【００２８】
パーソナルコンピュータ３１の表示部５７の表示面外側に配置するライン偏光板４３は、水平方向の偶数ラインに垂直方向の直線偏光フィルタが、水平方向の奇数ラインに水平方向の直線偏光フィルタが配置されている。
【００２９】
フィルタメガネ４１、支持棒４２、およびライン偏光板４３は、例えば、図１の光学アダプタ１１、並びに画像処理プログラム６５（図１０を参照して後述する）などとセットにされて販売される。
【００３０】
次に、パーソナルコンピュータ３１によって立体視画像が生成される過程の概要について、図５乃至図８を参照して説明する。
【００３１】
図５は、光学アダプタ１１が装着されたディジタルスチルカメラ１によって撮影された視差画像を示している。同図において、視差画像の右眼用の領域は、ミラーによって反射されているものであり、左眼用の領域は、ミラーによって反射されていないものであるとする。そこで、以下においては、視差画像の右眼用の領域をミラー画像、左眼用の領域をスルー画像と記述する。
【００３２】
上述したように、ミラー画像は、スルー画像に比較して、画質（輝度、彩度、解像度等）が劣化している。特に、ミラー画像の周辺部（上辺部、下辺部、および左辺部）は、中心部に比較して輝度が低下している。また、ミラー画像は、本来、矩形である画像が台形に歪んだものとなる。そこで、始めにミラー画像の画質が補正され（詳細については、図１４を参照して後述する）、画像の形状の歪みが補正される。次に、図６に示すように、ミラー画像およびスルー画像がそれぞれトリミングされて、Ｒ画像およびＬ画像が生成される。
【００３３】
次に、図７に示すように、次式（１）に従って、Ｌ画像とＲ画像が合成されて、図８に示すような立体視画像が生成される。

【００３４】
以上で、立体視画像が生成される過程の概要についての説明を終了する。生成された立体視画像は、表示部５７に表示される。図９に示すように、ユーザは、表示部５７に表示された立体視画像を、フィルタメガネ４１およびライン偏光板４３を介して見ることになる。したがって、ユーザの左眼は、立体視画像の偶数ライン、すなわち、１ライン置きのＬ画像を見ることになり、ユーザの右眼は、立体視画像の奇数ライン、すなわち、１ライン置きのＲ画像を見ることになる。よって、ユーザは、立体視画像を立体的に視認することが可能となる。
【００３５】
次に、図１０は、画像処理プログラム６５を実行することにより、立体視画像を生成する処理等を実行するパーソナルコンピュータ３１の構成例を示している。
【００３６】
パーソナルコンピュータ３１は、ＣＰＵ（Ｃｅｎｔｒａｌ　Ｐｒｏｃｅｓｓｉｎｇ　Ｕｎｉｔ）５１を内蔵している。ＣＰＵ５１には、バス５４を介して、入出力インタフェース５５が接続されている。バス５４には、ＲＯＭ（Ｒｅａｄ　Ｏｎｌｙ　Ｍｅｍｏｒｙ）５２およびＲＡＭ（Ｒａｎｄｏｍ　Ａｃｃｅｓｓ
Ｍｅｍｏｒｙ）５３が接続されている。
【００３７】
入出力インタフェース５５には、ユーザが操作コマンドを入力するキーボード、マウスなどの入力デバイスよりなる操作入力部５６、ＧＵＩ（Ｇｒａｐｈｉｃａｌ　Ｕｓｅｒ　Ｉｎｔｅｒｆａｃｅ）や生成される立体視画像等を表示するＬＣＤ（Ｌｉｑｕｉｄ　Ｃｒｙｓｔａｌ　Ｄｉｓｐｌａｙ）等よりなる表示部５７、各種のプログラムやデータを格納するハードディスクドライブなどよりなる記憶部５８、およびインタネット等のネットワークを介してデータを通信する通信部５９が接続されている。また、入出力インタフェース５５には、磁気ディスク６１、光ディスク６２、光磁気ディスク６３、および半導体メモリ６４などの記録媒体に対してデータを読み書きするドライブ６０が接続されている。
【００３８】
ＣＰＵ５１は、ＲＯＭ５２に記憶されているプログラム、または磁気ディスク６１乃至半導体メモリ６４から読み出されて記憶部６５に記憶され、記憶部６５からＲＡＭ５３にロードされたプログラムに従って各種の処理を実行する。ＲＡＭ５３にはまた、ＣＰＵ５１が各種の処理を実行する上において必要なデータなども適宜記憶される。
【００３９】
このパーソナルコンピュータに立体視画像を生成する処理等を実行させる画像処理プログラム６５は、磁気ディスク６１（フレキシブルディスクを含む）、光ディスク６２（ＣＤ−ＲＯＭ（Ｃｏｍｐａｃｔ　Ｄｉｓｃ−Ｒｅａｄ　Ｏｎｌｙ　Ｍｅｍｏｒｙ）、ＤＶＤ（Ｄｉｇｉｔａｌ　Ｖｅｒｓａｔｉｌｅ　Ｄｉｓｃ）を含む）、光磁気ディスク６３（ＭＤ（Ｍｉｎｉ　Ｄｉｓｃ）を含む）、もしくは半導体メモリ６４に格納された状態でパーソナルコンピュータ３１に供給され、ドライブ６０によって読み出されて記憶部５８に内蔵されるハードディスクドライブにインストールされている。記憶部５８にインストールされている画像処理プログラム６５は、操作入力部５６に入力されるユーザからのコマンドに対応するＣＰＵ５１の指令によって、記憶部５８からＲＡＭ５３にロードされて実行される。
【００４０】
図１１は、ＣＰＵ５１が画像処理プログラムを実行することによって実現される機能ブロックの構成例示している。
【００４１】
ＧＵＩブロック７１は、ＧＵＩに対するユーザの操作に対応して、画像管理部７２乃至表示制御部７６を制御する。画像管理ブロック７２は、ＧＵＩブロック７１からの制御に従い、ディジタルカメラ１から取り込まれて記憶部５８等に記憶されている視差画像などの画像データを、画像処理プログラム６５が取り扱うデータとして管理する。また、画像管理ブロック７２は、視差画像に付加されている属性情報に、サムネイル画像（縮小画像）の画像データ、トリミングされる領域の位置を示す情報、画質が補正されるときの設定値を示す情報、上下方向の位置が調整されるときの調整値を示す情報等を追加する。
【００４２】
画像取得ブロック７３は、ＧＵＩブロック７１からの制御に従い、画像管理ブロック７２によって管理されている視差画像などの画像データを取得し、ベース画像選択ブロック７４に出力する。ベース画像選択ブロック７４は、ユーザの操作に対応するＧＵＩブロック７１からの制御に従い、画像処理ブロック７３から入力される画像のうち、ユーザによって選択された視差画像または画像対を立体視画像生成ブロック７５に出力する。また、ベース画像選択ブロック７４は、ユーザの操作に対応するＧＵＩブロック７１からの制御に従い、画像処理ブロック７３から入力される画像のうち、立体視画像の元とすることができる画像対を選択して、立体視画像生成ブロック７５に出力する。
【００４３】
立体視画像生成ブロック７５は、ＧＵＩブロック７１からの制御に従い、ベース画像選択ブロック７４から入力される視差画像または画像対を元に、立体視画像を生成して表示制御ブロック７６に出力する。表示制御ブロック７６は、ＧＵＩブロック７１からの制御に従い、ＧＵＩおよび生成される立体視画像の表示を制御する。
【００４４】
次に、視差画像を元にして立体視画像を生成する立体視画像生成処理について説明するが、その前に、画像処理プログラム６５に対応するウィンドウと当該ウィンドウ上のＧＵＩについて、図１２および図１３を参照して説明する。
【００４５】
図１２は、画像処理プログラム６５が起動されたときに表示されるウィンドウ１０１の表示例を示している。ウィンドウ１０１には、処理の対象とする画像のサムネイルなどが表示されるメインエリア１０２、および「立体視」ボタン１０４乃至「エンド」ボタン１０７が設けられている。
【００４６】
「画像取得」ボタン１０３は、立体視画像の元とする視差画像および画像対を選択するためのサムネイルをメインエリア１０２に表示させるときに押下される。「立体視画像」ボタン１０４は、選択された立体視画像（または画像対）を元にして立体視画像の生成を開始させるときに押下される。「ＬＲ置換」ボタン１０５は、立体視画像の元となるＬ画像とＲ画像を置換させるときに押下される。「印刷」ボタン１０６は、生成された立体視画像などをプリントアウトするときに押下される。「エンド」ボタン１０７は、画像処理プログラムを終了させるときに押下される。
【００４７】
図１３は、「画像取得」ボタン１０３が押下されたときのウィンドウ１０１の表示例を示している。メインエリア１０２には、ディジタルカメラ１から取り込まれた視差画像、通常の画像（連続的に撮影された画像対含む）などのサムネイル画像が表示される。メインエリア１０２の上側には、「視差画像選択」ボタン１１１、「画像対選択」ボタン１１２、および「画像対自動選択」ボタン１１３が設けられる。
【００４８】
「視差画像選択」ボタン１１１は、メインエリア１０２に表示されているサムネイル画像のうち、視差画像に対応するものを選択するときに押下される。すなわち、ユーザは、「視差画像選択」ボタン１１１を押下した後、視差画像に対応するサムネイルを１つだけ選択することができる。
【００４９】
「画像対選択」ボタン１１２は、メインエリア１０２に表示されているサムネイル画像のうち、立体視画像の元とする画像対に対応するものを選択するときに押下される。すなわち、ユーザは、「画像対選択」ボタン１１２を押下した後、画像対に対応するサムネイルを２つ選択することができる。
【００５０】
「画像対自動選択」ボタン１１３は、メインエリア１０２に表示されているサムネイル画像のうち、立体視画像の元とする画像対を自動的に選択させるときに押下される。
【００５１】
次に、視差画像を元にして立体視画像を生成する立体視画像生成処理について、図１４のフローチャートを参照して説明する。この立体視画像生成処理は、図１３に示されたメインエリア１０２のサムネイルのうち、視差画像に対応するものがユーザによって選択された後、「立体視画像」ボタン１０４が押下されたときに開始される。
【００５２】
ステップＳ１において、ベース画像選択ブロック７４は、画像処理ブロック７３から入力される画像データのうち、ＧＵＩブロック７１からの制御に従い、ユーザが選択する視差画像の画像データを立体視画像生成ブロック７５に出力する。
【００５３】
ステップＳ２において、立体視画像生成ブロック７５は、ミラー画像の周辺部の輝度を、実験結果に基づくルックアップテーブル（または関数）を用いて補正する。この補正に関する情報は、画像管理ブロック７２に出力され、画像管理ブロック７２により、視差画像の属性情報に追加される。
【００５４】
ここで、実験とは、画像処理プログラム６５の開発者側によって実施されたものである。具体的には、光学アダプタ１１を装着した状態のディジタルカメラ１で画角の全体を占める白壁等を撮影し、得られた視差画像のうちのミラー画像について、中心部と周辺部の画素の輝度を比較する。そして、その比較結果に基づき、周辺部の画素の輝度が、中心部の画素の輝度と一致するように、例えば、周辺部の座標を入力とし、当該座標に対する補正値を出力するようなルックアップテーブル（関数でもよい）を生成する。または、周辺部の輝度を入力とし、当該輝度に対する補正値を出力するようなルックアップテーブルを生成する。そして、生成したルックアップテーブルを画像処理プログラム６５に組み込むようにすればよい。画像処理プログラム６５には、ディジタルカメラ１の機種に対応して複数のルックアップテーブルを組み込むようにしてもよい。
【００５５】
なお、ステップＳ２において、スルー画像の輝度も修正するようにしてもよい。
【００５６】
また、ステップＳ２において、立体視画像生成ブロック７５は、台形に歪んでいるミラー画像の形状を補正する。なお、このミラー画像の形状の補正については、本出願人が特開２００２−３４０５４号として既に提案済であるので、その説明は省略する。
【００５７】
ステップＳ３において、立体視画像生成ブロック７５は、補正済のミラー画像とスルー画像との全体的な輝度を比較して、その比較結果に基づいて、ミラー画像の輝度を補正する。具体的には、ミラー画像とスルー画像のそれぞれにおいて、所定の複数（例えば、４点）のサンプリング点の輝度を加算し、ミラー画像における輝度の加算値と、スルー画像の輝度の加算値を比較して、その差がなくなるように、ミラー画像を補正する。この補正に関する情報も、画像管理ブロック７２に出力され、画像管理ブロック７２により、視差画像の属性情報に追加される。
【００５８】
例えば、ミラー画像における４サンプリング点の輝度の加算値が３５０であり、スルー画像の４サンプリング点の輝度の加算値が５００である場合、その差１５０をサンプリング点の数で除算した値（１５０／４）を、ミラー画像の全ての画素の輝度に加算する。
【００５９】
また、ステップＳ３において、立体視画像生成ブロック７５は、ミラー画像とスルー画像の全体的な彩度が一致するように、ミラー画像の色差を補正する。この補正に関する情報も、画像管理ブロック７２に出力され、画像管理ブロック７２により、視差画像の属性情報に追加される。
【００６０】
ステップＳ４において、立体視画像生成ブロック７５は、ミラー画像に対し、所定のエッジ強調処理を施して、画像の全体的なぼけを補正する。ステップＳ５において、立体視画像生成ブロック７５は、ユーザの操作に対応するＧＵＩブロック７１からの制御に従い、ミラー画像とスルー画像をそれぞれトリミングして、それぞれを、Ｌ画像とＲ画像に設定する。このトリミング位置に関する情報は、画像管理ブロック７２に出力され、画像管理ブロック７２により、視差画像の属性情報に追加される。
【００６１】
ステップＳ６において、立体視画像生成ブロック７５は、ユーザの操作に対応するＧＵＩブロック７１からの制御に従い、Ｌ画像とＲ画像の上下方向の位置を調整する。なお、Ｌ画像とＲ画像の上下方向の位置を調整する過程の情報は、表示制御ブロック７６に出力されて、その調整の様子がメインエリア１０２に表示される（詳細は図１６のフローチャートを参照して後述する）。
【００６２】
ステップＳ７において、立体視画像生成ブロック７５は、ステップＳ６の処理で上下方向の位置が調整されたＬ画像およびＲ画像を、式（１）に従って合成し、立体視画像を生成する。生成された立体視画像は、メインエリア１０２に表示される。また、生成された立体視画像の画像データは、画像管理ブロック７２により、元となった視差画像に対応付けて記憶される。
【００６３】
なお、ユーザからの所定の操作により、生成された立体視画像を表示部５７の全体を占めるように表示させることも可能である。以上で、立体視画像生成処理の説明を終了する。
【００６４】
ところで、画像処理プログラム６５によれば、２枚の画像からなる画像対を元にしても、立体視画像を生成することができる。この画像対は、図１３に示された「画像対選択」ボタン１１２を押下して、メインエリア１０２に表示されているサムネイル画像のうちの２つを選択することにより、ユーザが任意に選択することができる。
【００６５】
画像対が選択された後、「立体視画像」ボタン１０４が押下された場合、画像対の一方がＬ画像とされ、他方がＲ画像とされて、上述したステップＳ６以降の処理が開始されて、立体視画像が生成される。生成された立体視画像の画像データは、画像管理ブロック７２により、元となった画像対に対応付けて記憶される。
【００６６】
なお、立体視画像の元とする画像対を自動的に選択させることもできる。画像対を自動的に選択する処理について、図１５のフローチャートを参照して説明する。この画像対自動選択処理は、図１３に示された「画像対自動選択」１１３が押下されたときに開始される。また、この画像対自動選択処理は、画像処理ブロック７３からベース画像選択ブロック７４に入力される画像が１つずつ処理対象とされて実行される。
【００６７】
ステップＳ１１において、ベース画像選択ブロック７４は、処理対象の画像が連写モードで撮影されたものであるか否かを、属性情報に含まれる連写フラグを参照して判定する。処理対象の画像が連写モードで撮影されたものではないと判定された場合、処理はステップＳ１２に進む。
【００６８】
ステップＳ１２において、ベース画像選択ブロック７４は、処理対象の画像と、１枚後に撮影された画像との撮影日時の差が所定の閾値（例えば、数秒）以内であるか否かを判定する。撮影日時の差が所定の閾値以内であると判定された場合、処理はステップＳ１３に進む。
【００６９】
ステップＳ１３において、ベース画像選択ブロック７４は、処理対象の画像と、１枚後に撮影された画像との類似度を算出し、類似度が所定の閾値以内であるか否かを判定する。ここで、２枚の画像の類似度としては、例えば、それぞれから所定の１ライン分の画素を抽出し、対応する画素同士の差分の総和を算出するようにする。
【００７０】
処理対象の画像と１枚後に撮影された画像との類似度が所定の閾値以内であると判定された場合、処理対象の画像と１枚後に撮影された画像とは、同一の被写体を連続して撮影したものであると判断できるので、処理対象の画像と１枚後に撮影された画像とを画像対に設定する。
【００７１】
画像対に設定された２枚の画像のサムネイルは、画像対に設定されたことをユーザに通知するために、メインエリア１０２において、例えば、単一の太枠で囲まれる。また、画像対の情報は、画像管理ブロック７２に出力され、画像対のそれぞれの画像データの付加情報に、画像対の相手を示す情報（シリアル番号等）が追加される。あるいは、画像対をなす２つの画像データとそれぞれの属性情報を新たなフォルダを生成して記憶するようにしてもよい。
【００７２】
なお、ステップＳ１１において、処理対象の画像が連写モードで撮影されたものであると判定された場合、ステップＳ１２はスキップされ、処理はステップＳ１３に進む。
【００７３】
ステップＳ１２において、処理対象の画像と、１枚後に撮影された画像との撮影日時の差が所定の閾値以内ではないと判定された場合、この２枚の画像は、立体視画像の元とする画像対に適していないので、画像対自動選択処理は終了される。
【００７４】
ステップＳ１３において、処理対象の画像と、１枚後に撮影された画像との類似度が所定の閾値以内ではないと判定された場合にも、この２枚の画像は、立体視画像の元とする画像対に適していないので、画像対自動選択処理は終了される。以上で、画像対自動選択処理の説明を終了する。
【００７５】
この画像対自動選択処理が、画像処理ブロック７３からベース画像選択ブロック７４に入力される全ての画像を処理対象として実行された後に、複数の画像対が設定されている場合、ユーザは設定された複数の画像対のうち、１つの画像対を選択することができる。画像対がユーザによって選択された後、「立体視画像」ボタン１０４が押下された場合、上述したステップＳ６以降の処理が開始されて、立体視画像が生成される。生成された立体視画像の画像データは、画像管理ブロック７２により、元となった画像対に対応付けて記憶される。
【００７６】
ただし、画像対を元にして立体視画像を生成した場合、画像対の一方をＬ画像とし、他方をＲ画像とする設定が適切ではなかった場合（逆であった場合）、立体的に視認することはできない。このとき、ユーザが「ＬＲ置換」ボタン１０５を押下すれば、Ｌ画像とＲ画像が置換されて、立体的に視認できる立体視画像が再生成される。
【００７７】
また、「ＬＲ置換」ボタン１０５を押下して立体視画像を再生成させることは、表示された立体視画像に対して、ライン偏光板４３が１ライン分だけ上下方向にずれて設置されている場合にも有効に作用する。
【００７８】
次に、図１４のステップＳ６の処理、すなわち、Ｌ画像とＲ画像の上下方向の位置を調整する処理の詳細について、図１６のフローチャートおよび図１７を参照して説明する。
【００７９】
ステップＳ２１において、立体視画像生成ブロック７５は、ステップＳ６の処理で設定したＬ画像およびＲ画像のサムネイル画像を、画像管理ブロック７２から取得する。ステップＳ２２において、立体視画像生成ブロック７５は、Ｌ画像のサムネイル画像とＲ画像のサムネイル画像の同じ座標の画素を５０％ずつ加算して合成画像を生成する。
【００８０】
このとき、ウィンドウ１０１の表示は、例えば、図１７に示すようなものとなる。すなわち、メインエリア１０２には、Ｌ画像が表示されたＬ画像表示エリア１２１、Ｒ画像が表示されたＲ画像表示エリア１２２、および合成画像が表示された合成画像表示エリア１２３が設けられる。
【００８１】
メインエリア１０２の上方には、Ｌ画像に対するＲ画像の相対的な位置を上方に移動させて合成画像を再生成させるときに操作させる「Ｒ画像上方移動」ボタン１２４、Ｌ画像に対するＲ画像の相対的な位置を下方に移動させ合成画像を再生成させるときに操作される「Ｒ画像下方移動」ボタン１２５、およびＬ画像に対するＲ画像の相対的な位置の調整が終了したとき操作される「位置調整終了」ボタン１２６が設けられる。
【００８２】
図１４に戻る。ステップＳ２３において、ＧＵＩブロック７１は、「Ｒ画像上方移動」ボタン１２４または「Ｒ画像下方移動」ボタン１２５が押下されたか否かを判定する。「Ｒ画像上方移動」ボタン１２４または「Ｒ画像下方移動」ボタン１２５が押下されたと判定された場合、処理はステップＳ２４に進む。ステップＳ２４において、立体視画像生成ブロック７５は、ユーザの操作に従うＧＵＩブロック７１からの制御に基づき、Ｌ画像に対するＲ画像の相対的な位置を上方または下方に移動させて、合成画像を再生成する。
【００８３】
このとき、合成画像表示エリア１２３の表示は、再生成された合成画像に更新される。なお、ステップＳ２３において、「Ｒ画像上方移動」ボタン１２４および「Ｒ画像下方移動」ボタン１２５が押下されていない判定された場合、ステップＳ２４はスキップされ、処理はステップＳ２５に進む。
【００８４】
ステップＳ２５において、ＧＵＩブロック７１は、「位置調整終了」ボタン１２６が押下されたか否かを判定する。「位置調整終了」ボタン１２６が押下されていないと判定された場合、ステップＳ２３に戻り、それ以降の処理が繰り返される。
【００８５】
従って、ユーザは、合成画像表示エリア１２３に表示される合成画像を見ながら、「Ｒ画像上方移動」ボタン１２４または「Ｒ画像下方移動」ボタン１２５を押下することにより、Ｌ画像とＲ画像の上下方向の位置を調整することができる。なお、再生成されて表示される合成画像は、データ量の少ない縮小された画像であるので、この再生成の処理は速やかに実行される。
【００８６】
ステップＳ２５において、「位置調整終了」ボタン１２６が押下されたと判定された場合、処理はステップＳ２６に進む。ステップＳ２６において、立体視画像生成ブロック７５は、Ｌ画像とＲ画像の上下方向の位置の調整値を画像管理ブロック７２に出力する。画像管理ブロック７２は、この上下方向の位置の調整値を、視差画像の属性情報に追加する。なお、画像対がＬ画像とＲ画像とされている場合、この上下方向の位置の調整値は、画像対の両方の属性情報に追加される。この後、処理は、上述した図１４のステップＳ７に戻ることとなる。
【００８７】
以上で、Ｌ画像とＲ画像の上下方向の位置を調整する処理の説明を終了する。
【００８８】
なお、上述した説明においては、「Ｒ画像上方移動」ボタン１２４または「Ｒ画像下方移動」ボタン１２５を押下することにより、Ｌ画像に対するＲ画像の相対的な位置を上方または下方に移動させるようにしたが、例えば、合成画像表示エリア１２３に表示されている合成画像を、マウス（操作入力部５６）を用いて、上方または下方にドラッグアンドドロップすることにより、Ｌ画像に対するＲ画像の相対的な位置を上方または下方に移動させるようにしてもよい。
【００８９】
次に、図１８は、立体視画像が生成されたときのウィンドウ１０１の表示例を示している。メインエリア１０２には、生成された立体視画像が表示される。メインエリア１０２の上側には、メインエリア１０２の表示を、立体視画像から、立体視画像の元となったＬ画像およびＲ画像を用いて従来の立体視の方法によって立体的に視認することができる画像に変換させるときに押下される「アナグリフ」ボタン１３１乃至「液晶シャッタメガネ」１３５が設けられている。また、メインエリア１０２の左側には、生成された立体視画像に任意の文字、記号などを書き込むときに押下される「テキスト入力」ボタン１３６が追加して設けられている。
【００９０】
「アナグリフ」ボタン１３１は、立体視画像の元となったＬ画像およびＲ画像を用いてアナグリフ方式の画像（以下、アナグリフ画像と記述する）を生成させるときに押下される。「カラーアナグリフ１」ボタン１３２は、立体視画像の元となったＬ画像およびＲ画像を用いて第１のカラーアナグリフ方式の画像（以下、第１のカラーアナグリフ画像と記述する）を生成させるときに押下される。「カラーアナグリフ２」ボタン１３３は、立体視画像の元となったＬ画像およびＲ画像を用いて第２のカラーアナグリフ方式の画像（以下、第２のカラーアナグリフ画像と記述する）を生成させるときに押下される。これらの場合、ユーザは、赤青メガネを用いて画像を見る必要がある。
【００９１】
「レンチキュラ」ボタン１３４は、立体視画像の元となったＬ画像およびＲ画像を用いてレンチキュラ方式の画像（以下、第１のレンチキュラ画像と記述する）を生成させるときに押下される。この場合、ユーザは、レンチキュラスクリーンを介して画像を見る必要がある。
【００９２】
「液晶シャッタメガネ」ボタン１３５は、立体視画像の元となったＬ画像およびＲ画像を時分割立体テレビジョン方式で表示させるときに押下される。この場合、ユーザは、表示部５７のフィールド周期に同期して左眼側と右眼側を交互に開閉する液晶シャッタメガネを用いて画像を見る必要がある。
【００９３】
次に、メインエリア１０２に表示されている立体視画像を、ユーザの操作に対応して変換する処理について、図１９のフローチャートを参照して説明する。
【００９４】
この立体視画像変換処理は、立体視画像が生成され、図１８に示されたようなウィンドウ１０１が表示されたときに開始される。
【００９５】
ステップＳ３１において、ＧＵＩブロック７１は、「ＬＲ置換」ボタン１０５が押下されたか否かを判定する。「ＬＲ置換」ボタン１０５が押下されたと判定された場合、処理はステップＳ３２に進む。ステップＳ３２において、立体視画像生成ブロック７５は、ＧＵＩブロック７１からの制御に従い、現在生成されている立体視画像の元となったＬ画像とＲ画像を置換して、立体視画像を再生成する。表示制御ブロック７６は、再生成された立体視画像を、メインエリア１０２に表示させる。この後、処理はステップＳ４３に進む。
【００９６】
ステップＳ３１において、「ＬＲ置換」ボタン１０５が押下されていないと判定された場合、処理はステップＳ３３に進む。ステップＳ３３において、ＧＵＩブロック７１は、「アナグリフ」ボタン１３１が押下されたか否かを判定する。「アナグリフ」ボタン１３１が押下されたと判定された場合、処理はステップＳ３４に進む。ステップＳ３４において、立体視画像生成ブロック７５は、ＧＵＩブロック７１からの制御に従い、現在生成されている立体視画像の元となったＬ画像とＲ画像を用い、アナグリフ画像を生成する。表示制御ブロック７６は、生成されたアナグリフ画像をメインエリア１０２に表示させる。この後、処理はステップＳ４５に進む。
【００９７】
ステップＳ３３において、「アナグリフ」ボタン１３１が押下されていないと判定された場合、処理はステップＳ３５に進む。ステップＳ３５において、ＧＵＩブロック７１は、「カラーアナグリフ１」ボタン１３２が押下されたか否かを判定する。「カラーアナグリフ１」ボタン１３２が押下されたと判定された場合、処理はステップＳ３６に進む。ステップＳ３６において、立体視画像生成ブロック７５は、ＧＵＩブロック７１からの制御に従い、現在生成されている立体視画像の元となったＬ画像とＲ画像を用い、第１のカラーアナグリフ画像を生成する。表示制御ブロック７６は、生成された第１のカラーアナグリフ画像をメインエリア１０２に表示させる。この後、処理はステップＳ４５に進む。
【００９８】
ステップＳ３５において、「カラーアナグリフ１」ボタン１３２が押下されていないと判定された場合、処理はステップＳ３７に進む。ステップＳ３７において、ＧＵＩブロック７１は、「カラーアナグリフ２」ボタン１３３が押下されたか否かを判定する。「カラーアナグリフ２」ボタン１３３が押下されたと判定された場合、処理はステップＳ３８に進む。ステップＳ３８において、立体視画像生成ブロック７５は、ＧＵＩブロック７１からの制御に従い、現在生成されている立体視画像の元となったＬ画像とＲ画像を用い、第２のカラーアナグリフ画像を生成する。表示制御ブロック７６は、生成された第２のカラーアナグリフ画像をメインエリア１０２に表示させる。この後、処理はステップＳ４５に進む。
【００９９】
ステップＳ３７において、「カラーアナグリフ２」ボタン１３３が押下されていないと判定された場合、処理はステップＳ３９に進む。ステップＳ３９において、ＧＵＩブロック７１は、「レンチキュラ」ボタン１３４が押下されたか否かを判定する。「レンチキュラ」ボタン１３４が押下されたと判定された場合、処理はステップＳ４０に進む。ステップＳ４０において、立体視画像生成ブロック７５は、ＧＵＩブロック７１からの制御に従い、現在生成されている立体視画像の元となったＬ画像とＲ画像を用い、レンチキュラ画像を生成する。表示制御ブロック７６は、生成されたレンチキュラ画像をメインエリア１０２に表示させる。この後、処理はステップＳ４５に進む。
【０１００】
ステップＳ３９において、「レンチキュラ」ボタン１３４が押下されていないと判定された場合、処理はステップＳ４１に進む。ステップＳ４１において、ＧＵＩブロック７１は、「液晶シャッタメガネ」ボタン１３５が押下されたか否かを判定する。「液晶シャッタメガネ」ボタン１３５が押下されたと判定された場合、処理はステップＳ４２に進む。ステップＳ４２において、立体視画像生成ブロック７５は、ＧＵＩブロック７１からの制御に従い、現在生成されている立体視画像の元となったＬ画像とＲ画像を表示制御ブロック７６に供給する。表示制御ブロック７６は、ＧＵＩブロック７１からの制御に従い、Ｌ画像とＲ画像を、表示部５７のフィールド周期に同期して交互にメインエリア１０２に表示させる。この後、処理はステップＳ４３に進む。
【０１０１】
ステップＳ４３において、ＧＵＩブロック７１は、何らかのボタンが押下されるまで待機する。何らかのボタンが押下されたと判定された場合、処理はステップＳ４４に進む。ステップＳ４４において、ＧＵＩブロック７１は、「エンド」ボタン１０７が押下されたか否かを判定する。「エンド」ボタン１０７が押下されたと判定された場合、立体視画像変換処理は終了され、さらに、実行されている画像処理プログラム６５も終了される。
【０１０２】
ステップＳ４４において、「エンド」ボタン１０７が押下されてないと判定された場合、ステップＳ３１に戻り、それ以降の処理が繰り返される。
【０１０３】
ステップＳ３４，Ｓ３６，Ｓ３８、またはＳ４０の処理の後に実行されるステップＳ４５において、ＧＵＩブロック７１は、何らかのボタンが押下されるまで待機する。何らかのボタンが押下されたと判定された場合、処理はステップＳ４６に進む。ステップＳ４６において、ＧＵＩブロック７１は、「印刷」ボタン１０６が押下されたか否かを判定する。「印刷」ボタン１０６が押下されたと判定された場合、処理はステップＳ４７に進む。ステップＳ４７において、表示制御ブロック７６は、ＧＵＩブロック７１からの制御に従い、メインエリア１０２に表示されているアナグリフ画像、第１のカラーアナグリフ画像、第２のアナグリフ画像、またはレンチキュラ画像の画像データをプリンタ（不図示）に出力して印刷させる。
【０１０４】
印刷されたアナグリフ画像、第１のカラーアナグリフ画像、または第２のアナグリフ画像は、赤青メガネを用いることにより、立体的に視認することができる。印刷されたレンチキュラ画像は、レンチキュラパネルを介することにより、立体的に視認することができる。以上で、立体視画像変換処理の説明を終了する。
【０１０５】
次に、図２０は、「テキスト入力」ボタン１３６が押下されたときのウィンドウ１０１の表示例を示している。メインエリア１０２の上側には、メインエリア１０２に表示されている立体視画像に重畳して、任意の文字、記号等を表示させるためのテキスト入力エリア１４１が設けられている。テキスト入力エリア１４１に入力される文字等は、立体的に視認されるように立体視画像に重畳される。ユーザは、マウス（操作入力部５６）を用いてドラッグアンドドロップすることにより、テキスト入力エリア１４１を任意の位置に移動することができる。
【０１０６】
メインエリア１０２の上側には、テキスト入力エリア１４１に入力された文字等の立体視したときの遠近感（奥行き感）を近づけるときに押下される「近く」ボタン１４２、およびテキスト入力エリア１４１に入力される文字等の立体視したときの遠近感を遠ざけるときに押下される「遠く」ボタン１４３が設けられている。
【０１０７】
ここで、テキスト入力処理について、図２１のフローチャートを参照して説明する。このテキスト入力処理は、「テキスト入力」ボタン１３６が押下されたときに開始される。
【０１０８】
ステップＳ５１において、表示制御ブロック７６は、メインエリア１０２に表示されている立体視画像に重畳して、テキスト入力エリア１４１を表示する。ユーザがテキスト入力エリア１４１に任意の文字等を入力すると、立体視画像生成ブロック７５は、入力された文字等が所定の遠近感で立体的に視認されるように、立体視画像に文字等を重畳する。
【０１０９】
ステップＳ５２において、ＧＵＩブロック７１は、「近く」ボタン１４２が押下されたか否かを判定する。「近く」ボタン１４２が押下されたと判定された場合、処理はステップＳ５３に進む。ステップＳ５３において、立体視画像生成ブロック７５は、ＧＵＩブロック７１からの制御に従い、入力された文字等の遠近感がより近くで立体的に視認されるように、立体視画像に文字等を重畳する。なお、ステップＳ５２において、「近く」ボタン１４２が押下されていないと判定された場合、ステップＳ５３の処理はスキップされる。
【０１１０】
ステップＳ５４において、ＧＵＩブロック７１は、「遠く」ボタン１４３が押下されたか否かを判定する。「遠く」ボタン１４３が押下されたと判定された場合、処理はステップＳ５５に進む。ステップＳ５５において、立体視画像生成ブロック７５は、ＧＵＩブロック７１からの制御に従い、入力された文字等の遠近感がより遠くで立体的に視認されるように、立体視画像等に文字等を重畳する。なお、ステップＳ５４において、「遠く」ボタン１４３が押下されていないと判定された場合、ステップＳ５５の処理はスキップされる。
【０１１１】
ステップＳ５６において、ＧＵＩブロック７１は、「テキスト入力」ボタン１３６４２が再び押下されたか否かを判定する。「テキスト入力」ボタン１３６が再び押下されていないと判定された場合、ステップＳ５２に戻り、それ以降の処理が繰り返される。
【０１１２】
ステップＳ５６において、「テキスト入力」ボタン１３６４２が再び押下されたと判定された場合、処理はステップＳ５７に進む。ステップＳ５７において、立体視画像生成ブロック７５は、入力された文字等のテキスト情報、文字等の立体視画像上の座標情報、文字等の立体視における遠近の情報を画像管理ブロック７２に出力する。画像管理ブロック７２は、立体視画像の画像データに対応付けて記録する。以上で、テキスト入力処理の説明を終了する。
【０１１３】
なお、本明細書において、記録媒体に記録されるプログラムを記述するステップは、記載された順序に従って時系列的に行われる処理はもちろん、必ずしも時系列的に処理されなくとも、並列的あるいは個別に実行される処理をも含むものである。
【０１１４】
また、本明細書において、システムとは、複数の装置により構成される装置全体を表すものである。
【０１１５】
【発明の効果】
以上のように、本発明によれば、立体的に視覚できる画像に任意の文字や記号等を書き込むことができ、さらに、書き込んだ文字なども立体的に視覚することが可能となる。
【図面の簡単な説明】
【図１】光学アダプタをカメラに取り付けた状態を示す図である。
【図２】図１の光学アダプタの構成例を示す図である。
【図３】光学アダプタ取り付けた状態のカメラによって撮影される視差画像を示す図である。
【図４】本発明を適用した立体視システムの構成例を示す図である。
【図５】ミラー画像とスルー画像から構成される視差画像を示す図である。
【図６】ミラー画像とスルー画像からトリミングされるＬ画像とＲ画像を示す図である。
【図７】Ｌ画像とＲ画像を合成して立体視画像生成する処理を説明するための図である。
【図８】立体視画像を示す図である。
【図９】立体視画像を立体的に視認する概要を説明するための図である。
【図１０】図４のパーソナルコンピュータの構成例を示すブロック図である。
【図１１】図１０のＣＰＵが画像処理プログラムを実行すること実現する機能ブロックの構成例を示す図である。
【図１２】画像処理プログラムに対応するウィンドウの表示例を示す図である。
【図１３】図１２の「画像取得」ボタンが押下されたときのウィンドウの表示例を示す図である。
【図１４】立体視画像生成処理を説明するフローチャートである。
【図１５】画像対自動選択処理を説明するフローチャートである。
【図１６】Ｌ画像とＲ画像の上下方向の位置調整処理を説明するフローチャートである。
【図１７】Ｌ画像とＲ画像の上下方向の位置調整処理におけるウィンドウの表示例を示す図である。
【図１８】立体視画像が生成されたときのウィンドウの表示例を示す図である。
【図１９】立体視画像変換処理を説明するフローチャートである。
【図２０】図１８の「テキスト入力」ボタンが押下されたときのウィンドウの表示例を示す図である。
【図２１】テキスト入力処理を説明するフローチャートである。
【符号の説明】
１　ディジタルスチルカメラ，　１１　光学アダプタ，　３１　パーソナルコンピュータ，　４１　フィルタメガネ，　４３　ライン偏光板，　５１　ＣＰＵ，６１　磁気ディスク，　６２　光ディスク，　６３　光磁気ディスク，　６４半導体メモリ，　６５　画像処理プログラム，　７１　ＧＵＩブロック，　７２画像管理ブロック，　７３　画像取得ブロック，　７４　ベース画像選択ブロック，　７５　立体視画像生成ブロック，　７６　表示制御ブロック[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention relates to an image processing apparatus and method, a recording medium, and a program, for example, an image processing apparatus and a suitable image processing apparatus suitable for use in generating a stereoscopic image based on an image for the left eye and an image for the right eye The present invention relates to a method, a recording medium, and a program.
[0002]
[Prior art]
Conventionally, there is a method of stereoscopically viewing a two-dimensional image using spatial displacement (binocular disparity) of retinal images obtained by the left and right eyes of a person (hereinafter, referred to as a stereoscopic method). Many are known.
[0003]
Known stereoscopic methods include an anaglyph method using special glasses, a color anaglyph method, a polarizing filter method, a time-division stereoscopic television method, and a lenticular method using no special glasses. For example, see Non-Patent Document 1).
[0004]
In the anaglyph method, for example, an image for the left eye (hereinafter, described as an L image) is converted into a red monotone image, and an image for the right eye (hereinafter, described as an R image) is converted into a blue monotone image. And superimpose the red monotone image and the blue monotone image. Then, the image is viewed using glasses with red filters on the left and blue filters on the right (hereinafter referred to as red-blue glasses). The anaglyph method is relatively easy and can be implemented at low cost, but the whole image becomes monotone.
[0005]
The color anaglyph method compensates for the disadvantages of the anaglyph method, and when the L image and the R image are superimposed, the corresponding portion of each image is shifted (that is, the portion where the parallax is large). Similarly, for example, the L image is converted to a red monotone image, and the R image is converted to a blue monotone image and superimposed. A portion where the corresponding portion of each image does not shift (that is, a portion where parallax is small) is superimposed in an original color state. Then, the image is viewed using red-blue glasses.
[0006]
In the color anaglyph method, an original color can be reproduced for a portion having a small parallax in the entire image. Note that the color anaglyph method has a plurality of variations depending on the color used for a portion having a small parallax. Hereinafter, a method of using an original color for a portion having a small parallax is referred to as a first color anaglyph method. In addition, a method in which an original color is not used in a portion where parallax is small is referred to as a second color anaglyph method.
[0007]
In the polarization filter method, for example, an L image projected by linearly polarized light in the vertical direction and an R image projected by linearly polarized light in the horizontal direction are superimposed. Then, the image is viewed using polarizing filter glasses in which a vertical linear polarization filter is arranged on the left side and a horizontal linear polarization filter is arranged on the right side. The polarizing filter method has advantages in that color reproducibility is good and resolution is high, but there is a disadvantage that an image is darkened by using a polarizing filter.
[0008]
In the time-division stereoscopic television system, an L image and an R image are alternately displayed on a television receiver every field cycle, and the video is synchronized with the field cycle of the television receiver on the left eye side. And the liquid crystal shutter glasses that alternately open and close the right eye side and the right eye side. In the time-division stereoscopic television system, it is important to control the opening and closing operation of the liquid crystal shutter glasses with high accuracy.
[0009]
The lenticular method is a method in which a screen is divided into vertical stripe-shaped regions, an L image and an R image are alternately displayed on the regions on each stripe, and the image is covered with a lens called a lenticular screen. is there.
[0010]
By the way, in order to realize the various stereoscopic viewing methods described above, it is necessary to acquire an L image and an R image. In order to acquire the L image and the R image, it is easiest to take the same subject twice by moving the camera position by the distance between both eyes of a person.
[0011]
As a method of acquiring the L image and the R image in one photographing, for example, as shown in FIG. 1, a method of attaching an optical adapter 11 composed of a mirror or the like to the outside of the photographing lens 3 of the camera 1 is known. It is known (for example, see Patent Document 1).
[0012]
FIG. 2 schematically illustrates the structure of the optical adapter 11. The optical image for the right eye that enters from a single lighting window is reflected by the mirror 21 toward the mirror 22, reflected by the mirror 22 toward the photographing lens 3, and then condensed by the photographing lens 3. . The optical image for the left eye incident from a single lighting window is condensed by the photographing lens 3 without being reflected by the mirrors 21 and 22.
[0013]
The optical image incident via the optical adapter 11 is captured as an image (hereinafter, referred to as a parallax image) including a left-eye area and a right-eye area, as shown in FIG. The area for the left eye is used as an L image, and the area for the right eye is used as an R image.
[0014]
[Non-patent document 1]
Supervised by Takehiro Izumi, edited by NHK Science and Technical Research Laboratories, "Basics of 3D Video" Ohm Publishing, published June 5, 1995 [Patent Document 1]
JP-A-11-46373
[Problems to be solved by the invention]
By the way, if arbitrary characters and symbols are written on an image which can be viewed three-dimensionally, and if the written characters and the like can be viewed three-dimensionally, the user's playability increases, but such a technique has been conventionally available. There were no challenges.
[0016]
The present invention has been made in view of such circumstances, and has as its object to write arbitrary characters, symbols, and the like on a stereoscopically visible image so that the written characters and the like can be stereoscopically viewed. .
[0017]
[Means for Solving the Problems]
An image processing apparatus according to the present invention includes a generation unit configured to generate image data for displaying a stereoscopic image based on an image for a left eye and an image for a right eye, and a reception unit that receives the write information received by the reception unit. And a synthesizing unit for adding an image corresponding to the writing information to the stereoscopic image by adding depth.
[0018]
The writing information may include at least one of a character, a symbol, and a line drawing.
[0019]
The image processing apparatus according to the present invention can further include a change unit that changes the depth of the image corresponding to the write information.
[0020]
The image processing apparatus of the present invention further includes a storage unit that stores the image data generated by the generation unit, the write information received by the reception unit, and the depth given to the image corresponding to the write information in association with each other. Can be.
[0021]
The image processing method according to the present invention includes a generation step of generating image data for displaying a stereoscopic image based on an image for the left eye and an image for the right eye, and writing information received in the processing of the reception step. It is characterized by including a receiving step of receiving and a synthesizing step of adding an image corresponding to the writing information to a stereoscopic image by adding depth.
[0022]
The recording medium program according to the present invention includes a generation step of generating image data for displaying a stereoscopic image based on a left-eye image and a right-eye image, and writing information received in the processing of the reception step. A receiving step of receiving an image corresponding to the writing information and a combining step of adding a depth to the image and combining it with a stereoscopic image.
[0023]
A program of the present invention includes a generation step of generating image data for displaying a stereoscopic image based on a left-eye image and a right-eye image, and a reception step of receiving the write information received in the processing of the reception step. The present invention is characterized by causing a computer to execute a process including a step and a synthesizing step of synthesizing an image corresponding to the writing information with a stereoscopic image by adding depth.
[0024]
In the image processing apparatus and method and the program according to the present invention, image data for displaying a stereoscopic image based on a left-eye image and a right-eye image is generated, and an image corresponding to the received writing information is generated. Is added to the depth and is synthesized with the stereoscopic image.
[0025]
BEST MODE FOR CARRYING OUT THE INVENTION
FIG. 4 shows a configuration example of a stereoscopic vision system to which the present invention is applied. The stereoscopic system mainly includes a personal computer (PC) 31 for generating a stereoscopic image, filter glasses 41 used by a user when viewing the displayed stereoscopic image, and display on a display unit 57 of the personal computer 31. It is composed of a linearly polarized light 43 disposed outside the plane.
[0026]
As shown in FIG. 1, the personal computer 31 has a parallax image captured by the camera (digital still camera) 1 with the optical adapter 11 mounted, and a parallax image captured with the optical adapter 11 not mounted. An image or the like is captured, and a stereoscopic image is generated and displayed based on a parallax image or an image pair including two images captured continuously. Note that image data such as a parallax image taken into the personal computer 31 from the digital still camera 1 includes a date and time of shooting, a serial file number indicating the order of shooting, and a serial number indicating whether or not shooting was performed in the continuous shooting mode. Attribute information such as a shooting mode flag is provided.
[0027]
The filter glasses 41 are supported by a support rod 42 connected to the personal computer 31 so as to be located in an upper space near a keyboard of the personal computer 31. In the left frame of the filter glasses 41, a vertical linear polarization filter is arranged. In the right frame, a horizontal linear polarization filter is arranged.
[0028]
The line polarizer 43 disposed outside the display surface of the display unit 57 of the personal computer 31 has a vertical linear polarization filter disposed on even horizontal lines, and a horizontal linear polarization filter disposed on odd horizontal lines. I have.
[0029]
The filter glasses 41, the support bar 42, and the line polarizer 43 are sold as a set, for example, with the optical adapter 11 of FIG. 1, an image processing program 65 (described later with reference to FIG. 10), and the like.
[0030]
Next, an outline of a process of generating a stereoscopic image by the personal computer 31 will be described with reference to FIGS.
[0031]
FIG. 5 shows a parallax image captured by the digital still camera 1 to which the optical adapter 11 is attached. In the figure, it is assumed that the right eye area of the parallax image is reflected by the mirror, and the left eye area is not reflected by the mirror. Therefore, in the following, the region for the right eye of the parallax image is described as a mirror image, and the region for the left eye is described as a through image.
[0032]
As described above, the mirror image has deteriorated image quality (luminance, saturation, resolution, etc.) as compared with the through image. In particular, the peripheral portion (upper side, lower side, and left side) of the mirror image has lower luminance than the central portion. Further, the mirror image is an image which is originally rectangular and is distorted into a trapezoid. Therefore, first, the image quality of the mirror image is corrected (the details will be described later with reference to FIG. 14), and the distortion of the image shape is corrected. Next, as shown in FIG. 6, the mirror image and the through image are trimmed, respectively, to generate an R image and an L image.
[0033]
Next, as shown in FIG. 7, the L image and the R image are combined according to the following equation (1) to generate a stereoscopic image as shown in FIG.

[0034]
This is the end of the description of the outline of the process of generating a stereoscopic image. The generated stereoscopic image is displayed on the display unit 57. As shown in FIG. 9, the user views the stereoscopic image displayed on the display unit 57 via the filter glasses 41 and the line polarizer 43. Therefore, the left eye of the user sees the even-numbered line of the stereoscopic image, that is, the L image of every other line, and the right eye of the user sees the odd-numbered line of the stereoscopic image, that is, the R image of every other line. You will see. Therefore, the user can stereoscopically view the stereoscopic image.
[0035]
Next, FIG. 10 illustrates a configuration example of a personal computer 31 that executes a process of generating a stereoscopic image by executing the image processing program 65.
[0036]
The personal computer 31 has a built-in CPU (Central Processing Unit) 51. An input / output interface 55 is connected to the CPU 51 via a bus 54. The bus 54 includes a ROM (Read Only Memory) 52 and a RAM (Random Access).
Memory 53 is connected.
[0037]
The input / output interface 55 includes an operation input unit 56 including an input device such as a keyboard and a mouse for inputting operation commands by a user, an LCD (Liquid Crystal Display) for displaying a GUI (Graphical User Interface), a generated stereoscopic image, and the like. ), A storage unit 58 such as a hard disk drive for storing various programs and data, and a communication unit 59 for communicating data via a network such as the Internet. The input / output interface 55 is connected to a drive 60 that reads and writes data from and to a recording medium such as a magnetic disk 61, an optical disk 62, a magneto-optical disk 63, and a semiconductor memory 64.
[0038]
The CPU 51 executes various processes according to a program stored in the ROM 52 or a program read from the magnetic disk 61 to the semiconductor memory 64 and stored in the storage unit 65 and loaded from the storage unit 65 into the RAM 53. The RAM 53 also appropriately stores data necessary for the CPU 51 to execute various processes.
[0039]
An image processing program 65 for causing this personal computer to execute processing for generating a stereoscopic image includes a magnetic disk 61 (including a flexible disk), an optical disk 62 (Compact Disc-Read Only Memory), and a DVD (Digital Versatile). Disc), a magneto-optical disk 63 (including an MD (Mini Disc)), or a semiconductor memory 64 and supplied to the personal computer 31, read by the drive 60, and built into the storage unit 58. Installed on the hard disk drive. The image processing program 65 installed in the storage unit 58 is loaded from the storage unit 58 to the RAM 53 and executed by a command from the CPU 51 corresponding to a command from the user input to the operation input unit 56.
[0040]
FIG. 11 illustrates a configuration example of a functional block realized by the CPU 51 executing the image processing program.
[0041]
The GUI block 71 controls the image management unit 72 to the display control unit 76 in response to a user operation on the GUI. The image management block 72 manages image data such as a parallax image captured from the digital camera 1 and stored in the storage unit 58 or the like as data handled by the image processing program 65 under the control of the GUI block 71. Further, the image management block 72 shows, in the attribute information added to the parallax image, image data of a thumbnail image (reduced image), information indicating a position of a region to be trimmed, and a set value when image quality is corrected. Information, information indicating an adjustment value when the vertical position is adjusted, and the like are added.
[0042]
The image acquisition block 73 acquires image data such as a parallax image managed by the image management block 72 under the control of the GUI block 71, and outputs the image data to the base image selection block 74. The base image selection block 74 converts the parallax image or the image pair selected by the user from the images input from the image processing block 73 according to the control from the GUI block 71 corresponding to the operation of the user, by the stereoscopic image generation block 75. Output to Further, the base image selection block 74 selects an image pair that can be a source of a stereoscopic image from among the images input from the image processing block 73 in accordance with the control from the GUI block 71 corresponding to the operation of the user. Then, the image is output to the stereoscopic image generation block 75.
[0043]
The stereoscopic image generation block 75 generates a stereoscopic image based on the parallax image or image pair input from the base image selection block 74 and outputs the stereoscopic image to the display control block 76 under the control of the GUI block 71. The display control block 76 controls the display of the GUI and the generated stereoscopic image according to the control from the GUI block 71.
[0044]
Next, a stereoscopic image generation process for generating a stereoscopic image based on a parallax image will be described. Before that, a window corresponding to the image processing program 65 and a GUI on the window will be described with reference to FIGS. This will be described with reference to FIG.
[0045]
FIG. 12 shows a display example of a window 101 displayed when the image processing program 65 is started. The window 101 is provided with a main area 102 on which a thumbnail of an image to be processed is displayed, and a “stereoscopic” button 104 to an “end” button 107.
[0046]
An “acquire image” button 103 is pressed when a thumbnail for selecting a parallax image and an image pair as a source of a stereoscopic image is displayed in the main area 102. The “stereoscopic image” button 104 is pressed to start generation of a stereoscopic image based on the selected stereoscopic image (or image pair). The “LR replacement” button 105 is pressed when replacing the L image and the R image that are the basis of the stereoscopic image. A “print” button 106 is pressed to print out the generated stereoscopic image or the like. The “end” button 107 is pressed to end the image processing program.
[0047]
FIG. 13 shows a display example of the window 101 when the “acquire image” button 103 is pressed. In the main area 102, thumbnail images such as a parallax image captured from the digital camera 1 and a normal image (including a pair of continuously captured images) are displayed. Above the main area 102, a “parallax image selection” button 111, an “image pair selection” button 112, and an “image pair automatic selection” button 113 are provided.
[0048]
The “select parallax image” button 111 is pressed to select a thumbnail image displayed in the main area 102 that corresponds to the parallax image. That is, after pressing the “parallax image selection” button 111, the user can select only one thumbnail corresponding to the parallax image.
[0049]
The “select image pair” button 112 is pressed to select an image pair corresponding to an image pair that is a source of a stereoscopic image from among the thumbnail images displayed in the main area 102. That is, after the user presses the “select image pair” button 112, the user can select two thumbnails corresponding to the image pair.
[0050]
The “automatic image pair selection” button 113 is pressed to automatically select an image pair as a source of a stereoscopic image from the thumbnail images displayed in the main area 102.
[0051]
Next, a stereoscopic image generation process for generating a stereoscopic image based on a parallax image will be described with reference to the flowchart in FIG. This stereoscopic image generation processing is started when the “stereoscopic image” button 104 is pressed after the user selects the thumbnail corresponding to the parallax image from the thumbnails in the main area 102 shown in FIG. Is done.
[0052]
In step S1, the base image selection block 74 outputs the image data of the parallax image selected by the user to the stereoscopic image generation block 75 according to the control of the GUI block 71 among the image data input from the image processing block 73. I do.
[0053]
In step S2, the stereoscopic image generation block 75 corrects the luminance of the peripheral portion of the mirror image using a lookup table (or function) based on an experiment result. Information on this correction is output to the image management block 72, and is added to the attribute information of the parallax image by the image management block 72.
[0054]
Here, the experiment was performed by the developer of the image processing program 65. More specifically, the digital camera 1 with the optical adapter 11 mounted thereon photographs a white wall or the like occupying the entire angle of view, and the brightness of the pixels at the center and the periphery of the mirror image in the obtained parallax image. To compare. Then, on the basis of the comparison result, for example, a lookup is performed such that the coordinates of the peripheral portion are input and a correction value for the coordinates is output so that the luminance of the peripheral portion coincides with the luminance of the central portion. Create a table (may be a function). Alternatively, a look-up table is generated in which the luminance of the peripheral portion is input and a correction value for the luminance is output. Then, the generated lookup table may be incorporated in the image processing program 65. A plurality of look-up tables may be incorporated in the image processing program 65 according to the model of the digital camera 1.
[0055]
In step S2, the luminance of the through image may be corrected.
[0056]
In step S2, the stereoscopic image generation block 75 corrects the shape of the mirror image distorted into a trapezoid. The correction of the shape of the mirror image has already been proposed by the present applicant as Japanese Patent Application Laid-Open No. 2002-34054, and the description thereof will be omitted.
[0057]
In step S3, the stereoscopic image generation block 75 compares the overall brightness of the corrected mirror image and the through image, and corrects the brightness of the mirror image based on the comparison result. Specifically, in each of the mirror image and the through-the-lens image, the luminances of a plurality of (for example, four) sampling points are added, and the sum of the luminance of the mirror image and the luminance of the through-the-lens image are compared. Then, the mirror image is corrected so that the difference disappears. Information on this correction is also output to the image management block 72, and is added to the attribute information of the parallax image by the image management block 72.
[0058]
For example, if the sum of the luminances of the four sampling points in the mirror image is 350 and the sum of the luminances of the four sampling points of the through image is 500, a value obtained by dividing the difference 150 by the number of sampling points (150/150) 4) is added to the luminance of all pixels of the mirror image.
[0059]
In step S3, the stereoscopic image generation block 75 corrects the color difference between the mirror image and the mirror image so that the overall saturation of the mirror image and the through image match. Information on this correction is also output to the image management block 72, and is added to the attribute information of the parallax image by the image management block 72.
[0060]
In step S4, the stereoscopic image generation block 75 performs a predetermined edge enhancement process on the mirror image to correct the overall blur of the image. In step S5, the stereoscopic image generation block 75 trims the mirror image and the through image according to the control from the GUI block 71 corresponding to the user's operation, and sets each of them as an L image and an R image. The information on the trimming position is output to the image management block 72, and is added to the attribute information of the parallax image by the image management block 72.
[0061]
In step S6, the stereoscopic image generation block 75 adjusts the vertical position of the L image and the R image in accordance with the control from the GUI block 71 corresponding to the operation of the user. Note that information on the process of adjusting the vertical position of the L image and the R image is output to the display control block 76, and the state of the adjustment is displayed in the main area 102 (for details, see the flowchart in FIG. 16). And will be described later).
[0062]
In step S7, the stereoscopic image generation block 75 combines the L image and the R image whose positions in the vertical direction have been adjusted in the process of step S6 in accordance with Expression (1), and generates a stereoscopic image. The generated stereoscopic image is displayed in the main area 102. The image data of the generated stereoscopic image is stored by the image management block 72 in association with the original parallax image.
[0063]
In addition, it is also possible to display the generated stereoscopic image so as to occupy the entire display unit 57 by a predetermined operation from the user. This is the end of the description of the stereoscopic image generation processing.
[0064]
By the way, according to the image processing program 65, it is possible to generate a stereoscopic image based on an image pair composed of two images. This image pair is arbitrarily selected by the user by pressing the “select image pair” button 112 shown in FIG. 13 and selecting two of the thumbnail images displayed in the main area 102. be able to.
[0065]
When the “stereoscopic image” button 104 is pressed after the image pair is selected, one of the image pairs is set to the L image and the other is set to the R image, and the above-described processing after step S6 is started. , A stereoscopic image is generated. The image data of the generated stereoscopic image is stored by the image management block 72 in association with the original image pair.
[0066]
It should be noted that an image pair serving as a source of a stereoscopic image can be automatically selected. Processing for automatically selecting an image pair will be described with reference to the flowchart in FIG. This image pair automatic selection process is started when the “image pair automatic selection” 113 shown in FIG. 13 is pressed. In addition, this image pair automatic selection processing is executed by processing the images input from the image processing block 73 to the base image selection block 74 one by one.
[0067]
In step S11, the base image selection block 74 determines whether or not the processing target image has been captured in the continuous shooting mode with reference to the continuous shooting flag included in the attribute information. If it is determined that the image to be processed has not been captured in the continuous shooting mode, the process proceeds to step S12.
[0068]
In step S12, the base image selection block 74 determines whether or not the difference between the shooting date and time of the image to be processed and the image shot one image after is within a predetermined threshold (for example, several seconds). If it is determined that the difference between the shooting dates and times is within the predetermined threshold, the process proceeds to step S13.
[0069]
In step S13, the base image selection block 74 calculates the similarity between the image to be processed and the image captured one image after, and determines whether or not the similarity is within a predetermined threshold. Here, as the similarity between two images, for example, a predetermined one-line pixel is extracted from each image, and the sum of the differences between the corresponding pixels is calculated.
[0070]
If it is determined that the similarity between the image to be processed and the image captured one image later is within a predetermined threshold, the image to be processed and the image captured one image later are the same subject continuously. Therefore, the image to be processed and the image captured one image after are set as an image pair.
[0071]
The thumbnails of the two images set in the image pair are surrounded by, for example, a single thick frame in the main area 102 to notify the user that the image pair has been set. The information of the image pair is output to the image management block 72, and information (serial number and the like) indicating the partner of the image pair is added to the additional information of each image data of the image pair. Alternatively, a new folder may be created and stored for two image data forming an image pair and their respective attribute information.
[0072]
If it is determined in step S11 that the image to be processed has been shot in the continuous shooting mode, step S12 is skipped and the process proceeds to step S13.
[0073]
If it is determined in step S12 that the difference between the shooting date and time of the image to be processed and the image taken one image after is not within a predetermined threshold, the two images are used as the source of the stereoscopic image. Since the image is not suitable for the image pair, the image pair automatic selection process ends.
[0074]
In step S13, even when it is determined that the similarity between the image to be processed and the image captured one image later is not within the predetermined threshold, the two images are used as the source of the stereoscopic image. Since the image is not suitable for the image pair, the image pair automatic selection process ends. This concludes the description of the image pair automatic selection process.
[0075]
When this image pair automatic selection process is performed on all images input from the image processing block 73 to the base image selection block 74 as a processing target, and when a plurality of image pairs are set, the user is set. One image pair can be selected from a plurality of image pairs. When the “stereoscopic image” button 104 is pressed after the image pair is selected by the user, the above-described processing in step S6 and thereafter is started, and a stereoscopic image is generated. The image data of the generated stereoscopic image is stored by the image management block 72 in association with the original image pair.
[0076]
However, when a stereoscopic image is generated based on an image pair, if one of the image pairs is set to be an L image and the other is set to be an R image (if the setting is not reversed), the image is viewed stereoscopically. I can't. At this time, if the user presses the “LR replacement” button 105, the L image and the R image are replaced, and a stereoscopic image that can be viewed stereoscopically is regenerated.
[0077]
Depressing the “LR replacement” button 105 to regenerate a stereoscopic image means that the line polarizer 43 is vertically shifted by one line with respect to the displayed stereoscopic image. It also works effectively.
[0078]
Next, the details of the process of step S6 in FIG. 14, that is, the process of adjusting the vertical position of the L image and the R image will be described with reference to the flowchart of FIG. 16 and FIG.
[0079]
In step S21, the stereoscopic image generation block 75 acquires from the image management block 72 the thumbnail images of the L image and the R image set in the process of step S6. In step S22, the stereoscopic image generation block 75 adds the pixels at the same coordinates of the thumbnail image of the L image and the thumbnail image of the R image by 50% to generate a composite image.
[0080]
At this time, the display of the window 101 is, for example, as shown in FIG. That is, the main area 102 is provided with an L image display area 121 on which an L image is displayed, an R image display area 122 on which an R image is displayed, and a composite image display area 123 on which a composite image is displayed.
[0081]
Above the main area 102, an “upward R image” button 124 for operating the relative position of the R image with respect to the L image upward to regenerate the composite image is displayed. Move down R image button 125 operated when moving the current position downward to regenerate the synthesized image, and the "position operated when the adjustment of the relative position of the R image with respect to the L image is completed. An "end adjustment" button 126 is provided.
[0082]
It returns to FIG. In step S23, the GUI block 71 determines whether the “R image upward move” button 124 or the “R image downward move” button 125 has been pressed. If it is determined that the “R image upward move” button 124 or the “R image downward move” button 125 has been pressed, the process proceeds to step S24. In step S24, the stereoscopic image generation block 75 regenerates the composite image by moving the relative position of the R image with respect to the L image upward or downward based on the control from the GUI block 71 according to the operation of the user. .
[0083]
At this time, the display of the composite image display area 123 is updated to the regenerated composite image. If it is determined in step S23 that the “R image upward move” button 124 and the “R image downward move” button 125 have not been pressed, step S24 is skipped and the process proceeds to step S25.
[0084]
In step S25, the GUI block 71 determines whether or not the “position adjustment end” button 126 has been pressed. If it is determined that the “end position adjustment” button 126 has not been pressed, the process returns to step S23, and the subsequent processing is repeated.
[0085]
Accordingly, the user presses the “move R image upward” button 124 or the “move R image downward” button 125 while looking at the composite image displayed in the composite image display area 123, so that the L image and the R image move up and down. The position in the direction can be adjusted. Since the regenerated and displayed composite image is a reduced image having a small data amount, the regenerating process is quickly executed.
[0086]
If it is determined in step S25 that the “end position adjustment” button 126 has been pressed, the process proceeds to step S26. In step S26, the stereoscopic image generation block 75 outputs the adjustment value of the vertical position of the L image and the R image to the image management block 72. The image management block 72 adds the adjustment value of the vertical position to the attribute information of the parallax image. When the image pair is an L image and an R image, the adjustment value of the vertical position is added to both pieces of attribute information of the image pair. Thereafter, the process returns to step S7 in FIG. 14 described above.
[0087]
This concludes the description of the processing for adjusting the vertical position of the L image and the R image.
[0088]
In the above description, the relative position of the R image with respect to the L image is moved upward or downward by pressing the “R image upward move” button 124 or the “R image downward move” button 125. However, for example, by dragging and dropping the combined image displayed in the combined image display area 123 upward or downward using a mouse (operation input unit 56), the relative position of the R image to the L image is reduced. The position may be moved upward or downward.
[0089]
Next, FIG. 18 illustrates a display example of the window 101 when a stereoscopic image is generated. In the main area 102, the generated stereoscopic image is displayed. On the upper side of the main area 102, the display of the main area 102 can be stereoscopically viewed from the stereoscopic image by the conventional stereoscopic method using the L image and the R image that are the basis of the stereoscopic image. An “anaglyph” button 131 to “liquid crystal shutter glasses” 135 which are pressed when converting the image into a possible image are provided. Further, on the left side of the main area 102, an “text input” button 136 which is pressed when writing an arbitrary character, symbol, or the like on the generated stereoscopic image is additionally provided.
[0090]
The “anaglyph” button 131 is pressed to generate an anaglyph image (hereinafter, referred to as an anaglyph image) using the L image and the R image that are the basis of the stereoscopic image. The “color anaglyph 1” button 132 is used to generate an image of the first color anaglyph method (hereinafter, referred to as a first color anaglyph image) using the L image and the R image that are the basis of the stereoscopic image. Is pressed. The “color anaglyph 2” button 133 is used to generate an image of the second color anaglyph method (hereinafter, referred to as a second color anaglyph image) using the L image and the R image that are the basis of the stereoscopic image. Is pressed. In these cases, the user needs to view the image using red and blue glasses.
[0091]
The “Lenticular” button 134 is pressed to generate a lenticular image (hereinafter, referred to as a first lenticular image) using the L image and the R image that are the basis of the stereoscopic image. In this case, the user needs to view the image through the lenticular screen.
[0092]
The “liquid crystal shutter glasses” button 135 is pressed to display the L image and the R image that are the basis of the stereoscopic image in a time-division stereoscopic television system. In this case, the user needs to view the image using liquid crystal shutter glasses that alternately open and close the left eye side and the right eye side in synchronization with the field cycle of the display unit 57.
[0093]
Next, a process of converting a stereoscopic image displayed in the main area 102 in response to a user operation will be described with reference to a flowchart of FIG.
[0094]
This stereoscopic image conversion processing is started when a stereoscopic image is generated and a window 101 as shown in FIG. 18 is displayed.
[0095]
In step S31, the GUI block 71 determines whether the “LR replacement” button 105 has been pressed. If it is determined that the “LR replacement” button 105 has been pressed, the process proceeds to step S32. In step S32, the stereoscopic image generation block 75 replaces the L image and the R image that are the source of the currently generated stereoscopic image and regenerates the stereoscopic image according to the control from the GUI block 71. . The display control block 76 displays the regenerated stereoscopic image in the main area 102. Thereafter, the process proceeds to step S43.
[0096]
If it is determined in step S31 that the “LR replacement” button 105 has not been pressed, the process proceeds to step S33. In step S33, the GUI block 71 determines whether the “anaglyph” button 131 has been pressed. If it is determined that the “anaglyph” button 131 has been pressed, the process proceeds to step S34. In step S34, the stereoscopic image generation block 75 generates an anaglyph image using the L image and the R image that are the source of the currently generated stereoscopic image, under the control of the GUI block 71. The display control block 76 displays the generated anaglyph image in the main area 102. Thereafter, the processing proceeds to step S45.
[0097]
If it is determined in step S33 that the “anaglyph” button 131 has not been pressed, the process proceeds to step S35. In step S35, the GUI block 71 determines whether the “color anaglyph 1” button 132 has been pressed. If it is determined that the “color anaglyph 1” button 132 has been pressed, the process proceeds to step S36. In step S36, the stereoscopic image generation block 75 generates a first color anaglyph image using the L image and the R image that are the basis of the currently generated stereoscopic image, under the control of the GUI block 71. . The display control block 76 displays the generated first color anaglyph image in the main area 102. Thereafter, the processing proceeds to step S45.
[0098]
If it is determined in step S35 that the “color anaglyph 1” button 132 has not been pressed, the process proceeds to step S37. In step S37, the GUI block 71 determines whether the “color anaglyph 2” button 133 has been pressed. If it is determined that the “color anaglyph 2” button 133 has been pressed, the process proceeds to step S38. In step S38, the stereoscopic image generation block 75 generates a second color anaglyph image using the L image and the R image that are the basis of the currently generated stereoscopic image, under the control of the GUI block 71. . The display control block 76 displays the generated second color anaglyph image in the main area 102. Thereafter, the processing proceeds to step S45.
[0099]
If it is determined in step S37 that the “color anaglyph 2” button 133 has not been pressed, the process proceeds to step S39. In step S39, the GUI block 71 determines whether the “Lenticular” button 134 has been pressed. If it is determined that the “Lenticular” button 134 has been pressed, the process proceeds to step S40. In step S40, the stereoscopic image generation block 75 generates a lenticular image using the L image and the R image that are the basis of the currently generated stereoscopic image, under the control of the GUI block 71. The display control block 76 displays the generated lenticular image in the main area 102. Thereafter, the processing proceeds to step S45.
[0100]
If it is determined in step S39 that the “lenticular” button 134 has not been pressed, the process proceeds to step S41. In step S41, the GUI block 71 determines whether the “liquid crystal shutter glasses” button 135 has been pressed. If it is determined that the “liquid crystal shutter glasses” button 135 has been pressed, the process proceeds to step S42. In step S42, the stereoscopic image generation block 75 supplies the L image and the R image that are the basis of the currently generated stereoscopic image to the display control block 76 according to the control from the GUI block 71. The display control block 76 displays the L image and the R image on the main area 102 alternately in synchronization with the field cycle of the display unit 57 according to the control from the GUI block 71. Thereafter, the process proceeds to step S43.
[0101]
In step S43, the GUI block 71 waits until any button is pressed. If it is determined that any button has been pressed, the process proceeds to step S44. In step S44, the GUI block 71 determines whether the “end” button 107 has been pressed. If it is determined that the “end” button 107 has been pressed, the stereoscopic image conversion processing ends, and the image processing program 65 being executed also ends.
[0102]
If it is determined in step S44 that the "end" button 107 has not been pressed, the process returns to step S31, and the subsequent processing is repeated.
[0103]
In step S45 executed after the processing of step S34, S36, S38 or S40, the GUI block 71 waits until any button is pressed. If it is determined that any button has been pressed, the process proceeds to step S46. In step S46, the GUI block 71 determines whether the “print” button 106 has been pressed. If it is determined that the “print” button 106 has been pressed, the process proceeds to step S47. In step S47, the display control block 76 prints the image data of the anaglyph image, the first color anaglyph image, the second anaglyph image, or the lenticular image displayed on the main area 102 according to the control from the GUI block 71. (Not shown) for printing.
[0104]
The printed anaglyph image, the first color anaglyph image, or the second anaglyph image can be viewed three-dimensionally by using red and blue glasses. The printed lenticular image can be visually recognized three-dimensionally via the lenticular panel. This is the end of the description of the stereoscopic image conversion processing.
[0105]
Next, FIG. 20 shows a display example of the window 101 when the “text input” button 136 is pressed. Above the main area 102, a text input area 141 for displaying an arbitrary character, symbol, or the like is provided so as to be superimposed on the stereoscopic image displayed in the main area 102. Characters and the like input to the text input area 141 are superimposed on a stereoscopic image so that the characters are visually recognized in a stereoscopic manner. The user can move the text input area 141 to an arbitrary position by dragging and dropping using the mouse (operation input unit 56).
[0106]
Above the main area 102, a “near” button 142 that is pressed when approaching the perspective (depth) of a character or the like input to the text input area 141 when viewed stereoscopically, and an input to the text input area 141. A “far” button 143 is provided to be pressed when the perspective of a character or the like to be stereoscopically viewed is kept away.
[0107]
Here, the text input processing will be described with reference to the flowchart in FIG. This text input processing is started when the “text input” button 136 is pressed.
[0108]
In step S51, the display control block 76 displays the text input area 141 so as to be superimposed on the stereoscopic image displayed in the main area 102. When the user inputs an arbitrary character or the like in the text input area 141, the stereoscopic image generation block 75 adds the character or the like to the stereoscopic image so that the input character or the like is stereoscopically viewed with a predetermined perspective. Superimpose.
[0109]
In step S52, the GUI block 71 determines whether the “near” button 142 has been pressed. If it is determined that the “near” button 142 has been pressed, the process proceeds to step S53. In step S53, the stereoscopic image generation block 75 superimposes a character or the like on the stereoscopic image so that the perspective of the input character or the like is closer and stereoscopically visible according to the control from the GUI block 71. . If it is determined in step S52 that the “near” button 142 has not been pressed, the process of step S53 is skipped.
[0110]
In step S54, the GUI block 71 determines whether the “far” button 143 has been pressed. If it is determined that the “far” button 143 has been pressed, the process proceeds to step S55. In step S55, the stereoscopic image generation block 75 superimposes a character or the like on the stereoscopic image or the like so that the perspective of the input character or the like is more stereoscopically viewed under the control of the GUI block 71. I do. If it is determined in step S54 that the “far” button 143 has not been pressed, the process of step S55 is skipped.
[0111]
In step S56, the GUI block 71 determines whether the “text input” button 13642 has been pressed again. If it is determined that the “text input” button 136 has not been pressed again, the process returns to step S52, and the subsequent processing is repeated.
[0112]
If it is determined in step S56 that the “text input” button 13642 has been pressed again, the process proceeds to step S57. In step S57, the stereoscopic image generation block 75 outputs to the image management block 72 the input text information such as characters, the coordinate information on the stereoscopic image such as characters, and the perspective information such as characters in stereoscopic vision. The image management block 72 records the image data in association with the image data of the stereoscopic image. This is the end of the description of the text input process.
[0113]
In this specification, the steps of describing a program recorded on a recording medium include, in addition to processing performed in chronological order according to the described order, not only chronological processing but also parallel or individual processing. This includes the processing to be executed.
[0114]
Also, in this specification, a system refers to an entire device including a plurality of devices.
[0115]
【The invention's effect】
As described above, according to the present invention, it is possible to write arbitrary characters, symbols, and the like on an image that can be viewed three-dimensionally, and further, it is possible to view the written characters and the like three-dimensionally.
[Brief description of the drawings]
FIG. 1 is a diagram showing a state where an optical adapter is attached to a camera.
FIG. 2 is a diagram illustrating a configuration example of an optical adapter in FIG. 1;
FIG. 3 is a diagram illustrating a parallax image captured by a camera with an optical adapter attached.
FIG. 4 is a diagram showing a configuration example of a stereoscopic vision system to which the present invention is applied.
FIG. 5 is a diagram illustrating a parallax image including a mirror image and a through image.
FIG. 6 is a diagram showing an L image and an R image trimmed from a mirror image and a through image.
FIG. 7 is a diagram illustrating a process of generating a stereoscopic image by combining an L image and an R image.
FIG. 8 is a diagram showing a stereoscopic image.
FIG. 9 is a diagram for describing an overview of stereoscopically viewing a stereoscopic image.
10 is a block diagram illustrating a configuration example of the personal computer in FIG.
11 is a diagram illustrating a configuration example of a functional block that is implemented when the CPU of FIG. 10 executes an image processing program.
FIG. 12 is a diagram illustrating a display example of a window corresponding to an image processing program.
FIG. 13 is a diagram showing a display example of a window when an “image acquisition” button in FIG. 12 is pressed.
FIG. 14 is a flowchart illustrating a stereoscopic image generation process.
FIG. 15 is a flowchart illustrating an image pair automatic selection process.
FIG. 16 is a flowchart illustrating a vertical position adjustment process of an L image and an R image.
FIG. 17 is a diagram illustrating a display example of a window in a vertical position adjustment process of an L image and an R image.
FIG. 18 is a diagram illustrating a display example of a window when a stereoscopic image is generated.
FIG. 19 is a flowchart illustrating a stereoscopic image conversion process.
FIG. 20 is a diagram illustrating a display example of a window when the “text input” button in FIG. 18 is pressed.
FIG. 21 is a flowchart illustrating a text input process.
[Explanation of symbols]
Reference Signs List 1 digital still camera, 11 optical adapter, 31 personal computer, 41 filter glasses, 43 line polarizing plate, 51 CPU, 61 magnetic disk, 62 optical disk, 63 magneto-optical disk, 64 semiconductor memory, 65 image processing program, 71 GUI block, 72 image management block, 73 image acquisition block, 74 base image selection block, 75 stereoscopic image generation block, 76 display control block

Claims

An image processing apparatus that generates image data for displaying a stereoscopic image based on an image for the left eye and an image for the right eye,
Generating means for generating image data for displaying the stereoscopic image based on the image for the left eye and the image for the right eye,
Receiving means for receiving the write information;
An image processing apparatus comprising: a synthesizing unit that adds depth to an image corresponding to the write information received by the receiving unit and synthesizes the image with the stereoscopic image.

The image processing apparatus according to claim 1, wherein the write information includes at least one of a character, a symbol, and a line drawing.

The image processing apparatus according to claim 1, further comprising a changing unit configured to change a depth of an image corresponding to the writing information.

A storage unit that stores the image data generated by the generation unit, the write information received by the reception unit, and a depth given to an image corresponding to the write information in association with each other. The image processing device according to claim 1.

In an image processing method of an image processing apparatus that generates image data for displaying a stereoscopic image based on an image for a left eye and an image for a right eye,
A generation step of generating image data for displaying the stereoscopic image based on the image for the left eye and the image for the right eye,
A receiving step of receiving writing information;
A combining step of adding depth to an image corresponding to the write information received in the processing of the receiving step and combining the image with the stereoscopic image.

A program for generating image data for displaying a stereoscopic image based on a left-eye image and a right-eye image,
A generation step of generating image data for displaying the stereoscopic image based on the image for the left eye and the image for the right eye,
A receiving step of receiving the write information received in the processing of the receiving step;
A combining step of adding an image corresponding to the write information to the stereoscopic image by adding depth to the stereoscopic image.

A computer that generates image data for displaying a stereoscopic image based on the image for the left eye and the image for the right eye,
A generation step of generating image data for displaying the stereoscopic image based on the image for the left eye and the image for the right eye,
A receiving step of receiving the write information received in the processing of the receiving step;
A program that executes a process including a step of adding an image corresponding to the write information to the stereoscopic image by adding depth to the stereoscopic image.