JP4904638B2

JP4904638B2 - Method and apparatus for generating three-dimensional shape data and computer program

Info

Publication number: JP4904638B2
Application number: JP2001177582A
Authority: JP
Inventors: 浩一藤原; 浩次藤原
Original assignee: Konica Minolta Inc
Current assignee: Konica Minolta Inc
Priority date: 2001-06-12
Filing date: 2001-06-12
Publication date: 2012-03-28
Anticipated expiration: 2021-06-12
Also published as: JP2002366934A

Description

【０００１】
【発明の属する技術分野】
本発明は、３次元形状データの生成方法および装置並びにコンピュータプログラムに関する。
【０００２】
【従来の技術】
従来より、物体の３次元形状データを生成する方法の１つとして、シルエット法（Shape From Silhouette ）が知られている。
【０００３】
シルエット法は、物体を異なる視点から撮影（または撮像）して得られた複数の画像から、当該物体の３次元形状を復元する。つまり、シルエット法では、物体を複数の視線方法から撮影した画像に基づいて、それぞれの視線方向から見た物体の形状を反映する物体存在領域を３次元画像空間内のボクセルの集合として記述する。そして、すべての方向から見た物体存在領域内のボクセルの集合をその３次元空間内の物体として求める。
【０００４】
【発明が解決しようとする課題】
しかし、シルエット法では、物体の窪んだ部分または凹んだ部分がシルエット画像として表れないため、そのような凹部領域の形状を復元することができない。
【０００５】
他方、物体の３次元形状を復元する他の方法として、光切断法などにより非接触で物体を計測するレンジセンサ（３次元計測装置）を用いる方法がある。しかし、レンジセンサを用いた場合には、物体の表面の反射率が極めて低い場合にデータが欠落してしまい、その部分の３次元形状を復元できないという問題点がある。
【０００６】
本発明は、上述の問題に鑑みてなされたもので、凹部領域や低反射率の部分があった場合でも、できるだけ少ないメモリ容量で且つ少ない演算時間で正確に３次元形状を復元することを目的とする。
【０００７】
本発明の１つの形態による方法では、物体についての３次元形状データを生成する方法であって、前記物体についての視点の異なる複数の画像からシルエット法を用いてボリュームデータに変換することによって第１のボリュームデータを生成する第１のステップ、前記物体について３次元計測によって得られた３次元データをボリュームデータに変換することによって第２のボリュームデータを生成する第２のステップ、前記第１のボリュームデータと第２のボリュームデータとを統合して１つのボリュームデータとする第３のステップを有し、前記ボリュームデータは、ボリュームを表現する座標系内の離散的な各座標位置を格子点とする複数のボクセルによって構成され、前記各ボクセルの格子点において、物体の表面からの距離に応じた値が属性値として保持されており、前記属性値について、当該格子点が物体の外部にあることを示す外部（遠）、当該格子点が物体の外部にあり前記外部（遠）よりも物体の表面に近いことを示す外部（近）、当該格子点が物体の内部にあることを示す内部（遠）、および、当該格子点が物体の内部にあり前記内部（遠）よりも物体の表面に近いことを示す外部（近）の４つに分類し、前記第３のステップにおいて、前記各ボクセルの格子点について、前記第１のボリュームデータによる前記属性値と前記第２のボリュームデータによる前記属性値とに基づいて統合された属性値である統合属性値を求める。
【０００８】
好ましくは、第１のステップにおいて、シルエット法を用いてボリュームデータに変換するに際し、
（ａ）ボリュームを表現する座標系内の離散的な各座標位置において、前記物体の画像の輪郭と当該画像の視点とにより定まる視体積の境界までの距離に応じた値を求めるステップ、
（ｂ）各座標位置について、前記物体の複数の画像について求めた複数の値に基づいて物体表面からの距離に応じた値を決定し、各座標位置に対応付けて保持するステップを実行する。
【０００９】
また、第２のステップにおいて、３次元データをボリュームデータに変換するに際し、
（ａ）ボリュームを表現する座標系内の離散的な各座標位置において、前記物体の３次元データで表される物体表面までの距離に応じた値を求めるステップ、
（ｂ）求めた値を各座標位置に対応付けて保持するステップを実行する。
【００１０】
生成されたボリュームデータを逆変換することにより、物体の３次元形状が復元される。
本発明によると、３次元形状データの生成方法の相違によるそれぞれの欠点を補って欠落のない精度のよい３次元モデルが生成される。
【００１１】
【発明の実施の形態】
図１は本実施形態に係る３次元データ生成装置１のブロック図である。
図１において、３次元データ生成装置１は、装置本体１０、磁気ディスク装置１１、媒体ドライブ装置１２、ディスプレイ装置１３、キーボード１４、およびマウス１５などからなる。
【００１２】
装置本体１０は、ＣＰＵ、ＲＡＭ、ＲＯＭ、ビデオＲＡＭ、入出力ポート、および各種コントローラなどからなる。ＲＡＭおよびＲＯＭなどに記憶されたプログラムをＣＰＵが実行することにより、以下に説明する種々の機能が実現される。
【００１３】
磁気ディスク装置１１には、ＯＳ（Operating System) 、３次元モデルＭＬを生成するためのモデリングプログラムＰＲ、その他のプログラム、入力された３次元データ（３次元形状データ）ＤＴ、画像（２次元画像データ）ＦＴ、ボリュームデータＤＶ、生成された３次元モデルＭＬ、その他のデータなどが格納されている。これらのプログラムおよびデータは、適時、装置本体１０のＲＡＭにローディングされる。
【００１４】
なお、モデリングプログラムＰＲには、初期化処理、属性値設定処理、属性値決定処理、境界判定処理、分割処理、統合処理、逆変換処理、マッピング処理、およびその他の処理のためのプログラムが含まれる。
【００１５】
媒体ドライブ装置１２は、ＣＤ−ＲＯＭ（ＣＤ）、フロッピィディスクＦＤ、光磁気ディスク、コンパクトフラッシュなどの半導体メモリＨＭ、その他の記録媒体にアクセスし、データまたはプログラムの読み書きを行う。記録媒体の種類に応じて適切なドライブ装置が用いられる。上に述べたモデリングプログラムＰＲは、これら記録媒体からインストールすることも可能である。３次元データＤＴおよび画像ＦＴなども記録媒体を介して入力することが可能である。
【００１６】
ディスプレイ装置１３の表示面ＨＧには、上に述べた種々のデータ、３次元データＤＴ、画像ＦＴ、モデリングプログラムＰＲによる処理過程の画像、生成された３次元モデルＭＬ、その他のデータまたは画像が表示される。
【００１７】
キーボード１４およびマウス１５は、ディスプレイ装置１３に表示された画像ＦＴおよび３次元データＤＴに対して、ユーザが種々の指定を行うために用いられる他、装置本体１０に種々のデータを入力しまたは指令を与えるために用いられる。
【００１８】
図示は省略したが、装置本体１０には、被写体である物体を種々の視線で撮影しまたは種々の視点から撮影して画像ＦＴを入力するためのデジタルカメラを接続することが可能である。画像ＦＴから被写体である物体の輪郭を切り出すことにより、シルエット画像ＦＳが生成される。シルエット画像ＦＳに基づいて、シルエット法によって３次元データＤＴを生成することが可能である。
【００１９】
また、視差のある２枚の画像に基づいて３次元再構成を行い、３次元データＤＴを生成することも可能である。
また、装置本体１０には、物体を撮影してその３次元データＤＴを入力するための３次元入力装置（３次元計測装置）を接続することも可能である。そのような３次元入力装置は、例えば光切断法によって物体の３次元データＤＴを非接触で計測する。また、３次元入力装置から、３次元データＤＴではなく、３次元データＤＴを生成するための元となるデータを出力し、装置本体１０によって３次元データＤＴを演算によって求めてもよい。
【００２０】
このようにして得られた３次元データＤＴ、または３次元データＤＴを生成する途中のデータは、物体の３次元形状を境界表現形式で表現するものである。
本実施形態においては、境界表現形式で表現された物体の３次元形状を、各ボクセル（Voxel)に多値の属性値ｄを持たせたボリュームデータ（Volume Data ）ＤＶに変換する。ボクセルとは、３次元空間を小さな単位の格子に分解し、単位格子によって構成される小さな立方体である。
【００２１】
また、生成方法の異なる複数の３次元形状から変換された複数のボリュームデータＤＶを、１つのボリュームデータＤＶに統合することも行われる。
最終的なボリュームデータＤＶは、再度、境界表現形式の形状表現に逆変換される。その際に、例えば、公知の零等値面抽出法などを用いることができる。つまり、隣接する頂点（格子点）を結ぶ辺上の零等値面に対応する点を算出し、それらの点を連結した三角ポリゴンを生成することにより、ポリゴンメッシュに変換することができる。
【００２２】
逆変換によって得られた３次元形状に、テクスチャ画像を貼り付けることも行われる。このようにして３次元モデルＭＬが生成される。
３次元データ生成装置１は、パーソナルコンピュータまたはワークステーションなどを用いて構成することが可能である。上に述べたプログラムおよびデータは、ネットワークＮＷを介して受信して取得することも可能である。
【００２３】
図２は３次元データ生成装置１による３次元モデルＭＬの生成処理の流れを示すフローチャートである。
図２において、適当なサイズのボリューム（ボリュームデータＤＶ）が準備され、初期化が行われる（＃１）。ボリュームとして、例えば、５０×５０×５０ボクセル、または１００×１００×１００ボクセル程度のサイズのものが用いられる。ボリュームは立方体でなくてもよい。また、座標系におけるボクセルの中心の座標および姿勢なども設定される。
【００２４】
ボリュームは、ボクセルの各頂点に、２種類の多値のデータを属性値ｄとして格納することができる。つまり、各頂点に対して、２つの属性値ｄｓ，ｄｒを格納する領域が設けられる。これは、ボクセルの各頂点に多値のデータを属性値として格納することが可能なボリュームが２つある、とすることもできる。この場合に、一方を第１のボリューム、他方を第２のボリュームと記載することができる。
【００２５】
なお、本明細書において、「ボリューム」のことを「ボリュームデータ」と記載することがある。その場合には、「ボリュームデータ」は、ボクセルによって所定の形状となるように構成され、ボクセルの各頂点にデータを格納することが可能な３次元領域を意味する。
【００２６】
また、ボクセルの頂点は、隣合うボクセルについて共通である。したがって、周縁部にはない頂点については、８つのボクセルの頂点となる。頂点はボリュームデータの格子点と一致する。したがって、「頂点」を「格子点」と記載することがある。
【００２７】
さて、図２に戻って、シルエット画像を用いて変換処理が行われ、第１のボリュームデータＤＶが生成される（＃２）。レンジデータを用いて変換処理が行われ、第２のボリュームデータＤＶが生成される（＃３）。なお、レンジデータとは、レンジファインダなどと呼称される３次元計測装置を用いて、物体を計測して得られた３次元形状データである。また、視差のある複数の画像からステレオ法によって再構成された３次元形状データも、ここにいうレンジデータに含まれる。
【００２８】
これら、第１および第２のボリュームデータＤＶが統合される（＃４）。統合されたボリュームデータＤＶが、境界表現形式の形状表現に逆変換される（＃５）。必要に応じてテクスチャマッピングが行われる（＃６）。
【００２９】
以下において、ボリュームデータＤＶへの変換処理、および複数のボリュームデータＤＶの統合処理について、詳しく説明する。まず、変換処理について説明する。
〔第１の実施形態による変換処理〕
図３は３次元データ生成装置１による変換処理を示すフローチャート、図４は変換処理の変形例を示すフローチャートである。これらのフローチャートで示す変換処理は、レンジデータ、その他の３次元形状データ、およびシルエット画像を用いた変換処理に適用される。
【００３０】
図３において、まず、物体の３次元形状を表す３次元データＤＴが準備される。または、３次元データＤＴに代えて、３次元データＤＴの生成の元となる複数の画像ＦＴが準備される。そして、３次元データＤＴとボリュームデータＤＶとの位置合わせが行われる（＃１１）。通常、ボリュームデータＤＶの中に３次元データＤＴが丁度納まるように、それらのサイズおよび位置が決定される。
【００３１】
ボリュームを構成するボクセルの各頂点について、それぞれの頂点から３次元形状を表現する境界まで、つまり物体の表面までの距離に応じた値が求められる（＃１２）。求められた値は、各頂点の属性値として保持される（＃１３）。
【００３２】
このようにして、各ボクセルの頂点、つまり格子点には、多値の属性値が格納され、多値の属性値を持った格子点群により構成されるボリュームデータＤＶが生成される。
【００３３】
図４において、物体の表面と交差するボクセルが抽出される（＃２２）。このようなボクセルを「境界ボクセル」と記載することがある。境界ボクセルについて、その頂点から物体の表面までの距離に応じた値が求められる（＃２３）。求められた値は、上と同様に各頂点の属性値として保持される（＃２４）。
【００３４】
なお、ある頂点が境界ボクセルと他のボクセルとの共通の頂点である場合に、その頂点は境界ボクセルの頂点として扱う。
〔第２の実施形態による変換処理〕
第２の実施形態に基づいて、変換処理についてさらに詳しく説明する。なお、第２および第３の実施形態は、主としてシルエット画像を用いた変換処理に適用される。
【００３５】
図５は第２の実施形態の変換処理を示すフローチャート、図６は図５のステップ＃３４の属性値の設定処理のサブルーチンを示すフローチャート、図７はシルエット画像ＦＳをボリュームデータＤＶに投影した状態を説明するための図、図８は図７に示すシルエット画像ＦＳの一部を拡大して示す図、図９はボリュームデータＤＶを水平面で切断した状態を示す図、図１０は境界ボクセルを拡大して示す図、図１１はボリュームデータＤＶを格納したボリュームテーブルＴＬｓの例を示す図である。
【００３６】
図５において、まず、設定されたボリュームデータＤＶについて、すべての格子点ＴＰに初期値をセットする。これによってボリュームデータＤＶを初期化する（＃３１）。格子点ＴＰの初期値として、例えば、物体内部を意味する「プラス無限大」をセットする。この場合、「プラス無限大」は本発明における第１の特定値である。初期化が完了した時点において、図１０に示すボリュームデータＤＶはすべて初期値となる。
【００３７】
次に、１つの画像ＦＴを入力する（＃３２）。その際に、画像ＦＴを撮影したときのカメラパラメータも入力する。カメラパラメータとして、焦点距離などのカメラ内部行列、視点位置などの外部行列が挙げられる。これらを含んだ射影行列を入力してもよい。カメラパラメータに基づいて、画像ＦＴをボリュームデータＤＶに投影する際の視点位置および投影方向が決定される。
【００３８】
なお、ステップ＃３２で画像ＦＴを入力すると、画像ＦＴは磁気ディスク装置１１に格納される。磁気ディスク装置１１に格納された画像ＦＴは、プログラムによって自動的にＲＡＭ上に読み込まれ、以降の処理が加えられる。しかし、ステップ＃３２での画像の入力を、磁気ディスク装置１１に格納された画像ＦＴをＲＡＭ上に読み込む、という意味に用いてもよい。その場合には、予め多数の画像ＦＴを磁気ディスク装置１１に格納しておき、ステップ＃３２で指定された１つの画像ＦＴをＲＡＭ上に読み込むようにすればよい。また、画像の入力を、磁気ディスク装置１１に格納された多数の画像ＦＴの中から、処理すべき１つの画像ＦＴを指定する、という意味に用いることも可能である。
【００３９】
入力された画像ＦＴから、被写体である物体の輪郭が切り出されることにより、シルエット画像ＦＳが生成される（＃３３）。シルエット画像ＦＳは、輪郭が分かればよいので、モノクロの画像で充分である。シルエット画像の生成は、公知の方法によって自動的にまたは手動により行うことができる。
【００４０】
ボリュームデータＤＶ、つまりボクセルＶＸの各頂点（格子点）ＴＰの属性値が求められ、設定される（＃３４）。属性値は、各頂点ＴＰと物体の表面との間の符号付き距離として求められる。つまり、例えば、図７および図８に示されるように、シルエット画像ＦＳによる輪郭（遮蔽輪郭）と視点ＶＰとで定まる視体積ＶＶの境界ＳＦから頂点ＴＰまでの距離Ｌｓを、視体積ＶＶの内側に向かう方向を＋（プラス）として求めたものである。詳しくは後述する。
【００４１】
処理の必要な画像ＦＴが残っている場合に、次の１つの画像を入力する（＃３５でイエス、＃３２）。これが、必要なすべての画像ＦＴについての処理が終了するまで繰り返される（＃３５）。予定したすべての画像ＦＴについての処理を終えた場合でも、必要に応じて画像ＦＴを追加してステップ＃３２〜３４の処理を行うことが可能である。
【００４２】
なお、ステップ＃３２で画像ＦＴを入力する代わりに、シルエット画像ＦＳを入力することも可能である。その場合には、ステップ＃３３は不要である。
図６において、属性値の設定に際し、まず、ボリュームを構成する１つの格子点ＴＰに注目する（＃４１）。注目している格子点ＴＰの属性値をチェックする（＃４２）。属性値が「マイナス無限大」であれば、ステップ＃４５に進む。つまり、属性値が「マイナス無限大」であるということは、その格子点ＴＰは物体の外部にあることを意味する。物体の外部にある格子点ＴＰは切り取られるので、それ以上に属性値を求める必要はない。この場合、「マイナス無限大」は、本発明における第２の特定値である。
【００４３】
なお、最初の画像を処理する時点では、全格子点に初期値として「プラス無限大」がセットされているので、すべてステップ＃４３に進む。２枚目以降の画像を処理する時点では、以前の画像の処理により、「プラス無限大」、「マイナス無限大」、または後述の「符号付き距離ｄｓ」のいずれかが属性値としてセットされており、「マイナス無限大」以外、つまり「プラス無限大」または「符号付き距離ｄｓ」の場合に、ステップ＃４３に進む。
【００４４】
属性値が「マイナス無限大」でなければ、ステップ＃４３において、符号付き距離ｄｓの算出を行う。符号付き距離の算出には種々の方法がある。次に第１から第３までの３つの方法を説明する。
（第１の方法）
注目している格子点ＴＰと、その格子点ＴＰに隣接している格子点（つまり同じボクセルＶＸについての頂点）とを、それぞれ視点ＶＰに向かって画像ＦＴ上に投影する。投影された点同士を画像ＦＴ上でつないだときに、それによってできる辺が輪郭（遮蔽輪郭）と交差する場合に、符号付き距離を計算する。
【００４５】
つまり、図７に示すように、あるボクセルＶＸが、視体積ＶＶの境界ＳＦ上にある場合に、そのボクセルＶＸは境界ボクセルＶＸｓである。境界ボクセルＶＸｓの頂点ＴＰについて、符号付き距離を計算する。
【００４６】
図９において、視体積ＶＶの境界ＳＦと交差するボクセルＶＸが、境界ボクセルＶＸｓとして中間濃度で示されている。図１０において、境界ボクセルＶＸｓの各頂点ＴＰから視体積ＶＶの境界ＳＦまでの距離を求める。求めた距離、またはそれに対応する数値が、それぞれの頂点ＴＰの属性値として示されている。
【００４７】
距離の単位またはスケールは、適当に設定してよい。距離の最大値は、ボクセルＶＸの対角線の長さである。したがって、対角線の長さを基準として、距離を正規化してもよい。また、視体積ＶＶの内部にある頂点ＴＰについては、距離の符号をそのまま正の値とし、外部にある頂点ＴＰについては、距離にマイナスを付けて負の値とする。
【００４８】
例えば、属性値として８ビットのデータを用いる場合には、最上位ビットを符号ビットとし、下位７ビットで距離を示す。これによって、−１２７〜＋１２８の値を表現することができるので、−１２７を、外部を示す「マイナス無限大」に対応させ、＋１２８を、内部を示す「プラス無限大」に対応させ、−１２６〜＋１２７を符号付き距離に対応させる。属性値として、１２ビット、１６ビット、その他のビット数のデータを用いることができる。
【００４９】
したがって、格子点ＴＰが視体積ＶＶの境界ＳＦ上にある場合には、その属性値は零となる。格子点ＴＰが視体積ＶＶの内部にいくにしたがって属性値は大きくなり、外部にいくにしたがって属性値は小さくなる。
【００５０】
辺が遮蔽輪郭と交差しない場合には、つまり境界ボクセルＶＸｓでない場合には、その格子点ＴＰは視体積ＶＶの内部または外部に存在する。外部にある場合には、その格子点ＴＰは切り取るべき点であるから、属性値を「マイナス無限大」とする。内部にある場合には属性値を「プラス無限大」のままとする。
（第２の方法）
注目している格子点ＴＰを視点ＶＰに向かって画像ＦＴ上に投影する。投影された点を中心とし、中心から一定の半径の円の内部の領域に遮蔽輪郭があるか否かをチェックする。遮蔽輪郭が存在する場合に、符号付き距離を計算する。領域内に遮蔽輪郭が存在しない場合で、格子点ＴＰが視体積ＶＶの外部にある場合には、属性値を「マイナス無限大」とする。
。
（第３の方法）
遮蔽輪郭を一定間隔でサンプリングし、サンプリング点と視点ＶＰとを結ぶ直線を得る。これらの直線は、遮蔽輪郭を通る視線である。視体積ＶＶの境界ＳＦに代えて視線を用い、３次元的に符号付き距離を算出する。つまり、境界ボクセルＶＸｓの頂点ＴＰについて、頂点ＴＰから視線までの距離を算出する。注目する格子点ＴＰが視体積ＶＶの外部にある場合には、属性値を「マイナス無限大」とする。
【００５１】
そして、ステップ＃４４において、格子点ＴＰに属性値をセットする。属性値をセットする際には、既にセットされている属性値よりも小さい値のみをセットする。既にセットされている属性値よりも大きい値が新たに得られている場合には、その新しい属性値は無視する。
【００５２】
つまり、属性値として、新たに「マイナス無限大」が得られたとすると、先にセットされている属性値が何であれ、属性値は「マイナス無限大」となる。このようにして物体の外部が切り取られていく。
【００５３】
なお、属性値が新旧とも符号付き距離であった場合には、それらの平均値を求めてそれを新たな属性値ｄとしてもよい。
すべての格子点ＴＰに対して処理を行ったかどうかをチェックする（＃４５）。まだ処理を行っていない格子点ＴＰがある場合は、ステップ＃４１以降を繰り返す。すべての格子点ＴＰについての処理を終えた場合にはリターンする。これによって、図１１に示すようなボリュームテーブルＴＬｓが完成する。
【００５４】
第２の実施形態によると、１つの画像ＦＴごとに処理が行われるので、画像の処理が終わればその画像を削除することができ、それだけ使用するメモリ容量が少なくて済む。
〔第３の実施形態による変換処理〕
第２の実施形態の変換処理では、画像ＦＴを追加していくことにより変換処理を進行したが、最初に複数枚の画像ＦＴをまとめて入力し（読み込み）、処理を行うことも可能である。次にその例を第３の実施形態として示す。
【００５５】
図１２は第３の実施形態の変換処理を示すフローチャート、図１３は図１２のステップ＃５４の属性値の設定処理のサブルーチンを示すフローチャートである。
【００５６】
図１２において、ステップ＃５１は図５のステップ＃３１と同様である。ステップ＃５２では、物体について撮影したすべての画像ＦＴを一時に入力する。その際に、それぞれの画像ＦＴを撮影したときのカメラパラメータを入力する。各画像ＦＴについて、ステップ＃３３と同様にシルエット画像ＦＳが生成される（＃５３）。そして、属性値の設定が行われる（＃５４）。
【００５７】
図１３において、属性値の設定に際し、まず、ステップ＃４１と同様に、１つの格子点ＴＰに注目する（＃６１）。注目している格子点ＴＰが、視体積ＶＶに対してどのような位置に存在するかをチェックする（＃６２）。
【００５８】
すなわち、格子点ＴＰが視体積ＶＶの境界ＳＦの付近に存在する場合には属性値を仮に「BORDER」とし、視体積ＶＶの内部に存在する場合には属性値を「INSIDE」とし、被写体の外部に存在する場合には属性値を「OUTSIDE 」とする。属性値「INSIDE」「OUTSIDE 」は、本発明における第１の特定値および第２の特定値である。次に２つのチェック方法について説明する。
（第１の方法）
注目している格子点ＴＰと隣接している格子点とを、すべての画像ＦＴ上に投影する。投影されたそれぞれの画像ＦＴ上において、投影された２つの点を各画像ＦＴ上でつないだときに、それによってできる辺が輪郭（遮蔽輪郭）と交差するか否かを判断する。１つでも交差する画像ＦＴが存在すれば、格子点ＴＰは視体積ＶＶの境界ＳＦ付近に存在すると判断し、属性値を仮に「BORDER」とする。交差する画像ＦＴがない場合で、格子点ＴＰの投影点が遮蔽輪郭の外部に存在している画像が１つでもあれば、格子点ＴＰは被写体の外部に存在すると判断し、属性値を仮に「OUTSIDE 」とする。それ以外の場合は、格子点ＴＰは被写体の内部に存在すると判断し、属性値を仮に「INSIDE」とする。
（第２の方法）
注目している格子点ＴＰをすべての画像ＦＴ上に投影する。投影された各点を中心とし、中心から一定の半径の円の内部の領域に遮蔽輪郭があるか否かをチェックする。遮蔽輪郭を含むと判断された画像が１つでも存在する場合は、視体積ＶＶの境界ＳＦ付近に存在すると判断する。遮蔽輪郭を含むと判断された画像がない場合で、格子点ＴＰの投影点が遮蔽輪郭外に存在している画像が１つでもあれば、被写体の外部に存在すると判断する。それ以外の場合は、被写体の内部に存在すると判断する。
【００５９】
このようにして、１つの格子点ＴＰについて、全ての画像ＦＴを考慮した上で、仮の属性値を決定する。
仮の属性値が「OUTSIDE 」または「INSIDE」である場合には、属性値を、例えば、それぞれ「マイナス無限大」または「プラス無限大」とする。仮の属性値が「BORDER」である場合には、符号付き距離を計算する（＃６３）。
【００６０】
符号付き距離の計算方法は、基本的にはステップ＃４３の説明で述べたと同様である。しかし、ここでは、１つの格子点ＴＰについて、すべての画像ＦＴを同時に考慮して符号付き距離を決定する。
【００６１】
すなわち、例えば、次のようにして符号付き距離を計算する。
（第１の方法）
注目している格子点ＴＰを、すべての画像ＦＴ上に投影する。投影したすべての画像ＦＴの中から、投影点から遮蔽輪郭までの距離が一番近いものを選択する。選択された画像ＦＴについて、その遮蔽輪郭上の点を通る視線を求める。格子点ＴＰからその視線までの距離を求める。求めた距離に正負の符号を付けて属性値とする。
（第２の方法）
すべての画像ＦＴについて、遮蔽輪郭を一定間隔でサンプリングし、ステップ＃３４での第３の方法と同様に、遮蔽輪郭を通る視線を用いて３次元的に符号付き距離を算出する。
【００６２】
次に、ステップ＃６４では、ステップ＃４４と同様に格子点ＴＰに属性値をセットする。但し、こここでは、各格子点ＴＰに対して、最終的な属性値がセットされる。
【００６３】
ステップ＃６５では、ステップ＃４５と同様に、すべての格子点ＴＰに対して処理を行ったかどうかがチェックされる。
第３の実施形態によると、仮の属性値が「BORDER」である格子点ＴＰのみについて符号付き距離を計算するので、符号付き距離の計算量が大幅に低減する。したがって、処理速度が速い。
〔第４の実施形態による変換処理〕
次に、第４の実施形態として、８分木（Octree) 表現を用いた変換処理について説明する。
【００６４】
図１４は第４の実施形態の変換処理を示すフローチャート、図１５は図１４のステップ＃７５の交差判定処理のサブルーチンを示すフローチャート、図１６および図１７は８分木表現の原理を説明するための図である。
【００６５】
図１６および図１７に示すように、８分木表現では、対象とする物体よりも大きい立方体を定義し、これをルートキューブ（Root-Cube)ＲＣとする。ルートキューブを、ｘ，ｙ，ｚの各方向に沿ってそれぞれ２等分すると、体積が８分の１の立方体が８つ生成される。このような分割を任意のレベルまで再起的に繰り返すことにより、８分木のデータが生成される。８分木表現それ自体は公知である。
【００６６】
図１４において、まず、ルートキューブを設定する（＃７１）。ルートキューブの設定に当たっては、その中心の座標およびサイズを入力する。また、ルートキューブのすべての頂点に対して、物体内部を意味する「ＣＯ」を初期値としてセットする。また、ルートキューブの属性を、遮蔽輪郭と交差していることを表す「GRAY」とし、レベルを「０」とする。属性が「GRAY」であるキューブは、本発明における境界キューブに相当する。
【００６７】
次に、被写体である物体を撮影した画像ＦＴについて、必要なすべての画像を入力する（＃７２）。その際に、それぞれの画像ＦＴを撮影したときのカメラパラメータを入力する。
【００６８】
ステップ＃７３において、ステップ＃３３と同様にシルエット画像ＦＳが生成される。
ステップ＃７４において、キューブ（ルートキューブ）の分割が行われる。ここでの分割は、属性が「GRAY」であるキューブのみがついて行われる。分割は、８分割とされる。そして、レベルを１つ上げる。
【００６９】
ステップ＃７５において、キューブの交差判定が行われる。ここでは、分割されたそれぞれのキューブを画像ＦＴ上に投影し、遮蔽輪郭との交差の有無を判断する。その判断結果に基づいて、キューブの属性を決定する。
【００７０】
そして、所定のレベルに達するまで、ステップ＃７４および７５の処理を繰り返す（＃７６）。または、属性が「GRAY」であるキューブがなくなった場合には、そこで終了する。その場合には、以降の処理において、境界上にあるキューブ、または頂点が境界上にあるキューブを、属性が「GRAY」であるキューブとして用いる。
【００７１】
処理が終わると、属性が「GRAY」であるキューブについて、各頂点の属性値を求めて設定する（＃７７）。各頂点について属性値を求める方法は、第２の実施形態において説明した方法を用いることができる。
【００７２】
図１５において、分割されたキューブのうちの１つに注目する（＃８１）。キューブをすべての画像ＦＴ上に投影し、それぞれにおいて遮蔽輪郭との交差の有無を判断する（＃８２）。
【００７３】
その判断の結果、遮蔽輪郭に対して交差する画像ＦＴが１つでも存在すれば、そのキューブの属性を「GRAY」に設定する。すべての画像において、投影したキューブが遮蔽輪郭内に存在する場合は、キューブの属性を、物体内部を表す「WHITE 」に設定する。すべての画像において、投影したキューブが遮蔽輪郭外に存在する場合は、物体外部を表す「BLACK 」に設定する（＃８３）。
【００７４】
ここでの「GRAY」「WHITE 」「BLACK 」は、それぞれ、上に述べた「BORDER」「INSIDE」「OUTSIDE 」に対応する。
分割された８つのキューブのすべてについての処理が行われた場合には（＃８４でイエス）、リターンする。
【００７５】
第４の実施形態においても、属性が「GRAY」であるキューブの格子点ＴＰのみについて符号付き距離を計算するので、符号付き距離の計算量が大幅に低減し、処理速度が速い。
【００７６】
このように、第１〜第４の実施形態の変換処理によると、格子点ＴＰに多値のデータを持たせることができ、少ない個数のボクセルＶＸによって高精度の３次元形状を表現することができる。したがって、少ないメモリ容量で物体の３次元形状を高精度に表現することができる。
【００７７】
上に述べた第２〜第４の実施形態の変換処理は、主としてシルエット画像に適用されるものである。次に、主としてレンジデータに対して適用される変換処理について詳しく説明する。
〔第５の実施形態による変換処理〕
図１８は第５の実施形態の変換処理を示すフローチャート、図１９はスペースカービングを説明するための図、図２０はボリュームデータＤＶを水平面で切断した状態を示す図、図２１は境界ボクセルを拡大して示す図、図２２〜２５は最近点ＭＤを求める方法を説明するための図である。
【００７８】
ここでは、複数のレンジデータＤＲに対して変換処理を行う。このような複数のレンジデータは、例えば、物体の周囲の異なる位置から物体を複数回に分けて計測することにより得られる。なお、ボリュームデータＤＶに対するレンジデータの位置合わせは済んでいるものとする。
【００７９】
図１８において、スペースカービング（space carving ) を行う（＃９１）。スペースカービングは、ボリュームデータＤＶから、物体ではない不要な部分を切り取る処理である。
【００８０】
すなわち、図１９に示すように、ボリュームデータＤＶのボクセルＶＸのうち、レンジデータＤＲの外部にあるボクセルＶＸについて、その頂点ＴＰの属性値として「マイナス無限大」をセットする。その際に、視点ＶＰからレンジデータＤＲに対して、レンジデータＤＲを含むような視線を仮想し、その視線の内側にあって且つレンジデータＤＲの外側にあるボクセルＶＸを、切り取るべきボクセルＶＸとする。但し、上の実施形態の場合と同様に、レンジデータＤＲの近傍にある頂点ＴＰについては除外する。図１９において、白いボクセルが外部のボクセルＶＸとして表示されている。
【００８１】
１つの格子点ＴＰｘに注目する（＃９２）。注目している格子点ＴＰｘの属性値をチェックする（＃９３）。属性値が「マイナス無限大」でなければ、ステップ＃９４に進む。
【００８２】
その格子点ＴＰｘについて、最近点ＭＤを求める（＃９４）。ｉ番目のレンジデータＤＲにおける最近点ＭＤを最近点ｒｉとする。最近点ＭＤの求め方は次のとおりである。
【００８３】
図２２に示すように、複数のレンジデータＤＲのそれぞれに対して、格子点ＴＰｘから垂線を下ろす。１つのレンジデータＤＲに対して複数の垂線が引ける場合には、そのうちの最も短い垂線を選択する。垂足点が最近点ＭＤ１，２である。但し、垂線の長さＬＭ１，２が所定の長さαよりも短いことを条件とする。つまり、垂線の長さＬＭ１，２がいずれも所定の長さαよりも大きい場合には、最近点ＭＤは存在しないものとする。
【００８４】
最近点ＭＤが存在しない場合には（＃９５でノー）、その格子点ＴＰｘの属性値を「プラス無限大」とする（＃９６）。１つまたは複数の最近点ＭＤが存在する場合には（＃９５でイエス）、最近点ＭＤのうち、格子点ＴＰｘに最も近い最近点ＭＤを最近点ｒmin とする（＃９７）。
【００８５】
なお、ここでは、最近点ＭＤは、レンジデータＤＲを構成する３次元のポリゴンデータＤＰの中から選ばれる。
つまり、図２３および図２５において、格子点ＴＰｘに対するレンジデータＤＲ上の最近点ＭＤ１は、レンジデータＤＲのポリゴンデータＤＰとして存在する３次元点と一致する。したがって、最近点ＭＤ１の座標はポリゴンデータＤＰの座標と一致する。ポリゴンデータＤＰの座標は既知であるので、最近点ＭＤ１の座標は極めて容易に求められる。しかし、最近点ＭＤ１は、レンジデータＤＲに対して真に最も近い点とはいえない。したがって、後で符号付き距離を求める際に、法線となす角の余弦をかけて補正する。
【００８６】
これに対し、図２４に示すように、ポリゴンメッシュＰＭ上の任意の点ＰＭＴを最近点ＭＤとした場合には、レンジデータＤＲに対する真の最近点を得ることができる。しかし、この場合に、最近点ＭＤ１の座標は、周辺のポリゴンデータＤＰの座標から演算により求める必要があるので、座標を求めるのに時間がかかる。
【００８７】
さて、図１８に戻り、ステップ＃９８において、格子点ＴＰｘからレンジデータＤＲまでの符号付き距離を、格子点ＴＰｘから１つまたは複数の最近点ＭＤまでの距離の加重平均によって求める。符号付き距離の求め方は次のとおりである。
【００８８】
すなわち、最近点ＭＤの中で、最近点ｒmin から一定の距離内にある最近点ＭＤのみを抽出する。その中には最近点ｒmin も含める。つまり、最近点ｒmin から一定の距離よりも遠い最近点ＭＤを除外する。抽出された最近点ＭＤについて、格子点ＴＰｘからの距離を求める。求めたすべての距離を加重平均することにより得られる。
【００８９】
すなわち、格子点ＴＰｘの符号付き距離はｄr(x)は、
ｄr(x)＝Σｗi ・〔ｎi ・（ｒi −ＴＰｘ）〕／Σｗi ……（１）
但し、ｒi はｉ番目のレンジデータにおける最近点
ｎi は最近点ｒi の法線ベクトル
ｗi は最近点ｒi の重み
【００９０】
【数１】

【００９１】
として求められる。
すなわち、格子点ＴＰｘと最近点ｒi との間の距離は、格子点ＴＰｘと最近点ｒi とを結ぶ線の長さ（つまりそれらの間の距離）に、その線と最近点ｒi における法線とのなす角の余弦をかけた値として求められる。つまり、格子点ＴＰｘまでの最近点ＭＤにおける法線方向の距離として求められる。距離の符号は、法線方向がマイナスである。求めた距離について、各最近点ｒi の信頼性に応じた重み（荷重）を与えて平均する。これによって符号付き距離が求まる。
【００９２】
重みｗi は、最近点ｒi における法線ベクトルｎi と、格子点ＴＰｘを始点とし最近点ｒi を終点とするベクトルとがなす角の余弦である。つまり、その角が大きいほど、信頼性は小さくなると考えられるので、重みｗi を小さくする。
【００９３】
求めた符号付き距離を、その格子点ＴＰｘの属性値として格納する。
すべての格子点ＴＰに対して処理を行ったかどうかがチェックされる（＃９９）。まだ処理を行っていない格子点ＴＰがある場合は、ステップ＃９２以降を繰り返す。すべての格子点ＴＰについての処理を終えた場合には終了する。これによって、レンジデータＤＲに対応した、図１１のボリュームテーブルＴＬｓと同様なボリュームテーブルＴＬｒが完成する。
〔ボリュームデータの統合〕
さて、ステップ＃２および３において、シルエット画像ＦＳを用いたボリュームデータＤＶと、レンジデータＤＲを用いたボリュームデータＤＶとが生成された。ステップ＃４では、これらのボリュームデータＤＶを統合する。ボリュームデータＤＶの統合に際しては、各ボリュームデータＤＶの互いに対応するボクセルＶＸの２つの属性値に基づいて、当該ボクセルＶＸの統合された属性値を求める。なお、シルエット画像ＦＳによるボリュームデータＤＶの属性値を、シルエット属性値ｄｓ、レンジデータＤＲによるボリュームデータＤＶの属性値を、レンジ属性値ｄｒ、統合されたボリュームデータＤＶの属性値を統合属性値ｄｔと記載することがある。
【００９４】
図２６は格子点ＴＰの属性値と物体の表面ＨＭとの位置関係を示す図、図２７は２つの属性値の統合方法を示す図である。
図２６に示すように、物体の表面ＨＭに対し、属性値の値に応じて、外部（遠）、外部（近）、内部（近）、内部（遠）の４つに分類する。
【００９５】
すなわち、属性値が「マイナス無限大」の場合に外部（遠）、属性値が「プラス無限大」の場合に内部（遠）、属性値が負であって「マイナス無限大」でない場合に外部（近）、属性値が正であって「プラス無限大」でない場合に内部（近）となる。これは、シルエット画像ＦＳおよびレンジデータＤＲのいずれによるボリュームデータＤＶに対しても適用される。
【００９６】
図２７に示すように、シルエット属性値ｄｓが外部（遠）である場合に、レンジ属性値ｄｒの内容に係わらず、統合属性値ｄｔは外部（遠）を示す「マイナス無限大」である。シルエット属性値ｄｓおよびレンジ属性値ｄｒが共に内部（遠）である場合に、統合属性値ｄｔは内部（遠）を示す「プラス無限大」である。シルエット属性値ｄｓが内部（遠）で且つレンジ属性値ｄｒが外部（遠）である場合には、これは実際にはあり得ないが、統合属性値ｄｔは外部（遠）を示す「マイナス無限大」である。
【００９７】
それ以外で、一方が外部（遠）または内部（遠）で他方が外部（近）または内部（近）であった場合には、外部（近）または内部（近）であった方の属性値が用いられる。両方が外部（近）または内部（近）であった場合には、両方の属性値が混合される。混合によって、統合属性値ｄｔは次の（３）式により計算される。
【００９８】
ｄｔ＝ｗｘｄｒ＋（１−ｗｘ）ｄｓ ……（３）
但し、ｗｘは格子点ＴＰｘにおけるレンジ属性値ｄｒの重みを表す。つまり、レンジ属性値ｄｒの重みｗｘに応じた比で、レンジ属性値ｄｒとシルエット属性値ｄｓとが混合される。荷重ｗｘの値は、物体の形状などに応じて決定される。このように、重みに応じた適当な比で加算されることにより、シルエット画像ＦＳによるボリュームデータＤＶとレンジデータＤＲによるボリュームデータＤＶとの境界部分が滑らかに接続される。得られた統合属性値ｄｔは、その格子点ＴＰの属性値として格納される。このようにして、すべての格子点ＴＰについて属性値が求められる。これによって統合処理が完了する。
【００９９】
このように、シルエット画像ＦＳによるボリュームデータＤＶとレンジデータＤＲによるボリュームデータＤＶとを統合することにより、それぞれの欠点を補って欠落のない精度のよい３次元モデルＭＬを生成することができる。
【０１００】
したがって、物体に凹部領域や低反射率の部分があった場合でも、できるだけ少ないメモリ容量で且つ少ない演算時間で、正確に３次元形状を復元することができる。
【０１０１】
上に述べた実施形態においては、属性値をボクセルの頂点にセットしたが、属性値をセットするのは必ずしもボクセルの頂点である必要はない。例えば、ボクセルの重心であってもよい。
【０１０２】
また、境界と交差するボクセルについてのみ、頂点と境界との距離値をセットするようにしているが、全ボクセルについて行ってもよい。但し、実施例のように交差するボクセルについてのみ演算する方が、演算時間が短縮される。物体の内部を正、外部を負として表現したが、逆であってもよい。
【０１０３】
第４の実施形態では、キューブを８分割する例について説明したが、８分割に限らず、ｘｙｚそれぞれの方向に３分割し、つまりキューブを２７分割してもよいし、それ以上に分割してもよい。
【０１０４】
上に述べた実施形態において、３次元データ生成装置１の全体または各部の構成、処理の内容および順序、ボクセルＶＸの個数、属性値の桁数などは、本発明の趣旨に沿って適宜変更することができる。
【０１０５】
【発明の効果】
本発明によると、凹部領域や低反射率の部分があった場合でも、できるだけ少ないメモリ容量で且つ少ない演算時間で正確に３次元形状を復元することができる。
【図面の簡単な説明】
【図１】本実施形態に係る３次元データ生成装置のブロック図である。
【図２】３次元データ生成装置による３次元モデルの生成処理の流れを示すフローチャートである。
【図３】３次元データ生成装置による変換処理を示すフローチャートである。
【図４】変換処理の変形例を示すフローチャートである。
【図５】第２の実施形態の変換処理を示すフローチャートである。
【図６】図５のステップ＃３４の属性値の設定処理のサブルーチンを示すフローチャートである。
【図７】シルエット画像をボリュームデータに投影した状態を説明するための図である。
【図８】図７に示すシルエット画像の一部を拡大して示す図である。
【図９】ボリュームデータを水平面で切断した状態を示す図である。
【図１０】境界ボクセルを拡大して示す図である。
【図１１】ボリュームデータを格納したボリュームテーブルの例を示す図である。
【図１２】第３の実施形態の変換処理を示すフローチャートである。
【図１３】図１２のステップ＃５４の属性値の設定処理のサブルーチンを示すフローチャートである。
【図１４】第４の実施形態の変換処理を示すフローチャートである。
【図１５】図１４のステップ＃７５の交差判定処理のサブルーチンを示すフローチャートである。
【図１６】８分木表現の原理を説明するための図である。
【図１７】８分木表現の原理を説明するための図である。
【図１８】第５の実施形態の変換処理を示すフローチャートである。
【図１９】スペースカービングを説明するための図である。
【図２０】ボリュームデータを水平面で切断した状態を示す図である。
【図２１】境界ボクセルを拡大して示す図である。
【図２２】最近点を求める方法を説明するための図である。
【図２３】最近点を求める方法を説明するための図である。
【図２４】最近点を求める方法を説明するための図である。
【図２５】最近点を求める方法を説明するための図である。
【図２６】格子点の属性値と物体の表面との位置関係を示す図である。
【図２７】２つの属性値の統合方法を示す図である。
【符号の説明】
１３次元データ生成装置
１０装置本体（第１の変換手段、第２の変換手段、統合手段）
１１磁気ディスク装置
ＤＶボリュームデータ
ＶＸボクセル
ＦＴ画像
ＦＳシルエット画像
ＣＤＣＤ−ＲＯＭ（記録媒体）
ＦＤフロッピィディスク（記録媒体）
ＴＬボリュームテーブル（ボリュームデータ）
ＰＲモデリングプログラム（コンピュータプログラム）[0001]
BACKGROUND OF THE INVENTION
The present invention relates to a method and apparatus for generating three-dimensional shape data, and a computer program.
[0002]
[Prior art]
Conventionally, a silhouette method (Shape From Silhouette) is known as one of methods for generating three-dimensional shape data of an object.
[0003]
In the silhouette method, the three-dimensional shape of an object is restored from a plurality of images obtained by photographing (or imaging) the object from different viewpoints. That is, in the silhouette method, based on images obtained by photographing an object from a plurality of line-of-sight methods, an object existence region that reflects the shape of the object viewed from each line-of-sight direction is described as a set of voxels in the three-dimensional image space. Then, a set of voxels in the object existence area viewed from all directions is obtained as an object in the three-dimensional space.
[0004]
[Problems to be solved by the invention]
However, in the silhouette method, a depressed portion or a depressed portion of the object does not appear as a silhouette image, and thus the shape of such a recessed region cannot be restored.
[0005]
On the other hand, as another method for restoring the three-dimensional shape of the object, there is a method using a range sensor (three-dimensional measurement apparatus) that measures the object in a non-contact manner by a light cutting method or the like. However, when the range sensor is used, there is a problem that data is lost when the reflectance of the surface of the object is extremely low, and the three-dimensional shape of the portion cannot be restored.
[0006]
The present invention has been made in view of the above-described problems, and it is an object of the present invention to accurately restore a three-dimensional shape with as little memory capacity as possible and with a small calculation time even when there is a recessed region or a low reflectance portion. And
[0007]
According to one aspect of the present invention, there is provided a method for generating three-dimensional shape data for an object, wherein a first method is performed by converting a plurality of images of the object with different viewpoints into volume data using a silhouette method. A first step of generating volume data of the second, a second step of generating second volume data by converting the three-dimensional data obtained by three-dimensional measurement of the object into volume data, the first volume A third step of integrating the data and the second volume data into one volume data;The volume data is composed of a plurality of voxels having lattice points at discrete coordinate positions in the coordinate system representing the volume, and values corresponding to the distance from the surface of the object at the lattice points of the voxels. Is stored as an attribute value, and for the attribute value, the outside (far) indicating that the grid point is outside the object, and the surface of the object is more external than the outside (far) when the grid point is outside the object. Outside (near) indicating that the grid point is inside the object (far), and that the grid point is inside the object and closer to the surface of the object than the inside (far) In the third step, for the lattice point of each voxel, the attribute value by the first volume data and the second volume data are used for the lattice points of the voxels. Obtaining an integrated attribute value is an integrated attribute value based on the sexual value.
[0008]
Preferably, in the first step, when converting to volume data using the silhouette method,
(A) obtaining a value corresponding to the distance to the boundary of the visual volume determined by the contour of the image of the object and the viewpoint of the image at each discrete coordinate position in the coordinate system representing the volume;
(B) For each coordinate position, a value corresponding to the distance from the object surface is determined based on a plurality of values obtained for a plurality of images of the object, and a step of holding the value in association with each coordinate position is executed.
[0009]
In the second step, when converting the three-dimensional data into volume data,
(A) obtaining a value corresponding to the distance to the object surface represented by the three-dimensional data of the object at each discrete coordinate position in the coordinate system representing the volume;
(B) A step of holding the obtained value in association with each coordinate position is executed.
[0010]
By inversely transforming the generated volume data, the three-dimensional shape of the object is restored.
According to the present invention, an accurate three-dimensional model without missing is generated by making up for each defect due to a difference in the method of generating three-dimensional shape data.
[0011]
DETAILED DESCRIPTION OF THE INVENTION
FIG. 1 is a block diagram of a three-dimensional data generation apparatus 1 according to this embodiment.
In FIG. 1, the three-dimensional data generation device 1 includes a device main body 10, a magnetic disk device 11, a medium drive device 12, a display device 13, a keyboard 14, a mouse 15, and the like.
[0012]
The apparatus main body 10 includes a CPU, a RAM, a ROM, a video RAM, an input / output port, various controllers, and the like. Various functions described below are realized by the CPU executing programs stored in the RAM and ROM.
[0013]
The magnetic disk device 11 includes an OS (Operating System), a modeling program PR for generating a three-dimensional model ML, other programs, input three-dimensional data (three-dimensional shape data) DT, and images (two-dimensional image data). ) FT, volume data DV, generated three-dimensional model ML, and other data are stored. These programs and data are loaded into the RAM of the apparatus main body 10 at appropriate times.
[0014]
The modeling program PR includes programs for initialization processing, attribute value setting processing, attribute value determination processing, boundary determination processing, division processing, integration processing, inverse transformation processing, mapping processing, and other processing. .
[0015]
The medium drive device 12 accesses a semiconductor memory HM such as a CD-ROM (CD), a floppy disk FD, a magneto-optical disk, a compact flash, and other recording media, and reads / writes data or programs. An appropriate drive device is used according to the type of the recording medium. The modeling program PR described above can also be installed from these recording media. The three-dimensional data DT and the image FT can also be input via a recording medium.
[0016]
On the display surface HG of the display device 13, the various data described above, the three-dimensional data DT, the image FT, the image of the processing process by the modeling program PR, the generated three-dimensional model ML, and other data or images are displayed. Is done.
[0017]
The keyboard 14 and the mouse 15 are used for the user to make various designations for the image FT and the three-dimensional data DT displayed on the display device 13, and input various data or commands to the apparatus main body 10. Used to give
[0018]
Although not shown, the apparatus main body 10 can be connected to a digital camera for photographing an object as a subject with various lines of sight or photographing from various viewpoints and inputting an image FT. The silhouette image FS is generated by cutting out the outline of the object that is the subject from the image FT. Based on the silhouette image FS, the three-dimensional data DT can be generated by the silhouette method.
[0019]
It is also possible to generate 3D data DT by performing 3D reconstruction based on two images with parallax.
The apparatus main body 10 can be connected to a three-dimensional input device (three-dimensional measurement device) for photographing an object and inputting the three-dimensional data DT. Such a three-dimensional input device measures the three-dimensional data DT of an object in a non-contact manner by, for example, a light cutting method. Further, instead of the three-dimensional data DT, data that is the basis for generating the three-dimensional data DT may be output from the three-dimensional input device, and the three-dimensional data DT may be obtained by calculation by the apparatus body 10.
[0020]
The three-dimensional data DT obtained in this way, or data in the process of generating the three-dimensional data DT, represents the three-dimensional shape of the object in the boundary representation format.
In the present embodiment, the three-dimensional shape of the object expressed in the boundary expression format is converted into volume data (Volume Data) DV in which each voxel has a multivalued attribute value d. A voxel is a small cube composed of unit lattices by dividing a three-dimensional space into small unit lattices.
[0021]
Further, a plurality of volume data DV converted from a plurality of three-dimensional shapes having different generation methods are also integrated into one volume data DV.
The final volume data DV is again converted back to a shape representation in the boundary representation format. At that time, for example, a known zero isosurface extraction method or the like can be used. That is, a point corresponding to a zero isosurface on an edge connecting adjacent vertices (lattice points) is calculated, and a triangular polygon obtained by connecting these points can be converted into a polygon mesh.
[0022]
A texture image is also pasted on the three-dimensional shape obtained by the inverse transformation. In this way, the three-dimensional model ML is generated.
The three-dimensional data generation apparatus 1 can be configured using a personal computer or a workstation. The programs and data described above can be received and acquired via the network NW.
[0023]
FIG. 2 is a flowchart showing a flow of processing for generating the three-dimensional model ML by the three-dimensional data generating apparatus 1.
In FIG. 2, an appropriately sized volume (volume data DV) is prepared and initialized (# 1). As the volume, for example, a volume of about 50 × 50 × 50 voxels or 100 × 100 × 100 voxels is used. The volume does not have to be a cube. Also, the coordinates and posture of the center of the voxel in the coordinate system are set.
[0024]
The volume can store two types of multivalued data as attribute values d at each vertex of the voxel. That is, an area for storing two attribute values ds and dr is provided for each vertex. This may be that there are two volumes capable of storing multivalued data as attribute values at each vertex of the voxel. In this case, one can be described as a first volume and the other as a second volume.
[0025]
In this specification, “volume” may be referred to as “volume data”. In this case, “volume data” means a three-dimensional area that is configured to have a predetermined shape by voxels and that can store data at each vertex of the voxel.
[0026]
Also, the voxel vertices are common to neighboring voxels. Therefore, the vertices that are not in the peripheral edge are the vertices of eight voxels. The vertex coincides with the grid point of the volume data. Therefore, “vertex” may be described as “grid point”.
[0027]
Now, returning to FIG. 2, the conversion process is performed using the silhouette image, and the first volume data DV is generated (# 2). Conversion processing is performed using the range data, and second volume data DV is generated (# 3). Note that the range data is three-dimensional shape data obtained by measuring an object using a three-dimensional measuring device called a range finder or the like. The three-dimensional shape data reconstructed from a plurality of images with parallax by the stereo method is also included in the range data referred to here.
[0028]
These first and second volume data DV are integrated (# 4). The integrated volume data DV is inversely transformed into a shape representation in the boundary representation format (# 5). Texture mapping is performed as necessary (# 6).
[0029]
Hereinafter, the conversion processing to volume data DV and the integration processing of a plurality of volume data DV will be described in detail. First, the conversion process will be described.
[Conversion processing according to the first embodiment]
FIG. 3 is a flowchart showing conversion processing by the three-dimensional data generation apparatus 1, and FIG. 4 is a flowchart showing a modification of the conversion processing. The conversion processing shown in these flowcharts is applied to conversion processing using range data, other three-dimensional shape data, and a silhouette image.
[0030]
In FIG. 3, first, three-dimensional data DT representing the three-dimensional shape of an object is prepared. Alternatively, instead of the three-dimensional data DT, a plurality of images FT that is a source for generating the three-dimensional data DT is prepared. Then, the alignment of the three-dimensional data DT and the volume data DV is performed (# 11). Normally, the size and position of the three-dimensional data DT are determined so that the three-dimensional data DT is exactly contained in the volume data DV.
[0031]
For each vertex of the voxel constituting the volume, a value corresponding to the distance from each vertex to the boundary expressing the three-dimensional shape, that is, the surface of the object is obtained (# 12). The obtained value is held as the attribute value of each vertex (# 13).
[0032]
In this way, multi-value attribute values are stored at the vertices of each voxel, that is, lattice points, and volume data DV composed of lattice point groups having multi-value attribute values is generated.
[0033]
In FIG. 4, voxels that intersect the surface of the object are extracted (# 22). Such a voxel may be referred to as a “boundary voxel”. A value corresponding to the distance from the vertex to the surface of the object is obtained for the boundary voxel (# 23). The obtained value is held as the attribute value of each vertex in the same manner as above (# 24).
[0034]
When a certain vertex is a common vertex between the boundary voxel and other voxels, the vertex is treated as a vertex of the boundary voxel.
[Conversion processing according to the second embodiment]
Based on the second embodiment, the conversion processing will be described in more detail. Note that the second and third embodiments are mainly applied to conversion processing using a silhouette image.
[0035]
FIG. 5 is a flowchart showing the conversion processing of the second embodiment, FIG. 6 is a flowchart showing the attribute value setting processing subroutine of step # 34 in FIG. 5, and FIG. 7 is a state in which the silhouette image FS is projected onto the volume data DV. FIG. 8 is an enlarged view showing a part of the silhouette image FS shown in FIG. 7, FIG. 9 is a view showing a state in which the volume data DV is cut along a horizontal plane, and FIG. 10 is an enlarged view of boundary voxels. FIG. 11 is a diagram showing an example of a volume table TLs storing volume data DV.
[0036]
In FIG. 5, first, initial values are set at all grid points TP for the set volume data DV. As a result, the volume data DV is initialized (# 31). As the initial value of the lattice point TP, for example, “plus infinity” meaning the inside of the object is set. In this case, “plus infinity” is the first specific value in the present invention. When the initialization is completed, all of the volume data DV shown in FIG. 10 have initial values.
[0037]
Next, one image FT is input (# 32). At that time, the camera parameters when the image FT is taken are also input. Camera parameters include camera internal matrix such as focal length and external matrix such as viewpoint position. A projection matrix including these may be input. Based on the camera parameters, the viewpoint position and the projection direction when the image FT is projected onto the volume data DV are determined.
[0038]
When the image FT is input in step # 32, the image FT is stored in the magnetic disk device 11. The image FT stored in the magnetic disk device 11 is automatically read into the RAM by a program, and the subsequent processing is added. However, the image input in step # 32 may be used to mean that the image FT stored in the magnetic disk device 11 is read into the RAM. In that case, a large number of images FT may be stored in advance in the magnetic disk device 11, and one image FT designated in step # 32 may be read onto the RAM. Further, the input of an image can also be used to designate one image FT to be processed from among a large number of images FT stored in the magnetic disk device 11.
[0039]
The silhouette image FS is generated by cutting out the contour of the object that is the subject from the input image FT (# 33). The silhouette image FS only needs to be known in outline, so a monochrome image is sufficient. The silhouette image can be generated automatically or manually by a known method.
[0040]
The volume data DV, that is, the attribute value of each vertex (grid point) TP of the voxel VX is obtained and set (# 34). The attribute value is obtained as a signed distance between each vertex TP and the surface of the object. That is, for example, as shown in FIGS. 7 and 8, the distance Ls from the boundary SF of the visual volume VV determined by the contour (shielding contour) by the silhouette image FS and the viewpoint VP to the vertex TP is set to the inner side of the visual volume VV. The direction toward is taken as + (plus). Details will be described later.
[0041]
When an image FT that needs to be processed remains, the next one image is input (Yes in # 35, # 32). This is repeated until the processing for all necessary images FT is completed (# 35). Even when the processing has been completed for all scheduled images FT, it is possible to perform steps # 32 to # 34 by adding an image FT as necessary.
[0042]
Instead of inputting the image FT at step # 32, it is also possible to input a silhouette image FS. In that case, step # 33 is unnecessary.
In FIG. 6, when setting the attribute value, first, attention is paid to one lattice point TP constituting the volume (# 41). The attribute value of the focused lattice point TP is checked (# 42). If the attribute value is “minus infinity”, the process proceeds to step # 45. That is, that the attribute value is “minus infinity” means that the lattice point TP is outside the object. Since the grid point TP outside the object is cut off, there is no need to obtain an attribute value beyond that. In this case, “minus infinity” is the second specific value in the present invention.
[0043]
At the time of processing the first image, since “plus infinity” is set as the initial value for all grid points, the process proceeds to step # 43. At the time of processing the second and subsequent images, either “plus infinity”, “minus infinity”, or “signed distance ds” described later is set as an attribute value by the processing of the previous image. If it is other than “minus infinity”, ie, “plus infinity” or “signed distance ds”, the process proceeds to step # 43.
[0044]
If the attribute value is not “minus infinity”, the signed distance ds is calculated in step # 43. There are various methods for calculating the signed distance. Next, three methods from the first to the third will be described.
(First method)
The focused lattice point TP and the lattice point adjacent to the lattice point TP (that is, the vertex for the same voxel VX) are each projected onto the image FT toward the viewpoint VP. When projected points are connected to each other on the image FT, a signed distance is calculated in a case where a side formed thereby intersects with a contour (shielding contour).
[0045]
That is, as shown in FIG. 7, when a certain voxel VX is on the boundary SF of the visual volume VV, the voxel VX is a boundary voxel VXs. A signed distance is calculated for the vertex TP of the boundary voxel VXs.
[0046]
In FIG. 9, voxels VX that intersect the boundary SF of the visual volume VV are shown as intermediate voxels as boundary voxels VXs. In FIG. 10, the distance from each vertex TP of the boundary voxel VXs to the boundary SF of the visual volume VV is obtained. The obtained distance or a numerical value corresponding to the distance is shown as an attribute value of each vertex TP.
[0047]
The unit or scale of the distance may be set appropriately. The maximum value of the distance is the length of the diagonal line of the voxel VX. Therefore, the distance may be normalized based on the length of the diagonal line. For the vertex TP inside the visual volume VV, the sign of the distance is set to a positive value as it is, and for the vertex TP located outside, a negative value is added to the distance.
[0048]
For example, when 8-bit data is used as the attribute value, the most significant bit is a sign bit and the lower 7 bits indicate a distance. As a result, values of −127 to +128 can be expressed, so that −127 corresponds to “minus infinity” indicating the outside, +128 corresponds to “plus infinity” indicating the interior, and −126 ˜ + 127 corresponds to the signed distance. As the attribute value, data of 12 bits, 16 bits, or other number of bits can be used.
[0049]
Therefore, when the lattice point TP is on the boundary SF of the visual volume VV, the attribute value is zero. The attribute value increases as the lattice point TP moves into the visual volume VV, and the attribute value decreases as it moves outward.
[0050]
If the side does not intersect the occlusion outline, that is, if it is not the boundary voxel VXs, the lattice point TP exists inside or outside the view volume VV. If it is outside, the grid point TP is a point to be cut out, so the attribute value is set to “minus infinity”. If it is inside, the attribute value remains “plus infinity”.
(Second method)
The target lattice point TP is projected on the image FT toward the viewpoint VP. It is checked whether or not there is a shielding outline in a region inside a circle having a certain radius from the center with the projected point as the center. Calculate signed distance if occluded contour exists. When there is no shielding contour in the region and the lattice point TP is outside the visual volume VV, the attribute value is set to “minus infinity”.
.
(Third method)
The shielding contour is sampled at regular intervals, and a straight line connecting the sampling point and the viewpoint VP is obtained. These straight lines are lines of sight passing through the shielding contour. Using a line of sight instead of the boundary SF of the visual volume VV, a signed distance is calculated three-dimensionally. That is, for the vertex TP of the boundary voxel VXs, the distance from the vertex TP to the line of sight is calculated. When the focused lattice point TP is outside the visual volume VV, the attribute value is set to “minus infinity”.
[0051]
In step # 44, an attribute value is set at the grid point TP. When setting an attribute value, only a value smaller than an already set attribute value is set. If a new value larger than the already set attribute value is obtained, the new attribute value is ignored.
[0052]
In other words, if “minus infinity” is newly obtained as the attribute value, the attribute value becomes “minus infinity” whatever the previously set attribute value. In this way, the outside of the object is cut off.
[0053]
In addition, when the attribute value is a signed distance for both old and new, an average value thereof may be obtained and used as a new attribute value d.
It is checked whether or not processing has been performed for all grid points TP (# 45). If there is a grid point TP that has not yet been processed, step # 41 and subsequent steps are repeated. When the processing for all the lattice points TP is completed, the process returns. As a result, the volume table TLs as shown in FIG. 11 is completed.
[0054]
According to the second embodiment, since processing is performed for each image FT, the image can be deleted when the processing of the image is completed, and the memory capacity to be used can be reduced accordingly.
[Conversion processing according to the third embodiment]
In the conversion process of the second embodiment, the conversion process has progressed by adding the image FT. However, it is also possible to input (read) a plurality of images FT at a time and perform the process. . Next, an example is shown as a third embodiment.
[0055]
FIG. 12 is a flowchart showing the conversion process of the third embodiment, and FIG. 13 is a flowchart showing a subroutine of the attribute value setting process in step # 54 of FIG.
[0056]
In FIG. 12, step # 51 is the same as step # 31 of FIG. In step # 52, all the images FT taken of the object are input at a time. At that time, the camera parameters when each image FT is photographed are input. For each image FT, a silhouette image FS is generated as in step # 33 (# 53). Then, an attribute value is set (# 54).
[0057]
In FIG. 13, when setting an attribute value, first, attention is paid to one grid point TP as in step # 41 (# 61). It is checked at what position the focused lattice point TP is present with respect to the visual volume VV (# 62).
[0058]
That is, when the lattice point TP exists near the boundary SF of the visual volume VV, the attribute value is “BORDER”, and when the lattice point TP exists inside the visual volume VV, the attribute value is “INSIDE”. If it exists outside, the attribute value is “OUTSIDE”. The attribute values “INSIDE” and “OUTSIDE” are the first specific value and the second specific value in the present invention. Next, two checking methods will be described.
(First method)
The target grid point TP and the adjacent grid point are projected on all the images FT. In each projected image FT, when the two projected points are connected on each image FT, it is determined whether or not a side formed thereby intersects the contour (shielding contour). If there is at least one intersecting image FT, it is determined that the lattice point TP exists in the vicinity of the boundary SF of the visual volume VV, and the attribute value is assumed to be “BORDER”. If there is no intersecting image FT and there is at least one image where the projection point of the grid point TP exists outside the shielding contour, it is determined that the grid point TP exists outside the subject, and the attribute value is temporarily set. “OUTSIDE”. In other cases, it is determined that the grid point TP exists inside the subject, and the attribute value is temporarily set to “INSIDE”.
(Second method)
The focused lattice point TP is projected on all the images FT. It is checked whether or not there is a shielding outline in a region inside a circle having a certain radius from the center with each projected point as the center. If there is even one image that is determined to include the occlusion contour, it is determined that the image exists near the boundary SF of the visual volume VV. If there is no image determined to include the occlusion outline and there is one image in which the projection point of the lattice point TP exists outside the occlusion outline, it is determined that the image exists outside the subject. In other cases, it is determined that the object exists inside the subject.
[0059]
In this way, a temporary attribute value is determined for one lattice point TP in consideration of all images FT.
When the temporary attribute value is “OUTSIDE” or “INSIDE”, the attribute value is, for example, “minus infinity” or “plus infinity”, respectively. If the provisional attribute value is “BORDER”, the signed distance is calculated (# 63).
[0060]
The calculation method of the signed distance is basically the same as described in the explanation of Step # 43. However, here, for one grid point TP, the signed distance is determined in consideration of all the images FT at the same time.
[0061]
That is, for example, the signed distance is calculated as follows.
(First method)
The target lattice point TP is projected on all the images FT. From all the projected images FT, the one having the shortest distance from the projection point to the shielding contour is selected. For the selected image FT, a line of sight passing through a point on the shielding contour is obtained. The distance from the grid point TP to the line of sight is obtained. An attribute value is added to the obtained distance with a positive or negative sign.
(Second method)
For all images FT, the occlusion contour is sampled at regular intervals, and the signed distance is calculated three-dimensionally using the line of sight that passes through the occlusion contour, as in the third method in step # 34.
[0062]
Next, in step # 64, an attribute value is set at the grid point TP as in step # 44. However, here, the final attribute value is set for each lattice point TP.
[0063]
In step # 65, as in step # 45, it is checked whether or not processing has been performed for all grid points TP.
According to the third embodiment, since the signed distance is calculated only for the lattice point TP whose provisional attribute value is “BORDER”, the calculation amount of the signed distance is greatly reduced. Therefore, the processing speed is fast.
[Conversion processing according to the fourth embodiment]
Next, a conversion process using octree representation will be described as a fourth embodiment.
[0064]
FIG. 14 is a flowchart showing the conversion process of the fourth embodiment, FIG. 15 is a flowchart showing a subroutine of the intersection determination process in step # 75 of FIG. 14, and FIGS. 16 and 17 are diagrams for explaining the principle of the octree representation. FIG.
[0065]
As shown in FIGS. 16 and 17, in the octree representation, a cube larger than the target object is defined, and this is defined as a root cube (Root-Cube) RC. When the root cube is divided into two equal parts along the x, y, and z directions, eight cubes having a volume of 1/8 are generated. By repeating such division recursively to an arbitrary level, data of an octree is generated. The octree representation itself is known.
[0066]
In FIG. 14, first, a root cube is set (# 71). When setting the root cube, enter the coordinates and size of its center. Also, “CO” meaning the inside of the object is set as an initial value for all the vertices of the root cube. Further, the attribute of the root cube is set to “GRAY” indicating that it intersects the occlusion outline, and the level is set to “0”. A cube whose attribute is “GRAY” corresponds to a boundary cube in the present invention.
[0067]
Next, all necessary images are input for the image FT obtained by photographing the object that is the subject (# 72). At that time, the camera parameters when each image FT is photographed are input.
[0068]
In step # 73, a silhouette image FS is generated as in step # 33.
In step # 74, the cube (root cube) is divided. The division here is performed only for cubes having the attribute “GRAY”. The division is 8 divisions. Then raise the level by one.
[0069]
In step # 75, a cube intersection determination is performed. Here, each divided cube is projected onto the image FT, and the presence or absence of an intersection with the shielding contour is determined. Based on the determination result, the attributes of the cube are determined.
[0070]
Then, steps # 74 and 75 are repeated until a predetermined level is reached (# 76). Or, when there are no more cubes with the attribute “GRAY”, the process ends there. In that case, in the subsequent processing, a cube on the boundary or a cube whose vertex is on the boundary is used as a cube whose attribute is “GRAY”.
[0071]
When the processing is completed, the attribute value of each vertex is obtained and set for the cube having the attribute “GRAY” (# 77). As a method for obtaining the attribute value for each vertex, the method described in the second embodiment can be used.
[0072]
In FIG. 15, attention is paid to one of the divided cubes (# 81). The cube is projected on all the images FT, and it is determined whether or not there is an intersection with the occlusion contour in each of them (# 82).
[0073]
As a result of the determination, if there is even one image FT that intersects the occlusion outline, the attribute of the cube is set to “GRAY”. In all the images, when the projected cube exists within the occlusion outline, the cube attribute is set to “WHITE” representing the inside of the object. In all the images, when the projected cube exists outside the occlusion outline, it is set to “BLACK” representing the outside of the object (# 83).
[0074]
“GRAY”, “WHITE”, and “BLACK” here correspond to “BORDER”, “INSIDE”, and “OUTSIDE” described above, respectively.
If processing has been performed for all of the eight divided cubes (Yes in # 84), the process returns.
[0075]
Also in the fourth embodiment, since the signed distance is calculated only for the lattice point TP of the cube whose attribute is “GRAY”, the calculation amount of the signed distance is greatly reduced, and the processing speed is high.
[0076]
As described above, according to the conversion processes of the first to fourth embodiments, multivalued data can be given to the lattice points TP, and a highly accurate three-dimensional shape can be expressed by a small number of voxels VX. it can. Therefore, the three-dimensional shape of the object can be expressed with high accuracy with a small memory capacity.
[0077]
The conversion processes of the second to fourth embodiments described above are mainly applied to silhouette images. Next, the conversion process mainly applied to the range data will be described in detail.
[Conversion processing according to the fifth embodiment]
18 is a flowchart showing the conversion process of the fifth embodiment, FIG. 19 is a diagram for explaining space carving, FIG. 20 is a diagram showing a state in which the volume data DV is cut along a horizontal plane, and FIG. 21 is an enlarged view of boundary voxels. FIGS. 22 to 25 are diagrams for explaining a method of obtaining the nearest point MD.
[0078]
Here, conversion processing is performed on a plurality of range data DR. Such a plurality of range data can be obtained, for example, by measuring an object in multiple times from different positions around the object. It is assumed that the range data has been aligned with the volume data DV.
[0079]
In FIG. 18, space carving is performed (# 91). Space carving is a process of cutting out unnecessary parts that are not objects from the volume data DV.
[0080]
That is, as shown in FIG. 19, among the voxels VX of the volume data DV, “minus infinity” is set as the attribute value of the vertex TP for the voxel VX outside the range data DR. At that time, a line of sight including the range data DR is hypothesized from the viewpoint VP to the range data DR, and the voxel VX inside the line of sight and outside the range data DR is the voxel VX to be cut off. To do. However, as in the case of the above embodiment, the vertex TP in the vicinity of the range data DR is excluded. In FIG. 19, white voxels are displayed as external voxels VX.
[0081]
Attention is paid to one lattice point TPx (# 92). The attribute value of the focused lattice point TPx is checked (# 93). If the attribute value is not “minus infinity”, the process proceeds to step # 94.
[0082]
For the lattice point TPx, the nearest point MD is obtained (# 94). The closest point MD in the i-th range data DR is set as the closest point ri. The method for obtaining the nearest point MD is as follows.
[0083]
As shown in FIG. 22, a perpendicular is drawn from the lattice point TPx for each of the plurality of range data DR. When a plurality of perpendicular lines can be drawn for one range data DR, the shortest perpendicular line is selected. The droop point is the latest points MD1 and MD2. However, it is a condition that the lengths LM1 and LM2 of the perpendicular are shorter than the predetermined length α. That is, when the perpendicular lengths LM1 and LM2 are both greater than the predetermined length α, the nearest point MD does not exist.
[0084]
If the nearest point MD does not exist (No in # 95), the attribute value of the lattice point TPx is set to “plus infinity” (# 96). If one or more nearest points MD exist (Yes in # 95), the nearest point MD closest to the grid point TPx is set as the nearest point rmin (# 97).
[0085]
Here, the nearest point MD is selected from the three-dimensional polygon data DP constituting the range data DR.
That is, in FIGS. 23 and 25, the nearest point MD1 on the range data DR with respect to the lattice point TPx coincides with the three-dimensional point existing as the polygon data DP of the range data DR. Therefore, the coordinates of the nearest point MD1 coincide with the coordinates of the polygon data DP. Since the coordinates of the polygon data DP are known, the coordinates of the nearest point MD1 can be obtained very easily. However, the nearest point MD1 is not truly the closest point to the range data DR. Therefore, when calculating the signed distance later, the correction is performed by applying the cosine of the angle formed with the normal.
[0086]
On the other hand, as shown in FIG. 24, when an arbitrary point PMT on the polygon mesh PM is set as the nearest point MD, a true nearest point with respect to the range data DR can be obtained. However, in this case, it is necessary to obtain the coordinates of the nearest point MD1 by calculation from the coordinates of the surrounding polygon data DP.
[0087]
Returning to FIG. 18, in step # 98, a signed distance from the grid point TPx to the range data DR is obtained by a weighted average of the distances from the grid point TPx to one or more nearest points MD. The method for obtaining the signed distance is as follows.
[0088]
That is, only the nearest point MD within a certain distance from the nearest point rmin is extracted from the nearest points MD. The nearest point rmin is also included in this. That is, the nearest point MD far from the nearest point rmin by a certain distance is excluded. For the extracted nearest point MD, the distance from the grid point TPx is obtained. It is obtained by weighted average of all obtained distances.
[0089]
That is, the signed distance of the grid point TPx is dr (x)
dr (x) = Σwi · [ni · (ri−TPx)] / Σwi (1)
Where ri is the closest point in the i-th range data
ni is the normal vector of the nearest point ri
wi is the weight of the nearest point ri
[0090]
[Expression 1]

[0091]
As required.
That is, the distance between the lattice point TPx and the nearest point ri is the length of the line connecting the lattice point TPx and the nearest point ri (that is, the distance between them), and the normal line at that point and the nearest point ri. Is obtained by multiplying the cosine of the angle formed by That is, the distance in the normal direction at the nearest point MD to the lattice point TPx is obtained. The sign of the distance is negative in the normal direction. The obtained distance is averaged by giving a weight (load) according to the reliability of each nearest point ri. Thereby, the signed distance is obtained.
[0092]
The weight w i is the cosine of the angle formed by the normal vector ni at the nearest point ri and the vector starting from the lattice point TPx and ending at the nearest point ri. That is, since the reliability is considered to decrease as the angle increases, the weight wi is decreased.
[0093]
The obtained signed distance is stored as an attribute value of the lattice point TPx.
It is checked whether processing has been performed for all grid points TP (# 99). If there is a grid point TP that has not been processed yet, step # 92 and subsequent steps are repeated. When the processes for all the lattice points TP are completed, the process ends. As a result, a volume table TLr corresponding to the range data DR and similar to the volume table TLs of FIG. 11 is completed.
[Integration of volume data]
In steps # 2 and # 3, volume data DV using the silhouette image FS and volume data DV using the range data DR are generated. In step # 4, these volume data DV are integrated. When integrating the volume data DV, the integrated attribute value of the voxel VX is obtained based on the two attribute values of the voxel VX corresponding to each other of the volume data DV. Note that the attribute value of the volume data DV by the silhouette image FS is the silhouette attribute value ds, the attribute value of the volume data DV by the range data DR is the range attribute value dr, and the attribute value of the integrated volume data DV is the integrated attribute value dt. May be described.
[0094]
FIG. 26 is a diagram showing a positional relationship between the attribute value of the lattice point TP and the surface HM of the object, and FIG.
As shown in FIG. 26, the surface HM of the object is classified into four types: outside (far), outside (near), inside (near), and inside (far) according to the value of the attribute value.
[0095]
That is, external (far) when the attribute value is “minus infinity”, internal (far) when the attribute value is “plus infinity”, and external when the attribute value is negative and not “minus infinity” (Near) If the attribute value is positive and not “plus infinity”, it is internal (near). This is applied to the volume data DV by either the silhouette image FS or the range data DR.
[0096]
As shown in FIG. 27, when the silhouette attribute value ds is outside (far), the integrated attribute value dt is “minus infinity” indicating outside (far) regardless of the contents of the range attribute value dr. When the silhouette attribute value ds and the range attribute value dr are both inside (far), the integrated attribute value dt is “plus infinity” indicating the inside (far). If the silhouette attribute value ds is inside (far) and the range attribute value dr is outside (far), this is not possible, but the integrated attribute value dt is “minus infinity” indicating the outside (far). "Large".
[0097]
Otherwise, if one is outside (far) or inside (far) and the other is outside (near) or inside (near), the attribute value that was outside (near) or inside (near) Is used. If both are external (near) or internal (near), both attribute values are mixed. By mixing, the integrated attribute value dt is calculated by the following equation (3).
[0098]
dt = wxdr + (1-wx) ds (3)
However, wx represents the weight of the range attribute value dr at the lattice point TPx. That is, the range attribute value dr and the silhouette attribute value ds are mixed at a ratio corresponding to the weight wx of the range attribute value dr. The value of the load wx is determined according to the shape of the object. In this way, by adding at an appropriate ratio according to the weight, the boundary portion between the volume data DV by the silhouette image FS and the volume data DV by the range data DR is smoothly connected. The obtained integrated attribute value dt is stored as the attribute value of the lattice point TP. In this way, attribute values are obtained for all grid points TP. This completes the integration process.
[0099]
In this way, by integrating the volume data DV based on the silhouette image FS and the volume data DV based on the range data DR, it is possible to generate a highly accurate three-dimensional model ML that compensates for the respective defects.
[0100]
Therefore, even when the object has a concave region or a low reflectance portion, the three-dimensional shape can be accurately restored with as little memory capacity as possible and with a short calculation time.
[0101]
In the embodiment described above, the attribute value is set at the vertex of the voxel. However, it is not always necessary to set the attribute value at the vertex of the voxel. For example, it may be the center of gravity of the voxel.
[0102]
In addition, the distance value between the vertex and the boundary is set only for the voxel that intersects the boundary, but may be performed for all the voxels. However, the calculation time is shortened by calculating only the intersecting voxels as in the embodiment. Although the inside of the object is expressed as positive and the outside is expressed as negative, it may be reversed.
[0103]
In the fourth embodiment, an example in which a cube is divided into eight parts has been described. However, the present invention is not limited to eight parts, but is divided into three parts in each of xyz directions, that is, the cube may be divided into 27 parts or more. Also good.
[0104]
In the embodiment described above, the configuration of the whole or each part of the three-dimensional data generation device 1, the contents and order of processing, the number of voxels VX, the number of digits of attribute values, and the like are changed as appropriate in accordance with the spirit of the present invention. be able to.
[0105]
【The invention's effect】
According to the present invention, a three-dimensional shape can be accurately restored with as little memory capacity as possible and with a short calculation time even when there is a recessed area or a portion with low reflectance.
[Brief description of the drawings]
FIG. 1 is a block diagram of a three-dimensional data generation apparatus according to an embodiment.
FIG. 2 is a flowchart showing a flow of processing for generating a three-dimensional model by the three-dimensional data generating apparatus.
FIG. 3 is a flowchart showing conversion processing by the three-dimensional data generation apparatus.
FIG. 4 is a flowchart showing a modification of the conversion process.
FIG. 5 is a flowchart illustrating a conversion process according to the second embodiment.
FIG. 6 is a flowchart showing a subroutine of attribute value setting processing in step # 34 of FIG. 5;
FIG. 7 is a diagram for explaining a state in which a silhouette image is projected onto volume data.
8 is an enlarged view showing a part of the silhouette image shown in FIG.
FIG. 9 is a diagram illustrating a state in which volume data is cut along a horizontal plane.
FIG. 10 is an enlarged view showing boundary voxels.
FIG. 11 is a diagram showing an example of a volume table storing volume data.
FIG. 12 is a flowchart illustrating conversion processing according to the third embodiment.
FIG. 13 is a flowchart showing a subroutine of attribute value setting processing in step # 54 of FIG. 12;
FIG. 14 is a flowchart illustrating a conversion process according to the fourth embodiment.
FIG. 15 is a flowchart showing a subroutine of intersection determination processing in step # 75 of FIG.
FIG. 16 is a diagram for explaining the principle of octree representation;
FIG. 17 is a diagram for explaining the principle of octree representation;
FIG. 18 is a flowchart illustrating conversion processing according to the fifth embodiment.
FIG. 19 is a diagram for explaining space carving.
FIG. 20 is a diagram illustrating a state in which volume data is cut along a horizontal plane.
FIG. 21 is an enlarged view showing boundary voxels.
FIG. 22 is a diagram for explaining a method for obtaining a closest point;
FIG. 23 is a diagram for explaining a method for obtaining a closest point;
FIG. 24 is a diagram for explaining a method for obtaining a closest point;
FIG. 25 is a diagram for explaining a method for obtaining a closest point;
FIG. 26 is a diagram illustrating a positional relationship between an attribute value of a lattice point and the surface of an object.
FIG. 27 is a diagram illustrating a method for integrating two attribute values.
[Explanation of symbols]
1 3D data generator
10 Device body (first conversion means, second conversion means, integration means)
11 Magnetic disk unit
DV volume data
VX Voxel
FT image
FS silhouette image
CD CD-ROM (recording medium)
FD floppy disk (recording medium)
TL volume table (volume data)
PR modeling program (computer program)

Claims

A method of generating three-dimensional shape data about an object,
A first step of generating first volume data by converting a plurality of images of the object from different viewpoints into volume data using a silhouette method;
A second step of generating second volume data by converting three-dimensional data obtained by three-dimensional measurement of the object into volume data;
A third step of integrating the first volume data and the second volume data into one volume data;
The volume data is composed of a plurality of voxels having lattice points at discrete coordinate positions in a coordinate system representing the volume, and a value corresponding to the distance from the surface of the object is obtained at the lattice point of each voxel. As an attribute value,
For the attribute value, the outside (far) indicating that the grid point is outside the object, and the outside (near) indicating that the grid point is outside the object and closer to the surface of the object than the outside (far). Inside (far) indicating that the grid point is inside the object, and outside (near) indicating that the grid point is inside the object and closer to the surface of the object than the inside (far). Classified into
In the third step, an integrated attribute value that is an attribute value integrated based on the attribute value based on the first volume data and the attribute value based on the second volume data on the lattice point of each voxel. Seeking
A method for generating three-dimensional shape data.

When the attribute value by the first volume data is the external (far), the integrated attribute value is external (far) regardless of the attribute value by the second volume data,
When the attribute value based on the first volume data and the attribute value based on the second volume data are both internal (far), the integrated attribute value is internal (far),
When the attribute value based on the first volume data is the inside (far) and the attribute value based on the second volume data is the outside (far), the integrated attribute value is set to the outside (far). ,
Otherwise, if the attribute value according to the first volume data is different from the attribute value according to the second volume data, the integrated attribute value is external (near) or internal (near),
When both the attribute value based on the first volume data and the attribute value based on the second volume data are the same on the outside (near) or inside (near), both the attribute values are mixed. To obtain the integrated attribute value,
The method for generating three-dimensional shape data according to claim 1.

When both the attribute value based on the first volume data and the attribute value based on the second volume data are the same outside (near) or inside (near), the integrated attribute value dt is formula,
dt = wx · dr + (1−wx) ds
However, wx: Weight of attribute value by second volume data at grid point
ds: attribute value based on the first volume data
dr: attribute value based on second volume data
Calculate with
The method for generating three-dimensional shape data according to claim 2.

In the first step, when converting to volume data using the silhouette method,
(A) obtaining a value corresponding to the distance to the boundary of the visual volume determined by the contour of the image of the object and the viewpoint of the image at each discrete coordinate position in the coordinate system representing the volume;
(B) For each coordinate position, determining a value according to the distance from the object surface based on a plurality of values obtained for a plurality of images of the object, and holding the value in association with each coordinate position;
The method for generating three-dimensional shape data according to claim 1, wherein:

In the second step, when converting the three-dimensional data into volume data,
(A) obtaining a value corresponding to the distance to the object surface represented by the three-dimensional data of the object at each discrete coordinate position in the coordinate system representing the volume;
(B) a step of holding the obtained value in association with each coordinate position;
The method for generating three-dimensional shape data according to claim 1, wherein:

An apparatus for generating three-dimensional shape data about an object,
First conversion means for generating first volume data by converting a plurality of images with different viewpoints of the object into volume data using a silhouette method;
Second conversion means for generating second volume data by converting three-dimensional data obtained by three-dimensional measurement of the object into volume data; and
Integration means for integrating the first volume data and the second volume data into one volume data ;
The volume data is composed of a plurality of voxels having lattice points at discrete coordinate positions in a coordinate system representing the volume, and a value corresponding to the distance from the surface of the object is obtained at the lattice point of each voxel. As an attribute value,
In the integration means,
For the attribute value, the outside (far) indicating that the grid point is outside the object, and the outside (near) indicating that the grid point is outside the object and closer to the surface of the object than the outside (far). Inside (far) indicating that the grid point is inside the object, and outside (near) indicating that the grid point is inside the object and closer to the surface of the object than the inside (far). Classified into
For the lattice points of each voxel, an integrated attribute value that is an attribute value integrated based on the attribute value by the first volume data and the attribute value by the second volume data is obtained.
An apparatus for generating three-dimensional shape data.

A computer program for generating three-dimensional shape data about an object,
A first step of generating first volume data by converting a plurality of images of the object from different viewpoints into volume data using a silhouette method;
A second step of generating second volume data by converting three-dimensional data obtained by three-dimensional measurement of the object into volume data;
A third step of integrating the first volume data and the second volume data into one volume data;
The above processing is executed by a computer, and the computer further includes:
The volume data is composed of a plurality of voxels having lattice points at discrete coordinate positions in a coordinate system representing the volume, and a value corresponding to the distance from the surface of the object is obtained at the lattice point of each voxel. Execute the process so that it is retained as an attribute value,
For the attribute value, the outside (far) indicating that the grid point is outside the object, and the outside (near) indicating that the grid point is outside the object and closer to the surface of the object than the outside (far). Inside (far) indicating that the grid point is inside the object, and outside (near) indicating that the grid point is inside the object and closer to the surface of the object than the inside (far). Process to classify
In the third step, an integrated attribute value that is an attribute value integrated based on the attribute value based on the first volume data and the attribute value based on the second volume data on the lattice point of each voxel. To execute the process for
Computer program for.

A computer-readable recording medium on which the computer program according to claim 7 is recorded.