JP4023330B2

JP4023330B2 - Image identification device

Info

Publication number: JP4023330B2
Application number: JP2003037153A
Authority: JP
Inventors: 勝行首藤
Original assignee: Victor Company of Japan Ltd
Current assignee: Victor Company of Japan Ltd
Priority date: 2003-02-14
Filing date: 2003-02-14
Publication date: 2007-12-19
Anticipated expiration: 2023-02-14
Also published as: JP2004246717A

Description

【０００１】
【発明の属する技術分野】
本発明は、画像識別装置に関する。
【０００２】
【従来の技術】
従来、例えば特開平４−３６９０８７号公報に記載されているような、色により物体を認識する画像識別装置が知られている。
【０００３】
【特許文献１】
特開平４−３６９０８７号公報
【０００４】
【発明が解決しようとする課題】
従来の画像識別装置は、識別精度が高くない問題点と、対象物が暗い場合に誤検出しやすい問題点があった。
【０００５】
本発明は、このような従来の技術的課題を解決するためになされたもので、色識別精度が高く、また対象物が暗い場合にも正確に色識別できる画像識別装置を提供することを目的とする。
【０００６】
【課題を解決するための手段】
請求項１の発明は、画像入力部と演算処理部とを備え、前記演算処理部は、前記画像入力部に入力された原画像から所望の対象画像を予め設定した設定色に基づいて識別する画像識別装置であって、前記演算処理部は、前記設定色のＲＧＢ値であるＲ０、Ｇ０、Ｂ０を予め記憶し、該ＲＧＢ値から設定ＲＧＢ比率Ｃ＝Ｒ０／Ｇ０、Ｄ＝Ｇ０／Ｂ０、Ｅ＝Ｂ０／Ｒ０を求め、前記原画像の画素毎に、該画素のＲＧＢ値であるＲ１、Ｇ１、Ｂ１から原ＲＧＢ比率Ｈ＝Ｒ１／Ｇ１、Ｉ＝Ｇ１／Ｂ１、Ｊ＝Ｂ１／Ｒ１を求め、前記設定ＲＧＢ比率と前記原ＲＧＢ比率との比である比率Ｋ１＝Ｈ／Ｃ、Ｌ１＝Ｉ／Ｄ、Ｍ１＝Ｊ／Ｅを求め、前記比率Ｋ１、Ｌ１、Ｍ１の各値が１以下の場合は、Ｋ２＝Ｋ１、Ｌ２＝Ｌ１、Ｍ２＝Ｍ１とし、前記比率Ｋ１、Ｌ１、Ｍ１の各値が１を超える場合は、Ｋ２＝１／Ｋ１、Ｌ２＝１／Ｌ１、Ｍ２＝１／Ｍ１とし、前記比率Ｋ２、Ｌ２、Ｍ２の積であるＫ２×Ｌ２×Ｍ２を求めてその値を評価値Ｎとし、前記評価値Ｎが所定のしきい値以上となるＲＧＢ値Ｒ１、Ｇ１、Ｂ１を有する前記画素を対象画素として抽出し、前記対象画素の分布に応じて前記対象画像を識別する識別手段を有することを特徴とするものである。
【０００７】
請求項２の発明は、画像入力部と演算処理部とを備え、前記演算処理部は、前記画像入力部に入力された原画像から所望の対象画像を予め設定した設定色に基づいて識別する画像識別装置であって、前記演算処理部は、前記設定色のＲＧＢ値であるＲ０、Ｇ０、Ｂ０を予め記憶し、該ＲＧＢ値から設定ＲＧＢ比率Ｃ＝Ｒ０／Ｇ０、Ｄ＝Ｇ０／Ｂ０、Ｅ＝Ｂ０／Ｒ０を求め、前記原画像の画素毎に、該画素のＲＧＢ値であるＲ１、Ｇ１、Ｂ１から原ＲＧＢ比率Ｈ＝Ｒ１／Ｇ１、Ｉ＝Ｇ１／Ｂ１、Ｊ＝Ｂ１／Ｒ１を求める一方、該画素の明るさ情報Ｔを求め、前記設定ＲＧＢ比率と前記原ＲＧＢ比率との比である比率Ｋ１＝Ｈ／Ｃ、Ｌ１＝Ｉ／Ｄ、Ｍ１＝Ｊ／Ｅを求め、前記比率Ｋ１、Ｌ１、Ｍ１の各値が１以下の場合は、Ｋ２＝Ｋ１、Ｌ２＝Ｌ１、Ｍ２＝Ｍ１とし、前記比率Ｋ１、Ｌ１、Ｍ１の各値が１を超える場合は、Ｋ２＝１／Ｋ１、Ｌ２＝１／Ｌ１、Ｍ２＝１／Ｍ１とし、前記比率Ｋ２、Ｌ２、Ｍ２の積であるＫ２×Ｌ２×Ｍ２を求めてその値を積Ｓとし、前記積Ｓと前記明るさ情報Ｔとの積Ｓ×Ｔを評価値Ｕとし、評価値Ｕが所定のしきい値以上となるＲＧＢ値Ｒ１、Ｇ１、Ｂ１及び前記明るさ情報Ｔを有する前記画素を対象画素として抽出し、前記対象画素の分布に応じて前記対象画像を識別する識別手段を有することを特徴とするものである。
【０００８】
ここで、前記明るさ情報Ｔには、前記画素のＲＧＢ値の和であるＦ＝Ｒ１＋Ｇ１＋Ｂ１を利用することができる。
【０００９】
【発明の実施の形態】
以下、本発明の実施の形態を図に基づいて詳説する。図１に本発明の第１の実施の形態の画像識別装置を組み込んだロボット１の概略図を示す。ロボット１の頭部２の鼻先にはビデオカメラ３が取り付けてある。ロボット１の目の部分は赤外線受光部４になっている。ロボット１の耳にはマイクロホン５が取り付けてあり、音や声を入力することが可能である。
【００１０】
頭部２はロボット１の胴体６に取り付けてある。この頭部２は、２自由度のアクチュエータを介して取り付けてあり、左右・上下に動かすことが可能である。
【００１１】
ロボット１の胴体６には４本の脚７が取り付けてある。この脚７もアクチュエータを介して取り付けてあり、歩行が可能であり、自身の姿勢も制御できる。図１において、８はスピーカ、９はしつけボタンである。なお、このロボット１の駆動機構は一般的なものであり、詳細については図示していない。
【００１２】
ロボット１の鼻先のビデオカメラ３から出力された映像情報は、原画像として後述する内部のＣＰＵ１１に送られ、このＣＰＵ１１により画像認識処理など行い、これをもとに動作などに反映させる。例えば、好きな色のボールが転がっていればこれを識別して追いかけたり、嫌いなものがあればこれから退避したりする行動をとるように設定するのである。
【００１３】
図２にロボット１の制御系のブロック図を示す。ロボット１の動作制御はＣＰＵ１１により予め組み込んであるソフトウェアプログラムを実行することによって行うものであるが、ここではＣＰＵ１１の演算処理を機能別に分けて説明する。
【００１４】
ビデオカメラ３からＣＰＵ１１の画像入力ポート１２に映像信号Ｙ（輝度）、Ｕ（色差）、Ｖ（色差）と同期信号Ｖｓ、Ｈｓが出力される。ＣＰＵ１１では、ビデオカメラ３からの同期信号Ｖｓ、Ｈｓに同期して画像入力ポート１２よりデータを取り込み、画像データ処理部１３において画像データ処理を行う。
【００１５】
この画像データ処理部１３では、入力したＹＵＶデータをＲＧＢ値に変換する等の処理を行う。このＹＵＶデータのＲＧＢ値への変換は、下記のとおりである。
【００１６】
【数１】
Ｒ＝Ｙ＋１．４０２Ｖ
Ｇ＝Ｙ−０．７１４Ｖ−０．３４４Ｕ
Ｂ＝Ｙ＋１．７７２Ｕ
画像データ処理部１３ではビデオカメラ３からの各画素のＹＵＶデータに対して画素毎にＲＧＢ値を求める。
【００１７】
次に、この画素毎のＲＧＢ値を基にして、この画素が識別対象物の色を有しているか否かを全体制御部１４で判定する。
【００１８】
そして、画像データ中に、識別対象物の色を有する画素を所定値以上の数検出した場合（この方法についての詳細は後述する）、識別対象物がビデオカメラ３の前に存在すると判断し、これに対応してロボット１の行動パターンを決定し、動作制御部１５に指示する。動作制御部１５は指示された行動パターンを実現するために各部駆動モータ２１と発音部２２を動作制御し、ロボット１を頭部２を動かしたり、脚７で歩かせたり、またほえる声をスピーカ８から出させる等の動作をさせる。
【００１９】
全体制御部１４による画像識別処理は、図３のフローチャートの手順による。予め、所望の多数の対象物それぞれについて、識別すべき色のＲＧＢ値とこのＲＧＢ値を組み合わせた比率とを求める。これらの値はＣＰＵ１１に記憶させておく（ステップＳ１）。
【００２０】
例１として、対象物が黄色のテニスボールの場合、Ｒ０＝１７６、Ｇ０＝１８９、Ｂ０＝８０とであり、各色の比率Ｃ、Ｄ、Ｅは、次の通りである。
【００２１】
【数２】
Ｃ＝Ｒ０／Ｇ０＝１７６／１８９≒０．９３１
Ｄ＝Ｇ０／Ｂ０＝１８９／８０≒２．３６
Ｅ＝Ｂ０／Ｒ０＝８０／１７６≒０．４５５
ある対象物の画像信号がビデオカメラ３からＣＰＵ１１に入力されると、ＣＰＵ１１では、入力した各画素のＲＧＢの組合せの比率を計算する。ここでは、ＲＧＢ信号は、Ｒ１＝１３０、Ｇ１＝１４３、Ｂ１＝６１であったとする。すると、各色の比率Ｈ、Ｉ、Ｊは次の演算で得る（ステップＳ２）。
【００２２】
【数３】
Ｈ＝Ｒ１／Ｇ１＝１３０／１４３≒０．９０９
Ｉ＝Ｇ１／Ｂ１＝１４３／６１≒２．３４
Ｊ＝Ｂ１／Ｒ１＝６１／１３０≒０．４６９
そして各組合せに対応したＣ−Ｈ間、Ｄ−Ｉ間、Ｅ−Ｊ間の第１の比率Ｋ１＝Ｈ／Ｃ、Ｌ１＝Ｉ／Ｄ、Ｍ１＝Ｊ／Ｅを求める。そして、この第１の比率Ｋ１、Ｌ１、Ｍ１が１を超えないように除数と被除数を設定して第２の比率Ｋ２、Ｌ２、Ｍ２を求める（ステップＳ３）。例えば、Ｋ１に関して、Ｈ＞ＣであればＫ２＝１／Ｋ１とし、Ｈ≦ＣであればＫ２＝Ｋ１として第２の比率Ｋ２を求める。Ｄ−Ｉ間、Ｅ−Ｊ間の第２の比率Ｌ２、Ｍ２についても同様である。
【００２３】
これにより、上記の例１（黄色のテニスボール）においては、次のようにしてＣ−Ｈ間、Ｄ−Ｉ間、Ｅ−Ｊ間の第２の比率Ｋ２、Ｌ２、Ｍ２を求める。
【００２４】
【数４】
Ｋ２＝Ｋ１＝Ｈ／Ｃ＝０．９０９／０．９３１≒０．９７６
Ｌ２＝Ｌ１＝Ｉ／Ｄ＝２．３４／２．３６≒０．９９１
Ｍ２＝１／Ｍ１＝Ｅ／Ｊ＝０．４５５／０．４６９≒０．９７０
次に、これらを乗算して評価値Ｎを求める（ステップＳ４）。
【００２５】
【数５】
Ｎ＝Ｋ２×Ｌ２×Ｍ２
＝０．９７６×０．９９１×０，９７０≒０．９３８
次に、例２について示す。当例では、画素のＲＧＢ値がＲ１＝７０、Ｇ１＝１７２、Ｂ１＝１２５であり、この場合に、Ｋ２、Ｌ２、Ｍ２、Ｎの各値を求める方法を以下に説明する。
【００２６】
まず、各色の比率Ｈ、Ｉ、Ｊは次のようになる。
【００２７】
【数６】
Ｈ＝Ｒ１／Ｇ１＝７０／１７２≒０．４０７
Ｉ＝Ｇ１／Ｂ１＝１７２／１２５≒１．３８
Ｊ＝Ｂ１／Ｒ１＝１２５／７０≒１．７９
そして、各組合せに対応したＣ−Ｈ、Ｄ−Ｉ、Ｅ−Ｊの比率を求める。このときにも、比率は１を超えないように除数と被除数を決定する。
【００２８】
【数７】
Ｋ２＝Ｋ１＝Ｈ／Ｃ＝０．４０７／０．９３１≒０．４３７
Ｌ２＝Ｌ１＝Ｉ／Ｄ＝１．３８／２．３６≒０．５８５
Ｍ２＝１／Ｍ１＝Ｅ／Ｊ＝０．４５５／１．７９≒０．２５４
これらを乗算して評価値Ｎを求める。
【００２９】
【数８】
Ｎ＝Ｋ２×Ｌ２×Ｍ２
＝０．４３７×０．５８５×０．２５４≒０．０６４９
ここで、予めしきい値Ｓ１を０．７に設定していた場合、例１の画素はその評価値Ｎがしきい値よりも大きいので、その画素を所望の対象物の画像に含まれる画素として認識する（ステップＳ４，Ｓ６，Ｓ７）。この場合の判定を、「ＯＫ判定」と称する。
【００３０】
一方、例２の画素はその評価値Ｎがしきい値よりも小さいので、その画素を所望の対象物の画像に含まれない画素として認識する（ステップＳ４，Ｓ６，Ｓ８）。この場合の判定を、「ＮＧ判定」と称する。
【００３１】
以上説明した方法は、ＲＧＢの各比率をもとに計算しているので、輝度による影響を受けない。このため、画像が明るい場合でも暗い場合でも、良好な画像識別が可能である。また、各要素の比率を乗算して評価値Ｎを求めているので、色の差を大きな数値差として得られ、精度の良い識別が可能である。
【００３２】
次に、本発明の第２の実施の形態について説明する。第２の実施の形態の画像識別装置の機能構成は第１の実施の形態と共通である。そして全体制御部１４における処理機能において、第１の実施の形態における評価値Ｎを求めるステップまでの処理は共通であるので、それ以降の処理機能について以下に詳述する。
【００３３】
当実施の形態における全体制御部１４は、例えばペットロボットなど安価なカメラを画像入力手段として使用する場合のように、得られる画像信号のＳ／Ｎが良好でないときに、明るさ成分を含めて識別させることが可能になり、具体的には、評価値Ｎに明るさ情報Ｔを乗じた第２の評価値で評価する機能を備えている。
【００３４】
この明るさ情報Ｔとしては、輝度信号Ｙを使用することができる。具体的には、入力した輝度信号Ｙ値の正規化値を評価値Ｎに乗算する。つまり、輝度信号Ｙの最大値がＹMAX＝２５５である場合、Ｙ／２５５を乗算して第２の評価値Ｐを、
【数９】
Ｐ＝Ｎ×（Ｙ／２５５）
として求め（ステップＳ５）、この値を予め設定したしきい値Ｓ２と比較する（ステップＳ６）。
【００３５】
この場合、第２の評価値Ｐが１に近いほど識別したい色に近いことになる。つまり、明るく、かつＳ／Ｎ良く撮影された状態の信号でＯＫ判定となったことになる（ステップＳ６，Ｓ７）。そして暗くて、色信号がノイズに埋もれた状態で得られた値は評価が下がることになり、曖昧な値はキャンセルされる（ステップＳ６，Ｓ８）。
【００３６】
従って、良好なＳ／Ｎの色信号のみを対象にＯＫ判定するので暗い画像であっても高精度の識別が可能となる。
【００３７】
なお、この輝度信号Ｙは上述のように入力のＹ値を用いてもよいし、ビデオカメラ３からの入力がＲＧＢ信号の場合にはＹを演算により求めてもよい。また、明るさ情報ＴとしてＲ１＋Ｇ１＋Ｂ１の値を使用してもよい。
【００３８】
ＣＰＵ１１は、この方法で得られた識別結果をもとに全体の制御プログラムの流れを決め、動作制御を行う。
【００３９】
まず、上述した処理で得られた画素毎の色識別結果から、原画像に所望の対象物が含まれているか否かを判定する。この判定処理の一例を説明する。
【００４０】
所望の対象物の色であるとの判定（上述のＯＫ判定）を縦横に隣接する４つの画素において得られた場合に、この所望の対象物がカメラ前方に存在すると判定する。
【００４１】
また、所望の対象物の色であるとの判定を複数の画素領域でそれぞれ独立に得られた場合は、縦横に連続する画素数が多い方の領域を所望の対象物であると判定する。
【００４２】
この判定処理は、本画像識別装置に搭載するＣＰＵ１１の性能により、またカメラ３の受像素子の画素数により適宜に設定すべきものである。いずれの処理方法においても、ＯＫ判定により抽出した対象画素の画像上の分布に応じて対象物とする領域を識別するように設定すればよい。
【００４３】
ＣＰＵ１１は、上述のように原画像中に所望の対象物を識別すると、予め記憶した、その対象物に応じた仕草を実行するようにロボットの動作を制御する。例えば、各関節のモータ２１を制御することによってロボットを歩行させたり、首を回させたりする。また発音部２２に指示を出し、スピーカ８から鳴き声を出させたりする。この制御により、ロボットに好みのものを追いかけさせたり、危険なものから退避させたりすることが可能となる。
【００４４】
なお、本発明の技術的思想には次の画像識別処理方法、画像識別処理プログラムも含むものである。
【００４５】
（１）コンピュータに原画像を入力し、入力された原画像から所望の対象画像を予め設定した設定色に基づいて識別する画像識別方法であって、前記設定色のＲＧＢ値であるＲ０、Ｇ０、Ｂ０を予め記憶するステップと、該ＲＧＢ値から設定ＲＧＢ比率Ｃ＝Ｒ０／Ｇ０、Ｄ＝Ｇ０／Ｂ０、Ｅ＝Ｂ０／Ｒ０を求めるステップと、前記原画像の画素毎に、該画素のＲＧＢ値であるＲ１、Ｇ１、Ｂ１から原ＲＧＢ比率Ｈ＝Ｒ１／Ｇ１、Ｉ＝Ｇ１／Ｂ１、Ｊ＝Ｂ１／Ｒ１を求め、前記設定ＲＧＢ比率と前記原ＲＧＢ比率との比である比率Ｋ１＝Ｈ／Ｃ、Ｌ１＝Ｉ／Ｄ、Ｍ１＝Ｊ／Ｅを求め、前記比率Ｋ１、Ｌ１、Ｍ１の各値が１以下の場合は、Ｋ２＝Ｋ１、Ｌ２＝Ｌ１、Ｍ２＝Ｍ１とし、前記比率Ｋ１、Ｌ１、Ｍ１の各値が１を超える場合は、Ｋ２＝１／Ｋ１、Ｌ２＝１／Ｌ１、Ｍ２＝１／Ｍ１とし、前記比率Ｋ２、Ｌ２、Ｍ２の積であるＫ２×Ｌ２×Ｍ２を求めてその値を評価値Ｎとするステップと、前記評価値Ｎが所定のしきい値以上となるＲＧＢ値Ｒ１、Ｇ１、Ｂ１を有する前記画素を対象画素として抽出するステップと、前記対象画素の分布に応じて前記対象画像を識別するステップとを有する画像識別方法。
【００４６】
（２）コンピュータに原画像を入力し、入力された原画像から所望の対象画像を予め設定した設定色に基づいて識別する画像識別方法であって、前記設定色のＲＧＢ値であるＲ０、Ｇ０、Ｂ０を予め記憶するステップと、該ＲＧＢ値から設定ＲＧＢ比率Ｃ＝Ｒ０／Ｇ０、Ｄ＝Ｇ０／Ｂ０、Ｅ＝Ｂ０／Ｒ０を求めるステップと、前記原画像の画素毎に、該画素のＲＧＢ値であるＲ１、Ｇ１、Ｂ１から原ＲＧＢ比率Ｈ＝Ｒ１／Ｇ１、Ｉ＝Ｇ１／Ｂ１、Ｊ＝Ｂ１／Ｒ１を求める一方、該画素の明るさ情報Ｔを求め、前記設定ＲＧＢ比率と前記原ＲＧＢ比率との比である比率Ｋ１＝Ｈ／Ｃ、Ｌ１＝Ｉ／Ｄ、Ｍ１＝Ｊ／Ｅを求め、前記比率Ｋ１、Ｌ１、Ｍ１の各値が１以下の場合は、Ｋ２＝Ｋ１、Ｌ２＝Ｌ１、Ｍ２＝Ｍ１とし、前記比率Ｋ１、Ｌ１、Ｍ１の各値が１を超える場合は、Ｋ２＝１／Ｋ１、Ｌ２＝１／Ｌ１、Ｍ２＝１／Ｍ１とし、前記比率Ｋ２、Ｌ２、Ｍ２の積であるＫ２×Ｌ２×Ｍ２を求めてその値を積Ｓとし、前記積Ｓと前記明るさ情報Ｔとの積Ｓ×Ｔを評価値Ｕとするステップと、評価値Ｕが所定のしきい値以上となるＲＧＢ値Ｒ１、Ｇ１、Ｂ１及び前記明るさ情報Ｔを有する前記画素を対象画素として抽出するステップと、前記対象画素の分布に応じて前記対象画像として識別するステップとを有することを特徴とする画像識別方法。
【００４７】
（３）設定色のＲＧＢ値であるＲ０、Ｇ０、Ｂ０を予め記憶装置に記憶するステップと、該ＲＧＢ値から設定ＲＧＢ比率Ｃ＝Ｒ０／Ｇ０、Ｄ＝Ｇ０／Ｂ０、Ｅ＝Ｂ０／Ｒ０を求めるステップと、前記原画像の画素毎に、該画素のＲＧＢ値であるＲ１、Ｇ１、Ｂ１から原ＲＧＢ比率Ｈ＝Ｒ１／Ｇ１、Ｉ＝Ｇ１／Ｂ１、Ｊ＝Ｂ１／Ｒ１を求め、前記設定ＲＧＢ比率と前記原ＲＧＢ比率との比である比率Ｋ１＝Ｈ／Ｃ、Ｌ１＝Ｉ／Ｄ、Ｍ１＝Ｊ／Ｅを求め、前記比率Ｋ１、Ｌ１、Ｍ１の各値が１以下の場合は、Ｋ２＝Ｋ１、Ｌ２＝Ｌ１、Ｍ２＝Ｍ１とし、前記比率Ｋ１、Ｌ１、Ｍ１の各値が１を超える場合は、Ｋ２＝１／Ｋ１、Ｌ２＝１／Ｌ１、Ｍ２＝１／Ｍ１とし、前記比率Ｋ２、Ｌ２、Ｍ２の積であるＫ２×Ｌ２×Ｍ２を求めてその値を評価値Ｎとするステップと、前記評価値Ｎが所定のしきい値以上となるＲＧＢ値Ｒ１、Ｇ１、Ｂ１を有する前記画素を対象画素として抽出するステップと、前記対象画素の分布に応じて前記対象画像を識別するステップとをコンピュータに実行させる画像識別プログラム。
【００４８】
（４）設定色のＲＧＢ値であるＲ０、Ｇ０、Ｂ０を予め記憶装置に記憶するステップと、該ＲＧＢ値から設定ＲＧＢ比率Ｃ＝Ｒ０／Ｇ０、Ｄ＝Ｇ０／Ｂ０、Ｅ＝Ｂ０／Ｒ０を求めるステップと、前記原画像の画素毎に、該画素のＲＧＢ値であるＲ１、Ｇ１、Ｂ１から原ＲＧＢ比率Ｈ＝Ｒ１／Ｇ１、Ｉ＝Ｇ１／Ｂ１、Ｊ＝Ｂ１／Ｒ１を求める一方、該画素の明るさ情報Ｔを求め、前記設定ＲＧＢ比率と前記原ＲＧＢ比率との比である比率Ｋ１＝Ｈ／Ｃ、Ｌ１＝Ｉ／Ｄ、Ｍ１＝Ｊ／Ｅを求め、前記比率Ｋ１、Ｌ１、Ｍ１の各値が１以下の場合は、Ｋ２＝Ｋ１、Ｌ２＝Ｌ１、Ｍ２＝Ｍ１とし、前記比率Ｋ１、Ｌ１、Ｍ１の各値が１を超える場合は、Ｋ２＝１／Ｋ１、Ｌ２＝１／Ｌ１、Ｍ２＝１／Ｍ１とし、前記比率Ｋ２、Ｌ２、Ｍ２の積であるＫ２×Ｌ２×Ｍ２を求めてその値を積Ｓとし、前記積Ｓと前記明るさ情報Ｔとの積Ｓ×Ｔを評価値Ｕとするステップと、評価値Ｕが所定のしきい値以上となるＲＧＢ値Ｒ１、Ｇ１、Ｂ１及び前記明るさ情報Ｔを有する前記画素を対象画素として抽出するステップと、前記対象画素の分布に応じて前記対象画像として識別するステップとをコンピュータに実行させる画像識別プログラム。
【００４９】
【発明の効果】
以上のように本発明によれば、識別のしきい値をＲＧＢ信号の各比率をもとに求めているので、輝度による影響を受けずに画像識別でき、この結果、明るい場合でも暗い場合でも画像識別を良好に行うことができる。また、各要素の比率を乗算しているので、色の差が大きな数値差として得られ、精度良い識別が可能である。
【００５０】
また本発明によれば、画像中の暗い部分におけるＳ／Ｎが悪い曖昧な画像情報による評価をキャンセルすることができ、画像識別結果の信頼度を上げることができる。
【図面の簡単な説明】
【図１】本発明の１つの実施の形態の画像識別装置を搭載したロボットの斜視図。
【図２】上記実施の形態の画像識別装置の機能ブロック図。
【図３】上記実施の形態による画像識別処理のフローチャート。
【符号の説明】
１ロボット
３ビデオカメラ
１１ＣＰＵ
１２画像入力ポート
１３画像データ処理部
１４全体制御部
１５動作制御部[0001]
BACKGROUND OF THE INVENTION
The present invention relates to an image identification device.
[0002]
[Prior art]
2. Description of the Related Art Conventionally, there has been known an image identification device that recognizes an object by color as described in, for example, Japanese Patent Laid-Open No. 4-369087.
[0003]
[Patent Document 1]
Japanese Patent Laid-Open No. 4-369087
[Problems to be solved by the invention]
The conventional image identification device has a problem that the identification accuracy is not high and a problem that it is easy to erroneously detect when the object is dark.
[0005]
The present invention has been made to solve such a conventional technical problem, and an object of the present invention is to provide an image identification device that has high color identification accuracy and can accurately identify a color even when an object is dark. And
[0006]
[Means for Solving the Problems]
The invention according to claim 1 includes an image input unit and a calculation processing unit, and the calculation processing unit identifies a desired target image from the original image input to the image input unit based on a preset set color. In the image identification device, the arithmetic processing unit stores in advance R0, G0, and B0 that are RGB values of the set color, and sets RGB ratios C = R0 / G0, D = G0 / B0, from the RGB values. E = B0 / R0 is obtained, and for each pixel of the original image, the RGB values H1, R1, G1, I = G1 / B1, J = B1 / R1 are obtained from R1, G1, B1, which are RGB values of the pixel. The ratios K1 = H / C, L1 = I / D, and M1 = J / E, which are ratios between the set RGB ratio and the original RGB ratio, are obtained, and each value of the ratios K1, L1, and M1 is 1 or less. In this case, K2 = K1, L2 = L1, M2 = M1, and the ratio K1, If each value of 1 and M1 exceeds 1, K2 = 1 / K1, L2 = 1 / L1, M2 = 1 / M1, and K2 × L2 × M2, which is the product of the ratios K2, L2, and M2, is obtained. The value is set as an evaluation value N, and the pixels having RGB values R1, G1, and B1 at which the evaluation value N is equal to or greater than a predetermined threshold are extracted as target pixels, and the target is selected according to the distribution of the target pixels. It has the identification means which identifies an image, It is characterized by the above-mentioned.
[0007]
The invention of claim 2 includes an image input unit and a calculation processing unit, and the calculation processing unit identifies a desired target image based on a preset color set in advance from the original image input to the image input unit. In the image identification device, the arithmetic processing unit stores in advance R0, G0, and B0 that are RGB values of the set color, and sets RGB ratios C = R0 / G0, D = G0 / B0, from the RGB values. E = B0 / R0 is obtained, and for each pixel of the original image, the RGB values H1, R1, G1, I = G1 / B1, J = B1 / R1 are obtained from R1, G1, B1, which are RGB values of the pixel. Meanwhile, the brightness information T of the pixel is obtained, and ratios K1 = H / C, L1 = I / D, and M1 = J / E, which are ratios of the set RGB ratio and the original RGB ratio, are obtained, and the ratio When the values of K1, L1, and M1 are 1 or less, K2 = K1, L2 = L , M2 = M1, and if the values of the ratios K1, L1, M1 exceed 1, K2 = 1 / K1, L2 = 1 / L1, M2 = 1 / M1, and the ratios K2, L2, M2 The product K2 × L2 × M2 is obtained and the value is set as the product S, the product S × T of the product S and the brightness information T is set as the evaluation value U, and the evaluation value U is equal to or greater than a predetermined threshold value. The pixel having the RGB values R1, G1, B1 and the brightness information T to be extracted is extracted as a target pixel, and an identification unit for identifying the target image according to the distribution of the target pixel is provided. is there.
[0008]
Here, as the brightness information T, F = R1 + G1 + B1, which is the sum of the RGB values of the pixels, can be used.
[0009]
DETAILED DESCRIPTION OF THE INVENTION
Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. FIG. 1 is a schematic view of a robot 1 incorporating the image identification device according to the first embodiment of the present invention. A video camera 3 is attached to the nose of the head 2 of the robot 1. The eye part of the robot 1 is an infrared light receiving unit 4. A microphone 5 is attached to the ear of the robot 1 so that sounds and voices can be input.
[0010]
The head 2 is attached to the body 6 of the robot 1. The head 2 is attached via an actuator with two degrees of freedom and can be moved left and right and up and down.
[0011]
Four legs 7 are attached to the body 6 of the robot 1. The leg 7 is also attached via an actuator, can walk, and can control its own posture. In FIG. 1, 8 is a speaker and 9 is a discipline button. The drive mechanism of the robot 1 is a general one, and details are not shown.
[0012]
Video information output from the video camera 3 at the tip of the nose of the robot 1 is sent as an original image to an internal CPU 11 (to be described later). The CPU 11 performs image recognition processing and the like, and reflects this in the operation and the like. For example, if a ball of a favorite color is rolling, it is set so that it can be identified and chased, or if there is something that you do not like, it will be evacuated.
[0013]
FIG. 2 shows a block diagram of the control system of the robot 1. The operation control of the robot 1 is performed by executing a software program pre-installed by the CPU 11, but here, the arithmetic processing of the CPU 11 will be described separately for each function.
[0014]
Video signals Y (luminance), U (color difference), V (color difference) and synchronization signals Vs, Hs are output from the video camera 3 to the image input port 12 of the CPU 11. The CPU 11 captures data from the image input port 12 in synchronization with the synchronization signals Vs and Hs from the video camera 3 and performs image data processing in the image data processing unit 13.
[0015]
The image data processing unit 13 performs processing such as converting input YUV data into RGB values. The conversion of this YUV data into RGB values is as follows.
[0016]
[Expression 1]
R = Y + 1.402V
G = Y−0.714V−0.344U
B = Y + 1.772U
The image data processing unit 13 obtains an RGB value for each pixel with respect to the YUV data of each pixel from the video camera 3.
[0017]
Next, based on the RGB values for each pixel, the overall control unit 14 determines whether this pixel has the color of the identification target.
[0018]
When the number of pixels having the color of the identification object is detected in the image data at a predetermined value or more (details of this method will be described later), it is determined that the identification object exists in front of the video camera 3, Correspondingly, the behavior pattern of the robot 1 is determined and the operation control unit 15 is instructed. In order to realize the instructed action pattern, the operation control unit 15 controls the operation of each unit drive motor 21 and the sound generation unit 22, moves the robot 1 with the head 2, walks with the legs 7, and makes a roaring voice a speaker. The operation such as taking out from 8 is performed.
[0019]
The image identification process by the overall control unit 14 follows the procedure of the flowchart of FIG. For each desired large number of objects, an RGB value of a color to be identified and a ratio obtained by combining the RGB values are obtained in advance. These values are stored in the CPU 11 (step S1).
[0020]
As an example 1, when the object is a yellow tennis ball, R0 = 176, G0 = 189, B0 = 80, and the ratios C, D, and E of the respective colors are as follows.
[0021]
[Expression 2]
C = R0 / G0 = 176 / 189≈0.931
D = G0 / B0 = 189 / 80≈2.36
E = B0 / R0 = 80 / 176≈0.455
When an image signal of a certain object is input from the video camera 3 to the CPU 11, the CPU 11 calculates a ratio of RGB combinations of the input pixels. Here, it is assumed that the RGB signals are R1 = 130, G1 = 143, and B1 = 61. Then, the ratios H, I, and J of each color are obtained by the following calculation (step S2).
[0022]
[Equation 3]
H = R1 / G1 = 130 / 143≈0.909
I = G1 / B1 = 143 / 61≈2.34
J = B1 / R1 = 61 / 130≈0.469
Then, first ratios K1 = H / C, L1 = I / D, and M1 = J / E between C-H, DI, and E-J corresponding to each combination are obtained. Then, the divisor and the dividend are set so that the first ratios K1, L1, and M1 do not exceed 1 and the second ratios K2, L2, and M2 are obtained (step S3). For example, with respect to K1, if H> C, K2 = 1 / K1, and if H ≦ C, the second ratio K2 is determined as K2 = K1. The same applies to the second ratios L2 and M2 between DI and EJ.
[0023]
Thus, in the above-described example 1 (yellow tennis ball), the second ratios K2, L2, and M2 between C-H, D-I, and E-J are obtained as follows.
[0024]
[Expression 4]
K2 = K1 = H / C = 0.909 / 0.931≈0.976
L2 = L1 = I / D = 2.34 / 2.36≈0.991
M2 = 1 / M1 = E / J = 0.455 / 0.469≈0.970
Next, these values are multiplied to obtain an evaluation value N (step S4).
[0025]
[Equation 5]
N = K2 × L2 × M2
= 0.976 × 0.991 × 0,970≈0.938
Next, Example 2 will be described. In this example, the RGB values of the pixels are R1 = 70, G1 = 172, and B1 = 125. In this case, a method for obtaining each value of K2, L2, M2, and N will be described below.
[0026]
First, the ratios H, I, and J of each color are as follows.
[0027]
[Formula 6]
H = R1 / G1 = 70 / 172≈0.407
I = G1 / B1 = 172 / 125≈1.38
J = B1 / R1 = 125 / 70≈1.79
Then, the ratio of C—H, D—I, and E—J corresponding to each combination is obtained. At this time, the divisor and the dividend are determined so that the ratio does not exceed 1.
[0028]
[Expression 7]
K2 = K1 = H / C = 0.407 / 0.931≈0.437
L2 = L1 = I / D = 1.38 / 2.36≈0.585
M2 = 1 / M1 = E / J = 0.455 / 1.79≈0.254
These values are multiplied to obtain an evaluation value N.
[0029]
[Equation 8]
N = K2 × L2 × M2
= 0.437 x 0.585 x 0.254 ≒ 0.0649
Here, when the threshold value S1 is set to 0.7 in advance, since the evaluation value N of the pixel of Example 1 is larger than the threshold value, the pixel is included in the image of the desired object. (Steps S4, S6, S7). The determination in this case is referred to as “OK determination”.
[0030]
On the other hand, since the evaluation value N of the pixel of Example 2 is smaller than the threshold value, the pixel is recognized as a pixel that is not included in the image of the desired object (steps S4, S6, and S8). The determination in this case is referred to as “NG determination”.
[0031]
Since the method described above is calculated based on each ratio of RGB, it is not affected by luminance. Therefore, good image identification is possible regardless of whether the image is bright or dark. In addition, since the evaluation value N is obtained by multiplying the ratio of each element, the color difference can be obtained as a large numerical difference, and identification with high accuracy is possible.
[0032]
Next, a second embodiment of the present invention will be described. The functional configuration of the image identification apparatus of the second embodiment is the same as that of the first embodiment. Since the processing up to the step of obtaining the evaluation value N in the first embodiment is common in the processing functions in the overall control unit 14, the subsequent processing functions will be described in detail below.
[0033]
The overall control unit 14 in the present embodiment includes the brightness component when the S / N of the obtained image signal is not good as in the case where an inexpensive camera such as a pet robot is used as the image input means. More specifically, it has a function of evaluating with a second evaluation value obtained by multiplying the evaluation value N by the brightness information T.
[0034]
As the brightness information T, a luminance signal Y can be used. Specifically, the evaluation value N is multiplied by the normalized value of the input luminance signal Y value. That is, when the maximum value of the luminance signal Y is YMAX = 255, the second evaluation value P is multiplied by Y / 255,
[Equation 9]
P = N × (Y / 255)
(Step S5), and this value is compared with a preset threshold value S2 (step S6).
[0035]
In this case, the closer the second evaluation value P is to 1, the closer the color is to be identified. That is, an OK determination is made with a signal that is bright and has been photographed with good S / N (steps S6 and S7). The value obtained when the color signal is dark and the color signal is buried in the noise is lowered, and the ambiguous value is canceled (steps S6 and S8).
[0036]
Therefore, since only the good S / N color signal is determined to be OK, it is possible to identify with high accuracy even for a dark image.
[0037]
As the luminance signal Y, the input Y value may be used as described above. When the input from the video camera 3 is an RGB signal, Y may be obtained by calculation. Further, the value of R1 + G1 + B1 may be used as the brightness information T.
[0038]
The CPU 11 determines the flow of the entire control program based on the identification result obtained by this method, and performs operation control.
[0039]
First, it is determined from the color identification result for each pixel obtained by the above-described processing whether or not a desired object is included in the original image. An example of this determination process will be described.
[0040]
When the determination that the color of the desired object is the color (the above-described OK determination) is obtained in four pixels adjacent vertically and horizontally, it is determined that the desired object exists in front of the camera.
[0041]
In addition, when the determination that the color of the desired object is the color is obtained independently in each of the plurality of pixel areas, the area having the larger number of continuous pixels in the vertical and horizontal directions is determined as the desired object.
[0042]
This determination process should be appropriately set according to the performance of the CPU 11 mounted in the image identification apparatus and the number of pixels of the image receiving element of the camera 3. In any of the processing methods, it is only necessary to set so as to identify a region as a target object according to the distribution of the target pixel extracted by the OK determination on the image.
[0043]
When the CPU 11 identifies a desired object in the original image as described above, the CPU 11 controls the operation of the robot so as to execute a pre-stored gesture corresponding to the object. For example, the robot is walked or the neck is rotated by controlling the motor 21 of each joint. Also, an instruction is given to the sound generator 22 to make a cry from the speaker 8. With this control, it is possible to cause the robot to follow a favorite object or to retreat from a dangerous object.
[0044]
The technical idea of the present invention includes the following image identification processing method and image identification processing program.
[0045]
(1) An image identification method for inputting an original image to a computer and identifying a desired target image from the input original image based on a preset set color, and R0, G0 which are RGB values of the set color , B0 in advance, a step of obtaining set RGB ratios C = R0 / G0, D = G0 / B0, E = B0 / R0 from the RGB values, and RGB of each pixel of the original image The original RGB ratios H = R1 / G1, I = G1 / B1, and J = B1 / R1 are obtained from the values R1, G1, and B1, and the ratio K1 = H, which is the ratio between the set RGB ratio and the original RGB ratio. / C, L1 = I / D, M1 = J / E, and when the values of the ratios K1, L1, and M1 are 1 or less, K2 = K1, L2 = L1, M2 = M1, and the ratio K1 , L1 and M1 exceed 1 when K2 = / K1, L2 = 1 / L1, and M2 = 1 / M1, and the product of the ratios K2, L2, and M2, K2 × L2 × M2, and setting the value as the evaluation value N; and the evaluation value N An image identification method comprising: extracting the pixels having RGB values R1, G1, and B1 that are equal to or greater than a predetermined threshold as target pixels; and identifying the target image according to a distribution of the target pixels .
[0046]
(2) An image identification method in which an original image is input to a computer and a desired target image is identified from the input original image based on a preset set color, and R0 and G0 which are RGB values of the set color , B0 in advance, a step of obtaining set RGB ratios C = R0 / G0, D = G0 / B0, E = B0 / R0 from the RGB values, and RGB of each pixel of the original image While obtaining the original RGB ratio H = R1 / G1, I = G1 / B1, J = B1 / R1 from the values R1, G1, and B1, the brightness information T of the pixel is obtained, and the set RGB ratio and the original RGB ratio are obtained. Ratios K1 = H / C, L1 = I / D, and M1 = J / E, which are ratios to the RGB ratio, are obtained. When the values of the ratios K1, L1, and M1 are 1 or less, K2 = K1, L2 = L1, M2 = M1, and the ratios K1, L1, M When each value of 1 exceeds 1, K2 = 1 / K1, L2 = 1 / L1, M2 = 1 / M1, and K2 × L2 × M2, which is the product of the ratios K2, L2, and M2, is obtained and the value is obtained. Is a product S, a product S × T of the product S and the brightness information T is set as an evaluation value U, and RGB values R1, G1, B1, and the evaluation value U are equal to or greater than a predetermined threshold An image identification method comprising: extracting the pixel having brightness information T as a target pixel; and identifying the pixel as the target image according to a distribution of the target pixel.
[0047]
(3) A step of storing the RGB values R0, G0, B0 of the set color in the storage device in advance, and setting RGB ratios C = R0 / G0, D = G0 / B0, E = B0 / R0 from the RGB values. Obtaining the original RGB ratio H = R1 / G1, I = G1 / B1, J = B1 / R1 from the R1, G1, and B1 RGB values of the pixel for each pixel of the original image When ratios K1 = H / C, L1 = I / D, and M1 = J / E, which are ratios of the RGB ratio and the original RGB ratio, are obtained, and when the values of the ratios K1, L1, and M1 are 1 or less, When K2 = K1, L2 = L1, M2 = M1, and each value of the ratios K1, L1, M1 exceeds 1, K2 = 1 / K1, L2 = 1 / L1, M2 = 1 / M1, K2 × L2 × M2, which is the product of the ratios K2, L2, and M2, is calculated and the value is evaluated N Extracting the pixels having RGB values R1, G1, and B1 with the evaluation value N equal to or greater than a predetermined threshold as target pixels, and extracting the target image according to the distribution of the target pixels. An image identification program for causing a computer to execute the identifying step.
[0048]
(4) R0, G0, B0, which are RGB values of the set color, are stored in the storage device in advance, and the set RGB ratios C = R0 / G0, D = G0 / B0, E = B0 / R0 from the RGB values. Determining the original RGB ratio H = R1 / G1, I = G1 / B1, J = B1 / R1 from R1, G1, and B1, which are RGB values of the pixel, for each pixel of the original image, Pixel brightness information T is obtained, and ratios K1 = H / C, L1 = I / D, and M1 = J / E, which are ratios of the set RGB ratio and the original RGB ratio, are obtained, and the ratios K1, L1, When each value of M1 is 1 or less, K2 = K1, L2 = L1, and M2 = M1, and when each value of the ratios K1, L1, and M1 exceeds 1, K2 = 1 / K1, L2 = 1 / L1, M2 = 1 / M1, and K2 is the product of the ratios K2, L2, and M2 L2 × M2 is obtained, the value is set as the product S, the product S × T of the product S and the brightness information T is set as the evaluation value U, and the evaluation value U is equal to or greater than a predetermined threshold value. An image identification program for causing a computer to execute a step of extracting the pixel having values R1, G1, B1 and the brightness information T as a target pixel and a step of identifying the pixel as the target image according to the distribution of the target pixel.
[0049]
【The invention's effect】
As described above, according to the present invention, since the threshold value for identification is obtained based on the ratio of each RGB signal, it is possible to identify an image without being affected by luminance, and as a result, whether it is bright or dark. Image identification can be performed satisfactorily. Further, since the ratio of each element is multiplied, a color difference is obtained as a large numerical difference, and identification with high accuracy is possible.
[0050]
Further, according to the present invention, it is possible to cancel evaluation based on ambiguous image information having a poor S / N in a dark part of an image, and to improve the reliability of an image identification result.
[Brief description of the drawings]
FIG. 1 is a perspective view of a robot equipped with an image identification device according to an embodiment of the present invention.
FIG. 2 is a functional block diagram of the image identification apparatus of the embodiment.
FIG. 3 is a flowchart of image identification processing according to the embodiment.
[Explanation of symbols]
1 Robot 3 Video Camera 11 CPU
12 Image input port 13 Image data processing unit 14 Overall control unit 15 Operation control unit

Claims

An image input unit and an arithmetic processing unit;
The arithmetic processing unit is an image identification device that identifies a desired target image based on a preset color from an original image input to the image input unit,
The arithmetic processing unit includes:
R0, G0, B0 which are RGB values of the set color are stored in advance, and set RGB ratios C = R0 / G0, D = G0 / B0, E = B0 / R0 are obtained from the RGB values,
For each pixel of the original image, the original RGB ratio H = R1 / G1, I = G1 / B1, J = B1 / R1 is obtained from R1, G1, and B1 that are RGB values of the pixel,
A ratio K1 = H / C, L1 = I / D, and M1 = J / E, which are ratios of the set RGB ratio and the original RGB ratio,
When the values of the ratios K1, L1, and M1 are 1 or less, K2 = K1, L2 = L1, M2 = M1,
When the values of the ratios K1, L1, and M1 exceed 1, K2 = 1 / K1, L2 = 1 / L1, M2 = 1 / M1,
K2 × L2 × M2, which is the product of the ratios K2, L2, and M2, is calculated and the value is set as an evaluation value N;
Extracting the pixels having the RGB values R1, G1, and B1 at which the evaluation value N is equal to or greater than a predetermined threshold as target pixels;
An image identification apparatus comprising: identification means for identifying the target image according to the distribution of the target pixels.

An image input unit and an arithmetic processing unit;
The arithmetic processing unit is an image identification device that identifies a desired target image based on a preset color from an original image input to the image input unit,
The arithmetic processing unit includes:
R0, G0, B0 which are RGB values of the set color are stored in advance, and set RGB ratios C = R0 / G0, D = G0 / B0, E = B0 / R0 are obtained from the RGB values,
For each pixel of the original image, the original RGB ratio H = R1 / G1, I = G1 / B1, J = B1 / R1 is obtained from the RGB values R1, G1, and B1 of the pixel, while the brightness of the pixel Seeking information T,
A ratio K1 = H / C, L1 = I / D, and M1 = J / E, which are ratios of the set RGB ratio and the original RGB ratio,
When the values of the ratios K1, L1, and M1 are 1 or less, K2 = K1, L2 = L1, M2 = M1,
When the values of the ratios K1, L1, and M1 exceed 1, K2 = 1 / K1, L2 = 1 / L1, M2 = 1 / M1,
K2 × L2 × M2, which is the product of the ratios K2, L2, and M2, is determined and the value is set as the product S;
The product S × T of the product S and the brightness information T is defined as an evaluation value U,
Extracting the pixels having the RGB values R1, G1, B1 and the brightness information T for which the evaluation value U is equal to or greater than a predetermined threshold as target pixels;
An image identification apparatus comprising: identification means for identifying the target image according to the distribution of the target pixels.