JP2004066351A

JP2004066351A - Apparatus and method for controlling robot, and program for the same

Info

Publication number: JP2004066351A
Application number: JP2002224758A
Authority: JP
Inventors: Takatomo Nakajima; 中島　隆智; Naoki Takasaki; 高▲崎▼　直樹
Original assignee: Open Interface Inc
Current assignee: Open Interface Inc
Priority date: 2002-08-01
Filing date: 2002-08-01
Publication date: 2004-03-04
Anticipated expiration: 2022-08-01
Also published as: JP3702297B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide an apparatus for controlling a robot, which can control the movement of the robot with the motion suitable for the sensitivity of replayed music. <P>SOLUTION: A spectrum analyzing section 1 analyzes the distribution of the sound pressure for every frequency band based on the sound data read out from a MP3 data memory section 4. A sensitivity data calculating section 12 calculates the sensitivity data composed of kinds of feeling and feeling levels for every kind of feeling, etc., based on the analyzed results. The calculated sensitivity data is output from a sensitivity data output section 3 to a robot control section 7. The robot control section 7 controls the movement of the robot based on the sensitivity data. <P>COPYRIGHT: (C)2004,JPO

Description

【０００１】
【発明の属する技術分野】
この発明は、情報処理装置に係り、特に、ロボット制御装置及びロボット制御方法並びにそのプログラムに関するものである。
【０００２】
【従来の技術】
従来、展示場などに配置された人間やその他の動物の形をしたロボットに動作を加える制御を行う場合には、そのロボットの一定の動作をプログラムして制御していた。
【０００３】
【発明が解決しようとする課題】
しかしながら、上述のプログラムによってロボットを制御する方法においては、再生される音楽から感情を表現する感情パラメータを抽出し、その感情パラメータに応じてロボットを制御するようなロボットの制御方法がなかった。
そこでこの発明は、再生される音楽から感情を表現する感情パラメータを抽出し、その感情パラメータに応じてロボットを制御することで、再生される音楽の感性に合った動きでロボットの動作を制御することができるロボット制御装置及びロボット制御方法並びにそのプログラムを提供することを目的としている。
【０００４】
【課題を解決するための手段】
本発明は、上述の課題を解決すべくなされたもので、請求項１に記載の発明は、分割された周波数帯域毎の音圧値として音を表現したデジタルデータを記憶するデータ記憶部と、前記データ記憶部から読み出された前記デジタルデータを基に前記周波数帯域毎の音圧値の分布を解析し、この解析結果に基づいて感情種類と、前記感情種類毎の感情レベルなどからなる感性データを算出する感性データ算出部と、前記感性データ算出部によって算出された前記感性データを出力する感性データ出力部と、前記感性データ出力部から受信した前記感性データに基づいてロボットの動作の制御を行うロボット制御部とを備える事を特徴とするロボット制御装置である。
【０００５】
上述の構成によれば、データ記憶部から読み出された前記デジタルデータを基に前記周波数帯域毎の音圧値の分布を解析し、この解析結果に基づいて感情種類と、前記感情種類毎の感情レベルなどからなる感性データを算出する。そして、算出した前記感性データを出力し、その感性データに基づいて、ロボット制御部がロボットの動作を制御する。これにより、再生される音楽の感性に合った動きでロボットの動作を制御することができる。
【０００６】
また、請求項２に記載の発明は、分割された周波数帯域毎の音圧値として音を表現したデジタルデータを記憶するデータ記憶部を備えたロボット制御装置におけるロボット制御方法であって、前記データ記憶部から読み出された前記デジタルデータを基に前記周波数帯域毎の音圧値の分布を解析し、この解析結果に基づいて感情種類と、前記感情種類毎の感情レベルなどからなる感性データを算出する感性データ算出過程と、前記感性データ算出過程によって算出された前記感性データを出力する感性データ出力過程と、前記感性データ出力過程で出力した前記感性データに基づいて、ロボットの動作の制御を行うロボット制御過程とを備える事を特徴とするロボット制御方法である。
【０００７】
また、請求項３に記載の発明は、分割された周波数帯域毎の音圧値として音を表現したデジタルデータを記憶するデータ記憶部を備えたロボット制御装置のコンピュータにおけるロボット制御プログラムであって、前記コンピュータに、前記データ記憶部から読み出された前記デジタルデータを基に前記周波数帯域毎の音圧値の分布を解析し、この解析結果に基づいて感情種類と、前記感情種類毎の感情レベルなどからなる感性データを算出する感性データ算出処理と、前記感性データ算出処理によって算出された前記感性データを出力する感性データ出力処理と、前記感性データ出力処理で出力した前記感性データに基づいて、ロボットの動作の制御を行うロボット制御処理とを実行させるためのプログラムである。
【０００８】
【発明の実施の形態】
以下、本発明の一実施形態によるロボット制御装置を図面を参照して説明する。
図１は、同実施形態による感性データ算出方法を応用したロボット制御装置の構成を示すブロック図である。このロボット制御装置は、ＭＰ３（ＭＰＥＧ１　Ａｕｄｉｏ　Ｌａｙｅｒ３）の形式で記録された音楽（聴覚データ）を再生するとともに、この音楽を基に感性データを算出するものである。
【０００９】
図１において、符号４は、ＭＰ３形式の音楽データを記憶するＭＰ３データ記憶部である。１は、ＭＰ３データ記憶部４から読み出された音楽データを基に感性データを算出して出力するスペクトラム解析部である。２は、スペクトラム解析部１によって算出された感性データを蓄積する感性データ蓄積部である。３は、感性データ蓄積部２に蓄積された感性データを順次読み出し、ロボット制御部へ出力する感性データ出力部である。
【００１０】
また、５は、ＭＰ３データ記憶部４から読み出された音楽データをデコードして時系列的な音圧レベルのデジタル信号（デジタルオーディオ信号）を出力するＭＰ３デコーダ部である。６は、ＭＰ３デコーダ部５から出力されたデジタル信号を変換してアナログオーディオ信号を出力するＤ／Ａ（デジタル／アナログ）変換部である。７は感性データ出力部３が出力した制御信号に基づいたロボットの制御を行うロボット制御部である。
【００１１】
ＭＰ３データ記憶部４に記憶されている音楽データは、所定のサンプリング周波数（例えば、４４１００Ｈｚ（ヘルツ））でサンプリングされ量子化された音圧レベルが、所定のフレーム長（例えば、約０．０２６１秒）を１フレームとするフレーム単位で、所定数（例えば、５７６本）に分割された周波数帯域ごとの音圧値に変換され、さらに圧縮されたものである。
【００１２】
スペクトラム解析部１は、このような音楽データを時系列的に順次読み出し、読み出したデータをまず伸長してから、後述する所定の手順により解析を行い、その結果を感性データとして順次出力していく。スペクトラム解析部１によって出力される感性データも時系列的なデータであり、順次、感性データ蓄積部２に蓄積されていく。
【００１３】
感性データ出力部３は、感性データ蓄積部２に蓄積した感性データをロボット制御部７に出力するものである。そして、ロボット制御部７は感性データ出力部３が感性データ蓄積部２から読み取って送信した感性データに基づいて、ロボットの腕や口などの動きを制御する。
【００１４】
そして、再生される音楽の進行と感性データ出力部３が出力する感性データのタイミングが合うように、スペクトラム解析部１と感性データ出力部３とＭＰ３デコーダ５との間で互いに同期を取るようにする。
また、スペクトラム解析部１による感性データ算出の演算に時間がかかっても音楽と感性データ出力部３が出力する感性データとのタイミングがずれないように、音楽の再生が指示された後にスペクトラム解析部１による感性データ算出を先行して行い、数秒から数十秒程度遅れて後追いの形で、ＭＰ３デコータ５による音楽の再生と感性データ出力部３からの制御信号の出力とを行うようする。但し、スペクトラム解析部１による感性データ算出の演算が充分に速く行える場合には、上記の遅延を設けずにリアルタイムで再生するようにしても良い。
【００１５】
ＭＰ３データ記憶部４は、磁気ディスクやＣＤ−ＲＯＭ（コンパクトディスクを用いた読み出し専用メモリ）あるいはＣＤ−Ｒ（ＣＤ　Ｒｅｃｏｒｄａｂｌｅ　）やＤＶＤ（Ｄｉｇｉｔａｌ　Ｖｅｒｓａｔｉｌｅ　Ｄｉｓｋ）や光磁気ディスクや半導体メモリなどといった記録媒体とそれを読み取る読取装置によって実現する。
【００１６】
スペクトラム解析部１は、コンピュータを用いて実現する。つまり、後述する感性データ算出等の処理の手順をコンピュータプログラムの形式でコンピュータ読み取り可能な記録媒体に記録しておき、このプログラムをコンピュータの中央処理装置が読み取ってその命令を実行することにより感性データ算出等の機能を実現する。ここで、コンピュータ読み取り可能な記録媒体とは、例えば、磁気ディスクや、ＣＤ−ＲＯＭあるいはＣＤ−Ｒや、半導体メモリなどである。あるいは、専用ロジックを搭載した集積回路としてスペクトラム解析部１を実現するようにしても良い。あるいは、コンピュータプログラムをコンピュータ読み取り可能な記録媒体に記録しておく代わりに、通信を用いて配信するようにして、配信を受けたコンピュータの中央処理装置がこのプログラムを実行するようにしても良い。
感性データ蓄積部２は、半導体メモリや磁気ディスクなど、高速に書換え可能な記録媒体を用いて実現する。
【００１７】
次に、スペクトラム解析部１の内部生成、およびその処理の詳細について説明する。図１に示すように、スペクトラム解析部１は、伸長部１１と感性データ算出部１２とを備えている。伸長部１１は、ＭＰ３データ記憶部から読み取られた音楽データを伸長する。つまり、図１のＡの部分では、圧縮された状態のデータが伝達される。また、図１のＢの部分では、前述の周波数帯域（音域）ごとの音圧値の情報が伸長済みの状態で伝達される。感性データ算出部１２は、さらに、感情解析部１３とリズム解析部１４とを備えている。
【００１８】
次に、感情解析部１３とリズム解析部１４の詳細な処理手順について説明する。
【００１９】
図２は、感情解析部１３による処理の手順を示すフローチャートである。図２に示すように、感情解析部１３は、まずステップＳ１において入力されるデータを基に５つの音域への分割の処理を行い、ステップＳ２においてこれら５つの音域の音圧値を基に感情パラメータを算出する処理を行い、ステップＳ３において算出された感情パラメータを基に判定を行う。判定結果として、インデックス、感情種類、感情レベル、継続時間、補間フラグを１組とした時系列データが出力される。
【００２０】
上記インデックスは、０から始まるシーケンシャルな値である。
上記感情種類は、「無表情（ｄｅｆａｕｌｔ　）」、「快感（ｐｌｅａｓｕｒｅ）」、「驚き（ｓｕｒｐｒｉｓｅ）」、「怯え（ｆｅａｒ）」、「嬉しい（ｈａｐｐｙ　）」、「哀しい（ｓａｄ　）」のいずれかである。
感情種類が「快感」、「驚き」、「怯え」、「嬉しい」、「哀しい」のいずれかであるとき、上記感情レベルは１以上５以下のいずれかの整数の値を取る。また、感情種類が「無表情」のとき、感情レベルの値は「なし」である。
上記継続時間は、秒単位の数値であり、１以上の値を取る。
上記補間フラグは、０（「ＯＦＦ」を表わす）あるいは１（「ＯＮ」を表わす）のいずれかの値を取る。
【００２１】
時系列の音楽データを処理する際の初期値は、インデックス＝０、感情種類＝「無表情」、感情レベル＝「なし」、継続時間＝「１」、補間フラグ＝「１」とする。
【００２２】
以下に、処理をさらに詳細に説明する。
図２の符号Ｄ１は、感情解析部１３に入力される周波数帯域ごとの音圧値情報である。この段階では、５７６本の周波数帯域それぞれの音圧値情報が保持されている。また、元のＭＰ３データのサンプリング周波数は４４１００Ｈｚである。つまり、分割された周波数帯域ごとの音圧値として音を表現したデジタルデータを入力として、周波数帯域ごとの音圧値の分布を以下の方法で解析することにより、前記の音に関連する感性データを算出する。
【００２３】
ステップＳ１においては、音圧値情報（Ｄ１）を基に、次の５段階の音域ごとの平均音圧値を算出し、音圧値情報（Ｄ２）として出力する。その５段階の音域とは、低音部（０Ｈｚ〜７６．５６２５Ｈｚ）、中低音部（２２９．６８７５Ｈｚ〜１９９０．６２５Ｈｚ）、中高音部（７００５．４６９Ｈｚ〜１００２９．６９Ｈｚ）、高音部（１００２９．６９Ｈｚ〜１４９６７．９７Ｈｚ）、最高音部（１５００６．２５Ｈｚ〜１７９９２．１９Ｈｚ）の５つである。
つまり、ここでは、周波数帯域全体を、５個の周波数帯域グループに分割し、この周波数帯域グループごとの音圧値を用いた解析を行う。
【００２４】
また、ステップＳ１においては、音階分割により、長音要素と短音要素の抽出を行う。この抽出のために、まず、０Ｈｚ〜４９７．６５６３Ｈｚの帯域を１３の領域に均等分割し、４９７．６５６３Ｈｚ〜２２０５０Ｈｚの帯域を６３の領域に音階分割する。そして、そのうちの４９７．６５６３Ｈｚ〜２０２８．９０６Ｈｚの２オクターブ分の２４個の音階領域の音圧値が所定の閾値より大きいかどうかを判断する。
【００２５】
上記２４個の音階領域のうち、１番目、３番目、５番目、８番目、１０番目、１２番目、１３番目、１５番目、１７番目、２０番目、２２番目、２４番目の領域が長音要素である。これらの長音要素のうち、１番目と１３番目とは１オクターブ離れた領域であるため、この２つの領域の音圧値が共に閾値より大きければ、長音要素を＋１としてカウントする。また同様に、３番目と１５番目の領域、５番目と１７番目の領域、８番目と２０番目の領域、１０番目と２２番目の領域、１２番目と２４番目の領域がそれぞれ互いに１オクターブ離れた領域であり、２つの領域の音圧値が共に閾値より大きい場合に、それぞれ長音要素を＋１としてカウントする。
また、上記２４個の音階領域のうち、２番目と１４番目、４番目と１６番目、６番目と１８番目、７番目と１９番目、９番目と２１番目、１１番目と２３番目がそれぞれ互いに１オクターブ離れた領域のペアであり、ペアごとに、２つの領域の音圧値が共に閾値より大きい場合に、それぞれ短音要素を＋１としてカウントする。
この抽出の処理の結果、長音要素および短音要素は、それぞれ０以上６以下のいずれかの整数の値を取る。
【００２６】
次に、ステップＳ２では、音圧値情報Ｄ２を基に感情パラメータを算出する処理を行う。感情パラメータには優先順位が設定されており、「快感」の優先度が１、「驚き」の優先度が２、「怯え」の優先度が３、「嬉しい」および「哀しい」の優先度がともに４となっている。
なお、上記５種類の感情パラメータ値がすべて「０」のときは、「無表情」に該当する。
【００２７】
また、ステップＳ３では、算出された感情パラメータに基づく判定を行い、感性データを求める処理を行う。また、この判定においては、図１に示したリズム解析部１４によるリズム解析の結果も一部で用いられる。リズム解析の結果とは、例えば、ビート間の時間間隔がどの程度の長さかといったことである。
なお、感情パラメータ値算出の際には、音圧値がＬ１以下の音を無視する。
【００２８】
「快感（Ｐｌｅａｓｕｒｅ）」に関する処理は、次の通りである。
［条件１］　ビート間の時間間隔がＴ３以上で、かつ、中低音部から高音部までのいずれかの音圧のピークが高音方向に時間的にＴ４以上移動した場合は、「快感」の感情パラメータのカウントを＋１する。この条件に合致するとき、当該感情は、対象の音が鳴り始めてから時間Ｔ４経過時点から、対象の音が鳴りやんでから時間Ｔ２経過時点まで継続するものとする。つまり、本実施形態においては、この継続時間の間は、「快感」データに基づく制御信号が出力される。
［条件２］　低音域の音圧値がＬ７以上で、かつ、高音部の平均音圧値がＬ４以上である場合で、平均音圧値がＬ６以上の時、前回までのビート間の平均時間間隔から今回のビート間の時間間隔を差し引いた値がＴ１以上である、または、前回の判定結果が「驚き」の場合は「快感」の感情パラメータのカウントを＋２する。この条件に合致するとき、当該感情は、対象の音が鳴り始めてから時間Ｔ４が経過した時点から始まるものとする。
【００２９】
つまり、上記条件２が適用される場合には、分割された周波数帯域グループごとの平均音圧値に基づいて感性データが算出される。
また、上記条件１が適用される場合には、周波数帯域グループ内において、音圧値のピークとなる周波数帯域が時間的にどのように推移するかに基づいて感性データが算出される。
また、上記条件１が適用される場合には、元のデジタルデータに基づき音に含まれるリズムの単位時間あたりの拍数が求められ、この単位時間あたり拍数に基づいて感性データが算出される。上記の「ビート間の時間間隔」は単位時間あたり拍数の逆数から求められる。
なお、「快感」の感情の優先順位は最も高い「１」であるため、上記の条件１あるいは条件２のいずれかにあてはまる場合は、他の感情を無視する。
【００３０】
「驚き（Ｓｕｒｐｒｉｓｅ）」に関する処理は、次の通りである。
上述した「快感」の条件に該当しない場合は、下記の条件により「驚き」に該当するかどうかをチェックする。
【００３１】
［条件１］　全音域の平均音圧値がＬ３以下の音が無い状態から、低音部のピークの音圧値がＬ７以上の音を最初に取得した場合は、「驚き」の感情パラメータのカウントを＋４し、その音が鳴りつづけた時間を継続時間とする。ただし、下記の条件２を満たす場合は無視をする。
［条件２］　全音域の平均音圧値がＬ２以下の音が無い状態から、低音部のピークの音圧値がＬ７以上の音を最初に取得した場合は、「驚き」の感情パラメータのカウントを＋５し、その音が鳴りつづけた時間を継続時間とする。
【００３２】
［条件３］　全音域の平均音圧値がＬ３以下の音が無い状態から、低音部以外のピークの音圧値がＬ７以上の音を最初に取得した場合は、「驚き」の感情パラメータのカウントを＋１し、その音が鳴りつづけた時間を継続時間とする。ただし、下記の条件４を満たす場合は無視をする。
［条件４］　全音域の平均音圧値がＬ２以下の音が無い状態から、低音部以外のピークの音圧値がＬ７以上の音を最初に取得した場合は、「驚き」の感情パラメータのカウントを＋２し、その音が鳴りつづけた時間を継続時間とする。
［条件５］　最高音部の音が時間Ｔ４以上続いた場合、または最高音部の音が存在し、かつ中高音部の平均音圧値がＬ４以下の場合は、「驚き」の感情パラメータのカウントを＋３し、その音が鳴りつづけた時間を継続時間とする。
なお、「驚き」の感情の優先順位は「快感」のそれに次ぐ「２」であるため、上記の条件１から５までのいずれかにあてはまる場合は、他の優先順位の低い感情を無視する。
【００３３】
「怯え（Ｆｅａｒ）」に関する処理は、次の通りである。
上述した「快感」あるいは「驚き」のいずれの条件にも該当しない場合は、下記の条件により「怯え」に該当するかどうかをチェックする。
【００３４】
［条件１］　中低音部から高音部までのいずれかの音圧値のピークが低音方向に時間的にＴ４以上移動した場合は、「怯え」の感情パラメータのカウントを＋１する。
［条件２］　中低音部から高音部までのいずれかの音圧値のピークが低音方向に時間的にＴ４以上移動し、続けて高音方向に時間的にＴ４以上移動した場合は、「怯え」の感情パラメータのカウントを＋４する。
［条件３］　中低音部から高音部までのいずれかの音圧値のピークが低音方向に移動中に高音方向に揺れた回数Ｎが４２以上の場合、「怯え」の感情パラメータのカウントを＋（Ｎ／１６）する。
【００３５】
なお、「怯え」データに基づく制御信号の出力の始点は対象の音が鳴り始めてから時間Ｔ４経過後とし、同じく制御信号の終点は対象の音が鳴りやんでから時間Ｔ２経過後とする。
なお、「怯え」の感情の優先順位は「驚き」のそれに次ぐ「３」であるため、上記の条件１から３までのいずれかにあてはまる場合は、他の優先順位の低い感情を無視する。
【００３６】
上述した「快感」、「驚き」、「怯え」のいずれの条件にも該当しない場合は、下記の条件により「嬉しい」または「哀しい」に該当するかどうかをチェックする。
【００３７】
「嬉しい（Ｈａｐｐｙ）」に関する処理は、次の通りである。
［条件１］　ビートがある場合は、「嬉しい」の感情パラメータのカウントを＋１する。
［条件２］　ビート間の時間間隔がＴ７以下の場合は、「嬉しい」の感情パラメータのカウントを＋１する。
［条件３］　高音部の平均音圧値がＬ４以上の場合は、「嬉しい」の感情パラメータのカウントを＋１する。
［条件４］　上記の条件３を満たし、かつ、中低音部の音圧値のピークが５つ以上あった場合は、「嬉しい」の感情パラメータのカウントを＋２する。
［条件５］　上記の条件３を満たし、かつ、上記の条件４をみたし、かつ、低音部の平均音圧値がＬ５以下の場合は、「嬉しい」の感情パラメータのカウントを＋２をする。
［条件６］　抽出された長調要素−短調要素の数値が２以上の場合は、「嬉しい」の感情パラメータのカウントを＋１する。
【００３８】
なお、「嬉しい」データに基づく制御信号出力の始点の時間的な誤差は±Ｔ２とする。また、同じく制御信号出力の終点の時間的な誤差も±Ｔ２とする。
【００３９】
「哀しい（Ｓａｄ）」に関する処理は、次の通りである。
［条件１］　ビート間の時間間隔がＴ５以上である場合は、「哀しい」の感情パラメータのカウントを＋１する。
［条件２］　ビートがない場合は、「哀しい」の感情パラメータのカウントを＋２する。
［条件３］　中低音部に時間Ｔ４以上続く音圧値のピークがあった場合は、「哀しい」の感情パラメータを＋１し、音が鳴り続けている時間を継続時間とする。ただし、下記の条件４を満たす場合は無視をする。
［条件４］　中低音部に時間Ｔ６以上続く音圧値のピークがあった場合は、「哀しい」の感情パラメータを＋２し、音が鳴り続けている時間を継続時間とする。
【００４０】
［条件５］　高音部に音圧値のピークが３つ以上あった場合は、「哀しい」の感情パラメータを＋１する。
［条件６］　全領域の平均音圧値がＬ３以上の音が無い状態の場合は、「哀しい」の感情パラメータを＋１する。
［条件７］　全領域の平均音圧値がＬ３以上の音が時間Ｔ２以上無い場合は、「哀しい」の感情パラメータを＋１する。
［条件８］　中高音部と高音部の平均音圧値がＬ３以下であり、中低音部の音のみを取得した場合は、「哀しい」の感情パラメータを＋２する。
［条件９］　短調要素−長調要素の数値が２以上の場合は、「哀しい」の感情パラメータを＋１する。
【００４１】
なお、「哀しい」データに基づく制御信号出力の始点の時間的な誤差は±Ｔ２とする。また、同じく制御信号出力の終点の時間的な誤差も±Ｔ２とする。
【００４２】
以上述べたように、「快感」、「驚き」、「怯え」、「嬉しい」、「哀しい」の感情について、それぞれ定義された条件でのチェックが行われる。
そして、優先順位の高い感情から順に、「快感」、「驚き」、「怯え」のいずれかのカウント結果が１以上である場合に、その感情が感情種類として判定される。また、そのときのカウント値が感情レベルとされるので、感情レベルはレベル１〜レベル５（Ｌｖ（レベル）＝１〜５）となる。但し、カウントが５を超える場合は、感情レベルを５とする。
【００４３】
なお、感情種類が「怯え」で、かつ同一の感情レベルである状態が時間Ｔ５以上継続した場合には、時間Ｔ５ごとに再チェックを行う。
また、感情種類が「快感」のまま、感情レベルが２から１へ移行した場合は、以後の感情レベルも２とみなし、感情レベル２を継続させるものとする。
【００４４】
「快感」、「驚き」、「怯え」のカウント値がいずれも０である場合で、「嬉しい」あるいは「哀しい」のカウント値の少なくとも一方が１以上である場合には、次に述べる方法で「嬉しい」および「哀しい」のカウント値を比較する。まず、前回の「嬉しい」のカウント値と現在の「嬉しい」のカウント値とから、これらの平均値を求める。次に、前回の「哀しい」のカウント値と現在の「哀しい」のカウント値とから、これらの平均値を求める。そして、「嬉しい」の平均値と「哀しい」の平均値とを比較する。
【００４５】
上記の「嬉しい」の平均カウント値のほうが大きい場合には、感情種類を「嬉しい」とするとともに、「嬉しい」の平均カウント値から「哀しい」の平均カウント値を引いた値を感情レベルとする。逆に、「哀しい」の平均カウント値のほうが大きい場合には、感情種類を「哀しい」とするとともに、「哀しい」の平均カウント値から「嬉しい」の平均カウント値を引いた値を感情レベルとする。
「嬉しい」の平均カウント値と「哀しい」の平均カウント値とが等しい場合には、前回のカウント値同士を比較し、大きい方のカウント値を持つほうを感情種類として選択するとともに、この場合の感情レベルを１とする。
【００４６】
但し、「嬉しい」と「哀しい」のカウント値を用いた判定に関して、上記の規則に関わらず、次の２つの例外パターンに該当する場合には、これを適用するものとする。
第１の例外パターンは、「嬉しい」のカウント値が５で、かつ、「哀しい」のカウント値が５である場合であり、このときは、感情種類を「快感」とし、感情レベルを２とする。
第２の例外パターンは、「怯え」のカウント値が３以上で、かつ、「哀しい」のカウント値が４以上の場合であり、このときは、感情種類を「哀しい」とし、感情レベルを５とする。
【００４７】
なお、上記５種類のいずれの感情についても、カウント値の結果がすべて０である場合には、感情種類は「無表情」であると判定される。
【００４８】
次に、補間フラグに関する判定方法を説明する。補間フラグのデフォルト値は１（ＯＮ）であるが、次の２通りのいずれかに該当する場合に限り、補間フラグを０（ＯＦＦ）とする。第１に、同じ感情種類が時間Ｔ６以上継続した場合には補間フラグを０とする。第２に、前回の感情種類が「嬉しい」または「哀しい」であり、そこから感情種類「快感」に遷移する場合には補間フラグを０とする。
【００４９】
上述した感情パラメータの算出および感情の判定等の処理において、時間Ｔ１〜Ｔ６については、Ｔ１＜Ｔ２＜Ｔ３＜Ｔ４＜Ｔ５＜Ｔ６の関係を満たす適切な値を用いることとする。なお、Ｔ１はほぼ数百ミリ秒程度、Ｔ６はほぼ数千ミリ秒程度である。また、音圧値レベルＬ１〜Ｌ７については、Ｌ１＜Ｌ２＜Ｌ３＜Ｌ４＜Ｌ５＜Ｌ６＜Ｌ７の関係を満たす適切な値を用いることとする。一例としては、Ｌ１は−５０ｄＢ（デシベル）程度、Ｌ７は−２０ｄＢ程度の値を用いる。
【００５０】
次に、図１に示したリズム解析部１４における処理について説明する。
リズム解析部１４には、伸長部によって伸長されたデータが入力される。この入力データは、前述のように、周波数領域ごとの音圧値情報を時系列的に持つものである。このような入力データを基に、リズム解析部１４は音楽のリズムを解析し、その音楽のｂｐｍ値（ｂｅａｔｓ　ｐｅｒ　ｍｉｎｕｔｅ，１分あたりビート数，単位時間あたり拍数）を算出して出力する。
【００５１】
リズム解析の処理においては、次の事項を前提とする。第１に、少なくとも一定時間以上は曲のリズムは一定のｂｐｍ値で正確に刻まれることとする。第２に、１拍あたり２回、ノイズ系の音が含まれることとする。例えば、曲が４分の４拍子である場合には、４拍の間に８回ノイズ系の音が含まれる。ここで、ノイズ系の音とは、例えばシンバル等の音である。
ノイズ系の音は、ほぼ全周波数帯域に渡って音圧変化があることが特徴である。従って、周波数帯域ごとにフレーム間の音圧変化量を求め、全周波数にわたって連続的に音圧変化量が所定の閾値以上となる場合にこれをノイズ系の音として検出できる。
【００５２】
そして、ノイズ系の音はリズムに応じて所定のタイミングの箇所に多く集中するという傾向があることから、このノイズ系の音を検出し、この検出間隔をフレーム（１フレームは約０．０２６１秒）単位で求める。この段階では、検出される間隔は、一定ではなく、フレーム数ごとの度数の分布として得られる。得られた分布を基に、補正を加えて、拍の間隔を決定することによってｂｐｍ値を求めることとする。
つまり、前記第２の前提によると１拍あたり２回のノイズ系の音が含まれるため、求められたノイズ間隔Ｆ（フレーム単位）を用いると、ｂｐｍ値は、次の式で得られる。すなわち、
ｂｐｍ値＝６０　［秒／分］　／　（２＊Ｆ　［フレーム］　＊０．０２６１　［秒／フレーム］）
【００５３】
図３は、上述したロボット制御装置におけるデータの流れを示す概略図である。図示するように、音声データ２１を基に、これを各周波数帯域に分解する処理（３１）を行うことによって、分解された音声のデータ２２が得られる。そしてこのデータを基に、感性データを算出する処理（３２）を行うことによって感性データ２３が得られる。そして、この感性データ２３をロボット制御部７に出力（３３）すると、感性データ２３に基づいて、ロボット制御部７がロボットを動作（３４）させる。例えば、ロボット制御部７は、ロボットの腕や口などの動きを制御する。
【００５４】
次に、本実施形態のロボット制御装置が人間の形をしたロボットの制御に適用される例について説明する。
図４はロボット制御装置が制御するロボットの概略を示す図である。この図において、４１及び４２はそれぞれロボットの右目まぶた及び左目まぶたであり、ロボット制御装置のロボット制御部７が右目まぶた４１、左目まぶた４２を上下させて目の大きさを制御する。尚、通常は両目のまぶたは半分だけ開いている。
また４３及び４４はそれぞれ右眼球及び左眼球であり、ロボット制御部７が右眼球４３、左眼球４４を左右に動かすことで眼球の向きを制御する。
また、４５は下唇であり、ロボット制御部７が下唇を上下に動かす事により、口の開閉を制御する。
また、４６及び４７はそれぞれ右肩関節及び左肩関節であり、ロボット制御部７は右肩関節４６、左肩関節４７を中心とした回転運動をさせることによって、それぞれ右腕、左腕を前後方向に振る。また右肩関節４６、左肩関節４７は胴体の左右の上部に位置しているが、その位置から下方に動くように設計されている。これにより、ロボット制御部７が肩の位置を下方に下げることで肩を落とす制御を行う。また右肩関節４６は右腕を右方向へ、左肩関節４７は左腕を左方向へ動かし、頭部の脇まで腕を持ち上げることができる。
また、４８は首部であり、ロボット制御部７が首部４８を回転させることができる。また、４９は腰部であり、ロボット制御部７は腰部４９を制御してロボットの上半身を前に倒す制御を行う。
尚、図１のロボット制御装置は、図４のロボットの内部に全て格納されても良いし、また、ロボット制御部７のみを図４のロボット内部に格納し、図１のロボット制御部７以外の各処理部を図４のロボットの外部に設置して、感性データ出力部３とロボット制御部７との感性データの送受信を、無線送受信装置を用いて行うようにしても良い。
【００５５】
図５は感性データに基づくロボット制御部７の処理を示す表である。
図５より、ロボット制御部７が感性データ出力部３から受信した感性データが「嬉しい」の「Ｌｖ＝１」であった場合、ロボット制御部７は、首部４８を左右に振る制御を行う。
また、感性データが「嬉しい」の「Ｌｖ＝２」であった場合、ロボット制御部７は下唇４５少し下げる制御をおこなうことでロボットの口を小さく開ける。また、首部４８を左右に振る制御を行う。
また、感性データが「嬉しい」の「Ｌｖ＝３」であった場合、ロボット制御部７は下唇４５少し下げる制御をおこなうことでロボットの口を小さく開ける。また、右肩関節４６、左肩関節４７を制御して右腕、左腕を肩の位置まで持ち上げ、両手を横に広げる。
また、感性データが「嬉しい」の「Ｌｖ＝４」であった場合、ロボット制御部７は下唇４５を大きく下げる制御を行うことでロボットの口を大きく開ける。また、首部４８を回転させる制御を行う。
また、感性データが「嬉しい」の「Ｌｖ＝５」であった場合、ロボット制御部７は下唇４５を大きく下げる制御を行うことでロボットの口を大きく開ける。また、右肩関節４６、左肩関節４７を制御して右腕、左腕を頭部の脇まで持ち上げ、首部４８を回転させる制御を行う。
【００５６】
また、ロボット制御部７が感性データ出力部３から受信した感性データが「哀しい」の「Ｌｖ＝１」であった場合、ロボット制御部７は、腰部４９を制御して胴体を前に少し倒し、ロボットが下を向くようにする。
また、感性データが「哀しい」の「Ｌｖ＝２」であった場合、ロボット制御部７は右まぶた４１、左まぶた４２を下げて、ロボットの目を細める制御を行い、また、ロボット制御部７は、腰部４９を制御して胴体を前に少し倒し、ロボットが下を向くようにする。
また、感性データが「哀しい」の「Ｌｖ＝３」であった場合、ロボット制御部７は右まぶた４１、左まぶた４２を少し下げてロボットが目を細める制御を行い、また、ロボット制御部７は、腰部４９を制御して胴体を前に少し倒し、ロボットが下を向くようにする。さらにロボット制御部７は、右肩関節４６、左肩関節４７の位置を下方に動かして、ロボットが肩を落とす制御を行う。
また、感性データが「哀しい」の「Ｌｖ＝４」であった場合、ロボット制御部７は右まぶた４１、左まぶた４２をいっぱいに下げて、ロボットが目を閉じる制御を行う。そして、ロボット制御部７は腰部４９を制御して胴体を前に少し倒し、ロボットが下を向くようにする。また、ロボット制御部７は首部４８を左右に振って、さらに、右肩関節４６、左肩関節４７の位置を下方に動かして、ロボットが肩を落とす制御を行う。
また、感性データが「哀しい」の「Ｌｖ＝５」であった場合、ロボット制御部７は右まぶた４１、左まぶた４２をいっぱいに下げて、ロボットが目を閉じる制御を行い、また、腰部４９を制御して胴体を前に少し倒し、ロボットが下を向ようにする。さらに、ロボット制御部７は首部４８を回転させ、そして、右肩関節４６、左肩関節４７の位置を下方に動かして、ロボットが肩を落とす制御を行う。
【００５７】
また、ロボット制御部７が感性データ出力部３から受信した感性データが「驚き」の「Ｌｖ＝１」であった場合、ロボット制御部７は、下唇４５を制御して口を小さく開けっ放しにする。
また、感性データが「驚き」の「Ｌｖ＝２」であった場合、ロボット制御部７は、下唇４５を制御して口を大きく開けっ放しにする。
また、感性データが「驚き」の「Ｌｖ＝３」であった場合、ロボット制御部７は、右まぶた４１、左まぶた４２を限界まで開き、口を小さく開けっ放しにする制御を行う。
また、感性データが「驚き」の「Ｌｖ＝４」であった場合、ロボット制御部７は、右まぶた４１、左まぶた４２を限界まで開き、口を大きく開けっ放しにする制御を行う。
また、感性データが「驚き」の「Ｌｖ＝５」であった場合、ロボット制御部７は、右まぶた４１、左まぶた４２を限界まで開き、口を大きく開けっ放しにする制御を行う。さらに、右肩関節４６、左肩関節４７を制御して右腕、左腕を頭部の脇まで持ち上げる。
【００５８】
また、ロボット制御部７が感性データ出力部３から受信した感性データが「怯え」の「Ｌｖ＝１」であった場合、ロボット制御部７は、下唇４５を小さく開閉する制御を行い、ロボットの口をパクパクさせる。
また、感性データが「怯え」の「Ｌｖ＝２」であった場合、ロボット制御部７は、下唇４５を大きく開閉する制御を行い、ロボットの口をパクパクさせる。
また、感性データが「怯え」の「Ｌｖ＝３」であった場合、ロボット制御部７は、下唇４５を小さく開閉する制御を行い、ロボットの口をパクパクさせる。また、ロボット制御部７は首部４８を左右に振る制御を行う。
また、感性データが「怯え」の「Ｌｖ＝４」であった場合、ロボット制御部７は、下唇４５を大きく開閉する制御を行い、ロボットの口をパクパクさせる。また、ロボット制御部７は首部４８を左右に振る制御を行う。
また、感性データが「怯え」の「Ｌｖ＝５」であった場合、ロボット制御部７は、下唇４５を大きく開閉する制御を行い、ロボットの口をパクパクさせる。また、ロボット制御部７は首部４８を左右に振り、そして、右肩関節４６、左肩関節４７を中心とした回転運動をさせることによって、ロボットの腕を前後に大きく振る制御を行う。
【００５９】
また、ロボット制御部７が感性データ出力部３から受信した感性データが「快感」の「Ｌｖ＝１」であった場合、ロボット制御部７は、右まぶた４１、左まぶた４２を限界まで開く制御を行い、また、右肩関節４６、左肩関節４７を制御して右腕、左腕を頭部の脇まで持ち上げる。そして、腰部４９を制御して胴体を前に倒したり、起こしたりする。
また、ロボット制御部７が感性データ出力部３から受信した感性データが「快感」の「Ｌｖ＝２」であった場合、ロボット制御部７は、右まぶた４１、左まぶた４２を限界まで開く制御を行い、また、右肩関節４６、左肩関節４７を中心とした回転運動をさせることによって、ロボットの右腕と左腕をぐるぐると回転させる。
【００６０】
以上、本発明の実施の形態について説明したが、上述のロボット制御装置は内部に、コンピュータシステムを有している。そして、図１に示す各処理部の機能を実現するためのプログラムをコンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピュータシステムに読み込ませ、実行することにより、図１に示す各処理部に必要な処理を行ってもよい。なお、ここでいう「コンピュータシステム」とは、ＯＳや周辺機器等のハードウェアを含むものとする。
【００６１】
また、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ−ＲＯＭ等の可般媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置のことをいう。また、上記プログラムを通信回線によってコンピュータに配信し、この配信を受けたコンピュータが当該プログラムを実行するようにしても良い。
また上記プログラムは、前述した機能の一部を実現するためのものであっても良く、さらに前述した機能をコンピュータシステムにすでに記録されているプログラムとの組合せで実現できるもの、いわゆる差分ファイル（差分プログラム）であっても良い。
【００６２】
【発明の効果】
以上説明したように、この発明によれば、データ記憶部から読み出された前記デジタルデータを基に前記周波数帯域毎の音圧値の分布を解析し、この解析結果に基づいて感情種類と、前記感情種類毎の感情レベルなどからなる感性データを算出する。そして、算出した前記感性データを出力し、その感性データに基づいて、ロボット制御部がロボットの動作を制御する。これにより、再生される音楽の感性に合った動きでロボットの動作を制御することができる。
【図面の簡単な説明】
【図１】本実施形態による感性データ算出方法を応用したロボット制御装置の構成を示すブロック図である。
【図２】本実施形態の感情解析部１３による処理の手順を示すフローチャートである。
【図３】本実施形態のロボット制御装置におけるデータの流れを示す概略図である。
【図４】本実施形態のロボット制御装置が制御するロボットの概略を示す図である。
【図５】本実施形態の感性データに基づくロボット制御部７の処理を示す表である。
【符号の説明】
１　スペクトラム解析部
２　感性データ蓄積部
３　感性データ出力部
４　ＭＰ３データ記憶部
５　ＭＰ３デコーダ部
６　Ｄ／Ａ変換部
７　ロボット制御部
１１　伸長部
１２　感性データ算出部
１３　感情解析部
１４　リズム解析部[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention relates to an information processing device, and more particularly to a robot control device, a robot control method, and a program therefor.
[0002]
[Prior art]
2. Description of the Related Art Conventionally, when performing a control for applying a motion to a human or other animal-shaped robot arranged in an exhibition hall or the like, a certain motion of the robot is programmed and controlled.
[0003]
[Problems to be solved by the invention]
However, in a method of controlling a robot by the above-described program, there is no robot control method that extracts an emotion parameter expressing an emotion from music to be reproduced and controls the robot according to the emotion parameter.
Therefore, the present invention controls an operation of the robot with a motion that matches the sensitivity of the reproduced music by extracting an emotion parameter expressing an emotion from the reproduced music and controlling the robot according to the emotion parameter. It is an object of the present invention to provide a robot control device, a robot control method, and a program thereof that can perform the control.
[0004]
[Means for Solving the Problems]
The present invention has been made to solve the above-described problems, and the invention according to claim 1 includes a data storage unit that stores digital data expressing sound as a sound pressure value for each divided frequency band; Analyzing the distribution of sound pressure values for each of the frequency bands based on the digital data read from the data storage unit, based on the analysis result, the emotion type and the sensitivity including the emotion level for each of the emotion types A sensitivity data calculation unit for calculating data, a sensitivity data output unit for outputting the sensitivity data calculated by the sensitivity data calculation unit, and control of the operation of the robot based on the sensitivity data received from the sensitivity data output unit And a robot controller that performs the following.
[0005]
According to the configuration described above, the distribution of sound pressure values for each frequency band is analyzed based on the digital data read from the data storage unit, and based on the analysis result, an emotion type and an emotion type Calculate emotion data including emotion level. The calculated kansei data is output, and the robot control unit controls the operation of the robot based on the kansei data. Thus, the operation of the robot can be controlled with a motion that matches the sensitivity of the music to be reproduced.
[0006]
According to a second aspect of the present invention, there is provided a robot control method in a robot control device including a data storage unit that stores digital data representing sound as a sound pressure value for each divided frequency band, Analyzing the distribution of sound pressure values for each frequency band based on the digital data read from the storage unit, based on the analysis result, emotion type and emotion data including emotion level for each emotion type and the like. A feeling data calculating step for calculating, a feeling data outputting step for outputting the feeling data calculated in the feeling data calculating step, and controlling the operation of the robot based on the feeling data output in the feeling data output step. And a robot control process to be performed.
[0007]
According to a third aspect of the present invention, there is provided a robot control program for a computer of a robot control device including a data storage unit that stores digital data representing sound as a sound pressure value for each divided frequency band, The computer analyzes the distribution of sound pressure values for each frequency band based on the digital data read from the data storage unit, and based on the analysis result, the emotion type and the emotion level for each emotion type. Based on the sentiment data output processing that calculates the sentiment data that is calculated by the sentiment data calculation processing, and the sentiment data output processing that outputs the sentiment data calculated by the sentiment data calculation processing, This is a program for executing a robot control process for controlling the operation of the robot.
[0008]
BEST MODE FOR CARRYING OUT THE INVENTION
Hereinafter, a robot control device according to an embodiment of the present invention will be described with reference to the drawings.
FIG. 1 is a block diagram showing a configuration of a robot control device to which a feeling data calculation method according to the embodiment is applied. This robot control device reproduces music (auditory data) recorded in the format of MP3 (MPEG1 Audio Layer 3) and calculates sensibility data based on the music.
[0009]
In FIG. 1, reference numeral 4 denotes an MP3 data storage unit that stores music data in the MP3 format. Reference numeral 1 denotes a spectrum analysis unit that calculates and outputs sensitivity data based on music data read from the MP3 data storage unit 4. Reference numeral 2 denotes a sentiment data storage unit that stores the sentiment data calculated by the spectrum analyzer 1. Reference numeral 3 denotes a sentiment data output unit that sequentially reads out sentiment data stored in the sentiment data storage unit 2 and outputs the data to the robot control unit.
[0010]
Reference numeral 5 denotes an MP3 decoder unit that decodes music data read from the MP3 data storage unit 4 and outputs a digital signal (digital audio signal) having a time-series sound pressure level. Reference numeral 6 denotes a D / A (digital / analog) conversion unit that converts a digital signal output from the MP3 decoder unit 5 and outputs an analog audio signal. Reference numeral 7 denotes a robot control unit that controls the robot based on the control signal output from the emotion data output unit 3.
[0011]
The music data stored in the MP3 data storage unit 4 has a sound pressure level sampled and quantized at a predetermined sampling frequency (for example, 44100 Hz (Hertz)) having a predetermined frame length (for example, about 0.0261 seconds). ) Is converted into a sound pressure value for each frequency band divided into a predetermined number (for example, 576 lines) in units of one frame, and further compressed.
[0012]
The spectrum analysis unit 1 sequentially reads out such music data in a time-series manner, decompresses the read data first, performs analysis according to a predetermined procedure described later, and sequentially outputs the result as emotion data. . The sentiment data output by the spectrum analyzer 1 is also time-series data, and is sequentially stored in the sentiment data storage 2.
[0013]
The sentiment data output unit 3 outputs the sentiment data stored in the sentiment data storage unit 2 to the robot control unit 7. Then, the robot control unit 7 controls the movement of the robot's arm and mouth based on the sentiment data read from the sentiment data storage unit 2 by the sentiment data output unit 3 and transmitted.
[0014]
The spectrum analysis unit 1, the sensitivity data output unit 3, and the MP3 decoder 5 are synchronized with each other so that the progress of the music to be reproduced matches the timing of the sensitivity data output by the sensitivity data output unit 3. I do.
Also, even if it takes time to calculate the sensitivity data by the spectrum analysis unit 1, the spectrum analysis unit is instructed to reproduce the music so that the timing between the music and the sensitivity data output by the sensitivity data output unit 3 is not shifted. In this manner, the MP3 decoder 5 reproduces the music and outputs the control signal from the emotional data output unit 3 in a manner that the sensitivity data is calculated in advance by the MP3 decoder 5 with a delay of several seconds to several tens of seconds. However, if the calculation of the sensitivity data by the spectrum analyzer 1 can be performed sufficiently quickly, the reproduction may be performed in real time without providing the above-described delay.
[0015]
The MP3 data storage unit 4 includes a recording medium such as a magnetic disk, a CD-ROM (read only memory using a compact disk), a CD-R (CD Recordable), a DVD (Digital Versatile Disk), a magneto-optical disk, and a semiconductor memory. This is realized by a reading device that reads the information.
[0016]
The spectrum analyzer 1 is realized using a computer. In other words, the procedure of processing, such as calculation of emotional data, which will be described later, is recorded on a computer-readable recording medium in the form of a computer program, and the central processing unit of the computer reads the program and executes the instruction to execute the instruction. Implement functions such as calculation. Here, the computer-readable recording medium is, for example, a magnetic disk, a CD-ROM or a CD-R, or a semiconductor memory. Alternatively, the spectrum analyzer 1 may be realized as an integrated circuit equipped with dedicated logic. Alternatively, instead of recording the computer program on a computer-readable recording medium, the program may be distributed using communication, and the central processing unit of the computer that has received the program may execute the program.
The sensitivity data storage unit 2 is realized using a high-speed rewritable recording medium such as a semiconductor memory or a magnetic disk.
[0017]
Next, details of the internal generation of the spectrum analysis unit 1 and the processing thereof will be described. As shown in FIG. 1, the spectrum analysis unit 1 includes an expansion unit 11 and a sensitivity data calculation unit 12. The expansion unit 11 expands music data read from the MP3 data storage unit. That is, in the part A of FIG. 1, data in a compressed state is transmitted. In the part B of FIG. 1, the information of the sound pressure value for each of the above-described frequency bands (sound ranges) is transmitted in an expanded state. The emotion data calculation unit 12 further includes an emotion analysis unit 13 and a rhythm analysis unit 14.
[0018]
Next, a detailed processing procedure of the emotion analysis unit 13 and the rhythm analysis unit 14 will be described.
[0019]
FIG. 2 is a flowchart illustrating a procedure of the process performed by the emotion analysis unit 13. As shown in FIG. 2, the emotion analyzing unit 13 first performs a division process into five ranges based on the data input in step S1, and performs emotion processing based on the sound pressure values of these five ranges in step S2. A process for calculating parameters is performed, and a determination is made based on the emotion parameters calculated in step S3. As a result of the determination, time-series data including a set of an index, an emotion type, an emotion level, a duration, and an interpolation flag is output.
[0020]
The index is a sequential value starting from 0.
The emotion type is one of “default”, “pleasure”, “surprise”, “fear”, “happy”, and “sad”. It is.
When the emotion type is any of “pleasure”, “surprise”, “frightened”, “happy”, and “sad”, the emotion level takes an integer value of any one of 1 or more and 5 or less. When the emotion type is “expressionless”, the value of the emotion level is “none”.
The duration is a numerical value in seconds and takes a value of 1 or more.
The interpolation flag takes a value of either 0 (representing “OFF”) or 1 (representing “ON”).
[0021]
Initial values for processing time-series music data are index = 0, emotion type = “no expression”, emotion level = “none”, duration time = “1”, and interpolation flag = “1”.
[0022]
Hereinafter, the processing will be described in more detail.
Reference symbol D1 in FIG. 2 is sound pressure value information for each frequency band input to the emotion analysis unit 13. At this stage, sound pressure value information of each of the 576 frequency bands is held. The sampling frequency of the original MP3 data is 44100 Hz. That is, by inputting digital data representing a sound as a sound pressure value for each divided frequency band and analyzing the distribution of sound pressure values for each frequency band by the following method, the sensitivity data related to the sound is analyzed. Is calculated.
[0023]
In step S1, an average sound pressure value for each of the following five ranges is calculated based on the sound pressure value information (D1) and output as sound pressure value information (D2). The five ranges are a low tone (0 Hz to 76.5625 Hz), a middle and low tone (229.6875 Hz to 1900.625 Hz), a middle and high tone (7005.469 Hz to 1002.969 Hz), and a high tone (10029.69 Hz). １４1497.97 Hz) and the highest tone (15006.25 Hz to 17992.19 Hz).
That is, here, the entire frequency band is divided into five frequency band groups, and the analysis is performed using the sound pressure value for each frequency band group.
[0024]
In step S1, a long sound element and a short sound element are extracted by scale division. For this extraction, first, the band from 0 Hz to 497.6563 Hz is equally divided into 13 regions, and the band from 497.6563 Hz to 22050 Hz is scale-divided into 63 regions. Then, it is determined whether or not the sound pressure values of 24 scale regions of two octaves of 497.6563 Hz to 2028.906 Hz are larger than a predetermined threshold value.
[0025]
The first, third, fifth, eighth, tenth, twelfth, thirteenth, fifteenth, seventeenth, twentieth, twenty-second, and twenty-fourth regions of the above-mentioned twenty-four scale regions are long elements. is there. Of these long sound elements, the first and thirteenth areas are areas separated by one octave, and if the sound pressure values of these two areas are both larger than the threshold value, the long sound element is counted as +1. Similarly, the third and fifteenth regions, the fifth and seventeenth regions, the eighth and twentieth regions, the tenth and twenty-second regions, and the twelfth and twenty-fourth regions are one octave apart from each other. When the sound pressure values of the two areas are both larger than the threshold, the long sound element is counted as +1.
The second and fourteenth, the fourth and the sixteenth, the sixth and the eighteenth, the seventh and the nineteenth, the ninth and the twenty-first, and the eleventh and the twenty-third of the above-mentioned 24 scale regions are each 1 When the sound pressure values of the two regions are larger than the threshold value for each pair, the short sound element is counted as +1.
As a result of the extraction processing, the long sound element and the short sound element each take any integer value of 0 or more and 6 or less.
[0026]
Next, in step S2, processing for calculating an emotion parameter based on the sound pressure value information D2 is performed. Priority is set in the emotion parameter, and the priority of “pleasure” is 1, the priority of “surprise” is 2, the priority of “fear” is 3, and the priority of “happy” and “sad” are priority. Both are 4.
Note that when all of the above five types of emotion parameter values are “0”, this corresponds to “no expression”.
[0027]
In step S3, a determination based on the calculated emotion parameter is performed, and a process of obtaining sensibility data is performed. In this determination, the result of the rhythm analysis by the rhythm analysis unit 14 shown in FIG. 1 is partially used. The result of the rhythm analysis is, for example, how long the time interval between beats is.
In calculating the emotion parameter value, sounds having a sound pressure value of L1 or less are ignored.
[0028]
The processing relating to “pleasure (Pleasure)” is as follows.
[Condition 1] If the time interval between beats is T3 or more and any of the sound pressure peaks from the middle and low pitches to the high pitches temporally moves in the treble direction by T4 or more, the emotion of "pleasure" Increment the parameter count by one. When this condition is met, the emotion is assumed to continue from the point in time T4 after the sound of the target starts to sound until the point in time T2 after the sound of the target stops. That is, in the present embodiment, a control signal based on the “pleasure” data is output during this duration.
[Condition 2] When the sound pressure value in the low-tone range is L7 or more and the average sound pressure value in the high-frequency part is L4 or more, and the average sound pressure value is L6 or more, the average time between beats up to the previous time If the value obtained by subtracting the time interval between the current beats from the interval is equal to or greater than T1, or if the previous determination result is "surprise", the count of the emotion parameter of "pleasure" is increased by two. When this condition is met, the emotion is assumed to start from the time when the time T4 has elapsed since the sound of the target started to sound.
[0029]
That is, when the above condition 2 is applied, the sensitivity data is calculated based on the average sound pressure value for each of the divided frequency band groups.
When the above condition 1 is applied, the sensitivity data is calculated based on how the frequency band having the peak sound pressure value changes with time in the frequency band group.
When the above condition 1 is applied, the number of beats per unit time of the rhythm included in the sound is obtained based on the original digital data, and the sensitivity data is calculated based on the number of beats per unit time. . The "time interval between beats" is obtained from the reciprocal of the number of beats per unit time.
Since the priority of the emotion of “pleasure” is “1”, which is the highest, if any of the above conditions 1 or 2 is satisfied, the other emotions are ignored.
[0030]
The processing relating to “surprise” is as follows.
If the above-mentioned condition of "pleasure" is not satisfied, it is checked whether or not the condition of "surprise" is satisfied under the following conditions.
[0031]
[Condition 1] When a sound whose peak sound pressure value of the low-pitched portion is L7 or more is first obtained from a state where there is no sound whose average sound pressure value of the whole sound range is L3 or less, the emotion parameter of “surprise” is counted. +4, and the time during which the sound continues to sound is defined as the duration. However, when the following condition 2 is satisfied, it is ignored.
[Condition 2] When a sound having a peak sound pressure value of L7 or more is first acquired from a state where there is no sound having an average sound pressure value of L2 or less in the entire sound range, the emotion parameter of "surprise" is counted. +5, and the time during which the sound continues to sound is defined as the duration time.
[0032]
[Condition 3] When there is no sound with an average sound pressure value of L3 or less in the entire sound range and a sound with a peak sound pressure value of L7 or more other than the low-frequency part is first acquired, the emotion parameter of “surprise” is The count is incremented by 1, and the time during which the sound continues to sound is defined as the duration. However, when the following condition 4 is satisfied, it is ignored.
[Condition 4] From the state where there is no sound having an average sound pressure value of L2 or less in the entire sound range, if a sound having a peak sound pressure value of L7 or more other than the low-frequency part is first acquired, the emotion parameter of “surprise” is The count is incremented by +2, and the time during which the sound continues to sound is defined as the duration.
[Condition 5] When the sound of the highest pitch lasts for the time T4 or more, or when the sound of the highest pitch exists and the average sound pressure value of the middle / high pitch is not more than L4, the emotion parameter of “surprise” is The count is incremented by +3, and the time during which the sound continues to sound is defined as the duration.
Note that the priority of the emotion of “surprise” is “2” next to that of “pleasure”, so if any of the above conditions 1 to 5 is satisfied, the other emotions with lower priority are ignored.
[0033]
The processing relating to “Fear” is as follows.
If none of the above “pleasure” or “surprise” conditions are met, it is checked whether or not “fear” is met under the following conditions.
[0034]
[Condition 1] When the peak of any of the sound pressure values from the middle and low pitches to the high pitch moves temporally by T4 or more in the low pitch direction, the count of the emotion parameter of “fear” is incremented by one.
[Condition 2] If any one of the peaks of the sound pressure value from the mid-low-pitched portion to the high-pitched portion moves temporally by T4 or more in the bass direction and then temporally moves by T4 or more in the treble direction, “scared” +4 is applied to the emotion parameter count.
[Condition 3] When the number N of peaks of any of the sound pressure values from the middle and low pitches to the high pitches fluctuates in the high pitch direction while moving in the low pitch direction is 42 or more, the count of the emotion parameter of “fear” is increased by + (N / 16).
[0035]
The start point of the output of the control signal based on the “fear” data is set after a lapse of time T4 from the start of the sound of the target, and the end point of the control signal is set after a lapse of the time T2 after the stop of the sound of the target.
Since the priority of the emotion of “fear” is “3” next to that of “surprise”, if any of the above conditions 1 to 3 is satisfied, the other emotions with lower priority are ignored.
[0036]
If none of the above-mentioned "pleasure", "surprise" and "fear" conditions are met, it is checked whether the condition is "happy" or "sad" according to the following conditions.
[0037]
The processing relating to “happy” is as follows.
[Condition 1] When there is a beat, the count of the emotion parameter of “happy” is incremented by one.
[Condition 2] When the time interval between beats is T7 or less, the count of the emotion parameter of “happy” is incremented by one.
[Condition 3] When the average sound pressure value of the treble portion is L4 or more, the count of the emotion parameter of “happy” is incremented by one.
[Condition 4] When Condition 3 described above is satisfied and there are five or more peaks in the sound pressure value of the middle / low sound part, the count of the emotion parameter of “happy” is incremented by +2.
[Condition 5] When the above condition 3 is satisfied, the above condition 4 is satisfied, and the average sound pressure value of the low-pitched sound portion is L5 or less, the count of the emotion parameter of “happy” is incremented by +2.
[Condition 6] When the value of the extracted major element minus minor element is 2 or more, the count of the emotion parameter of “happy” is incremented by one.
[0038]
The time error of the start point of the control signal output based on the “happy” data is ± T2. The time error of the end point of the control signal output is also ± T2.
[0039]
The processing relating to “Sad” is as follows.
[Condition 1] When the time interval between beats is equal to or longer than T5, the count of the emotion parameter of "sad" is incremented by one.
[Condition 2] When there is no beat, the count of the emotion parameter of “sad” is incremented by +2.
[Condition 3] When there is a peak of the sound pressure value that lasts for the time T4 or more in the middle / low-pitched sound part, the emotion parameter of “sad” is incremented by 1, and the time during which the sound continues to be sounded is set as the duration. However, when the following condition 4 is satisfied, it is ignored.
[Condition 4] When there is a peak of the sound pressure value that lasts for the time T6 or more in the middle and low pitch part, the emotion parameter of “sad” is increased by +2, and the time during which the sound continues to be sounded is set as the duration.
[0040]
[Condition 5] When there are three or more peaks of the sound pressure value in the treble part, the emotion parameter of "sad" is incremented by one.
[Condition 6] When there is no sound having an average sound pressure value of L3 or more in all regions, the emotion parameter of “sad” is incremented by one.
[Condition 7] When there is no sound having an average sound pressure value of L3 or more in all regions for a time T2 or more, the emotion parameter of “sad” is incremented by one.
[Condition 8] When the average sound pressure value of the middle and high pitch parts and the high pitch part is L3 or less and only the sound of the middle and low pitch parts is acquired, the emotion parameter of “sad” is increased by +2.
[Condition 9] When the numerical value of the minor element-the major element is 2 or more, the emotion parameter of "sad" is incremented by one.
[0041]
The time error of the start point of the control signal output based on the "sad" data is ± T2. The time error of the end point of the control signal output is also ± T2.
[0042]
As described above, the feelings of "pleasure", "surprise", "frightened", "happy", and "sad" are checked under the respectively defined conditions.
Then, in the case where one of the count results of “pleasure”, “surprise”, and “fear” is 1 or more in order from the emotion having the highest priority, the emotion is determined as the emotion type. In addition, since the count value at that time is the emotion level, the emotion level is level 1 to level 5 (Lv (level) = 1 to 5). However, when the count exceeds 5, the emotion level is set to 5.
[0043]
If the emotion type is “frightened” and the state having the same emotion level continues for the time T5 or more, the check is performed again every time T5.
If the emotion type shifts from 2 to 1 while the emotion type remains “pleasure”, the subsequent emotion level is regarded as 2 and the emotion level 2 is continued.
[0044]
If the count values of "pleasure", "surprise", and "fright" are all 0 and at least one of the count values of "happy" or "sad" is 1 or more, the following method is used. Compare the counts of "happy" and "sad". First, an average of these values is obtained from the previous “happy” count value and the current “happy” count value. Next, an average of these values is obtained from the previous “sad” count value and the current “sad” count value. Then, the average value of “happy” and the average value of “sad” are compared.
[0045]
If the above average count value of “happy” is larger, the emotion type is set to “happy”, and the average count value of “happy” minus the average count value of “sad” is used as the emotion level. . Conversely, if the average count value of “sad” is larger, the emotion type is “sad” and the value obtained by subtracting the average count value of “happy” from the average count value of “sad” is the emotion level. I do.
If the average count value of “happy” is equal to the average count value of “sad”, compare the previous count values and select the one with the larger count value as the emotion type, and in this case Let the emotion level be 1.
[0046]
However, regarding the determination using the count values of “happy” and “sad”, regardless of the above rule, when the following two exception patterns are applicable, this applies.
The first exception pattern is a case where the count value of “happy” is 5 and the count value of “sad” is 5, in this case, the emotion type is “pleasure” and the emotion level is 2 I do.
The second exception pattern is a case where the count value of “fear” is 3 or more and the count value of “sad” is 4 or more. In this case, the emotion type is “sad” and the emotion level is 5 And
[0047]
If the result of the count value is 0 for any of the five types of emotions, the emotion type is determined to be “expressionless”.
[0048]
Next, a determination method regarding the interpolation flag will be described. Although the default value of the interpolation flag is 1 (ON), the interpolation flag is set to 0 (OFF) only in one of the following two cases. First, when the same emotion type continues for the time T6 or more, the interpolation flag is set to 0. Secondly, when the previous emotion type is "happy" or "sad" and the state transits to the emotion type "pleasure", the interpolation flag is set to 0.
[0049]
In the above-described processes such as the calculation of the emotion parameter and the determination of the emotion, for the times T1 to T6, appropriate values satisfying the relationship of T1 <T2 <T3 <T4 <T5 <T6 are used. T1 is about several hundred milliseconds, and T6 is about several thousand milliseconds. For sound pressure value levels L1 to L7, appropriate values that satisfy the relationship of L1 <L2 <L3 <L4 <L5 <L6 <L7 are used. As an example, L1 uses a value of about −50 dB (decibel), and L7 uses a value of about −20 dB.
[0050]
Next, processing in the rhythm analysis unit 14 shown in FIG. 1 will be described.
The data expanded by the expansion unit is input to the rhythm analysis unit 14. As described above, this input data has sound pressure value information for each frequency domain in time series. Based on such input data, the rhythm analysis unit 14 analyzes the rhythm of the music, calculates and outputs a bpm value (beats per minute, beats per minute, beats per unit time) of the music.
[0051]
The following items are assumed in the rhythm analysis processing. First, it is assumed that the rhythm of a song is accurately carved at a fixed bpm value for at least a fixed time. Second, it is assumed that a noise-based sound is included twice per beat. For example, if the song has a quarter time signature, a noise-based sound is included eight times during four beats. Here, the noise-based sound is, for example, a sound such as a cymbal.
Noise-based sounds are characterized by sound pressure changes over almost the entire frequency band. Accordingly, the amount of change in sound pressure between frames is obtained for each frequency band, and when the amount of change in sound pressure continuously exceeds a predetermined threshold value over all frequencies, this can be detected as noise-based sound.
[0052]
Since the noise-based sound tends to concentrate at a predetermined timing according to the rhythm, the noise-based sound is detected, and the detection interval is set to a frame (one frame is about 0.0261 seconds). ) Calculate in units. At this stage, the detected intervals are not constant, but are obtained as a frequency distribution for each frame number. Based on the obtained distribution, the bpm value is determined by correcting the beat and determining the interval between beats.
That is, according to the second premise, since two noise-based sounds are included per beat, the bpm value can be obtained by the following equation using the obtained noise interval F (frame unit). That is,
bpm value = 60 [second / minute] / (2 * F [frame] * 0.0261 [second / frame])
[0053]
FIG. 3 is a schematic diagram showing a data flow in the above-described robot control device. As shown in the figure, by performing a process (31) of decomposing the audio data 21 into each frequency band, the decomposed audio data 22 is obtained. Then, the sensitivity data 23 is obtained by performing a process (32) of calculating the sensitivity data based on this data. Then, when the sentiment data 23 is output to the robot controller 7 (33), the robot controller 7 operates the robot based on the sentiment data 23 (34). For example, the robot control unit 7 controls the movement of the arm, mouth, and the like of the robot.
[0054]
Next, an example in which the robot control device of the present embodiment is applied to the control of a humanoid robot will be described.
FIG. 4 is a diagram schematically illustrating a robot controlled by the robot control device. In this figure, reference numerals 41 and 42 denote a right eyelid and a left eyelid of the robot, respectively. The robot control unit 7 of the robot controller moves the right eyelid 41 and the left eyelid 42 up and down to control the size of the eyes. Normally, both eyelids are half open.
Reference numerals 43 and 44 denote a right eyeball and a left eyeball, respectively, and the robot controller 7 controls the direction of the eyeball by moving the right eyeball 43 and the left eyeball 44 left and right.
A lower lip 45 controls opening and closing of the mouth by the robot controller 7 moving the lower lip up and down.
Reference numerals 46 and 47 denote a right shoulder joint and a left shoulder joint, respectively. The robot controller 7 swings the right arm and the left arm in the front-rear direction by rotating the right shoulder joint 46 and the left shoulder joint 47, respectively. The right shoulder joint 46 and the left shoulder joint 47 are located at the upper left and right sides of the body, but are designed to move downward from these positions. As a result, the robot controller 7 performs control to drop the shoulder by lowering the position of the shoulder. The right shoulder joint 46 can move the right arm to the right, and the left shoulder joint 47 can move the left arm to the left to lift the arm to the side of the head.
Reference numeral 48 denotes a neck, and the robot controller 7 can rotate the neck 48. Reference numeral 49 denotes a waist, and the robot controller 7 controls the waist 49 to control the upper body of the robot to fall forward.
The robot controller of FIG. 1 may be entirely stored inside the robot of FIG. 4, or only the robot controller 7 may be stored inside the robot of FIG. 4 may be installed outside the robot shown in FIG. 4 so that the transmission / reception of emotional data between the emotional data output unit 3 and the robot control unit 7 is performed using a wireless transmission / reception device.
[0055]
FIG. 5 is a table showing the processing of the robot controller 7 based on the sensitivity data.
From FIG. 5, when the emotion data received from the emotion data output unit 3 by the robot control unit 7 is “Lv = 1” of “happy”, the robot control unit 7 controls the neck 48 to swing left and right.
Further, when the emotion data is “Lv = 2” of “happy”, the robot control unit 7 performs control to lower the lower lip 45 slightly to open the mouth of the robot small. Further, control is performed to swing the neck portion 48 left and right.
In addition, when the emotion data is “Lv = 3” of “happy”, the robot control unit 7 performs control to lower the lower lip 45 slightly to open the mouth of the robot small. In addition, the right shoulder joint 46 and the left shoulder joint 47 are controlled to lift the right arm and the left arm to the position of the shoulder, and spread both hands sideways.
Also, when the emotion data is “Lv = 4” of “happy”, the robot control unit 7 performs control to lower the lower lip 45 greatly, thereby opening the mouth of the robot widely. Further, control for rotating the neck 48 is performed.
When the emotion data is “Lv = 5” of “happy”, the robot control unit 7 performs control to lower the lower lip 45 greatly to open the mouth of the robot widely. In addition, the right shoulder joint 46 and the left shoulder joint 47 are controlled to lift the right arm and the left arm to the side of the head, and control to rotate the neck 48 is performed.
[0056]
When the emotional data received from the emotional data output unit 3 by the robot control unit 7 is “Lv = 1” of “sad”, the robot control unit 7 controls the waist 49 to slightly tilt the torso forward. , With the robot pointing down.
When the emotion data is “Lv = 2” of “sad”, the robot control unit 7 lowers the right eyelid 41 and the left eyelid 42 to perform control to narrow the eyes of the robot. Controls the waist 49 to slightly tilt the torso forward so that the robot faces downward.
When the emotion data is “Lv = 3” of “sad”, the robot control unit 7 performs control to lower the right eyelid 41 and the left eyelid 42 slightly to narrow the eyes of the robot. Controls the waist 49 to slightly tilt the torso forward so that the robot faces downward. Further, the robot controller 7 controls the robot to drop its shoulder by moving the positions of the right shoulder joint 46 and the left shoulder joint 47 downward.
When the emotion data is “Lv = 4” of “sad”, the robot control unit 7 lowers the right eyelid 41 and the left eyelid 42 to the full and controls the robot to close its eyes. Then, the robot controller 7 controls the waist portion 49 to slightly tilt the torso forward so that the robot faces downward. In addition, the robot control unit 7 controls the robot to drop the shoulder by shaking the neck 48 left and right and further moving the positions of the right shoulder joint 46 and the left shoulder joint 47 downward.
If the emotional data is “Lv = 5” of “sad”, the robot control unit 7 lowers the right eyelid 41 and the left eyelid 42 to the full, controls the robot to close its eyes, and Control to tilt the torso slightly forward so that the robot faces downward. Further, the robot control section 7 rotates the neck portion 48, and moves the positions of the right shoulder joint 46 and the left shoulder joint 47 downward, so that the robot drops the shoulder.
[0057]
In addition, when the emotion data received from the emotion data output unit 3 by the robot control unit 7 is “Lv = 1” of “surprise”, the robot control unit 7 controls the lower lip 45 to keep the mouth open and small. I do.
When the emotion data is “Lv = 2” of “surprise”, the robot control unit 7 controls the lower lip 45 to keep the mouth wide open.
When the emotion data is “Lv = 3” of “surprise”, the robot control unit 7 performs control to open the right eyelid 41 and the left eyelid 42 to the limit and keep the mouth small and open.
When the emotion data is “Lv = 4” of “surprise”, the robot control unit 7 performs control to open the right eyelid 41 and the left eyelid 42 to their limits and keep the mouth wide open.
When the emotion data is “Lv = 5” of “surprise”, the robot control unit 7 performs control to open the right eyelid 41 and the left eyelid 42 to the limit and keep the mouth wide open. Further, the right shoulder joint 46 and the left shoulder joint 47 are controlled to lift the right arm and the left arm to the side of the head.
[0058]
When the emotional data received from the emotional data output unit 3 by the robot control unit 7 is “Lv = 1” of “fear”, the robot control unit 7 performs control to open and close the lower lip 45 to a small extent, Makes her mouth tingle.
Further, when the emotion data is “Lv = 2” of “fear”, the robot control unit 7 performs control to open and close the lower lip 45 largely, and makes the mouth of the robot pat.
Further, when the emotion data is “Lv = 3” of “fear”, the robot control unit 7 performs control to open and close the lower lip 45 to make the mouth of the robot pat. Further, the robot control unit 7 performs control of swinging the neck 48 left and right.
When the emotion data is “Lv = 4” of “frightened”, the robot control unit 7 performs control to open and close the lower lip 45 largely, and makes the mouth of the robot pat. Further, the robot control unit 7 performs control of swinging the neck 48 left and right.
Further, when the emotion data is “Lv = 5” of “fear”, the robot control unit 7 performs control to open and close the lower lip 45 largely, and makes the mouth of the robot pat. Further, the robot controller 7 swings the neck 48 left and right, and performs a rotational movement about the right shoulder joint 46 and the left shoulder joint 47, thereby performing control to swing the robot arm back and forth greatly.
[0059]
If the emotional data received from the emotional data output unit 3 by the robot control unit 7 is “Lv = 1” of “pleasure”, the robot control unit 7 controls the right eyelid 41 and the left eyelid 42 to open to the limit. Then, the right shoulder joint 46 and the left shoulder joint 47 are controlled to lift the right arm and the left arm to the side of the head. Then, the waist 49 is controlled so that the body is tilted forward or raised.
Further, when the emotion data received from the emotion data output unit 3 by the robot control unit 7 is “Lv = 2” of “pleasure”, the robot control unit 7 controls the right eyelid 41 and the left eyelid 42 to open to the limit. The robot is rotated around the right shoulder joint 46 and the left shoulder joint 47 to rotate the right arm and the left arm of the robot.
[0060]
The embodiments of the present invention have been described above, but the above-described robot control device has a computer system therein. Then, a program for realizing the function of each processing unit shown in FIG. 1 is recorded on a computer-readable recording medium, and the program recorded on this recording medium is read by a computer system and executed, whereby Necessary processing may be performed in each processing unit shown in FIG. Here, the “computer system” includes an OS and hardware such as peripheral devices.
[0061]
The “computer-readable recording medium” refers to a general medium such as a flexible disk, a magneto-optical disk, a ROM, a CD-ROM, and a storage device such as a hard disk built in a computer system. Alternatively, the program may be distributed to a computer via a communication line, and the computer that has received the distribution may execute the program.
Further, the above-mentioned program may be for realizing a part of the above-mentioned functions, and may be a program for realizing the above-mentioned functions in combination with a program already recorded in the computer system, that is, a so-called differential file (differential file). Program).
[0062]
【The invention's effect】
As described above, according to the present invention, the distribution of the sound pressure value for each frequency band is analyzed based on the digital data read from the data storage unit, and based on the analysis result, the emotion type, The emotion data including the emotion level for each emotion type is calculated. Then, the calculated kansei data is output, and the robot control unit controls the operation of the robot based on the kansei data. Thus, the operation of the robot can be controlled with a motion that matches the sensitivity of the music to be reproduced.
[Brief description of the drawings]
FIG. 1 is a block diagram illustrating a configuration of a robot control device to which a feeling data calculation method according to an embodiment is applied.
FIG. 2 is a flowchart illustrating a procedure of processing by an emotion analysis unit 13 according to the embodiment.
FIG. 3 is a schematic diagram illustrating a data flow in the robot control device of the present embodiment.
FIG. 4 is a diagram schematically illustrating a robot controlled by the robot control device according to the embodiment.
FIG. 5 is a table showing processing of the robot control unit 7 based on emotion data of the embodiment.
[Explanation of symbols]
Reference Signs List 1 spectrum analysis unit 2 sensitivity data storage unit 3 sensitivity data output unit 4 MP3 data storage unit 5 MP3 decoder unit 6 D / A conversion unit 7 robot control unit 11 expansion unit 12 sensitivity data calculation unit 13 emotion analysis unit 14 rhythm analysis unit

Claims

A data storage unit that stores digital data representing sound as a sound pressure value for each divided frequency band,
Analyzing the distribution of sound pressure values for each of the frequency bands based on the digital data read from the data storage unit, based on the analysis result, the emotion type and the sensitivity including the emotion level for each of the emotion types A sensitivity data calculation unit for calculating data,
A sentiment data output unit that outputs the sentiment data calculated by the sentiment data calculation unit,
A robot control unit that controls the operation of the robot based on the sensitivity data received from the sensitivity data output unit;
A robot control device comprising:

A robot control method in a robot control device including a data storage unit that stores digital data expressing sound as a sound pressure value for each divided frequency band,
Analyzing the distribution of sound pressure values for each of the frequency bands based on the digital data read from the data storage unit, based on the analysis result, the emotion type and the sensitivity including the emotion level for each of the emotion types Feeling data calculation process for calculating data,
A feeling data outputting step of outputting the feeling data calculated by the feeling data calculating step,
A robot control step of controlling the operation of the robot based on the sentiment data output in the sentiment data output step;
A robot control method, comprising:

A robot control program in a computer of a robot control device including a data storage unit that stores digital data representing sound as a sound pressure value for each divided frequency band,
To the computer,
Analyzing the distribution of sound pressure values for each of the frequency bands based on the digital data read from the data storage unit, based on the analysis result, the emotion type and the sensitivity including the emotion level for each of the emotion types Emotion data calculation processing for calculating data,
Emotion data output processing for outputting the emotion data calculated by the emotion data calculation processing,
A robot control process for controlling the operation of the robot based on the sentiment data output in the sentiment data output process;
The program to execute.