JPWO2023286224A5

JPWO2023286224A5 -

Info

Publication number: JPWO2023286224A5
Application number: JP2022507774A
Authority: JP
Filing date: 2021-07-14
Publication date: 2023-06-20
Anticipated expiration: 2041-07-14

Description

かかる課題を解決すべく、第１の発明は、以下のステップをコンピュータに実行させる会話処理プログラムを提供する。第１のステップでは、スピーカより出力された質問に対して、マイクより取得された会話相手の応答を解析する。第２のステップでは、応答がネガティブであるか否かを示す所定の評価基準に従って、それぞれの質問に対する応答を評価して評価値を付与する。第３のステップでは、評価値を時系列的に累積した評価累積値が所定のしきい値に到達した場合、会話途中において、スピーカより歌を再生すべき旨を指示する。第４のステップでは、ある応答に関する評価値の符号に応じて、この応答に対応する質問の提示頻度を調整する。 In order to solve this problem, the first invention provides a conversation processing program that causes a computer to execute the following steps. In the first step, the conversation partner's response obtained from the microphone is analyzed in response to the question output from the speaker. In a second step, the response to each question is evaluated and assigned a rating value according to predetermined criteria that indicate whether the response is negative or not. In the third step, when the accumulated evaluation value obtained by accumulating the evaluation values in time series reaches a predetermined threshold value, an instruction to reproduce the song from the speaker is given during the conversation. In the fourth step, depending on the sign of the evaluation value for a certain response, the presentation frequency of the question corresponding to this response is adjusted.

ここで、第１の発明において、スピーカによる歌の再生時にマイクより音声を取得し、マイクより取得された音声波形と、歌の音声波形との差分を算出することによって、歌の再生時における会話相手の反応を特定する第５のステップを設けてもよい。 Here, in the first invention, voice is acquired from the microphone when the song is played back by the speaker, and by calculating the difference between the voice waveform acquired from the microphone and the voice waveform of the song, conversation during playback of the song is performed. A fifth step of identifying the opponent's reaction may be provided.

第１の発明において、上記第３のステップは、上記評価値に応じて、スピーカより再生すべき歌の長さまたは種類を変えてもよい。また、上記評価値に応じて、人間と会話するキャラクターの動作を指示する第６のステップを設けてもよい。 In the first invention, the third step may change the length or type of song to be reproduced from the speaker according to the evaluation value. Further, a sixth step of instructing the action of the character conversing with the human according to the evaluation value may be provided.

第２の発明は、質問生成部と、応答解析部と、応答評価部と、歌指示部とを有する会話処理システムを提供する。質問生成部は、スピーカより出力すべき質問を生成する。応答解析部は、スピーカより出力された質問に対して、マイクより取得された会話相手の応答を解析する。応答評価部は、応答がネガティブであるか否かを示す所定の評価基準に従って、それぞれの質問に対する応答を評価して評価値を付与する。歌指示部は、評価値を時系列的に累積した評価累積値が所定のしきい値に到達した場合、会話途中において、スピーカより歌を再生すべき旨を指示する。ここで、質問生成部は、ある応答に関する評価値の符号に応じて、この応答に対応する質問の提示頻度を調整する。 A second invention provides a conversation processing system having a question generation section, a response analysis section, a response evaluation section, and a song instruction section. The question generator generates questions to be output from the speaker. The response analysis unit analyzes the conversational partner's response obtained from the microphone in response to the question output from the speaker. The response evaluator evaluates the response to each question according to predetermined evaluation criteria indicating whether the response is negative or not, and assigns an evaluation value. The song instruction unit instructs that a song should be reproduced from the speaker during conversation when an evaluation accumulated value obtained by accumulating evaluation values in time series reaches a predetermined threshold value. Here, the question generation unit adjusts the presentation frequency of the question corresponding to a certain response according to the sign of the evaluation value regarding this response.

第２の発明において、上記歌指示部は、上記評価値に応じて、スピーカより再生すべき歌の長さまたは種類を変えてもよい。また、上記評価値に応じて、人間と会話するキャラクターの動作を指示する動作指示部を設けてもよい。 In the second invention, the song instruction section may change the length or type of song to be reproduced from the speaker according to the evaluation value. Further, an action instructing section may be provided for instructing the action of the character that converses with the human according to the evaluation value.

第３の発明は、スピーカと、マイクと、歌再生部とを有する会話型ロボットを提供する。スピーカは、会話相手に対して質問および歌を出力する。マイクは、スピーカより出力された質問に対する会話相手の応答を取得する。歌再生部は、評価累積値が所定のしきい値に到達したタイミングにおいて、会話途中で歌を挿入してスピーカより再生する。ここで、評価累積値は、評価値を時系列的に累積した値である。また、評価値は、マイクより取得された応答がネガティブであるか否かを示す所定の評価基準に従って、それぞれの質問に対する応答を評価した値である。さらに、ある応答に対応する質問の提示頻度は、この応答に関する評価値の符号に応じて調整される。 A third invention provides a conversational robot having a speaker, a microphone, and a song reproducing section. The speaker outputs questions and songs to the conversation partner. A microphone acquires a conversation partner's response to a question output from a speaker. The song reproducing unit inserts a song in the middle of conversation and reproduces it from the speaker at the timing when the cumulative evaluation value reaches a predetermined threshold value. Here, the cumulative evaluation value is a value obtained by accumulating the evaluation values in time series. Also, the evaluation value is a value obtained by evaluating the response to each question according to a predetermined evaluation criterion indicating whether or not the response obtained from the microphone is negative. Furthermore, the presentation frequency of the question corresponding to a certain response is adjusted according to the sign of the evaluation value for this response.

Claims

In a conversation processing program,
a first step of analyzing a conversational partner's response obtained from a microphone in response to a question output from a speaker;
a second step of evaluating the response to each question and assigning a rating value according to a predetermined criteria indicating whether the response is negative;
a third step of instructing that a song should be reproduced from the speaker during conversation when the accumulated evaluation value obtained by accumulating the evaluation values in time series reaches a predetermined threshold value ;
a fourth step of adjusting the presentation frequency of a question corresponding to a certain response according to the sign of the evaluation value for the response;
A conversation processing program characterized by causing a computer to execute processing having

Acquiring the voice from the microphone when the song is played back by the speaker, and calculating the difference between the voice waveform acquired from the microphone and the voice waveform of the song, thereby identifying the conversation partner's reaction when the song is played back. 2. A dialogue processing program as recited in claim 1, further comprising a fifth step of:

In the second step, when the response is determined to be negative, a first evaluation value having a sign of one of plus and minus is given as the evaluation value, and the response is determined to be non-negative. 3. The conversation processing program according to claim 1 or 2, wherein, when said evaluation value is given, a second evaluation value having a sign opposite to that of said first evaluation value is given.

4. The method according to claim 3, wherein said second step determines whether said response is negative based on whether or not a pre-registered negative word is included in said response. A documented conversation processing program.

4. The conversation processing program according to claim 3, wherein said second step determines whether said response is negative based on the time required from said question to said response.

4. The method according to claim 3, wherein said second step determines whether or not said response is negative based on the volume of the conversation partner's voice acquired from said microphone with reference to the beginning of the conversation. conversation processor.

4. The conversation processing program according to claim 3, wherein said second step determines whether or not said response is negative based on the facial expression of the conversation partner photographed by a camera.

4. The conversation processing program according to claim 3, wherein said second step determines whether said response is negative based on the pulse of the conversation partner obtained by a pulse sensor.

3. The conversation processing program according to claim 1 , wherein said third step changes the length or type of song to be reproduced from said speaker according to said evaluation value.

3. The conversation processing program according to claim 1, further comprising a sixth step of instructing the action of the character conversing with a human according to said evaluation value.

In a conversation processing system,
a question generator that generates a question to be output from a speaker;
a response analysis unit that analyzes a conversation partner's response obtained from a microphone in response to a question output from the speaker;
a response evaluation unit that evaluates the response to each question and gives an evaluation value according to a predetermined evaluation criterion indicating whether the response is negative;
a song instruction unit for instructing that a song should be reproduced from the speaker during conversation when an accumulated evaluation value obtained by accumulating the evaluation values in time series reaches a predetermined threshold ;
The conversation processing system , wherein the question generation unit adjusts the presentation frequency of the question corresponding to a certain response according to the sign of the evaluation value related to the response.

The response analysis unit acquires voice from the microphone when the song is played back by the speaker, and calculates a difference between the voice waveform acquired from the microphone and the voice waveform of the song, thereby determining the 12. A conversation processing system according to claim 11 , wherein the conversation partner's reaction is specified.

When the response is determined to be negative, the response evaluation unit assigns a sign of either plus or minus as the evaluation value, and when the response is determined to be not negative, the evaluation value is 13. The conversation processing system according to claim 11 or 12, wherein a sign opposite to that of said first evaluation value is given.

14. The method according to claim 13 , wherein the response evaluation unit determines whether or not the response is negative based on whether or not a pre-registered negative word is included in the response. speech processing system.

14. The conversation processing system according to claim 13 , wherein said response evaluation unit determines whether said response is negative based on the time required from said question to said response.

14. The method according to claim 13 , wherein the response evaluation unit determines whether or not the response is negative based on the volume of the conversation partner's voice obtained from the microphone with reference to the beginning of the conversation. conversation processing system.

14. The conversation processing system according to claim 13 , wherein the response evaluation unit determines whether the response is negative based on the facial expression of the conversation partner captured by a camera.

14. The conversation processing system according to claim 13 , wherein the response evaluation unit determines whether the response is negative based on the pulse of the conversation partner obtained by a pulse sensor.

13. The dialogue processing system according to claim 11 , wherein said song instruction section changes the length or type of song to be reproduced from said speaker according to said evaluation value.

13. The dialogue processing system according to claim 11, further comprising an action instructing unit that instructs the action of the character that converses with the human according to the evaluation value.

In conversational robots,
a speaker that outputs questions and songs to a conversation partner;
a microphone that acquires a conversation partner's response to a question output from the speaker;
a song reproducing unit that inserts a song during conversation and reproduces it from the speaker at the timing when the accumulated evaluation value reaches a predetermined threshold;
The evaluation cumulative value is a value obtained by accumulating evaluation values in time series,
The evaluation value is a value obtained by evaluating the response to each question according to a predetermined evaluation criterion indicating whether or not the response obtained from the microphone is negative ,
A conversational robot , wherein the presentation frequency of a question corresponding to a certain response is adjusted according to the sign of the evaluation value relating to the response .

3. The song reproducing unit selects one of a plurality of songs registered in advance based on a reproduction instruction from a server connected to the conversational robot via a network, and outputs the selected song from the speaker. 21 , a conversational robot.