JP2022029869A

JP2022029869A - Program, device and method for creating dialog scenario with verbs replaced so as to correspond to character attribute

Info

Publication number: JP2022029869A
Application number: JP2020133426A
Authority: JP
Inventors: 俊一田原; Shunichi Tawara; 元服部; Hajime Hattori
Original assignee: KDDI Corp
Current assignee: KDDI Corp
Priority date: 2020-08-05
Filing date: 2020-08-05
Publication date: 2022-02-18
Anticipated expiration: 2040-08-05
Also published as: JP7297721B2

Abstract

To provide a program, dialog scenario creation device and dialog scenario creation method.for creating a dialog scenario with verbs replaced so as to correspond to a character attribute.SOLUTION: A dialog scenario creation device 1 comprises: dialog scenario accumulation means 101 which accumulates dialog scenarios; contribution sentence accumulation means 102 which accumulates a plurality of contribution sentences of a plurality of users; meaning element determination means 11 which determines a meaning element in a spoken verb described in a spoken sentence of the dialog scenario based on a first character attribute, and associates a plurality of synonym verbs having the same meaning element with each other; feature synonym verb selection means 12 which selects the synonym verb for which an appearance frequency is most lopsided among the plurality of synonym verbs in the same meaning element as a feature synonym verb, by using the user's contribution sentence based on a second character attribute; and verb replacement means 13 which replaces the spoken verb described in the spoken sentence of the dialog scenario based on the first character attribute with the feature synonym verb, to convert the spoken sentence into a spoken sentence of the dialog scenario of the second character attribute.SELECTED DRAWING: Figure 2

Description

本発明は、ユーザと対話するエージェントの対話シナリオを作成する技術に関する。 The present invention relates to a technique for creating a dialogue scenario of an agent interacting with a user.

スマートフォンやタブレット端末、ロボットを用いて、ユーザに対して自然に対話する対話システムが普及している。コンピュータグラフィックスのエージェントが、ディスプレイに表示され、ユーザに対して音声やテキストで対話する。エージェントは、ユーザから見て対話可能な１人の人物として認識でき、ユーザの状況や、趣味趣向、感情に応じた対話を成立させる。
エージェントは、人物のプロフィールと同様に、個性的なキャラクタ属性（年齢、性別、出身地等）を有する。ユーザは、例えば自らが好むキャラクタ属性のエージェントと対話することによって、より親近感を持つことができる。 Dialogue systems that naturally interact with users using smartphones, tablet terminals, and robots have become widespread. A computer graphics agent appears on the display and interacts with the user by voice or text. The agent can be recognized as one person who can interact with the user, and establishes a dialogue according to the user's situation, hobbies, and emotions.
Agents have unique character attributes (age, gender, hometown, etc.) as well as a person's profile. The user can have a more intimate feeling, for example, by interacting with an agent having a character attribute that he / she likes.

図１は、エージェントとの対話によって、ユーザが抱く違和感を表す説明図である。 FIG. 1 is an explanatory diagram showing a sense of incongruity that a user feels through dialogue with an agent.

図１（ａ）によれば、２０代・女性のキャラクタ属性のエージェントが、以下のようにユーザと対話をしている。
ユーザ：「夜、ご飯どうするの？」
エージェント：「炒飯を食べるよ」
ユーザ：「よく聞く音楽って何ですか？」
エージェント：「ＫＡＮＡを聴くよ」 According to FIG. 1 (a), an agent with a character attribute of a woman in her twenties has a dialogue with a user as follows.
User: "What do you do with rice at night?"
Agent: "I'll eat fried rice."
User: "What is the music you listen to often?"
Agent: "Listen to KANA"

これに対し、図１（ｂ）によれば、１０代・男性のキャラクタ属性のエージェントが、図１（ａ）と同じ対話シナリオを用いて、同じようにユーザと対話している。
ユーザ：「夜、ご飯どうするの？」
エージェント：「炒飯を食べるよ」
ユーザ：「よく聞く音楽って何ですか？」
エージェント：「ＫＡＮＡを聴くよ」
この場合、ユーザは、１０代・男性のキャラクタ属性のエージェントと対話しているにも拘わらず、例えば「食べるよ」のように発話をすることに違和感を抱いてしまう。１０代・男性であれば、荒っぽい発話をする場合が多いためである。 On the other hand, according to FIG. 1 (b), a teenage / male character attribute agent interacts with the user in the same manner using the same dialogue scenario as in FIG. 1 (a).
User: "What do you do with rice at night?"
Agent: "I'll eat fried rice."
User: "What is the music you listen to often?"
Agent: "Listen to KANA"
In this case, the user feels uncomfortable to speak, for example, "I'll eat", even though he / she is interacting with an agent having a character attribute of a teenager / male. This is because teenagers and men often make rough utterances.

そのために、エージェントの対話シナリオは、一般的に、キャラクタ属性に合わせて人手で作成する必要がある。人手で作成することによって、キャラクタ属性に応じた個性を感じるエージェントを作り込むことができる。しかしながら、多様なエージェントの対話シナリオを人手で作成するには、コスト的な問題が生じる。 Therefore, the agent dialogue scenario generally needs to be manually created according to the character attribute. By creating it manually, it is possible to create an agent that feels individuality according to the character attributes. However, there is a cost problem in manually creating dialogue scenarios for various agents.

従来、所望のキャラクタ属性のエージェントに合わせて、発話文の各文節を自動的に書き換える技術がある（例えば特許文献１参照）。この技術によれば、年代・性別などのキャラクタ属性に応じて、人手で作成された発話の「書き換え規則集」を参照する。
例えば、５０代・男性のキャラクタ属性の対話シナリオ「ゴルフが好きだな」から、１０代・女性のキャラクタ属性の対話シナリオを作成する際に、「な」を「よ」と書き換える。これによって、１０代・女性のキャラクタ属性の対話シナリオとして「ゴルフが好きだよ」を作成する。
一般に、ゴルフは、５０代・男性に人気のスポーツである。そのために、５０代・男性のエージェントが、「ゴルフが好きだな」と発話することには全く違和感がない。しかしながら、１０代・女性のエージェントが、「ゴルフが好きだよ」と発話することには違和感がある。一般的な１０代・女性が発話する内容ではないためである。 Conventionally, there is a technique for automatically rewriting each phrase of an utterance sentence according to an agent having a desired character attribute (see, for example, Patent Document 1). According to this technique, a "rewriting rule collection" of utterances created by hand is referred to according to character attributes such as age and gender.
For example, when creating a dialogue scenario with a character attribute of a teenager / woman from the dialogue scenario "I like golf" with a character attribute of a man in his 50s, "na" is rewritten as "yo". This creates "I like golf" as a dialogue scenario for teenage and female character attributes.
In general, golf is a popular sport for men in their 50s. Therefore, there is no sense of discomfort when a male agent in his 50s says, "I like golf." However, it feels strange for a teenage female agent to say, "I like golf." This is because it is not the content spoken by general teenagers and women.

これに対して、対話シナリオの発話文に含まれる名詞を、所望のキャラクタ属性に相応しい名詞に置換する技術がある（例えば非特許文献１参照）。この技術によれば、以下のステップを実行することによって、キャラクタ属性に合わせた対話シナリオを作成する。
（Ｓ１）キャラクタ属性（例えば２０代女性、５０代男性など）を指定する。
（Ｓ２）対話シナリオにおける当該キャラクタの発話文に含まれる名詞について、当該名詞の類似候補をWikipedia（登録商標）から抽出する。
（Ｓ３）抽出された類似候補の中から、指定されたキャラクタ属性と同じキャラクタ属性のＳＮＳユーザの投稿文を抽出する。
（Ｓ４）当該ＳＮＳユーザの投稿文の中で、特徴的に出現する名詞を１つ決定する。
（Ｓ５）対話シナリオにおける当該キャラクタの発話文に含まれる当該名詞を、Ｓ４で決定された名詞に置換する。
例えば、２０代女性のキャラクタ属性の対話シナリオの発話文「私はＫＡＮＡをよく聴きます」に対して、５０代男性のキャラクタ属性の対話シナリオの発話文「私はＨＡＭＡＤＡをよく聴きます」と書き換えることができる。 On the other hand, there is a technique of replacing a noun included in an utterance sentence of a dialogue scenario with a noun suitable for a desired character attribute (see, for example, Non-Patent Document 1). According to this technique, a dialogue scenario tailored to a character attribute is created by performing the following steps.
(S1) A character attribute (for example, a woman in her twenties, a man in her fifties, etc.) is specified.
(S2) With respect to the noun included in the utterance sentence of the character in the dialogue scenario, similar candidates of the noun are extracted from Wikipedia (registered trademark).
(S3) From the extracted similar candidates, the posted text of the SNS user having the same character attribute as the specified character attribute is extracted.
(S4) One noun that appears characteristically in the posted sentence of the SNS user is determined.
(S5) The noun included in the utterance sentence of the character in the dialogue scenario is replaced with the noun determined in S4.
For example, the utterance sentence "I often listen to KANA" in the dialogue scenario of the character attribute of a woman in her twenties is rewritten as the utterance sentence "I often listen to HAMADA" in the dialogue scenario of the character attribute of a man in her fifties. be able to.

特開２０１３－１９６０７０号公報Japanese Unexamined Patent Publication No. 2013-196070

田原俊一ら、「対話シナリオにおけるキャラクタ性を強調した名詞への自動置換手法の提案と評価」、DEIM 2019、[online]、［令和２年７月８日検索］、インターネット＜URL:https://db-event.jpn.org/deim2019/post/papers/2.pdf＞Shunichi Tahara et al., "Proposal and evaluation of automatic replacement method for nouns that emphasizes character in dialogue scenarios", DEIM 2019, [online], [Search on July 8, 2nd year of Reiwa], Internet <URL: https: //db-event.jpn.org/deim2019/post/papers/2.pdf ＞ Wikipedia：カテゴリ「動詞」、[online]、［令和２年７月１日検索］、インターネット＜URL:https://ja.wikipedia.org/wiki/Category:%E5%8B%95%E8%A9%9E＞Wikipedia: Category "Verb", [online], [Search on July 1, 2nd year of Reiwa], Internet <URL: https://ja.wikipedia.org/wiki/Category:%E5%8B%95%E8% A9% 9E ＞ Weblio類語辞典、[online]、［令和２年７月８日検索］、インターネット＜URL:https://thesaurus.weblio.jp/＞Weblio Thesaurus, [online], [Search on July 8, 2nd year of Reiwa], Internet <URL: https://thesaurus.weblio.jp/> 連想類語辞典、[online]、［令和２年７月８日検索］、インターネット＜URL:https://renso-ruigo.com/word/＞Associative Thesaurus, [online], [Search on July 8, 2nd year of Reiwa], Internet <URL: https://renso-ruigo.com/word/> 類語辞書、[online]、［令和２年７月８日検索］、インターネット＜URL:https://dictionary.goo.ne.jp/thsrs/＞Synonyms dictionary, [online], [Search on July 8, 2nd year of Reiwa], Internet <URL: https://dictionary.goo.ne.jp/thsrs/>

しかしながら、例えば非特許文献１に記載の技術のように名詞の置換だけでは、ユーザは、その対話シナリオの発話文に違和感を抱く場合がある。
例えば「ＫＡＮＡ」は、２０代・女性に人気のあるアーティストであり、「ＨＡＭＡＤＡ」は５０代・男性に人気のあるアーティストであるとする。
このとき、２０代・女性のキャラクタ属性のエージェントの発話文として、「私のＫＡＮＡのＣＤをパクられた」があるとする。この発話に含まれる名詞を、非特許文献１に記載の技術によって、５０代・男性用のキャラクタの発話文に置換するとする。ここでは、２０代・女性のキャラクタの発話文に含まれる「ＫＡＮＡ」が、５０代・男性のキャラクタに適した「ＨＡＭＡＤＡ」に置換される。そうすると、「私のＨＡＭＡＤＡのＣＤをパクられた」という発話文に置換されることとなる。
しかしながら、書き換えた発話文の動詞「パクる」は若年層によく使われる言葉であり、５０代男性を特徴付ける発話としては違和感がある。即ち、発話文に含まれる動詞が、キャラクタ属性に合った動詞に置換されていないために、キャラクタ表現としては違和感が残る。 However, for example, as in the technique described in Non-Patent Document 1, the user may feel uncomfortable with the spoken sentence of the dialogue scenario only by replacing the noun.
For example, "KANA" is an artist popular with women in their twenties, and "HAMADA" is an artist popular with men in their fifties.
At this time, it is assumed that there is "I was struck by my KANA CD" as an utterance sentence of an agent with a character attribute of a woman in her twenties. It is assumed that the noun included in this utterance is replaced with the utterance sentence of a character for men in their 50s by the technique described in Non-Patent Document 1. Here, "KANA" included in the utterance of a female character in her twenties is replaced with "HAMADA" suitable for a male character in her fifties. Then, it will be replaced with the utterance sentence "My HAMADA CD was packed".
However, the verb "pakuru" in the rewritten utterance sentence is a word often used by young people, and it feels strange as an utterance that characterizes a man in his 50s. That is, since the verb included in the utterance sentence is not replaced with the verb that matches the character attribute, a sense of incongruity remains as a character expression.

これに対し、本願発明者らは、対話シナリオの中で単に名詞や文節を置換しても、そのキャラクタ属性に基づく人物の特徴を表すことにはならない、のではないかと考えた。実は、そのキャラクタ属性に基づく人物が、実際に発話することが想起される「動詞」こそ、そのキャラクタ属性に基づく人物の特徴を最も表すのではないか、と考えた。 On the other hand, the inventors of the present application thought that simply replacing a noun or a phrase in a dialogue scenario would not represent the characteristics of a person based on the character attribute. In fact, I thought that the "verb" that the person based on the character attribute is reminded to actually speak might be the most representative of the characteristics of the person based on the character attribute.

また、例えばWikipedia（登録商標）におけるページは名詞が大多数を占めており、動詞に関連する記事が少ない。そのため、動詞の置換候補が少ないという問題もある。例えばカテゴリ「動詞」を見ると、動詞の記事が４７記事しかない。そのために、単にWikipediaの動詞を参照しても、対話シナリオの発話文に含まれる動詞を、キャラクタ属性に応じた動詞に置換することは極めて難しい。 Also, for example, pages on Wikipedia (registered trademark) are dominated by nouns, and there are few articles related to verbs. Therefore, there is also a problem that there are few verb replacement candidates. For example, looking at the category "verbs", there are only 47 articles about verbs. Therefore, it is extremely difficult to replace the verb contained in the utterance sentence of the dialogue scenario with the verb corresponding to the character attribute by simply referring to the verb of Wikipedia.

そこで、本発明は、キャラクタ属性に応じた動詞に置換した対話シナリオを作成するプログラム、装置及び方法を提供することを目的とする。 Therefore, an object of the present invention is to provide a program, a device, and a method for creating a dialogue scenario in which a verb is replaced according to a character attribute.

本発明のプログラムによれば、
キャラクタ属性毎に、対話シナリオを蓄積した対話シナリオ蓄積手段と、
キャラクタ属性毎に、複数のユーザにおける複数の投稿文を蓄積する投稿文蓄積手段と、
第１キャラクタ属性に基づく対話シナリオの発話文に記述された発話動詞における意義素を決定し、意義素を同一とする複数の類義動詞を対応付ける意義素決定手段と、
第２キャラクタ属性に基づくユーザの投稿文を用いて、同一の意義素における複数の類義動詞の中で出現頻度が最も偏る類義動詞を、特徴類義動詞として選択する特徴類義動詞選択手段と、
第１キャラクタ属性に基づく対話シナリオの発話文に記述された当該発話動詞を、特徴類義動詞に置換して、第２キャラクタ属性の対話シナリオの発話文とする動詞置換手段と
してコンピュータを機能させることを特徴とする。 According to the program of the present invention
Dialogue scenario storage means that accumulates dialogue scenarios for each character attribute,
Post-sentence storage means for accumulating multiple posts by multiple users for each character attribute,
Meaning element determination means for determining the meaning element in the utterance verb described in the utterance sentence of the dialogue scenario based on the first character attribute, and associating a plurality of synonymous verbs with the same meaning element.
Characteristic synonymous verb selection means for selecting a synonymous verb whose appearance frequency is most biased among multiple synonymous verbs in the same meaning element as a characteristic synonymous verb using a user's post based on the second character attribute. When,
Replacing the utterance verb described in the utterance sentence of the dialogue scenario based on the first character attribute with a characteristic verb, and making the computer function as a verb replacement means to be the utterance sentence of the dialogue scenario of the second character attribute. It is characterized by.

本発明のプログラムにおける他の実施形態によれば、
意義素決定手段は、
第１キャラクタ属性に基づく対話シナリオの発話文に記述された発話動詞について、複数の意義素が決定された際に、
類義語コーパスを用いて、意義素毎に、当該意義素を同一とする複数の類義動詞を検索し、
第１キャラクタ属性に基づくユーザの投稿文の中で、意義素毎に各類義動詞の出現頻度を計数し、
各類義動詞の出現頻度が最も高い意義素に決定する
ようにコンピュータを機能させることも好ましい。 According to other embodiments in the program of the invention
Meaning factor determination means
When multiple meanings are determined for the utterance verb described in the utterance sentence of the dialogue scenario based on the first character attribute,
Using the synonym corpus, search for multiple synonymous verbs that have the same meaning for each meaning element.
In the user's post based on the first character attribute, the frequency of appearance of each synonymous verb is counted for each meaning element, and
It is also preferable to make the computer function so as to determine the meaning element having the highest frequency of appearance of each synonymous verb.

本発明のプログラムにおける他の実施形態によれば、
意義素決定手段は、
第１キャラクタ属性に基づく対話シナリオの発話文から、係り受け関係となる名詞と動詞とを抽出する品詞抽出手段と、
類義語コーパスを用いて、意義素毎に、係り受け動詞に類似する類義動詞を検索する類義動詞検索手段と、
第１キャラクタ属性に基づくユーザの投稿文の中から、係り受け名詞と各類義動詞との共起頻度を検索し、各類義動詞の共起頻度が最も高い意義素を決定する共起頻度検索手段と
してコンピュータを機能させることも好ましい。 According to other embodiments in the program of the invention
Meaning factor determination means
Part-of-speech extraction means for extracting nouns and verbs that are dependent on each other from the utterance sentences of the dialogue scenario based on the first character attribute.
A synonym verb search method for searching for synonymous verbs similar to dependent verbs for each meaning element using a synonym corpus,
The co-occurrence frequency of searching the co-occurrence frequency of the dependent noun and each synonymous verb from the user's post based on the first character attribute, and determining the co-occurrence element with the highest co-occurrence frequency of each synonymous verb. It is also preferable to make the computer function as a search means.

本発明のプログラムにおける他の実施形態によれば、
特徴類義動詞選択手段は、
以下のように、赤池情報量規準(ＡＩＣ：Akaike's Information Criterion)を用いて、キャラクタ属性P1に対する類義動詞P2の評価値V(P1,P2)を算出し、当該評価値V(P1,P2)が所定閾値以上となる、又は、当該評価値V(P1,P2)が大きいものから順に所定数となる、類義動詞P2を検出対象とする
p11：キャラクタ属性P1のユーザが、類義動詞P2に関する投稿文を投稿した数
p12：キャラクタ属性P1のユーザが、類義動詞P2以外の投稿文を投稿した数
p21：キャラクタ属性P1以外のユーザが、類義動詞P2に関する投稿文を投稿した数
p22：キャラクタ属性P1以外のユーザが、類義動詞P2以外の投稿文を投稿した数
p11＋p12＋p21＋p22＝全てのユーザが、投稿文を投稿した数
MLL_IM(P1,P2)＝(p11+p12) log(p11+p12)
＋(p11+p21) log(p11+p21)
＋(p21+p22) log(p21+p22)
＋(p12+p22) log(p12+p22)－2N log N
MLL_DM(P1,P2)＝p11 log p11＋p12 log p12＋p21 log p21＋p22 log p22－N log N
但し、N＝p11＋p12＋p21＋p22
AIC_IM(P1,P2)＝-2 × MLL_IM(P1,P2) ＋ 2×2
AIC_IM(P1,P2)：キャラクタ属性P1と類義動詞P2との組の従属AIC
AIC_DM(P1,P2)＝-2 × MLL_DM(P1,P2) ＋ 2×3
AIC_DM(P1,P2)：キャラクタ属性P1と類義動詞P2との組の独立AIC
V(P1,P2)＝AIC_IM(P1,P2) － AIC_DM(P1,P2)
ようにコンピュータを機能させることも好ましい。 According to other embodiments in the program of the invention
Characteristic verb selection means
As shown below, using the Akaike's Information Criterion (AIC), the evaluation value V (P1, P2) of the synonymous verb P2 for the character attribute P1 is calculated, and the evaluation value V (P1, P2). The synonymous verb P2 whose evaluation value V (P1, P2) is greater than or equal to the predetermined threshold or whose evaluation value V (P1, P2) is the largest is the predetermined number.
p11: The number of posts related to the synonymous verb P2 by the user of the character attribute P1.
p12: The number of posted sentences other than the synonymous verb P2 by the user of the character attribute P1.
p21: The number of posts related to the synonymous verb P2 by users other than the character attribute P1.
p22: The number of posted sentences other than the synonymous verb P2 by users other than the character attribute P1.
p11 ＋ p12 ＋ p21 ＋ p22 ＝ Number of posts posted by all users
MLL_IM (P1, P2) = (p11 + p12) log (p11 + p12)
＋ (p11 + p21) log (p11 + p21)
＋ (p21 + p22) log (p21 + p22)
＋ (p12 + p22) log (p12 + p22) －2N log N
MLL_DM (P1, P2) ＝ p11 log p11 ＋ p12 log p12 ＋ p21 log p21 ＋ p22 log p22－N log N
However, N = p11 + p12 + p21 + p22
AIC_IM (P1, P2) = -2 x MLL_IM (P1, P2) + 2 x 2
AIC_IM (P1, P2): Subordinate AIC of a set of character attribute P1 and synonymous verb P2
AIC_DM (P1, P2) = -2 x MLL_DM (P1, P2) + 2 x 3
AIC_DM (P1, P2): Independent AIC of a pair of character attribute P1 and synonymous verb P2
V (P1, P2) = AIC_IM (P1, P2) -AIC_DM (P1, P2)
It is also preferable to make the computer function as such.

本発明のプログラムにおける他の実施形態によれば、
投稿文は、不特定複数のユーザがＳＮＳ(Social Networking Service)又はブログに投稿したテキストであり、
キャラクタ属性は、年代、性別、出身地のいずれか又はそれらの組み合わせに基づく分類属性である
ようにコンピュータを機能させることも好ましい。 According to other embodiments in the program of the invention
The posted text is a text posted on an SNS (Social Networking Service) or a blog by an unspecified number of users.
It is also preferable to make the computer function so that the character attribute is a classification attribute based on any one of age, gender, birthplace, or a combination thereof.

本発明のプログラムにおける他の実施形態によれば、
対話シナリオを用いて、エージェントがユーザと対話する対話手段と、
対話手段によってユーザとの対話中に、対話シナリオの中で特徴類義動詞に置換された発話文がユーザに向けて発話された際に又はその後、ユーザに対して当該発話文に違和感を生じるか否かをテキスト又は音声によって問い合わせ、ユーザが違和感を生じると判定した際に、置換された特徴類義動詞を元の発話動詞に戻すべく再度置換する発話動詞修正手段と
してコンピュータを更に機能させることも好ましい。 According to other embodiments in the program of the invention
A means of dialogue in which an agent interacts with a user using a dialogue scenario,
Whether the utterance sentence replaced with the characteristic synonymous verb in the dialogue scenario causes a sense of discomfort to the user when or after the utterance sentence is spoken to the user during the dialogue with the user by the dialogue means. It is also possible to further function the computer as an utterance verb correction means that replaces the replaced feature synonymous verb with the original utterance verb when the user inquires by text or voice and determines that the user feels uncomfortable. preferable.

本発明の対話シナリオ作成装置によれば、
キャラクタ属性毎に、対話シナリオを蓄積した対話シナリオ蓄積手段と、
キャラクタ属性毎に、複数のユーザにおける複数の投稿文を蓄積する投稿文蓄積手段と、
第１キャラクタ属性に基づく対話シナリオの発話文に記述された発話動詞における意義素を決定し、意義素を同一とする複数の類義動詞を対応付ける意義素決定手段と、
第２キャラクタ属性に基づくユーザの投稿文を用いて、同一の意義素における複数の類義動詞の中で出現頻度が最も偏る類義動詞を、特徴類義動詞として選択する特徴類義動詞選択手段と、
第１キャラクタ属性に基づく対話シナリオの発話文に記述された当該発話動詞を、特徴類義動詞に置換して、第２キャラクタ属性の対話シナリオの発話文とする動詞置換手段と
を有することを特徴とする。 According to the dialogue scenario creating device of the present invention.
Dialogue scenario storage means that accumulates dialogue scenarios for each character attribute,
Post-sentence storage means for accumulating multiple posts by multiple users for each character attribute,
Meaning element determination means for determining the meaning element in the utterance verb described in the utterance sentence of the dialogue scenario based on the first character attribute, and associating a plurality of synonymous verbs with the same meaning element.
Characteristic synonymous verb selection means for selecting a synonymous verb whose appearance frequency is most biased among multiple synonymous verbs in the same meaning element as a characteristic synonymous verb using a user's post based on the second character attribute. When,
It is characterized by having a verb replacement means for replacing the utterance verb described in the utterance sentence of the dialogue scenario based on the first character attribute with a characteristic synonymous verb to be the utterance sentence of the dialogue scenario of the second character attribute. And.

本発明の対話シナリオ作成方法によれば、
装置は、
キャラクタ属性毎に、対話シナリオを蓄積した対話シナリオ蓄積手段と、
キャラクタ属性毎に、複数のユーザにおける複数の投稿文を蓄積する投稿文蓄積手段と
を有し、
第１キャラクタ属性に基づく対話シナリオの発話文に記述された発話動詞における意義素を決定し、意義素を同一とする複数の類義動詞を対応付ける第１のステップと、
第２キャラクタ属性に基づくユーザの投稿文を用いて、同一の意義素における複数の類義動詞の中で出現頻度が最も偏る類義動詞を、特徴類義動詞として選択する第２のステップと、
第１キャラクタ属性に基づく対話シナリオの発話文に記述された当該発話動詞を、特徴類義動詞に置換して、第２キャラクタ属性の対話シナリオの発話文とする第３のステップと
を実行することを特徴とする。 According to the dialogue scenario creation method of the present invention.
The device is
Dialogue scenario storage means that accumulates dialogue scenarios for each character attribute,
It has a post sentence storage means for accumulating a plurality of post sentences by a plurality of users for each character attribute.
The first step of determining the meaning elements in the utterance verb described in the utterance sentence of the dialogue scenario based on the first character attribute, and associating a plurality of synonymous verbs having the same meaning element,
The second step of selecting the synonymous verb whose appearance frequency is most biased among multiple synonymous verbs of the same meaning as the characteristic synonymous verb by using the user's posted sentence based on the second character attribute.
The third step of replacing the utterance verb described in the utterance sentence of the dialogue scenario based on the first character attribute with a characteristic synonymous verb and making it the utterance sentence of the dialogue scenario of the second character attribute is executed. It is characterized by.

本発明のプログラム、装置及び方法によれば、キャラクタ属性に応じた動詞に置換した対話シナリオを作成することができる。 According to the program, device, and method of the present invention, it is possible to create a dialogue scenario in which verbs are replaced according to character attributes.

エージェントとの対話によって、ユーザが抱く違和感を表す説明図である。It is explanatory drawing which shows the uncomfortable feeling which a user has by the dialogue with an agent. 本発明における対話シナリオ作成装置の機能構成図である。It is a functional block diagram of the dialogue scenario creation apparatus in this invention. 本発明における対話シナリオ蓄積部及び投稿文蓄積部の説明図である。It is explanatory drawing of the dialogue scenario storage part and the post sentence storage part in this invention. 本発明における意義素決定部の機能構成図である。It is a functional block diagram of the meaning element determination part in this invention. 本発明における特徴類義動詞選択部の説明図である。It is explanatory drawing of the characteristic type verb selection part in this invention. 本発明における動詞置換部の説明図である。It is explanatory drawing of the verb substitution part in this invention. 本発明によって置換された対話シナリオを表す説明図である。It is explanatory drawing which shows the dialogue scenario replaced by this invention. 本発明によって置換された動詞を修正するインタフェースを表す説明図である。It is explanatory drawing which shows the interface which modifies the verb replaced by this invention.

以下、本発明の実施の形態について、図面を用いて詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

図２は、本発明における対話シナリオ作成装置の機能構成図である。 FIG. 2 is a functional configuration diagram of the dialogue scenario creating device in the present invention.

本発明の対話シナリオ作成装置１は、エージェントのキャラクタ属性に応じた対話シナリオを自動的に作成する。
図２によれば、対話シナリオ作成装置１は、対話部１００と、対話シナリオ蓄積部１０１と、投稿文蓄積部１０２と、意義素決定部１１と、特徴類義動詞選択部１２と、動詞置換部１３と、発話動詞修正部１４とを有する。これら機能構成部は、装置に搭載されたコンピュータを機能させるプログラムを実行することによって実現される。また、これら機能構成部の処理の流れは、対話シナリオ作成方法としても理解できる。 The dialogue scenario creation device 1 of the present invention automatically creates a dialogue scenario according to the character attribute of the agent.
According to FIG. 2, the dialogue scenario creating device 1 includes a dialogue unit 100, a dialogue scenario storage unit 101, a posted sentence storage unit 102, a meaning element determination unit 11, a characteristic synonymous verb selection unit 12, and a verb replacement. It has a unit 13 and a spoken verb correction unit 14. These functional components are realized by executing a program that makes a computer mounted on the device function. In addition, the processing flow of these functional components can be understood as a method for creating a dialogue scenario.

本発明の対話シナリオ作成装置１は、
第１キャラクタ属性のエージェントにおける第１対話シナリオの発話文に含まれる動詞に関係する意義素を決定し（意義素決定部１１）、
意義素の中に複数存在する類義動詞の中から、作成したい第２の対話シナリオの第２キャラクタ属性を特徴付ける類義動詞を選択し（特徴類義動詞選択部１２）、
第１対話シナリオの発話文に含まれる当該動詞を、第２キャラクタ属性に応じた類義動詞に置換する（動詞置換部１３）
ことによって、第２キャラクタ属性のエージェントにおける第２対話シナリオを自動的に作成する。 The dialogue scenario creating device 1 of the present invention is
The meaning element related to the verb included in the utterance sentence of the first dialogue scenario in the agent of the first character attribute is determined (meaning element determination unit 11).
From among the multiple synonymous verbs that exist in the meaning element, select the synonymous verb that characterizes the second character attribute of the second dialogue scenario that you want to create (characteristic synonymous verb selection unit 12).
The verb included in the utterance sentence of the first dialogue scenario is replaced with a synonymous verb corresponding to the second character attribute (verb replacement part 13).
By doing so, the second dialogue scenario in the agent of the second character attribute is automatically created.

［対話部１００］
対話部１００は、ユーザが所持する端末２に対して、ユーザインタフェースを介して、音声やテキストで対話するエージェントが起動する。ユーザは、所望のキャラクタ属性（年齢、性別、出身地等）に応じたエージェントを選択することができる。エージェントは、対話シナリオ蓄積部１０１を参照し、そのキャラクタ属性に応じた対話シナリオを用いてユーザと対話する。エージェントは、ユーザから見て、そのキャラクタ属性に基づく人物として認識できる。 [Dialogue unit 100]
The dialogue unit 100 activates an agent that interacts with the terminal 2 owned by the user by voice or text via the user interface. The user can select an agent according to a desired character attribute (age, gender, hometown, etc.). The agent refers to the dialogue scenario storage unit 101 and interacts with the user using the dialogue scenario according to the character attribute. The agent can be recognized by the user as a person based on the character attribute.

［対話シナリオ蓄積部１０１］
対話シナリオ蓄積部１０１は、キャラクタ属性毎に、対話シナリオを蓄積したものである。
対話シナリオ蓄積部１０１は、対話シナリオを、受話文と発話文とによって構成する。受話文は、ユーザが発話するであろうテキストを想定したものであり、発話文は、その受話文に対してエージェントが発話するテキストである。
また、エージェントのキャラクタ属性は、例えば年代、性別、出身地のいずれか又はそれらの組み合わせに基づく分類属性とする。 [Dialogue scenario storage unit 101]
The dialogue scenario storage unit 101 stores dialogue scenarios for each character attribute.
The dialogue scenario storage unit 101 constitutes a dialogue scenario with an received sentence and an uttered sentence. The utterance sentence assumes a text that the user will utter, and the utterance sentence is a text that the agent utters with respect to the utterance sentence.
Further, the character attribute of the agent is, for example, a classification attribute based on any one of age, gender, hometown, or a combination thereof.

図３は、本発明における対話シナリオ蓄積部及び投稿文蓄積部の説明図である。 FIG. 3 is an explanatory diagram of a dialogue scenario storage unit and a posted text storage unit in the present invention.

図３（ａ）によれば、対話シナリオ蓄積部１０１に蓄積された第１キャラクタ属性の対話シナリオが表されている。ここで、第１キャラクタ属性は、例えば２０代・女性とする。
（受話文）（発話文）
・・・・・・・・・・・・・・・・・・・
「夜、ご飯、どうするの？」「炒飯を食べるよ」
・・・・・・・・・・・・・・・・・・・
エージェントの発話文の動詞は、「食べるよ」となっており、第１キャラクタ属性「２０代・女性」を想定する話し方となっている。キャラクタ属性に応じた微妙な動詞の発話文に調整するために、従来技術によれば人手によって作成されていた。
本発明の実施形態によれば、第１キャラクタ属性「２０代・女性」の第１対話シナリオから、第２キャラクタ属性「１０代・男性」の第２対話シナリオを自動的に作成する。 According to FIG. 3A, the dialogue scenario of the first character attribute stored in the dialogue scenario storage unit 101 is shown. Here, the first character attribute is, for example, a woman in her twenties.
(Received sentence) (Spoken sentence)
・・・・・・・・・・・・・・・・・
"What do you do with rice at night?""I'll eat fried rice."
・・・・・・・・・・・・・・・・・
The verb of the agent's utterance is "I'll eat", which is a way of speaking assuming the first character attribute "female in her twenties". According to the prior art, it was created manually in order to adjust the utterance sentence of a delicate verb according to the character attribute.
According to the embodiment of the present invention, the second dialogue scenario of the second character attribute "teens / male" is automatically created from the first dialogue scenario of the first character attribute "twenties / female".

［投稿文蓄積部１０２］
投稿文蓄積部１０２は、キャラクタ属性（ユーザ属性、デモグラフィック）毎に、複数のユーザにおける複数の投稿文を蓄積したものである。
投稿文は、例えばＳＮＳ(Social Networking Service)又はブログに投稿されたテキストであって、不特定多数の第三者、又は、特定人物によって送信されたものであってもよい。 [Posted text storage unit 102]
The posted text storage unit 102 stores a plurality of posted texts by a plurality of users for each character attribute (user attribute, demographic).
The posted text may be, for example, a text posted on an SNS (Social Networking Service) or a blog, and may be sent by an unspecified number of third parties or a specific person.

図３（ｂ）によれば、キャラクタ属性毎に、例えば以下のような投稿文が列挙されている。
第１キャラクタ属性（２０代・女性）に基づく投稿文
「炒飯でも食べるよ」
・・・・・・
「炒飯食べて元気だそう」
「炒飯を召し上がっちゃおう」
第２キャラクタ属性（１０代・男性）に基づく投稿文
・・・・・・
「飯食うよ」
・・・・・・
「飯、食って腹一杯」
「炒飯でも召し上がろうかね」
投稿文には、男性らしい口調や、女性らしい口調、若者らしい口調、老年者らしい口調、地方の方言のような口調が表れる。これらは、キャラクタ属性に応じて投稿文の動詞に影響を与える。 According to FIG. 3B, for example, the following posted sentences are listed for each character attribute.
Posts based on the first character attribute (female in their 20s)
"I'll eat fried rice"
・・・・・・
"I'm fine eating fried rice."
"Let's eat fried rice"
Posts based on the second character attribute (teens / male)
・・・・・・
"I'll eat"
・・・・・・
"Rice, eat and be full"
"Would you like to eat fried rice?"
Posts have a masculine tone, a feminine tone, a youthful tone, an elderly tone, and a local dialect-like tone. These affect the verbs in the post depending on the character attributes.

不特定多数の投稿文をキャラクタ属性毎に分類するために、例えば特許文献１のように、投稿文毎に、性別・年代や出身地のようなキャラクタ属性を推定する技術を用いることもできる。ＳＮＳサイトサーバから取得可能な投稿者間の交流関係を用いて、投稿者のプロフィール（キャラクタ属性）を推定する。
投稿文のキャラクタ属性は、対話シナリオのキャラクタ属性と同様の分類属性とする。キャラクタ属性は、発話文における特に動詞（例えば語尾）に影響する。 In order to classify an unspecified number of posted texts by character attribute, for example, as in Patent Document 1, a technique of estimating character attributes such as gender / age and birthplace can be used for each posted text. The profile (character attribute) of the poster is estimated using the exchange relationship between the posters that can be obtained from the SNS site server.
The character attribute of the posted text shall be the same classification attribute as the character attribute of the dialogue scenario. Character attributes affect verbs (eg, flexion) in particular in spoken sentences.

第１キャラクタ属性の投稿文は、置換元となる第１対話シナリオと同一又は類似のキャラクタ属性のユーザ（不特定多数又は特定人物）から投稿された投稿文である。
第２キャラクタ属性の投稿文は、置換先となる第２対話シナリオと同一又は類似のキャラクタ属性のユーザ（不特定多数又は特定人物）から投稿された投稿文である。 The posted text of the first character attribute is a posted text posted by a user (unspecified majority or specific person) having the same or similar character attribute as the first dialogue scenario to be replaced.
The post text of the second character attribute is a post text posted by a user (unspecified majority or specific person) with the same or similar character attribute as the second dialogue scenario to be replaced.

［意義素決定部１１］
意義素決定部１１は、第１キャラクタ属性に基づく対話シナリオの発話文に記述された「発話動詞」における「意義素」を決定し、意義素を同一とする複数の「類義動詞」を対応付ける。ここで、第１キャラクタ属性に基づく対話シナリオの発話文に記述された「発話動詞」について、複数の意義素が決定された場合、いずれか１つの意義素に決定する。 [Meaning element determination unit 11]
The meaning element determination unit 11 determines the "meaning element" in the "utterance verb" described in the utterance sentence of the dialogue scenario based on the first character attribute, and associates a plurality of "synonymous verbs" having the same meaning element. .. Here, when a plurality of meaning elements are determined for the "utterance verb" described in the utterance sentence of the dialogue scenario based on the first character attribute, one of the meaning elements is determined.

「意義素(sememe)」とは、単語の意味を構成する概念に基づく単位をいう。言語学では、「文の意味」と「語の意義」とを使い分けている。語の「意義」を構成する要素として、音素(phoneme)や形態素 (morpheme)に準じて意義素がある。 A "sememe" is a unit based on a concept that constitutes the meaning of a word. In linguistics, "meaning of sentence" and "meaning of word" are used properly. As an element that constitutes the "meaning" of a word, there is a meaning element according to a phoneme or a morpheme.

図４は、本発明における意義素決定部の機能構成図である。 FIG. 4 is a functional configuration diagram of the meaning element determination unit in the present invention.

図４によれば、意義素決定部１１は、第１キャラクタ属性の対話シナリオを入力し、決定した意義素及び類義語からなる意義素テーブルを出力する。
例えば入力される第１キャラクタ属性の対話シナリオとして、２０代・女性のエージェントの対話シナリオが入力されるとする。
（発話文）
「炒飯を食べるよ」 According to FIG. 4, the meaning element determination unit 11 inputs the dialogue scenario of the first character attribute, and outputs the meaning element table composed of the determined meaning elements and synonyms.
For example, it is assumed that a dialogue scenario of a female agent in her twenties is input as a dialogue scenario of the first character attribute to be input.
(Utterance)
"I'll eat fried rice"

図４によれば、意義素決定部１１は、品詞抽出部１１１と、類義動詞検索部１１２と、共起頻度検索部１１３とを有する。 According to FIG. 4, the meaning element determination unit 11 has a part of speech extraction unit 111, a synonymous verb search unit 112, and a co-occurrence frequency search unit 113.

＜品詞抽出部１１１＞
品詞抽出部１１１は、第１キャラクタ属性に基づく対話シナリオの発話文から、係り受け関係となる名詞と動詞とを抽出する。
品詞抽出部１１１は、例えば日本語係り受け解析器CaboChaを用いたものであってもよい。このような解析器に、文を入力すると、名詞及び動詞が出力される。
図４によれば、品詞抽出部１１１は、以下のように名詞及び動詞を抽出する。
（発話文）（名詞）（動詞）
「炒飯を食べるよ」「炒飯」「食べる」
品詞抽出部１１１は、動詞「食べる」を類義動詞検索部１１２へ出力し、名詞「炒飯」を出力する。 <Part of speech extraction unit 111>
The part-of-speech extraction unit 111 extracts nouns and verbs that have a dependency relationship from the utterance sentences of the dialogue scenario based on the first character attribute.
The part-speech extraction unit 111 may use, for example, the Japanese dependency analyzer CaboCha. When a sentence is input to such an analyzer, nouns and verbs are output.
According to FIG. 4, the part of speech extraction unit 111 extracts nouns and verbs as follows.
(Utterance) (Noun) (Verb)
"I'll eat fried rice""Friedrice""Eat"
The part-speech extraction unit 111 outputs the verb "eat" to the synonymous verb search unit 112, and outputs the noun "fried rice".

＜類義動詞検索部１１２＞
類義動詞検索部１１２は、「類義語コーパス」を用いて、意義素毎に、係り受け動詞に類似する「類義動詞」を検索する。 <Synonymous verb search unit 112>
The synonym verb search unit 112 searches for a "synonym verb" similar to a dependency verb for each meaning element by using the "synonym corpus".

類義語コーパスは、意義素と類義語とから構成され、例えば以下のようなものがある。
・Weblio類語辞典（非特許文献３参照）
・連想類語辞典（非特許文献４参照）
・類語辞書（非特許文献５参照）
例えば、類義動詞検索部１１２は、動詞「食べる」を入力した場合、動詞「食べる」に類似する１つ以上の類義動詞を検索する。
このとき、動詞「食べる」には、「ご飯を食べる」「魚を食べる」「野菜を食べる」のように「食事をする」という意義素（意味合い）と、「社長業で食っていく」「好きなことをして食っていく」のように「自活をする」という意義素とがある。そして、意義素毎に複数の類義動詞が検索される。
（動詞）（意義素）（類義動詞）
「食べる」「食事をする」食う
召し上がる
食事する
喫する
「自活をする」生活する
生きていく
食べる
複数の類義動詞「食う」「召し上がる」「食事する」「喫する」は、意義素「食事をする」を同一とする。また、複数の類義動詞「生活する」「生きていく」「食べる」は、意義素「自活をする」を同一とする。 The synonym corpus is composed of meaning elements and synonyms, and includes, for example, the following.
・ Weblio Thesaurus (see Non-Patent Document 3)
-Associative thesaurus (see Non-Patent Document 4)
-Synonyms dictionary (see Non-Patent Document 5)
For example, when the verb "eat" is input, the synonymous verb search unit 112 searches for one or more synonymous verbs similar to the verb "eat".
At this time, the verb "eat" has the meaning (meaning) of "eating" such as "eating rice", "eating fish", and "eating vegetables", and "eating in the president's business" and "eating". There is a meaning of "self-supporting" such as "do what you like and eat". Then, a plurality of synonymous verbs are searched for each meaning element.
(Verb) (Meaning element) (Synonymous verb)
"Eat""Eat" Eat
Eat
To eat
To enjoy
"Self-supporting" to live
Live
Eat Multiple synonymous verbs "eat", "eat", "eat", and "eat" have the same meaning "eat". In addition, the multiple synonymous verbs "living", "living", and "eating" have the same meaning "self-supporting".

類義動詞検索部１１２は、入力された動詞について、全ての意義素を導出し、意義素毎に全ての類義動詞を検索する。検索された意義素テーブルは、共起頻度検索部１１３へ出力される。 The synonymous verb search unit 112 derives all meaning elements for the input verb, and searches for all synonymous verbs for each meaning element. The searched meaning element table is output to the co-occurrence frequency search unit 113.

＜共起頻度検索部１１３＞
共起頻度検索部１１３は、第１キャラクタ属性に基づくユーザの投稿文の中から、「係り受け名詞と各類義動詞との共起頻度」を検索し、意義素毎に各類義動詞の出現頻度を計数する。そして、類義動詞の共起頻度が最も高い意義素を決定する。 <Co-occurrence frequency search unit 113>
The co-occurrence frequency search unit 113 searches for "co-occurrence frequency between the dependent noun and each synonymous verb" from the user's posted sentences based on the first character attribute, and searches for the co-occurrence frequency of each synonymous verb for each meaning element. Count the frequency of appearance. Then, the meaning element with the highest co-occurrence frequency of synonymous verbs is determined.

図４によれば、共起頻度検索部１１３は、例えば、類義動詞検索部１１２から動詞「食べる」の意義素テーブルを入力すると共に、品詞抽出部１１１から名詞「炒飯」を入力する。また、共起頻度検索部１１３は、投稿文蓄積部１０２を参照し、第１キャラクタ属性の投稿文を取得する。
ここで、共起頻度検索部１１３は、名詞「炒飯」と、意義素テーブルの各類義動詞とが共起する頻度が高い意義素を決定する。 According to FIG. 4, the co-occurrence frequency search unit 113 inputs, for example, the meaning element table of the verb “eat” from the synonymous verb search unit 112, and inputs the noun “fried rice” from the part of speech extraction unit 111. Further, the co-occurrence frequency search unit 113 refers to the posted sentence storage unit 102 and acquires the posted sentence of the first character attribute.
Here, the co-occurrence frequency search unit 113 determines a meaning element having a high frequency of co-occurrence between the noun "fried rice" and each synonymous verb in the meaning element table.

図４によれば、投稿文蓄積部１０２には、第１キャラクタ属性の投稿文が８件あるとする。その中で、以下の３件の投稿文について、名詞「炒飯」と、意義素「食べる」に類義する類義動詞とが共起している。
（投稿文）（名詞）（類義動詞）
「炒飯でも食べるよ」「炒飯」「食べる」
「炒飯食べて元気だそう」「炒飯」「食べる」
「炒飯を召し上がっちゃおう」「炒飯」「召し上がる」
図４によれば、名詞「炒飯」に対して共起頻度が高い類義動詞として「食べる」「召し上がる」がある。これら類義動詞は、意義素「食事をする」に対応する。そうすると、意義素「食事をする」と「自活をする」とを比較すると、共起頻度が最も高い意義素としては、「食事をする」となる。
即ち、共起頻度検索部１１３は、意義素「食事をする」を決定し、類義動詞「食う」「召し上がる」「食べる」「喫する」が対応付けられた意義素テーブルを出力する。 According to FIG. 4, it is assumed that the posted sentence storage unit 102 has eight posted sentences having the first character attribute. Among them, the noun "fried rice" and the synonymous verb synonymous with the meaning element "eat" co-occur in the following three posts.
(Posted text) (Noun) (Synonymous verb)
"Eat fried rice""Friedrice""Eat"
"Eat fried rice and feel fine""Friedrice""Eat"
"Let's eat fried rice""Friedrice""Eat"
According to FIG. 4, there are “eat” and “eat” as synonymous verbs that frequently co-occur with the noun “fried rice”. These synonymous verbs correspond to the meaning "eat". Then, when comparing the meaning elements "eat" and "self-support", the meaning element with the highest co-occurrence frequency is "eat".
That is, the co-occurrence frequency search unit 113 determines the meaning element "eat" and outputs a meaning element table associated with the synonymous verbs "eat", "eat", "eat", and "eat".

［特徴類義動詞選択部１２］
特徴類義動詞選択部１２は、第２キャラクタ属性に基づくユーザの投稿文を用いて、同一の意義素における複数の類義動詞の中で出現頻度が最も偏る類義動詞を、特徴類義動詞として選択する。 [Characteristic verb selection unit 12]
The feature synonymous verb selection unit 12 uses a user's post based on the second character attribute to select a synonymous verb whose appearance frequency is most biased among a plurality of synonymous verbs in the same meaning element. Select as.

第２キャラクタ属性は、新たに作成する第２対話シナリオのエージェントに基づくものである。そのエージェントが、特定人物（例えば芸能人やタレント）をユーザに想起させるものであってもよい。その場合、第２キャラクタ属性に基づくユーザの投稿文として、その特定人物が過去に投稿した投稿文を用いることもできる。これによって、不特定人物に基づく一般的な第１対話シナリオから、特定人物に基づく第２対話シナリオを自動的に作成することができる。 The second character attribute is based on the agent of the newly created second dialogue scenario. The agent may remind the user of a specific person (eg, entertainer or personality). In that case, as the user's post based on the second character attribute, the post posted by the specific person in the past can also be used. This makes it possible to automatically create a second dialogue scenario based on a specific person from a general first dialogue scenario based on an unspecified person.

図５は、本発明における特徴類義動詞選択部の説明図である。 FIG. 5 is an explanatory diagram of a characteristic synonymous verb selection unit in the present invention.

図５によれば、意義素決定部１１から意義素テーブルを入力すると共に、作成したい対話シナリオに基づく第２のキャラクタ属性の投稿文を参照する。
ここでは、１０代・男性を、第２キャラクタ属性と設定し、そのエージェントの対話シナリオを作成するとする。
特徴類義動詞選択部１２は、第２キャラクタ属性に基づくユーザの投稿文を用いて、同一の意義素における以下の複数の類義動詞について、出現頻度を検索する。
意義素「食事をする」
類義動詞「食う」「召し上がる」「食べる」「喫する」
そして、特徴類義動詞選択部１２は、類義動詞毎に、当該類義動詞が記述された第２キャラクタ属性の投稿文を検索し、出現頻度が最も偏る類義動詞を、特徴類義動詞として選択する。 According to FIG. 5, the meaning element table is input from the meaning element determination unit 11, and the posted text of the second character attribute based on the dialogue scenario to be created is referred to.
Here, it is assumed that a teenager / male is set as the second character attribute and a dialogue scenario of the agent is created.
The feature synonymous verb selection unit 12 searches for the frequency of appearance of the following plurality of synonymous verbs in the same meaning by using the user's posted sentence based on the second character attribute.
Meaning "Eat"
Synonymous verbs "eat""eat""eat""eat"
Then, the characteristic synonymous verb selection unit 12 searches for the post sentence of the second character attribute in which the synonymous verb is described for each synonymous verb, and selects the synonymous verb whose appearance frequency is most biased. Select as.

特徴類義動詞選択部１２は、以下のように、赤池情報量規準(ＡＩＣ：Akaike's Information Criterion)を用いて、キャラクタ属性P1に対する類義動詞P2の評価値V(P1,P2)を算出し、当該評価値V(P1,P2)が所定閾値以上となる、又は、当該評価値V(P1,P2)が大きいものから順に所定数となる、類義動詞P2を検出対象とする
p11：キャラクタ属性P1のユーザが、類義動詞P2に関する投稿文を投稿した数
p12：キャラクタ属性P1のユーザが、類義動詞P2以外の投稿文を投稿した数
p21：キャラクタ属性P1以外のユーザが、類義動詞P2に関する投稿文を投稿した数
p22：キャラクタ属性P1以外のユーザが、類義動詞P2以外の投稿文を投稿した数
p11＋p12＋p21＋p22＝全てのユーザが、投稿文を投稿した数
MLL_IM(P1,P2)＝(p11+p12) log(p11+p12)
＋(p11+p21) log(p11+p21)
＋(p21+p22) log(p21+p22)
＋(p12+p22) log(p12+p22)－2N log N
MLL_DM(P1,P2)＝p11 log p11＋p12 log p12＋p21 log p21＋p22 log p22－N log N
但し、N＝p11＋p12＋p21＋p22
AIC_IM(P1,P2)＝-2 × MLL_IM(P1,P2) ＋ 2×2
AIC_IM(P1,P2)：キャラクタ属性P1と類義動詞P2との組の従属AIC
AIC_DM(P1,P2)＝-2 × MLL_DM(P1,P2) ＋ 2×3
AIC_DM(P1,P2)：キャラクタ属性P1と類義動詞P2との組の独立AIC
V(P1,P2)＝AIC_IM(P1,P2) － AIC_DM(P1,P2) The feature synonymous verb selection unit 12 calculates the evaluation value V (P1, P2) of the synonymous verb P2 for the character attribute P1 using the Akaike's Information Criterion (AIC) as follows. The synonymous verb P2 whose evaluation value V (P1, P2) is equal to or greater than a predetermined threshold, or whose evaluation value V (P1, P2) is a predetermined number in descending order is targeted for detection.
p11: The number of posts related to the synonymous verb P2 by the user of the character attribute P1.
p12: The number of posted sentences other than the synonymous verb P2 by the user of the character attribute P1.
p21: The number of posts related to the synonymous verb P2 by users other than the character attribute P1.
p22: The number of posted sentences other than the synonymous verb P2 by users other than the character attribute P1.
p11 ＋ p12 ＋ p21 ＋ p22 ＝ Number of posts posted by all users
MLL_IM (P1, P2) = (p11 + p12) log (p11 + p12)
＋ (p11 + p21) log (p11 + p21)
＋ (p21 + p22) log (p21 + p22)
＋ (p12 + p22) log (p12 + p22) －2N log N
MLL_DM (P1, P2) ＝ p11 log p11 ＋ p12 log p12 ＋ p21 log p21 ＋ p22 log p22－N log N
However, N = p11 + p12 + p21 + p22
AIC_IM (P1, P2) = -2 x MLL_IM (P1, P2) + 2 x 2
AIC_IM (P1, P2): Subordinate AIC of a set of character attribute P1 and synonymous verb P2
AIC_DM (P1, P2) = -2 x MLL_DM (P1, P2) + 2 x 3
AIC_DM (P1, P2): Independent AIC of a pair of character attribute P1 and synonymous verb P2
V (P1, P2) = AIC_IM (P1, P2) -AIC_DM (P1, P2)

特徴類義動詞選択部１２は、赤池情報量規準に応じて、１件の類義動詞を選択する。ここで、上位数件の類義動詞を選択し、それらの中からランダムに１件の類義動詞を選択するものであってもよい。 The feature synonymous verb selection unit 12 selects one synonymous verb according to the Akaike information criterion. Here, the top several synonymous verbs may be selected, and one synonymous verb may be randomly selected from them.

図５によれば、キャラクタ属性の１０代・男性について、類義動詞毎の赤池情報量規準を算出すると、類義動詞「食う」に偏って出現頻度が多いことが理解できる。
最終的に、特徴類義動詞選択部１２は、動詞「食べる」と、類義動詞「食う」とを、動詞置換部１３へ出力する。 According to FIG. 5, when the Akaike information criterion for each synonymous verb is calculated for teenagers and males with character attributes, it can be understood that the frequency of appearance is biased toward the synonymous verb "eating".
Finally, the feature synonymous verb selection unit 12 outputs the verb “eat” and the synonymous verb “eat” to the verb replacement unit 13.

［動詞置換部１３］
動詞置換部１３は、第１キャラクタ属性に基づく対話シナリオの発話文に記述された当該発話動詞を、特徴類義動詞に置換して、第２キャラクタ属性の対話シナリオの発話文とする。 [Verb substitution part 13]
The verb replacement unit 13 replaces the utterance verb described in the utterance sentence of the dialogue scenario based on the first character attribute with the characteristic synonymous verb to obtain the utterance sentence of the dialogue scenario of the second character attribute.

図６は、本発明における動詞置換部の説明図である。 FIG. 6 is an explanatory diagram of a verb substitution part in the present invention.

図６によれば、動詞置換部１３は、第１キャラクタ属性（２０代・女性）の対話シナリオの中で、動詞「食べる」を類義動詞「食う」に置換し、第２キャラクタ属性（１０代・男性）の対話シナリオとして作成する。
第１キャラクタ属性の対話テキストの発話文：「炒飯を食べるよ」
第２キャラクタ属性の対話テキストの発話文：「炒飯を食うよ」 According to FIG. 6, the verb replacement unit 13 replaces the verb “eat” with the synonymous verb “eat” in the dialogue scenario of the first character attribute (female in her twenties), and the second character attribute (10). Create as a dialogue scenario for teens and men).
Speech text of the dialogue text of the first character attribute: "I will eat fried rice"
The utterance of the dialogue text of the second character attribute: "I will eat fried rice"

前述したように、意義素決定部１１と、特徴類義動詞選択部１２と、動詞置換部１３とを繰り返すことによって、最終的に、異なるキャラクタ属性の対話シナリオから、所望のキャラクタ属性の対話シナリオを自動的に作成することができる。 As described above, by repeating the meaning element determination unit 11, the feature synonymous verb selection unit 12, and the verb replacement unit 13, finally, from the dialogue scenario of different character attributes, the dialogue scenario of the desired character attribute Can be created automatically.

図７は、本発明によって置換された対話シナリオを表す説明図である。 FIG. 7 is an explanatory diagram showing a dialogue scenario replaced by the present invention.

図７によれば、図１（ａ）のような２０代・女性のキャラクタ属性の対話シナリオから、１０代・男性のキャラクタ属性の対話シナリオを作成したものである。
ユーザ：「夜、ご飯どうするの？」
エージェント：「炒飯を食うよ」
ユーザは、１０代・男性のキャラクタ属性のエージェントと対話しており、「食うよ」と発話されることに違和感を生じない。１０代・男性であれば、荒っぽい発話をする場合が多いためである。 According to FIG. 7, a dialogue scenario with a character attribute of a teenager / male is created from a dialogue scenario with a character attribute of a woman in her twenties as shown in FIG. 1 (a).
User: "What do you do with rice at night?"
Agent: "I'll eat fried rice."
The user is interacting with an agent with a teenage / male character attribute, and does not feel uncomfortable when he / she is told "I will eat". This is because teenagers and men often make rough utterances.

ここで、ユーザから見て、動詞が置換された対話シナリオの発話文に対して、日本語に不自然な違和感を抱く場合もある。例えば、対話シナリオの発話文に以下のようなものがあるとする。
「炒飯を食べたい」-> 動詞：「食べ」、機能語：「たい」
置換される類義動詞「食う」が選択された場合、以下のような発話文が作成される。
「炒飯を食べたい」-> 「炒飯を食うたい」
このように、動詞の活用形を考慮せずに、動詞を置換するために、日本語として違和感が生じてしまう。 Here, from the user's point of view, Japanese may have an unnatural sense of discomfort with respect to the utterance sentence of the dialogue scenario in which the verb is replaced. For example, suppose that the utterance sentence of the dialogue scenario is as follows.
"I want to eat fried rice"-> Verb: "Eat", Function word: "Tai"
When the synonymous verb "eat" to be replaced is selected, the following utterance sentence is created.
"I want to eat fried rice"->"I want to eat fried rice"
In this way, because the verb is replaced without considering the inflected form of the verb, a sense of incongruity arises in Japanese.

図８は、本発明によって置換された動詞を修正するインタフェースを表す説明図である。 FIG. 8 is an explanatory diagram showing an interface for modifying a verb replaced by the present invention.

［発話動詞修正部１４］
発話動詞修正部１４は、対話部１００におけるユーザとの対話中に、対話シナリオの中で特徴類義動詞に置換された発話文がユーザに向けて発話された際に又はその後、ユーザに対して当該発話文に違和感を生じるか否かをテキスト又は音声によって問い合わせる。このとき、発話動詞修正部１４は、対話部１００を介して、端末２のユーザインタフェースに対して問い合わせ情報を送信する。
図８によれば、端末２の対話アプリケーション上に、以下のような、２択の問い合わせボタンが表示される。
「返答に違和感有り」
「返答に違和感無し」 [Utterance verb correction part 14]
The utterance verb correction unit 14 tells the user when or after the utterance sentence replaced with the characteristic synonymous verb in the dialogue scenario is spoken to the user during the dialogue with the user in the dialogue unit 100. Inquire by text or voice whether or not the utterance feels strange. At this time, the utterance verb correction unit 14 transmits inquiry information to the user interface of the terminal 2 via the dialogue unit 100.
According to FIG. 8, the following two-choice inquiry button is displayed on the interactive application of the terminal 2.
"There is something wrong with the reply"
"No discomfort in reply"

ユーザが違和感を生じると判定した際に、「返答に違和感有り」がタップされる。このタップ情報は、端末２から、対話シナリオ作成装置１へ送信される。そのタップ情報は、発話動詞修正部１４によって認識され、置換された特徴類義動詞を元の発話動詞に戻すべく再度置換する。
このように、エージェントがユーザと対話することによって、動詞の置換の正当性を判断していくことができる。 When the user determines that a feeling of strangeness is generated, "there is a feeling of strangeness in the reply" is tapped. This tap information is transmitted from the terminal 2 to the dialogue scenario creating device 1. The tap information is recognized by the utterance verb correction unit 14, and the replaced feature synonymous verb is replaced again to return to the original utterance verb.
In this way, the agent can interact with the user to determine the validity of the verb substitution.

以上、詳細に説明したように、本発明のプログラム、装置及び方法によれば、キャラクタ属性に応じた動詞に置換した対話シナリオを作成することができる。これによって、キャラクタ属性として違和感の無い対話シナリオを自動的に作成することができる As described above in detail, according to the program, device, and method of the present invention, it is possible to create a dialogue scenario in which verbs are replaced according to character attributes. This makes it possible to automatically create a dialogue scenario that does not feel strange as a character attribute.

前述した本発明の種々の実施形態について、本発明の技術思想及び見地の範囲の種々の変更、修正及び省略は、当業者によれば容易に行うことができる。前述の説明はあくまで例であって、何ら制約しようとするものではない。本発明は、特許請求の範囲及びその均等物として限定するものにのみ制約される。 Various modifications, modifications and omissions of the above-mentioned various embodiments of the present invention within the scope of the technical idea and viewpoint of the present invention can be easily carried out by those skilled in the art. The above explanation is just an example and does not attempt to limit anything. The present invention is limited only to the scope of claims and their equivalents.

１対話シナリオ作成装置
１００対話部
１０１対話シナリオ蓄積部
１０２投稿文蓄積部
１１意義素決定部
１１１品詞抽出部
１１２類義動詞検索部
１１３共起頻度検索部
１２特徴類義動詞選択部
１３動詞置換部
１４発話動詞修正部
２端末 1 Dialogue scenario creation device 100 Dialogue section 101 Dialogue scenario storage section 102 Post sentence storage section 11 Significance element determination section 111 Part of speech extraction section 112 Synonymous verb search section 113 Co-occurrence frequency search section 12 Characteristic verb substitution section 14 Speech verb correction part 2 Terminal

Claims

Dialogue scenario storage means that accumulates dialogue scenarios for each character attribute,
Post-sentence storage means for accumulating multiple posts by multiple users for each character attribute,
Meaning element determination means for determining the meaning element in the utterance verb described in the utterance sentence of the dialogue scenario based on the first character attribute, and associating a plurality of synonymous verbs with the same meaning element.
Characteristic synonymous verb selection means for selecting a synonymous verb whose appearance frequency is most biased among multiple synonymous verbs in the same meaning element as a characteristic synonymous verb using a user's post based on the second character attribute. When,
Replacing the utterance verb described in the utterance sentence of the dialogue scenario based on the first character attribute with a characteristic verb, and making the computer function as a verb replacement means to be the utterance sentence of the dialogue scenario of the second character attribute. A program featuring.

Meaning factor determination means
When multiple meanings are determined for the utterance verb described in the utterance sentence of the dialogue scenario based on the first character attribute,
Using the synonym corpus, search for multiple synonymous verbs that have the same meaning for each meaning element.
In the user's post based on the first character attribute, the frequency of appearance of each synonymous verb is counted for each meaning element, and
The program according to claim 1, wherein the computer functions so as to determine the meaning element having the highest frequency of appearance of each synonymous verb.

Meaning factor determination means
Part-of-speech extraction means for extracting nouns and verbs that are dependent on each other from the utterance sentences of the dialogue scenario based on the first character attribute.
A synonym verb search method for searching for synonymous verbs similar to dependent verbs for each meaning element using a synonym corpus,
The co-occurrence frequency of searching the co-occurrence frequency of the dependent noun and each synonymous verb from the user's post based on the first character attribute, and determining the co-occurrence element with the highest co-occurrence frequency of each synonymous verb. The program according to claim 1 or 2, wherein the computer functions as a search means.

Characteristic verb selection means
As shown below, using the Akaike's Information Criterion (AIC), the evaluation value V (P1, P2) of the synonymous verb P2 for the character attribute P1 is calculated, and the evaluation value V (P1, P2). The synonymous verb P2 whose evaluation value V (P1, P2) is greater than or equal to the predetermined threshold or whose evaluation value V (P1, P2) is the largest is the predetermined number.
p11: The number of posts related to the synonymous verb P2 by the user of the character attribute P1.
p12: The number of posted sentences other than the synonymous verb P2 by the user of the character attribute P1.
p21: The number of posts related to the synonymous verb P2 by users other than the character attribute P1.
p22: The number of posted sentences other than the synonymous verb P2 by users other than the character attribute P1.
p11 ＋ p12 ＋ p21 ＋ p22 ＝ Number of posts posted by all users
MLL_IM (P1, P2) = (p11 + p12) log (p11 + p12)
＋ (p11 + p21) log (p11 + p21)
＋ (p21 + p22) log (p21 + p22)
＋ (p12 + p22) log (p12 + p22) －2N log N
MLL_DM (P1, P2) ＝ p11 log p11 ＋ p12 log p12 ＋ p21 log p21 ＋ p22 log p22－N log N
However, N = p11 + p12 + p21 + p22
AIC_IM (P1, P2) = -2 x MLL_IM (P1, P2) + 2 x 2
AIC_IM (P1, P2): Subordinate AIC of a set of character attribute P1 and synonymous verb P2
AIC_DM (P1, P2) = -2 x MLL_DM (P1, P2) + 2 x 3
AIC_DM (P1, P2): Independent AIC of a pair of character attribute P1 and synonymous verb P2
V (P1, P2) = AIC_IM (P1, P2) -AIC_DM (P1, P2)
The program according to any one of claims 1 to 3, wherein the computer functions as described above.

The posted text is a text posted on an SNS (Social Networking Service) or a blog by an unspecified number of users.
The program according to any one of claims 1 to 4, wherein the character attribute causes the computer to function as a classification attribute based on any one of age, gender, and birthplace, or a combination thereof.

A means of dialogue in which an agent interacts with a user using a dialogue scenario,
Whether the utterance sentence replaced with the characteristic synonymous verb in the dialogue scenario is uttered to the user during the dialogue with the user by the dialogue means, or after that, the utterance sentence causes discomfort to the user. When the user inquires by text or voice and determines that the user feels uncomfortable, the computer can be further functioned as an utterance verb correction means for replacing the replaced feature synonymous verb with the original utterance verb. The program according to any one of claims 1 to 5, which is characterized.

Dialogue scenario storage means that accumulates dialogue scenarios for each character attribute,
Post-sentence storage means for accumulating multiple posts by multiple users for each character attribute,
Meaning element determination means for determining the meaning element in the utterance verb described in the utterance sentence of the dialogue scenario based on the first character attribute, and associating a plurality of synonymous verbs with the same meaning element.
Characteristic synonymous verb selection means for selecting a synonymous verb whose appearance frequency is most biased among multiple synonymous verbs in the same meaning element as a characteristic synonymous verb using a user's post based on the second character attribute. When,
It is characterized by having a verb replacement means for replacing the utterance verb described in the utterance sentence of the dialogue scenario based on the first character attribute with a characteristic synonymous verb to be the utterance sentence of the dialogue scenario of the second character attribute. Dialogue scenario creation device.

It is a method of creating a dialogue scenario of the device.
The device is
Dialogue scenario storage means that accumulates dialogue scenarios for each character attribute,
It has a post sentence storage means for accumulating a plurality of post sentences by a plurality of users for each character attribute.
The first step of determining the meaning elements in the utterance verb described in the utterance sentence of the dialogue scenario based on the first character attribute, and associating a plurality of synonymous verbs having the same meaning element,
The second step of selecting the synonymous verb whose appearance frequency is most biased among multiple synonymous verbs of the same meaning as the characteristic synonymous verb by using the user's posted sentence based on the second character attribute.
The third step of replacing the utterance verb described in the utterance sentence of the dialogue scenario based on the first character attribute with a characteristic synonymous verb and making it the utterance sentence of the dialogue scenario of the second character attribute is executed. How to create a dialogue scenario featuring.