JP7259349B2

JP7259349B2 - Dialogue device, dialogue method, and program

Info

Publication number: JP7259349B2
Application number: JP2019009482A
Authority: JP
Inventors: 淳一郎副島; 博康井手
Original assignee: Casio Computer Co Ltd
Current assignee: Casio Computer Co Ltd
Priority date: 2019-01-23
Filing date: 2019-01-23
Publication date: 2023-04-18
Anticipated expiration: 2039-01-23
Also published as: JP2020119221A

Description

本発明は、ユーザからの入力に応じた応答文を出力する対話装置、対話方法、及びプログラムに関する。 The present invention relates to an interactive device, an interactive method, and a program for outputting a response sentence according to an input from a user.

従来、ユーザからの入力に応じた応答文を出力する対話装置として、例えば特許文献１に開示されたものが知られている。この従来の対話装置は、車両のナビゲーション装置に適用されたものであり、有限オートマトンで構成された対話シナリオデータを備えている。従来の対話装置では、そのときどきのユーザからの入力に応じ、対話シナリオデータに従って、ユーザとの対話が実行される。 2. Description of the Related Art Conventionally, as an interactive device for outputting a response sentence in response to an input from a user, for example, one disclosed in Japanese Patent Application Laid-Open No. 2002-200012 is known. This conventional dialogue system is applied to a vehicle navigation system and has dialogue scenario data composed of a finite automaton. A conventional dialogue device executes dialogue with a user according to dialogue scenario data in response to input from the user at that time.

特開２００３－３２９４７７号公報JP-A-2003-329477

この種の有限オートマトンを備えた対話装置では、対話シナリオデータを用いて取得される遷移先の応答ルールに基づいて、ユーザに対する応答文が作成される。一方、雑談の対話では、その話題が様々に変化する傾向にあるため、対話シナリオデータの構成によっては、遷移先の応答ルールだけでは不足する可能性がある。 A dialogue apparatus having this kind of finite automaton creates a response sentence for a user based on a transition destination response rule obtained using dialogue scenario data. On the other hand, in casual conversations, the topic tends to change in various ways, so depending on the configuration of the dialogue scenario data, there is a possibility that the transition destination response rule alone may not be sufficient.

このような問題は、広範囲の話題を取得できるように上記の対話シナリオデータを構成することによって回避することができるが、その場合には、話題が唐突に切り替わることによって、ユーザとの対話が不自然になるおそれがある。 Such a problem can be avoided by configuring the above dialogue scenario data so that a wide range of topics can be acquired. It can become natural.

そこで、本発明は、ユーザと自然に対話できるようにすることを目的とする。 SUMMARY OF THE INVENTION Accordingly, an object of the present invention is to enable a natural dialogue with a user .

態様の一例の対話装置は、所定の対象から入力された入力発話情報を取得する取得手段と、複数の入力単語の組、応答文、オートマトンの状態、及び、次に遷移すべきオートマトンの状態を互いに対応付けたルールデータを複数記憶したデータベースから、前記オートマトンの現在の状態を含む所定の状態と、前記取得手段によって取得された入力発話情報に含まれる複数の入力単語とに応じて、ルールデータを選択し、選択したルールデータに含まれる応答文を前記所定の対象に対して出力する応答文出力手段と、前記選択されたルールデータに対応する前記所定の状態を記憶手段に順次記憶させる記憶制御手段と、を備え、前記記憶制御手段による記憶制御の後、前記取得手段によって入力発話情報が取得されると、前記応答文出力手段は、選択すべきルールデータが複数存在する場合、前記データベースに複数記憶されているルールデータのうち、前記記憶手段に記憶されている複数の前記所定の状態におけるより新しく記憶された前記所定の状態と同じ状態に対応するルールデータを優先的に選択する。 An example of a dialogue device includes an acquisition means for acquiring input utterance information input from a predetermined target, a set of a plurality of input words, a response sentence, an automaton state, and an automaton state to be transitioned to next. rule data according to a predetermined state including the current state of the automaton and a plurality of input words included in the input utterance information obtained by the obtaining means from a database storing a plurality of rule data associated with each other; and a response sentence output means for outputting a response sentence included in the selected rule data to the predetermined object; and a memory for sequentially storing the predetermined state corresponding to the selected rule data in a storage means. and a control means , when the input utterance information is acquired by the acquisition means after the storage control by the storage control means, the response sentence output means, if there is a plurality of rule data to be selected, the database among the plurality of rule data stored in the storage means , rule data corresponding to the same state as the predetermined state stored more recently than the plurality of predetermined states stored in the storage means are preferentially selected. .

本発明によれば、ユーザと自然に対話することが可能となる。 According to the present invention , it becomes possible to interact with the user naturally.

対話装置の一実施形態を示すブロック図である。1 is a block diagram illustrating one embodiment of an interactive device; FIG. 対話装置を実現可能なコンピュータのハードウェア例を示す図である。FIG. 2 is a diagram showing an example of hardware of a computer that can implement an interactive device; 制御データ、入力単語データ、及びルールデータのデータフォーマットの例を示す図である。FIG. 4 is a diagram showing examples of data formats of control data, input word data, and rule data; 対話処理の例を示すメインフローチャートである。4 is a main flowchart showing an example of interactive processing; 前処理の詳細例を示すフローチャートである。6 is a flowchart showing a detailed example of preprocessing; ルール検索処理の詳細例を示すフローチャートである。9 is a flowchart showing a detailed example of rule search processing; 応答文出力処理の詳細例を示すフローチャートである。9 is a flowchart showing a detailed example of response sentence output processing; オートマトンによる対話データベースの一実施形態を示す図（その１）である。FIG. 1 is a diagram (part 1) showing an embodiment of a dialogue database based on an automaton; オートマトンによる対話データベースの一実施形態を示す図（その２）である。FIG. 2 is a diagram (part 2) showing an embodiment of a dialogue database based on an automaton; オートマトンによる対話データベースの一実施形態を示す図（その３）である。FIG. 3 is a diagram (part 3) showing an embodiment of a dialogue database based on an automaton; 対話装置の動作例を示す図である。It is a figure which shows the operation example of a dialog apparatus.

以下、本発明を実施するための形態について図面を参照しながら詳細に説明する。図１は、対話装置の一実施形態を示すブロック図である。対話装置１００は、データベース１０１（データベース）、取得部１０６（取得手段）、抽出部１０７（抽出手段）、データ取得部１０２（データ取得手段）を含む応答文出力部１０３（応答文出力手段）、及び記憶部１０４（記憶手段）を備え、所定の対象、例えばユーザと対話可能に構成されている。この対話装置１００は例えば、家庭用の対話機能付きスピーカーや、ロボットの対話機能に組み込んで使用することができる。ロボットである場合、上記所定の対象は例えば、他のロボットであってもよい。 EMBODIMENT OF THE INVENTION Hereinafter, it demonstrates in detail, referring drawings for the form for implementing this invention. FIG. 1 is a block diagram illustrating one embodiment of an interactive device. The dialogue device 100 includes a response sentence output unit 103 (response sentence output means) including a database 101 (database), an acquisition unit 106 (acquisition means), an extraction unit 107 (extraction means), and a data acquisition unit 102 (data acquisition means), and a storage unit 104 (storage means), and is configured to be able to interact with a predetermined target, for example, a user. This interactive device 100 can be used, for example, by incorporating it into a speaker with an interactive function for home use or an interactive function of a robot. In the case of a robot, the predetermined target may be, for example, another robot.

データベース１０１は、複数のルールデータ１１０を記憶しており、各ルールデータ１１０には、想定入力単語組（想定入力発話文）と、応答文と、オートマトンの状態番号と、オートマトンの次回の遷移先の状態番号（以下「次遷移先状態番号」という）とが含まれる。この想定入力単語組は、ユーザから入力されると想定される複数の入力単語で構成され、応答文は、ユーザの発話に対する応答文であり、これらの想定入力単語組、応答文、オートマトンの状態番号、及びオートマトンの次遷移先状態番号が、互いに対応付けられて、ルールデータ１１０として記憶されている。オートマトンの状態番号及び次遷移先状態番号は、ユーザとの対話中に、複数のルールデータ１１０のうちのいずれを選択して応答すべきかを規定するものであり、次遷移先状態番号は、次回の応答時に選択すべきルールデータ１１０に含まれるオートマトンの状態番号を表して（定義して）いる。この場合、オートマトンの状態番号と次遷移先状態番号は、同じ番号であったり、異なる番号であったりする。各ルールデータ１１０のオートマトンの状態番号及び次遷移先状態番号、並びに、それらの各々に対応する応答文は、ユーザとの対話における話題を適切に変化させながら自然な応答が行われるように、設定されている。ここで、例えば様々な話題のユーザの発話に対応できるように、同一のオートマトンの状態番号を含む複数のルールデータ１１０が、データベース１０１に記憶されていてもよい。 The database 101 stores a plurality of rule data 110. Each rule data 110 contains an assumed input word set (assumed input utterance sentence), a response sentence, an automaton state number, and the next transition destination of the automaton. state number (hereinafter referred to as "next transition destination state number"). This assumed input word set is composed of a plurality of input words assumed to be input by the user, and the response sentence is a response sentence to the user's utterance. The number and the next transition destination state number of the automaton are associated with each other and stored as the rule data 110 . The state number of the automaton and the next transition destination state number define which of the plurality of rule data 110 should be selected and responded during the dialogue with the user. It represents (defines) the state number of the automaton included in the rule data 110 to be selected when responding to . In this case, the automaton state number and the next transition destination state number may be the same number or different numbers. The automaton state number and the next transition destination state number of each rule data 110, and the response sentence corresponding to each of them are set so that a natural response can be made while appropriately changing the topic in the dialogue with the user. It is Here, for example, a plurality of rule data 110 including the same automaton state number may be stored in the database 101 so as to correspond to user utterances on various topics.

取得部１０６は、ユーザの入力発話情報１１５を、例えばマイクロフォン（図示せず）を介して取得する。 Acquisition unit 106 acquires user input speech information 115 via, for example, a microphone (not shown).

抽出部１０７は、上記入力発話情報１１５を、増幅器及びＡ／Ｄ（アナログ／デジタル）変換器等を介してデジタル化してデジタル音声に変換する。次に、抽出部１０７は、このデジタル音声に対して、音声認識処理を実行することにより、入力文のテキストデータを取得し、取得した入力文のテキストデータに対して形態素解析を実行することにより、発話を単語ごと（例えば名詞や動詞、形容詞、副詞等）に分かち書きした形式で、複数の入力単語からなる入力単語組１１１のテキストデータを抽出する。 The extraction unit 107 digitizes the input speech information 115 through an amplifier, an A/D (analog/digital) converter, and the like, and converts it into digital voice. Next, the extraction unit 107 acquires text data of the input sentence by executing speech recognition processing on the digital voice, and executes morphological analysis on the acquired text data of the input sentence to obtain , text data of an input word set 111 consisting of a plurality of input words is extracted in a form in which the utterance is divided into words (for example, nouns, verbs, adjectives, adverbs, etc.).

応答文出力部１０３は、複数のルールデータ１１０から、オートマトンの現在の状態を含む所定の状態と、取得部１０６にて取得された入力発話情報１１５とに応じて、応答ルールデータ１１３を選択し、選択した応答ルールデータ１１３に含まれる応答文１１４を所定の対象に対して出力する。 The response sentence output unit 103 selects the response rule data 113 from the plurality of rule data 110 according to a predetermined state including the current state of the automaton and the input utterance information 115 acquired by the acquisition unit 106. , the response sentence 114 included in the selected response rule data 113 is output to a predetermined target.

ここで、応答文出力部１０３は、次のようなデータ取得部１０２を有してよい。データ取得部１０２は、複数のルールデータ１１０のうちの、オートマトンの現在の状態を含む所定の状態に対応する複数のルールデータよりも少ない複数のルールデータから、取得された入力発話情報１１５に応じて、応答ルールデータ１１３の候補となる応答候補の複数のルールデータである応答候補ルールデータ１１２を検索して取得する。この検索は、例えば次のようにして行われる。まず、取得した入力単語組１１１に包含される想定入力単語組を有するルールデータ１１０を、応答候補ルールデータ１１２の暫定候補データとして検索する。この場合、「入力単語組１１１に包含される想定入力単語組」は、想定入力単語組の単語数が入力単語組の単語数以下であって、かつ、想定入力単語組のすべての単語が入力単語組の一部またはすべての単語と一致する想定入力単語のことである。データ取得部１０２は、このようにして検索された1つ以上の暫定候補データに対し、更に後述するように記憶部１０４を参照して、応答候補ルールデータ１１２を検索する。 Here, the response sentence output unit 103 may have the data acquisition unit 102 as follows. The data acquisition unit 102 acquires input utterance information 115 from a plurality of rule data less than a plurality of rule data corresponding to a predetermined state including the current state of the automaton, among the plurality of rule data 110 . Then, the candidate response rule data 112 , which are a plurality of rule data of response candidates that are candidates for the response rule data 113 , are retrieved and acquired. This search is performed, for example, as follows. First, the rule data 110 having the assumed input word set included in the acquired input word set 111 is searched as provisional candidate data of the response candidate rule data 112 . In this case, the "assumed input word set included in the input word set 111" is such that the number of words in the assumed input word set is less than or equal to the number of words in the input word set, and all the words in the assumed input word set are input. An assumed input word that matches some or all of the words in the word set. The data acquisition unit 102 searches the response candidate rule data 112 by referring to the storage unit 104 as will be described later, for one or more pieces of provisional candidate data thus retrieved.

応答文出力部１０３は、データ取得部１０２で検索された応答候補ルールデータ１１２から応答ルールデータ１１３を選択し、その応答ルールデータ１１３に含まれる応答文（図１のルールデータ１１０に含まれる応答文を参照）を、ユーザの発話に対する応答文１１４として出力する。このようにして出力された応答文１１４のデータ（例えばテキストデータ）に対して、音声合成処理部からＤ／Ａ変換器、増幅器、及びスピーカを介して、応答文１１４に対応する音声が発声される。なお、対話装置１００が例えばロボットの対話機能として組み込まれている場合、ロボットはユーザが周囲に存在することをセンサ等によって検出している状態で、上記音声の発声を対話装置１００に行わせる。これと共に、応答文出力部１０３は、応答ルールデータ１１３に含まれるオートマトンの次遷移先状態番号（図１参照）を、スタック状態番号ＮＳＡとして、記憶部１０４に順次記憶する。このように、記憶部１０４は、オートマトンの状態番号をスタックするスタックとして機能する。記憶部１０４に記憶された複数のスタック状態番号ＮＳＡは、ユーザの発話の話題が辿るのに応じて遷移したオートマトンの状態番号の履歴を示すと同時に、これらの複数のスタック状態番号ＮＳＡのうち、最も新しく記憶されたスタック状態番号ＮＳＡは、遷移したオートマトンの現在の状態番号（＝例えば話題）を示している。 The response sentence output unit 103 selects the response rule data 113 from the response candidate rule data 112 searched by the data acquisition unit 102, and extracts the response sentence included in the response rule data 113 (the response included in the rule data 110 in FIG. 1). sentence) is output as a response sentence 114 to the user's utterance. In response to the data (text data, for example) of the response sentence 114 output in this way, the voice corresponding to the response sentence 114 is uttered from the speech synthesis processing unit via the D/A converter, the amplifier, and the speaker. be. If the dialogue device 100 is incorporated as a dialogue function of a robot, for example, the robot causes the dialogue device 100 to utter the above-described voice while detecting the presence of the user in the surroundings with a sensor or the like. Along with this, the response sentence output unit 103 sequentially stores the automaton next transition destination state number (see FIG. 1) included in the response rule data 113 in the storage unit 104 as the stack state number NSA. Thus, the storage unit 104 functions as a stack that stacks state numbers of automatons. The plurality of stack state numbers NSA stored in the storage unit 104 indicate the history of the state numbers of the automaton that transitioned according to the topic of the user's utterance. The most recently stored stack state number NSA indicates the current state number (=topic, for example) of the transitioned automaton.

ここで、データ取得部１０２は、前述したようにして検索された1つ以上の暫定候補データ（ルールデータ１１０）のうち、その対応するオートマトンの状態番号（図１参照）が記憶部１０４中の複数のスタック状態番号ＮＳＡのうちの何れかに一致する暫定候補データを、応答候補ルールデータ１１２として検索する。記憶部１０４中の各スタック状態番号ＮＳＡは、ユーザの発話の話題が辿るのに応じて遷移したオートマトンの状態番号（＝例えば話題）を示している。従って、データ取得部１０２は、データベース１０１に記憶されているルールデータ１１０のうち、いままで発生したオートマトンの状態番号（スタック状態番号ＮＳＡ）と同じ状態番号を含むルールデータ１１０、すなわち、例えばいままで話題にのぼったルールデータ１１０から、応答候補ルールデータ１１２を選択することになる。 Here, the data acquisition unit 102 stores the state number (see FIG. 1) of the corresponding automaton among the one or more provisional candidate data (rule data 110) retrieved as described above in the storage unit 104. Temporary candidate data matching any of the plurality of stack state numbers NSA is retrieved as response candidate rule data 112 . Each stack state number NSA in the storage unit 104 indicates the state number (=for example, topic) of the automaton transitioned according to the topic of the user's utterance. Therefore, the data acquisition unit 102 selects the rule data 110 that includes the same state number as the state number of the automaton that has occurred so far (stack state number NSA) among the rule data 110 stored in the database 101. Answer candidate rule data 112 is selected from the rule data 110 that has become a hot topic.

最も望ましいのは、記憶部１０４に最も新しく記憶されているスタック状態番号ＮＳＡ、すなわちオートマトンの現在の状態番号と同じ状態番号を含むルールデータ１１０が検索されることである。しかしながら、ユーザによる入力単語組１１１に対応するルールデータ１１０に含まれるオートマトンの状態番号が、オートマトンの現在の状態番号と必ずしも一致するとは限らない。このような場合に、データ取得部１０２は、記憶部１０４が記憶するスタック状態番号ＮＳＡが示す過去に辿ってきたオートマトンの状態番号と一致するオートマトンの状態番号を含むとともに、入力単語組１１１に対応する想定入力単語組を有するルールデータ１１０を応答候補ルールデータ１１２として検索する。この対話装置１００の構成により、ユーザとの例えば対話における話の流れに沿った応答候補ルールデータ１１２を検索することができる。 Most preferably, the rule data 110 containing the most recently stored stack state number NSA in storage 104, ie, the same state number as the automaton's current state number, is retrieved. However, the state number of the automaton included in the rule data 110 corresponding to the word set 111 input by the user does not necessarily match the current state number of the automaton. In such a case, the data acquisition unit 102 includes the state number of the automaton that matches the state number of the automaton traced in the past indicated by the stack state number NSA stored in the storage unit 104 and corresponds to the input word set 111. The rule data 110 having an assumed input word set is searched as the answer candidate rule data 112 . With this configuration of the dialogue apparatus 100, it is possible to search for the answer candidate rule data 112 along the flow of dialogue with the user, for example.

この場合、データ取得部１０２は更に、対応するオートマトンの状態番号が上述のように記憶部１０４中の複数のスタック状態番号ＮＳＡの何れかに一致する複数の応答候補ルールデータ１１２（ルールデータ１１０）が存在する場合に、これらの応答候補ルールデータ１１２のうち、より新しく記憶されたスタック状態番号ＮＳＡに一致する状態番号に対応する応答候補ルールデータ１１２が、応答ルールデータ１１３として優先的に選択されるように、各応答候補ルールデータ１１２のスコアを算出する。 In this case, the data acquisition unit 102 further collects a plurality of response candidate rule data 112 (rule data 110) whose corresponding automaton state number matches any of the plurality of stack state numbers NSA in the storage unit 104 as described above. exists, of these candidate response rule data 112, the candidate response rule data 112 corresponding to the state number matching the stack state number NSA stored more recently is preferentially selected as the response rule data 113. The score of each response candidate rule data 112 is calculated as follows.

より具体的には、データ取得部１０２は、例えば、入力単語組１１１に含まれる入力単語の各々に、後述するＴＦ－ＩＤＦ手法などの所定の手法によって、重み係数を設定する。これにより、複数の入力単語の各々の重み係数は、その重要度に応じて、互いに同じ値に設定されたり、互いに異なる値に設定されたりする。また、データ取得部１０２は、応答候補ルールデータ１１２としてのルールデータ１１０に含まれる想定入力単語組（図１のルールデータ１１０の想定入力単語組を参照）の入力単語組１１１に対する類似度を示すコサイン類似度を、設定した重み係数に応じて、応答候補ルールデータ１１２毎に算出する。また、データ取得部１０２は、算出したコサイン類似度に応じて、応答候補ルールデータ１１２毎に、応答ルールデータ１１３を選択するための指標を示すスコアを算出する（コサイン類似度をスコアとして算出する）。さらに、データ取得部１０２は、複数の応答候補ルールデータ１１２（ルールデータ１１０）のうちの、記憶部１０４に記憶された複数のスタック状態番号ＮＳＡのうちのより過去に記憶されたスタック状態番号ＮＳＡに一致する状態番号に対応する応答候補ルールデータ１１２のスコアを、より小さな値に算出する。そして、応答文出力部１０３は、応答候補ルールデータ１１２毎に算出された上記スコアに基づいて、最大のスコアを有する応答候補ルールデータ１１２を応答ルールデータ１１３として選択する。この対話装置１００の構成によって、応答文出力部１０３は、データ取得部１０２により検索された１つ以上の応答候補ルールデータ１１２のうち、オートマトンの現在の状態番号（＝例えば現在の話題）により近い状態番号（＝例えばより近い話題）に対応する応答候補ルールデータ１１２を、応答ルールデータ１１３として優先的に選択することができ、よりユーザとの話題に沿った自然な対話が行える対話装置１００を提供することが可能となる。 More specifically, the data acquisition unit 102 sets a weighting factor for each of the input words included in the input word set 111, for example, using a predetermined method such as the TF-IDF method described later. As a result, the weight coefficients of the plurality of input words are set to the same value or different values depending on their importance. The data acquisition unit 102 also indicates the degree of similarity of the assumed input word set (see the assumed input word set of the rule data 110 in FIG. 1) included in the rule data 110 as the response candidate rule data 112 to the input word set 111. A cosine similarity is calculated for each response candidate rule data 112 according to the set weighting factor. In addition, the data acquisition unit 102 calculates a score indicating an index for selecting the response rule data 113 for each response candidate rule data 112 according to the calculated cosine similarity (the cosine similarity is calculated as a score). ). Further, the data acquisition unit 102 obtains the stack state number NSA stored in the past among the plurality of stack state numbers NSA stored in the storage unit 104, among the plurality of candidate response rule data 112 (rule data 110). , the score of the answer candidate rule data 112 corresponding to the state number that matches is calculated to be a smaller value. Then, the response sentence output unit 103 selects the response candidate rule data 112 having the maximum score as the response rule data 113 based on the score calculated for each response candidate rule data 112 . With this configuration of the dialogue apparatus 100, the response sentence output unit 103 selects one or more of the candidate response rule data 112 retrieved by the data acquisition unit 102, which is closest to the current state number of the automaton (=for example, the current topic). The dialogue device 100 can preferentially select response candidate rule data 112 corresponding to a state number (=for example, a closer topic) as the response rule data 113, and can have a more natural dialogue with the user according to the topic. can be provided.

ここまでの構成に加えて、選択された応答ルールデータ１１３に基づいて生成された応答文１１４を過去所定回数分記憶する応答文記憶部１０４（応答文記憶手段）を更に備えてよい。そして、データ取得部１０２は、応答文記憶部１０４を参照することにより、１つ以上の応答候補ルールデータ１１２中の各応答候補ルールデータ１１２が応答ルールデータ１１３として選択されるときの優先度（例えば前述したスコア）を、応答候補ルールデータ１１２の応答文１１４が過去何回目に生成されたかに応じて変更してよい。この対話装置１００の構成により、同じ応答ルールデータ１１３による応答文１１４が繰り返し出力されるのを防ぐことができる。 In addition to the above configuration, a response sentence storage unit 104 (response sentence storage means) may be further provided for storing a predetermined number of past response sentences 114 generated based on the selected response rule data 113 . Then, by referring to the response sentence storage unit 104, the data acquisition unit 102 determines the priority ( For example, the score described above) may be changed according to how many times in the past the response sentence 114 of the response candidate rule data 112 was generated. With this configuration of the interactive device 100, it is possible to prevent repeated output of the response sentence 114 based on the same response rule data 113. FIG.

ここまでの構成に加えて、データベース１０１には、所定の複数の非想定用ルール（後述するワイルドカードリスト、スーパーワイルドカードリスト）が含まれてもよい。そして、データ取得部１０２は、ユーザによる発話の内容が想定外の内容である場合、すなわち、入力単語組１１１の複数の単語が、いずれのルールデータ１１０の想定用入力単語組の複数の単語を包含していない場合には、複数の非想定用ルールのうち、前回に選択された応答ルールデータに対応するものを、応答候補ルールデータとして取得してよい。 In addition to the configuration described so far, the database 101 may include a plurality of predetermined non-assumed rules (wild card list and super wild card list, which will be described later). Then, when the content of the user's utterance is unexpected content, that is, the plurality of words of the input word set 111 do not match the plurality of words of the assumed input word set of any of the rule data 110 . If not included, among the plurality of non-assuming rules, the one corresponding to the previously selected response rule data may be acquired as the response candidate rule data.

図２は、図２の対話装置１００を実現可能なコンピュータのハードウェア例を示す図である。このコンピュータは、通常のパーソナルコンピュータのほか、スマートフォン、タブレット端末、デジタルカメラなどを含む。図２に示されるコンピュータは、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）ＣＰＵ２０１、メモリ２０２、入力装置２０３、出力装置２０４、補助情報記憶装置２０５、可搬型記録媒体２１１が挿入される媒体駆動装置２０６、ネットワーク接続装置２０７、音声入力装置２０８、及び音声出力装置２０９を有する。これらの構成要素は、バス２１０により相互に接続されている。図２に示される構成は図２の対話装置１００を実現できるコンピュータの一例であり、そのようなコンピュータはこの構成に限定されるものではない。 FIG. 2 is a diagram showing an example of computer hardware that can implement the dialog apparatus 100 of FIG. This computer includes smartphones, tablet terminals, digital cameras, etc., in addition to ordinary personal computers. The computer shown in FIG. 2 includes a CPU (Central Processing Unit) CPU 201, a memory 202, an input device 203, an output device 204, an auxiliary information storage device 205, a media drive device 206 into which a portable recording medium 211 is inserted, and a network connection device. 207 , an audio input device 208 and an audio output device 209 . These components are interconnected by bus 210 . The configuration shown in FIG. 2 is an example of a computer that can implement the interactive device 100 of FIG. 2, and such a computer is not limited to this configuration.

メモリ２０２は、例えば、ＲｅａｄＯｎｌｙＭｅｍｏｒｙ（ＲＯＭ）、ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ（ＲＡＭ）、フラッシュメモリ等の半導体メモリであり、例えば、後述する図４～図７のフローチャートの処理に対応するプログラム、及び後述する図３に対応する各種データを格納する。 The memory 202 is, for example, a Read Only Memory (ROM), a Random Access Memory (RAM), or a semiconductor memory such as a flash memory. Various data corresponding to FIG. 3 are stored.

ＣＰＵ（プロセッサ）１０１は、例えば、メモリ２０２を利用して、図２の対話装置１００に用いられる後述する図４～図７のフローチャートの処理に対応するプログラムを実行することにより、図２に示される各処理ブロックとして動作する。 A CPU (processor) 101 uses a memory 202, for example, to execute a program corresponding to processing of flow charts of FIGS. It operates as each processing block.

入力装置２０３は、例えば、タッチパネル入力装置であり、オペレータ又はユーザからの指示又は情報の入力に用いられる。出力装置２０４は、例えば、上記タッチパネル入力装置と一体に形成される液晶ディスプレイ（ＬＣＤ：ＬｉｑｕｉｄＣｒｙｓｔａｌＤｉｓｐｌａｙ）などのディスプレイ装置であり、オペレータ又はユーザへの問合せ又は処理結果の出力に用いられる。 The input device 203 is, for example, a touch panel input device, and is used to input instructions or information from an operator or user. The output device 204 is, for example, a display device such as a liquid crystal display (LCD) integrally formed with the touch panel input device, and is used to output an inquiry to an operator or user or a processing result.

補助情報記憶装置２０５は、例えば、半導体記憶装置、ハードディスク記憶装置、磁気ディスク記憶装置、光ディスク装置、光磁気ディスク装置などであり、例えば、図１で説明したデータベース１０１を記憶する記憶装置、或いは、図１の記憶部１０４又は応答文記憶部１０５として動作する。図２の対話装置１００は、補助情報記憶装置２０５に図２の対話装置１００に用いられる例えば図４から図７のフローチャートの処理を実行するプログラム及び図３で後述する各種データなどのデータを格納しておき、それらをメモリ２０２にロードして使用してもよい。 The auxiliary information storage device 205 is, for example, a semiconductor storage device, a hard disk storage device, a magnetic disk storage device, an optical disk device, a magneto-optical disk device, or the like. It operates as the storage unit 104 or the response sentence storage unit 105 in FIG. 2 stores, in the auxiliary information storage device 205, data such as programs for executing the processes of the flowcharts of FIGS. 4 to 7 and various data described later in FIG. stored, and loaded into the memory 202 for use.

媒体駆動装置２０６は、可搬型記録媒体２１１を駆動し、その記録内容にアクセスする。可搬型記録媒体２１１は、メモリデバイス、フレキシブルディスク、光ディスク、光磁気ディスク等である。可搬型記録媒体２１１は、ＣｏｍｐａｃｔＤｉｓｋＲｅａｄＯｎｌｙＭｅｍｏｒｙ（ＣＤ－ＲＯＭ）、ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｋ（ＤＶＤ）、ＵｎｉｖｅｒｓａｌＳｅｒｉａｌＢｕｓ（ＵＳＢ）メモリ等であってもよい。オペレータ又はユーザは、この可搬型記録媒体２１１に上述のプログラム及びデータを格納しておき、メモリ２０２にロードして使用することができる。 The media drive device 206 drives the portable recording medium 211 and accesses the recorded contents. The portable recording medium 211 is a memory device, flexible disk, optical disk, magneto-optical disk, or the like. The portable recording medium 211 may be Compact Disk Read Only Memory (CD-ROM), Digital Versatile Disk (DVD), Universal Serial Bus (USB) memory, or the like. An operator or user can store the above-described programs and data in the portable recording medium 211 and load them into the memory 202 for use.

このように、上述のプログラム及びデータを格納するコンピュータ読取り可能な記録媒体は、メモリ２０２、補助情報記憶装置２０５、又は可搬型記録媒体２１１のような、物理的な（非一時的な）記録媒体である。 Thus, the computer-readable recording medium storing the above programs and data is a physical (non-temporary) recording medium such as the memory 202, the auxiliary information storage device 205, or the portable recording medium 211. is.

ネットワーク接続装置２０７は、例えばＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ（ＬＡＮ）等の通信ネットワークに接続され、通信に伴うデータ変換を行う通信インタフェースである。図２の対話装置１００は、上述のプログラム又はデータを外部の装置からネットワーク接続装置２０７を介して受信し、それらをメモリ２０２にロードして使用することができる。 A network connection device 207 is a communication interface that is connected to a communication network such as a Local Area Network (LAN) and performs data conversion associated with communication. The interactive device 100 of FIG. 2 can receive the above-described programs or data from an external device via the network connection device 207, load them into the memory 202, and use them.

音声入力装置２０８は、ユーザが喋る音声をアナログ入力音声信号として入力するマイクロフォン／アンプ、アナログ入力音声信号をデジタル入力音声信号に変換するＡ／Ｄ（アナログ／デジタル）変換器、デジタル入力音声信号をユーザからの入力としてＣＰＵ２０１に引き渡すためのインタフェース回路などである。 The voice input device 208 includes a microphone/amplifier for inputting the user's voice as an analog input voice signal, an A/D (analog/digital) converter for converting the analog input voice signal to a digital input voice signal, and a digital input voice signal. It is an interface circuit or the like for handing over to the CPU 201 as an input from the user.

音声出力装置２０９は、図２の対話装置１００が生成した応答文１１４に対応するデジタル音声信号を合成する音声合成処理装置、デジタル音声信号をアナログ音声信号に変換するＤ／Ａ（デジタル／アナログ）変換器、アナログ音声信号をユーザに向かって放音するためのアンプ／スピーカなどである。 The speech output device 209 is a speech synthesis processing device that synthesizes a digital speech signal corresponding to the response sentence 114 generated by the dialogue device 100 in FIG. converters, amplifiers/speakers for emitting analog audio signals to the user.

なお、図２の対話装置１００が図２の全ての構成要素を含む必要はなく、用途又は条件に応じて一部の構成要素を省略することも可能である。例えば、オペレータ又はユーザからの指示又は情報を入力する必要がない場合は、入力装置２０３の一部又は全部や出力装置２０４の一部又は全部が省略されてもよい。可搬型記録媒体２１１又は通信ネットワークを利用しない場合は、媒体駆動装置２０６又はネットワーク接続装置２０７が省略されてもよい。 2 does not need to include all the components shown in FIG. 2, and it is possible to omit some of the components depending on the application or conditions. For example, part or all of the input device 203 and part or all of the output device 204 may be omitted if there is no need to input instructions or information from the operator or user. If the portable recording medium 211 or communication network is not used, the media drive device 206 or the network connection device 207 may be omitted.

図３は、図２のコンピュータが実現する図２の対話装置１００の制御に必要な主要な各種データのフォーマットの例を示す図である。 FIG. 3 is a diagram showing an example of the format of various main data necessary for controlling the interactive device 100 of FIG. 2 realized by the computer of FIG.

図３（ａ）は、制御データのデータフォーマット例である。制御データの先頭ポインタＣｔｒｌのアドレスから順に、以下の各種データが図２のメモリ２０２に記憶される。 FIG. 3(a) is an example of a data format of control data. The following various data are stored in the memory 202 in FIG. 2 in order from the address of the start pointer Ctrl of the control data.

ｔｒａｎｓｉｔｉｏｎ：対話データベースへのポインタ。図２の補助情報記憶装置２０５などに記憶される図２のデータベース１０１内のルールデータ１１０の先頭へのポインタである。ルールデータ１１０は、図１を用いて前述したように、オートマトンの状態番号毎に、ユーザが発話すると想定される単語の組（想定入力単語組）と、それに対応する応答文と、次に遷移するオートマトンの状態番号（次遷移先状態番号）を管理する。 transition: a pointer to the transition database. It is a pointer to the beginning of the rule data 110 in the database 101 in FIG. 2 stored in the auxiliary information storage device 205 in FIG. As described above with reference to FIG. 1, the rule data 110 includes, for each state number of the automaton, a set of words assumed to be uttered by the user (assumed input word set), a corresponding response sentence, and then a transition It manages the state number of the automaton (next transition destination state number).

ｉｎｐｕｔＷｏｒｄＣｏｕｎｔ：入力単語数。入力されたユーザ発話の単語(形態素)数である。 inputWordCount: Input word count. It is the number of words (morphemes) in the input user utterance.

ｉｎｐｕｔＷｏｒｄｓ［ｉｎｐｕｔＷｏｒｄＣｏｕｎｔ］：図３（ｂ）の入力単語データへのポインタ。入力されたユーザ発話に含まれる単語(形態素)群の実体アドレスである。上記入力単語数ｉｎｐｕｔＷｏｒｄＣｏｕｎｔ分の入力単語データ配列である。 inputWords[inputWordCount]: A pointer to the input word data in FIG. 3(b). This is the entity address of the word (morpheme) group included in the input user utterance. This is an input word data array for the number of input words inputWordCount.

ｔｒａｎｓＣａｎｄｉｄａｔｅｓ：応答候補ルールデータリスト。当該の入力による条件に合致する１つ以上の応答候補ルールデータ１１２（図１参照）を格納するためのリストである。各リストの要素は、各ルールデータｔｒａｎｓｉｔｉｏｎ［ｉ］（図３（ｃ）参照）へのポインタのほかに、各応答候補ルールデータ１１２の一致単語数の値や、各応答候補ルールデータ１１２のスコアの値も含む。 transCandidates: Answer candidate rule data list. This is a list for storing one or more response candidate rule data 112 (see FIG. 1) that match the input condition. Elements of each list include a pointer to each rule data transition[i] (see FIG. 3(c)), a value of the number of matching words of each candidate response rule data 112, and a score of each candidate response rule data 112. Also includes the value of

ｓｔａｔｅ＿ｉｄ：状態番号のスタック配列。記憶部１０４に記憶されるスタック状態番号ＮＳＡ（図１参照）を管理するスタック配列である。 state_id: stack array of state numbers. It is a stack array for managing stack state numbers NSA (see FIG. 1) stored in the storage unit 104 .

ｓｃｏｒｅ＿ｃｏｅｆ：評価係数。各応答候補ルールデータ１１２を評価するためのコサイン距離を計算する際の、入力側分母成分である。 score_coef: evaluation coefficient. It is the input side denominator component when calculating the cosine distance for evaluating each response candidate rule data 112 .

図３（ｂ）は、ユーザの発話に基づいて検出される図１の入力単語組１１１を示す入力単語データのデータフォーマット例である。各入力単語データの先頭ポインタはｉｎｐｕｔＷｏｒｄｓ［ｉ］（ｉ＝０、１、２、・・・）で示され、入力単語データ毎に、このアドレスから順に以下の各種データが、図２のメモリ２０２に記憶される。 FIG. 3(b) is an example data format of input word data showing the input word set 111 of FIG. 1 detected based on the user's utterance. The head pointer of each input word data is indicated by inputWords[i] (i=0, 1, 2, . . . ). stored in

ｗｏｒｄ：入力単語。形態素解析処理により設定される、形態素単位のテキストデータ列である。 word: Input word. This is a morpheme-based text data string set by morpheme analysis processing.

ｗｅｉｇｈｔ：重み係数。そのルール内でのその単語の重要度を示すための係数である。重み係数の値は、例えば品詞に応じて大小が付けられ、例えば名詞や動詞に対しては大きな値、助詞などに対しては小さな値が付けられる。 weight: Weighting factor. A coefficient for indicating the importance of the word within the rule. The value of the weighting factor is, for example, assigned a large value according to the part of speech. For example, a large value is assigned to nouns and verbs, and a small value is assigned to particles.

ｐｒｅｖ：前ポインタ。ユーザの発話において、現在の入力単語の直前に発音された入力単語へのポインタである。 prev: previous pointer. A pointer to the input word pronounced just before the current input word in the user's utterance.

ｎｅｘｔ：次ポインタ。ユーザの発話において、現在の入力単語の直後に発音された入力単語へのポインタである。 next: next pointer. A pointer to the input word pronounced immediately after the current input word in the user's utterance.

図３（ｃ）は、図１のデータベース１０１に記憶されるルールデータ１１０のデータフォーマット例である。各ルールデータ１１０の先頭ポインタはｔｒａｎｓｉｔｉｏｎ［ｉ］（ｉ＝０、１、２、・・・）で示され、ルールデータ１１０毎に、このアドレスから順に以下の各種データが、例えば図２の補助情報記憶装置２０５内（メモリ２０２内でもよい）の図１のデータベース１０１に記憶される。 FIG. 3(c) is an example data format of the rule data 110 stored in the database 101 of FIG. The head pointer of each rule data 110 is indicated by transition[i] (i=0, 1, 2, . It is stored in the database 101 of FIG. 1 in the information storage device 205 (or in the memory 202).

ｕｓｅｒＷｏｒｄＣｏｕｎｔ：想定入力単語数。ルールデータ１１０の入力として与えられる想定入力単語の数である。
ｕｓｅｒＷｏｒｄｓ［ｕｓｅｒＷｏｒｄＣｏｕｎｔ］：ルールデータ１１０の入力として与えられる想定入力単語のテキストデータの配列である。図１のルールデータ１１０内の想定入力単語組に対応する。
ｓｔａｔｅ＿ｉｄ：オートマトンの現在の状態番号。ルールデータ１１０が属する状態を示す番号である。図１のルールデータ１１０内の状態番号に対応する。
ｂｏｔ＿ｒｅｐｌｙ：応答文。ルールデータ１１０の出力である応答文のテキストデータである。図１のルールデータ１１０内の応答文に対応する。
ｎｅｘｔ＿ｓｔａｔｅ＿ｉｄ：次遷移先状態番号。ルールデータ１１０が選択された後に遷移するオートマトンの遷移先の状態を示す番号である。図１のルールデータ１１０内の次遷移先状態番号に対応する。 userWordCount: Expected number of input words. This is the number of assumed input words given as input to the rule data 110 .
userWords [userWordCount]: An array of text data of assumed input words given as input to the rule data 110 . It corresponds to the assumed input word set in the rule data 110 of FIG.
state_id: current state number of the automaton. This number indicates the state to which the rule data 110 belongs. It corresponds to the state number in the rule data 110 of FIG.
bot_reply: reply text. It is the text data of the response sentence that is the output of the rule data 110 . It corresponds to the response sentence in the rule data 110 of FIG.
next_state_id: Next transition destination state number. This is a number indicating the transition destination state of the automaton that transitions after the rule data 110 is selected. It corresponds to the next transition destination state number in the rule data 110 of FIG.

ｐｒｅｖ：前ポインタ。現在のルールデータ１１０の直前に接続されるルールデータ１１０へのポインタである。 prev: previous pointer. A pointer to the rule data 110 connected immediately before the current rule data 110 .

ｎｅｘｔ：次ポインタ。現在のルールデータ１１０の直後に接続されたルールデータ１１０へのポインタである。 next: next pointer. A pointer to the rule data 110 connected immediately after the current rule data 110 .

図４から図７は、図２の対話装置１００の動作を実現するために、図２のコンピュータが実行する対話処理の例を示すフローチャートである。この対話処理は、図２のＣＰＵ２０１が、メモリ２０２又は補助情報記憶装置２０５に記憶されている図３で説明した各種データを用いながら、メモリ２０２に記憶された対話処理プログラムを実行する処理である。 4 to 7 are flowcharts showing examples of interactive processing executed by the computer of FIG. 2 to implement the operation of the interactive device 100 of FIG. This interactive processing is processing in which the CPU 201 of FIG. 2 executes an interactive processing program stored in the memory 202 while using various data described in FIG. .

図４は、対話処理の例を示すメインフローチャートである。特には図示しないパワースイッチがオンされてシステムが起動すると、ＣＰＵ２０１が、メモリ２０２上の各種変数の初期化、及び必要なデータの補助情報記憶装置２０５からメモリ２０２へのロードを実行する（ステップＳ４０１）。 FIG. 4 is a main flowchart showing an example of interactive processing. When a power switch (not shown) is turned on to start the system, the CPU 201 initializes various variables in the memory 202 and loads necessary data from the auxiliary information storage device 205 to the memory 202 (step S401). ).

次に、ＣＰＵ２０１は、図３（ａ）の制御データ上のオートマトンの状態番号のスタック配列ｓｔａｔｅ＿ｉｄに、初期状態を表す状態番号０をプッシュする（ステップＳ４０２）。 Next, the CPU 201 pushes the state number 0 representing the initial state to the stack array state_id of the automaton state numbers on the control data in FIG. 3A (step S402).

ステップＳ４０２の後、ＣＰＵ２０１は、ユーザ発話の入力（ステップＳ４０３）と、対話装置の終了指示（ステップＳ４０４）の待機状態となる（ステップＳ４０３とステップＳ４０４の繰返し処理）。 After step S402, the CPU 201 waits for input of a user's speech (step S403) and an instruction to end the interactive device (step S404) (repetitive processing of steps S403 and S404).

ユーザが発話を行うと、図２の音声入力装置２０８において、ユーザが喋る音声がマイクロフォン／アンプでアナログ入力音声信号として入力され、Ａ／Ｄ変換器においてそのアナログ入力音声信号がデジタル入力音声信号に変換され、図２のバス２１０を介してＣＰＵ２０１に送られる。そして、ＣＰＵ２０１が、このデジタル入力音声信号の所定閾値以上のパワーを検出したときに、ユーザ発話の入力が検出され、ステップＳ４０４の判定がＹＥＳとなる。ＣＰＵ２０１は、前処理（ステップＳ４０５）、ルール検索処理（ステップＳ４０６）、及び応答文出力処理（ステップＳ４０７）を順次実行し、その後、ステップＳ４０３とＳ４０４の待機処理に戻る。 When the user speaks, in the voice input device 208 of FIG. 2, the voice spoken by the user is input as an analog input voice signal through a microphone/amplifier, and the analog input voice signal is converted into a digital input voice signal in the A/D converter. It is converted and sent to CPU 201 via bus 210 in FIG. Then, when the CPU 201 detects the power of the digital input audio signal equal to or higher than the predetermined threshold, the input of the user's utterance is detected, and the determination in step S404 becomes YES. The CPU 201 sequentially executes preprocessing (step S405), rule search processing (step S406), and response sentence output processing (step S407), and then returns to standby processing in steps S403 and S404.

ユーザが特には図示しないパワースイッチをオフして対話装置１００の終了指示が発生すると、ステップＳ４０３の判定がＹＥＳとなって、ＣＰＵ２０１は、確保したメモリ２０２上の領域等を破棄し、図４に示す対話処理を終了し、システムを終了する。 When the user turns off a power switch (not shown) to issue an instruction to terminate the interactive apparatus 100, the determination in step S403 becomes YES, and the CPU 201 discards the secured area on the memory 202, and the state shown in FIG. Terminates the indicated interactive processing and terminates the system.

図５は、図４のステップＳ４０５の前処理の詳細例を示すフローチャートである。この前処理では主に、図３（ｂ）の入力単語データｉｎｐｕｔＷｏｒｄｓ［０］、ｉｎｐｕｔＷｏｒｄｓ［１］、・・・を作成する処理が実行される。 FIG. 5 is a flow chart showing a detailed example of preprocessing in step S405 of FIG. In this preprocessing, processing for creating input word data inputWords[0], inputWords[1], . . . in FIG. 3B is mainly executed.

まず、ＣＰＵ２０１は、メモリ２０２上にある図３（ａ）及び（ｂ）の各データ（変数）を初期化する（ステップＳ５０１）。 First, the CPU 201 initializes each data (variable) in FIGS. 3A and 3B on the memory 202 (step S501).

次に、ＣＰＵ２０１は、図４のステップＳ４０４で入力されたユーザの発話に基づくデジタル音声データに対して、まず音声認識を行って入力文のテキストデータを作成し、更にそのテキストデータに対して形態素解析を実行することにより、形態素に分割された複数の単語から成る入力単語群を抽出する（ステップＳ５０２）。この入力単語群は、図１の入力単語組１１１に対応する。 Next, the CPU 201 first performs speech recognition on the digital voice data based on the user's utterance input in step S404 in FIG. By executing the analysis, an input word group consisting of a plurality of words divided into morphemes is extracted (step S502). This input word group corresponds to the input word set 111 in FIG.

次に、ＣＰＵ２０１は、初期値１を、メモリ２０２上の変数であるスコア係数にセットする（ステップＳ５０３）。スコア係数については、後述する。 Next, the CPU 201 sets the initial value 1 to the score coefficient, which is a variable on the memory 202 (step S503). The score coefficient will be described later.

次に、ＣＰＵ２０１は、ステップＳ５０２での形態素解析により抽出した先頭の形態素から順に（ステップＳ５０４）、ステップＳ５０５ですべての形態素の検索が終了したと判定するまで（ステップＳ５０５）、ステップＳ５０９で形態素を順次検索しながら、以下のステップＳ５０６からＳ５０８の一連の処理を、形態素毎に繰り返し実行する。 Next, the CPU 201 sequentially extracts morphemes from the first morpheme extracted by the morphological analysis in step S502 (step S504) until it is determined in step S505 that all morphemes have been searched (step S505). While sequentially searching, a series of processes from steps S506 to S508 below are repeatedly executed for each morpheme.

まず、ＣＰＵ２０１は、図３（ａ）の制御データの入力単語数：ｉｎｐｕｔＷｏｒｄＣｏｕｎｔの値をインクリメントする。また、ＣＰＵ２０１は、図３（ｂ）の入力単語データの新たなエントリ（記憶領域）（例えばｉｎｐｕｔＷｏｒｄｓ［ｉ］）をメモリ２０２上に生成し、そのエントリ内の入力単語：ｗｏｒｄとして、ステップＳ５０４又はＳ５０９で取得した形態素のテキストを登録する（以上、ステップＳ５０６）。なお、ＣＰＵ２０１は、入力単語データの新たなエントリの前ポインタ：ｐｒｅｖを、その直前に生成されているエントリの先頭アドレスに設定し、更にその直前エントリの次ポインタ：ｎｅｘｔの値を上記新たなエントリの先頭アドレスに設定することにより、ユーザ発話に従って順番に接続された入力単語データのエントリのリストを生成する。 First, the CPU 201 increments the number of input words: inputWordCount of the control data in FIG. 3(a). Further, the CPU 201 generates a new entry (storage area) (for example, inputWords[i]) for the input word data shown in FIG. The text of the morpheme acquired in S509 is registered (above, step S506). The CPU 201 sets the previous pointer (prev) of the new entry of the input word data to the top address of the entry generated immediately before it, and sets the value of the next pointer (next) of the immediately preceding entry to the new entry. By setting the starting address of the input word data, a list of entries of the input word data connected in order according to the user's utterance is generated.

次に、ＣＰＵ２０１は、ステップＳ５０４又はＳ５０９で取得した形態素に対応する入力単語の重み係数を設定し、その重み係数を、ステップＳ５０６でメモリ２０２上に生成した図７（ｂ）の入力単語データの新たなエントリの重み係数：ｗｅｉｇｈｔとして設定する（ステップＳ５０７）。上述の重み係数は、対応する形態素の入力単語の、入力された文書内での重要度を示しているといえる。単語の文書内での重要度は、入力文書内で多く出現する単語ほど大きな（重要な）値として設定されるＴＦ（ＴｅｒｍＦｒｅｑｕｅｎｃｙ）と呼ばれる良く知られた手法と、いくつもの文書で横断的に使われている単語はそれほど大きな（重要な）値ではなく設定されるＩＤＦ（ＩｎｖｅｒｓｅＤｏｃｕｍｅｎｔＦｒｅｑｕｅｎｃｙ）と呼ばれる良く知られた他の手法とを組合せたＴＦ－ＩＤＦ手法により設定することができる。従って、上述の重み係数は、このようなＴＦ－ＩＤＦ手法により設定されてよい。また、単語の文書内での重要度は、形態素の品詞によっても異なる。重み係数の値は、例えば名詞や動詞に対しては大きな値、助詞などに対しては小さな値が割り当てられる。そこで、上述の重み係数は、例えば図２の補助情報記憶装置２０５に保持している、品詞別の重み係数テーブルや、ＩＤＦ（ＩｎｖｅｒｓｅＤｏｃｕｍｅｎｔＦｒｅｑｕｅｎｃｙ）テーブルなどから設定されてよい。 Next, the CPU 201 sets a weighting factor for the input word corresponding to the morpheme acquired in step S504 or S509, and stores the weighting factor in the input word data of FIG. The weighting factor of the new entry: weight is set (step S507). It can be said that the above-mentioned weighting factor indicates the importance of the input word of the corresponding morpheme in the input document. The degree of importance of a word in a document is determined by a well-known method called TF (Term Frequency), in which a word that appears more frequently in an input document is set as a higher (important) value, and by cross-cutting several documents. The words that are used can be set by the TF-IDF method in combination with another well-known method called IDF (Inverse Document Frequency), which is not set to a very large (important) value. Therefore, the weighting factors mentioned above may be set by such a TF-IDF approach. In addition, the degree of importance of a word within a document varies depending on the part of speech of the morpheme. For example, nouns and verbs are assigned large weighting coefficient values, while particles are assigned small values. Therefore, the weighting factors described above may be set from, for example, a weighting factor table for each part of speech or an IDF (Inverse Document Frequency) table held in the auxiliary information storage device 205 in FIG.

そして、ＣＰＵ２０１は、ステップＳ５０７で取得した重み係数の２乗値を、ステップＳ５０３で初期設定したメモリ２０２上のスコア係数に加算する（ステップＳ５０８）。スコア係数については後述する。 Then, the CPU 201 adds the square value of the weighting factor acquired in step S507 to the score factor in the memory 202 initialized in step S503 (step S508). Score coefficients will be described later.

以上のステップＳ５０６からＳ５０８までの一連の処理が、ステップＳ５０５からＳ５０９までの繰返し処理により入力により得られる全ての形態素について実行されることにより、図３（ｂ）の入力単語データｉｎｐｕｔＷｏｒｄｓ［０］、ｉｎｐｕｔＷｏｒｄｓ［１］、・・・が作成される。図３（ｂ）に示されるように、入力単語データｉｎｐｕｔＷｏｒｄｓ［ｉ］（ｉ＝０、１、・・・）は、形態素解析された現在の形態素のテキストデータである入力単語ｗｏｒｄと、その入力単語に対応してステップＳ５０７で取得された重み係数ｗｅｉｇｈｔと、前後のエントリへのポインタｐｒｅｖ及びｎｅｘｔから構成される。 The above series of processes from steps S506 to S508 are executed for all morphemes obtained by the input through repeated processes from steps S505 to S509, thereby obtaining the input word data inputWords[0], inputWords[1], . . . are created. As shown in FIG. 3B, the input word data inputWords[i] (i=0, 1, . It consists of the weight coefficient weight obtained in step S507 corresponding to the word, and pointers prev and next to the previous and subsequent entries.

すべての形態素に対する処理が終了してステップＳ５０５の判定がＹＥＳになると、ＣＰＵ２０１は、メモリ２０２上の変数である、前記ステップＳ５０８で最終的に算出されたスコア係数の平方根を算出し、更にその逆数を算出して、その算出結果を図３（ａ）の制御データ内の評価係数（評価式の分母）：ｓｃｏｒｅ＿ｃｏｅｆとして設定する（ステップＳ５１０）。このようにして算出される評価係数ｓｃｏｒｅ＿ｃｏｅｆは、実質的には、ｃｏｓ（コサイン）距離＝相関係数を求めていることになる。 When the processing for all morphemes is completed and the determination in step S505 becomes YES, the CPU 201 calculates the square root of the score coefficient finally calculated in step S508, which is a variable in the memory 202, and further calculates its reciprocal. is calculated, and the calculation result is set as the evaluation coefficient (denominator of the evaluation formula): score_coef in the control data of FIG. 3(a) (step S510). The evaluation coefficient score_coef calculated in this manner substantially obtains cos (cosine) distance=correlation coefficient.

共分散(分子)と、ルールデータ側の標準偏差は後述するルール検索処理時に算出するものとするが、入力側である入力単語組１１１に対応する分散は、どのルールに対しても共通のものであり、かつ分母の成分として発生するものであるので、まずステップＳ５０３で初期設定の後に、ステップＳ５０８で各入力単語の重み係数の２乗を加算してスコア係数を算出し、算出したスコア係数をステップＳ５１０であらかじめ逆数化して図３（ａ）の評価係数ｓｃｏｒｅ＿ｃｏｅｆとして求めておき、後述する各応答候補ルールデータ１１２のスコアをコサイン類似度に変換する演算（図６のステップＳ６０９参照）において、(計算負荷のかかる除算ではなく、)乗算で処理できるようにするためである。その後、ＣＰＵ２０１は、図５のフローチャートで例示される図４のステップＳ４０５の前処理を終了する。 The covariance (numerator) and the standard deviation of the rule data are calculated during the rule search process, which will be described later, but the variance corresponding to the input word set 111 on the input side is common to all rules. and is generated as a component of the denominator, first, after initial setting in step S503, the score coefficient is calculated by adding the square of the weight coefficient of each input word in step S508, and the calculated score coefficient is reciprocated in advance in step S510 to obtain the evaluation coefficient score_coef in FIG. This is to allow multiplication (rather than computationally expensive division). After that, the CPU 201 ends the pre-processing of step S405 in FIG. 4 illustrated in the flowchart in FIG.

図６は、図４のステップＳ４０６のルール検索処理の詳細例を示すフローチャートである。このルール検索処理は、図１のデータ取得部１０２の処理機能を実現するものである。図１のデータ取得部１０２の説明で前述したように、データ取得部１０２は、データベース１０１に記憶されているルールデータ１１０を参照しながら、オートマトンと呼ばれる状態モデルに従って動作する。図６のルール検索処理も同様に、メモリ２０２又は補助情報記憶装置２０５上のデータベース１０１に記憶されている図３（ｃ）のルールデータ１１０を参照しながら、オートマトンの状態モデルに従って動作する。ここで、オートマトンとは、状態と遷移と動作の組み合わせからなるモデルであり、ある時点で「現在状態」と呼ぶ何れか1つの状態をとり、何らかのイベントや条件によってある状態から別の状態へと「遷移」し、その状態を規定するのがデータベース１０１上のルールデータ１１０である。また、本実施形態では、後述するようにルールデータ１１０の検索と状態の遷移が、図１で説明した記憶部１０４に記憶されるスタック状態番号ＮＳＡに応じても制御される。 FIG. 6 is a flowchart showing a detailed example of the rule search processing in step S406 of FIG. This rule search processing implements the processing function of the data acquisition unit 102 in FIG. As described above in the description of the data acquisition unit 102 in FIG. 1, the data acquisition unit 102 operates according to a state model called automaton while referring to the rule data 110 stored in the database 101 . Similarly, the rule search process of FIG. 6 operates according to the state model of the automaton while referring to the rule data 110 of FIG. Here, an automaton is a model that consists of a combination of states, transitions, and actions. It is the rule data 110 on the database 101 that "transitions" and defines the state. In addition, in this embodiment, as will be described later, the search of the rule data 110 and the transition of the state are also controlled according to the stack state number NSA stored in the storage unit 104 described with reference to FIG.

以下、図４のステップＳ４０６のルール検索処理の詳細例である図６のフローチャートの処理について、説明する。 The processing of the flowchart of FIG. 6, which is a detailed example of the rule search processing in step S406 of FIG. 4, will be described below.

図６において、ＣＰＵ２０１はまず、前述した図５で示される図４のステップＳ４０５の前処理においてユーザの発話から得られた入力単語組について、全てのルール検索が終了したと判定するまで（後述するステップＳ６１３：ＹＥＳ）、ステップＳ６０１からＳ６１２の一連の処理を繰り返し実行する。 In FIG. 6, the CPU 201 first determines that all the rule searches for the input word set obtained from the user's utterance in the preprocessing of step S405 in FIG. 4 shown in FIG. Step S613: YES), a series of processes from steps S601 to S612 are repeatedly executed.

この繰返し処理において、ＣＰＵ２０１はまず、メモリ２０２又は補助情報記憶装置２０５に記憶されている図３（ｃ）のルールデータ１１０（図１のデータベース１０１内のルールデータ１１０に対応）において、前述したように上記入力単語組（図１の入力単語組１１１に対応）に包含される想定入力単語組を含むルールデータ１１０を、データベース１０２内から検索する（ステップＳ６０１）。ステップＳ６０１の他の実施形態として、ＣＰＵ２０１は、想定入力単語数（図３（ｃ）参照）が上記入力単語組（図１の入力単語組１１１に対応）の入力単語数（図３（ａ）参照）に一致し、かつ想定入力単語の配列（図３（ｃ）参照）中の全ての想定単語が、入力単語（図３（ｂ）参照）の全てと一致するルールデータを検索してもよい。なお、入力文が日本語である場合には、想定入力単語組と入力単語組の間の単語の順番は問わない。これは、入力文によっては、単語の倒置等が発生する場合に対応するためである。 In this iterative process, the CPU 201 first reads the rule data 110 (corresponding to the rule data 110 in the database 101 in FIG. 1) of FIG. First, the database 102 is searched for the rule data 110 containing the assumed input word set included in the input word set (corresponding to the input word set 111 in FIG. 1) (step S601). As another embodiment of step S601, the CPU 201 determines that the assumed input word count (see FIG. 3C) is the input word count (FIG. 3A) of the input word set (corresponding to the input word set 111 in FIG. 1). ) and all the assumed words in the array of assumed input words (see FIG. 3(c)) match all of the input words (see FIG. 3(b)). good. Note that when the input sentence is in Japanese, the order of the words between the assumed input word set and the input word set does not matter. This is to cope with the case where words are inverted depending on the input sentence.

ステップＳ６０１の検索の結果、ＣＰＵ２０１は、データベース１０２内からルールデータが見つかったか否かを判定する（ステップＳ６０２）。 As a result of the search in step S601, the CPU 201 determines whether rule data has been found in the database 102 (step S602).

ルールデータが見つからなかった場合（ステップＳ６０２の判定がＮＯの場合）には、ＣＰＵ２０１は、全てのルールデータの検索が終了したか否かを判定する（ステップＳ６１３）。 If no rule data is found (NO in step S602), the CPU 201 determines whether or not all rule data have been searched (step S613).

全てのルールデータの検索が終了していない場合（ステップＳ６１３の判定がＮＯの場合）には、ＣＰＵ２０１は、ステップＳ６０１の検索処理に戻ってルールデータの検索を繰り返し実行する。 If the search for all rule data has not ended (NO in step S613), the CPU 201 returns to the search processing in step S601 and repeats the search for rule data.

ステップＳ６０１での検索の結果、ルールデータが見つかった場合（ステップＳ６０２の判定がＹＥＳの場合）には、ＣＰＵ２０１は、以下のステップＳ６０３からＳ６１２で示される一連の処理を実行することにより、ステップＳ６０１で見つかったルールデータを応答候補ルールデータ１１２（図１参照）として採用するか否かを判定する。 If rule data is found as a result of the search in step S601 (if the determination in step S602 is YES), the CPU 201 executes a series of processes shown in steps S603 to S612 below to perform step S601. It is determined whether or not to adopt the rule data found in the above as the response candidate rule data 112 (see FIG. 1).

具体的には、ＣＰＵ２０１はまず、図２のメモリ２０２上に保持している変数である状態番号係数が示す倍率を、初期値である１．０にセットする（ステップＳ６０３）。状態番号係数は、現在の状態以外の過去に発生した状態に対応するルールデータを検索する場合に、その状態をどの程度重要視するかを決定するための重みデータである。 Specifically, the CPU 201 first sets the magnification indicated by the state number coefficient, which is a variable held in the memory 202 in FIG. 2, to the initial value of 1.0 (step S603). The state number coefficient is weight data for determining how important a state is to be given when retrieving rule data corresponding to a state that occurred in the past other than the current state.

次に、ＣＰＵ２０１は、メモリ２０２（記憶部１０４）上に制御データとして保持している（記憶されている）状態番号のスタック配列：ｓｔａｔｅ＿ｉｄ（図３（ａ）参照）において、スタック配列中の所定の複数のスタック状態番号ＮＳＡのうち、最初のスタック状態番号ＮＳＡ、すなわち、記憶された最新の状態番号を読み出す（ステップＳ６０４）。 Next, the CPU 201 stores a stack array of state numbers held (stored) as control data in the memory 202 (storage unit 104): state_id (see FIG. 3A). The first stack state number NSA, ie, the latest stored state number, is read out of the plurality of stack state numbers NSA (step S604).

次に、ＣＰＵ２０１は、ステップＳ６０１によって見つかったルールデータ中の状態番号：ｓｔａｔｅ＿ｉｄ（図３（ｃ）参照）が、上記ステップＳ６０４又は後述するＳ６１１によって選択された現在のスタック状態番号ＮＳＡと一致するか否かを判定する（ステップＳ６０５）。 Next, the CPU 201 determines whether the state number: state_id (see FIG. 3C) in the rule data found in step S601 matches the current stack state number NSA selected in step S604 or S611, which will be described later. It is determined whether or not (step S605).

ステップＳ６０５の判定がＹＥＳならば、ＣＰＵ２０１は、応答候補ルールデータリストｔｒａｎｓＣａｎｄｉｄａｔｅｓ（図３（ａ）参照）に、当該ルールデータに対応するステップＳ６０１で見つかったルールデータへのポインタ：ｔｒａｎｓｉｔｉｏｎ（図３（ｃ））の値を登録することにより、上記応答候補ルールデータリストに、当該ルールデータを新たな応答候補ルールデータ１１２として追加する（ステップＳ６０６）。 If the determination in step S605 is YES, the CPU 201 stores a pointer to the rule data found in step S601 corresponding to the rule data in response candidate rule data list transCandidates (see FIG. 3A): transition (see FIG. 3 ( By registering the value of c)), the rule data is added to the response candidate rule data list as new response candidate rule data 112 (step S606).

次に、ＣＰＵ２０１は、応答候補ルールデータリストｔｒａｎｓＣａｎｄｉｄａｔｅｓから参照される、今回追加された応答候補ルールデータ１１２のスコアに、図６のステップＳ６０１で見つかったルールデータ１１０の想定入力単語組内の各想定入力単語に対応する各入力単語の各重み係数：ｗｅｉｇｈｔ（図３（ｂ）参照）の２乗値を、全入力単語で累算する（ステップＳ６０７）。なお、応答候補ルールデータ１１２のスコアの初期値は所定値に設定されている。所定値は任意の値を採用可能であり、例えば値０でもよい。 Next, the CPU 201 adds each assumption in the assumed input word set of the rule data 110 found in step S601 of FIG. The square value of each weight coefficient: weight (see FIG. 3B) of each input word corresponding to the input word is accumulated for all input words (step S607). Note that the initial value of the score of the answer candidate rule data 112 is set to a predetermined value. Any value can be adopted as the predetermined value, for example, the value 0 may be used.

続いて、ＣＰＵ２０１は、応答候補ルールデータリストｔｒａｎｓＣａｎｄｉｄａｔｅｓから参照される、今回追加された応答候補ルールデータ１１２のスコアに、メモリ２０２上に保持されている変数である状態番号係数の２乗値を累算する（ステップＳ６０８）。前述したように、状態番号係数は、オートマトンの現在の状態番号以外の状態番号を含むルールデータ１１０を採用する場合に、その状態番号をどの程度重要視するかを決定するための重みデータである。 Subsequently, the CPU 201 accumulates the square value of the state number coefficient, which is a variable held in the memory 202, to the score of the response candidate rule data 112 added this time, which is referred to from the response candidate rule data list transCandidates. (step S608). As described above, the state number coefficient is weight data for determining how important the state number is when adopting the rule data 110 including the state number other than the current state number of the automaton. .

次に、ＣＰＵ２０１は、応答候補ルールデータリストｔｒａｎｓＣａｎｄｉｄａｔｅｓから参照される、ステップＳ６０６で追加された応答候補ルールデータ１１２としてのルールデータ１１０のスコアの値を、下記（１）式に従い、コサイン類似度に変換する（ステップＳ６０９）。「ｓｑｒｔ（）」は、平方根を計算する演算を示す。「Ｃｔｒｌ．ｓｃｏｒｅ＿ｃｏｅｆ」は、図３（ａ）の制御データの先頭アドレスＣｔｒｌから参照される評価係数であり、図５のステップＳ５０３、Ｓ５０８、Ｓ５１０などで算出されている値である。また、「ｔｒａｎｓＣａｎｄｉｄａｔｅｓ－＞ｓｃｏｒｅ」は、メモリ２０２上の今回追加された応答候補ルールデータ１１２のスコア変数値を示している。 Next, the CPU 201 converts the score value of the rule data 110 as the response candidate rule data 112 added in step S606, which is referenced from the response candidate rule data list transCandidates, into the cosine similarity according to the following equation (1). Convert (step S609). "sqrt()" denotes an operation that computes the square root. "Ctrl.score_coef" is an evaluation coefficient referenced from the start address Ctrl of the control data in FIG. Also, “transCandidates->score” indicates the score variable value of the response candidate rule data 112 added this time on the memory 202 .

ｔｒａｎｓＣａｎｄｉｄａｔｅｓ－＞ｓｃｏｒｅ
＝ｓｑｒｔ（ｔｒａｎｓＣａｎｄｉｄａｔｅｓ－＞ｓｃｏｒｅ）
×Ｃｔｒｌ．ｓｃｏｒｅ＿ｃｏｅｆ・・・（１） transCandidates->score
=sqrt(transCandidates->score)
x Ctrl. score_coef (1)

上記（１）式により、共分散と、新たな応答候補ルールデータ１１２としてのルールデータ１１０に対応する標準偏差がここまでで計算されるので、これに前述した入力単語組に対応する標準偏差の逆数である評価係数が乗算されることにより、相関係数が算出できることになる。 Since the covariance and the standard deviation corresponding to the rule data 110 as the new response candidate rule data 112 are calculated by the above equation (1), the standard deviation corresponding to the above-mentioned input word set is calculated. The correlation coefficient can be calculated by multiplying by the reciprocal evaluation coefficient.

次に、ＣＰＵ２０１は、ステップＳ６１３の処理からステップＳ６０１に戻り、次のルールデータに対する検索処理を繰り返し実行する。 Next, the CPU 201 returns from the process of step S613 to step S601, and repeats the search process for the next rule data.

一方、ステップＳ６０５の判定がＮＯ、すなわち、ステップＳ６０１で見つかったルールデータ１１０の状態番号：ｓｔａｔｅ＿ｉｄが、ステップＳ６０４又はＳ６１１によって選択された現在のスタック状態番号ＮＳＡと一致しなければ、ＣＰＵ２０１は、スタック配列中のスタック状態番号ＮＳＡの検索が終了したか否かを判定する（ステップＳ６１０）。このステップＳ６１０の判定がＮＯのときには、そのときに読みだされているスタック状態番号ＮＳＡよりも一つ前に記憶されたスタック状態番号ＮＳＡを読み出す（ステップＳ６１１）とともに、状態番号係数に所定の減衰係数（１．０＞減衰係数＞０）を乗算し（ステップＳ６１２）、ステップＳ６０５に戻る。 On the other hand, if the determination in step S605 is NO, that is, if the state number: state_id of the rule data 110 found in step S601 does not match the current stack state number NSA selected in step S604 or S611, the CPU 201 It is determined whether or not the search for the stack state number NSA in the array has ended (step S610). When the determination in step S610 is NO, the stack state number NSA stored one before the stack state number NSA being read at that time is read out (step S611), and the state number coefficient is attenuated by a predetermined value. A coefficient (1.0>attenuation coefficient>0) is multiplied (step S612), and the process returns to step S605.

一方、上記ステップＳ６１０の判定がＹＥＳで、状態番号のスタック配列中のスタック状態番号ＮＳＡの検索が終了したときには、前記ステップＳ６１３以降を実行する。 On the other hand, when the determination in step S610 is YES and the search for the stack state number NSA in the stack array of state numbers is completed, steps S613 and subsequent steps are executed.

一方、ステップＳ６１３の判定がＹＥＳで、全てのルールデータ１１０の検索が終了したときには、ステップＳ６１４及びＳ６１５において、ワイルドカードリスト検索処理及びスーパーワイルドカードリスト検索処理をそれぞれ、後述するようにして実行し、本処理を終了する。 On the other hand, when the determination in step S613 is YES and the search for all rule data 110 is completed, wild card list search processing and super wild card list search processing are executed in steps S614 and S615, respectively, as will be described later. , the process ends.

以上により、図６に示すルール検索処理では、ステップ６０１で見つかったルールデータ１１０の状態番号が、ステップＳ６０４で読みだされた最初のスタック状態番号ＮＳＡと、すなわちオートマトンの現在状態と一致する場合（ステップＳ６０５：ＹＥＳ）には、ステップＳ６０３で値１．０に設定された状態番号係数がそのまま用いられ、ステップＳ６０８で累算される。 As described above, in the rule search process shown in FIG. 6, when the state number of the rule data 110 found in step S601 matches the first stack state number NSA read out in step S604, that is, the current state of the automaton ( In step S605: YES), the state number coefficient set to 1.0 in step S603 is used as it is, and is accumulated in step S608.

一方、ステップ６０１で見つかったルールデータの状態番号が最初のスタック状態番号ＮＳＡと一致しない場合（ステップＳ６０５：ＮＯ）には、スタック配列に記憶されたスタック状態番号ＮＳＡが新しいものから順に読みだされ（ステップＳ６１１）、状態番号係数に減衰係数が乗算される(ステップＳ６１２) とともに、再度ステップＳ６０５が実行され、読みだされたスタック状態番号ＮＳＡに、ルールデータ１１０位に含まれるオートマトンの状態番号が一致するか否かが判定される。そして、すべてのスタック状態番号ＮＳＡの検索が終了しない限り（ステップＳ６１０：ＮＯ）、ステップＳ６０５がＹＥＳになるまで、ステップＳ６１１及びＳ６１２が繰り返し実行される。 On the other hand, if the state number of the rule data found in step 601 does not match the first stack state number NSA (step S605: NO), the stack state numbers NSA stored in the stack array are read out from the newest one. (Step S611), the state number coefficient is multiplied by the attenuation coefficient (Step S612), and step S605 is executed again. A match is determined. Steps S611 and S612 are repeatedly executed until step S605 becomes YES unless the retrieval of all stack state numbers NSA is completed (step S610: NO).

これにより、ステップＳ６１１で読みだされるスタック状態番号ＮＳＡがより過去のものになるほど、値１．０よりも小さい減衰係数が状態番号係数に乗算される回数がより多くなる結果、状態番号係数は、より小さな値に設定される。この場合、例えば減衰係数が値０．９の場合には、状態番号係数は、ステップＳ６０３で設定される初期値の値１．０から、減衰係数が乗算されるたびに、０．９→０．８１、・・・というように減衰される。そして、ステップＳ６０５の判定がＹＥＳになると、減衰された状態番号係数を用いて応答候補ルールデータ１１２のスコアが算出される（ステップＳ６０８）。 As a result, the older the stack state number NSA read in step S611 is, the more times the state number coefficient is multiplied by an attenuation factor smaller than 1.0. , is set to a smaller value. In this case, for example, when the damping coefficient is 0.9, the state number coefficient is changed from 1.0, which is the initial value set in step S603, to 0.9→0 each time the damping coefficient is multiplied. .81, . . . Then, if the determination in step S605 becomes YES, the score of the response candidate rule data 112 is calculated using the attenuated state number coefficient (step S608).

図７は、図４のステップＳ４０７の応答文出力処理の詳細例を示すフローチャートである。まずＣＰＵ２０１は、応答候補ルールデータリストｔｒａｎｓＣａｎｄｉｄａｔｅｓ中の各応答候補ルールデータ１１２としてのルールデータ１１０のスコアに基づいて、最尤の(スコアが最も高い)候補のルールデータ１１０を、図１の応答ルールデータ１１３として決定する（ステップＳ７０１）。 FIG. 7 is a flowchart showing a detailed example of response sentence output processing in step S407 of FIG. First, the CPU 201 converts the maximum likelihood (highest score) candidate rule data 110 to the response rule data list of FIG. It is determined as data 113 (step S701).

続いて、ＣＰＵ２０１は、ステップＳ７０１の最尤候補のルールデータ１１０が有する応答文：ｂｏｔ＿ｒｅｐｌｙ（図３（ｃ）参照）を、図２の音声出力装置２０９に出力する（ステップＳ７０２）。音声出力装置２０９は、応答文：ｂｏｔ＿ｒｅｐｌｙに対応するデジタル音声応答信号を合成し、そのデジタル音声応答信号を内蔵のＤ／Ａ変換器にてアナログ音声応答信号に変換し、そのアナログ音声応答信号をアンプ及びスピーカを介してユーザに向かって放音する。なお、上記応答文：ｂｏｔ＿ｒｅｐｌｙの音声信号を放音せずに、上記応答文：ｂｏｔ＿ｒｅｐｌｙのテキストデータが、出力装置２０４のディスプレイに表示されてもよい。 Subsequently, the CPU 201 outputs the response sentence: bot_reply (see FIG. 3C) included in the maximum likelihood candidate rule data 110 in step S701 to the voice output device 209 in FIG. 2 (step S702). The voice output device 209 synthesizes a digital voice response signal corresponding to the response sentence: bot_reply, converts the digital voice response signal to an analog voice response signal with a built-in D/A converter, and converts the analog voice response signal to A sound is emitted toward the user via an amplifier and a speaker. The text data of the response sentence: bot_reply may be displayed on the display of the output device 204 without outputting the voice signal of the response sentence: bot_reply.

続いて、ＣＰＵ２０１は、最尤候補のルールデータ１１０の遷移先状態番号：ｎｅｘｔ＿ｓｔａｔｅ＿ｉｄ（図３（ｃ）参照）を取得する（ステップＳ７０３）。 Subsequently, the CPU 201 acquires the transition destination state number: next_state_id (see FIG. 3C) of the maximum likelihood candidate rule data 110 (step S703).

そして、ＣＰＵ２０１は、ステップＳ７０２で取得した遷移先状態番号が、状態番号のスタック配列：ｓｔａｔｅ＿ｉｄ（＝図１のスタック状態番号ＮＳＡ）上で連続しないようにして、遷移先状態番号を、スタック状態番号ＮＳＡとして上記スタック配列にプッシュする（ステップＳ７０４）。この場合、本処理の初回の実行時、すなわち、状態番号のスタック配列に何もスタック状態番号ＮＳＡがスタックされていないときには、現在の状態番号及び遷移先状態番号を、この順にスタック配列にプッシュする。 Then, the CPU 201 prevents the transition destination state numbers obtained in step S702 from being consecutive on the stack array of state numbers: state_id (= stack state number NSA in FIG. It is pushed to the stack array as NSA (step S704). In this case, when this process is executed for the first time, that is, when no stack state number NSA is stacked in the state number stack array, the current state number and transition destination state number are pushed to the stack array in this order. .

図８、図９、及び図１０は、上述した処理の動作例を示している。図８（ｂ）、図９、及び図１０は、オートマトンの状態番号が状態番号０から状態番号４まで遷移する場合の動作例を示している。また、図８（ａ）は、図８（ｂ）、図９、及び図１０に示す動作例に関する凡例を示している。太い実線の丸で囲まれた数字（Ｘ＝０、１、２、３、４）はオートマトンの状態番号を示している。また、シャープ記号が付与されている破線枠＃Ｘ－ｉは、オートマトンの状態番号が状態番号Ｘである場合におけるｉ番のルールデータ１１０（図１参照）を示している。このルールデータ１１０において、濃い色の枠は、そのルールデータ１１０が選択されるための「条件」を示している。また、左側に向いている吹出し枠中のテキストは、ユーザから取得される入力発話情報（図１の入力発話情報１１５に対応）に対する形態素解析により得られる入力単語組（図１の入力単語組１１１に対応）がマッチすべき想定入力単語組（図１参照）を示している。そして、右側に向いている吹出し枠中のテキストは、応答文（図１参照）のテキストを示している。 8, 9, and 10 show operation examples of the processing described above. FIGS. 8B, 9, and 10 show an operation example when the state number of the automaton transitions from state number 0 to state number 4. FIG. Also, FIG. 8(a) shows a legend regarding the operation examples shown in FIGS. 8(b), 9, and 10. FIG. Numbers (X=0, 1, 2, 3, 4) enclosed in thick solid circles indicate state numbers of the automaton. A broken-line frame #Xi with a pound sign indicates the i-th rule data 110 (see FIG. 1) when the state number of the automaton is the state number X. FIG. In this rule data 110, dark-colored frames indicate "conditions" for selecting the rule data 110. FIG. Also, the text in the balloon facing left is the input word group (input word group 111 ) indicates an assumed input word set (see FIG. 1) to be matched. The text in the balloon facing to the right indicates the text of the response sentence (see FIG. 1).

ここで、ユーザによる入力文と入力単語組の取得方法については、前述したとおりである。この入力単語組のテキストデータは、図１の入力単語組１１１に対応しているが、以下単に「入力単語組」と記載したときは図１の入力単語組１１１に対応しているものとする。 Here, the method of acquiring the sentence input by the user and the input word set is as described above. The text data of this input word set corresponds to the input word set 111 in FIG. 1, but hereinafter simply referred to as "input word set" corresponds to the input word set 111 in FIG. .

例えば、図８（ｂ）に示されるオートマトンの状態番号が０である場合には、会話のスタート時の一般的な話の導入を行うためのルールデータ群が対応付けられている状態である。この状態番号０において、ルールデータ１１０［＃０－０］は、例えばユーザが喋ることにより、入力文に対応する入力単語組として「好き」という単語を含む疑問文が与えられたときには、「どうかな、わからないな」というテキストに対応する応答文を出力するためのルールデータである。また、この動作例では、このルールデータ１１０［＃０－０］による応答文が出力された（図７のステップＳ７０２）後には、ルールデータ１１０［＃０－０］の破線枠から出力する実線矢印として示されるように、オートマトンの状態番号は現在の状態番号０を維持する（図７のステップＳ７０３でルールデータ１１０［＃０－０］から遷移先状態番号＝０が取得され、ステップＳ７０４でスタックにプッシュされる）。この実線矢印は、図１のルールデータ１１０における次遷移先状態番号に対応する。図８（ｂ）に示される状態番号０におけるルールデータ１１０［＃０－１］、［＃０－３］についても同様である。 For example, when the state number of the automaton shown in FIG. 8(b) is 0, it is in a state in which a rule data group for introducing a general speech at the start of a conversation is associated. In this state number 0, the rule data 110 [#0-0] is such that, for example, when an interrogative sentence including the word "I like" is given as an input word set corresponding to the input sentence by speaking of the user, This is rule data for outputting a response sentence corresponding to the text "I don't understand." Further, in this operation example, after the response sentence based on the rule data 110[#0-0] is output (step S702 in FIG. 7), the solid line output from the dashed frame of the rule data 110[#0-0] As indicated by the arrow, the state number of the automaton maintains the current state number 0. pushed onto the stack). This solid arrow corresponds to the next transition destination state number in the rule data 110 of FIG. The same applies to the rule data 110 [#0-1] and [#0-3] in state number 0 shown in FIG. 8(b).

一方、図８（ｂ）に示される状態番号０におけるルールデータ１１０［＃０－２］は、入力文「動物の話」の疑問文に対応する「動物」「話」という２つの単語を含む入力単語組が与えられたときには、「うん、動物の話をしよう」というテキストに対応する応答文を出力するためのルールである。また、この動作例では、このルールデータ１１０［＃０－２］による応答文が出力された（ステップＳ７０２）後には、ルールデータ１１０［＃０－２］の破線枠から出力する実線矢印を太い破線の丸で囲まれた「１」に付すことで示されるように、オートマトンの状態番号が、現在の状態番号０から図９に示される状態番号１に遷移する（図７のステップＳ７０３でルールデータ１１０［＃０－２］から遷移先状態番号＝１が取得され、ステップＳ７０４でスタックにプッシュされる）。 On the other hand, the rule data 110 [#0-2] in state number 0 shown in FIG. 8B includes two words "animal" and "talk" corresponding to the interrogative sentence of the input sentence "animal story". This is a rule for outputting a response sentence corresponding to the text "Yes, let's talk about animals" when given an input word set. Also, in this operation example, after the response sentence based on this rule data 110[#0-2] is output (step S702), the solid line arrow output from the dashed frame of the rule data 110[#0-2] is changed to a thick line. As shown by adding "1" surrounded by a dashed circle, the state number of the automaton transitions from the current state number 0 to the state number 1 shown in FIG. The transition destination state number=1 is acquired from the data 110 [#0-2] and pushed onto the stack in step S704).

図９に示されるオートマトンの状態番号が状態番号１である場合は、動物に関する話題を会話するためのルールデータ群が対応付けられている状態である。例えばルールデータ１１０［＃１－０］は、入力文から生成される入力単語組が単語「猫」を含むときに猫に関する一般的な話題として「猫の目って大きいよね」という応答文を出力するためのルールである。また、ルールデータ１１０［＃１－１］は、入力文から生成される入力単語組が単語「目」を含むときに猫の目に関する一般的な話題として「猫の目って大きいよね」という応答文を出力するためのルールである。これらのルールデータ１１０［＃１－０］や［＃１－１］が出力された（図７のステップＳ７０２）後には、ルールデータ１１０［＃１－０］や［＃１－１］の各破線枠から出力する実線矢印を太い破線の丸で囲まれた「２」に付すことで示されるように、オートマトンの状態番号が、図９に示される現在の状態番号１から図１０に示される状態番号２に遷移する（図７のステップＳ７０３でルールデータ１１０［＃１－０］又は［＃１－１］から遷移先状態番号＝２が取得され、ステップＳ７０４でスタックにプッシュされる）。 When the state number of the automaton shown in FIG. 9 is state number 1, it is in a state in which a group of rule data for talking about animals is associated. For example, the rule data 110 [#1-0] is such that when an input word set generated from an input sentence includes the word "cat", the response sentence "Cat's eyes are big, isn't it?" as a general topic about cats. It is a rule for output. In addition, the rule data 110 [#1-1] is such that when an input word set generated from an input sentence includes the word "eyes", "cat's eyes are big, aren't they?" as a general topic about cat's eyes. This is a rule for outputting a response sentence. After these rule data 110 [#1-0] and [#1-1] are output (step S702 in FIG. 7), each of the rule data 110 [#1-0] and [#1-1] The state numbers of the automaton are shown in FIG. 9 through the current state number 1 shown in FIG. Transition to state number 2 (transition destination state number=2 is acquired from rule data 110 [#1-0] or [#1-1] in step S703 of FIG. 7, and pushed to the stack in step S704).

一方、図９の状態１において、例えばルールデータ１１０［＃１－２］は、入力文から生成される入力単語組がキーワード「狐」を含むときに狐に関する一般的な話題として「狐って夜に行動するイメージあるよね」という応答文を出力するためのルールである。また、ルールデータ１１０［＃１－３］は、入力文から生成される入力単語組が単語「目」を含むときに狐の目に関する一般的な話題として「狐って目が猫みたいかも」という応答文を出力するためのルールである。これらのルールデータ１１０［＃１－２］や［＃１－３］が出力された（図７のステップＳ７０２）後には、ルールデータ１１０［＃１－２］や［＃１－３］の各破線枠から出力する実線矢印を太い破線の丸で囲まれた「４」に付すことで示されるように、オートマトンの状態番号が、図９に示される現在の状態番号１から図９に示される状態番号４に遷移する（図７のステップＳ７０３でルールデータ１１０［＃１－２］又は［＃１－３］から遷移先状態番号＝４が取得され、ステップＳ７０４でスタックにプッシュされる）。 On the other hand, in state 1 of FIG. 9, the rule data 110 [#1-2], for example, defines a general topic about foxes as a general topic about foxes when an input word set generated from an input sentence includes the keyword "foxes." This is a rule for outputting a response sentence such as "I have an image of you acting at night." In addition, the rule data 110 [#1-3] indicates that when an input word set generated from an input sentence includes the word "eyes", a general topic related to fox eyes is "A fox might have eyes like a cat." This is a rule for outputting the response sentence. After these rule data 110 [#1-2] and [#1-3] are output (step S702 in FIG. 7), each of the rule data 110 [#1-2] and [#1-3] The state numbers of the automaton are shown in FIG. 9 through the current state number 1 shown in FIG. Transition to state number 4 (transition destination state number=4 is acquired from rule data 110 [#1-2] or [#1-3] in step S703 of FIG. 7, and pushed to the stack in step S704).

なお、猫の話題に関するルールデータ１１０［＃１－１］と狐の話題に関するルールデータ１１０［＃１－３］は、共に入力文から生成される入力単語組が単語「目」を含むときに選択され得るが、どちらが選択されるかは、入力単語組と各ルールデータ１１０の想定入力単語組との類似度（コサイン類似度）を算出するときの条件によって変化し得る（図５及び図６のステップＳ６０７、Ｓ６０８、Ｓ６０９、図７のステップＳ７０１）。このような構成により、対話が画一的になるのを回避している。 Note that both the rule data 110 [#1-1] regarding the topic of cats and the rule data 110 [#1-3] regarding the topic of foxes are generated when the input word set generated from the input sentence includes the word "eyes". However, which one is selected may change depending on the conditions for calculating the degree of similarity (cosine similarity) between the input word set and the assumed input word set of each rule data 110 (FIGS. 5 and 6). (steps S607, S608, S609, step S701 in FIG. 7). Such a configuration avoids the dialogue from becoming uniform.

図９の状態番号１におけるルールデータ１１０［＃１－０］又はルールデータ１１０［＃１－１］が選択された後に遷移する図１０の状態番号２の状態、更に状態番号２から遷移する状態番号３の状態は、猫についての更に深い話題に関するルールデータ１１０群に対応している。一方、図９の状態番号１におけるルールデータ１１０［＃１－２］又はルールデータ１１０［＃１－３］が選択された後に遷移する図９の状態番号４の状態は、狐のお話（童話等）についての話題に関するルールデータ１１０群に対応している。 The state of state number 2 in FIG. 10 that transitions after rule data 110 [#1-0] or rule data 110 [#1-1] in state number 1 in FIG. 9 is selected, and the state that transitions from state number 2 State number 3 corresponds to the group of rule data 110 relating to deeper topics about cats. On the other hand, the state of state number 4 in FIG. 9, which transitions after rule data 110 [#1-2] or rule data 110 [#1-3] in state number 1 of FIG. etc.).

図９の状態番号１におけるルールデータ１１０［＃１－４］は、入力文として「そうなんだ」というような曖昧な内容に対応する入力単語組が与えられたときに、「そうだと思うの」というような同様の曖昧な応答文を返すためのルールである。ルールデータ１１０［＃１－４］が出力された（図７のステップＳ７０２）後には、ルールデータ１１０［＃１－４］の破線枠から出る実線矢印を太い実線の丸で囲まれた「１」に戻して付することで示されるように、オートマトンの状態番号が、図９に示される状態番号１を維持する（図７のステップＳ７０３でルールデータ１１０［＃１－４］から遷移先状態番号＝１が取得され、ステップＳ７０４でスタックの先頭に１が記憶されたままとされる）。 The rule data 110 [#1-4] in state number 1 in FIG. This is a rule for returning similar ambiguous response sentences such as After the rule data 110 [#1-4] is output (step S702 in FIG. 7), the solid-line arrow that emerges from the dashed-line frame of the rule data 110 [#1-4] is surrounded by a thick solid-line circle with "1 , the state number of the automaton maintains the state number 1 shown in FIG. The number=1 is obtained, and 1 remains stored at the top of the stack in step S704).

図１１は、図８から図９に例示されるオートマトンに対応する図６のルール検索処理の具体的な動作例を示す図である。まず、オートマトンの状態番号０において、ユーザが例えば喋ることにより、入力文Ｉｎ［０］として「動物の話でもしようか」が入力され、これに対応して「動物」「話」という２つの単語からなる入力単語組が形態素解析により生成されたとする。これに対して、メモリ２０２又は補助情報記憶装置２０５に記憶されている図３（ｃ）の複数のルールデータ１１０のうち、その想定入力単語の配列中の想定入力単語組ｕｓｅｒＷｏｒｄｓ［０］、ｕｓｅｒＷｏｒｄｓ［１］（図３（ｃ）参照）が、上記入力単語組「動物」「話」に包含される（ここでは、「一致する」）ルールデータ１１０が検索される（図６のステップＳ６０１）。この結果、現在の状態番号ｓｔａｔｅ＿ｉｄ（図３（ｃ）参照）が図８（ｂ）のオートマトンの状態番号０と一致し、想定入力単語組ｕｓｅｒＷｏｒｄｓ［０］、ｕｓｅｒＷｏｒｄｓ［１］（図３（ｃ）参照）が入力単語組「動物」「話」に包含される（又は、一致する）１つのルールデータ１１０［＃０－２］が検索される。また、ステップＳ６０１では説明しなかったが、入力文Ｉｎ［０］が「タイプ」として「提案」型の疑問文であり「肯定／否定」項目として「肯定」を有するものと判定され、図１１の入力文Ｉｎ［０］に対応させられて、ルールデータ１１０［＃０－２］が検索され、応答候補ルールデータ１１２とされる（図６のステップＳ６０５の１回目の判定でＹＥＳとなりステップＳ６０６へ）。 FIG. 11 is a diagram showing a specific operation example of the rule search processing in FIG. 6 corresponding to the automaton illustrated in FIGS. 8 to 9. In FIG. First, in the state number 0 of the automaton, the user speaks, for example, and the input sentence In[0] "Let's talk about animals" is input. Suppose that an input word set consisting of is generated by morphological analysis. 3C stored in the memory 202 or the auxiliary information storage device 205, the assumed input word set userWords[0], userWords [1] (see FIG. 3(c)) is included in (here, "matches") the input word set "animal" and "story", and the rule data 110 is retrieved (step S601 in FIG. 6). . As a result, the current state number state_id (see FIG. 3(c)) matches the automaton state number 0 in FIG. )) is included in (or matches with) the input word set “animal” and “talk” is searched for one rule data 110 [#0-2]. Also, although not explained in step S601, it is determined that the input sentence In[0] is an interrogative sentence of the "proposal" type as the "type" and has "affirmative" as the "affirmative/negative" item. , the rule data 110 [#0-2] is retrieved and set as the response candidate rule data 112 (the first determination in step S605 in FIG. 6 is YES, and step S606 fart).

次に、図１１に示されるように、この１つの応答候補ルールデータ１１２［＃０－２］が、入力文Ｉｎ［０］に対応する応答ルールデータ１１３として選択される（図７のステップＳ７０１）。この結果、図１１に示されるように、応答ルールデータ１１３として選択されたルールデータ１１０［＃０－２］の応答文ｂｏｔ＿ｒｅｐｌｙ（図３（ｃ）参照）が、応答文Ｏｕｔ［０］＝「うん、動物の話をしよう」（図１１参照）、すなわち応答文１１４（図１参照）として生成され、出力される（ステップＳ７０２）。それに伴い、応答ルールデータ１１３として選択されたルールデータ１１０［＃０－２］が参照され、このルールデータ１１０［＃０－２］に次遷移先状態番号ｎｅｘｔ＿ｓｔａｔｅ＿ｉｄとして値１が含まれるので（図７のステップＳ７０３）、オートマトンの状態番号が、いままでの状態番号０から状態番号１に遷移する（図７のステップＳ７０４）。 Next, as shown in FIG. 11, this one response candidate rule data 112[#0-2] is selected as the response rule data 113 corresponding to the input sentence In[0] (step S701 in FIG. 7). ). As a result, as shown in FIG. 11, the response text bot_reply (see FIG. 3(c)) of the rule data 110 [#0-2] selected as the response rule data 113 becomes the response text Out[0]=“ Yes, let's talk about animals" (see FIG. 11), that is, is generated and output as the response sentence 114 (see FIG. 1) (step S702). Along with this, the rule data 110 [#0-2] selected as the response rule data 113 is referred to, and the rule data 110 [#0-2] contains the value 1 as the next transition destination state number next_state_id (Fig. 7 step S703), the state number of the automaton changes from state number 0 to state number 1 (step S704 in FIG. 7).

このとき、応答ルールデータ１１３として選択されたルールデータ１１０［＃０－２］のように、初めて選択された応答ルールデータ１１３である場合には、そのルールデータ１１０［＃０－２］の現在の状態番号ｓｔａｔｅ＿ｉｄ（図３（ｃ）参照）の値０が、まずスタック状態番号ＮＳＡとして、状態番号のスタック配列：ｓｔａｔｅ＿ｉｄ（図３（ａ）参照）にプッシュされて記憶され、続いて、ルールデータ１１０［＃０－２］の次遷移先状態番号ｎｅｘｔ＿ｓｔａｔｅ＿ｉｄの値１が、次のスタック状態番号ＮＳＡとして、状態番号のスタック配列：ｓｔａｔｅ＿ｉｄ（図３（ａ）参照）に更にプッシュされて記憶される（図７のステップＳ７０４）。この場合、状態番号０が、１回前の過去のスタック状態番号ＮＳＡとして記憶され、次遷移先状態番号１が、最新のスタック状態番号ＮＳＡとして記憶されることになる。 At this time, if the response rule data 113 is selected for the first time like the rule data 110 [#0-2] selected as the response rule data 113, the current rule data 110 [#0-2] The value 0 of the state number state_id (see FIG. 3(c)) is first pushed and stored as the stack state number NSA in the state number stack array: state_id (see FIG. 3(a)). The value 1 of the next transition destination state number next_state_id of the data 110 [#0-2] is further pushed to the state number stack array: state_id (see FIG. 3A) and stored as the next stack state number NSA. (step S704 in FIG. 7). In this case, the state number 0 is stored as the previous past stack state number NSA, and the next transition destination state number 1 is stored as the latest stack state number NSA.

またこのとき、図７のフローチャートには記載していないが、図１の応答文記憶部１０５に対応するメモリ２０２又は補助情報記憶装置２０５（図２参照）内の応答文記憶部（図示せず）に、応答ルールデータ１１３として選択されたルールデータ＃０－２の応答文Ｏｕｔ［０］＝「うん、動物の話をしよう」が記憶される。 Also, at this time, although not shown in the flowchart of FIG. ), the response sentence Out[0]=“Yes, let's talk about animals” of the rule data #0-2 selected as the response rule data 113 is stored.

次に、遷移後のオートマトンの状態番号１において、入力文Ｉｎ［１］として「猫を飼おうと思うんだけど」が入力され、これに対する形態素解析により単語「猫」を含む入力単語組１１１が生成されたとする。これに対して、図３（ｃ）の複数のルールデータ１１０のうち、その想定入力単語の配列中の想定入力単語組ｕｓｅｒＷｏｒｄｓ［０］（図３（ｃ）参照）が上記入力単語組「猫」に包含される（又は、一致する）ルールデータ１１０が検索される（図６のステップＳ６０１）。この結果、現在の状態番号ｓｔａｔｅ＿ｉｄ（図３（ｃ）参照）が図９のオートマトンの状態番号１と一致し、想定入力単語組ｕｓｅｒＷｏｒｄｓ［０］、ｕｓｅｒＷｏｒｄｓ［１］（図３（ｃ）参照）が入力単語組「猫」に包含される（又は、一致する）１つのルールデータ１１０［＃１－０］が検索される。また、入力文Ｉｎ［１］が「タイプ」として「その他」型の疑問文であり「肯定／否定」項目として「肯定」を有するものと判定され、図１１の入力文Ｉｎ［１］に対応させられて、ルールデータ１１０［＃１－０］が応答候補ルールデータ１１２として検索される（図６のステップＳ６０５の１回目の判定でＹＥＳとなりステップＳ６０６へ）。 Next, in state number 1 of the post-transition automaton, the input sentence In[1] "I'm thinking of getting a cat" is input, and the input word set 111 including the word "cat" is generated by morphological analysis of this. Suppose it was On the other hand, among the plurality of rule data 110 in FIG. 3C, the assumed input word set userWords[0] (see FIG. ” is searched for rule data 110 that is included in (or matches with) (step S601 in FIG. 6). As a result, the current state number state_id (see FIG. 3(c)) matches the state number 1 of the automaton shown in FIG. is included in (or matches with) the input word set "cat". Also, the input sentence In[1] is determined to be an interrogative sentence of the "other" type as the "type" and has "affirmative" as the "affirmative/negative" item, which corresponds to the input sentence In[1] in FIG. Then, the rule data 110 [#1-0] is searched as the answer candidate rule data 112 (the first judgment in step S605 in FIG. 6 becomes YES, and the process proceeds to step S606).

次に、図１１に示されるように、この１つの応答候補ルールデータ１１２［＃１－０］が、入力文Ｉｎ［１］に対応する応答ルールデータ１１３として選択される（図７のステップＳ７０１）。この結果、図１１に示されるように、応答ルールデータ１１３として選択されたルールデータ１１０［＃１－０］の応答文ｂｏｔ＿ｒｅｐｌｙ（図３（ｃ）参照）が、応答文Ｏｕｔ［１］＝「猫の目って大きいよね」（図１１参照）、すなわち応答文１１４（図１参照）として生成され、出力される（ステップＳ７０２）。それに伴い、応答ルールデータ１１３として選択されたルールデータ１１０［＃１－０］が参照され、このルールデータ１１０［＃１－０］に次遷移先状態番号ｎｅｘｔ＿ｓｔａｔｅ＿ｉｄとして値２が含まれるので（図７のステップＳ７０３）、オートマトンの状態番号が、いままでの状態番号１から状態番号２に遷移する（図７のステップＳ７０４）。 Next, as shown in FIG. 11, this one response candidate rule data 112[#1-0] is selected as the response rule data 113 corresponding to the input sentence In[1] (step S701 in FIG. 7). ). As a result, as shown in FIG. 11, the response text bot_reply (see FIG. 3(c)) of the rule data 110[#1-0] selected as the response rule data 113 becomes the response text Out[1]=“ Cat's eyes are big, aren't they?" (see FIG. 11), that is, is generated and output as response sentence 114 (see FIG. 1) (step S702). Along with this, the rule data 110 [#1-0] selected as the response rule data 113 is referred to, and this rule data 110 [#1-0] contains the value 2 as the next transition destination state number next_state_id (see FIG. 7 step S703), the state number of the automaton transitions from the previous state number 1 to state number 2 (step S704 in FIG. 7).

このとき、応答ルールデータ１１３として選択されたルールデータ１１０［＃１－０］のように、２回目以降に選択された応答ルールデータ１１３である場合には、そのルールデータ１１０［＃１－０］に含まれる次遷移先状態番号ｎｅｘｔ＿ｓｔａｔｅ＿ｉｄの値２が、最新のスタック状態番号ＮＳＡとして、状態番号のスタック配列：ｓｔａｔｅ＿ｉｄ（図３（ａ）参照）にプッシュされて記憶される（ステップＳ７０４）。 At this time, if the rule data 110 [#1-0] selected as the response rule data 113 is the response rule data 113 selected after the second time, the rule data 110 [#1-0] ] is pushed to the state number stack array: state_id (see FIG. 3A) and stored as the latest stack state number NSA (step S704).

またこのとき、図６のフローチャートには記載していないが、前述した応答文記憶部に、応答ルールデータ１１３として選択されたルールデータ１１０［＃１－０］の応答文Ｏｕｔ［１］＝「猫の目って大きいよね」が記憶される。 Also, at this time, although not shown in the flowchart of FIG. Cats have big eyes." is remembered.

次に、遷移後のオートマトンの状態番号２において、入力文Ｉｎ［２］として「そうだね、目が大きくて可愛いよね」が入力され、単語「目」を含む入力単語組１１１が生成されたとする。これに対して、図３（ｃ）の複数のルールデータ１１０のうち、その想定入力単語の配列中の想定入力単語組ｕｓｅｒＷｏｒｄｓ［０］（図３（ｃ）参照）が上記入力単語組「目」に包含される（又は、一致する）ルールデータ１１０が検索される（ステップＳ６０１）。 Next, assume that in state number 2 of the automaton after the transition, the input sentence In[2] is "That's right, your eyes are big and cute, isn't it?" . On the other hand, among the plurality of rule data 110 in FIG. 3C, the assumed input word set userWords[0] (see FIG. ” is retrieved (or matches) (step S601).

この結果、図６のステップＳ６０１→Ｓ６０２→Ｓ６１３→Ｓ６０１のループ処理により、想定入力単語組ｕｓｅｒＷｏｒｄｓ［０］（図３（ｃ）参照）が入力単語組「目」に包含される（又は、一致する）のに加えて、現在の状態番号ｓｔａｔｅ＿ｉｄ（図３（ｃ）参照）がオートマトンの状態番号２と一致する１つのルールデータ１１０［＃２－０］と、現在の状態番号ｓｔａｔｅ＿ｉｄ（図３（ｃ）参照）が状態番号１と一致するルールデータ１１０［＃１－１］、［＃１－３］とが順次検索されて、順次ステップＳ６０２の判定がＹＥＳとなる。続いて、その後のステップＳ６０３からＳ６１２において、ルールデータ１１０［＃２－０］については、そのオートマトンの状態番号２が状態番号のスタック配列：ｓｔａｔｅ＿ｉｄ（図３（ａ）参照）の一番上にプッシュされている最新のスタック状態番号ＮＳＡ（＝２）と一致し、それにより１回目のステップＳ６０５でＹＥＳと判定される結果、そのスコアに、状態番号係数＝１が累算される（ステップＳ６０８）。同様に、ルールデータ１１０［＃１－１］及び［＃１－３］については、それらのオートマトンの状態番号が状態番号のスタック配列：ｓｔａｔｅ＿ｉｄ（図３（ａ）参照）の一番上から２番目にプッシュされているスタック状態番号ＮＳＡ（＝１）と一致し、それにより２回目のステップＳ６０５でＹＥＳと判定されることにより、ステップＳ６１２がそれぞれ１回ずつ実行される結果、それぞれのスコアに、状態番号係数＝１×０．９＝０．９が累算される（ステップＳ６１２、Ｓ６０８）。 As a result, the assumed input word set userWords[0] (see FIG. 3C) is included in the input word set "me" (or matched ), one rule data 110 [#2-0] whose current state number state_id (see FIG. 3(c)) matches the state number 2 of the automaton, and the current state number state_id (see FIG. 3 The rule data 110 [#1-1] and [#1-3] in which (see (c)) matches the state number 1 are sequentially searched, and the determination in step S602 becomes YES. Subsequently, in subsequent steps S603 to S612, for the rule data 110 [#2-0], the state number 2 of the automaton is added to the top of the state number stack array: state_id (see FIG. 3(a)). Matches the latest pushed stack state number NSA (=2), and as a result of the first determination of YES in step S605, the state number coefficient=1 is accumulated in the score (step S608). ). Similarly, for the rule data 110 [#1-1] and [#1-3], the state numbers of these automatons are the stack array of state numbers: state_id (see FIG. 3(a)). The second pushed stack state number NSA (=1) is matched, and as a result, step S612 is executed once each by making a second determination of YES in step S605. , state number coefficient=1×0.9=0.9 are accumulated (steps S612, S608).

以上より、そのスコアが１．０倍されたルールデータ１１０［＃２－０］と、それぞれのスコアが０．９倍されたルールデータ１１０［＃１－１］及び［＃１－３］とが、応答候補ルールデータリストｔｒａｎｓＣａｎｄｉｄａｔｅｓにそれぞれ応答候補ルールデータ１１２として取得される。 From the above, rule data 110 [#2-0] whose score is multiplied by 1.0, and rule data 110 [#1-1] and [#1-3] whose respective scores are multiplied by 0.9 are acquired as response candidate rule data 112 in the response candidate rule data list transCandidates.

次に、図６のフローチャートには示されていないが、ルールデータ１１０［＃２－０］と、ルールデータ１１０［＃１－１］及び［＃１－３］について、それぞれの応答文１１４が前述した応答文記憶部に登録されているか否かが判定される。これにより、ルールデータ１１０［＃２－０］（図１０）とルールデータ１１０［＃１－１］（図９）の応答文＝「猫の目って大きいよね」が、応答文Ｏｕｔ［１］として出力されていて上記応答文記憶部に登録されていることが判定される。この結果、同じ応答文１１４が続けて出力されてしまうことを避けるために、ルールデータ１１０［＃２－０］（図１０）とルールデータ１１０［＃１－１］（図９）は選択されずに、ルールデータ１１０［＃１－３］が、図１１に示されるように、入力文Ｉｎ［２］に対応する応答ルールデータ１１３として選択される。 Next, although not shown in the flow chart of FIG. 6, the respective response sentences 114 for the rule data 110 [#2-0] and the rule data 110 [#1-1] and [#1-3] are: It is determined whether or not it is registered in the above-described response sentence storage unit. As a result, the response sentence of the rule data 110[#2-0] (FIG. 10) and the rule data 110[#1-1] (FIG. 9)=“Cat's eyes are big, aren't they?” ] and registered in the response sentence storage unit. As a result, rule data 110 [#2-0] (FIG. 10) and rule data 110 [#1-1] (FIG. 9) are selected in order to avoid outputting the same response sentence 114 in succession. First, the rule data 110[#1-3] is selected as the response rule data 113 corresponding to the input sentence In[2], as shown in FIG.

この結果、図１１に示されるように、応答ルールデータ１１３として選択されたルールデータ１１０［＃１－３］の応答文ｂｏｔ＿ｒｅｐｌｙ（図３（ｃ）参照）が、応答文Ｏｕｔ［２］＝「狐って目が猫みたいかも」（図１１参照）、すなわち応答文１１４（図１参照）として生成され、出力される（ステップＳ７０２）。それに伴い、応答ルールデータ１１３として選択されたルールデータ１１０［＃１－３］が参照され、このルールデータ１１０［＃１－３］に次遷移先状態番号ｎｅｘｔ＿ｓｔａｔｅ＿ｉｄとして値２が含まれるので（図７のステップＳ７０３）、次に選択すべきルールデータ１１０を表すオートマトンの状態番号が、いままでの状態番号２を維持する（図７のステップＳ７０４）。 As a result, as shown in FIG. 11, the response text bot_reply (see FIG. 3(c)) of the rule data 110 [#1-3] selected as the response rule data 113 becomes the response text Out[2]=“ A fox might have eyes like a cat' (see FIG. 11), that is, it is generated and output as the response sentence 114 (see FIG. 1) (step S702). Along with this, the rule data 110 [#1-3] selected as the response rule data 113 is referred to, and the rule data 110 [#1-3] contains the value 2 as the next transition destination state number next_state_id (Fig. 7 step S703), the state number of the automaton representing the rule data 110 to be selected next maintains the previous state number 2 (step S704 in FIG. 7).

このとき、応答ルールデータ１１３として選択されたルールデータ１１０［＃１－３］は２回目以降に選択された応答ルールデータ１１３であるため、そのルールデータ１１０［＃１－３］に含まれる次遷移先状態番号ｎｅｘｔ＿ｓｔａｔｅ＿ｉｄの値２が、最新のスタック状態番号ＮＳＡとして、状態番号のスタック配列：ｓｔａｔｅ＿ｉｄ（図３（ａ）参照）にプッシュされて記憶される（ステップＳ７０４）。 At this time, since the rule data 110 [#1-3] selected as the response rule data 113 is the response rule data 113 selected after the second time, the next rule data included in the rule data 110 [#1-3] The value 2 of the transition destination state number next_state_id is pushed to the state number stack array: state_id (see FIG. 3A) and stored as the latest stack state number NSA (step S704).

またこのとき、図６のフローチャートには記載していないが、前述した応答文記憶部に、応答ルールデータ１１３として選択されたルールデータ１１０［＃１－３］の応答文Ｏｕｔ［２］＝「狐って目が猫みたいかも」が記憶される。 Also, at this time, although not shown in the flowchart of FIG. A fox might have eyes like a cat' is remembered.

次に、遷移後のオートマトンの状態番号２において、入力文Ｉｎ［３］として「急に狐の話になるのね」が入力され、単語「狐」を含む入力単語組１１１が生成されたとする。これに対して、図３（ｃ）の複数のルールデータ１１０のうち、その想定入力単語組ｕｓｅｒＷｏｒｄｓ［０］（図３（ｃ）参照）が上記入力単語組「狐」に包含される（又は、一致する）ルールデータ１１０が検索される（ステップＳ６０１）。 Next, in state number 2 of the post-transition automaton, assume that the input sentence In[3] is "Suddenly you're talking about a fox, isn't it?", and an input word set 111 including the word "fox" is generated. . On the other hand, among the plurality of rule data 110 in FIG. 3(c), the assumed input word set userWords[0] (see FIG. 3(c)) is included in the input word set "fox" (or , matching) rule data 110 is retrieved (step S601).

この結果、図６のステップＳ６０１→Ｓ６０２→Ｓ６１３→Ｓ６０１のループ処理において、図９のオートマトンの現在の状態番号２を現在の状態番号ｓｔａｔｅ＿ｉｄ（図３（ｃ）参照）として含むルールデータ１１０には、「狐」を想定入力単語の配列ｕｓｅｒＷｏｒｄｓ［］（図３（ｃ）参照）に含むルールデータ１１０は存在しないが、図９に例示されるように、状態番号１を現在の状態番号ｓｔａｔｅ＿ｉｄ（図３（ｃ）参照）として含むルールデータ１１０中に、「狐」を想定入力単語の配列ｕｓｅｒＷｏｒｄｓ［］（図３（ｃ）参照）に含むルールデータ１１０［＃１－２］が検索されて、ステップＳ６０２の判定がＹＥＳとなる。続いて、その後のステップＳ６０３からＳ６１２において、ルールデータ１１０［＃１－２］については、そのオートマトンの状態番号１が状態番号のスタック配列：ｓｔａｔｅ＿ｉｄの最新の状態番号２よりも先にプッシュされている過去のスタック状態番号ＮＳＡ（＝１）と一致し、それにより２回目のステップＳ６０５でＹＥＳと判定される結果、ステップＳ６１２が１回実行される結果、そのスコアに、状態番号係数＝１×０．９＝０．９が乗算される（ステップＳ６１２、Ｓ６０８）。 As a result, in the loop processing of steps S601→S602→S613→S601 in FIG. 6, the rule data 110 including the current state number 2 of the automaton in FIG. , “fox” in the assumed input word array userWords[] (see FIG. 3(c)) does not exist, but as illustrated in FIG. 3(c)), the rule data 110 [#1-2] including "fox" in the assumed input word array userWords[] (see FIG. 3(c)) is searched. , the determination in step S602 is YES. Subsequently, in subsequent steps S603 to S612, for the rule data 110 [#1-2], the state number 1 of the automaton is pushed before the latest state number 2 of the state number stack array: state_id. and the past stack state number NSA (=1), which results in a YES determination in step S605 for the second time. 0.9=0.9 is multiplied (steps S612, S608).

以上より、ルールデータ１１０［＃１－２］のみが、応答候補ルールデータリストｔｒａｎｓＣａｎｄｉｄａｔｅｓに応答候補ルールデータ１１２として取得される。 As described above, only rule data 110 [#1-2] is acquired as response candidate rule data 112 in the response candidate rule data list transCandidates.

次に、図１１に示されるように、応答候補ルールデータ１１２として取得されたルールデータ１１０［＃１－２］が、入力文Ｉｎ［３］に対応する応答ルールデータ１１３として選択される（ステップＳ７０１）。この結果、図１１に示されるように、応答ルールデータ１１３として選択されたルールデータ１１０［＃１－２］の応答文ｂｏｔ＿ｒｅｐｌｙ（図３（ｃ）参照）が、応答文Ｏｕｔ［３］＝「狐って夜に行動するイメージあるよね」（図１１参照）、すなわち応答文１１４（図１参照）として生成され、出力される（ステップＳ７０２）。その後、応答ルールとして選択されたルールデータ１１０［＃１－２］が参照され、このルールデータ１１０［＃１－２］に次遷移先状態番号ｎｅｘｔ＿ｓｔａｔｅ＿ｉｄとして値４が含まれるので（図７のステップＳ７０３）、オートマトンの状態番号が、いままでの状態番号２から状態番号４に遷移する（図７のステップＳ７０４）。 Next, as shown in FIG. 11, the rule data 110 [#1-2] acquired as the answer candidate rule data 112 is selected as the answer rule data 113 corresponding to the input sentence In[3] (step S701). As a result, as shown in FIG. 11, the response text bot_reply (see FIG. 3(c)) of the rule data 110 [#1-2] selected as the response rule data 113 becomes the response text Out[3]=“ There is an image that foxes act at night.” (see FIG. 11), that is, is generated and output as response sentence 114 (see FIG. 1) (step S702). After that, the rule data 110 [#1-2] selected as the response rule is referred to, and since this rule data 110 [#1-2] contains the value 4 as the next transition destination state number next_state_id (step S703), the state number of the automaton transitions from state number 2 to state number 4 (step S704 in FIG. 7).

このとき、応答ルールデータ１１３として選択されたルールデータ１１０［＃１－２］は２回目以降に選択された応答ルールデータ１１３であるため、そのルールデータ１１０［＃１－２］に含まれる次遷移先状態番号ｎｅｘｔ＿ｓｔａｔｅ＿ｉｄの値４が、最新のスタック状態番号ＮＳＡとして、状態番号のスタック配列：ｓｔａｔｅ＿ｉｄにプッシュされて記憶される（ステップＳ７０４）。 At this time, since the rule data 110 [#1-2] selected as the response rule data 113 is the response rule data 113 selected after the second time, the next rule data included in the rule data 110 [#1-2] The value 4 of the transition destination state number next_state_id is pushed to the state number stack array: state_id and stored as the latest stack state number NSA (step S704).

またこのとき、前述した応答文記憶部に、応答ルールデータ１１３として選択されたルールデータ１１０［＃１－２］の応答文Ｏｕｔ［１］＝「狐って夜に行動するイメージあるよね」が記憶される。 Also, at this time, the response sentence Out[1] of the rule data 110 [#1-2] selected as the response rule data 113 is stored in the response sentence storage unit. remembered.

更に続いて、遷移後の状態番号１において、入力文Ｉｎ［４］として「そうなんだ、よく知ってるね」が入力され、単語「そうなんだ」を含む入力単語組１１１が生成されたとする。これに対して、図３（ｃ）の複数のルールデータ１１０のうち、その想定入力単語の配列中の想定入力単語組ｕｓｅｒＷｏｒｄｓ［０］（図３（ｃ）参照）が上記入力単語組「そうなんだ」に包含される（又は、一致する）ルールデータ１１０が検索される（ステップＳ６０１）。 Further, in state number 1 after the transition, it is assumed that the input sentence In[4] is "that's right, you know well" and that an input word set 111 including the word "that's right" is generated. On the other hand, among the plurality of rule data 110 in FIG. 3C, the assumed input word group userWords[0] (see FIG. The rule data 110 included in (or matching with) is searched (step S601).

この結果、図６のステップＳ６０１→Ｓ６０２→Ｓ６１３→Ｓ６０１のループ処理において、図９のオートマトンの現在の状態番号４を現在の状態番号ｓｔａｔｅ＿ｉｄ（図３（ｃ）参照）として含むルールデータ１１０には、「そうなんだ」を想定入力単語の配列ｕｓｅｒＷｏｒｄｓ［］（図３（ｃ）参照）に含むルールデータ１１０は見つからないが、図９に例示されるように、オートマトンの状態番号１を現在の状態番号ｓｔａｔｅ＿ｉｄ（図３（ｃ）参照）として含むルールデータ１１０中に、「そうなんだ」を想定入力単語の配列ｕｓｅｒＷｏｒｄｓ［］（図３（ｃ）参照）に含むルールデータ１１０［＃１－４］が検索されて、ステップＳ６０２の判定がＹＥＳとなる。続いて、その後のステップＳ６０３からＳ６１２において、ルールデータ１１０［＃１－４］については、そのオートマトンの状態番号１が状態番号のスタック配列：ｓｔａｔｅ＿ｉｄの最新のスタック状態番号ＮＳＡ（＝４）よりも先にプッシュされている過去のスタック状態番号ＮＳＡ（＝１）と一致し、それにより３回目のステップＳ６０５でＹＥＳと判定される結果、ステップＳ６１２が２回実行される結果、そのスコアに状態番号係数＝１×０．９×０．９＝０．８１が乗算される（ステップＳ６１２、Ｓ６０８）。 As a result, in the loop processing of steps S601→S602→S613→S601 in FIG. 6, the rule data 110 including the current state number 4 of the automaton in FIG. , "Sona na" in the assumed input word array userWords[] (see FIG. 3(c)) is not found, but as illustrated in FIG. The rule data 110 [#1-4] includes "that's right" in the array userWords[] of assumed input words (see FIG. 3(c)) in the rule data 110 included as the number state_id (see FIG. 3(c)). is retrieved, and the determination in step S602 becomes YES. Subsequently, in subsequent steps S603 to S612, for the rule data 110 [#1-4], the state number 1 of the automaton is higher than the latest stack state number NSA (=4) of the state number stack array: state_id. Matches the past stack state number NSA (=1) that was pushed earlier, and as a result of the determination of YES in step S605 for the third time, step S612 is executed twice, and the state number is added to the score. A coefficient=1×0.9×0.9=0.81 is multiplied (steps S612 and S608).

以上より、ルールデータ１１０［＃１－４］が、応答候補ルールデータリストｔｒａｎｓＣａｎｄｉｄａｔｅｓに応答候補ルールデータ１１２として取得される。 As described above, the rule data 110 [#1-4] is acquired as the response candidate rule data 112 in the response candidate rule data list transCandidates.

次に、図１１に示されるように、応答候補ルールデータ１１２として取得されたルールデータ１１０［＃１－４］について、その応答文ｂｏｔ＿ｒｅｐｌｙ（図３（ｃ）参照）が、応答文Ｏｕｔ［４］＝「そうだと思うの」が前述した応答文記憶部に登録されているか否かがチェックされる。この結果、上記の応答文Ｏｕｔ［４］＝「そうだと思うの」が応答文記憶部に登録されていないことが、判定される。この結果、オートマトンの状態番号１を含むルールデータ１１０［＃１－４］が、図１１に示されるように、入力文Ｉｎ［４］に対応する応答ルールデータ１１３として選択される（ステップＳ７０１）。そして、図１１に示されるように、ルールデータ１１３として選択されたルールデータ１１０［＃１－４］の応答文ｂｏｔ＿ｒｅｐｌｙ（図３（ｃ）参照）が、応答文Ｏｕｔ［４］＝「そうだと思うの」（図１１参照）、すなわち応答文１１４（図１参照）として生成され、出力される（ステップＳ７０２）。そそれに伴い、応答ルールデータ１１３として選択されたルールデータ１１０［＃１－４］が参照され、このルールデータ１１０［＃１－４］に次遷移先状態番号ｎｅｘｔ＿ｓｔａｔｅ＿ｉｄとして値１が含まれるので（図７のステップＳ７０３）、次に選択すべきルールデータ１１０を表すオートマトンの状態番号が、いままでの状態番号１を維持する（図７のステップＳ７０４）。 Next, as shown in FIG. 11, for rule data 110 [#1-4] acquired as response candidate rule data 112, the response sentence bot_reply (see FIG. 3(c)) is the response sentence Out[4]. ]="I think so" is checked whether or not it is registered in the above-described response sentence storage unit. As a result, it is determined that the response sentence Out[4]=“I think so” is not registered in the response sentence storage unit. As a result, the rule data 110 [#1-4] including the state number 1 of the automaton is selected as the response rule data 113 corresponding to the input sentence In [4], as shown in FIG. 11 (step S701). . Then, as shown in FIG. 11, the response sentence bot_reply (see FIG. 3(c)) of the rule data 110 [#1-4] selected as the rule data 113 is the response sentence Out[4]=“That's right. I think" (see FIG. 11), that is, is generated as a response sentence 114 (see FIG. 1) and output (step S702). Along with that, the rule data 110 [#1-4] selected as the response rule data 113 is referred to, and since this rule data 110 [#1-4] contains the value 1 as the next transition destination state number next_state_id ( Step S703 in FIG. 7), the state number of the automaton representing the rule data 110 to be selected next maintains the previous state number 1 (step S704 in FIG. 7).

このとき、応答ルールデータ１１３として選択されたルールデータ１１０［＃１－４］は２回目以降に選択された応答ルールデータ１１３であるため、そのルールデータ１１０［＃１－４］に含まれる次遷移先状態番号ｎｅｘｔ＿ｓｔａｔｅ＿ｉｄの値１が、最新のスタック状態番号ＮＳＡとして、状態番号のスタック配列：ｓｔａｔｅ＿ｉｄ（図３（ａ）参照）にプッシュされて記憶される（ステップＳ７０４）。 At this time, since the rule data 110 [#1-4] selected as the response rule data 113 is the response rule data 113 selected after the second time, the next rule data included in the rule data 110 [#1-4] The value 1 of the transition destination state number next_state_id is pushed to the state number stack array: state_id (see FIG. 3A) and stored as the latest stack state number NSA (step S704).

またこのとき、図６のフローチャートには記載していないが、前述した応答文記憶部に、応答ルールデータ１１３として選択されたルールデータ＃１－４の応答文Ｏｕｔ［４］＝「そうだと思うの」（応答文１１４）が記憶される。 At this time, although not shown in the flow chart of FIG. of” (response sentence 114) is stored.

また、図８（ｂ）に示されるオートマトンの状態番号０を含むルールデータ１１０［＃０－３］は、入力文の項目が「＊」になっている。これは、「どんな単語(形態素)でも該当するものとして扱う」という単語一致条件を持つルールである。そして、例えば、入力文として、状態番号０を含む他のどのルールデータ１１０［＃０－０］～［＃０－２］にも設定されていないキーワード（かつ疑問文等の条件は無し）が与えられた場合に、「え、何か言った？」というような応答文を出力するためのルールである。この「＊」をワイルドカードと呼ぶ。ルールデータ１１０［＃０－３］の破線枠から出る実線矢印によって、ルールデータ１１０［＃０－３］が選択された後は、選択前と同じ状態番号０を維持することが示されている（すなわち、次遷移先状態番号が０）。このようなワイルドカードのルールデータを設定することにより、曖昧な対話を実現することが可能となる。 In the rule data 110 [#0-3] including the state number 0 of the automaton shown in FIG. 8(b), the item of the input statement is "*". This is a rule with a word matching condition of "treat any word (morpheme) as applicable". Then, for example, as an input sentence, there is a keyword (and there is no condition such as a question sentence) that is not set in any of the other rule data 110 [#0-0] to [#0-2] including the state number 0. It is a rule for outputting a response sentence such as "What did you say?" when given. This "*" is called a wild card. A solid-line arrow emerging from the dashed frame of the rule data 110[#0-3] indicates that after the rule data 110[#0-3] is selected, the same state number 0 as before the selection is maintained. (That is, the next transition destination state number is 0). By setting such wildcard rule data, ambiguous dialogue can be realized.

図１０に示されるオートマトンの状態番号３を含むルールデータ１１０［＃３－２］は、入力文の項目が「＃」になっている。これをスーパーワイルドカードと呼ぶ。スーパーワイルドカードは、ワイルドカードの場合と同様の「どんな単語(形態素)でも該当するものとして扱う」という単語一致条件を持つが、ルールデータ１１０として、オートマトンの現在の状態番号と一致する状態番号を含むもののみが応答候補ルールデータ１１２として追加され、このルールデータ１１０が候補に追加された段階で、他の状態のルールデータ１１０は応答候補ルールデータリストから削除される。上述のワイルドカードに似て、入力文として、状態番号３を含む他のどのルールデータ１１０［＃３－０］～［＃３－１］にも設定されていないキーワード（かつ疑問文等の条件は無し）が与えられた場合に、「触ってみたいよう」というような応答文を出力するためのルールである。ここで例えば、ルールデータ１１０［＃３－３］の破線枠から出る実線矢印によって、ルールデータ１１０［＃３－３］が選択された後は、選択前とは異なる状態番号（例えば状態番号２）に遷移する。このように、スーパーワイルドカードのルールデータ１１０にしかるべき遷移先が記載されているときには、他の状態番号に遷移する挙動が実現できる。このようなスーパーワイルドカードのルールデータ１１０を設定することにより、対話が詰まったときに話題を大きく変えるような対話を実現することが可能となる。或いは、例えばシステム的に満足のいく回答が得られるまで同じ状態に滞留する、すなわち同じ質問を繰り返すなどの挙動を実現することも可能である。 In the rule data 110 [#3-2] including the state number 3 of the automaton shown in FIG. 10, the input statement item is "#". This is called a super wild card. The super-wildcard has the same word matching condition as the wildcard, ``treat any word (morpheme) as corresponding'', but the rule data 110 is the state number that matches the current state number of the automaton. Only those that include are added as answer candidate rule data 112, and when this rule data 110 is added to the candidates, rule data 110 in other states are deleted from the answer candidate rule data list. Similar to the wildcard described above, a keyword (and a condition such as an interrogative sentence) that is not set in any other rule data 110 [#3-0] to [#3-1] including state number 3 is used as an input sentence. This is a rule for outputting a response sentence such as "I would like to touch it" when given "No"). Here, for example, after the rule data 110 [#3-3] is selected by the solid-line arrow emerging from the dashed frame of the rule data 110 [#3-3], a different state number (for example, state number 2 ). In this way, when an appropriate transition destination is described in the super wildcard rule data 110, the behavior of transitioning to another state number can be realized. By setting such super wild card rule data 110, it is possible to realize a dialogue that greatly changes the topic when the dialogue gets stuck. Alternatively, for example, it is possible to implement behavior such as staying in the same state until a satisfactory answer is obtained systematically, that is, repeating the same question.

図６のルール検索処理において、現在の入力単語組に対して全てのルールデータの検索が終了しステップＳ６１３の判定がＹＥＳになった後に、ＣＰＵ２０１は、上述したワイルドカード及びスーパーワイルドカードについても、前述のステップＳ６０１からＳ６１３の処理の場合と同様の検索処理を実行する。その詳細については省略する。 In the rule search process of FIG. 6, after the search of all rule data for the current input word set is completed and the determination in step S613 becomes YES, the CPU 201 also performs the above-described wild card and super wild card: The same search processing as in the processing from steps S601 to S613 described above is executed. Its details are omitted.

なお、前述しようたに、スーパーワイルドカードは、それに対応するルールデータ１１０が応答ルールデータ１１３（図１参照）として選択された後は、選択前とは異なる状態番号にオートマトンの状態番号が遷移するという性格上、このルールデータ１１０に含まれる状態番号は、状態番号のスタック配列：ｓｔａｔｅ＿ｉｄ（＝図１のスタック状態番号ＮＳＡ）に同じ番号のスタック状態番号ＮＳＡが含まれているか否かにかかわらず、現在の状態番号と一致しないルールは応答候補ルールデータリストｔｒａｎｓＣａｎｄｉｄａｔｅｓに追加しないように制御されてよい。また、このリストにスーパーワイルドカードのルールが追加された段階で、リストからは、現在の状態番号以外の状態番号を有するルールを排除するように制御されてよい。 As described above, after the rule data 110 corresponding to the super wild card is selected as the response rule data 113 (see FIG. 1), the state number of the automaton transitions to a state number different from that before selection. Therefore, the state number included in the rule data 110 is the same regardless of whether the stack array of state numbers: state_id (=stack state number NSA in FIG. 1) includes the stack state number NSA of the same number. , a rule that does not match the current state number may be controlled so as not to be added to the response candidate rule data list transCandidates. Also, when the super wild card rule is added to this list, the list may be controlled to exclude rules having state numbers other than the current state number.

以上、本実施形態では、データベース１０２に記憶される複数のルールデータ１１０のうち入力発話情報１１５に応じたルールデータ１１０、例えば入力発話情報１１５中の入力単語の組に対応した想定入力単語の組が設定されているルールデータ１１０が、応答ルールデータ１１３の候補（応答候補ルールデータ１１２）として選択される。また、上記複数のルールデータ１１０のうちオートマトンの現在の状態を含む所定の状態、例えば現在の状態又はオートマトンの状態を順次記憶する記憶部１０４に記憶された複数の状態に含まれる状態を示すルールデータ１１０が、応答ルールデータ１１３の候補（応答候補ルールデータ１１２）として選択される。そして、そのように選択された応答ルールデータ１１３に含まれる応答文１１４が出力される。これにより、本実施形態では、ユーザとの例えば対話における話の流れに沿った応答ルールデータ１１３に基づく応答文１１４を出力することができる。このとき、本実施形態では、複数のルールデータ１１０のうち、対応するオートマトンの状態が記憶部１０４に記憶された複数の状態のうちのより新しく記憶された状態と同じ状態を示すルールデータを応答ルールデータ１１３として優先的に選択することができる。これにより、現在の話題により良く対応する応答ルールデータ１１３に基づく応答文１１４を出力することができる。 As described above, in the present embodiment, the rule data 110 corresponding to the input utterance information 115 among the plurality of rule data 110 stored in the database 102, for example, the set of assumed input words corresponding to the set of input words in the input utterance information 115 is set as a candidate for response rule data 113 (response candidate rule data 112). A rule indicating a predetermined state including the current state of the automaton among the plurality of rule data 110, for example, a state included in a plurality of states stored in the storage unit 104 for sequentially storing the current state or the state of the automaton. Data 110 is selected as a candidate for response rule data 113 (response candidate rule data 112). Then, the response sentence 114 included in the response rule data 113 selected as such is output. As a result, in this embodiment, it is possible to output the response sentence 114 based on the response rule data 113 along the flow of conversation with the user, for example. At this time, in the present embodiment, among the plurality of rule data 110, the state of the corresponding automaton responds with rule data indicating the same state as the state stored more recently among the plurality of states stored in the storage unit 104. It can be preferentially selected as the rule data 113 . As a result, it is possible to output a response sentence 114 based on the response rule data 113 that better corresponds to the current topic.

本実施形態の場合と異なり、従来の対話装置では、例えば、オートマトンの現在の状態番号を含むルールデータとして適切なルールデータが設定されていないために、現在の状態番号のルールデータ１１０が応答ルールデータ１１３として選択されなかった場合には、例えばランダムに他の状態番号を含むルールデータを検索せざるを得ず、その結果、話題が唐突に切り替わるというような事態が発生していた。これに対して、本実施形態では、上述したようにして応答ルールデータ１１３の選択を行うことができるので、話題が唐突に切り替わるのを抑制することができ、ユーザと自然に対話できる対話装置１００を提供することが可能となる。 Unlike the case of this embodiment, in the conventional interactive device, for example, appropriate rule data including the current state number of the automaton is not set. If it was not selected as the data 113, for example, rule data containing other state numbers had to be searched at random, and as a result, the topic suddenly changed. On the other hand, in the present embodiment, since the response rule data 113 can be selected as described above, the conversation apparatus 100 can suppress abrupt switching of the topic and can have a natural conversation with the user. can be provided.

また、本実施形態では、応答ルールデータ１１３を選択するための指標を示すスコアがルールデータ１１０毎に算出され、複数のルールデータ１１０のうちの、記憶部１０４に記憶された複数の状態のうちのより過去に記憶された状態と同じ状態を示すオートマトンの状態に対応するルールデータ１１０が応答ルールデータとして選択されにくくなるようにスコアが算出される。例えば、各ルールデータ１１０において、そのルールデータ１１０が示す状態が記憶部１０４においてより過去に記憶された状態であるほど値が減衰する減衰係数が、そのルールデータ１１０の状態番号係数に乗算され、乗算された状態番号係数がそのルールデータ１１０のスコアに累算される。そして、複数のルールデータ１１０のうち最大のスコアを有するルールデータ１１０が、応答ルールデータ１１３として選択される。このため、過去に辿ってきたユーザとの話題に基づき、かつより最近の話題により良く沿った、自然な対話が行える対話装置１００を提供することが可能となる。 Further, in the present embodiment, a score indicating an index for selecting the response rule data 113 is calculated for each rule data 110. The score is calculated so that the rule data 110 corresponding to the state of the automaton showing the same state as the previously stored state is less likely to be selected as the response rule data. For example, in each rule data 110, the state number coefficient of the rule data 110 is multiplied by an attenuation coefficient whose value is attenuated as the state indicated by the rule data 110 is stored in the storage unit 104 in the past, and The multiplied state number factor is accumulated in the score of that rule data 110 . Then, the rule data 110 having the maximum score among the multiple rule data 110 is selected as the response rule data 113 . Therefore, it is possible to provide the dialogue device 100 that enables natural dialogue based on past topics with the user and in line with more recent topics.

また、本実施形態では、入力文（入力発話情報１１５）の文脈に応じてその入力文を構成する入力単語毎に可変の重みが設定され、応答候補ルールデータ１１２毎に、その応答候補ルールデータ１１２中の各想定入力単語に対応する各入力単語の重みが累算されてその応答候補ルールデータ１１２の入力文に対する類似度を示す類似度パラメータが算出され、その類似度パラメータに応じてその応答候補ルールデータ１１２に対するスコアが算出される。そして、各応答候補ルールデータ１１２のスコア値のうち最大のスコアを有する応答候補ルールデータ１１２が応答ルールデータ１１３として選択される。このため、入力文の文脈に応じた正しい応答ルールデータ１１３を選択することが可能となる。 Further, in this embodiment, a variable weight is set for each input word constituting the input sentence according to the context of the input sentence (input utterance information 115). The weight of each input word corresponding to each assumed input word in 112 is accumulated to calculate a similarity parameter indicating the similarity of the response candidate rule data 112 to the input sentence. A score is calculated for the candidate rule data 112 . Then, the candidate response rule data 112 having the maximum score among the score values of each candidate response rule data 112 is selected as the response rule data 113 . Therefore, it is possible to select correct response rule data 113 according to the context of the input sentence.

更に、本実施形態では、複数のルールデータ１１０の各々に含まれる想定入力単語と、入力文からの形態素解析により抽出された入力単語組１１１中の複数の入力単語との比較結果に基づいて、データベース１０１中の複数のルールデータ１１０から応答候補ルールデータ１１２が検索される。このようにして、本実施形態では、単語同士の比較により応答候補ルールデータ１１２が検索されるので、話題に含まれる適切な単語に基づく対話ルールの決定が可能となる。 Furthermore, in this embodiment, based on the results of comparison between the assumed input words included in each of the plurality of rule data 110 and the plurality of input words in the input word set 111 extracted by morphological analysis from the input sentence, Response candidate rule data 112 is retrieved from a plurality of rule data 110 in the database 101 . In this way, in this embodiment, since the response candidate rule data 112 is searched by comparing words, it is possible to determine dialogue rules based on appropriate words included in the topic.

加えて、本実施形態では、応答候補ルールデータ１１２からの応答ルールデータ１１３の選択において過去所定回数分の応答文を記憶した応答文記憶部１０５を参照することにより同じ応答文１１４が繰り返し出力されないようにすることができるので、対話が単調になるのを防ぐことが可能となる。 In addition, in this embodiment, when selecting response rule data 113 from response candidate rule data 112, the same response sentence 114 is not repeatedly output by referring to the response sentence storage unit 105, which stores a predetermined number of past response sentences. It is possible to prevent the dialogue from becoming monotonous.

本実施形態では、入力単語毎に重み係数を設定しているが、全ての入力単語に一律に同じ重み係数を設定してもよい。 In this embodiment, a weighting factor is set for each input word, but the same weighting factor may be set uniformly for all input words.

本実施形態では、各応答候補ルールデータのスコアとしてコサイン類似度を算出し、その大小によって複数の応答候補ルールデータ１１２から応答ルールデータ１１３が選択されるようにしたが、テキストマッチングのための各種類似度の演算が適用されてもよい。 In this embodiment, the cosine similarity is calculated as the score of each response candidate rule data, and the response rule data 113 is selected from the plurality of response candidate rule data 112 according to the magnitude of the cosine similarity. A similarity operation may be applied.

本実施形態では、応答候補ルールデータ１１２の状態が記憶部１０４に記憶されている複数のスタック状態番号ＮＳＡのうちのより過去に記憶されたスタック状態番号ＮＳＡであるほど値が減衰する減衰係数を状態番号係数に乗算し、この状態番号係数を応答候補ルールデータ１１２のスコアに累算し、そのスコアに応じて複数の応答候補ルールデータ１１２から応答ルールデータ１１３を選択するようにしたが、複数の状態履歴から他のアルゴリズムに基づいて過去の特定の状態に対応する応答候補ルールデータ１１２が優先的に応答ルールデータ１１３として選択されるようにしてもよい。 In the present embodiment, a damping coefficient is set such that the value of the answer candidate rule data 112 is attenuated as the stack state number NSA stored in the past among the plurality of stack state numbers NSA stored in the storage unit 104 is attenuated. The state number coefficient is multiplied, the state number coefficient is accumulated in the score of the answer candidate rule data 112, and the answer rule data 113 is selected from the plurality of answer candidate rule data 112 according to the score. The candidate response rule data 112 corresponding to a specific past status may be preferentially selected as the response rule data 113 based on another algorithm from the status history.

本実施形態では、ルールデータ１１０毎に設定された想定入力単語組が入力単語組１１１と比較されることにより応答候補ルールデータ１１２が選択されるようにしたが、想定入力単語組ではなく、例えばルールデータ１１０中の応答文から形態素解析により得られる単語組が入力単語組１１１と比較されてもよい。その他、様々に設定された単語組や文と入力単語組とが比較されてもよい。 In this embodiment, the assumed input word set set for each rule data 110 is compared with the input word set 111 to select the response candidate rule data 112. However, instead of the assumed input word set, for example A word set obtained by morphological analysis from a response sentence in the rule data 110 may be compared with the input word set 111 . Alternatively, variously set word sets or sentences may be compared with the input word set.

本実施形態では、ユーザによる発話の内容が想定外の内容である場合（入力単語組１１１の複数の単語が、いずれのルールデータ１１０の想定用入力単語組の複数の単語を包含していない場合）に非想定用ルール（ワイルドカードリスト、スーパーワイルドカードリスト）が参照されるようにしたが、想定外の内容である場合にそれらを参照せずに、所定のルールに従って対話が行われるようにしてもよく、あるいは、何も発話しないようにしてもよい。 In this embodiment, when the content of the user's utterance is unexpected content (when a plurality of words in the input word set 111 does not include a plurality of words in any of the assumed input word sets of the rule data 110) ) are made to refer to non-expected rules (wildcard list, super wildcard list), but if the contents are unexpected, they are not referred to, and dialogue is carried out according to the predetermined rules. , or may not speak at all.

本実施形態では、応答候補ルールデータ１１２からの応答ルールデータ１１３の選択において過去所定回数分の応答文を記憶した応答文記憶部１０５を参照することにより同じ応答文１１４が繰り返し出力されないようにしたが、所定のアルゴリズムに従って同じ応答文を繰り返し出力されるようにしてもよい。 In this embodiment, when selecting response rule data 113 from response candidate rule data 112, the same response sentence 114 is prevented from being repeatedly output by referring to the response sentence storage unit 105 that stores a predetermined number of past response sentences. However, the same response sentence may be output repeatedly according to a predetermined algorithm.

本実施形態では、データベース１０１中の複数のルールデータ１１０から入力単語組１１１に対応する応答候補ルールデータ１１２を選択し、複数の応答候補ルールデータ１１２の中から応答ルールデータ１１３を最終的に選択する手法として、図６及び図７のフローチャートで示されるアルゴリズムの手法を示したが、記憶部１０４に記憶された複数のスタック状態番号ＮＳＡのうちのより新しく記憶されたスタック状態番号ＮＳＡに対応する応答候補ルールデータ１１２が応答ルールデータ１１３として優先的に選択されるという条件で、様々なアルゴリズムの手法を採用することが可能である。例えば、データベース１０２中のルールデータ１１０から応答候補ルールデータ１１２を検索する段階から、記憶部１０４に記憶された複数のスタック状態番号ＮＳＡ中の各状態とルールデータ１１２中の各状態とが比較されながら検索が行われてもよい。 In this embodiment, the candidate response rule data 112 corresponding to the input word set 111 is selected from a plurality of rule data 110 in the database 101, and finally the response rule data 113 is selected from the plurality of candidate response rule data 112. 6 and 7 is shown as a method for determining the number of stacks corresponding to the most recently stored stack state number NSA among the plurality of stack state numbers NSA stored in the storage unit 104. Various algorithmic techniques can be employed under the condition that the response candidate rule data 112 is preferentially selected as the response rule data 113 . For example, from the stage of searching the answer candidate rule data 112 from the rule data 110 in the database 102, each state in the plurality of stack state numbers NSA stored in the storage unit 104 is compared with each state in the rule data 112. The search may be performed while

上記本実施形態の構成に加えて、現在の状態番号と同じ状態番号を含むルールデータがみつかったらその時点で応答候補ルールデータリスト１１２の検索を終了し、記憶部１０４に記憶されたスタック状態番号ＮＳＡを考慮しない手法が採用されてもよい。 In addition to the configuration of this embodiment described above, when rule data containing the same state number as the current state number is found, the search of the answer candidate rule data list 112 is terminated at that point, and the stack state number stored in the storage unit 104 is retrieved. An approach that does not consider NSA may be adopted.

上記本実施形態では、図６において全てのルール検索が終了するまで入力単語組に包含される想定入力単語組を含むルールデータ１１０を検索するステップＳ６０１の処理が繰り返し実行されている。これに対して、オートマトンの現在の状態番号及びスタック配列に含まれるスタック状態番号ＮＳＡのいずれかと一致する状態番号を含むルールデータ１１０についてのみ、入力単語組に応じた検索が行われるようにしてもよい。 In the present embodiment described above, the process of step S601 of searching for the rule data 110 including the assumed input word set included in the input word set is repeatedly executed until all rule searches in FIG. 6 are completed. On the other hand, even if the rule data 110 including the state number matching either the current state number of the automaton or the stack state number NSA included in the stack array is searched according to the input word set, good.

更に、上記本実施形態では、減衰係数を反映させたコサイン類似度がスコアとして算出されているが、応答候補ルールデータ１１２毎に一旦コサイン類似度が算出されてから減衰係数を用いてスコアが減衰させられるような処理が実行されてもよい。また、減衰係数を乗算項として用いているが、減算項として用いてスコアを算出するように、スコアの算出式を設定してもよい。 Furthermore, in the above-described embodiment, the cosine similarity reflecting the attenuation coefficient is calculated as the score. A process may be performed that causes the Further, although the attenuation coefficient is used as a multiplication term, the score calculation formula may be set so that the score is calculated using the attenuation coefficient as a subtraction term.

上述の実施形態において、マイクロフォンをさらに備え、取得部１０６がマイクロフォンを介して入力された例えばユーザのである所定の対象の音声に基づいて、入力発話情報１１５を取得するようにしてよい。
また、上述の実施形態において、スピーカをさらに備え、応答文出力部１０３は、応答文１１４に対応する音声を、スピーカを介して例えばユーザである所定の対象に対して出力するようにしてもよい。
これらの構成により、例えば本実施形態による対話装置１００を、ロボットやスマートフォンの対話アプリとして実現することが可能となる。 In the above-described embodiment, a microphone may be further provided, and the acquisition unit 106 may acquire the input utterance information 115 based on the voice of a predetermined target, such as a user, input via the microphone.
Further, in the above-described embodiment, a speaker may be further provided, and the response sentence output unit 103 may output a voice corresponding to the response sentence 114 to a predetermined target, such as a user, via the speaker. .
With these configurations, for example, the interactive device 100 according to this embodiment can be realized as an interactive application for a robot or a smartphone.

本実施形態では、対話装置１００を図２のハードウェア構成例を有するコンピュータによって実行されるコンピュータプログラムとして提供することも可能となる。 In this embodiment, it is also possible to provide the interactive device 100 as a computer program executed by a computer having the hardware configuration example of FIG.

本実施形態では、ユーザからの入力文は音声データとして与えられそれに対して音声認識が実行されることにより入力文のテキストデータが与えられたが、これに限られるものではなく、ネットワークなどからメールシステムや各種メッセージングシステム、又はＳＮＳシステムなどを介して入力文のテキストデータが直接与えられてもよい。 In this embodiment, the input sentence from the user is given as voice data, and text data of the input sentence is given by executing voice recognition on it. The text data of the input sentence may be given directly via the system, various messaging systems, SNS systems, or the like.

以上、開示の実施形態とその利点について詳しく説明したが、当業者は、特許請求の範囲に明確に記載した本発明の範囲から逸脱することなく、様々な変更、追加、省略をすることができる。 While the disclosed embodiments and their advantages have been described in detail above, those skilled in the art can make various modifications, additions, and omissions without departing from the scope of the invention, which is clearly defined in the appended claims. .

その他、本発明は上述した実施形態に限定されるものではなく、実施段階ではその要旨を逸脱しない範囲で種々に変形することが可能である。また、上述した実施形態で実行される機能は可能な限り適宜組み合わせて実施しても良い。上述した実施形態には種々の段階が含まれており、開示される複数の構成要件による適宜の組み合せにより種々の発明が抽出され得る。例えば、実施形態に示される全構成要件からいくつかの構成要件が削除されても、効果が得られるのであれば、この構成要件が削除された構成が発明として抽出され得る。 In addition, the present invention is not limited to the above-described embodiments, and can be modified in various ways without departing from the gist of the present invention. Also, the functions executed in the above-described embodiments may be combined as appropriate as possible. Various steps are included in the above-described embodiments, and various inventions can be extracted by appropriately combining the disclosed multiple constituent elements. For example, even if some constituent elements are deleted from all the constituent elements shown in the embodiments, if an effect can be obtained, a configuration in which these constituent elements are deleted can be extracted as an invention.

以上の実施形態に関して、更に以下の付記を開示する。
（付記１）
応答文をそれぞれ含み、互いに異なるオートマトンの状態に対応付けられた複数のルールデータを記憶し、前記オートマトンの状態の遷移先の状態が定義されたデータベースと、
所定の対象から入力された入力発話情報を取得する取得手段と、
前記複数のルールデータから、前記オートマトンの現在の状態を含む所定の状態と、前記取得された入力発話情報とに応じて、応答ルールデータを選択し、選択した応答ルールデータに含まれる前記応答文を前記所定の対象に対して出力する応答文出力手段と、
前記オートマトンの状態を順次記憶するための記憶手段と、を備え、
前記所定の状態は、前記記憶手段に記憶された複数の状態を含み、
前記応答文出力手段は、前記複数のルールデータのうちの、対応する前記オートマトンの状態が前記記憶手段に記憶された複数の状態のうちのより新しく記憶された状態と同じ状態を示すルールデータを、前記応答ルールデータとして優先的に選択する、
対話装置。
（付記２）
前記応答文出力手段は、
前記複数のルールデータのうちの、前記オートマトンの現在の状態を含む所定の状態に対応する前記複数のルールデータよりも少ない複数のルールデータから、前記取得された入力発話情報に応じて、前記応答ルールデータの候補となる応答候補の複数のルールデータを検索して取得するデータ取得手段を有し、
前記取得した応答候補の複数のルールデータのうちの、対応する前記オートマトンの状態が前記記憶手段に記憶された複数の状態のうちのより新しく記憶された状態と同じ状態を示す応答候補ルールデータを、前記応答ルールデータとして優先的に選択する、
付記１に記載の対話装置。
（付記３）
前記応答文出力手段は、
前記応答ルールデータを選択するための指標を示すスコアを、前記ルールデータ毎に算出し、前記複数のルールデータのうちの、前記記憶手段に記憶された複数の状態のうちのより過去に記憶された状態と同じ状態を示す前記オートマトンの状態に対応する前記ルールデータが前記応答ルールデータとして選択されにくくなるように、前記スコアを算出する、
付記１又は２に記載の対話装置。
（付記４）
前記複数のルールデータの各々は、前記オートマトンの状態及び前記応答文に対応付けられた、前記所定の対象から入力されると想定される想定入力発話文を含み、
前記取得された入力発話情報に基づいて、前記所定の対象の発話文中に含まれる入力単語を抽出する抽出手段と、
抽出された前記入力単語毎に重みを設定する重み設定手段と、
前記入力単語毎に設定された前記重みに応じて、前記入力単語を含む前記所定の対象の発話に対する前記想定入力発話文の類似度を示す類似度パラメータを算出する類似度パラメータ算出手段と、をさらに備え、
前記応答文出力手段は、前記設定された前記類似度パラメータに応じて、前記スコアを算出する、
付記３に記載の対話装置。
（付記５）
前記選択された応答ルールデータに基づいて生成された応答文を過去所定回数分記憶する応答文記憶手段を更に備え、
前記応答文出力手段は、前記応答文記憶手段を参照することにより、前記応答候補ルールデータからの前記応答ルールデータの選択を、同じ応答文が繰り返し出力されないように、行う、付記１乃至４の何れか１項に記載の対話装置。
（付記６）
マイクロフォンをさらに備え、
前記取得手段は、前記マイクロフォンを介して入力された前記所定の対象の音声に基づいて、前記入力発話情報を取得する、
付記１乃至５のいずれか１項に記載の対話装置。
（付記７）
スピーカをさらに備え、
前記応答文出力手段は、前記応答文に対応する音声を、前記スピーカを介して前記所定の対象に対して出力する、
付記１乃至６の何れか1項に記載の対話装置。
（付記８）
所定の対象から入力された入力発話情報を取得する処理と、
応答文をそれぞれ含み、互いに異なるオートマトンの状態に対応付けられた複数のルールデータを記憶し、前記オートマトンの状態の遷移先の状態が定義されたデータベースを用い、前記複数のルールデータから、前記オートマトンの現在の状態を含む所定の状態と、前記取得された入力発話情報とに応じて、応答ルールデータを選択し、選択した応答ルールデータに含まれる前記応答文を前記所定の対象に対して出力する出力処理と、
前記オートマトンの状態を記憶手段に順次記憶する処理と、を含み、
前記所定の状態は、前記記憶手段に記憶された複数の状態を含み、
前記出力処理は、前記複数のルールデータのうちの、対応する前記オートマトンの状態が前記記憶手段に記憶された複数の状態のうちのより新しく記憶された状態と同じ状態を示すルールデータを、前記応答ルールデータとして優先的に選択する処理を含む、
対話方法。
（付記９）
コンピュータに、付記８に記載の対話方法を実行させるためのプログラム。 The following notes are further disclosed with respect to the above embodiments.
(Appendix 1)
a database storing a plurality of rule data each including a response sentence and associated with mutually different states of the automaton, and defining transition destination states of the states of the automaton;
Acquisition means for acquiring input utterance information input from a predetermined target;
selecting response rule data from the plurality of rule data according to a predetermined state including the current state of the automaton and the obtained input utterance information, and the response sentence included in the selected response rule data; response sentence output means for outputting to the predetermined target;
storage means for sequentially storing states of the automaton,
The predetermined state includes a plurality of states stored in the storage means,
The response sentence output means outputs rule data indicating that the state of the corresponding automaton, among the plurality of rule data, is the same as the state stored more recently among the plurality of states stored in the storage means. , preferentially selected as the response rule data,
interactive device.
(Appendix 2)
The response sentence output means is
According to the obtained input utterance information, from a plurality of rule data less than the plurality of rule data corresponding to a predetermined state including the current state of the automaton, out of the plurality of rule data, the response data acquisition means for retrieving and acquiring a plurality of rule data of response candidates that are candidates for rule data;
response candidate rule data indicating that the state of the corresponding automaton, among the plurality of rule data of the acquired response candidates, is the same as the state stored more recently among the plurality of states stored in the storage means; , preferentially selected as the response rule data,
1. An interactive device according to Appendix 1.
(Appendix 3)
The response sentence output means is
A score indicating an index for selecting the response rule data is calculated for each of the rule data, and the score stored earlier than the plurality of states stored in the storage means among the plurality of rule data is calculated. calculating the score so that the rule data corresponding to the state of the automaton indicating the same state as the state of the response is less likely to be selected as the response rule data;
3. A dialogue device according to appendix 1 or 2.
(Appendix 4)
each of the plurality of rule data includes an assumed input utterance sentence assumed to be input from the predetermined object, associated with the state of the automaton and the response sentence;
extraction means for extracting an input word included in the predetermined target utterance sentence based on the acquired input utterance information;
weight setting means for setting a weight for each extracted input word;
a similarity parameter calculating means for calculating a similarity parameter indicating a similarity of the assumed input utterance sentence to the predetermined target utterance containing the input word according to the weight set for each input word; further prepared,
The response sentence output means calculates the score according to the set similarity parameter.
3. An interactive device according to appendix 3.
(Appendix 5)
further comprising response sentence storage means for storing a predetermined number of past response sentences generated based on the selected response rule data;
The response sentence output means selects the response rule data from the response candidate rule data by referring to the response sentence storage means so that the same response sentence is not repeatedly output. A dialogue device according to any one of the preceding claims.
(Appendix 6)
Equipped with an additional microphone,
The acquisition means acquires the input utterance information based on the predetermined target voice input via the microphone.
6. A dialogue device according to any one of appendices 1 to 5.
(Appendix 7)
Equipped with additional speakers,
The response sentence output means outputs a voice corresponding to the response sentence to the predetermined target via the speaker.
7. A dialogue device according to any one of appendices 1 to 6.
(Appendix 8)
a process of acquiring input utterance information input from a predetermined target;
Using a database that stores a plurality of rule data each including a response sentence and associated with mutually different states of the automaton, and defining transition destination states of the states of the automaton, from the plurality of rule data, the automaton selecting response rule data according to a predetermined state including the current state of and the obtained input utterance information, and outputting the response sentence included in the selected response rule data to the predetermined target and output processing to
a process of sequentially storing the state of the automaton in a storage means,
The predetermined state includes a plurality of states stored in the storage means,
In the output processing, rule data indicating the same state of the corresponding automaton among the plurality of rule data as the state stored more recently among the plurality of states stored in the storage means is output as the rule data. Including the process of preferentially selecting as response rule data,
how to interact.
(Appendix 9)
A program for causing a computer to execute the interaction method according to appendix 8.

１００対話装置
１０１データベース
１０２データ取得部
１０３応答文出力部
１０４記憶部
１０５応答文記憶部
１０６取得部
１０７抽出部
１１０ルールデータ
１１１入力単語組
１１２応答候補ルールデータ
１１３応答ルールデータ
１１４応答文
１１５入力発話情報
ＮＳＡスタック状態番号
２０１ＣＰＵ
２０２メモリ
２０３入力装置
２０４出力装置
２０５補助情報記憶装置
２０６媒体駆動装置
２０７ネットワーク接続装置
２０８音声入力装置
２０９音声出力装置
２１０バス
２１１可搬型記録媒体 100 dialogue device 101 database 102 data acquisition unit 103 response sentence output unit 104 storage unit 105 response sentence storage unit 106 acquisition unit 107 extraction unit 110 rule data 111 input word set 112 response candidate rule data 113 response rule data 114 response sentence 115 input utterance Information NSA Stack State Number 201 CPU
202 Memory 203 Input Device 204 Output Device 205 Auxiliary Information Storage Device 206 Media Drive Device 207 Network Connection Device 208 Audio Input Device 209 Audio Output Device 210 Bus 211 Portable Recording Medium

Claims

Acquisition means for acquiring input utterance information input from a predetermined target;
A predetermined state including the current state of the automaton and a predetermined state including the current state of the automaton from a database that stores a plurality of rule data in which a plurality of sets of input words, response sentences, states of the automaton, and states of the automaton to be transitioned to next are associated with each other. a response sentence for selecting rule data in accordance with a plurality of input words contained in the input utterance information acquired by said acquisition means, and outputting a response sentence contained in the selected rule data to said predetermined target; an output means;
storage control means for sequentially storing the predetermined states corresponding to the selected rule data in a storage means ;
After the storage control by the storage control means, when the input utterance information is acquired by the acquisition means, the response sentence output means , if there is a plurality of rule data to be selected, among the data , preferentially selecting rule data corresponding to the same state as the predetermined state stored more recently among the plurality of predetermined states stored in the storage means;
A dialogue device characterized by:

According to the obtained input utterance information, candidate and further comprising data acquisition means for retrieving and acquiring a plurality of rule data,
The response sentence output means stores a newer predetermined state than the plurality of predetermined states in which the state of the corresponding automaton is stored in the storage means among the acquired plurality of candidate rule data. preferentially select the rule data showing the same state as
2. The interactive device according to claim 1, characterized by:

The response sentence output means is
A score indicating an index for selecting the rule data is calculated for each of the rule data, and a state stored earlier than the plurality of states stored in the storage means among the plurality of rule data is calculated. calculating the score so that the rule data corresponding to the predetermined state indicating the same state is less likely to be selected;
3. A dialogue device according to claim 1 or 2, characterized in that:

each of the plurality of rule data includes an assumed input utterance sentence assumed to be input from the predetermined object, associated with the predetermined state and the response sentence;
extraction means for extracting an input word included in the predetermined target utterance sentence based on the acquired input utterance information;
weight setting means for setting a weight for each extracted input word;
a similarity parameter calculating means for calculating a similarity parameter indicating a similarity of the assumed input utterance sentence to the predetermined target utterance containing the input word according to the weight set for each input word; further prepared,
The response sentence output means calculates the score according to the set similarity parameter.
4. The interactive device according to claim 3, characterized by:

further comprising response sentence storage means for storing a predetermined number of past response sentences generated based on the selected rule data;
5. The method according to any one of claims 1 to 4, wherein said response sentence output means selects said rule data by referring to said response sentence storage means so that the same response sentence is not repeatedly output. Dialog device as described.

Equipped with an additional microphone,
The acquisition means acquires the input utterance information based on the predetermined target voice input via the microphone.
6. A dialogue device according to any one of claims 1 to 5, characterized in that:

Equipped with additional speakers,
The response sentence output means outputs a voice corresponding to the response sentence to the predetermined target via the speaker.
7. A dialogue device according to any one of claims 1 to 6, characterized in that:

A dialogue method executed by a dialogue device,
an acquisition step of acquiring input utterance information input from a predetermined target;
A predetermined state including the current state of the automaton and a predetermined state including the current state of the automaton from a database that stores a plurality of rule data in which a plurality of sets of input words, response sentences, states of the automaton, and states of the automaton to be transitioned to next are associated with each other. , a response sentence for selecting rule data in accordance with a plurality of input words included in the input utterance information acquired in the acquiring step , and outputting a response sentence included in the selected rule data to the predetermined target. an output step;
a storage control step for sequentially storing the predetermined states corresponding to the selected rule data in a storage means ;
After the storage control by the storage control step, when the input utterance information is acquired in the acquisition step, the response sentence output step, if there is a plurality of rule data to be selected, stores a plurality of rule data in the database. among the rule data , preferentially selecting rule data corresponding to the same state as the predetermined state stored more recently than the plurality of predetermined states stored in the storage means;
A dialogue method characterized by:

to the computer of the interactive device,
an acquisition step of acquiring input utterance information input from a predetermined target;
A predetermined state including the current state of the automaton and a predetermined state including the current state of the automaton from a database that stores a plurality of rule data in which a plurality of sets of input words, response sentences, states of the automaton, and states of the automaton to be transitioned to next are associated with each other. , a response sentence for selecting rule data in accordance with a plurality of input words included in the input utterance information acquired in the acquiring step, and outputting a response sentence included in the selected rule data to the predetermined target. output step,
executing a storage control step of sequentially storing the predetermined states corresponding to the selected rule data in a storage means;
After the storage control by the storage control step, when the input utterance information is acquired in the acquisition step, the response sentence output step, if there is a plurality of rule data to be selected, stores a plurality of rule data in the database. A program for preferentially selecting, among rule data, rule data corresponding to the same state as the predetermined state stored more recently than the plurality of predetermined states stored in the storage means.