JPH01243761A

JPH01243761A - Interacting type voice responding device

Info

Publication number: JPH01243761A
Application number: JP6976788A
Authority: JP
Inventors: Kazuhiro Gomi; 五味　和洋; Yutaka Nishino; 豊西野
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1988-03-25
Filing date: 1988-03-25
Publication date: 1989-09-28
Anticipated expiration: 2012-03-12
Also published as: JP2590193B2

Abstract

PURPOSE:To improve accuracy to decide the silence and the end of a user by deciding the silent condition of the user according to a threshold after a responding message K is sent and executing the decision of an uttering end according to the other threshold when there is uttering. CONSTITUTION:When automatic incoming operation is finished, a control part 2 selects a message number n=1 from a responding message storing part 6 and sends the number to trunk lines L1 and L2. At such a timer, a user message recording part 7 is activated and message recording is started. Then, a threshold storing part 10 selects a silent condition deciding threshold (TNA<1>). The control part 2 substitutes the TNA<1> to a threshold (TNA) and executes the decision of the silent condition. On the other hand, when a user voice is detected before the TNA<1> passes, a uttering end deciding threshold (TED<1>) corresponding to the message number n=1 is extracted from the threshold storing part 10 and the TED<1> is substituted to a threshold TED. Then, a timer count is reset and the decision of the uttering end is executed.

Description

【発明の詳細な説明】（発明の属する技術分野）本発明は、利用者からの音声メツセージに対して、逐一
適切な７１声による応答メツセージを送出し処理を進め
る対話形音声応答装置であって、この装置から送出され
た応答メツセージに対して利用者が発声を開始しないこ
とを判定し１発声を開始した利用者の発声の終了を判定
する装置に関するものである。DETAILED DESCRIPTION OF THE INVENTION (Technical field to which the invention pertains) The present invention is an interactive voice response device that sends and processes appropriate response messages in 71 voices in response to voice messages from a user. The present invention relates to a device that determines that a user does not start speaking in response to a response message sent from the device, and determines that the user who has started speaking has finished speaking.

（従来の技術）利用者からの音声入力に対して装置が逐一応答する形式
（対話形式）は、人間同士で話をする場合に近いので、
最もよいマンマシンインタフェースの形態であると１゛
われでいる。(Prior art) The format in which the device responds point by point to voice input from the user (dialogue format) is similar to when humans talk to each other, so
We believe that this is the best form of man-machine interface.

この特性を利用して、従来は話の難しさから用件録音率
の低かった留守番電話機に対話形式を応用し、用件録音
率の向上を狙った装置も出現している（対話形留守番電
話装置：特願昭６２−１２０５２号）。Taking advantage of this characteristic, devices have been developed that aim to improve the recording rate of messages by applying the dialog format to answering machines, which previously had a low rate of recording messages due to the difficulty of speaking (dialogue type answering machines). Apparatus: Patent Application No. 12052/1982).

この対話形留守番電話装置において、−旦利用者が発声
を開始した場合に機械の動作として要求されるのは、利
用者の発声が終了したことを検出した後に１次の応答メ
ツセージを送出することである。In this interactive answering machine, when the user starts speaking, the required action of the machine is to send the first response message after detecting that the user has finished speaking. It is.

通常人間同士で会話を行う場合には、相手の発声内容を
理解し、内容的な句切れを′Ｌ＠識することにより発声
が終了したことを判定しているが、この方法を実現する
には、実時間で利用者の音声を理解する能力を機械が備
えている必要であり、音声認識や自然−３語処理の現状
では実現は困難である。そこで、利用者の音声の有無を
監視し、無言状態がある一定時間（発声終了判定閾値：
Ｔｇｏ）以上継続した時点で、利用者の発声が終了した
と判定している。Normally, when humans have a conversation, they understand the content of the other person's utterance and determine when the utterance has ended by recognizing the phrase breaks in the content. This requires the machine to have the ability to understand the user's voice in real time, which is difficult to achieve with the current state of voice recognition and natural three-word processing. Therefore, the presence or absence of the user's voice is monitored, and the user remains silent for a certain period of time (utterance end determination threshold:
It is determined that the user's utterance has ended when the utterance continues for more than Tgo).

一方、機械からの応答メツセージに対して利用者が発声
を開始しない場合には、機械は別の表現の応答メツセー
ジを送出するか、あるいは次の話題へと応答メツセージ
の内容を切り換える等の動作が要求される。On the other hand, if the user does not start speaking in response to a response message from the machine, the machine may send a response message with a different expression or take actions such as switching the content of the response message to the next topic. required.

人間同士の対話では互いの表情などから相手が発声をυ
ｉ始するか否かを判定できるが、機械の動作としては、
無言判定閾値（ＴＮＡ）を基に、応答メツセージ送出終
了後に利用者が発声を開始せずに無音状態がＴＮＡより
も長く続いた時点で、相手が発声を開始しないと見なし
ている。In dialogue between humans, the other person's utterances can be determined based on each other's facial expressions.
It can be determined whether or not to start, but as for the operation of the machine,
Based on the silence determination threshold (TNA), when the user does not start speaking after sending a response message and the silence continues longer than the TNA, it is assumed that the other party does not start speaking.

第６図は上述した従来の対話形留守番電話装置のフロー
チャートを示す。即ち、機械が応答メツセージ送出処理
を行ない（１）、応答メツセージを送出する（２）と同
時に利用者からのメツセージの録音開始を行なう（３）
、また、同時に計時カウントをリセットしく４）、利用
者からのメツセージ（音声）の検出判定を行なう（５）
。FIG. 6 shows a flowchart of the conventional interactive answering machine described above. That is, the machine performs a response message sending process (1), and at the same time as sending the response message (2), starts recording the message from the user (3).
, At the same time, the timer count is reset (4), and a message (voice) from the user is detected and judged (5)
.

そして、利用者が発声を開始しない判定は、計時カウン
ト値Ｔと、無言判定閾値ＴＮＡとを比較させ（６）、Ｔ
≧ＴＭＡなら利用者が発声を開始しないと判断して（７
）、録音を停止する（８）。Then, to determine that the user has not started speaking, the time count value T and the silence determination threshold TNA are compared (6).
If ≧TMA, it is determined that the user will not start speaking (7
), stop recording (8).

また、前記音声検出結果の判定（５）において、利用者
が発声を開始したときは、前記計時カラン１−はリセッ
トのままであり（４′）、その音声検出結果を判定しく
９）、この発声状態が続行（有音）されていれば、計時
カウントはリセットのままである。もし１発声が終了し
無音状態となり、その計時カウント値Ｔと発声終了判定
閾値１゛。とを判定しく１０）、　’Ｉ”≧Ｔ０なら利
用者の発声が終了したと判断して（１１）、録音を停止
する（１２）。In addition, in the judgment of the voice detection result (5), when the user starts speaking, the clock ring 1- remains reset (4'), and the voice detection result is not judged (9). If the utterance state continues (sound is heard), the time count remains reset. If one utterance ends and the state becomes silent, the time count value T and the utterance end determination threshold 1. If 'I''≧T0, it is determined that the user's utterance has ended (11), and the recording is stopped (12).

以上のように利用者が発声を開始しないことの判定（（
５）〜（８））と、−旦音声を開始した利用者の発声が
終了したことの判定（（９）〜（１２））を、それぞれ
の時間閾値Ｔ、ｌＡ、Ｔ０を用いて行なっている。As described above, it is determined that the user does not start speaking ((
5) to (8)) and the determination ((9) to (12)) that the user who started the voice has finished speaking are performed using the respective time thresholds T, lA, and T0. There is.

従来、」二記’ｒｇｏｔ　ＴＮＡの値は各装置で固定の
値を用いていたが、実際には応答メツセージの内容によ
って異なるべきものである０例えば、応答メツセージ内
容が答え難いものであると、利用者は発声開始までに発
声内容を考える時間を長く必要とし、逆に質問内容が簡
単であれば４発声開始までの所用時間は短い。Conventionally, a fixed TNA value was used for each device, but in reality it should vary depending on the content of the response message.For example, if the content of the response message is difficult to answer, The user needs a long time to think about what to say before starting to speak, and conversely, if the question is simple, the time required to start speaking is short.

利用者が一旦音声を開始した場合にも、送出された応答
メツセージの内容が答え難いものである時には、考えな
がら発声を行うために、発声中に比較的長い無音状態が
含まれる可能性が高い。Even if the user starts speaking, if the content of the response message sent is difficult to answer, there is a high possibility that there will be a relatively long period of silence during the utterance because the user will think while speaking. .

一方、次に送出すべき応答メツセージが例えば「はい」
、「ええ」などの相槌である場合には、利用者発声中の
息継ぎなど短い無音状態でタイミングよく応答メツセー
ジを送出すべきであるが、次に送出すべき応答メツセー
ジが話題を切り換える作用を持つものである場合には、
利用者メツセージが完全に終了してから応答メツセージ
を送出すべきであるなどの問題点があった。On the other hand, the next response message to be sent is, for example, "Yes".
, "Yes", etc., the response message should be sent in a well-timed manner during a short period of silence, such as a breather while the user is speaking, but the next response message to be sent has the effect of changing the topic. If it is,
There were problems such as the need to send a response message after the user's message was completely completed.

（発明の目的）本発明は上述した従来装置の問題点を解消し。(Purpose of the invention) The present invention solves the problems of the conventional device described above.

送出された応答メツセージに対して、利用者が発声を開
始しないこと、あるいは、−旦音声を開始した利用者の
発声が終了したこと、を精度良く判定し、マンマレンイ
ンタフェースのよい対話式音声応答装置を提供すること
を目的とするものである。To provide an interactive voice response with a good human interface by accurately determining whether the user has not started speaking in response to a sent response message, or whether the user who has started voice has finished speaking. The purpose is to provide a device.

（発明の構成）（発明の特徴と従来技術との差異）本発明は上記目的を達成するため、閾値格納部に、応答
メツセージｎを送出後、最適な無言状庵判定閾値’ｒＮ
Ａ”（ｎは応答メツセージ番号；１≦ｍ、但しｍは応答
メツセージの総数）と、前記応答メツセージｎを送出後
、最適な発声終了判定閾値Ｔ０゜を予め格納しておき、
応答メツセージに（１≦に５ｍ）を送出後には、前記閾
値格納部から閾値’ｒＮＡＫを選択し、これに基づいて
利用者の無菖状態の判定を行なうとともに、利用者がメ
ツセージの発声を開始した場合には、前記閾値格納部か
ら閾値Ｔ！１０Ｋを選択し、これに基づいて発声終了の
判定を行なうことを特徴とする。(Structure of the Invention) (Characteristics of the Invention and Differences from the Prior Art) In order to achieve the above-mentioned object, the present invention sets the optimum mutenjoan judgment threshold 'rN after sending the response message n to the threshold value storage section.
A” (n is the response message number; 1≦m, where m is the total number of response messages) and an optimal utterance end determination threshold T0° after sending the response message n are stored in advance,
After sending the response message (1≦5m), select the threshold 'rNAK from the threshold storage section, determine whether the user is in an irises state based on this, and the user starts uttering the message. In this case, the threshold value T! is stored in the threshold value storage section. 10K is selected and the end of utterance is determined based on this selection.

従来技術は、利用者の無音状態や発声を開始し終了した
時の判定基準となる閾値Ｔ、ＩＡ、Ｔ０の値を固定とし
たものを用いたため対話性が悪いのに対し、本発明は実
際の応答メツセージの内容に対応した閾値”ｒＮＡｎ、
　’ｒｇｏｎを用意し、最良の閾値Ｔ、、’、、ＴＥＤ
Ｋを選択して精度よく対話性の良い点が異なる。The conventional technology has poor interactivity because it uses fixed threshold values T, IA, and T0, which are the criteria for determining when the user is silent or when the user starts and ends speaking, but the present invention actually improves the interactivity. The threshold value "rNAn," corresponding to the content of the response message of
``Prepare rgon and find the best threshold T, ,'', TED
The difference is that K is selected and the accuracy and interactivity are good.

（実施例）第１図は本発明の一実施例のブロック構成図を示す０図
において、１は局１ｉＡＬユ、Ｌ２に接続される着信検
出部、２はマイクロコンピュータで構成される制御部、
３は電話回線と直流ループの開放／閉結を行うループ開
閉部、４はループ開閉部３を介して局線Ｌ□、Ｌ２に接
続される通話回路部。(Embodiment) FIG. 1 is a block diagram of an embodiment of the present invention, in which 1 is an incoming call detection section connected to the station 1iAL unit and L2, 2 is a control section composed of a microcomputer,
Reference numeral 3 denotes a loop opening/closing unit for opening/closing the telephone line and the DC loop, and 4 denotes a communication circuit unit connected to the office lines L□ and L2 via the loop opening/closing unit 3.

５は通話回路部の送話端子”ｒ、、Ｔ２に接続される応
答メツセージ送出部、６は応答メツセージ送出部５に接
続され複数の応答メツセージを送出される順に格納する
応答メツセージ格納部、７は通話ｒｊＦｌ路部４の受話
端子Ｒ□、Ｒ２に接続される利用者メツセージ録音部、
８は同じく通話回路部４の受話端子Ｒ□、Ｒ２に接続さ
れる音声検出部、９は利用者音声の無音状態の継続を測
定するための計時部、１０は無−ｄ状態判定あるいは発
声終了判定を行うための閾値（’ｒＮＡ、Ｔｗｏ）を格
納する閾値格納部である。Reference numeral 5 denotes a response message sending section connected to the transmitting terminal "r", T2 of the communication circuit section; 6 a response message storage section connected to the response message sending section 5 and storing a plurality of response messages in the order in which they are sent; 7 is a user message recording unit connected to the receiving terminals R□ and R2 of the call rjFl path unit 4;
Reference numeral 8 denotes a voice detection unit which is also connected to the receiving terminals R□ and R2 of the communication circuit unit 4, 9 a timer unit for measuring the continuation of the silence state of the user's voice, and 10 a non-d state determination or the end of utterance. This is a threshold value storage unit that stores threshold values ('rNA, Two) for making a determination.

また、第２図は第１図における応答メツセージ格納部６
の内部構成の一例、第３図は第１図におれる閾値格納部
１０の内部構成の一例を示す。FIG. 2 also shows the response message storage section 6 in FIG.
FIG. 3 shows an example of the internal configuration of the threshold storage section 10 shown in FIG. 1.

次に本実施例の動作を第１図に基づいて説明する。まず
着信があると着信検出部１がこれを検知して制御部２に
着イδ信号を送出する。制御部２はこの着信信号がある
と、所定の時間経過後、ループ開閉部３を動作させてル
ープを開成し、自動着信動作を終了する。Next, the operation of this embodiment will be explained based on FIG. First, when there is an incoming call, the incoming call detection unit 1 detects this and sends an incoming call δ signal to the control unit 2. When the control section 2 receives this incoming call signal, after a predetermined period of time has elapsed, the control section 2 operates the loop opening/closing section 3 to open a loop, and ends the automatic incoming call operation.

自動着信後の動作は第４図に示したフローチャートを用
いて説明する。The operation after automatic call reception will be explained using the flowchart shown in FIG.

自動着信動作が終了すると、制御部２は応答メツセージ
格納部６からメツセージ番号ｎ＝１（第４　（１））の
応答メツセージ（第２図よりこのメツセージ内容は「は
い、ｏｏ商事です」）を選択しく第４図（２））、応答
メツセージ送出部５より通話回路部４を介して、局線り
、、　Ｌ２に送出する（第４図（３））。この時、利用
者メツセージ録音部７に起動をかけ利用者すなわち発呼
者のメツセージ録音を開始すると共に（第４図（４））
、閾値格納部１０より無ｉ゛状態判定閾値−ｒＮＡ’を
選択する（第４図（５））。この後制御部２は、閾値Ｔ
ＮＡにＴＮＡｉを代入し、該フローに従い計時カウント
をリセット（第４図（６））　Ｌ、無Ｊ状態判定を行う
（第４図（７））。When the automatic call receiving operation is completed, the control unit 2 receives a response message with message number n=1 (No. 4 (1)) from the response message storage unit 6 (as shown in Figure 2, the content of this message is "Yes, this is oo Shoji"). Selectively (Fig. 4(2)), the response message sending section 5 sends the message to the central office line L2 via the telephone communication circuit section 4 (Fig. 4(3)). At this time, the user message recording section 7 is activated to start recording the message of the user, that is, the caller (Fig. 4 (4)).
, the i' state determination threshold -rNA' is selected from the threshold storage unit 10 (FIG. 4 (5)). After this, the control unit 2 controls the threshold value T
Substitute TNAi for NA and reset the clock count according to the flow ((6) in FIG. 4). L, determine the no-J state ((7) in FIG. 4).

ここで、Ｔ’ｌｌＡ”を過ぎても利用者の音声が検出さ
れず利用者が発声を開始しない、即ち利用者が！！冨状
態に陥ったと判定された場合には（第４図（８））、利
用者が電話機の応答メツセージを聞き取れなかったと推
定されるので、利用者メツセージ録音部７の動作を一旦
停止した後（第４図（９））、再度ｎ＝１の応答メツセ
ージ送出を行う（第４図（１０））。Here, if the user's voice is not detected and the user does not start speaking even after T'llA'', that is, it is determined that the user has fallen into the!! )), it is presumed that the user could not hear the response message from the telephone, so after temporarily stopping the operation of the user message recording unit 7 ((9) in FIG. 4), the response message of n=1 is transmitted again. (Figure 4 (10)).

また、同一の応答メツセージを２回送出しても（第４図
（ｌｌｎ、利用者が発声が開始しない場合は、その後何
回応答メツセージを送出しても利用者の発声開始は望め
ないと判断し、次の話題へと応答メツセージ内容を切り
換える（第４図（１２））。Furthermore, even if the same response message is sent twice (see Figure 4 (lln), if the user does not start speaking, it is determined that no matter how many times the response message is sent thereafter, the user cannot expect to start speaking. , the content of the response message is switched to the next topic ((12) in FIG. 4).

即ち、ｎ＝１の応答メツセージを２回送出しても利用者
の発声が開始されない場合には、ｎ＝３の応答メツセー
ジ（第２図よりこのメツセージは「失礼ですがどちら様
でしようか」）に話題を切り換え、ｎ　＝　３の応答メ
ツセージを２回送出しても利用者の発声が開始されない
場合には、ｎ＝４の応答メツセージ（第２図よりこのメ
ツセージは［只今留守にしております。御用件をお結上
さい」）に話題を切り換える。That is, if the user does not start speaking even after sending the n=1 response message twice, the n=3 response message (from Figure 2, this message is "Excuse me, but which way should I go?"). If the user does not start speaking even after sending the n = 3 response message twice, the n = 4 response message (from Figure 2, this message is ``I am away from home at the moment.''). Please change the topic to "Please complete your request."

但し、「はい」という相槌の応答メツセージ（ｎ　＝　
２　）は、利用者が無音状態のときに２回繰り返して送
出しても意味がないので、該応答メツセージ送出後利用
者が発声を開始しない場合には、すぐに次の応答メツセ
ージ（ｎ＝３）を送出し、話題を切り換える。However, the reply message “Yes” (n =
2), there is no point in repeating it twice when the user is silent, so if the user does not start speaking after sending the response message, the next response message (n= 3) and change the topic.

一方％　Ｔ、Ａ’経過以前に利用者音声が検出された場
合には、閾値格納部ｌＯから出力された応答メツセージ
のメツセージ番号ｎ＝１に相当する発声終了判定閾値Ｔ
０１を抽出しく第４図（１３））、　Ｔｌ１ｌにＴ、◎
１を代入し該フローに従い計時カウントをリセット（第
４図（１４））　Ｌ、、発声終了判定を行う（第４図（
１５））。On the other hand, if the user's voice is detected before the elapse of %T, A', the utterance end determination threshold T corresponding to the message number n=1 of the response message output from the threshold storage unit IO
To extract 01 (Figure 4 (13)), T to Tl1l, ◎
1 and reset the clock count according to the flow (Fig. 4 (14)) L., Determine the end of utterance (Fig. 4 (14)).
15)).

この状態で利用者音声の無音状態が１０以上継続し利用
者のメツセージが終了したと判定された場合には（第４
図（１６））、利用者メツセージ録音部７の動作を停止
した後に（第４図（１７））、応答メツセージ格納部６
からｎ＝２の応答メツセージ（第２図よりこのメツセー
ジ内容は「はい」）を選択し、応答メツセージ送出部５
より、通話回路部４を介して、局線り、、　Ｌ、に送出
し、閾値格納部１０より無言状態判定閾値ＴＮＡ２を取
り出す。In this state, if the user's voice continues to be silent for 10 or more times and it is determined that the user's message has ended,
(16)), after stopping the operation of the user message recording section 7 (FIG. 4 (17)), the response message storage section 6
Select n=2 response messages (from FIG. 2, the content of this message is "Yes") from
Then, the signal is sent to the local office line RI, , , L via the communication circuit section 4, and the silent state determination threshold TNA2 is taken out from the threshold storage section 10.

以後、この動作を、応答メツセージが無くなるまで（第
２図より口＝４まで）継続した後（第４図（１８））、
回線を開放し動作を終了する。After that, this operation is continued until there are no more response messages (from Figure 2 until mouth = 4) (Figure 4 (18)),
Release the line and end the operation.

以上の動作状態を利用者、機械間で交わされる音声に着
１＝Ｉ　Ｌ　、時系列的に整理した一例が第５図である
。FIG. 5 shows an example in which the above operating states are arranged in chronological order based on voices exchanged between the user and the machine.

この時、閾値格納部ＩＯに格納されている各閾値には以
下のような関係がある。At this time, each threshold value stored in the threshold value storage unit IO has the following relationship.

（ア）無言状態判定閾値ＴＩＩＡ第１〜３の応答メツセージ（ｎ＝１〜３）送出後の各場
面で、利用者はそれぞれ、「もしもし」、「利用者が用
事のある相手の名前」、「利用者名」を話すことになる
。これらは、利用者が電話を掛ける以前に決まっていた
内容あるいは習慣により自然に発声できる内容なので、
特に長い思考時間を必要とせずに発声を開始すると考え
られる。(A) Silent state determination threshold TIIA In each scene after sending the first to third response messages (n = 1 to 3), the user responds with "Moshi Moshi", "The name of the person with whom the user has business", You will be asked to speak your "user name." These are content that the user has decided on before making the call, or content that can be uttered naturally based on habit.
It is thought that vocalization begins without requiring particularly long thinking time.

一方、第４応答メツセージは、用件を録音することを利
用者に要求しているので、利用者は、用件を短時間のう
ちに要領よくまとめる必要がある。On the other hand, since the fourth response message requests the user to record the business, the user needs to summarize the business in a short time.

しかも、用件のある相手が留守であるという電話を掛け
る以前には知らなかった状況も加味して用件をまとめな
ければならないために、用件をまとめるには時間がかか
ることが予想される。Moreover, it is expected that it will take time to compile the matter because the person has to take into account the situation that the person did not know before the call that the person with whom the person has the matter is away. .

以上のことからＴＮＡ”　（ｎ　＝　１〜４　）にはＴ
　ＭＡ”　師Ｔ　ＩＩＡ”　４　Ｔ　ＮＡ”　＜　Ｔ　
ＩＩＡ’　−−−−−−−−−一−（１）を満たす必要
がある。From the above, TNA'' (n = 1 to 4) has T
MA” Master T IIA” 4 T NA” < T
It is necessary to satisfy IIA'--(1).

（イ）発声終了判定閾値ＴｇＤ第２応答メツセージは相槌なので、利用者音声の短い無
音状態でタイミングよく送出することが望ましい、この
ことから、Ｔ、Ｄｌは、短い値に設定するべきである。(B) Utterance end determination threshold TgD Since the second response message is a response, it is desirable to send it out in a timely manner with a short silence of the user's voice. For this reason, T and Dl should be set to short values.

一方、第４応答メツセージ送出後は、上記のように利用
者は用件をまとめながら発声をしなければならないため
に、発声中に思考に起因する無音状態が含まれる可能性
が高い、すなわち、第４応答メツセージ送出後には、Ｔ
ｏを十分に長くしなければ、利用者の発声が終了したこ
とを確実に判定することはできない。On the other hand, after the fourth response message is sent, since the user has to speak while summarizing the matter as described above, there is a high possibility that silence due to thoughts will be included while speaking. After sending the fourth response message, T
Unless o is made sufficiently long, it is not possible to reliably determine that the user has finished speaking.

以上のことからＴｗｏ”　（ｎ　＝　１〜４　）にはＴ
　ｇｏ’　＜　Ｔ　ｇｏ”　’Ｆ　Ｔ　ｇｏ３＜　’ｒ
　ｇｏ’　−−−−−−−−−−−（２）を満たす必要
がある。From the above, T
go'< T go” 'F T go3 <'r
go' ------------- (2) must be satisfied.

（発明の効果）以上説明したように１本発明は構成されているので、対
話式音声応答装置において、送出された応答メツセージ
に対して利用者が発声を開始しないこと、あるいは、−
旦発声を開始した利用者の発声が終Ｙしたことを精度よ
く判定できるため、マンマシンインタフェースのよい対
話式音声応答装置の実現が可能になる。(Effects of the Invention) Since the present invention is configured as described above, in the interactive voice response device, the user does not start speaking in response to the sent response message, or -
Since it is possible to accurately determine that the user who has started speaking has finished speaking, it is possible to realize an interactive voice response device with a good man-machine interface.

[Brief explanation of the drawing]

第１図は本発明の一実施例のブロック構成図。第２図は第１図の応答メツセージ格納部６の内部構成の
一例、第３図は第１図の閾値格納部１０の内部構成の一
例、第４図は第１図の動作処理フローチャート、第５図
は機械と利用者との間で行なわれる対話の経時的な一例
、第６図は従来の対話形留守番電話装置の判定手順を示
すフローチャートである。１・・・着信検出部、２・・・制御部、３・・・ループ
開閉部、４・・・通話回路部、５　・・・応答メツセージ送出部、６　・・・応答メツ
セージ格納部、７　・・・利用者メツセージ録音部、８
　リ・音声検出部、９　・・・計時部、１０・・・閾値
格納部。特許出願人　日本電信電話株式会社第３図５図ＡｎＱ−FIG. 1 is a block diagram of an embodiment of the present invention. 2 is an example of the internal configuration of the response message storage section 6 of FIG. 1, FIG. 3 is an example of the internal configuration of the threshold value storage section 10 of FIG. 1, and FIG. FIG. 5 is an example of the dialogue that takes place between the machine and the user over time, and FIG. 6 is a flowchart showing the determination procedure of a conventional interactive answering machine. DESCRIPTION OF SYMBOLS 1... Incoming call detection unit, 2... Control unit, 3... Loop opening/closing unit, 4... Call circuit unit, 5... Response message sending unit, 6... Response message storage unit, 7 ...User message recording department, 8
- Voice detection unit, 9... Time measurement unit, 10... Threshold value storage unit. Patent applicant Nippon Telegraph and Telephone Corporation Figure 3 Figure 5 AnQ-

Claims

[Claims]

After sending the response message n to the threshold storage unit, set the optimum silent state determination threshold T_H_A^n (n is the response message number: 1≦m, where m is the total number of response messages) and after sending the response message n, Optimal utterance end determination threshold T_
E_D^n is stored in advance, and the response message K(1
≦K≦m), the threshold value T_
N_A^K is selected, and based on this, the user's silent state is determined. When the user starts uttering a message, the threshold value T_E_D is selected from the threshold value storage section.
An interactive voice response device characterized by selecting ^K and determining the end of utterance based on this.