JPH01243761A - Interacting type voice responding device - Google Patents

Interacting type voice responding device

Info

Publication number
JPH01243761A
JPH01243761A JP6976788A JP6976788A JPH01243761A JP H01243761 A JPH01243761 A JP H01243761A JP 6976788 A JP6976788 A JP 6976788A JP 6976788 A JP6976788 A JP 6976788A JP H01243761 A JPH01243761 A JP H01243761A
Authority
JP
Japan
Prior art keywords
user
threshold
response message
message
response
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP6976788A
Other languages
Japanese (ja)
Other versions
JP2590193B2 (en
Inventor
Kazuhiro Gomi
五味 和洋
Yutaka Nishino
豊 西野
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nippon Telegraph and Telephone Corp
Original Assignee
Nippon Telegraph and Telephone Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nippon Telegraph and Telephone Corp filed Critical Nippon Telegraph and Telephone Corp
Priority to JP63069767A priority Critical patent/JP2590193B2/en
Publication of JPH01243761A publication Critical patent/JPH01243761A/en
Application granted granted Critical
Publication of JP2590193B2 publication Critical patent/JP2590193B2/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Abstract

PURPOSE:To improve accuracy to decide the silence and the end of a user by deciding the silent condition of the user according to a threshold after a responding message K is sent and executing the decision of an uttering end according to the other threshold when there is uttering. CONSTITUTION:When automatic incoming operation is finished, a control part 2 selects a message number n=1 from a responding message storing part 6 and sends the number to trunk lines L1 and L2. At such a timer, a user message recording part 7 is activated and message recording is started. Then, a threshold storing part 10 selects a silent condition deciding threshold (TNA<1>). The control part 2 substitutes the TNA<1> to a threshold (TNA) and executes the decision of the silent condition. On the other hand, when a user voice is detected before the TNA<1> passes, a uttering end deciding threshold (TED<1>) corresponding to the message number n=1 is extracted from the threshold storing part 10 and the TED<1> is substituted to a threshold TED. Then, a timer count is reset and the decision of the uttering end is executed.

Description

【発明の詳細な説明】 (発明の属する技術分野) 本発明は、利用者からの音声メツセージに対して、逐一
適切な71声による応答メツセージを送出し処理を進め
る対話形音声応答装置であって、この装置から送出され
た応答メツセージに対して利用者が発声を開始しないこ
とを判定し1発声を開始した利用者の発声の終了を判定
する装置に関するものである。
DETAILED DESCRIPTION OF THE INVENTION (Technical field to which the invention pertains) The present invention is an interactive voice response device that sends and processes appropriate response messages in 71 voices in response to voice messages from a user. The present invention relates to a device that determines that a user does not start speaking in response to a response message sent from the device, and determines that the user who has started speaking has finished speaking.

(従来の技術) 利用者からの音声入力に対して装置が逐一応答する形式
(対話形式)は、人間同士で話をする場合に近いので、
最もよいマンマシンインタフェースの形態であると1゛
われでいる。
(Prior art) The format in which the device responds point by point to voice input from the user (dialogue format) is similar to when humans talk to each other, so
We believe that this is the best form of man-machine interface.

この特性を利用して、従来は話の難しさから用件録音率
の低かった留守番電話機に対話形式を応用し、用件録音
率の向上を狙った装置も出現している(対話形留守番電
話装置:特願昭62−12052号)。
Taking advantage of this characteristic, devices have been developed that aim to improve the recording rate of messages by applying the dialog format to answering machines, which previously had a low rate of recording messages due to the difficulty of speaking (dialogue type answering machines). Apparatus: Patent Application No. 12052/1982).

この対話形留守番電話装置において、−旦利用者が発声
を開始した場合に機械の動作として要求されるのは、利
用者の発声が終了したことを検出した後に1次の応答メ
ツセージを送出することである。
In this interactive answering machine, when the user starts speaking, the required action of the machine is to send the first response message after detecting that the user has finished speaking. It is.

通常人間同士で会話を行う場合には、相手の発声内容を
理解し、内容的な句切れを′L@識することにより発声
が終了したことを判定しているが、この方法を実現する
には、実時間で利用者の音声を理解する能力を機械が備
えている必要であり、音声認識や自然−3語処理の現状
では実現は困難である。そこで、利用者の音声の有無を
監視し、無言状態がある一定時間(発声終了判定閾値:
Tgo)以上継続した時点で、利用者の発声が終了した
と判定している。
Normally, when humans have a conversation, they understand the content of the other person's utterance and determine when the utterance has ended by recognizing the phrase breaks in the content. This requires the machine to have the ability to understand the user's voice in real time, which is difficult to achieve with the current state of voice recognition and natural three-word processing. Therefore, the presence or absence of the user's voice is monitored, and the user remains silent for a certain period of time (utterance end determination threshold:
It is determined that the user's utterance has ended when the utterance continues for more than Tgo).

一方、機械からの応答メツセージに対して利用者が発声
を開始しない場合には、機械は別の表現の応答メツセー
ジを送出するか、あるいは次の話題へと応答メツセージ
の内容を切り換える等の動作が要求される。
On the other hand, if the user does not start speaking in response to a response message from the machine, the machine may send a response message with a different expression or take actions such as switching the content of the response message to the next topic. required.

人間同士の対話では互いの表情などから相手が発声をυ
i始するか否かを判定できるが、機械の動作としては、
無言判定閾値(TNA)を基に、応答メツセージ送出終
了後に利用者が発声を開始せずに無音状態がTNAより
も長く続いた時点で、相手が発声を開始しないと見なし
ている。
In dialogue between humans, the other person's utterances can be determined based on each other's facial expressions.
It can be determined whether or not to start, but as for the operation of the machine,
Based on the silence determination threshold (TNA), when the user does not start speaking after sending a response message and the silence continues longer than the TNA, it is assumed that the other party does not start speaking.

第6図は上述した従来の対話形留守番電話装置のフロー
チャートを示す。即ち、機械が応答メツセージ送出処理
を行ない(1)、応答メツセージを送出する(2)と同
時に利用者からのメツセージの録音開始を行なう(3)
、また、同時に計時カウントをリセットしく4)、利用
者からのメツセージ(音声)の検出判定を行なう(5)
FIG. 6 shows a flowchart of the conventional interactive answering machine described above. That is, the machine performs a response message sending process (1), and at the same time as sending the response message (2), starts recording the message from the user (3).
, At the same time, the timer count is reset (4), and a message (voice) from the user is detected and judged (5)
.

そして、利用者が発声を開始しない判定は、計時カウン
ト値Tと、無言判定閾値TNAとを比較させ(6)、T
≧TMAなら利用者が発声を開始しないと判断して(7
)、録音を停止する(8)。
Then, to determine that the user has not started speaking, the time count value T and the silence determination threshold TNA are compared (6).
If ≧TMA, it is determined that the user will not start speaking (7
), stop recording (8).

また、前記音声検出結果の判定(5)において、利用者
が発声を開始したときは、前記計時カラン1−はリセッ
トのままであり(4′)、その音声検出結果を判定しく
9)、この発声状態が続行(有音)されていれば、計時
カウントはリセットのままである。もし1発声が終了し
無音状態となり、その計時カウント値Tと発声終了判定
閾値1゛。とを判定しく10)、 ’I”≧T0なら利
用者の発声が終了したと判断して(11)、録音を停止
する(12)。
In addition, in the judgment of the voice detection result (5), when the user starts speaking, the clock ring 1- remains reset (4'), and the voice detection result is not judged (9). If the utterance state continues (sound is heard), the time count remains reset. If one utterance ends and the state becomes silent, the time count value T and the utterance end determination threshold 1. If 'I''≧T0, it is determined that the user's utterance has ended (11), and the recording is stopped (12).

以上のように利用者が発声を開始しないことの判定((
5)〜(8))と、−旦音声を開始した利用者の発声が
終了したことの判定((9)〜(12))を、それぞれ
の時間閾値T、lA、T0を用いて行なっている。
As described above, it is determined that the user does not start speaking ((
5) to (8)) and the determination ((9) to (12)) that the user who started the voice has finished speaking are performed using the respective time thresholds T, lA, and T0. There is.

従来、」二記’rgot TNAの値は各装置で固定の
値を用いていたが、実際には応答メツセージの内容によ
って異なるべきものである0例えば、応答メツセージ内
容が答え難いものであると、利用者は発声開始までに発
声内容を考える時間を長く必要とし、逆に質問内容が簡
単であれば4発声開始までの所用時間は短い。
Conventionally, a fixed TNA value was used for each device, but in reality it should vary depending on the content of the response message.For example, if the content of the response message is difficult to answer, The user needs a long time to think about what to say before starting to speak, and conversely, if the question is simple, the time required to start speaking is short.

利用者が一旦音声を開始した場合にも、送出された応答
メツセージの内容が答え難いものである時には、考えな
がら発声を行うために、発声中に比較的長い無音状態が
含まれる可能性が高い。
Even if the user starts speaking, if the content of the response message sent is difficult to answer, there is a high possibility that there will be a relatively long period of silence during the utterance because the user will think while speaking. .

一方、次に送出すべき応答メツセージが例えば「はい」
、「ええ」などの相槌である場合には、利用者発声中の
息継ぎなど短い無音状態でタイミングよく応答メツセー
ジを送出すべきであるが、次に送出すべき応答メツセー
ジが話題を切り換える作用を持つものである場合には、
利用者メツセージが完全に終了してから応答メツセージ
を送出すべきであるなどの問題点があった。
On the other hand, the next response message to be sent is, for example, "Yes".
, "Yes", etc., the response message should be sent in a well-timed manner during a short period of silence, such as a breather while the user is speaking, but the next response message to be sent has the effect of changing the topic. If it is,
There were problems such as the need to send a response message after the user's message was completely completed.

(発明の目的) 本発明は上述した従来装置の問題点を解消し。(Purpose of the invention) The present invention solves the problems of the conventional device described above.

送出された応答メツセージに対して、利用者が発声を開
始しないこと、あるいは、−旦音声を開始した利用者の
発声が終了したこと、を精度良く判定し、マンマレンイ
ンタフェースのよい対話式音声応答装置を提供すること
を目的とするものである。
To provide an interactive voice response with a good human interface by accurately determining whether the user has not started speaking in response to a sent response message, or whether the user who has started voice has finished speaking. The purpose is to provide a device.

(発明の構成) (発明の特徴と従来技術との差異) 本発明は上記目的を達成するため、閾値格納部に、応答
メツセージnを送出後、最適な無言状庵判定閾値’rN
A”(nは応答メツセージ番号;1≦m、但しmは応答
メツセージの総数)と、前記応答メツセージnを送出後
、最適な発声終了判定閾値T0゜を予め格納しておき、
応答メツセージに(1≦に5m)を送出後には、前記閾
値格納部から閾値’rNAKを選択し、これに基づいて
利用者の無菖状態の判定を行なうとともに、利用者がメ
ツセージの発声を開始した場合には、前記閾値格納部か
ら閾値T!10Kを選択し、これに基づいて発声終了の
判定を行なうことを特徴とする。
(Structure of the Invention) (Characteristics of the Invention and Differences from the Prior Art) In order to achieve the above-mentioned object, the present invention sets the optimum mutenjoan judgment threshold 'rN after sending the response message n to the threshold value storage section.
A” (n is the response message number; 1≦m, where m is the total number of response messages) and an optimal utterance end determination threshold T0° after sending the response message n are stored in advance,
After sending the response message (1≦5m), select the threshold 'rNAK from the threshold storage section, determine whether the user is in an irises state based on this, and the user starts uttering the message. In this case, the threshold value T! is stored in the threshold value storage section. 10K is selected and the end of utterance is determined based on this selection.

従来技術は、利用者の無音状態や発声を開始し終了した
時の判定基準となる閾値T、IA、T0の値を固定とし
たものを用いたため対話性が悪いのに対し、本発明は実
際の応答メツセージの内容に対応した閾値”rNAn、
 ’rgonを用意し、最良の閾値T、、’、、TED
Kを選択して精度よく対話性の良い点が異なる。
The conventional technology has poor interactivity because it uses fixed threshold values T, IA, and T0, which are the criteria for determining when the user is silent or when the user starts and ends speaking, but the present invention actually improves the interactivity. The threshold value "rNAn," corresponding to the content of the response message of
``Prepare rgon and find the best threshold T, ,'', TED
The difference is that K is selected and the accuracy and interactivity are good.

(実施例) 第1図は本発明の一実施例のブロック構成図を示す0図
において、1は局1iALユ、L2に接続される着信検
出部、2はマイクロコンピュータで構成される制御部、
3は電話回線と直流ループの開放/閉結を行うループ開
閉部、4はループ開閉部3を介して局線L□、L2に接
続される通話回路部。
(Embodiment) FIG. 1 is a block diagram of an embodiment of the present invention, in which 1 is an incoming call detection section connected to the station 1iAL unit and L2, 2 is a control section composed of a microcomputer,
Reference numeral 3 denotes a loop opening/closing unit for opening/closing the telephone line and the DC loop, and 4 denotes a communication circuit unit connected to the office lines L□ and L2 via the loop opening/closing unit 3.

5は通話回路部の送話端子”r、、T2に接続される応
答メツセージ送出部、6は応答メツセージ送出部5に接
続され複数の応答メツセージを送出される順に格納する
応答メツセージ格納部、7は通話rjFl路部4の受話
端子R□、R2に接続される利用者メツセージ録音部、
8は同じく通話回路部4の受話端子R□、R2に接続さ
れる音声検出部、9は利用者音声の無音状態の継続を測
定するための計時部、10は無−d状態判定あるいは発
声終了判定を行うための閾値(’rNA、Two)を格
納する閾値格納部である。
Reference numeral 5 denotes a response message sending section connected to the transmitting terminal "r", T2 of the communication circuit section; 6 a response message storage section connected to the response message sending section 5 and storing a plurality of response messages in the order in which they are sent; 7 is a user message recording unit connected to the receiving terminals R□ and R2 of the call rjFl path unit 4;
Reference numeral 8 denotes a voice detection unit which is also connected to the receiving terminals R□ and R2 of the communication circuit unit 4, 9 a timer unit for measuring the continuation of the silence state of the user's voice, and 10 a non-d state determination or the end of utterance. This is a threshold value storage unit that stores threshold values ('rNA, Two) for making a determination.

また、第2図は第1図における応答メツセージ格納部6
の内部構成の一例、第3図は第1図におれる閾値格納部
10の内部構成の一例を示す。
FIG. 2 also shows the response message storage section 6 in FIG.
FIG. 3 shows an example of the internal configuration of the threshold storage section 10 shown in FIG. 1.

次に本実施例の動作を第1図に基づいて説明する。まず
着信があると着信検出部1がこれを検知して制御部2に
着イδ信号を送出する。制御部2はこの着信信号がある
と、所定の時間経過後、ループ開閉部3を動作させてル
ープを開成し、自動着信動作を終了する。
Next, the operation of this embodiment will be explained based on FIG. First, when there is an incoming call, the incoming call detection unit 1 detects this and sends an incoming call δ signal to the control unit 2. When the control section 2 receives this incoming call signal, after a predetermined period of time has elapsed, the control section 2 operates the loop opening/closing section 3 to open a loop, and ends the automatic incoming call operation.

自動着信後の動作は第4図に示したフローチャートを用
いて説明する。
The operation after automatic call reception will be explained using the flowchart shown in FIG.

自動着信動作が終了すると、制御部2は応答メツセージ
格納部6からメツセージ番号n=1(第4 (1))の
応答メツセージ(第2図よりこのメツセージ内容は「は
い、oo商事です」)を選択しく第4図(2))、応答
メツセージ送出部5より通話回路部4を介して、局線り
、、 L2に送出する(第4図(3))。この時、利用
者メツセージ録音部7に起動をかけ利用者すなわち発呼
者のメツセージ録音を開始すると共に(第4図(4))
、閾値格納部10より無i゛状態判定閾値−rNA’を
選択する(第4図(5))。この後制御部2は、閾値T
NAにTNAiを代入し、該フローに従い計時カウント
をリセット(第4図(6)) L、無J状態判定を行う
(第4図(7))。
When the automatic call receiving operation is completed, the control unit 2 receives a response message with message number n=1 (No. 4 (1)) from the response message storage unit 6 (as shown in Figure 2, the content of this message is "Yes, this is oo Shoji"). Selectively (Fig. 4(2)), the response message sending section 5 sends the message to the central office line L2 via the telephone communication circuit section 4 (Fig. 4(3)). At this time, the user message recording section 7 is activated to start recording the message of the user, that is, the caller (Fig. 4 (4)).
, the i' state determination threshold -rNA' is selected from the threshold storage unit 10 (FIG. 4 (5)). After this, the control unit 2 controls the threshold value T
Substitute TNAi for NA and reset the clock count according to the flow ((6) in FIG. 4). L, determine the no-J state ((7) in FIG. 4).

ここで、T’llA”を過ぎても利用者の音声が検出さ
れず利用者が発声を開始しない、即ち利用者が!!冨状
態に陥ったと判定された場合には(第4図(8))、利
用者が電話機の応答メツセージを聞き取れなかったと推
定されるので、利用者メツセージ録音部7の動作を一旦
停止した後(第4図(9))、再度n=1の応答メツセ
ージ送出を行う(第4図(10))。
Here, if the user's voice is not detected and the user does not start speaking even after T'llA'', that is, it is determined that the user has fallen into the!! )), it is presumed that the user could not hear the response message from the telephone, so after temporarily stopping the operation of the user message recording unit 7 ((9) in FIG. 4), the response message of n=1 is transmitted again. (Figure 4 (10)).

また、同一の応答メツセージを2回送出しても(第4図
(lln、利用者が発声が開始しない場合は、その後何
回応答メツセージを送出しても利用者の発声開始は望め
ないと判断し、次の話題へと応答メツセージ内容を切り
換える(第4図(12))。
Furthermore, even if the same response message is sent twice (see Figure 4 (lln), if the user does not start speaking, it is determined that no matter how many times the response message is sent thereafter, the user cannot expect to start speaking. , the content of the response message is switched to the next topic ((12) in FIG. 4).

即ち、n=1の応答メツセージを2回送出しても利用者
の発声が開始されない場合には、n=3の応答メツセー
ジ(第2図よりこのメツセージは「失礼ですがどちら様
でしようか」)に話題を切り換え、n = 3の応答メ
ツセージを2回送出しても利用者の発声が開始されない
場合には、n=4の応答メツセージ(第2図よりこのメ
ツセージは[只今留守にしております。御用件をお結上
さい」)に話題を切り換える。
That is, if the user does not start speaking even after sending the n=1 response message twice, the n=3 response message (from Figure 2, this message is "Excuse me, but which way should I go?"). If the user does not start speaking even after sending the n = 3 response message twice, the n = 4 response message (from Figure 2, this message is ``I am away from home at the moment.''). Please change the topic to "Please complete your request."

但し、「はい」という相槌の応答メツセージ(n = 
2 )は、利用者が無音状態のときに2回繰り返して送
出しても意味がないので、該応答メツセージ送出後利用
者が発声を開始しない場合には、すぐに次の応答メツセ
ージ(n=3)を送出し、話題を切り換える。
However, the reply message “Yes” (n =
2), there is no point in repeating it twice when the user is silent, so if the user does not start speaking after sending the response message, the next response message (n= 3) and change the topic.

一方% T、A’経過以前に利用者音声が検出された場
合には、閾値格納部lOから出力された応答メツセージ
のメツセージ番号n=1に相当する発声終了判定閾値T
01を抽出しく第4図(13))、 Tl1lにT、◎
1を代入し該フローに従い計時カウントをリセット(第
4図(14)) L、、発声終了判定を行う(第4図(
15))。
On the other hand, if the user's voice is detected before the elapse of %T, A', the utterance end determination threshold T corresponding to the message number n=1 of the response message output from the threshold storage unit IO
To extract 01 (Figure 4 (13)), T to Tl1l, ◎
1 and reset the clock count according to the flow (Fig. 4 (14)) L., Determine the end of utterance (Fig. 4 (14)).
15)).

この状態で利用者音声の無音状態が10以上継続し利用
者のメツセージが終了したと判定された場合には(第4
図(16))、利用者メツセージ録音部7の動作を停止
した後に(第4図(17))、応答メツセージ格納部6
からn=2の応答メツセージ(第2図よりこのメツセー
ジ内容は「はい」)を選択し、応答メツセージ送出部5
より、通話回路部4を介して、局線り、、 L、に送出
し、閾値格納部10より無言状態判定閾値TNA2を取
り出す。
In this state, if the user's voice continues to be silent for 10 or more times and it is determined that the user's message has ended,
(16)), after stopping the operation of the user message recording section 7 (FIG. 4 (17)), the response message storage section 6
Select n=2 response messages (from FIG. 2, the content of this message is "Yes") from
Then, the signal is sent to the local office line RI, , , L via the communication circuit section 4, and the silent state determination threshold TNA2 is taken out from the threshold storage section 10.

以後、この動作を、応答メツセージが無くなるまで(第
2図より口=4まで)継続した後(第4図(18))、
回線を開放し動作を終了する。
After that, this operation is continued until there are no more response messages (from Figure 2 until mouth = 4) (Figure 4 (18)),
Release the line and end the operation.

以上の動作状態を利用者、機械間で交わされる音声に着
1=I L 、時系列的に整理した一例が第5図である
FIG. 5 shows an example in which the above operating states are arranged in chronological order based on voices exchanged between the user and the machine.

この時、閾値格納部IOに格納されている各閾値には以
下のような関係がある。
At this time, each threshold value stored in the threshold value storage unit IO has the following relationship.

(ア)無言状態判定閾値TIIA 第1〜3の応答メツセージ(n=1〜3)送出後の各場
面で、利用者はそれぞれ、「もしもし」、「利用者が用
事のある相手の名前」、「利用者名」を話すことになる
。これらは、利用者が電話を掛ける以前に決まっていた
内容あるいは習慣により自然に発声できる内容なので、
特に長い思考時間を必要とせずに発声を開始すると考え
られる。
(A) Silent state determination threshold TIIA In each scene after sending the first to third response messages (n = 1 to 3), the user responds with "Moshi Moshi", "The name of the person with whom the user has business", You will be asked to speak your "user name." These are content that the user has decided on before making the call, or content that can be uttered naturally based on habit.
It is thought that vocalization begins without requiring particularly long thinking time.

一方、第4応答メツセージは、用件を録音することを利
用者に要求しているので、利用者は、用件を短時間のう
ちに要領よくまとめる必要がある。
On the other hand, since the fourth response message requests the user to record the business, the user needs to summarize the business in a short time.

しかも、用件のある相手が留守であるという電話を掛け
る以前には知らなかった状況も加味して用件をまとめな
ければならないために、用件をまとめるには時間がかか
ることが予想される。
Moreover, it is expected that it will take time to compile the matter because the person has to take into account the situation that the person did not know before the call that the person with whom the person has the matter is away. .

以上のことからTNA” (n = 1〜4 )にはT
 MA” 師T IIA” 4 T NA” < T 
IIA’ −−−−−−−−−一−(1)を満たす必要
がある。
From the above, TNA'' (n = 1 to 4) has T
MA” Master T IIA” 4 T NA” < T
It is necessary to satisfy IIA'--(1).

(イ)発声終了判定閾値TgD 第2応答メツセージは相槌なので、利用者音声の短い無
音状態でタイミングよく送出することが望ましい、この
ことから、T、Dlは、短い値に設定するべきである。
(B) Utterance end determination threshold TgD Since the second response message is a response, it is desirable to send it out in a timely manner with a short silence of the user's voice. For this reason, T and Dl should be set to short values.

一方、第4応答メツセージ送出後は、上記のように利用
者は用件をまとめながら発声をしなければならないため
に、発声中に思考に起因する無音状態が含まれる可能性
が高い、すなわち、第4応答メツセージ送出後には、T
oを十分に長くしなければ、利用者の発声が終了したこ
とを確実に判定することはできない。
On the other hand, after the fourth response message is sent, since the user has to speak while summarizing the matter as described above, there is a high possibility that silence due to thoughts will be included while speaking. After sending the fourth response message, T
Unless o is made sufficiently long, it is not possible to reliably determine that the user has finished speaking.

以上のことからTwo” (n = 1〜4 )にはT
 go’ < T go” ’F T go3< ’r
 go’ −−−−−−−−−−−(2)を満たす必要
がある。
From the above, T
go'< T go” 'F T go3 <'r
go' ------------- (2) must be satisfied.

(発明の効果) 以上説明したように1本発明は構成されているので、対
話式音声応答装置において、送出された応答メツセージ
に対して利用者が発声を開始しないこと、あるいは、−
旦発声を開始した利用者の発声が終Yしたことを精度よ
く判定できるため、マンマシンインタフェースのよい対
話式音声応答装置の実現が可能になる。
(Effects of the Invention) Since the present invention is configured as described above, in the interactive voice response device, the user does not start speaking in response to the sent response message, or -
Since it is possible to accurately determine that the user who has started speaking has finished speaking, it is possible to realize an interactive voice response device with a good man-machine interface.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明の一実施例のブロック構成図。 第2図は第1図の応答メツセージ格納部6の内部構成の
一例、第3図は第1図の閾値格納部10の内部構成の一
例、第4図は第1図の動作処理フローチャート、第5図
は機械と利用者との間で行なわれる対話の経時的な一例
、第6図は従来の対話形留守番電話装置の判定手順を示
すフローチャートである。 1・・・着信検出部、2・・・制御部、3・・・ループ
開閉部、4・・・通話回路部、 5 ・・・応答メツセージ送出部、6 ・・・応答メツ
セージ格納部、7 ・・・利用者メツセージ録音部、8
 リ・音声検出部、9 ・・・計時部、10・・・閾値
格納部。 特許出願人 日本電信電話株式会社 第3図 5図 AnQ−
FIG. 1 is a block diagram of an embodiment of the present invention. 2 is an example of the internal configuration of the response message storage section 6 of FIG. 1, FIG. 3 is an example of the internal configuration of the threshold value storage section 10 of FIG. 1, and FIG. FIG. 5 is an example of the dialogue that takes place between the machine and the user over time, and FIG. 6 is a flowchart showing the determination procedure of a conventional interactive answering machine. DESCRIPTION OF SYMBOLS 1... Incoming call detection unit, 2... Control unit, 3... Loop opening/closing unit, 4... Call circuit unit, 5... Response message sending unit, 6... Response message storage unit, 7 ...User message recording department, 8
- Voice detection unit, 9... Time measurement unit, 10... Threshold value storage unit. Patent applicant Nippon Telegraph and Telephone Corporation Figure 3 Figure 5 AnQ-

Claims (1)

【特許請求の範囲】[Claims] 閾値格納部に、応答メッセージnを送出後、最適な無言
状態判定閾値T_H_A^n(nは応答メッセージ番号
:1≦m、但しmは応答メッセージの総数)と、前記応
答メッセージnを送出後、最適な発声終了判定閾値T_
E_D^nを予め格納しておき、応答メッセージK(1
≦K≦m)を送出後には、前記閾値格納部から閾値T_
N_A^Kを選択し、これに基づいて利用者の無言状態
の判定を行なうとともに、利用者がメッセージの発声を
開始した場合には、前記閾値格納部から閾値T_E_D
^Kを選択し、これに基づいて発声終了の判定を行なう
ことを特徴とする対話形音声応答装置。
After sending the response message n to the threshold storage unit, set the optimum silent state determination threshold T_H_A^n (n is the response message number: 1≦m, where m is the total number of response messages) and after sending the response message n, Optimal utterance end determination threshold T_
E_D^n is stored in advance, and the response message K(1
≦K≦m), the threshold value T_
N_A^K is selected, and based on this, the user's silent state is determined. When the user starts uttering a message, the threshold value T_E_D is selected from the threshold value storage section.
An interactive voice response device characterized by selecting ^K and determining the end of utterance based on this.
JP63069767A 1988-03-25 1988-03-25 Interactive voice response device Expired - Lifetime JP2590193B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP63069767A JP2590193B2 (en) 1988-03-25 1988-03-25 Interactive voice response device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP63069767A JP2590193B2 (en) 1988-03-25 1988-03-25 Interactive voice response device

Publications (2)

Publication Number Publication Date
JPH01243761A true JPH01243761A (en) 1989-09-28
JP2590193B2 JP2590193B2 (en) 1997-03-12

Family

ID=13412278

Family Applications (1)

Application Number Title Priority Date Filing Date
JP63069767A Expired - Lifetime JP2590193B2 (en) 1988-03-25 1988-03-25 Interactive voice response device

Country Status (1)

Country Link
JP (1) JP2590193B2 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS61189057A (en) * 1985-02-16 1986-08-22 Nippon Telegr & Teleph Corp <Ntt> Interactive automatic answering telephone set
JPS6346040A (en) * 1986-08-13 1988-02-26 Matsushita Electric Ind Co Ltd Automatic answering telephone set
JPS6345950A (en) * 1986-04-11 1988-02-26 Nippon Telegr & Teleph Corp <Ntt> Conversation type voice response device
JPH01189265A (en) * 1988-01-25 1989-07-28 Pioneer Answerphone Mfg Corp Automatic answering telephone set

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS61189057A (en) * 1985-02-16 1986-08-22 Nippon Telegr & Teleph Corp <Ntt> Interactive automatic answering telephone set
JPS6345950A (en) * 1986-04-11 1988-02-26 Nippon Telegr & Teleph Corp <Ntt> Conversation type voice response device
JPS6346040A (en) * 1986-08-13 1988-02-26 Matsushita Electric Ind Co Ltd Automatic answering telephone set
JPH01189265A (en) * 1988-01-25 1989-07-28 Pioneer Answerphone Mfg Corp Automatic answering telephone set

Also Published As

Publication number Publication date
JP2590193B2 (en) 1997-03-12

Similar Documents

Publication Publication Date Title
US7324636B2 (en) Multiple voice channel communications
US7665024B1 (en) Methods and apparatus for controlling a user interface based on the emotional state of a user
JPS6395532A (en) Control method for voice guidance output
EP2540133B1 (en) Switching off dtx for music
JP2001056696A (en) Method and device for voice storage and reproduction
JPH01243761A (en) Interacting type voice responding device
JPH01219893A (en) Adaptive voicing end detecting method
JPS61189057A (en) Interactive automatic answering telephone set
JPS6345950A (en) Conversation type voice response device
JPH0519734B2 (en)
JP2556978B2 (en) Interactive answering machine
JPH03276947A (en) Interactive automatic answering telephone set
JPS63257367A (en) Voice packet communication method
JPH01227557A (en) Automatic answering telephone system
KR100228916B1 (en) Method for providing audible alarm sound when recording message in vms
JP3561609B2 (en) Voice switch
JPH037119B2 (en)
JP2590119B2 (en) Silent phone repulsion device
JP2635970B2 (en) Answering machine
JPS62132467A (en) Voice message actuating system
Schmandt Understanding Speech Without Recognizing Words
JPS6253055A (en) Automatic answering telephone set
JPH03150950A (en) Automatic answering telephone set
JPH04120927A (en) Sound detector
JPH07312640A (en) Response message collecting and transmitting device

Legal Events

Date Code Title Description
FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20071205

Year of fee payment: 11

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20081205

Year of fee payment: 12

EXPY Cancellation because of completion of term
FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20081205

Year of fee payment: 12