JPH0588480B2

JPH0588480B2 -

Info

Publication number: JPH0588480B2
Application number: JP59021115A
Authority: JP
Inventors: Masao Watari; Takao Watanabe
Original assignee: Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1984-02-08
Filing date: 1984-02-08
Publication date: 1993-12-22
Also published as: JPS60165699A

Description

【発明の詳細な説明】＜産業上の利用分野＞本発明は、多数の利用者回線に対して音声認識
による音声入力と音声応答による音声出力を行う
音声入出力装置に関する。DETAILED DESCRIPTION OF THE INVENTION <Field of Industrial Application> The present invention relates to a voice input/output device that performs voice input by voice recognition and voice output by voice response to a large number of user lines.

＜従来技術＞音声認識による音声入力と音声応答による音声
出力を行う音声入出力装置は電話などによる利用
者からの問合せシステムに利用されている。この
問合せシステムでは第１表に示すように利用者か
らの音声入力により問合せ内容をシステムが自動
的に識別し、その問合せの解答を音声出力により
利用者へ通知している。このような問合せシステ
ムでは、多数の利用者へ同時に音声入力と音声出
力のサービスを行う必要がある。<Prior Art> A voice input/output device that performs voice input through voice recognition and voice output through voice response is used in a system for making inquiries from users via telephone or the like. In this inquiry system, as shown in Table 1, the system automatically identifies the content of the inquiry based on voice input from the user, and notifies the user of the answer to the inquiry through voice output. In such an inquiry system, it is necessary to simultaneously provide voice input and voice output services to a large number of users.

第１表利用者：システムへ電話するシステム：“こちらはテレホンサービスです。サ
ービスコードをどうぞ。ピー” 利用者：イチシステム：“ピー” 利用者：ニーシステム：“残高問合せですね。あなたの口座番
号をどうぞ。ピー” 利用者：イチシステム：“ピー” 利用者：ニーシステム：“ピー” 利用者：サンシステム：“あなたの口座番号は123ですね。ピ
ー” 利用者：はいシステム：“あなたの残高は12300円です。どうも
ありがとうございました。” 従来、多数の利用者回線に対する音声入出力装
置では、利用者回線数ｍと同数のｍ個の音声入力
を同時処理できる認識部と、ｍ個の音声出力を同
時処理できる応答部を使用していた。しかし、認
識部は応答部に比較し処理が複雑であり、かつ多
くのメモリを必要としているため、装置は大形と
なる欠点があつた。 Table 1 User: Calling the system System: “This is a telephone service. Please enter your service code. P” User: First system: “P” User: Nice System: “Inquiring about your balance. Your account number. User: Ichi System: “P” User: Ni System: “P” User: Sun System: “Your account number is 123. P” User: Yes System: “Your The balance is 12,300 yen. Thank you very much.” Conventionally, voice input/output devices for a large number of user lines have a recognition unit that can simultaneously process m voice inputs, the same number as the number of user lines, and a It used a response unit that could simultaneously process audio output. However, the recognition section has more complicated processing than the response section and requires a larger amount of memory, resulting in a larger device.

一方、特開昭56−131250号明細書に認識部を小
形にした音声入出力装置が記載されている。この
装置の概略は次の通りである。利用者は音声入力
と音声出力を使用するが、音声出力を行つている
時間は、音声入力を行なわず認識部にとつて空き
時間となる。この空き時間を他の利用者回線の音
声入力処理に利用するため、利用者回線と認識部
の間に切換スイツチを設け、音声入力要求のある
利用者回線のみを認識部に接続する。しかし、認
識部の同時処理回線はｍより小さなｎ個であるた
め、音声入力要求が同時にｎ個以上とならないよ
う制御する必要がある。このため、認識部の空き
回線がある時のみ、言いかえれば空き回線がない
時は空き回線ができるまで待つた後、利用者へ音
声入力の発声をうながす旨の音声出力（音声入力
促進音の出力）を行い、利用者の音声入力を受け
付けている。これによつて、ｍより小さなｎ個の
音声入力を同時処理行う認識部によりｍ個の利用
者に対して音声入力と音声出力を行うことが可能
となつた。 On the other hand, Japanese Patent Application Laid-open No. 131250/1983 describes a voice input/output device in which a recognition section is made small. The outline of this device is as follows. Although the user uses voice input and voice output, the time during which voice output is performed is idle time for the recognition unit without voice input. In order to utilize this free time for voice input processing on other user lines, a changeover switch is provided between the user line and the recognition unit, and only the user line with a voice input request is connected to the recognition unit. However, since the number of simultaneous processing lines of the recognition unit is n, which is smaller than m, it is necessary to control so that the number of voice input requests does not exceed n at the same time. Therefore, only when there is a free line for the recognition unit, or in other words, if there is no free line, the system waits until a free line becomes available, and then outputs a voice prompting the user to utter the voice input (voice input prompt sound). output) and accept voice input from the user. As a result, it has become possible to perform voice input and voice output for m users using a recognition unit that simultaneously processes n voice inputs smaller than m.

しかし、前記多回線音声入出力装置では、利用
者は音声入力促進音を聞いた後、音声入力を行な
わなければならない。このため未熟練者または音
声入出力装置に不慣れな者は、合図を待ちきれず
に発声することがおこりやすい。合図の前に発声
した音声は認識処理が開始されていないため、語
頭が脱落して誤認識を生じたり、全く処理されず
に入力未受理となる欠点があつた。 However, in the multi-line audio input/output device, the user must perform voice input after listening to the voice input prompting sound. For this reason, unskilled users or those who are unfamiliar with voice input/output devices are likely to impatiently wait for a signal and start speaking. Since the recognition process has not started for the voice uttered before the signal, the beginning of the word may be dropped, resulting in erroneous recognition, or the voice may not be processed at all and the input may not be accepted.

前記合図を待ちきれずに発声した音声の誤認識
または入力未受理を防ぐ方法が本願と同一出願人
による特願昭58−44741号明細書に記載されてい
る。この発明による音声入力装置は、音声検出信
号と音声入力要求を入力とし、認識スタート信号
と音声入力促進要求を出力する入力指令部を持
ち、音声入力促進要求時に音声検出信号が出力さ
れていた場合音声の終端検出時に再度音声入力促
進要求を出力し認識部を再スタートさせている。
すなわち、合図を待ちきれずに発声した場合、音
声入力促進要求時に音声検出信号が出力されてお
り、その音声の終端検出時に入力指令部より再び
音声入力促進要求と認識スタート信号が発せら
れ、待ちきれずに発声した音声は無視され、次に
発声された音声を認識する。これにより、待ちき
れずに発声した音声の誤認識または入力未受理を
防ぐことができる。 A method for preventing erroneous recognition of voices uttered without waiting for the signal or non-acceptance of input is described in Japanese Patent Application No. 1982-44741 filed by the same applicant as the present application. The voice input device according to the present invention has an input command unit that inputs a voice detection signal and a voice input request and outputs a recognition start signal and a voice input promotion request, and when the voice detection signal is output when the voice input promotion request is made. When the end of the voice is detected, a voice input promotion request is output again to restart the recognition unit.
In other words, if you can't wait for the cue and speak, the voice detection signal is output when the voice input promotion request is made, and when the end of the voice is detected, the input command unit issues the voice input promotion request and recognition start signal again. Voices that are uttered incorrectly are ignored, and the next voice that is uttered is recognized. This can prevent erroneous recognition of voice uttered without waiting or non-acceptance of input.

しかし、特願昭58−44741による音声入力装置
は１回線に対する音声入力装置であり、特開昭56
−131250による多回線音声入出力装置の認識部と
して使用した場合、切換スイツチにより利用者回
線が切換えられるため利用者回線の音声を常時監
視できず合図を待ちきれずに発声した音声の検出
が正確に行えないため誤認識または入力未受理を
防ぐことができない。このため、特願昭58−
44741による音声入力部を持つ多回線音声入出力
装置には、切換スイツチを用いることはできず、
やはりｍ個の音声を認識する認識部が必要となり
装置が大形となる欠点があつた。 However, the voice input device according to Japanese Patent Application No. 58-44741 is a voice input device for one line;
- When used as a recognition unit for a multi-line audio input/output device using the 131250, the user's line is switched by the switch, so the voice of the user's line cannot be constantly monitored, and the voice uttered by impatient waiting for a signal cannot be accurately detected. It is not possible to prevent erroneous recognition or non-acceptance of input. For this reason, a special request for
A switch cannot be used for a multi-line audio input/output device that has an audio input section based on 44741.
As expected, a recognition section for recognizing m voices is required, which results in a large device.

＜発明の目的＞本発明の目的は、音声認識処理部の前段に切換
えスイツチを設け音声認識処理部を時分割多重使
用しその使用するタイミングを音声出力により利
用者へ知らせる音声入出力装置に、利用者が合図
を待ちきれずに発声した音声を検出する検出部と
前記音声が検出された時再び音声入力促進要求を
出力する入力指令部を合せ持つことにより、合図
に対する同期にそれ程神経を使わなくて済む使用
し易すい音声入力が、多数の利用者に対して行う
ことのできる小形の音声入出力装置を提供するこ
とにある。<Objective of the Invention> The object of the present invention is to provide a voice input/output device which is provided with a changeover switch before the voice recognition processing section, uses the voice recognition processing section in a time division multiplex manner, and notifies the user of the timing of use by voice output. By having a detection unit that detects the voice uttered by the user impatiently waiting for a cue, and an input command unit that outputs a voice input promotion request again when the voice is detected, the user does not have to be so sensitive to synchronize with the cue. To provide a small-sized voice input/output device that can perform easy-to-use voice input for a large number of users.

＜発明の構成＞本発明による多回線音声入出力装置は、ｍ（正
の整数）個の利用者回線と、前記ｍ個の利用者回
線へ音声出力を行う応答部と、前記ｍ個の利用者
回線よりの音声入力を検出する検出部と、前記検
出部のｍ個の音声パタンの出力をｍより小さなｎ
（正の整数）個の認識部入力回線へ接続する切換
えスイツチと、前記切換えスイツチの出力を受け
るｎ個の入力回線の音声を認識する認識部と、認
識部の空き回線の存在を検知した時にのみ通話状
態にある利用者回線を前記認識部の空き回線に接
続し、音声入力要求を出力する制御部と、前記音
声入力要求を受信した時刻以前に前記検出部にて
音声が検出されていないとき利用者回線に発声を
うながす旨の音声出力を行い認識部をスタートさ
せるよう指令し、音声が検出されていたときその
音声の終端を検出した時刻に利用者回線に発声を
うながす旨の音声出力を行い認識部をスタートさ
せるように指令する入力指令部を有している。<Structure of the Invention> A multi-line audio input/output device according to the present invention includes m (positive integer) user lines, a response unit that outputs audio to the m user lines, and a response unit that outputs audio to the m user lines. a detection unit that detects audio input from a user line; and a detection unit that detects the audio input from the
(a positive integer) recognition unit; a recognition unit that recognizes the audio of the n input lines that receive the output of the changeover switch; a control unit that connects a user line that is only in a talking state to an idle line of the recognition unit and outputs a voice input request, and a voice is not detected by the detection unit before the time when the voice input request is received; When a voice is output to the user line to prompt the user to speak, the recognition unit is commanded to start, and when voice is detected, at the time when the end of the voice is detected, a voice is output to the user line to prompt the user to voice. It has an input command unit that instructs the recognition unit to perform the following steps and start the recognition unit.

＜本発明の作用・原理＞本発明による多回線音声入出力装置は、上位シ
ステムよりの音声入力コマンド，音声出力コマン
ドに従つて音声入力と音声出力をｍ個の利用者回
線に対して行う。具体的な動作を第１図を参照し
ながら説明する。始めに音声出力コマンドを受け
た制御部は応答部に対して“口座番号をどうぞ”
という音声出力を利用者回線ｍに行うよう指示す
る。音声出力が終了すると終了したことを制御部
を介して上位システムへ通知する。通知を受けた
上位システムは、音声入力コマンドを制御部に出
力する。制御部は認識部の空き入力回線を捜し、
見つけた時点で切換えスイツチにより要求のあつ
た利用者回線を前記空き認識部入力回線へ接続す
る。さらに入力指令部へ音声入力要求を通知す
る。入力指令部は音声入力要求を受取るとすでに
音声が検出されていたか判定し、検出されていな
い場合は音声入力促進音（例えば「ピー」）の出
力を応答部に指示し、前記接続された認識部入力
回線の認識処理開始を認識部へ指示する。もし、
音声入力要求を受けた時刻においてその時刻以前
に前記利用者回線からの音声が検出されていた場
合は、利用者が、音声入力促進音の前に音声を発
声していたことになるため、その音声の終端を検
出した時刻に再び音声入力促進音の出力を応答部
に指示し、認識処理の再スタートを認識部へ指示
する。<Operation/Principle of the Present Invention> The multi-line audio input/output device according to the present invention performs audio input and output to m user lines in accordance with audio input commands and audio output commands from a host system. The specific operation will be explained with reference to FIG. The control unit, which first receives the voice output command, tells the response unit, “Please give me your account number.”
This command instructs the user line m to output the voice. When the audio output ends, the host system is notified of the end via the control unit. The higher-level system that received the notification outputs a voice input command to the control unit. The control unit searches for a free input line of the recognition unit,
When found, a changeover switch connects the requested user line to the input line of the vacant recognition section. Furthermore, it notifies the input command section of the voice input request. Upon receiving the voice input request, the input command unit determines whether voice has already been detected, and if no voice has been detected, instructs the response unit to output a voice input prompting sound (for example, "beep"), and Instructs the recognition unit to start recognition processing for the input line. if,
If the voice from the user's line was detected before the time when the voice input request was received, it means that the user had uttered the voice before the voice input prompt sound, so At the time when the end of the voice is detected, the response unit is again instructed to output the voice input promotion sound, and the recognition unit is instructed to restart the recognition process.

＜実施例＞本発明について第２図に示した実施例を示すブ
ロツク図に基づきさらに詳細に説明する。第２図
に示すように本発明による多回線音声入出力装置
の実施例は、利用者回線１１，１２，…，１ｍ，
認識部入力回線２１，２２，…，２ｎ，音声検出
部３，切換えスイツチ４，認識部５，応答部６，
入力指令部７，制御部８より構成される。<Example> The present invention will be described in more detail based on a block diagram showing an example shown in FIG. As shown in FIG. 2, the embodiment of the multi-line audio input/output device according to the present invention has user lines 11, 12, ..., 1m,
Recognition unit input lines 21, 22, ..., 2n, voice detection unit 3, changeover switch 4, recognition unit 5, response unit 6,
It is composed of an input command section 7 and a control section 8.

切換スイツチ４は、例えば特開昭56−131250号
明細書の第４図または第５図または第７図に記載
された構成を取ることができる。すなわち、切換
スイツチ４は利用者回線１１，１２，…，１ｍを
入力とし、ｎ個の認識部入力回線２１，２２，
…，２ｎを出力としており、制御部８の制御によ
り利用者回線１１，１２，…，１ｍの中より音声
入力を要求している回線のみを認識部入力回線２
１，２２，…，２ｎへ接続する。応答部６はたと
えば刊行物アイ・イー・イー・イートランザクシ
ヨンズオンエー・エス・エス・ピー（IEEE
Transactions on ASSP）の1974年11月号の第
339頁より第352頁までの論文「アマルチライン
コンピユーターボイスレスポンスシステム
ユーテイライズイングエーデーピーシーエム
コーデイドスピーチ（ａ Multiline
Conputer Voice Response System Utilizing
ADPCM Coded Speech）」の第340頁の第２図
に記載された構成のものを用いることができる。
すなわち、応答部６は制御部８または入力指令部
７の指示に従つてｍ個の利用者回線へ「こちらは
テレホンセンタです。サービスコードをどうぞ」
という文章や「ピー」という音声入力促進音を出
力する。音声認識装置は、たとえば、刊行物プロ
シイーデイングズオブジアイ・イー・イ
ー・イー（PROCEEDINGS OF THE IEEE）
の1976年４月号の第405頁より第415頁までの論文
「プラクテイカルアプリケーシヨンズオブ
ボイスインプツトツーマシンズ
（Practical Applications of Voice Input to
Machines）」の第５図に示された構成のものを用
いることができる。すなわち、第３図に示すよう
に音声を分析して、例えばパワー情報と特徴ベク
トル（スペクトラム情報）を抽出する前処理部と
音声がどの時刻で発声されたかを検出する音声検
出部と、入力された音声の特徴を抽出する特徴抽
出部と、その特徴より発声された音声が何である
か識別する識別部より構成される。 The changeover switch 4 can take the configuration shown in, for example, FIG. 4, FIG. 5, or FIG. 7 of JP-A-56-131250. That is, the changeover switch 4 inputs the user lines 11, 12, .
..., 2n as outputs, and under the control of the control unit 8, only the line requesting voice input from among the user lines 11, 12,..., 1m is selected as the recognition unit input line 2.
Connect to 1, 22,..., 2n. The response section 6 is, for example, the publication IE Transactions on APS (IEEE
Transactions on ASSP), November 1974 issue.
The paper from pages 339 to 352, ``A Multiline Computer Voice Response System Utilizing ADM Coded Speech (a Multiline
Computer Voice Response System Utilizing
The configuration shown in FIG. 2 on page 340 of ``ADPCM Coded Speech'' can be used.
That is, the response unit 6 follows instructions from the control unit 8 or the input command unit 7 to send messages to the m user lines saying, ``This is the telephone center. Please give me the service code.''
It outputs the sentence ``Beep'' and the voice input promotion sound ``Beep''. Speech recognition devices can be used, for example, in the publication PROCEEDINGS OF THE IEEE.
From pages 405 to 415 of the April 1976 issue of
Practical Applications of Voice Input to Machines
The structure shown in FIG. 5 of ``Machines'' can be used. That is, as shown in FIG. 3, there is a pre-processing section that analyzes speech and extracts, for example, power information and feature vectors (spectrum information), a speech detection section that detects at what time the speech was uttered, and The system consists of a feature extracting section that extracts the features of the voice uttered, and an identifying section that identifies the type of voice that was uttered based on the features.

本発明では第２図に示した如く前処理部と音声
検出部を特徴抽出部と識別部より切りはなし、そ
の間に切換えスイツチ４を置いている。すなわ
ち、利用者回線ｉの音声は音声検出部３の入力回
線１ｉへ入力され音声の発声された時刻を検出し
その結果を入力指令部７へ送る。さらに利用者回
線ｉの音声は切換えスイツチ４を通過し、認識部
入力回線２ｊへ入力される。認識部ではｎ個の音
声の特徴抽出と識別の処理の実行が可能である。
認識部入力回線２ｊより入力された音声は特徴を
抽出し識別を行いその結果を制御部８へ出力す
る。 In the present invention, as shown in FIG. 2, the preprocessing section and the voice detection section are separated from the feature extraction section and the identification section, and a changeover switch 4 is placed between them. That is, the voice of the user line i is input to the input line 1i of the voice detection section 3, the time at which the voice is uttered is detected, and the result is sent to the input command section 7. Furthermore, the voice of the user line i passes through the changeover switch 4 and is input to the recognition unit input line 2j. The recognition unit can perform feature extraction and identification processing for n voices.
The voice input from the recognition unit input line 2j is extracted and identified, and the results are output to the control unit 8.

音声検出部３は、第４図に示すように前処理部
３１にてｍ個の利用者回線の音声を分析し、その
分析結果を切換えスイツチ４へ出力する。さらに
前処理部３１よりｍ個の利用者回線の音声パワー
情報が、マイクロプロセツサ部３２のパワー情報
入力部３２１へ送られる。マイクロプロセツサ部
３２は入力部３２１、検出情報出力部３２２、
CPU３２３、メモリ３２４より構成され、第５
図に示したフローチヤートに従つてｍ個の音声の
始端時刻，終端時刻を求める。すなわち、音声の
始端検出では、第５図ａに示すように子韻レベル
Th１を越えた時刻t_s以後さらに母音レベルTh２
を越えた場合、t_sを始端時刻としている。一方、
第５図ｂに示すように音声の終端検出では、子韻
レベルTh１を下がつた時刻t_e以後LE時間Th１以
下のレベルであつた時にt_eを終端時刻としてい
る。前記した始端時刻t_s，終端時刻t_eは検出情報
出力部３２２より入力指令部７へ通知される。こ
こで第５図のFLAGは一定時刻ごとに１にセツト
されるもので、一定時刻ごとのパワーPiより音声
検出を行つている。また、Diはパワーの状態で、
Di＝０，１，２は各々無音，始端候補が検出さ
れた、母音が検出され始端が、確定した状態であ
る。以上述べた音声検出部の動作例を図示する
と、第６図に示した如くとなる。第５図におい
て、Pi，Si，Eiは利用者回線ｉのパワー，始端時
刻，終端時刻を示す。 As shown in FIG. 4, the voice detecting section 3 analyzes the voices of m user lines using a preprocessing section 31, and outputs the analysis results to the changeover switch 4. Furthermore, the voice power information of m user lines is sent from the preprocessing section 31 to the power information input section 321 of the microprocessor section 32. The microprocessor section 32 includes an input section 321, a detection information output section 322,
Consisting of a CPU 323 and a memory 324, the fifth
The start and end times of m voices are determined according to the flowchart shown in the figure. In other words, when detecting the beginning of a speech, the assonance level is determined as shown in Figure 5a.
After the time t _s exceeding Th1, the vowel level Th2
If it exceeds , t _s is taken as the starting point time. on the other hand,
As shown in FIG. 5b, in detecting the end of speech, t _e is determined as the end time when the level is below LE time Th1 after the time t _e when the consonant level Th1 has been lowered. The above-mentioned starting end time t _s and ending end time _te are notified from the detection information output section 322 to the input command section 7 . Here, FLAG in FIG. 5 is set to 1 at regular time intervals, and audio is detected from the power Pi at regular time intervals. Also, Di is in a power state,
Di=0, 1, and 2 are states in which there is no sound, a starting edge candidate has been detected, and a vowel has been detected and the starting edge has been determined, respectively. An example of the operation of the voice detection section described above is shown in FIG. 6. In FIG. 5, Pi, Si, and Ei indicate the power, start time, and end time of user line i.

入力指令部７は第７図に示すようにマイクロプ
ロセツサ７１，メモリ７２，検出情報入力部７
３，入力要求情報入力部７４，応答情報出力部７
５，認識情報出力部７６より構成される。マイク
ロプロセツサ７１は第８図ａ，ｂ，ｃに示すフロ
ーチヤートに従つて動作する。すなわち、始めに
ブロツク７０１で利用者回線ｉでの使用状態Ri，
Wiをリセツトし、（Ri＝０認識スタートしていな
い状態、Wi＝０は認識スタート以前に音声検出
されていない状態），続いてブロツク７０２で音
声検出部３より検出情報（利用者回線ｉ，始端時
刻t_s，終端時刻t_e）を入力する。検出情報が始端
情報である場合、ブロツク７０３にて認識部５へ
認識部入力回線ji（利用者回線が切替スイツチ４
により接続された認識部入力回線）の始端時刻t_s
を通過するか、またはその利用者回線ｉが認識部
５へ接続される時刻以前に音声が発声されたと判
断しその状態（Wi＝１）を記憶する。検出情報
が終端情報である場合、第８図ｂにて認識部５へ
認識部入力回線jiの終端時刻t_eを通知するか、ま
たはWi＝１であつた場合応答部６へ応答情報
（利用者回線ｉ，応答内容として音声入力促進音）
を出力し認識部５へ認識部入力回線jiの認識をス
タートするよう指示する。さらに、第８図ｃにて
制御部８より入力要求情報（利用者回線ｉ，認識
部入力回線ji）を入力し、応答部６へ応答情報を
出力し、認識部５へ認識入力回線jiの認識をスタ
ートするように指示し、さらに認識入力回線jiで
認識がスタートした状態（Ri＝１）を記憶する。
以上説明したブロツク７０２，７０３，第８図
ｂ，ｃの処理を繰返し行うことにより、ｍ個の利
用者回線に対する処理が実行される。 As shown in FIG. 7, the input command section 7 includes a microprocessor 71, a memory 72, and a detection information input section 7.
3. Input request information input section 74, response information output section 7
5, recognition information output section 76. The microprocessor 71 operates according to the flowchart shown in FIGS. 8a, b, and c. That is, first, in block 701, the usage status Ri,
Wi is reset (Ri=0 is a state in which recognition has not started, Wi=0 is a state in which no voice has been detected before starting recognition), and then in block 702 the voice detecting section 3 detects the detection information (user line i, Input the start time t _s and end time _te ). If the detected information is start end information, block 703 transfers the recognition unit input line ji (the user line is switched to the changeover switch 4) to the recognition unit 5.
start time t _s of the recognition unit input line connected by
or before the time when the user line i is connected to the recognition unit 5, and stores that state (Wi=1). If the detected information is termination information, the termination time _te of the recognition unit input line ji is notified to the recognition unit 5 in FIG. 8b, or if Wi=1, the response information (use user line i, voice input prompt sound as response content)
is output to instruct the recognition unit 5 to start recognizing the recognition unit input line ji. Furthermore, as shown in FIG. An instruction is given to start recognition, and the state in which recognition has started (Ri=1) is stored on the recognition input line ji.
By repeating the processes of blocks 702 and 703 and FIGS. 8b and 8c described above, the processes for m user lines are executed.

制御部８は1983年３月発行のアメリカ特許
USP4385359「マルチプルチヤンネルボイス
インプツト／アウトプツトシステム
（Multiple−Channel Voice Input／Output
System）」の明細書の第２図に示す構成を取るこ
とができ、同明細書の第４図に示したフローチヤ
ートに従つて動作する。 Control unit 8 is a US patent issued in March 1983.
USP4385359 “Multiple-Channel Voice Input/Output System”
The system can be configured as shown in FIG. 2 of the specification of ``System'', and operates according to the flowchart shown in FIG. 4 of the same specification.

以上説明した実施例は電話回線および応用プロ
グラムが動作する上位システムと共使用される。
第２図に示したように利用者が電話によりシステ
ムを呼び出すと電話回線制御部（NCU）１０ｉ
が呼び出しを検出し上位システム２００へ通知す
る。上位システム２００通知後利用者回線１ｉに
対して音声入力コマンド、音声出力コマンドを制
御部８へ出力して音声入出力のサービスを行う。 The embodiment described above is used in common with a telephone line and an upper system in which an application program runs.
As shown in Figure 2, when a user calls the system by telephone, the telephone line control unit (NCU) 10i
detects the call and notifies the higher level system 200. After notification from the host system 200, voice input commands and voice output commands are output to the control unit 8 for the user line 1i to provide voice input/output services.

以上本発明を実施例に基づき説明したが、１回
線に対する音声検出部，認識部，応答部をｍ個ま
たはｎ個並べる構成を取ることもできることは明
白である。また、実施例では入力指令部，制御部
は共にマイクロプロセツサを用いた構成であるた
め、CPUの処理能力に余裕があればCPU，メモ
リを共用し、CPU，メモリを１つに減らすこと
が可能である。 Although the present invention has been described above based on the embodiments, it is clear that a configuration in which m or n voice detection units, recognition units, and response units for one line are arranged can also be adopted. In addition, in this example, both the input command section and the control section are configured using microprocessors, so if the CPU has sufficient processing capacity, the CPU and memory can be shared and the number of CPUs and memory can be reduced to one. It is possible.

＜発明の効果＞本発明によれば、認識処理部の前段に切換えス
イツチを設け、認識処理部を時分割多重使用し、
その音声入力を利用するタイミングを音声入力促
進音の出力により利用者へ通知することにより、
認識処理部の同時処理回線数を利用者回線数ｍよ
り小さなｎとすることができ、装置の小形化が実
現する。<Effects of the Invention> According to the present invention, a changeover switch is provided before the recognition processing section, and the recognition processing section is used in time division multiplexing.
By notifying the user of the timing to use the voice input by outputting a voice input promotion sound,
The number of lines simultaneously processed by the recognition processing section can be set to n, which is smaller than the number of user lines m, and the device can be made smaller.

さらに、切換スイツチの前段に音声検出部を設
け、常時利用者回線の音声入力を監視し、利用者
が音声入力促進音の合図に待ちきれず音声を発声
した場合、再度音声入力を行うよう制御する入力
指令部を設けることにより、利用者が合図にに待
ちきれず音声を発声した場合も誤認識や入力未受
理とはならず、利用者に対して使用しやすい音声
入力が提供できる。 In addition, a voice detection unit is installed in front of the switch to constantly monitor the voice input of the user's line, and if the user cannot wait to hear the voice input prompting sound and utters voice, the system is controlled so that voice input is performed again. By providing an input command section to do this, even if the user impatiently utters a voice as a signal, there will be no misrecognition or input not being accepted, and it is possible to provide the user with voice input that is easy to use.

[Brief explanation of the drawing]

第１図は本発明の動作を示す図、第２図は本発
明の実施例を示すブロツク図、第３図は音声認識
装置のブロツク図、第４図は本発明の検出部の実
施例を示すブロツク図、第５図ａ，ｂは本発明の
検出部の動作を示すフローチヤート、第６図は音
声検出動作の例を示す図、第７図は本発明の入力
指令部の実施例を示すブロツク図、第８図ａ，
ｂ，ｃは本発明の入力指令部の動作を示すフロー
チヤートである。図において、１１，１２，…，１ｍは利用者回
線、２１，２２，…，２ｎは認識部入力回線、３
は音声検出部、４は切換スイツチ、５認識部、６
は応答部、７は入力指令部、８は制御部、３１は
前処理部、３２はマイクロプロセツサ部、１０
１，１０２，１０ｉ，１０ｍはNCU、２００は
上位システム、である。 Fig. 1 is a diagram showing the operation of the present invention, Fig. 2 is a block diagram showing an embodiment of the invention, Fig. 3 is a block diagram of a speech recognition device, and Fig. 4 is a diagram showing an embodiment of the detection section of the invention. 5a and 5b are flowcharts showing the operation of the detection section of the present invention, FIG. 6 is a diagram showing an example of voice detection operation, and FIG. 7 is a diagram showing an embodiment of the input command section of the present invention. The block diagram shown in Fig. 8a,
b and c are flowcharts showing the operation of the input command section of the present invention. In the figure, 11, 12, ..., 1m are user lines, 21, 22, ..., 2n are recognition section input lines, and 3
is a voice detection section, 4 is a changeover switch, 5 is a recognition section, 6
1 is a response section, 7 is an input command section, 8 is a control section, 31 is a preprocessing section, 32 is a microprocessor section, 10
1, 102, 10i, and 10m are NCUs, and 200 is an upper system.

Claims

[Claims]

1 m (positive integer) user lines, a response unit that outputs audio to the m user lines, and the m
a detection unit that detects voice input from the number of user lines; a changeover switch that connects the output of the m voice patterns of the detection unit to n (positive integer) recognition unit input lines smaller than m; a recognition unit that recognizes the voices of n input lines that receive the output of the changeover switch; and a recognition unit that connects a user line that is in a talking state to the idle line of the recognition unit only when the presence of an idle line to the recognition unit is detected. a control unit that generates a control signal to generate a voice input request and outputs a voice input request; and a voice output that prompts the user line to speak when voice is not detected by the detection unit before the time when the voice input request is received. An input that commands the recognition unit to start by outputting a voice prompting the user to speak to the user line at the time when the end of the voice is detected when voice is being detected. A multi-line audio input/output device characterized by having a command section.