JPH04177299A

JPH04177299A - Sound responding device

Info

Publication number: JPH04177299A
Application number: JP2305297A
Authority: JP
Inventors: Yoshiyuki Hara; 義幸原
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1990-11-09
Filing date: 1990-11-09
Publication date: 1992-06-24
Anticipated expiration: 2016-03-19
Also published as: JP3147898B2

Abstract

PURPOSE:To enable a service to be easily understood by a user by a method wherein a fixed part (a guidance) and a non-fixed part (a male) part are outputted through their changed sounds. CONSTITUTION:As a call sound from a telephone set 4 is detected by a network control unit(NCU) 5, it is informed to a host computer 1 that a signal is received. Then, the host computer 1 gives a Chinese character and kana-character sentence coded into a fixed sentence to a sound restricting and synthesizing part 6. A female sound element selecting code for use in selecting a female sound element is given to a sound restriction synthesizing part 6 in advance. In this case, the sound restricting and synthesizing part 6 analyzes the sentence in view of language and converts it into a sound echo code and a melody information so as to generate a synthesized sound. In turn, when a part of the non- fixed sentence is replaced, a male voice element selection code is given to the sound restricting and synthesizing part 6, the sound element is set to the male voice, thereafter the synthesized sound is generated and given to a user. With such an arrangement, the user can listen and understand the sound of the guidance and the male sound easily.

Description

【発明の詳細な説明】［発明の目的コ（産業上の利用分野）本発明は、たとえば電話機や専用端末装置から入力され
る情報に対する情報を音声により応答出力する音声応答
装置に関する。DETAILED DESCRIPTION OF THE INVENTION [Purpose of the Invention (Field of Industrial Application) The present invention relates to a voice response device that responds and outputs information in response to information input from, for example, a telephone or a dedicated terminal device.

（従来の技術）最近、入力文字コード列を解析して音韻系列および韻律
情報を求め、それらの情報から規則を用いて音韻パラメ
ータおよび韻律パラメータ列を生成し、それらのパラメ
ータ列に基づいて合成音声を生成する音声合成装置が種
々開発されている。(Prior art) Recently, input character code strings are analyzed to obtain phonological sequences and prosodic information, phonological parameters and prosodic parameter strings are generated using rules from that information, and synthesized speech is generated based on these parameter strings. Various speech synthesis devices have been developed that generate .

この種の規則による音声合成装置は、従来からの録音編
集方式の音声合成装置と比較して、任意の単語や文章を
表す合成音声を簡易に生成できるという利点を持つ。こ
れ故、音声認識技術と相俟って自然性の高いマンマシン
・インタフェイスを実現する上での重要な技術として注
目されている。This type of rule-based speech synthesis device has the advantage of being able to easily generate synthesized speech representing arbitrary words or sentences, compared to conventional speech synthesis devices using a recording/editing method. For this reason, it is attracting attention as an important technology for realizing highly natural man-machine interfaces in combination with voice recognition technology.

一方、現在、パーソナルコンピュータ（以後、単にパソ
コンと略称する）あるいはワードプロセッサ（以後、単
にワープロと略称する）を電話回線を介してネットワー
ク化し、メール通信や各種の情報サービスを行なうパソ
コンネットワークなるサービスが行なわれている。On the other hand, currently, there is a service called a PC network that connects personal computers (hereinafter simply referred to as personal computers) or word processors (hereinafter simply referred to as word processors) to networks via telephone lines to provide email communications and various information services. It is.

これら２つの技術を組合わせてパソコンネットワークの
利用者に送られてくるメールの内容を電話機を介して音
声で伝達するようなシステムが構築されつつある。この
種の装置は、定型部分（ガイダンス）と、非定型部分（
メール）の２種類の音声出力部分かあるが、そのうち、
ガイダンス部分は比較的音質の良い録音編集方式が用い
られている。しかし、ある程度のメモリか必要なことや
、装置の制御か複雑になるなとの不具合があった。By combining these two technologies, a system is being constructed in which the contents of mail sent to users of a personal computer network are transmitted by voice via a telephone. This type of device has a regular part (guidance) and an atypical part (guidance).
There are two types of audio output parts (email), but among them,
The guidance section uses a recording and editing method with relatively good sound quality. However, there were drawbacks such as the need for a certain amount of memory and the complexity of controlling the device.

そこで、規則合成方式の音質が向上したこととも相俟っ
てメール部分たけでなく、ガイダンス部分にも規則合成
方式が導入されつつある。Therefore, along with the improvement in the sound quality of the rule synthesis method, the rule synthesis method is being introduced not only to the email section but also to the guidance section.

また、ガイダンス部分に規則合成方式を用いることによ
って、サービスの形態を簡単に変更できるようになるが
、しかしメール部分との聞き分けがしすらいといった不
具合が生じていた。Furthermore, by using a rule synthesis method for the guidance section, it becomes possible to easily change the form of the service, but there is a problem in that it is difficult to distinguish between the guidance section and the email section.

さらに、従来は定型文に対しても漢字かな混じり文から
音声に変換していた。しかしなから、漢字かな混じり文
から韻律情報を得るためにアクセント辞書との照合、言
語解析の処理を行なう必要があり、文が入力されてから
音声を生成するまでに時間を要することや、多少不自然
な音声となる場合もあった。したがって、ガイダンス部
分の音声の応答が遅くなるといった不具合が生じていた
。Furthermore, in the past, standard sentences were converted from kanji/kana mixed sentences to audio. However, in order to obtain prosodic information from a sentence containing kanji and kana, it is necessary to check it with an accent dictionary and perform language analysis processing, and it may take some time to generate speech after the sentence is input. In some cases, the sound was unnatural. Therefore, there has been a problem that the response of the voice in the guidance part is delayed.

（発明が解決しようとする課題）上記したように、従来にあっては、利用者が電話機を介
してメールなどの内容を聞く場合、今しゃべっている内
容がガイダンス部分なのか、メール部分なのかを判断す
ることは困難であった。(Problem to be solved by the invention) As mentioned above, in the past, when a user listens to the contents of an e-mail etc. via a telephone, it is difficult to determine whether the content being spoken is the guidance part or the e-mail part. It was difficult to judge.

また、上記したように、従来にあっては、ガイダンス部
分の音声の応答か遅くなることや、多少不自然な音声と
なることがしばしば起こることなどの問題かあった。Furthermore, as described above, in the conventional system, there have been problems such as the voice response in the guidance portion being delayed and the voice often sounding somewhat unnatural.

そこで、本発明は、ガイダンス部分の音声とメール部分
の音声を利用者が簡単に聞き分けることかできる音声応
答装置を提供することを目的とする。SUMMARY OF THE INVENTION Therefore, an object of the present invention is to provide a voice response device that allows a user to easily distinguish between the voice of the guidance part and the voice of the mail part.

また、本発明は、ガイダンス部分の音声に対しては応答
時間の短縮および自然性の向上が計れる音声応答装置を
提供することを目的とする。Another object of the present invention is to provide a voice response device that can shorten the response time and improve the naturalness of the voice of the guidance portion.

Ｕ発明の構成］（課題を解決するための手段）第１の発明に係る音声応答装置は、電話機から入力され
る情報に対する情報を音声により応答出力するものであ
って、その応答部分に文字コード列から合成音声を生成
する規則合成方式を用いる音声応答装置において、音声
出力すべき文章が定型の場合と非定型の場合の合成音声
の声質を変える手段を具備している。U Structure of the Invention] (Means for Solving the Problems) A voice response device according to the first invention outputs a voice response to information input from a telephone, and includes a character code in the response part. A voice response device using a rule synthesis method for generating synthesized speech from sequences is provided with means for changing the voice quality of the synthesized speech when the text to be outputted is a fixed form or an unstructured text.

第２の発明に係る音声応答装置は、応答部分に文字コー
ド列から合成音声を生成する規則合成方式を用いる音声
応答装置において、音声出力すべき文章が定型の場合、
あらかじめ登録されている音韻を表すコード列と韻律を
表す情報とから合成音声を生成する手段と、音声出力す
べき文章が非定型の場合、漢字かな混じり文から合成音
声を生成する手段とを具備している。A voice response device according to a second aspect of the present invention is a voice response device that uses a rule synthesis method for generating synthesized voice from a character code string in a response part, and when a sentence to be outputted as a voice is a fixed form,
Equipped with means for generating synthesized speech from pre-registered code strings representing phonemes and information representing prosody, and means for generating synthetic speech from sentences containing Kanji and kana when the sentences to be outputted are atypical. are doing.

（作　用）第１の発明に係る音声応答装置によれば、定型部分（ガ
イダンス）と非定型部分（メール）の音声を声質を変え
て出力することにより、ガイダンス部分の音声とメール
部分の音声を利用者か簡単に聞き分けることができ、利
用者に分りやすいサービスが行なえる。(Function) According to the voice response device according to the first invention, by outputting the voice of the standard part (guidance) and the non-standard part (email) with different voice quality, the voice of the guidance part and the voice of the mail part are output. It is possible to easily tell whether a person is a user, and to provide services that are easy for the user to understand.

第２の発明に係る音声応答装置によれば、非定型文（メ
ール）に対しては漢字かな混じり文から音声を生成し、
定型文（ガイダンス）に対してはあらかじめ登録されて
いる音韻コードと韻律情報とから音声を生成することに
より、ガイダンス部分（定型文）の音声に対しては応答
時間の短縮および自然性の向上が計れる。According to the voice response device according to the second invention, for a non-standard sentence (email), a voice is generated from a sentence containing kanji and kana,
By generating speech for fixed phrases (guidance) from pre-registered phonetic codes and prosodic information, the response time and naturalness of the guidance portion (fixed phrase) can be shortened and naturalness improved. It can be measured.

（実施例）以下、本発明の実施例について図面を参照して説明する
。(Example) Hereinafter, an example of the present invention will be described with reference to the drawings.

まず、第１の実施例について説明する。第１図は、本発
明に係る音声応答装置を概略的に示す構成因である。す
なわち、ホスト計算機１は、電話回線を介してパソコン
２と接続されており、パソコン２から送られてくる利用
者番号、暗証番号などを認識し、ネットワークと接続す
るようになっている。ネットワークと接続されたパソコ
ン２は、ホスト計算機１に対して情報を送受信すること
が可能となる。また、ホスト計算機１を介して他のパソ
コン３とも情報交換することが可能となっている。音声
規則合成部６は、ホスト計算機１から送られてくる文字
コードを言語解析し、韻律情報を含む音韻コードに変換
する。その後、それらの情報に基づいて合成音声を生成
する。この合成音声は、アナログ信号としてＮＣＵ　（
ネットワーク・コントロール・ユニット）部５へ与えら
れる。First, a first example will be described. FIG. 1 schematically shows the components of a voice response device according to the present invention. That is, the host computer 1 is connected to a personal computer 2 via a telephone line, recognizes the user number, password, etc. sent from the personal computer 2, and connects to the network. The personal computer 2 connected to the network can send and receive information to and from the host computer 1. Furthermore, information can be exchanged with other personal computers 3 via the host computer 1. The speech rule synthesis unit 6 linguistically analyzes the character code sent from the host computer 1 and converts it into a phonetic code containing prosody information. Then, synthesized speech is generated based on that information. This synthesized voice is sent to the NCU (
network control unit) section 5.

一方、ＮＣＵ部５は、電話回線と接続されており、電話
の着信、切断、ＰＢ検出、ＢＴ検出をホスト計算機１に
通知したり、音声規則合成部６から与えられるアナログ
信号を電話回線に送出するようになっている。On the other hand, the NCU unit 5 is connected to the telephone line, and notifies the host computer 1 of incoming calls, disconnections, PB detection, and BT detection, and sends analog signals given from the voice rule synthesis unit 6 to the telephone line. It is supposed to be done.

このような構成において、第１の実施例の動作を、第２
図に示す要部のフローチャートを参照しつつ、利用者か
ら電話かかかってきた場合を例にとって説明する。ます
、電話機４からの呼ａ音をＮＣＵ部５が検出すると、ホ
スト計算機１に着信したことを通知する。すると、ホス
ト計算機１は、あらかしめ登録されている定型文［こち
らは、ネットワークサービスセンターです。」　「利用
者番号をどうぞ。」なるコード化された漢字かな混じり
文を音声規則合成部６に与えるが、その前に合成音声す
べき文章が定型文のため（第２図のステップＳ１）、女
声の音声素片を選択するための女声素片選択コードを音
声規則合成部１６に与える。In such a configuration, the operation of the first embodiment is similar to that of the second embodiment.
Referring to the main flowchart shown in the figure, explanation will be given by taking as an example a case in which a call is received from a user. First, when the NCU unit 5 detects a ringing sound from the telephone 4, it notifies the host computer 1 that the call has arrived. Then, host computer 1 prints the registered prefix message [This is the network service center]. ” ``Please give me your user number.'' A coded sentence containing kanji and kana is given to the speech rule synthesis unit 6, but since the sentence to be synthesized into speech is a fixed phrase (step S1 in Fig. 2), A female voice segment selection code for selecting a female voice segment is provided to the voice rule synthesis unit 16.

音声規則合成部６は、与えられた女声素片選択コードと
漢字かな混じり文を受取り、音声素片を女声に設定した
後（第２図のステップＳ２）、前記文を言語解析して音
韻コードと韻律情報に変換し、合成音声を生成する（第
２図のステップＳ３）。The voice rule synthesis unit 6 receives the given female voice segment selection code and a sentence containing Kanji and kana, sets the voice segment to a female voice (step S2 in FIG. 2), and then linguistically analyzes the sentence to generate a phonological code. is converted into prosody information, and a synthesized speech is generated (step S3 in FIG. 2).

なお、−度、音声素片ファイルが選択されると、新たに
素片選択コードが入力されない限り前の状態を維持する
。Note that once a speech segment file is selected, the previous state is maintained unless a new segment selection code is input.

こうして生成された合成音声は、ＮＣＵ部５へ与えられ
、電話回線を介して電話機４に出力される。ここで、利
用者が電話機４から利用者番号を入力すると、ＮＣＵ部
５でそのブツシュトーン信号をＰＢ検出し、コード化し
てホスト計算機１に転送する。ホスト計算機１は、暗証
番号の入力を促す定型文「暗証番号をどうぞ。」を音声
規則合成部６に転送し、そのメツセージを音声出力する
。The synthesized speech generated in this way is given to the NCU section 5 and output to the telephone set 4 via the telephone line. Here, when the user inputs a user number from the telephone 4, the NCU section 5 detects the PB of the tone signal, encodes it, and transfers it to the host computer 1. The host computer 1 transfers a fixed phrase "Please enter your password" to prompt the input of the password to the voice rule synthesis section 6, and outputs the message as a voice.

ここで、利用者が電話機４から暗証番号を人力すると、
ホスト計算機１は、先に入力された利用者番号に対して
その暗証番号が正当であるか否かをチエツクする。この
チエツクの結果、正当であるとき、その利用者に例えば
第３図に示すような内容のメールが２件、ホスト計算機
１に蓄えられている場合には、非定型文「メールがＯＯ
０件届ています。」の００部分を「２」に置換して、「
メールが２件届いています。」なる漢字かな混じり文を
音声規則合成部６に与えるが、このとき音声出力すべき
内容が非定型文のため（第２図のステップＳ１）、男声
素片選択コードを音声規則合成部６に与える。音声規則
合成部６ては、与えられた男声素片選択コードと漢字か
な混じり文を受取　　゛とり、音声素片を男声に設定し
た後（第２図のステップＳ２）、前記文を言語解析して
音韻コードと韻律情報に変換し、合成音声を生成して（
第２図のステップＳ３）、利用者に伝える。Here, when the user enters the PIN number manually from the telephone 4,
The host computer 1 checks whether the previously input user number is valid or not. If the result of this check is that the email is legitimate, and if the user has two emails with the content shown in Figure 3 stored in the host computer 1, the non-standard message ``The email is OO
0 items have been received. ”, replace the 00 part with “2” and make “
I have received 2 emails. ” is given to the speech rule synthesis unit 6. However, since the content to be outputted is an atypical sentence (step S1 in FIG. 2), a male voice unit selection code is given to the speech rule synthesis unit 6. give. The speech rule synthesis unit 6 receives the given male voice segment selection code and a sentence containing kanji and kana, sets the voice segment to a male voice (step S2 in Figure 2), and then linguistically analyzes the sentence. convert it into phonological code and prosodic information, generate synthesized speech (
Step S3 in FIG. 2), informs the user.

次に、利用者からのブツシュトーン信号が「１」の場合
は、第３図に示す１件目のメール「７月２１日午後３時
より、４Ａ会議室で特許に関する会議を行ないます。是
非御参加下さい。」を、「２」の場合は、２件目のメー
ル「７月３１日子定の旅行会は中止になりました。」を
音声規則合成部６に与え、音声出力する。また、利用者
番号に対して暗証番号が正当でなかったときは、女声素
片選択コードと「暗証番号が違います。」　「もう−度
利用者番号から入力して下さい。」を音声規則合成部６
に与えて音声出力し、再入力を促す。Next, if the buzz tone signal from the user is "1", the first email shown in Figure 3 will be sent to you saying, "There will be a patent conference in conference room 4A from 3:00 pm on July 21st. Please come and see us." Please join us.'', and in the case of ``2'', the second email ``July 31st Kosada's travel party has been cancelled.'' is given to the voice rule synthesis unit 6 and output as voice. In addition, if the PIN is not valid for the user number, the female voice segment selection code and ``The PIN is incorrect.'' ``Please enter the user number again.'' are synthesized using voice rules. Part 6
It outputs audio and prompts you to re-enter.

このように第１の実施例によれば、非定型文に対しては
男声素片で音声を生成し、定型文に対しては女声素片で
音声を生成することにより、非定型文（メール）と定型
文（ガイダンス）の区別が利用者に明確に分り、利用者
に分りやすいサービスか行なえるなどの実用上多大なる
効果が奏せられる。In this way, according to the first embodiment, by generating speech with male voice segments for non-standard sentences and using female voice segments for fixed sentences, ) and fixed phrases (guidance), and this has great practical effects, such as being able to provide services that are easy for users to understand.

なお、本発明は上記第１の実施例に限定されるものでは
ない。たとえば、第１の実施例におけるサービスの流れ
、第３図に示したメツセージの内容は上述した例に限定
されるものではない。また、第１の実施例では、非定型
文と定型文を区別するために男声の素片と女声の素片を
用いたか、女声の素片だけ用いて声の高さや発声速度を
変更することにより区別してもよい。Note that the present invention is not limited to the first embodiment described above. For example, the flow of the service in the first embodiment and the content of the message shown in FIG. 3 are not limited to the example described above. In addition, in the first embodiment, in order to distinguish between non-standard sentences and fixed sentences, male voice elements and female voice elements were used, or only female voice elements were used to change the pitch and speaking speed. It may be differentiated by

次に、第２の実施例について説明する。第２の実施例の
構成は第１図と同様であり、以下、第２の実施例の動作
を、利用者から電話かかかってきた場合を例にとって説
明する。まず、電話機４からの呼出音をＮＣ０部５が検
出すると、ホスト計算機１に着信したことを通知する。Next, a second example will be described. The configuration of the second embodiment is the same as that shown in FIG. 1, and the operation of the second embodiment will be explained below, taking as an example a case where a call is received from a user. First, when the NC0 unit 5 detects a ringing tone from the telephone 4, it notifies the host computer 1 that the call has arrived.

すると、ホスト計算機１は、「こちらは、ネットワーク
サービスセンターです。」　「利用者番号をどうぞ。」
なる音声を発声させるために、第４図に示すように、あ
らかじめホスト計算機１に登録されている音韻コードと
韻律情報「コチラハ９．／ネットワーク／サービス０セ
６ンターデス０」　「リョーシャバ′ンコ０−ヲ／ド６
−ゾ」　（メツセージ１）なるコードを音声規則合成部
６に与える。音声規則合成部６は、それらのコードにし
たかって合成音声を生成する。Then, host computer 1 says, ``This is the network service center.'' ``Please give me your user number.''
In order to utter the voice, as shown in FIG. wo/do 6
-zo” (message 1) is given to the voice rule synthesis unit 6. The speech rule synthesis unit 6 generates synthesized speech based on these codes.

こうして生成された合成音声は、ＮＣ０部５へ与えられ
、電話回線を介して電話機４に出力される。ここで、利
用者か電話８４から利用者番号を入力すると、ＮＣ０部
５でそのブツシュトーン信号をＰＢ検出し、コード化し
てホスト計算機１に転送する。ホスト計算機１は、暗証
番号の入力を促すコード「アンショーバ′ンコ°−ヲ／
ド°−ゾ」　（メツセージ２）を音声規則合成部６に転
送し、そのメツセージを音声出力する。ここで、利用者
が電話機４から暗証番号を久方すると、ホスト計算機１
は、先に入力された利用者番号に対してその暗証番号が
正当であるか否かをチエツクする。このチエツクの結果
、正当であるとき、その利用者に例えば第３図に示すよ
うな内容のメールが２件、ホスト計算機１に蓄えられて
いる場合には、００部分を「２」に置換して「メールが
２件届いています。」　（メツセージ３）なる漢字かな
混じり文を音声規則合成部６に与える。音声規則合成部
６では、与えられた漢字かな混じり文を受取とり、その
文を言語解析して音韻コードと韻律情報に変換し、合成
音声を生成して、利用者に伝える。The synthesized speech generated in this way is given to the NC0 unit 5 and output to the telephone 4 via the telephone line. Here, when the user inputs a user number from the telephone 84, the NC0 section 5 detects the PB of the tone signal, encodes it, and transfers it to the host computer 1. The host computer 1 enters the code "Anshoba'nko°-wo/
"Do°-zo" (message 2) is transferred to the voice rule synthesis section 6, and the message is output as voice. Here, when the user enters the PIN number from the telephone 4, the host computer 1
checks whether the password is valid for the previously input user number. If the result of this check is that the email is valid, and if the user has two emails with the content shown in Figure 3 stored in the host computer 1, the 00 part will be replaced with ``2''. ``Two emails have arrived.'' (Message 3) is given to the speech rule synthesis unit 6, which is a sentence containing kanji and kana. The speech rule synthesis unit 6 receives a given sentence containing kanji and kana, linguistically analyzes the sentence, converts it into a phonetic code and prosody information, generates synthesized speech, and conveys it to the user.

次に、利用者からのブツシュトーン信号が「１」の場合
は、第３図に示す１件目のメール「７月２１日午後３時
より、４Ａ会議室で特許に関する会議を行ないます。是
非御参加下さい。」を、「２」の場合は、２件目のメー
ル「７月３１日子定の旅行会は中止になりました。」を
音声規則合成部６に与え、音声出力する。また、利用者
番号に対して暗証番号が正当でなかったときは、「アン
ショー式６ンゴーカ０／チガイマヘス０」　「モーイチ
ト、／リョーシャノじノコ０−カラ、／ニューリョクシ
０テクダサ′イ」　（メツセージ４）を音声規則合成部
６に与えて音声出力し、再入力を促す。Next, if the buzz tone signal from the user is "1", the first email shown in Figure 3 will be sent to you saying, "There will be a patent conference in conference room 4A from 3:00 pm on July 21st. Please come and see us." Please join us.'', and in the case of ``2'', the second email ``July 31st Kosada's travel party has been cancelled.'' is given to the voice rule synthesis unit 6 and output as voice. Also, if the PIN is not valid for the user number, "Ansho style 6 ngoka 0 / Chigaimahesu 0""Moichito, / Ryoshanojinoko 0-kara, / New ryokushi 0 tekdasai" (message 4) is given to the voice rule synthesis unit 6 to output the voice and prompt for re-input.

このように第２の実施例によれば、非定型文に対しては
漢字かな混じり文から音声を生成し、定型文に対しては
あらかしめ登録されている音韻コードと韻律情報とから
音声を生成することにより、定型文（ガイダンス）に対
しては応答時間の短縮および自然性の向上が計れる。In this way, according to the second embodiment, for non-standard sentences, speech is generated from sentences containing kanji and kana, and for fixed sentences, speech is generated from phonological codes and prosody information that have been preliminarily registered. By generating this, it is possible to shorten response time and improve naturalness for fixed phrases (guidance).

なお、本発明は上記第２の実施例に限定されるものでは
ない。たとえば、第２の実施例におけるサービスの流れ
、第４図に示したメツセージの内容は上述した例に限定
されるものではない。その他、本発明はその要旨を逸脱
しない範囲で種々変形して実施することかできる。Note that the present invention is not limited to the second embodiment described above. For example, the flow of the service in the second embodiment and the content of the message shown in FIG. 4 are not limited to the example described above. In addition, the present invention can be implemented with various modifications without departing from the gist thereof.

［発明の効果コ以上詳述したように本発明の音声応答装置によれば、定
型部分（ガイダンス）と非定型部分（メール）の音声を
声質を変えて出力することにより、ガイダンス部分とメ
ール部分の音声を利用者が簡単に聞き分ける二とができ
、利用者に分りやすいサービスか行なえるなど、実用上
多大なる効果か奏せられる。[Effects of the Invention] As detailed above, according to the voice response device of the present invention, by outputting the voices of the standard part (guidance) and the non-standard part (email) with different voice quality, This has great practical effects, such as allowing the user to easily distinguish between the two voices and providing services that are easy for the user to understand.

また、本発明の音声応答装置によれば、応答部分に規則
合成方式を用いる場合、非定型文（メール）に対しては
、漢字かな混じり文から音声を生成し、定型文（ガイダ
ンス）に対しては、漢字かな混じり文から音声を生成す
るのではなく、音韻コードと韻律情報とから音声を生成
することにより、定型文（ガイダンス）の音声に対して
は、発声開始の時間短縮と自然な音声を提供できるなど
、実用上多大なる効果が奏される。Further, according to the voice response device of the present invention, when using the rule synthesis method for the response part, the voice is generated from a sentence mixed with Kanji and kana for a non-standard sentence (email), and the voice is generated from a sentence mixed with kanji and kana for a non-standard sentence (email), and for a standard sentence (guidance). By generating speech from phonological codes and prosodic information, rather than from sentences containing kanji and kana, it is possible to shorten the time to start speaking and create a natural sound for the speech of fixed sentences (guidance). It has great practical effects, such as being able to provide audio.

[Brief explanation of drawings]

第１図ないし第３図は本発明の第１の実施例を示すもの
で、第１図は概略構成図、第２図は定型文と非定型文の
ときの処理を示す要部のフローチャート、第３図はメー
ルの内容を示す図、第４図は本発明の第２の実施例にお
ける応答メツセージの内容を示す図である。１・・・ホスト計算機（ホスト装置）、２．３・・・バ
ラコン（端末装置）、４・・電話機、５・ＮＣＵ部、６
・・音声規則合成部。出願人代理人　弁理士　鈴江武彦第１図第２図但し、ＶＪ＋１、アクセントの位置をＨす記号「／」は
、アクセント旬の区切りを表す記号「」は、後続する７
りｔント句との間の無音区間を表す記号で、１つにつき
１００ｍ５を表す「１」　「コ°」は、「カー」「】−」の鼻扇責「ノ０
」「ス’Ｊは　「ノＪ　「スＪの無声化青第４図1 to 3 show a first embodiment of the present invention, in which FIG. 1 is a schematic configuration diagram, and FIG. 2 is a flowchart of main parts showing processing for fixed and non-fixed sentences. FIG. 3 is a diagram showing the contents of the mail, and FIG. 4 is a diagram showing the contents of the response message in the second embodiment of the present invention. 1...Host computer (host device), 2.3...Balacon (terminal device), 4...telephone, 5.NCU section, 6
...Speech rule synthesis section. Applicant's representative Patent attorney Takehiko Suzue Figure 1 Figure 2 However, VJ+1, the symbol "/" indicating the accent position, and the symbol "" indicating the accent break, the following 7
It is a symbol that represents the silent interval between the rest phrases, and each one represents 100m5.
""S'J is "ノJ""Su's voiceless blue figure 4

Claims

[Claims]

(1) In a voice response device that outputs a voice response to information input from a telephone, and uses a rule synthesis method that generates synthesized voice from a character code string in the response part, the text to be voice output 1. A voice response device comprising means for changing the voice quality of synthesized speech when the voice is a typical one and when the voice is an atypical voice.

(2) In a voice response device that uses a rule synthesis method that generates synthesized speech from a character code string in the response part, if the sentence to be output is a fixed form, a pre-registered code string representing phoneme and information representing prosody are used. What is claimed is: 1. A voice response device comprising: a means for generating a synthesized voice from a sentence; and a means for generating a synthesized voice from a sentence containing kanji and kana when the sentence to be outputted is an atypical sentence.