JPH0538700U

JPH0538700U - Voice response device

Info

Publication number: JPH0538700U
Application number: JP023927U
Authority: JP
Inventors: 傑易
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1991-04-11
Filing date: 1991-04-11
Publication date: 1993-05-25

Abstract

(57)【要約】【目的】話者の性別に応じた柔軟かつ親切なサービス
を話者に提供できる具体的な音声応答装置を提供するこ
とである。。【構成】入力音声信号に対して、ピッチ抽出部１０３
でピッチパラメータが求められる。比較演算部１０５は
このピッチパラメータの平均化を行い所定のしきい値よ
りも大きいか小さいかを判定し、大きいと判定すると入
力音声の話者が女性であると判定し、小さいと判定され
ると話者が男性であると判定する。この結果に基づい
て、規則音声合成部１０６で、応答メッセージ記憶部１
０４から供給される応答メッセージを男性音又は女性音
によって合成して出力する。つまり話者が男性の場合
は、女性音で応答し、話者が女性の場合は、男性音で出
力する。 (57) [Summary] [Purpose] To provide a concrete voice response device capable of providing a speaker with a flexible and kind service according to the gender of the speaker. .. [Configuration] Pitch extraction unit 103 for input voice signal
The pitch parameter is calculated with. The comparison calculation unit 105 averages this pitch parameter and determines whether it is larger or smaller than a predetermined threshold value. If it is larger, it is determined that the speaker of the input voice is a female, and is smaller. It is determined that the speaker is male. Based on this result, the rule speech synthesizer 106 causes the response message storage 1
The response message supplied from 04 is combined with a male sound or a female sound and output. In other words, when the speaker is male, it responds with a female sound, and when the speaker is female, it outputs with a male sound.

Description

[Detailed description of the device]

【０００１】[0001]

[Industrial applications]

この考案は、音声応答装置に係り、詳しくは話者から発せられる音声を入力し、この音声認識結果に応じて応答メッセージを合成して出力する音声応答装置に関するに関する。 The present invention relates to a voice response device, and more particularly, to a voice response device that inputs a voice uttered by a speaker, synthesizes a response message according to the voice recognition result, and outputs the response message.

【０００２】[0002]

[Prior Art]

従来、この種の音声応答装置として、第１に特開昭５８−２４１９９号公報、第２に特開昭５９−１５３２３８号公報、第３に特開昭５９−２１６２４２号公報、第４に特開昭６０−１５９９３３号公報、及び第５に特開昭６２−１４５３２２号公報などに開示されているものがある。 Conventionally, as a voice response device of this type, firstly, Japanese Patent Laid-Open No. 58-24199, second, Japanese Laid-Open Patent Publication No. 59-153238, third, Japanese Laid-Open Patent Publication No. 59-216242, and fourthly. There are those disclosed in JP-A-60-159933 and, fifthly, JP-A-62-145322.

【０００３】上記第１の音声応答装置は、話者からの入力音声レベルに応じて応答出力レベルを調整しており、上記第２の音声応答装置は、音声入力信号又はキー入力信号に応じて応答出力レベルを制御している。また上記第３及び第５の音声応答装置は、音声入力信号のスピードに合わせて合成音のスピードを制御しており、更に上記第４の音声応答装置は、入力音声の発話速度に応じて応答出力レベルを制御している。The first voice response device adjusts the response output level according to the input voice level from the speaker, and the second voice response device responds to the voice input signal or the key input signal. To control the response output level. Further, the third and fifth voice response devices control the speed of the synthesized voice in accordance with the speed of the voice input signal, and the fourth voice response device responds according to the utterance speed of the input voice. The output level is controlled.

【０００４】上記のように従来、入力音声に応じて応答出力音声を制御する音声応答装置が種々提案されている。As described above, various types of voice response devices have heretofore been proposed that control response output voices according to input voices.

【０００５】[0005]

[Problems to be solved by the device]

しかしながら、従来においては、話者の性別に応じた柔軟かつ親切なサービスを該話者に提供できるような音声応答装置は提案されるに至っていない。 However, conventionally, no voice response device has been proposed that can provide a flexible and kind service according to the gender of the speaker to the speaker.

【０００６】この考案は、以上の課題に鑑み為されたものであり、その目的とするところは、話者の性別に応じた柔軟かつ親切なサービスを話者に提供できる具体的な音声応答装置を提供することである。The present invention has been made in view of the above problems, and its purpose is to provide a concrete voice response device capable of providing a speaker with a flexible and kind service according to the gender of the speaker. Is to provide.

【０００７】[0007]

[Means for Solving the Problems]

この考案は、以上の目的を達成するために、話者から発せられる音声を入力し、この音声の認識結果に応じて応答メッセージを合成して出力する音声応答装置において、以下のような手段を備える。 In order to achieve the above object, the present invention provides the following means in a voice response device that inputs a voice uttered by a speaker and synthesizes and outputs a response message according to the recognition result of the voice. Prepare

【０００８】つまり入力される音声を分析して話者の性別を判別する性別判定手段と、性別判定手段が話者を男性であると判定した時に、第１の種類の音声によって応答メッセージを合成して出力する第１の音声合成手段と、性別判定手段が話者を女性であると判定した時に、第２の種類の音声によって応答メッセージを合成して出力する第２の音声合成手段とを有することを特徴とする。That is, the sex determination unit that analyzes the input voice to determine the gender of the speaker, and when the gender determination unit determines that the speaker is male, the response message is displayed by the first type of voice. A first voice synthesizing means for synthesizing and outputting, and a second voice synthesizing means for synthesizing and outputting a response message by a second type of voice when the gender determining means determines that the speaker is a female. And having.

【０００９】[0009]

[Action]

この考案によれば、性別判定手段が話者を男性であると判定すると、話者に第１の種類の音声での応答メッセージが出力され、性別判定手段が話者を女性であると判定すると、話者に対して第２の種類の音声での応答メッセージが出力される。 According to this invention, when the sex determining means determines that the speaker is male, the speaker outputs a response message in the first type of voice, and the sex determining means determines that the speaker is female. Then, the response message with the second type of voice is output to the speaker.

【００１０】例えば、第１の種類の音声が女性音に、第２の種類の音声が男性音にそれぞれ定められた場合、男性の話者に対しては女性音での応答メッセージが出力され、女性の話者に対しては男性音での応答メッセージが出力される。For example, when the first type of voice is defined as a female sound and the second type of voice is defined as a male sound, a response message in a female sound is output to a male speaker, A response message with a male sound is output to the female speaker.

【００１１】また性別判定手段は、例えば入力音声から抽出される基本周波数（ピッチ）に基づいて話者の性別を判定することができる。Further, the gender determining means can determine the gender of the speaker based on, for example, the fundamental frequency (pitch) extracted from the input voice.

【００１２】[0012]

【Example】

次にこの考案に係る音声応答装置の好適な一実施例を図面を用いて説明する。図１はこの考案に係る音声応答装置の一実施例を示す機能ブロック図である。図１において、この音声応答装置は、話者の音声信号をマイクロホン等を使用して取り込み、この取込んだ音声信号に対して所定の前処理を行う音声入力部１０１と、入力された音声信号から基本周波数（ピッチ）を抽出するピッチ抽出部１０３と、男性音と女性音との判別を行うための判別基準ピッチとピッチ抽出部１０３において抽出されたピッチ情報を比較して話者の性別を判定する比較演算部１０５とを有している。更に音声応答装置は、入力音声を認識するための単語認識部１０２と、応答メッセージが予め格納されている応答メッセージ記憶部１０４と、応答メッセージ記憶部１０４から読み出された応答メッセージを男性音辞書又は女性音辞書を用いて合成する規則音声合成部１０６と、規則音声合成部１０６において合成された音声を出力するスピーカ１０７と、上述した各部を制御する制御部１０８とを有している。尚上記ピッチ抽出部１０３と比較演算部１０５で構成される部分を、性別判定部１０９と呼ぶ。 Next, a preferred embodiment of the voice response device according to the present invention will be described with reference to the drawings. FIG. 1 is a functional block diagram showing an embodiment of a voice response device according to the present invention. In FIG. 1, the voice response device captures a voice signal of a speaker using a microphone or the like, and performs a predetermined pre-processing on the captured voice signal, and an input voice signal. The pitch extraction unit 103 that extracts the fundamental frequency (pitch) from the signal, the discrimination reference pitch for discriminating between male and female sounds, and the pitch information extracted by the pitch extraction unit 103 are compared to determine the gender of the speaker. And a comparison calculation unit 105 for determining. Further, the voice response device recognizes the input voice, the word recognition unit 102, the response message storage unit 104 in which the response message is stored in advance, and the response message read from the response message storage unit 104 as a male voice. It has a regular voice synthesizing unit 106 that synthesizes using a dictionary or a female sound dictionary, a speaker 107 that outputs the voice synthesized by the regular voice synthesizing unit 106, and a control unit 108 that controls the above-mentioned units. .. The portion composed of the pitch extraction unit 103 and the comparison calculation unit 105 is referred to as a sex determination unit 109.

【００１３】次にこの音声応答装置の動作を図２に示す動作フローチャートに従って説明する。Next, the operation of the voice response device will be described with reference to the operation flowchart shown in FIG.

【００１４】話者が音声応答装置に対して音声を発し、所定の操作によって応答を請求すると、処理が開始され（ステップ２０１）、音声入力部１０１がマイクロホン等からの音声信号をＡ／Ｄ変換し（ステップ２０２）、所定の前処理を行う（ステップ２０３）。この前処理は、ピッチ抽出のために必要な低域濾波の処理を含む。次に単語認識部１０２が前処理を終えた音声データに対して、認識処理を行う（ステップ２０４）。この単語認識部１０２での認識処理として、例えば特願平１−２２４９５６の願書に添付されている明細書及び図面に開示されている処理が用いることができる。When the speaker utters a voice to the voice response device and requests a response by a predetermined operation, the process is started (step 201), and the voice input unit 101 converts the voice signal from the microphone or the like into A / D conversion is performed (step 202) and predetermined preprocessing is performed (step 203). This pre-processing includes the processing of low-pass filtering necessary for pitch extraction. Next, the word recognition unit 102 performs recognition processing on the voice data that has undergone the preprocessing (step 204). As the recognition processing by the word recognition unit 102, for example, the processing disclosed in the specification and the drawings attached to the application of Japanese Patent Application No. 1-224956 can be used.

【００１５】音声データの認識処理が終了すると、この認識結果に基づいて、制御部１０８が対応する応答メッセージを応答メッセージ記憶部１０４内の応答メッセージから選択する（ステップ２０５）。When the voice data recognition process is completed, the control unit 108 selects the corresponding response message from the response messages in the response message storage unit 104 based on the recognition result (step 205).

【００１６】一方、前記ステップ２０３の前処理において得られた音声データは、ピッチ抽出部１０３に供給される。そしてここで入力音声データからピッチパラメータを抽出する（ステップ２０９）。次に抽出されたピッチパラメータを平均化し（ステップ２１０）、この平均値と予め定められたしきい値を比較する（ステップ２１０）。一般に女性音の平均ピッチが男性音の平均ピッチの約２倍（例えば男性成人は１００〜１２５Ｈｚ、女性成人は２５０〜３００Ｈｚ）であることを考慮して、上記しきい値は男性音と女性音を判別するものとして適切に定められる。そして平均ピッチが上記しきい値より大きい場合に女性音であると判定され、平均ピッチが上記しきい値以下の場合に男性音であると判定される（ステップ２１１）。この判定で入力音声が女性音であると判定されると、男性音を生成するための男性音辞書が選択される（ステップ２１２）。またステップ２１１の判定で入力音声が男性音であると判定されると、女性音を生成するための女性音辞書を選択する（ステップ２１３）。On the other hand, the voice data obtained in the pre-processing of step 203 is supplied to the pitch extraction unit 103. Then, here, the pitch parameter is extracted from the input voice data (step 209). Next, the extracted pitch parameters are averaged (step 210), and this average value is compared with a predetermined threshold value (step 210). Considering that the average pitch of female tones is about twice the average pitch of male tones (for example, 100 to 125 Hz for male adults and 250 to 300 Hz for female adults), the above threshold values are set for male and female sounds. It is properly determined as what distinguishes. If the average pitch is larger than the threshold value, it is determined to be a female sound, and if the average pitch is less than or equal to the threshold value, it is determined to be a male sound (step 211). If it is determined in this determination that the input voice is a female sound, a male sound dictionary for generating a male sound is selected (step 212). If it is determined in step 211 that the input voice is a male sound, a female sound dictionary for generating a female sound is selected (step 213).

【００１７】上記によって、男性音辞書又は女性音辞書のいずれかが選択されると、規則音声合成部１０６は、前記において選択された辞書を用いて前記ステップ２０５において選択された応答メッセージの合成音を生成する（ステップ２０６）。そしてこの規則音声合成部１０６において生成された応答メッセージの合成音信号がスピーカ１０７に供給され、合成音が出力される（ステップ２０７）。応答メッセージの出力が終了すると全体の処理が終了する（ステップ２０８）。When either the male sound dictionary or the female sound dictionary is selected as described above, the regular voice synthesizing unit 106 uses the dictionary selected in the above to select the response message selected in step 205. The synthesized sound of is generated (step 206). Then, the synthesized voice signal of the response message generated by the rule voice synthesis unit 106 is supplied to the speaker 107, and the synthesized voice is output (step 207). When the output of the response message ends, the entire process ends (step 208).

【００１８】前記規則音声合成部１０６での処理として、例えば特願平２−１４１９９の願書に添付されている明細書及び図面に開示されている処理を用いて実現することができる。The processing in the rule speech synthesizing unit 106 can be realized using, for example, the processing disclosed in the specification and drawings attached to the application of Japanese Patent Application No. 2-14199.

【００１９】上述したように、この実施例によれば、話者からの入力音声が男性音であると判定された場合に、女性音によって応答メッセージが出力される。一方話者からの入力音声が女性音であると判定された場合には、男性音での応答メッセージが出力される。即ち話者は異性の音声による応答メッセージを受けることになる。従って話者の性別に応じた柔軟かつ親切なサービスを話者に提供できる。As described above, according to this embodiment, when the input voice from the speaker is determined to be the male sound, the response message is output by the female sound. On the other hand, when the input voice from the speaker is determined to be a female sound, a response message with a male sound is output. That is, the speaker receives the response message with the voice of the opposite sex. Therefore, it is possible to provide the speaker with a flexible and kind service according to the gender of the speaker.

【００２０】尚、上記実施例においては、異性の音声での応答メッセージを提供するようにしたが、この考案はこのような態様に限定されず、話者の性別に応じて、例えば応答メッセージのトーンや音色等を種々の態様にて変えることも可能である。In the above embodiment, the response message is provided by voice of the opposite sex, but the present invention is not limited to such a mode, and for example, the response message of the response message may be changed according to the gender of the speaker. It is also possible to change the tone, tone color, etc. in various ways.

【００２１】[0021]

[Effect of the device]

以上述べたようにこの考案によれば、話者の性別を判定し、この結果に応じて応答メッセージの音声の種類を変えられるようにしたため、話者の性別に応じた応答サービスが可能となり、より付加価値の高い音声応答装置を実現することができる。 As described above, according to the present invention, the gender of the speaker is determined, and the type of the voice of the response message can be changed according to the result, which enables the response service according to the gender of the speaker. It is possible to realize a voice response device with higher added value.

[Brief description of drawings]

【図１】この実施例に係る音声応答装置の機能ブロック
図である。FIG. 1 is a functional block diagram of a voice response device according to this embodiment.

【図２】図１に係る音声応答装置の動作フローチャート
である。FIG. 2 is an operational flowchart of the voice response device according to FIG.

[Explanation of symbols]

１０１…音声入力部、１０２…単語認識部、１０３…ピ
ッチ抽出部、１０４…応答メッセージ記憶部、１０５…
比較演算部、１０６…規則音声合成部、１０７…スピー
カ、１０８…制御部。101 ... Voice input section, 102 ... Word recognition section, 103 ... Pitch extraction section, 104 ... Response message storage section, 105 ...
Comparing calculation unit 106 ... Regular voice synthesis unit 107 ... Speaker 108 ... Control unit.

Claims

[Scope of utility model registration request]

1. A voice response device for inputting a voice uttered by a speaker, synthesizing a response message according to the recognition result of the voice, and outputting the result, by analyzing the input voice to determine the gender of the speaker. And a gender determining means to do, when the gender determining means determines that the speaker is male,
A first voice synthesizing unit for synthesizing and outputting a response message with a first type of voice; and when the gender determining unit determines that the speaker is a female,
And a second voice synthesizing means for synthesizing and outputting a response message with a second type of voice.