JPH0612520B2

JPH0612520B2 - Voice response sentence output method

Info

Publication number: JPH0612520B2
Application number: JP59072392A
Authority: JP
Inventors: 祥二山本; 豊安井; 修一橋元; 実大山; 尚純土田
Original assignee: Fujitsu Ltd; Nippon Telegraph and Telephone Corp
Current assignee: Fujitsu Ltd; Nippon Telegraph and Telephone Corp
Priority date: 1984-04-11
Filing date: 1984-04-11
Publication date: 1994-02-16
Anticipated expiration: 2009-02-16
Also published as: JPS60215235A

Description

【発明の詳細な説明】発明の技術分野本発明は音声応答装置に係り、特に音声応答文出力方式
に関する。Description: TECHNICAL FIELD OF THE INVENTION The present invention relates to a voice response device, and more particularly to a voice response sentence output system.

技術の背景情報活動の活発化に伴う通信網の高度化に対処すべく、
通信網のディジタル化に加えて各種新サービスの導入が
企図されている。これら新サービスのうち本発明は、近
年急速に発展してきた音声処理技術を用いた音声応答装
置に関連する音声応答文出力方式について言及する。Background of technology In order to cope with the sophistication of communication networks accompanying the increase in information activities,
In addition to the digitization of communication networks, the introduction of various new services is planned. Among these new services, the present invention refers to a voice response sentence output method related to a voice response device using a voice processing technique which has been rapidly developed in recent years.

従来技術と問題点従来、音声応答文を出力する際、顧客から要求された音
声応答文を編集した後、単に１回だけ返送するという手
法が行われていた。このため、顧客が真に知りたい情報
を聞き逃してしまうと、もう一度問い合わせをし直さな
ければならないという問題点があった。勿論、このよう
な問題点は顧客自身の注意力如何に係るものであるが、
サービスの向上という観点からすると何らかの対策を施
すのが好ましい。そのため、音声応答文の一部を繰返し
音声出力する方法として、繰返し出力の必要な音片群を
編集結果のエリアに何度も書いておいて、音声出力時に
編集結果を順番にとり出すことにより、繰返し音声出力
する方法も考えられる。然しながら音片編集により音声
応答文を組み立てる場合、音片数には上限があることが
一般的であり、このような方式をそのまま採用すると、
音片数が増加するので実質的な音声応答文の長さに対す
る制限が厳しくなる。一方、音声応答文の全体を繰返し
出力させることも又上記方法で可能であるが、音声応答
文の長さに対する制限は一層厳しい。例えばメモリ容量
の制限から２回出力ならば１回出力のときの1/2の音声
数の文章しか構成できないという問題がある。2. Related Art and Problems Conventionally, when outputting a voice response sentence, a method of editing the voice response sentence requested by the customer and then simply returning it once has been used. For this reason, if the customer misses the information he or she really wants to know, there is a problem in that the customer has to make another inquiry. Of course, such problems are related to the customer's attention.
From the viewpoint of improving the service, it is preferable to take some measures. Therefore, as a method of repeatedly outputting a part of a voice response sentence, by writing a group of speech pieces that need to be repeatedly output in the edit result area many times and extracting the edit results in order when outputting the voice, A method of repeatedly outputting voice may be considered. However, when assembling a voice response sentence by voice unit editing, it is common that the number of voice units has an upper limit, and if such a method is adopted as it is,
Since the number of speech pieces increases, the limit on the actual length of the voice response sentence becomes severe. On the other hand, it is possible to repeatedly output the entire voice response sentence by the above method, but the limitation on the length of the voice response sentence is more severe. For example, due to the limitation of the memory capacity, there is a problem that if the document is output twice, only a sentence with half the number of voices that can be output once can be composed.

発明の目的本発明は上記問題点を解決することのできる音声応答文
出力方式を提案することを目的とするものである。OBJECT OF THE INVENTION It is an object of the present invention to propose a voice response sentence output system which can solve the above problems.

発明の構成上記目的を達成するために、本発明は、外部から入力さ
れる文章指定符号により指定される文章の構成を表わす
情報列と、文章の詳細な形態を指定する情報列から予め
記憶されている複数種類の音声情報を用いて文章を編集
し、その編集された文章を音声により出力する音声応答
装置において、編集された文章の情報列中に文章の一ま
たは複数の部分を繰返し出力するするための制御情報を
入力する手段と、音声出力時に音声出力状態を保持する
手段を備え、前記編集された文章の情報列に含まれる制
御情報と音声出力状態の保持情報とから繰返し指定され
た回数だけ繰返し音声出力を行うことを特徴とするもの
である。In order to achieve the above object, the present invention is prestored from an information string representing the structure of a sentence designated by a sentence designation code input from the outside and an information sequence designating a detailed form of the sentence. In a voice response device that edits a sentence using a plurality of types of voice information and outputs the edited sentence by voice, one or more parts of the sentence are repeatedly output in the information sequence of the edited sentence. And a means for holding the voice output state at the time of voice output, which is repeatedly designated from the control information and the voice output state holding information included in the information sequence of the edited sentence. The feature is that the voice output is repeated a number of times.

すなわち、部分的に繰返し音声応答文を出力する機能を
備えることにより、所定の音声応答文を編集して出力す
るに際し、音声応答文の任意の文節（特に重要な情報を
含む文節）を繰り返して出力し、情報の聞き逃しを少な
くするようにし、さらには、利用者の操作なしに文章全
体の繰返し出力も可能とするものである。That is, by providing a function of partially outputting a repeated voice response sentence, when a predetermined voice response sentence is edited and output, an arbitrary phrase of the voice response sentence (a phrase including particularly important information) is repeated. The output is performed so that the user does not miss the information, and further, the entire sentence can be repeatedly output without the user's operation.

発明の実施例第１図は本発明の方式を適用した音声応答システムの基
本構成例を示すブロック図である。本図において、１１
は上位装置、詳しくはディジタル交換機である。上位装
置１１は、ディジタル通話路スイッチ（ＳＷ）１３およ
び中央制御部（ＣＣ）１４とを備えてなる。本発明の音
声応答装置（ＡＲＵ）１２はこの上位装置１１に接続さ
れる。Embodiment of the Invention FIG. 1 is a block diagram showing a basic configuration example of a voice response system to which the system of the present invention is applied. In this figure, 11
Is a host device, more specifically, a digital exchange. The upper device 11 includes a digital communication path switch (SW) 13 and a central control unit (CC) 14. The voice response unit (ARU) 12 of the present invention is connected to the host device 11.

この音声応答装置１２は音声データとして例えばＬＳＰ
（線スペクトル対）方式により分析された音声データを
蓄積し、適当なタイミングで次々とデータを音声合成器
に入力することにより文章を合成するものであり、ディ
ジタル通話路インタフェース部（ＨＷＩ）１５を介し
て、音声応答文が出力され、交換機制御系インタフェー
ス部（ＣＣＩ）１６からは文章指定符号，文章詳細指定
符号を受け取る。HWI１５には音声合成部１７が接続さ
れ、CCI１６には音声編集部１８が接続される。音声編
集部１８は制御装置181を主体として動作し、これによ
り制御されるのは、参照番号182〜186で示すブロック群
である。182は制御メモリであり、応答すべき音声の編
集ならびに合成処理を行うマイクロプログラムを格納す
る。183は音声メモリであり、単音ならびに単語形式の
ＬＳＰ分析された音声データを格納する。184は文章の
構成をあらわす情報を格納する文形パターンメモリであ
り、詳しくは前記音声データの前記音声メモリ183内で
の収容位置を示すマップデータ（第２Ａ図参照）を格納
する。185はバッファメモリであり、出力すべき音声応
答文の編集結果を各回線毎（本図中の音声合成部１７内
に示す回線♯０…回線♯31参照）に格納する（第３Ａ図
参照）。186は制御装置181がワークエリアとして使用す
るワークエリアメモリであり、データバッファの領域と
回線制御語（ＣＬＷ）の領域とに大別される。このデー
タバッファは、上位装置１１からの要求を一旦格納する
ための受信バッファである。又、回線制御語（ＣＬＷ）
は各回線毎の音声応答文を合成する際に合成の回線毎の
進行状態を保持しておくためのエリアである（第４図参
照）。The voice response device 12 uses, for example, an LSP as voice data.
The speech data analyzed by the (line spectrum pair) method is accumulated, and sentences are synthesized by inputting the data to the speech synthesizer one after another at an appropriate timing. The digital speech path interface unit (HWI) 15 is used. A voice response sentence is output via the exchange control system interface unit (CCI) 16 to receive a text designation code and a text detail designation code. A voice synthesis unit 17 is connected to the HWI 15, and a voice editing unit 18 is connected to the CCI 16. The voice editing unit 18 mainly operates by the control device 181, and the control is performed by the block group denoted by reference numerals 182-186. A control memory 182 stores a microprogram for editing and synthesizing a voice to be responded to. Reference numeral 183 denotes a voice memory, which stores voice data obtained by LSP analysis in a single tone and word format. Reference numeral 184 denotes a sentence pattern memory that stores information representing the structure of a sentence. Specifically, it stores map data (see FIG. 2A) that indicates the accommodation position of the voice data in the voice memory 183. A buffer memory 185 stores the edited result of the voice response sentence to be output for each line (see line # 0 ... line # 31 shown in the voice synthesizer 17 in the figure) (see FIG. 3A). . A work area memory 186 is used as a work area by the control device 181, and is roughly divided into a data buffer area and a line control word (CLW) area. This data buffer is a reception buffer for temporarily storing the request from the host device 11. Also, line control word (CLW)
Is an area for holding the progress state of each line for synthesis when synthesizing the voice response sentence for each line (see FIG. 4).

第１図に示す実施例においては、３２個の音声合成器17
-0〜17-31を並列動作させることができる。In the embodiment shown in FIG. 1, 32 speech synthesizers 17
-0 to 17-31 can be operated in parallel.

第２Ａ図は第１図の文形パターンメモリ184内に格納さ
れる文形マップデータの一例を示す図であり、本発明の
特徴部分でもある。文形マップは例えばＭ種の文形から
なり、このうち任意の第Ｎ番目の文形マップＮについて
詳しく一例を示す。図示する通り、文形マップは例えば
固定部♯１，♯２，♯３および♯４と可変部♯１および
♯２と文形の最後を示す終了マークとからなる。FIG. 2A is a diagram showing an example of the sentence pattern map data stored in the sentence pattern memory 184 of FIG. 1 and is also a characteristic part of the present invention. The sentence pattern map is composed of, for example, M types of sentence patterns, and an example of the arbitrary Nth sentence pattern map N will be described in detail. As shown in the figure, the sentence pattern map includes, for example, fixed parts # 1, # 2, # 3 and # 4, variable parts # 1 and # 2, and an end mark indicating the end of the sentence pattern.

第２Ｂ図は第２Ａ図に示す文形マップ固定部のフォーマ
ットを示す図であり、例えば１ワード当り１６ビツトの
２ワード（２Ｗ）構成である。第２ワード目には、前記
音声メモリ183内に格納された音声データの各音片のア
ドレス（音片アドレス）の下位１６ビツトが書込まれて
おり、その第１ワード目の０〜３ビツトには当該音片ア
ドレスの上位４ビツトＰＡが書込まれる。FIG. 2B is a diagram showing the format of the sentence pattern fixing portion shown in FIG. 2A, which has, for example, a 2-word (2W) structure of 16 bits per word. In the second word, the lower 16 bits of the address (speech address) of each sound piece of the sound data stored in the sound memory 183 are written, and the 0th to 3th bits of the first word are written. The upper 4 bits PA of the voice unit address are written in.

第２Ｃ図は第２Ａ図に示す文形マップ可変部のフォーマ
ットを示す図であり、例えば１ワード（１Ｗ）構成であ
って、第７ビツトにマーク（Ｍ）領域、第０〜２ビツト
にフラグ（ＦＬＧ）領域がそれぞれ割り当てられる。残
りのビツトは空きである。マーク（Ｍ）が論理“１”で
あるときは可変部であることを表わす。又、フラグ（Ｆ
ＬＧ）は可変部の処理内容を示し、特に“１”“１”
“０”のパターンであるときは部分繰返しの始まりを示
し、“１”“１”“１”のパターンであるときは部分繰
返しの終了を示す。これ以外のパターンは各種のキャラ
クタ処理の内容を示す。FIG. 2C is a diagram showing a format of the sentence pattern map variable portion shown in FIG. 2A, which has, for example, a 1-word (1 W) structure, and a mark (M) area is at the 7th bit and a flag is at the 0th to 2nd bits. Each (FLG) area is allocated. The remaining bits are empty. When the mark (M) has a logic "1", it represents a variable part. In addition, the flag (F
LG) indicates the processing contents of the variable part, particularly "1" and "1".
The pattern of "0" indicates the beginning of partial repetition, and the pattern of "1""1""1" indicates the end of partial repetition. The other patterns indicate the contents of various character processes.

第３Ａ図は編集動作の結果、第１図のバッファメモリ18
5内に格納される編集結果の例を示す図であり、第３Ｂ
図は第３Ａ図内の音片アドレスデータの一例を示す図で
ある。特に第３Ｂ図の構成は本発明に基づくものであ
る。第３Ａ図において、バッファメモリ185の中は、例
えば３２回線の場合、回線♯０バッファの領域から回線
♯３１バッファの領域までに各々区分されており、任意
の回線♯Ｎバッファについて詳しく一例を示す。このバ
ッファメモリ185は前述の如く、出力すべき音声応答文
の編集結果を各回線毎に格納するものであり、例えば１
回線当り128ワード構成である。回線♯Ｎバッファ内の
Ｅは未使用のワードを示す。回線♯Ｎバッファは複数個
の音片アドレス♯０，♯１…と終了マークからなってお
り、各該音片アドレスの基本構成は第２Ｂ図に示したの
とほぼ同様であるが、後述する第３Ｂ図に示すように編
集後の該音片アドレスには部分繰返し制御情報の領域
（図中ｍで示す）が割り当てられている。FIG. 3A shows the buffer memory 18 of FIG. 1 as a result of the editing operation.
FIG. 3B is a diagram showing an example of the edited result stored in FIG.
The figure is a diagram showing an example of the voice unit address data in FIG. 3A. In particular, the configuration of Figure 3B is based on the present invention. In FIG. 3A, in the buffer memory 185, for example, in the case of 32 lines, the lines are divided into a line # 0 buffer region and a line # 31 buffer region, and an example of a detailed line #N buffer is shown in detail. . As described above, the buffer memory 185 stores the edited result of the voice response sentence to be output for each line.
It consists of 128 words per line. E in line #N buffer indicates an unused word. The line #N buffer comprises a plurality of voice unit addresses # 0, # 1 ... And an end mark. The basic structure of each voice unit address is almost the same as that shown in FIG. 2B, but will be described later. As shown in FIG. 3B, the region (indicated by m in the figure) of the partial repeat control information is assigned to the voice unit address after editing.

第３Ｂ図は第３Ａ図内に示す音片アドレスのフォーマッ
トを示す図であり、前記の部分繰返し制御情報ｍが例え
ば第４ビツト目に割り付けられている。マークｍが
“１”であれば部分繰返しを要する音片であることを示
し、“０”であれば繰返しは不要であることを示すもの
とする。この場合、合成時にはマークｍが“１”の音片
について、予め決まった回数（例えば２回だけ）合成を
行い、他の部分については１回だけ合成を行うようにす
るものである。FIG. 3B is a diagram showing the format of the voice unit address shown in FIG. 3A, and the partial repetition control information m is assigned to, for example, the fourth bit. If the mark m is "1", it indicates that the speech piece requires partial repetition, and if it is "0", it indicates that repetition is unnecessary. In this case, at the time of synthesizing, the speech unit with the mark m of “1” is synthesized a predetermined number of times (for example, only twice), and the other portions are synthesized only once.

以下後述する第６図に図示する文形マップ５１から第６
図のバッファメモリ５２に示す編集結果を得るまでの制
御部の編集動作について、第４図のフローによって説明
する。From the sentence pattern map 51 shown in FIG.
The editing operation of the control unit until the editing result shown in the buffer memory 52 of the figure is obtained will be described with reference to the flow of FIG.

第４図は部分繰返しを含む音声編集時の動作を表わすフ
ローチャートである。FIG. 4 is a flow chart showing the operation at the time of voice editing including partial repetition.

(1)文形パターンメモリ184内の上位装置１１から指示さ
れた文形に関する文形マップの第１ワード目（第６図の
５１）をロードする（ステップａ）。(1) The first word (51 in FIG. 6) of the sentence pattern map regarding the sentence pattern instructed from the upper level device 11 in the sentence pattern memory 184 is loaded (step a).

(2)文形マップ可変部（第２Ｃ図）のマークＭが“０”
（ステップｂ）、且つ制御部内で保持している編集処理
対象部分が部分繰返し範囲の中か外かを示す部分繰返し
マークｍ１が“０”であるので（ステップｃ）、動作フ
ローチャートのルートを採り、音片♯１アドレスを、
バッファメモリ185内にロードする（ステップｅ）（第
６図の５２の音片♯１アドレス参照）。(2) The mark M in the variable portion of the sentence pattern map (Fig. 2C) is "0".
(Step b) And since the partial repeat mark m1 indicating whether the edit processing target part held in the control unit is inside or outside the partial repeat range is "0" (step c), the route of the operation flowchart is taken. , Voice unit # 1 address,
It is loaded into the buffer memory 185 (step e) (see the voice unit # 1 address of 52 in FIG. 6).

(3)文形パターンメモリ184より文形マップの第３ワード
目（第６図の５１）をロードする。(3) The third word (51 in FIG. 6) of the sentence pattern map is loaded from the sentence pattern memory 184.

(4)文形マップ可変部のマークＭが“１”で、フラグＦ
ＬＧが「部分繰返しの始まり」（既述の例ではFLG＝
“１”“１”“０”）であるので（ステップｆ）（第６
図の５１の部分繰返し始まり）、動作フローチャートの
ルートを採り部分繰返しマークｍ１を“１”にする
（ステップｇ）。(4) The mark M in the variable portion of the sentence pattern map is "1" and the flag F
LG is “beginning of partial repetition” (FLG =
Since it is "1""1""0") (step f) (sixth)
Starting from the partial repeat of 51 in the figure), the route of the operation flowchart is taken and the partial repeat mark m1 is set to "1" (step g).

(5)文形パターンメモリ184より第４ワード目（第６図の
５１）をロードする。(5) The fourth word (51 in FIG. 6) is loaded from the sentence pattern memory 184.

(6)文形マップの可変部のマークＭが“０”で、部分繰
返しマークｍが“１”なので、ルートを通り、文形マ
ップからの音片アドレスに対して繰返しマークｍを
“１”として（ステップｄ）且つ第５ワード目と共に
（第６図の５１内の音片♯２アドレス）、バッファメモ
リ185にストアする（第６図の５２の音片♯２アドレス
参照）。(6) Since the mark M in the variable part of the sentence pattern is "0" and the partial repeat mark m is "1", the repeat mark m is set to "1" for the voice unit address from the sentence map through the route. (Step d) and with the fifth word (speech unit # 2 address in 51 of FIG. 6), it is stored in the buffer memory 185 (see the speech unit # 2 address of 52 of FIG. 6).

(7)文形パターンメモリ184より第６ワード目（第６図の
５１）をロードする。(7) The sixth word (51 in FIG. 6) is loaded from the sentence pattern memory 184.

(8)上記(5)および(6)と同様の処理を行い、バッファメ
モリ185にストアする（第６図の５２の音片♯３アドレ
ス参照）。(8) The same processes as (5) and (6) above are performed and stored in the buffer memory 185 (refer to voice unit # 3 address 52 in FIG. 6).

(9)文形パターンメモリ184より第８ワード目（第６図の
５１）をロードする。(9) The 8th word (51 in FIG. 6) is loaded from the sentence pattern memory 184.

(10)可変部のマークＭが“１”で、フラグＦＬＧが「部
分繰返し終了」（第６図の５１の部分繰返し）を示すの
で（既述の例ではFLG＝“１”“１”“１”）、ルート
を通り部分繰返しマークｍ１を“０”とする（ステッ
プｈ）。(10) Since the mark M of the variable portion is "1" and the flag FLG indicates "partial repetition end" (partial repetition of 51 in FIG. 6) (FLG = "1""1""in the example described above). 1 ") and the partial repeat mark m1 passing through the route is set to" 0 "(step h).

(11)文形パターンメモリ184より第９ワード目をロード
する。(11) The ninth word is loaded from the sentence pattern memory 184.

(12)上記(1)および(2)と同様の処理を行う。(12) The same processing as (1) and (2) above is performed.

(13)第１１ワード目は終了マーク（第６図の51の最下
段）であるので、編集を終了し（ステップｊ）、第６図
に図解した後段の合成に入る。なお、ステップｉは、フ
ラグＦＬＧ（第２Ｃ図）がFLG≠“１”“１”“０”、
且つFLG≠“１”“１”“１”の場合の各種キャラクタ
処理を行う。(13) Since the 11th word is the end mark (the bottom row of 51 in FIG. 6), the editing is finished (step j), and the synthesis in the latter stage illustrated in FIG. 6 is started. In step i, the flag FLG (FIG. 2C) is FLG ≠ “1” “1” “0”,
In addition, various character processing is performed when FLG ≠ “1” “1” “1”.

第５図は１回線分の回線制御語（ＣＬＷ）の構成を示す
図である。ＣＬＷは前述のように、第１図のワークエリ
アメモリ186の一部を構成するエリアであり、各回線毎
の音声応答文を合成する際に合成の進行状態を保持して
おき、例えばマイクロプログラムで参照しながら更新す
るものである。本図において、第２ワード目には原則と
して次に合成器に入力する音声データの下位１６ビツ
ト、第１ワード目には同じアドレスの上位４ビツトPA′
と部分繰返し制御情報と現在出力中の音片の合成が終わ
ったら次に合成，出力すべき音片を示すポインタで構成
される。FIG. 5 is a diagram showing the structure of a line control word (CLW) for one line. As described above, CLW is an area that constitutes a part of the work area memory 186 of FIG. 1, and holds the progress of synthesis when synthesizing the voice response sentence for each line. It will be updated with reference to. In the figure, as a general rule, the lower 16 bits of the voice data to be inputted to the synthesizer next are input to the second word, and the upper 4 bits PA 'of the same address are input to the first word.
When the composition of the partial repeat control information and the currently output voice unit is completed, the pointer is used to indicate the voice unit to be synthesized and output next.

この実施例では、音声合成時においては、バッファメモ
リ185内の音片アドレスにおける部分繰返しマークｍ
（上記第３Ｂ図）の“１”又は“０”に応じて、ＣＬＷ
内の部分繰返しマークｍ１（第５図）および部分繰返し
２回目マークｍ２（第５図）をそれぞれ“１”又は
“０”にし、これらマークｍ１およびｍ２を見て、音声
応答部の任意の音節を部分的に繰り返して出力する。In this embodiment, at the time of voice synthesis, the partial repeat mark m at the voice unit address in the buffer memory 185 is used.
Depending on "1" or "0" in (Fig. 3B above), CLW
The partial repeat mark m1 (Fig. 5) and the partial repeat second mark m2 (Fig. 5) in the above are set to "1" or "0" respectively, and these marks m1 and m2 are viewed, and any syllable of the voice response part is displayed. Is partially repeated and output.

第６図は部分繰返しを含む音声応答文の編集過程を図解
的に示す図であり、各音片♯１〜♯４は具体的に次のよ
うな音声からなり、特に音片♯２と音片♯３の応答文が
重要であってこれを繰り返すものとする。FIG. 6 is a diagram schematically showing the process of editing a voice response sentence including partial repetition. Each voice unit # 1 to # 4 is specifically composed of the following voices, particularly voice unit # 2 and voice unit. It is assumed that the response sentence of piece # 3 is important and is repeated.

音片♯１：「あなたがおかけになった」音片♯２：「１２３−４５６７番は」音片♯３：「使用されておりません」音片♯３：「おかけ直し下さい」ここに音片♯２と音片♯３が部分繰返しの範囲内である
から、結局、最終的な音声応答文は、「あなたがおかけになった，１２３−４５６７番は，使
用されておりません，１２３−４５６７番は，使用され
ておりません，おかけ直し下さい」ということになる。第６図中の５１の部分は第2A図の文
形パターンメモリ184の範囲に属し、５２の部分は第３
Ａ図のバッファメモリ185の範囲に属し、５３の部分は
最終的な音声応答出力文である。Voice unit # 1: "You have called" Voice unit # 2: "No. 123-4567" Voice unit # 3: "Not used" Voice unit # 3: "Please call again" Here Since voice unit # 2 and voice unit # 3 are within the range of partial repetition, in the end, the final voice response sentence is as follows: "You asked me, No. 123-4567 is not used, No. 123-4567 is not used. Please call again. " The part 51 in FIG. 6 belongs to the range of the sentence pattern memory 184 in FIG. 2A, and the part 52 is the third part.
A part 53 belongs to the range of the buffer memory 185 of FIG. A and is a final voice response output sentence.

以下第６図のバッファメモリ５２に示す編集結果から音
声応答文を合成する制御部の動作を、第７図を参照しつ
つ説明する。この第７図は部分繰返しを含む音声合成時
の音片の接続動作を表わすフローチャートである。各ス
テップの内容は図中に記載の通りである。前述した応答
文の具体例によれば、 (1)ルートの処理によって、「あなたがおかけになっ
た」を合成する。このとき、ｍ１＝“０”，ｍ２＝
“０”である。ｍ１およびｍ２はそれぞれＣＬＷ内（第
５図）の部分繰返しマークおよび部分繰返し２回目マー
クである。The operation of the control unit for synthesizing the voice response sentence from the edited result shown in the buffer memory 52 of FIG. 6 will be described below with reference to FIG. FIG. 7 is a flow chart showing the operation of connecting the voice units during voice synthesis including partial repetition. The content of each step is as described in the figure. According to the specific example of the response sentence described above, (1) By the processing of the route, "you were the call" is synthesized. At this time, m1 = “0”, m2 =
It is "0". m1 and m2 are a partial repeat mark and a partial repeat second mark in CLW (FIG. 5), respectively.

(2)ルートおよびの処理によって、「１２３−４５
６７番は使用されておりません」を合成する。このと
き、ｍ１＝“１”，ｍ２＝“０”である。(2) Depending on the processing of route and
No. 67 is not used. " At this time, m1 = "1" and m2 = "0".

(3)ルートおよびの処理によって、「１２３−４５
６７番は使用されておりません」を合成する。このと
き、ｍ１＝“１”，ｍ２＝“０”である。(3) Depending on the route and
No. 67 is not used. " At this time, m1 = "1" and m2 = "0".

(4)ルートおよびの処理によって、「おかけ直し下
さい」を合成する。このとき、ｍ１＝“０”，ｍ２＝
“０”である。(4) "Recall" is synthesized by processing the route and. At this time, m1 = “0”, m2 =
It is "0".

ここに合成が完了し、第１図の音声合成器17-0〜17-31
のいずれかを経由して、顧客へ目的とする音声応答文が
送出される。The synthesis is completed here, and the speech synthesizers 17-0 to 17-31 shown in FIG.
A desired voice response sentence is sent to the customer via any of the above.

前述の第６図に示した文形では説明を簡単にするために
固定部しか含めていないが、可変部を繰返し範囲の内、
外に含む文形であっても良い。例えば前述の文章の電話
番号の部分は、通常は音声応答文の出力指示がある度に
新たに指定されるべき部分であり、文形としては可変部
にすべきものである。又、第３Ｂ図の部分繰返し情報ｍ
の決め方としてはこれまで説明した実施例の他に、第
５，６ビツトが“０１”，“１０”，“１１”のいずれ
かであるときは各々２，３，４回繰返しを行う音片群の
先頭であることを示し、第４ビツトが“１”であるとき
はその直前の音片が部分繰返しの最後尾であることを示
すような取り決め方も可能である。この場合には、合成
される音片群毎に繰返しの回数を変えることが可能とな
る。例えば、内容が重要な部分については４回繰返しを
指定し、それ程重要でない部分については２回繰返しを
指定するという使い方もできる。この回数の指定方法と
しては文形マップを決定する際に決めておく方法でも良
いが、上位装置から音声応答文の合成を指示する際に指
定できるようにしておくことも可能である。この他本発
明の実施にあたっては文形マップの構成方法，編集結果
の形式，回線制御語の構成方法，編集動作，合成動作，
あるいは装置構成等において、種々の変形が可能である
ことは言うまでもない。In the sentence pattern shown in FIG. 6 described above, only the fixed part is included for simplification of description, but the variable part is included in the repeating range.
It may be a sentence pattern included outside. For example, the telephone number portion of the above sentence is usually a portion to be newly designated every time there is an instruction to output a voice response sentence, and the sentence pattern should be a variable portion. Also, the partial repetition information m in FIG. 3B
In addition to the above-described embodiments, when the fifth and sixth bits are any of "01", "10", and "11", a sound piece that repeats 2, 3 and 4 times respectively. It is also possible to make an arrangement so as to indicate that it is the head of the group, and that when the fourth bit is "1", it indicates that the speech piece immediately before that is the tail of the partial repetition. In this case, it is possible to change the number of repetitions for each synthesized voice unit group. For example, it is possible to use a method of designating a repetition four times for a portion where the content is important and a repetition twice for a less important portion. The method of specifying the number of times may be a method of determining it when the sentence pattern map is determined, but it is also possible to allow it to be specified when the upper device instructs synthesis of the voice response sentence. In addition to the above, in carrying out the present invention, a method of constructing a sentence pattern, a format of an edited result, a method of constructing a line control word, an editing operation, a combining operation,
Alternatively, it goes without saying that various modifications can be made in the device configuration and the like.

以上の説明は一つの音声応答文の一部分のみを繰り返し
音声出力する場合の説明であるが、以下に音声応答文の
全体の繰返し音声出力について説明する。全体繰返しの
場合、編集動作において編集結果の情報列の中に繰返し
の有無、又は回数を表わす情報を含ませることにより、
合成動作時にはこの情報とＣＬＷ内に持つ合成の進行状
態の情報から指定回数の繰返し動作を行う。全体繰返し
の回数の指定は、上位装置から音声応答文出力を指示す
る際に他の符号と同様に上位装置から指定することも可
能であり、文形パターンに予め設定しておくことも可能
である。この全体繰返しにおいてもその実施方法につい
ては種々の変形が可能である。Although the above description is for the case where only a part of one voice response sentence is repeatedly output by voice, the following description will be given of the entire repeated voice output of the voice response sentence. In the case of the entire repetition, by including the information indicating the presence or absence of the repetition or the number of times in the information string of the edited result in the editing operation,
At the time of the synthesizing operation, a repeating operation is performed a specified number of times based on this information and the information on the progress of synthesizing held in CLW. It is also possible to specify the number of total repetitions from the higher-level device in the same way as other codes when instructing the voice response sentence output from the higher-level device, and can be set in advance in the sentence pattern. is there. Even in the entire repetition, various modifications can be made in the method of implementation.

発明の効果以上説明したように本発明によれば、重要部分のみを繰
返して応答することができ、音声応答におけるサービス
の向上が図れる。EFFECTS OF THE INVENTION As described above, according to the present invention, it is possible to repeatedly respond only to important parts, and it is possible to improve the service in voice response.

[Brief description of drawings]

第１図は本発明の方式を適用した音声応答システムの基
本構成例を示すブロック図、第２Ａ図は第１図の文形パ
ターンメモリ184内に格納される文形マップデータの一
例を示す図、第２Ｂ図は第２Ａ図に示す文形マップ固定
部の一例を示す図、第２Ｃ図は第２Ａ図に示す文形マッ
プ可変部のフォーマットを示す図、第３Ａ図は第１図の
バッファメモリ185内に格納される音声応答文を示す
図、第３Ｂ図は第３Ａ図内に示す音片アドレスの一例を
示す図、第４図は部分繰返しを含む音声応答文の編集時
の動作を表わすフローチャート、第５図は回線制御語
（ＣＬＷ）の１回線分についての一例を示す図、第６図
は部分繰返しを含む音声応答文の編集および合成の過程
を図解的に示す図、第７図は部分繰返しを含む音声応答
文の合成時の動作を表わすフローチャートである。１１…上位装置、１２…音声応答装置、１７…音声合成部、１８…音声編集部、 181…制御装置、182…制御メモリ、 183…音声メモリ、184…文形パターンメモリ、 185…バッファメモリ、 186…ワークエリアメモリ。FIG. 1 is a block diagram showing a basic configuration example of a voice response system to which the method of the present invention is applied, and FIG. 2A is a diagram showing an example of sentence pattern map data stored in the sentence pattern memory 184 of FIG. 2B is a diagram showing an example of the sentence pattern map fixing unit shown in FIG. 2A, FIG. 2C is a diagram showing the format of the sentence pattern changing unit shown in FIG. 2A, and FIG. 3A is the buffer of FIG. FIG. 3 is a diagram showing a voice response sentence stored in the memory 185, FIG. 3B is a diagram showing an example of the voice unit address shown in FIG. 3A, and FIG. 4 is an operation at the time of editing a voice response sentence including partial repetition. 5 is a flowchart showing an example of one line of a line control word (CLW), FIG. 6 is a diagram schematically showing a process of editing and synthesizing a voice response sentence including partial repetition, and FIG. The figure shows the operation when synthesizing a voice response sentence including partial repetition. Is a low chart. 11 ... Host device, 12 ... Voice response device, 17 ... Voice synthesis unit, 18 ... Voice editing unit, 181 ... Control device, 182 ... Control memory, 183 ... Voice memory, 184 ... Text pattern memory, 185 ... Buffer memory, 186 ... Work area memory.

フロントページの続き (72)発明者橋元修一神奈川県川崎市中原区上小田中1015番地富士通株式会社内 (72)発明者大山実東京都武蔵野市緑町３丁目９番11号日本電信電話公社武蔵野電気通信研究所内 (72)発明者土田尚純東京都武蔵野市緑町３丁目９番11号日本電信電話公社武蔵野電気通信研究所内 (56)参考文献特開昭58−62738（ＪＰ，Ａ) 実開昭56−64100（ＪＰ，Ｕ)Front page continuation (72) Inventor Shuichi Hashimoto 1015 Kamiodanaka, Nakahara-ku, Kawasaki-shi, Kanagawa Fujitsu Limited (72) Inventor Minoru Oyama 3-9-11 Midoricho, Musashino-shi, Tokyo Nippon Telegraph and Telephone Public Corporation Musashino Telecommunications Inside the research institute (72) Inventor, Naozumi Tsuchida 3-9-11 Midoricho, Musashino City, Tokyo Inside Musashino Telecommunications Research Laboratories, Nippon Telegraph and Telephone Public Corporation (56) Reference JP 58-62738 (JP, A) Actual development Sho 56- 64100 (JP, U)

Claims

[Claims]

1. A voice response device (1) for generating and outputting a voice response sentence to be responded to according to an instruction from a higher-level device (11).
In 2), the voice response sentence is received as a sentence pattern map (51) from the higher-level device (11), and the sentence pattern map data stored in the sentence pattern map (51) is the voice response map. A plurality of sentence pattern map fixing units including a voice unit address for displaying an address corresponding to each voice unit in a voice memory (183) in which various voice units constituting a response sentence are stored in advance, and these sentence pattern map fixing units. And a sentence pattern map variable portion inserted between a position where the partial repetition is to be started and a position where the partial repetition is to be ended in the voice unit address. When the voice response sentence is edited with reference to the map (51), the voice unit addresses are sequentially written in the buffer memory (52), and each of the voices for which the repetition is designated by the sentence pattern changing unit. To one address For this purpose, a partial repeat mark is written, and further, an end mark is written for the final voice unit address, and when the voice response sentence is output, the voice memory (183) is sequentially read according to the contents of the buffer memory (52). A voice response sentence output method, wherein the voice is synthesized up to the end mark by accessing and by repeatedly accessing the partial repeat mark.