JPH01119822A

JPH01119822A - Sentence reader

Info

Publication number: JPH01119822A
Application number: JP62278542A
Authority: JP
Inventors: Mitsuko Kaseda; 加世田　光子
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1987-11-04
Filing date: 1987-11-04
Publication date: 1989-05-11

Abstract

PURPOSE:To synthesize and express a natural voice by designating a range and adopting different reading with respect to a KANJI (Chinese character) of its part, in a sentence reader for outputting a KANJI and KANA (Japanese syllabary) mixed sentence in a voice. CONSTITUTION:A sentence by a KANJI and KANA mixed sentence inputted from an input/output display device 1 is sent to a sentence accumulating part 4 and stored in the sentence accumulating part 4 by an input/output control part 3 and a main control part 2. A language processing part 5 reads out the sentence from the sentence accumulating part 4, divides it into analytic units, executes a language analysis at every analytic word and replaces the language with the information of a phonetic symbol and a meter symbol. An acoustic processing part 6 synthesizes a sound signal from the phonetic symbol and the meter symbol, and outputs a voice from a loudspeaker 7. Also, with regard to the sentence concerned, as for the sentence whose reading or meter is incorrect, or the sentence being different from a user's intention, the original sentence is displayed on a display screen from the input/output display device 1, the range is designated by a cursor, and the phonetic symbol and the meter symbol which are intended are inputted in advance.

Description

【発明の詳細な説明】［概　要］本発明は、漢字かな混じり文について、言語解析を行な
って、発音記号と韻律記号に変換し、これに基づき音声
を合成して出力する装置に関し、通常の言語解析の結果を用いた合成では不自然な音声と
なるような部分について自然な音声を出力することの可
能な装置を提供することを目的とし、音声合成すべき漢字かな混じり文の文字列の中の任意の
文字あるいは文字列の範囲を指定する手段と、該手段に
より指定された文字あるいは文字列については、前記言
語解析の結果を用いることなく、別途設定した記号列に
基づいて音声を合成し出力する手段とを設けることによ
り構成する。[Detailed Description of the Invention] [Summary] The present invention relates to a device that performs language analysis on a sentence containing kanji and kana, converts it into phonetic symbols and prosodic symbols, and synthesizes and outputs speech based on this. The aim is to provide a device that can output natural speech for parts that would be unnatural when synthesized using the results of language analysis. means for specifying a range of arbitrary characters or character strings in It is constructed by providing a means for synthesizing and outputting.

［産業上の利用分野］本発明は、漢字かな混じり文について、言語解析を行な
って、発音記号と韻律記号に変換し、これに基づき音声
を合成して出力する装置に関し、特に、通常の言語解析
の結果を用いた合成では不自然な音声となるような部分
について自然な音声を出力したり、あるいは、指定によ
り故意に通常と異なる韻律の音声などを出力することの
可能な文章読み上げ装置に係る。[Industrial Field of Application] The present invention relates to a device that performs linguistic analysis on a sentence containing kanji and kana, converts it into phonetic symbols and prosodic symbols, and synthesizes and outputs speech based on this. This is a text-to-speech device that can output natural speech for parts that would be unnatural when synthesized using the analysis results, or intentionally output speech with a different prosody than normal according to specifications. Related.

［従来の技術］第３図は従来の文章読み上げ装置の構成を示すブロック
図であって、５１は入力装置、５２は主制御部、５３は
入力制御部、５４は言語処理部、５５は音響処理部、５
６はスピーカを表している。[Prior Art] FIG. 3 is a block diagram showing the configuration of a conventional text reading device, in which 51 is an input device, 52 is a main control section, 53 is an input control section, 54 is a language processing section, and 55 is an audio device. Processing section, 5
6 represents a speaker.

同図において、入力装置５１から入力された漢字かな混
じり文は、言語処理部５４において、解析され、発音記
号と韻律記号（アクセント、ポーズ記号など）の列に変
換する。In the figure, a sentence containing kanji and kana input from an input device 51 is analyzed by a language processing unit 54 and converted into a string of phonetic symbols and prosodic symbols (accents, pause symbols, etc.).

そして、音響処理部５５が、これらの発音記号と韻律記
号を音声パラメータに変換した後、音声を合成し、これ
をスピーカ５６から出力する。After converting these phonetic symbols and prosodic symbols into audio parameters, the audio processing unit 55 synthesizes audio and outputs it from the speaker 56.

［発明が解決しようとする問題点］文章読み上げ装置においては、出力される音声によって
、元の文字が正しく発音されていて、文章の意味が正確
に判断できることが望ましい。[Problems to be Solved by the Invention] In a text reading device, it is desirable that the original characters are pronounced correctly and the meaning of the text can be accurately determined based on the output audio.

しかし、実際には、漢字かな混じり文中に同字異音語が
存在したり、多くの単語が結合した複合語が存在する場
合には、これらに対して、正しい発音記号や適切な韻律
記号を付与するのは容易ではない。However, in reality, when there are homographs and allophones in a sentence containing kanji and kana, or when there are compound words made up of many words, the correct phonetic symbols and appropriate prosodic symbols are used for these words. It is not easy to grant.

もし、これらを相当程度に実用に耐えるものにしようと
すると意味解析を導入したり、個々の単語側のルールを
設ける等の手段が必要となるから、前記第３図に示した
ような簡潔な構成の装置で、これを実現することは不可
能である。If we were to make these highly practical, we would need to introduce semantic analysis or set up rules for individual words, so we would need to use a simple method like the one shown in Figure 3 above. It is impossible to achieve this with the configuration of the device.

そのなめ、従来、同字異音語や多くの単語が結合した複
合語が存在する場合に、これらを正しく発音する文章読
み上げ装置を経済的に実現することが困難であるという
問題点があった。Therefore, in the past, when there were homographs and compound words made up of many words, it was difficult to economically realize a text-to-speech device that could correctly pronounce these words. .

本発明は、このような従来の問題点に鑑み、前述のよう
な特別な場合の発声を正しく行なうことが可能であると
共に簡潔な構成で経済的に実現することの可能な文章読
み上げ装置を提供することを目的としている。In view of these conventional problems, the present invention provides a text reading device that can correctly perform utterances in special cases such as those mentioned above, and that can be realized economically with a simple configuration. It is intended to.

［問題点を解決するための手段］本発明によれば、上述の目的は、前記特許請求の範囲に
記載した手段により達成される。すなわち、本発明は、
漢字かな混じり文について、言語解析を行なって、発音
記号と韻律記号に変換し、これに基づき音声を合成して
出力する装置において、音声合成すべき漢字かな混じり
文の文字列の中の任意の文字あるいは文字列の範囲を指
定する手段と、該手段により指定された文字あるいは文
字列については、前記言語解析の結果を用いることなく
、別途設定した記号列に基づいて音声を合成し出力する
手段とを設けた文章読み上げ装置である。[Means for Solving the Problems] According to the present invention, the above objects are achieved by the means described in the claims. That is, the present invention
A device that performs linguistic analysis on a sentence containing kanji and kana, converts it into phonetic symbols and prosodic symbols, and synthesizes and outputs speech based on this. A means for specifying a range of characters or character strings, and a means for synthesizing and outputting speech based on separately set symbol strings for the characters or character strings specified by the means, without using the results of the linguistic analysis. This is a text reading device equipped with

［作用］本発明による文章読み上げ装置においては、例えば「私
は五月が好きです」という文章があって、通常の言語解
析による方法では「私はゴガツが好きです」と音声合成
される所を、「私はサツキが好きです」と発声させたい
とき、゛五月″について範囲指定を成し、別途、“サツ
キパと音声合成するための発音記号と韻律記号を設定す
る。[Function] The text reading device according to the present invention can, for example, read a sentence such as ``I like Satsuki,'' which would be voice synthesized as ``I like Gogatsu'' using a normal language analysis method. , when you want to utter ``I love Satsuki'', specify the range for ``Satsuki'' and separately set the phonetic symbol and metrical symbol for voice synthesis with ``Satsukipa''.

文章読み上げ装置では、音声合成すべき文中に、上記範
囲指定された箇所（五月）を検出すると、該箇所につい
ては言語解析の結果を無視して用いることによりパサツ
キ″と音声合成して出力する。When the text-to-speech device detects the part specified in the above range (May) in the sentence to be synthesized, it ignores the results of the language analysis and outputs the synthesized speech as "Pasatsuki". .

［実施例］第１図は本発明の一実施例のブロック図であって、文章
読み上げ装置の構成の例を示しており、１は入出力表示
装置、２は主制御部、３は入出力制御部、４は文章蓄積
部、５は言語処理部、６は音響処理部、７はスピーカを
表している。[Embodiment] FIG. 1 is a block diagram of an embodiment of the present invention, showing an example of the configuration of a text reading device, in which 1 is an input/output display device, 2 is a main control unit, and 3 is an input/output device. A control section, 4 is a text storage section, 5 is a language processing section, 6 is a sound processing section, and 7 is a speaker.

同図において、入出力表示装置１から入力された漢字か
な混じり文による文章は、入出力制御部３と主制御部２
により、文章蓄積部４に送られて、該文章蓄積部４に格
納される。In the figure, a sentence containing kanji and kana input from the input/output display device 1 is sent to the input/output control unit 3 and the main control unit 2.
As a result, the text is sent to the text storage section 4 and stored therein.

言語処理部５は、文章蓄積部４から文章を読み出し、解
析単位に分割して、該解析単語ごとに言語解析を行なっ
て言語を発音記号と韻律記号との情報に変換する。The language processing unit 5 reads the text from the text storage unit 4, divides it into analysis units, performs language analysis on each analyzed word, and converts the language into information on phonetic symbols and prosodic symbols.

音響処理部６は、上記発音記号と韻律記号とから音声信
号を合成しスピーカ７から音声として出力する。The sound processing section 6 synthesizes a sound signal from the phonetic symbols and the prosodic symbols, and outputs the signal from the speaker 7 as sound.

このとき、該当する文章について、読みや韻律が正しく
ないものや、利用者の意図と異なるものについては、入
出力表示装置１から、元の文章を表示画面に表示し、カ
ーソルによって、該当する範囲を指定すると共に、該範
囲の音声合成に係る発音記号と韻律記号を入力する。以
降、該文章について音声合成を行なうとき範囲指定が成
されている箇所については言語解析を行なうことなく、
対応する発声記号と韻律記号が直接音響処理部に送られ
、これによって音声合成が成される。At this time, if the corresponding sentence is incorrect in pronunciation or prosody, or is different from the user's intention, the input/output display device 1 displays the original sentence on the display screen, and uses the cursor to select the appropriate range. is specified, and the phonetic symbols and prosodic symbols related to speech synthesis for the range are input. From now on, when performing speech synthesis on the text, language analysis will not be performed for the parts where the range has been specified.
Corresponding phonetic symbols and prosodic symbols are sent directly to the audio processing section, thereby performing speech synthesis.

第２図は上述の制御と操作について流れ図として示した
ものであって、（ａ）は言語解析による結果の合成音声
について、これと異なる発声をさせたい場合の指定の方
法について示しており、（ｂ）は文章中に言語解析の結
果によらない音声合成に係る表示のある場合の音声合成
方法について示している。FIG. 2 shows a flowchart of the above-mentioned control and operation, and (a) shows a method for specifying a different utterance for the synthesized speech resulting from language analysis. b) shows a speech synthesis method when there is an indication related to speech synthesis not based on the result of language analysis in the text.

［発明の効果］以上説明したように、本発明によれば、漢字かな混じり
文について、言語解析を行なって、発声記号と韻律記号
に変換し、これに基づき音声を合成して出力する装置に
、通常の言語解析の結果を用いた合成では不自然な音声
となるような部分について自然な音声を出力したり、あ
るいは、指定により故意に通常と異なる韻律の音声など
を出力することの可能な文章読み上げ装置を簡潔な構成
で容易に実現し得る利点がある。[Effects of the Invention] As explained above, according to the present invention, a device that performs linguistic analysis on sentences containing kanji and kana, converts them into phonetic symbols and prosodic symbols, and synthesizes and outputs speech based on these symbols. , it is possible to output natural speech for parts that would be unnatural when synthesized using the results of normal language analysis, or to output speech with a prosody that is intentionally different from normal by specifying it. There is an advantage that the text reading device can be easily realized with a simple configuration.

[Brief explanation of the drawing]

第１図は本発明の一実施例のブロック図、第２図は本発
明の一実施例の制御と操作を示す流れ図、第３図は従来
の文章読み上げ装置の構成を示すブロック図である。１・・・・・・入出力表示装置、２・・・・・・主制御
部、３・・・・・・入出力制御部、４・・・・・・文章
蓄積部、５・・・・・・言語処理部、６・・・・・・音
響処理部、７・・・・・・スピーカFIG. 1 is a block diagram of an embodiment of the present invention, FIG. 2 is a flowchart showing control and operation of an embodiment of the invention, and FIG. 3 is a block diagram showing the configuration of a conventional text reading device. 1... Input/output display device, 2... Main control section, 3... Input/output control section, 4... Text storage section, 5... ... Language processing section, 6 ... Sound processing section, 7 ... Speaker

Claims

[Claims] In a device that performs language analysis on a sentence containing kanji and kana, converts it into phonetic symbols and prosodic symbols, and synthesizes and outputs speech based on this, the characters of the sentence containing kanji and kana to be synthesized into speech. A means for specifying a range of arbitrary characters or character strings in a string; and a means for specifying a range of arbitrary characters or character strings in a string; A text reading device characterized by comprising: means for synthesizing and outputting.