JPH067341B2

JPH067341B2 - Speech synthesis method

Info

Publication number: JPH067341B2
Application number: JP58186733A
Authority: JP
Inventors: 熹市川
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1983-10-07
Filing date: 1983-10-07
Publication date: 1994-01-26
Anticipated expiration: 2009-01-26
Also published as: JPS6079400A

Description

【発明の詳細な説明】〔発明の利用分野〕本発明は音声合成、特に記号やアルファベット、特殊記
号等を含む任意内容の特殊文字列からなる入力より音声
を合成するに好適な音声合成方法に関する。The present invention relates to speech synthesis, and more particularly to a speech synthesis method suitable for synthesizing speech from an input composed of special character strings of arbitrary contents including symbols, alphabets, special symbols and the like. .

[Background of the Invention]

任意の文字列を入力として、音声を合成する方法は規則
合成、あるいは法則合成と呼ばれ、各種技術が開発され
て来た。しかしながら、これまでの技術では、入力には
カナ文字列か、ローマ字列、あるいは漢字・カナ混合文
までであり、日常使われている文章に現われるアルファ
ベットや記号の取扱い方法は開発されていない。さら
に、コンピュータ端末等と同様のフォーマットで合成部
へ出力するような場合、各種の端末制御用の記号も混在
しており、入力符号列をそのまま音声に変換することは
困難である。また新たに利用者が定義した略語等は辞書
を用意できないため、辞書を利用した方法では処理でき
ない。A method of synthesizing a voice by inputting an arbitrary character string is called rule synthesis or law synthesis, and various techniques have been developed. However, in the technology so far, the input method is only a kana character string, a roman character string, or a mixed kanji / kana sentence, and a method of handling alphabets and symbols appearing in everyday sentences has not been developed. Furthermore, when outputting to the synthesis unit in the same format as that of a computer terminal or the like, various terminal control symbols are mixed, and it is difficult to convert the input code string as it is to speech. Moreover, since a dictionary cannot prepare a new abbreviation newly defined by the user, it cannot be processed by the method using the dictionary.

[Object of the Invention]

本発明の目的は、カナ、漢字の他、アルファベットや各
種記号、制御用記号を含む符号列により表現される任意
内容の文章を音声に変換する方法を提供することにあ
る。It is an object of the present invention to provide a method for converting a sentence of arbitrary content represented by a code string including kana, kanji, alphabet, various symbols, and control symbols into voice.

[Outline of Invention]

上記の目的を達成するため本発明では、従来の規則合成
処理部の前段に、記号やアルファベット列を見出し語と
し、その読みを内容とする辞書を設け、入力の記号やア
ルファベット列を、その読みの文字列（たとえばカナ
列）に変換した後、規則合成処理部へ引き渡す。ただ
し、記号には、カンマなど、文章の表記記号として用い
るものもあり、これらはそのまま記号として規則合成処
理部に引き渡す必要があり、それらは変換から除外す
る。これらの記号自身を読みの対象とするためには、エ
スケープ記号を用意し、エスケープ記号で囲まれた場合
は、その読みに変換する。記号は原則的に一記号単位で
処理し、エスケープ記号のあらわれた場合のみ一連の列
として処理を行なう。アルファベットは、一連のアルフ
ァベットを単位として辞書処理を行なう。アルファベッ
ト列には、NATO／ナトウ／のように読みの習慣のある略
号や外国語単語は、その読み（カナ列等）に変換する
が、利用者の定義した略号等は、利用者がその読みを辞
書に登録しないかぎり、アルファベット列を構成する各
文字毎に、読みに変換する。ギリシャ文字等の特殊文字
や数字等も同様の取扱いが可能である。To achieve the above object, in the present invention, a symbol or alphabet string is used as a headword and a dictionary having its reading as a content is provided before the conventional rule synthesis processing unit, and the input symbol or alphabet string is read. Is converted to a character string (for example, a kana string) and then passed to the rule composition processing unit. However, some symbols, such as commas, are used as notation symbols in sentences, and these need to be passed as they are to the rule synthesis processing unit, and they are excluded from the conversion. In order to make these symbols themselves to be read, an escape symbol is prepared, and when surrounded by escape symbols, they are converted to the reading. In principle, symbols are processed in units of one symbol, and only when an escape symbol appears, a series of strings is processed. For the alphabet, dictionary processing is performed in units of a series of alphabets. Abbreviations and foreign words that have a habit of reading such as NATO / Nato / are converted to their readings (Kana string, etc.) in the alphabet string, but the abbreviations defined by the user are read by the user. Unless is registered in the dictionary, each character forming the alphabet string is converted into reading. Special characters such as Greek letters and numbers can be handled in the same way.

Example of Invention

以下、本発明の一実施例を図をもって説明する。第１図
は本実施例を説明するための装置のブロック図である。
第１図において、記号やアルファベットなどの特殊文字
を含むカナ漢字混合文が入力端１より制御部２に入力さ
れる。制御部はあらかじめ定められた手順に従い、音声
合成用制御情報メモリ３から合成に必要な情報を取り出
し、合成部４の制御情報を作り、順次合成部４に送り込
み音声波形に合成し出力端５より出力する。カナ漢字文
字列から合成制御情報に変換する手順については、たと
えば、文献(1)日本音響学会音声研究会資料のＳ８２−
０８「日本語テキスト変換の検討」等で明らかにされて
いる技術を用いることができる。また、合成制御情報か
ら音声を合成する技術は、たとえば、文献(2)電子通信
学会論文誌Vo．Ｊ６１−Ｄ，No.１１pp.８５８−８６
５（７８／１１）「PARCOR−VCVを用いた音声合成方
式」に開示されており、容易に実現することができる。
従って、ここでは記号、アルファベット等を含むカナ漢
字混合入力から、カナ漢字混合文への変換手順をのみ示
せば十分である。この変換は制御部２と、各種変換辞書
６を用いて実行される。以下簡単のため、入力をカナ、
漢字、記号とアルファベットからなるものとするが、ギ
リシャ文字や数字列等の処理はアルファベットの処理等
と同様の処理を付加することにより実現できるので省略
する。第２図は、この変換の手順を示すフローチャート
である。第２図において、各種記号やアルファベットを
含むカナ漢字混合文が入力端２１より入力すると、入力
コードを判定部２２で順次判定し、それぞれの処理に振
り分ける。記号は文章表記用記号判定部２３へ、カナと
漢字はパス２７を経て、そのままカナ・漢字入力文から
音声合成制御情報を作成する規則処理部３３へ、アルフ
ァベットはアルファベット文字列作成部２８に送られ
る。文章表記用記号判定部２３では、カンマ（，）、点
（．）、カッコ（「，」）等は、エスケープ処理判定部
２４に送り、その他の記号は、記号読み辞書処理部２５
に送る。エスケープ処理判定部２４は、エスケープ記号
に入力記号が囲まれていれば、記号読み辞書処理部２５
へ、その他は、そのまま規則処理部３３へ送る。記号読
み辞書処理部２５は第１図の各種変換辞書６中の記号読
み辞書を引き、記号に対応する読みをカナ表記として、
入力記号に置き換え、規則処理部３３へ送る。エスケー
プ記号は読みがないものとして処理する。An embodiment of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram of an apparatus for explaining this embodiment.
In FIG. 1, a kana-kanji mixed sentence including special characters such as symbols and alphabets is input to the control unit 2 from the input terminal 1. According to a predetermined procedure, the control unit extracts information necessary for synthesis from the voice synthesis control information memory 3, creates control information for the synthesis unit 4, sequentially sends it to the synthesis unit 4, synthesizes it into a voice waveform, and outputs it from the output terminal 5. Output. For the procedure for converting the kana-kanji character string into the synthesis control information, see, for example, S82- in Reference (1) Material of Acoustical Society of Japan.
The technology disclosed in 08 "Study of Japanese text conversion" or the like can be used. A technique for synthesizing a voice from synthesis control information is described in, for example, the literature (2) IEICE Transactions Vo. J61-D, No.11 pp.858-86
5 (78/11) “Voice synthesis method using PARCOR-VCV” and can be easily realized.
Therefore, here, it is sufficient to show only the conversion procedure from the kana-kanji mixed input including symbols and alphabets to the kana-kanji mixed sentence. This conversion is executed using the control unit 2 and various conversion dictionaries 6. For simplicity, enter the
Although it is assumed to be composed of kanji, symbols and alphabets, the processing of Greek characters and numeral strings can be realized by adding the same processing as the processing of alphabets and so will be omitted. FIG. 2 is a flowchart showing the procedure of this conversion. In FIG. 2, when a kana-kanji mixed sentence including various symbols and alphabets is input from the input terminal 21, the input code is sequentially determined by the determination unit 22 and assigned to each process. The symbols are sent to the text notation symbol determination unit 23, the Kana and Kanji are sent to the rule processing unit 33 that creates the voice synthesis control information from the Kana / Kanji input sentence, and the alphabet is sent to the alphabet character string creation unit 28 via the path 27. To be In the sentence notation symbol determination unit 23, commas (,), dots (.), Parentheses (“,”), etc. are sent to the escape processing determination unit 24, and other symbols are symbol reading dictionary processing unit 25.
Send to. If the input symbol is surrounded by the escape symbol, the escape processing determination unit 24 determines the symbol reading dictionary processing unit 25.
, And others are sent to the rule processing unit 33 as they are. The symbol reading dictionary processing unit 25 looks up the symbol reading dictionary in the various conversion dictionaries 6 of FIG. 1 and uses the reading corresponding to the symbol as Kana notation.
It is replaced with the input symbol and sent to the rule processing unit 33. Escape symbols are treated as unreadable.

アルファベット文字列作成部２８は、入力コード列がア
ルファベットから他のコードに変った時点で、アルファ
ベット文字列が終了したものと判定し、アルファベット
文字列読み辞書処理部２９にて、第１図の各種変換辞書
６中のアルファベット文字列読み辞書を引き、その読み
のカナ文字列に変換する。辞書にない場合は、アルファ
ベット文字列の文字毎にアルファベットの読みに変換、
カナ文字列として処理部３２で処理し、規則処理部３３
へ送る。規則処理部３３は、上記の文献(2)のような処
理を行ない、合成部の制御コードに変換し出力３４す
る。The alphabetic character string creating unit 28 determines that the alphabetic character string has ended when the input code string changes from the alphabet to another code, and the alphabetic character string reading dictionary processing unit 29 causes the various characters shown in FIG. The dictionary for reading alphabetic character strings in the conversion dictionary 6 is drawn and converted into a kana character string for the reading. If it is not in the dictionary, it is converted into alphabet reading for each character in the alphabet string,
The processing unit 32 processes it as a Kana character string, and the rule processing unit 33
Send to. The rule processing unit 33 performs the process as in the above-mentioned document (2), converts it into the control code of the synthesizing unit, and outputs it.

なお、カナ以外のすべてのコード列を漢字と同様に取扱
い、漢字を含む一括辞書でカナへ変換し、辞書にないも
ののみを、アルファベット文字毎の読み変換処理３２と
同様の処理によりカナ文字に変換する手順も同様の効果
を上げることができる。All code strings other than kana are treated like kanji, converted to kana with a batch dictionary containing kanji, and only those that are not in the dictionary are converted to kana characters by the same process as the reading conversion process 32 for each alphabetic character. A similar effect can be obtained in the conversion procedure.

〔The invention's effect〕

以上説明したごとく、本発明によればカナと漢字及び文
章表記上の記号以外の記号を含む任意の文章から、音声
として聞きやすい形式で、音声を合成することが可能で
ある。As described above, according to the present invention, it is possible to synthesize a voice in an easily audible format from an arbitrary sentence including kana, kanji, and symbols other than the symbols on the sentence notation.

[Brief description of drawings]

第１図は本発明の一実施例を説明するブロック図、第２
図は、本発明の処理を説明するためのフローチャートで
ある。１…入力端、２…制御部、３…音声合成用制御情報メモ
リ、４…合成部、５…出力端、６…各種変換辞書。FIG. 1 is a block diagram for explaining an embodiment of the present invention, and FIG.
The figure is a flow chart for explaining the process of the present invention. 1 ... Input end, 2 ... Control part, 3 ... Voice synthesis control information memory, 4 ... Synthesis part, 5 ... Output end, 6 ... Various conversion dictionaries.

Claims

[Claims]

1. A kana character and a kanji character as well as a sentence including at least an alphabet and a symbol are inputted, and each input code in the inputted sentence is sequentially judged by the control means,
In a voice synthesizing method of giving a reading corresponding to an input code by using a dictionary, synthesizing the reading regularly by a synthesizing means to synthesize a voice, when the determined input code is kana or kanji, the dictionary Is used to give a reading corresponding to the kana or kanji character string, and the readings are combined in a rule. When the determined input code is a symbol, it is determined whether the symbol is a predetermined sentence notation symbol. However, (1) when the symbol is a predetermined text notation symbol, it is further determined whether or not the symbol is enclosed by an escape symbol. If the symbol is enclosed by the escape symbol, the dictionary is provided as the dictionary. Using the symbol reading dictionary provided, the escape symbol is given a reading corresponding to the symbol as having no reading, and the reading is rule-synthesized, and when not enclosed by the escape symbol, (2) When the symbol is not a predetermined text notation symbol, the symbol corresponding to the symbol is given using the symbol reading dictionary, and the reading is rule-synthesized. When the determined input code is an alphabet, an alphabetic character string is created with a series of the alphabetic characters as a unit, and whether or not there is a reading corresponding to the alphabetic character string in the alphabetic character string reading dictionary provided as the dictionary. If it is determined that the alphabetic character string is in the alphabetic character string reading dictionary,
The reading is subjected to rule synthesis, and when the alphabet character string is not in the alphabet string reading dictionary, it is converted into alphabet reading for each character of the alphabet, and the reading is rule-synthesized. Synthesis method.