JP2622834B2

JP2622834B2 - Text-to-speech converter

Info

Publication number: JP2622834B2
Application number: JP61205046A
Authority: JP
Inventors: 春樹関; 達也平原
Original assignee: Nippon Telegraph and Telephone Corp; Matsushita Communication Industrial Co Ltd
Current assignee: Nippon Telegraph and Telephone Corp; Panasonic Mobile Communications Co Ltd
Priority date: 1986-09-02
Filing date: 1986-09-02
Publication date: 1997-06-25
Anticipated expiration: 2012-06-25
Also published as: JPS6361297A

Description

【発明の詳細な説明】（産業上の利用分野）本発明は、ワードプロセッサの入力文章を音声に自動
変換し、音声による原稿との照合、あるいはメッセージ
通信の音声出力に用いる文音声変換装置に関するもので
ある。Description: FIELD OF THE INVENTION The present invention relates to a sentence-to-speech conversion apparatus for automatically converting an input sentence of a word processor into a sound and comparing the sentence with a manuscript by sound or outputting a message communication sound. It is.

（従来の技術）第２図は、従来の文音声変換装置の構成を示してい
る。(Prior Art) FIG. 2 shows a configuration of a conventional sentence-to-speech conversion device.

１は本装置に漢字かな混じり文のコードを入力するた
めのインタフェース、２は漢字かに変換辞書、３は漢字
かな変換部、４は単音節ファイル、５は音声合成部、６
は音声出力スピーカである。1 is an interface for inputting kanji-kana mixed sentence codes to the apparatus, 2 is a kanji-kanji conversion dictionary, 3 is a kanji-kana conversion section, 4 is a monosyllabic file, 5 is a speech synthesis section, 6
Is an audio output speaker.

インタフェース１より入力された漢字かな混じりコー
ドは、漢字かな変換部３で漢字かな変換辞書２を検索し
てその情報を元に読みとアクセント型が付与され、カナ
コード列に変換される。The kanji / kana mixed code input from the interface 1 is searched by the kanji / kana conversion unit 2 and converted to a kana code string by reading and accenting based on the information.

音声合成部５では、カナコード列に対応する音節の合
成パラメータを単音節ファイル４より検索し、これを元
に音声を合成する。The speech synthesizing unit 5 searches the single syllable file 4 for a syllable synthesis parameter corresponding to the kana code string, and synthesizes a speech based on this.

漢字かな変換辞書２には、単語を構成する漢字コー
ド、読み、アクセント型、品詞等の情報が格納されてい
る。The kanji-kana conversion dictionary 2 stores information such as kanji codes, readings, accent types, parts of speech, etc., constituting words.

この辞書には、漢字の間にひらがなを挟まない単語
（例：枯尾花）は含まれているが、漢字の間にひらがな
を挟んだ単語（例：枯れ尾花、物の数）は含まれていな
かった。このため、ひらがなを挟んだ単語に与えられる
読み、アクセント型が不自然なものになっていた。This dictionary contains words that do not have hiragana between kanji characters (eg, Karaeoka), but do not include words that have hiragana between kanji characters (eg, withered flower, number of objects). For this reason, the reading and accent type given to the words sandwiching the hiragana became unnatural.

例えば（第１文例）のような文章が入力された場合の
「地の利があるかについて」に対する従来の文音声変換
の流れを第３図に示す。同図において、漢字とそれに続
くひらがなを一つのまとまりとして文節に区切る（／地
の／利があるかについて／）。次に、それぞれの区切り
に対して、前記漢字かな変換辞書２を検索して文法的な
チェックを加え、読み，アクセントを与える。具体的に
は、「地の」は「の」が名詞に続く助詞と判断され、という読みとアクセントが与えられる。また「利がある
かについて」はひらがな部分についてどの助詞と接合す
るかを調べ、「が」が名詞に続く助詞と判断され、という読みとアクセントが与えられる。For example, FIG. 3 shows a flow of conventional sentence-to-speech conversion for "whether there is an advantage" when a sentence such as (first sentence example) is input. In the figure, the kanji and the hiragana following it are grouped as a unit (// of the ground / whether there is an advantage /). Next, the Kanji-Kana conversion dictionary 2 is searched for each delimiter, grammatical checks are added, and reading and accent are given. Specifically, "no" is determined as a particle whose "no" follows a noun, Readings and accents. In addition, for "whether there is profit," the hiragana part is checked with which particle to join, and "ga" is determined to be a particle following the noun, Readings and accents.

（第１文例）の他の文章に対しても、同様にして（第
２文例）のように変換される。（第１文例）どちらに地の利があるかについて真っ二つに意見が分か
れた。The other sentences (first sentence example) are similarly converted as (second sentence example). (First sentence example) Two opinions were divided on which of the land is more advantageous.

（第２文例）しかし、人間が発声する場合には、のような読み、アクセント型が与えられるはずである。(Second sentence example) However, when humans utter, Readings and accent types should be given.

ここで、漢字の変換は漢字かな変換辞書中の単語の情
報（品詞、活用等）に基づいて最適なものが与えられる
が、これがない場合には音読みとＯ型アクセントが与え
られる。最終的に、単語とそれに続くひらがなは一つの
アクセント型を持つように纏められる。Here, the most suitable kanji conversion is given based on the word information (part of speech, utilization, etc.) in the kanji / kana conversion dictionary. If there is no kanji conversion, the on-speech reading and the O-type accent are given. Finally, the word and the following hiragana are combined to have one accent type.

従来例のように、漢字の間にひらがなを挟んだ単語の
情報を辞書に持たない文音声変換方法では、人間が発音
した場合と異なる読み、アクセントが与えられ、文章の
自然性、了解性が悪くなるという問題を有していた。As in the conventional example, in a sentence-to-speech conversion method that does not have information on words with hiragana between kanji in the dictionary, reading and accent different from those pronounced by humans are given, and the naturalness and intelligibility of sentences are Had the problem of getting worse.

（発明が解決しようとする問題点）本発明は、上記従来例の問題を解決するものであり、
変換された音声の不自然さをなくし、文章の了解度を向
上させることを目的としている。(Problems to be solved by the invention) The present invention is to solve the above-mentioned problems of the conventional example,
An object of the present invention is to eliminate the unnaturalness of the converted speech and improve the intelligibility of sentences.

（問題点を解決するための手段）本発明は上記目的を達成するために、単語を構成する
漢字コード，読み，アクセント型，品詞等の情報に、漢
字の間にひらがなを挟む単語の情報が追加された漢字か
な変換辞書と、この漢字かな変換辞書に漢字かな混じり
コードを出力するためのインタフェースと、前記漢字か
な変換辞書を検索して、その情報を元に読みとアクセン
ト型を付与してカナコード列に変換し、さらに、ひらが
な前後の漢字が共に前記漢字かな変換辞書の中における
二字以上の漢字の単語として同定されなかったときに、
漢字の間にひらがなを挟む単語との同定を行う漢字かな
変換部と、この漢字かな変換辞書からのカナコード列に
対応する音節の合成パラメータを単音節ファイルより検
索し、その合成パラメータを元に音声を合成する音声合
成部とを備えて、文章を自然で了解度の高い音声に変換
するようにしたものである。(Means for Solving the Problems) According to the present invention, in order to achieve the above-mentioned object, information of a word that includes a hiragana between kanji characters is included in information such as a kanji code, a reading, an accent type, and a part of speech that constitute the word. An additional Kanji-Kana conversion dictionary, an interface for outputting Kanji-Kana mixed codes to this Kanji-Kana conversion dictionary, and searching for the Kanji-Kana conversion dictionary, giving a reading and accent type based on the information Converted to a kana code string, further, when both the kanji before and after the hiragana were not identified as two or more kanji words in the kanji kana conversion dictionary,
A kanji-kana conversion unit that identifies hiragana words between kanji and a syllable synthesis parameter corresponding to a kana code string from this kanji-kana conversion dictionary is searched from a single syllable file and based on the synthesis parameters. A speech synthesizing unit for synthesizing speech is provided to convert sentences into natural and highly intelligible speech.

（実施例）以下に、本発明の一実施例について説明する。Example An example of the present invention will be described below.

入力文中の文字列が辞書中の単語により同定された場
合、文字数がより多い単語として同定された場合の方
が、正しく読み、アクセント型が与えられる可能性が高
い。When a character string in the input sentence is identified by a word in the dictionary, the case where the character string is identified as a word having more characters is more likely to be correctly read and given an accent type.

本実施例ではこの点を考慮して、次のように辞書の検
索を行う。In this embodiment, in consideration of this point, the dictionary is searched as follows.

従来の辞書、すなわち第２図に示す文音声変換装置に
おける漢字かな変換辞書２に、「地の利」、「世の中」
のように漢字一字により両側に挾まれた単語の読み，ア
クセント等の情報を追加し、ひらがなに先行する漢字、
および後続する漢字が、共に二字以上の漢字の単語とし
て同定されなかった場合、前記情報を追加した漢字かな
変換辞書２を漢字かな変換部３にて検索する。In the conventional dictionary, that is, the Kanji-Kana conversion dictionary 2 in the sentence-to-speech conversion device shown in FIG.
Information such as the reading of words and accents sandwiched on one side by one kanji character, such as
If the kanji and the succeeding kanji are not identified as two or more kanji words, the kanji / kana conversion unit 3 searches the kanji / kana conversion dictionary 2 to which the information is added.

例えば前記（第１文例）のような文章に対する実施例
における文音声変換の流れを第１図に示す。同図におい
て第３図の従来における文音声変換の流れと異なるのは
以下の点であって、「地の」と「利があるかについて
は」は共に漢字一字のみを含むので、「地の利があるか
について」のようにまとめ（連結）た上で前記情報を追
加した漢字かな変換辞書２内の単語と比較する。そし
て、「が」が名詞に続く助詞と判定され、また辞書内に
おいて「地の利」という名詞が検索できることから、と同定される。For example, FIG. 1 shows the flow of sentence-to-speech conversion in the embodiment for a sentence as described above (first sentence example). 3 differs from the conventional sentence-to-speech conversion flow shown in FIG. 3 in the following points. Since both “chi no no” and “do you have interest” contain only one kanji, And then compare it with a word in the Kanji-Kana conversion dictionary 2 to which the information has been added. Then, "ga" is determined to be a particle following the noun, and since the noun "geichi no ri" can be searched in the dictionary, Is identified.

「真っ二つ」についても同様である。この結果、（第
１文例）は（第３文例）のように読み、アクセントが与
えられる。The same is true for "the two". As a result, (first sentence example) is read and accented as (third sentence example).

（第３文例）同様に（第４文例）は（第５文例）のようになる。(3rd sentence example) Similarly, (fourth sentence example) becomes like (fifth sentence example).

（第４文例）空き地の利用についての反対は物の数ではない。(4th sentence) The opposite of using vacant land is not the number of objects.

（第４文例）第４文例では「空地」、「利用」が二字の単語として
同定されたから「地の利」の検索を行わない。(4th sentence example) In the fourth sentence example, since “vacant land” and “use” are identified as two-letter words, the search for “ground advantage” is not performed.

本実施例では、上記のように辞書の追加および辞書検
索の追加を行っているので、人間の発声に近い自然な音
声が出され、文章の了解度が向上するという利点があ
る。また、本実施例は、上記のように新しい辞書の追加
および辞書検索方法を採用するものであり、漢字の間に
ひらがなを挟む単語の辞書情報を追加し、この辞書情報
の検索を行うようにしているので、より自然な読み、ア
クセントが得られるようになる。In the present embodiment, since the addition of the dictionary and the addition of the dictionary search are performed as described above, there is an advantage that a natural voice close to human utterance is output and the intelligibility of the sentence is improved. Further, the present embodiment adopts a new dictionary addition and dictionary search method as described above, and adds dictionary information of a word sandwiching hiragana between kanji and searches for this dictionary information. So you can get more natural readings and accents.

（発明の効果）本発明は、上記のように辞書の追加および検索法の改
良をして、漢字の間にひらがなを挟む単語にも正しい読
みとアクセントを付与するようにしているので、自然な
アクセントが与えられ、合成した音声の了解度を向上で
きるという利点を有する。(Effects of the Invention) The present invention adds a dictionary and improves a search method as described above so that correct readings and accents are given to words sandwiching hiragana between kanji characters. This has the advantage that the accent is given and the intelligibility of the synthesized speech can be improved.

[Brief description of the drawings]

第１図は本発明の一実施例における文音声変換の流れを
示す図、第２図は従来の文音声変換装置の構成を示す
図、第３図は従来の文音声変換の流れを示す図である。１……インタフェース、２……漢字かな変換辞書、３…
…漢字かな変換部、４……単音節ファイル、５……音声
合成部、６……スピーカ。1 is a diagram showing a flow of a sentence-to-speech conversion in one embodiment of the present invention, FIG. 2 is a diagram showing a configuration of a conventional sentence-to-speech conversion device, and FIG. 3 is a diagram showing a flow of a conventional sentence-to-speech conversion. It is. 1 ... Interface 2 ... Kanji Kana conversion dictionary 3 ...
... Kanji-Kana conversion unit, 4 ... single syllable file, 5 ... Speech synthesis unit, 6 ... Speaker.

Claims

(57) [Claims]

1. A Kanji-Kana conversion dictionary in which information such as Kanji codes, readings, accent types, parts of speech, etc. constituting words are added to information on words sandwiching Hiragana between Kanji characters. An interface for outputting kanji-kana mixed codes and searching the kanji-kana conversion dictionary, converting the kana code string by adding reading and accent type based on the information, and furthermore, the kanji before and after the hiragana A kanji-kana conversion unit for identifying a word sandwiching hiragana between kanji when not identified as a word of two or more kanji in the kanji-kana conversion dictionary; Sentence-to-speech conversion characterized by comprising a speech synthesizing unit for retrieving a syllable synthesis parameter corresponding to a kana code sequence from a monosyllable file and synthesizing speech based on the synthesis parameter. Location.