JP6142632B2

JP6142632B2 - Word dictionary registration computer program, speech synthesizer, and word dictionary registration registration method

Info

Publication number: JP6142632B2
Application number: JP2013077124A
Authority: JP
Inventors: 英樹小島
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2013-04-02
Filing date: 2013-04-02
Publication date: 2017-06-07
Anticipated expiration: 2033-04-02
Also published as: JP2014202823A

Description

本発明は、例えば、テキストデータから音声信号を合成する音声合成装置及びその音声合成装置で利用される単語辞書の登録方法及び単語辞書登録用コンピュータプログラムに関する。 The present invention relates to a speech synthesizer that synthesizes a speech signal from text data, a word dictionary registration method and a word dictionary registration computer program used in the speech synthesizer, for example.

近年、音声を自動合成する音声合成技術が開発されている。音声合成技術は、短時間で所望の音声を作成できるというメリットを有するため、これまで予め録音されたプロのナレータによる音声を用いていたアプリケーションの中には、このような音声合成技術を採用したものもある。特に、商業施設における案内放送、ハイウェイラジオ、ハイウェイテレホンまたは天気予報の放送など、短い時間間隔で提供する情報が更新されるアプリケーションでは、上記のメリットを持つ音声合成技術が有用である。 In recent years, speech synthesis technology for automatically synthesizing speech has been developed. Since speech synthesis technology has the advantage that it can create desired speech in a short time, such speech synthesis technology has been adopted in applications that have used pre-recorded speech by professional narrators. There are also things. In particular, in an application in which information provided at a short time interval is updated, such as a guidance broadcast in a commercial facility, a highway radio, a highway telephone, or a weather forecast broadcast, the speech synthesis technology having the above-described advantages is useful.

合成したい音声信号を生成するために、音声合成装置には、例えば、キーボードなどを介して漢字仮名交じりのテキストデータが入力される。そして音声合成装置は、そのテキストデータに対して、単語を漢字と仮名で表した漢字仮名表記とその単語の発音を表す表音文字列などを登録した単語辞書を利用して、形態素解析または係り受け解析といった言語処理を行う。そして音声合成装置は、その言語処理によって、テキストデータの表音文字列と、その表音文字列にアクセント位置、アクセントの強弱あるいは抑揚の大小といった韻律を表す韻律記号を付した中間表記を生成する。そして音声合成装置は、その中間表記に基づいて、合成音声信号を生成する。 In order to generate a speech signal to be synthesized, text data mixed with kanji characters is input to the speech synthesizer via, for example, a keyboard. Then, the speech synthesizer uses a word dictionary in which kanji kana notation representing a word in kanji and kana and a phonetic character string representing pronunciation of the word are registered with respect to the text data, and morphological analysis or Performs language processing such as receiving analysis. The speech synthesizer then generates a phonetic character string of the text data and an intermediate notation in which the phonetic character string is added with a prosodic symbol representing the prosody such as accent position, accent strength, or inflection magnitude. . Then, the speech synthesizer generates a synthesized speech signal based on the intermediate notation.

日常で使用される単語は日々変化するので、全ての単語を予め単語辞書に登録することは事実上不可能である。そこで、入力テキストデータの終端に至った際にいままで抽出された未知語を一括してユーザに伝え、その未知語とユーザにより入力されたその未知語についての情報を単語辞書に登録する技術が提案されている（例えば、特許文献１を参照）。 Since words used in daily life change day by day, it is virtually impossible to register all words in the word dictionary in advance. Therefore, when the input text data reaches the end of the input text data, the unknown words extracted so far are collectively transmitted to the user, and the unknown word and the information about the unknown word input by the user are registered in the word dictionary. It has been proposed (see, for example, Patent Document 1).

しかし、漢字には複数の読み方があり、また、単語によってその単語の漢字仮名表記に含まれる漢字の読み方は異なる。場合によっては、登録しようとする単語の漢字仮名表記に含まれる漢字の読みが、その漢字が単独で用いられるときの読みとは異なることがある。そのため、上記の技術では、登録しようとする単語の表音文字列が正しく生成されないおそれがあった。
また、１文字毎の読み及び対応する表記の有無の情報を登録した文字情報記憶手段に、入力された文字列の正式住所データの読みに対応する表記がない場合の派生ルールを参照して異表記住所を派生する技術が提案されている（例えば、特許文献２を参照）。 However, there are multiple ways of reading kanji, and the way of reading kanji included in the kanji kana notation of the word differs depending on the word. In some cases, the reading of kanji included in the kanji kana notation of the word to be registered may be different from the reading when the kanji is used alone. Therefore, with the above technique, there is a possibility that the phonetic character string of the word to be registered is not correctly generated.
Also, refer to the derivation rule when there is no notation corresponding to the reading of the official address data of the input character string in the character information storage means in which the information on the reading for each character and the corresponding notation information is registered. A technique for deriving a written address has been proposed (see, for example, Patent Document 2).

特開平７−２４４４９１号公報JP 7-244491 A 特開２００６−４０６９号公報JP 2006-4069 A

しかしながら、特許文献２に開示された技術は、音声合成用の表音文字列を生成するものではないので、単語の読み自体は入力された読みそのものとなる。そのため、単語の読み仮名と適切な表音文字列が異なる場合には、適切な表音文字列が単語辞書に登録されない。 However, since the technique disclosed in Patent Document 2 does not generate a phonetic character string for speech synthesis, the word reading itself is the input reading itself. For this reason, when a word reading kana is different from an appropriate phonetic character string, the appropriate phonetic character string is not registered in the word dictionary.

そこで本明細書は、一つの側面として、音声合成装置で利用される単語辞書に単語の適切な表音文字列を登録できる単語辞書登録用コンピュータプログラムを提供することを目的とする。 Accordingly, an object of one aspect of the present specification is to provide a word dictionary registration computer program capable of registering an appropriate phonetic character string of a word in a word dictionary used in a speech synthesizer.

一つの実施形態によれば、テキストデータから合成音声信号を生成するために利用される、単語の漢字仮名表記とその単語の発音を表す表音文字列とが登録される単語辞書に単語を登録するための単語辞書登録用コンピュータプログラムが提供される。このコンピュータプログラムは、単語辞書に登録しようとする単語の読み仮名のうち、その単語の漢字仮名表記に含まれる何れの漢字の単漢字辞書に登録された読み仮名とも一致しない仮名に対応する漢字を、その仮名の前または後の仮名に対応する漢字に基づいて特定することで、単語の読み仮名に含まれる全ての仮名をその単語の漢字仮名表記に含まれる漢字の何れかに対応付ける単漢字不一致処理を実行し、登録しようとする単語の読み仮名と漢字の対応関係に基づいてその単語の読み仮名に含まれる、長音化可能な母音の並びが同じ漢字に対応と判断される場合に、その母音の並びを長音に変換することで、登録しようとする単語の表音文字列を生成し、登録しようとする単語の表音文字列をその単語の漢字仮名表記とともに単語辞書に登録することをコンピュータに実行させる命令を有する。 According to one embodiment, a word is registered in a word dictionary used to generate a synthesized speech signal from text data, in which a kanji kana notation of the word and a phonetic character string representing the pronunciation of the word are registered. A computer program for registering a word dictionary is provided. This computer program reads kanji corresponding to kana that does not match the kana registered in the single kanji dictionary of any kanji included in the kanji notation of the word among the kana of the word to be registered in the word dictionary. A single kanji mismatch that associates all kana characters in the reading kana of a word with one of the kanji characters in the kana notation of the word by specifying based on the kanji corresponding to the kana before or after the kana If it is determined that the sequence of vowels that can be made longer and included in the reading kana of the word based on the correspondence between the reading kana and kanji of the word to be registered corresponds to the same kanji By converting the sequence of vowels to a long sound, a phonetic character string of the word to be registered is generated, and the phonetic character string of the word to be registered is added to the word dictionary along with the kanji kana notation of the word. Having instructions thereon to perform the registering in the computer.

本発明の目的及び利点は、請求項において特に指摘されたエレメント及び組み合わせにより実現され、かつ達成される。
上記の一般的な記述及び下記の詳細な記述の何れも、例示的かつ説明的なものであり、請求項のように、本発明を限定するものではないことを理解されたい。 The objects and advantages of the invention will be realized and attained by means of the elements and combinations particularly pointed out in the appended claims.
It should be understood that both the foregoing general description and the following detailed description are exemplary and explanatory and are not restrictive of the invention as claimed.

本明細書に開示された単語辞書登録用コンピュータプログラムは、音声合成装置で利用される単語辞書に単語の適切な表音文字列を登録できる。 The word dictionary registration computer program disclosed in this specification can register an appropriate phonetic character string of a word in a word dictionary used in a speech synthesizer.

単語の漢字仮名表記及び読み仮名と、単語辞書に登録される表音文字列の関係の一例を示す図である。It is a figure which shows an example of the relationship between the kanji kana notation of a word, and a reading kana, and the phonetic character string registered into a word dictionary. 一つの実施形態による音声合成装置の概略構成図である。It is a schematic block diagram of the speech synthesizer by one Embodiment. 一つの実施形態による音声合成装置が有する処理部の機能ブロック図である。It is a functional block diagram of the process part which the speech synthesizer by one Embodiment has. 単漢字辞書の一例を示す図である。It is a figure which shows an example of a single Chinese character dictionary. 動的計画法を用いた単語の読み仮名と各漢字の読みとの対応付けの一例を示す図である。It is a figure which shows an example of matching with the reading kana of the word using dynamic programming, and the reading of each kanji. 動的計画法を用いた単語の読み仮名と各漢字の読みとの対応付けの他の一例を示す図である。It is a figure which shows another example of matching with the reading kana of the word using dynamic programming, and the reading of each kanji. 長音変換処理の動作フローチャートである。It is an operation | movement flowchart of a long sound conversion process. 単語登録処理の動作フローチャートである。It is an operation | movement flowchart of a word registration process.

以下、図を参照しつつ、一つの実施形態による音声合成装置について説明する。
最初に、図１を参照しつつ、単語の漢字仮名表記及び読み仮名と、単語辞書に登録される表音文字列の関係を説明する。図１において、表１００は、単語を漢字と仮名で表記した漢字仮名表記と読み仮名の関係を表しており、表１１０は、表１００に示された各単語の漢字仮名表記と単語の発音を表す表音文字列の関係を表している。なお、読み仮名と表音文字列との区別を分かり易くするために、本明細書では、読み仮名を平仮名で表し、表音文字列を片仮名で表す。また、以下では、漢字仮名表記を、単に表記と呼ぶ。 Hereinafter, a speech synthesizer according to an embodiment will be described with reference to the drawings.
First, with reference to FIG. 1, the relationship between the kanji kana notation and the reading kana of a word and the phonetic character string registered in the word dictionary will be described. In FIG. 1, a table 100 represents a relationship between kanji kana notation in which a word is written in kanji and kana and reading kana, and table 110 shows kanji kana notation of each word and pronunciation of the word shown in table 100. The relationship of the phonetic character string to represent is represented. In this specification, in order to make the distinction between the reading kana and the phonetic character string easy to understand, in this specification, the reading kana is expressed by hiragana and the phonetic character string is expressed by katakana. In the following, kanji kana notation is simply referred to as notation.

単語辞書に登録される単語の表音文字列は、合成音声における実際のその単語の発音を表している。そのため、同じ単語に対しても、読み仮名と表音文字列が異なることがある。特に、表音文字列では、特定の母音の並びが長音に変換される。例えば、図１では、単語「豆腐」の読み仮名が「とうふ」であるのに対し、表音文字列では、母音の並び「おう」が長音に変換されて「トーフ」となる。また、単語「映画」の読み仮名が「えいが」であるのに対し、表音文字列では、母音の並び「えい」が長音に変換されて「エーガ」となる。一般に、長音化が可能な母音の並びは、「ああ」、「いい」、「うう」、「ええ」、「おお」、「おう」及び「えい」の７種類である。 The phonetic character string of the word registered in the word dictionary represents the actual pronunciation of the word in the synthesized speech. Therefore, even for the same word, the reading kana and the phonetic character string may be different. In particular, in the phonetic character string, a specific vowel sequence is converted into a long sound. For example, in FIG. 1, the reading kana of the word “tofu” is “tofu”, whereas in the phonogram string, the vowel sequence “ou” is converted into a long sound and becomes “torf”. In addition, while the reading kana of the word “movie” is “Eiga”, in the phonetic character string, the sequence of “vowels” of the vowels is converted into long sounds and becomes “Aega”. In general, there are seven types of vowels that can be made longer: "Oh", "Good", "Uu", "Ee", "Oo", "Oo", and "Ei".

しかし、単語中にこれらの母音の並びが含まれていても、その並びを長音化すると不自然な発音となるために長音化されないことがある。例えば、図１の例では、単語「瀬戸内」には、長音化が可能な母音の並び「おう」が含まれているものの、その並びは長音化されず、表音文字列も読み仮名と同じ「セトウチ」となる。同様に、単語「要石」にも、長音化が可能な母音の並び「えい」が含まれているが、その並びは長音化されず、表音文字列も読み仮名と同じ「カナメイシ」となる。
長音化可能な母音の並びが実際に長音化されるか否かは、その母音の並びが一つの漢字の読みに含まれるか、あるいは、複数の漢字の読みに跨っているかに依存する。すなわち、上記の例の単語「瀬戸内」のように、長音化可能な母音の並びが、漢字「戸（と）」と漢字「内（うち）」に跨っている場合には、その母音の並びは長音化されない。一方、単語「豆腐」のように、長音化可能な母音の並びが、一つの漢字「豆（とう）」に対応する場合には、その母音の並びは長音化される。
したがって、単語を単語辞書に登録する際、長音化可能な母音の並びを実際に長音化するか否かを判定するためには、読み仮名に含まれる各モーラが、どの漢字に対応するかを調べることが重要である。 However, even if a sequence of these vowels is included in a word, if the sequence is lengthened, it may not be lengthened because of unnatural pronunciation. For example, in the example of FIG. 1, the word “Setouchi” includes a sequence of vowels “O” that can be made longer, but the sequence is not made longer and the phonetic character string is the same as the reading kana. It becomes "Setouchi". Similarly, the word “keystone” also includes a sequence of vowels that can be made longer, “Ei”, but that sequence is not made longer, and the phonetic string is “Kanameishi”, which is the same as the reading kana. Become.
Whether or not a sequence of vowels that can be made longer is actually made longer depends on whether the sequence of vowels is included in one kanji reading or straddles a plurality of kanji readings. That is, if the sequence of vowels that can be made longer, such as the word “Setouchi” in the above example, straddles the Chinese character “To” and the Chinese character “Uchi”, the vowel sequence Is not made longer. On the other hand, when a sequence of vowels that can be made longer like the word “tofu” corresponds to one kanji “tou”, the sequence of vowels is made longer.
Therefore, when registering a word in the word dictionary, in order to determine whether or not to actually lengthen a sequence of vowels that can be lengthened, it is necessary to determine which kanji each mora included in the reading kana corresponds to. It is important to investigate.

ここで、単語を登録するために、漢字ごとの表記と読み仮名の関係を表す単漢字辞書が利用される。例えば、単漢字辞書では、漢字「豆」について、その読み仮名として「まめ」と「とう」が対応付けられている。そのため、単語「豆腐」を単語辞書に登録する際、その単語の表記に含まれる漢字「豆」について単漢字辞書を参照すれば、単語「豆腐」の読み仮名「とうふ」のうちの「とう」が対応付けられているので、読み仮名「とう」は、漢字「豆」に対応することが分かる。このように、単漢字辞書を参照して、単語の表記に含まれる漢字ごとに、単語の読み仮名との対応を調べることで、各モーラがどの漢字の読みに含まれるかが分かる。 Here, in order to register words, a single kanji dictionary representing the relationship between the notation for each kanji and the reading kana is used. For example, in the single kanji dictionary, “Mame” and “Tou” are associated as kana for the kanji “bean”. Therefore, when registering the word “tofu” in the word dictionary, referring to the single kanji dictionary for the kanji “tofu” included in the word notation, “tou” of the reading kana “tofu” of the word “tofu” Are associated with each other, it can be seen that the reading kana “to” corresponds to the kanji “bean”. In this way, by referring to the single kanji dictionary and checking the correspondence with the reading kana for each kanji included in the word notation, it is possible to know which kanji reading each mora is included in.

しかし、単語によっては、漢字そのものの読みとは異なる読み方をする漢字が含まれたり、単語内のどの漢字にも含まれない読み仮名が含まれることがある。例えば、地名である単語「和歌浦」の読み仮名は「わかのうら」であり、その読み仮名には、単語「和歌浦」のどの漢字の読みでもない仮名「の」が含まれている。また、地名または船の種類などを表す単語「安宅」は、「あたけ」と読まれることがある。しかし、単語「安宅」に含まれる漢字「安」、「宅」そのものの読みをどのように組み合わせても、「あたけ」とはならない。 However, some words may contain kanji that is read differently from the reading of the kanji itself, or may contain kana characters that are not included in any kanji in the word. For example, the reading name of the word “Wakaura”, which is the place name, is “Wakaura”, and the reading name includes the kana “NO”, which is not the reading of any kanji in the word “Wakaura”. In addition, the word “Ataka” representing the place name or the type of ship may be read as “Atake”. However, no matter how you combine the readings of the kanji characters “an” and “home” contained in the word “Ataka”, it does not become “Atake”.

このように、単語の読み仮名が、その単語の表記に含まれる各漢字の読みの組み合わせで表現できない場合には、単漢字辞書を参照しても、その単語の各モーラがどの漢字の読みに含まれるのかを特定することは困難である。その結果として、そのような単語については、読み仮名から表音文字列に自動的に変換することは困難である。例えば、上記の例の単語「和歌浦」には、読み仮名の「のう」の部分に長音化可能な母音の並び「おう」が含まれる。しかし、仮名「の」がどの漢字の読みに含まれるのかが不明であるために、音声合成装置は、その母音の並びを誤って長音化することにより、表音文字列を「ワカノーラ」としてしまうおそれがある。 In this way, if a kana reading of a word cannot be expressed by a combination of readings of each kanji included in the notation of the word, even if the single kanji dictionary is referred to, each mora of the word reads which kanji. It is difficult to specify whether it is included. As a result, it is difficult to automatically convert such a word from a reading kana to a phonogram string. For example, the word “Wakaura” in the above example includes “o”, a sequence of vowels that can be made longer, in the “no” part of the reading kana. However, since it is unclear which kanji is included in the reading of the kana character "no", the speech synthesizer erroneously lengthens the vowel sequence to make the phonetic character string "wakanora" There is a fear.

そこで、本実施形態による音声合成装置は、単語を単語辞書に登録する際、その単語の読み仮名に含まれる仮名のうち、その単語の表記に含まれる何れの漢字の読み仮名とも一致しない仮名に対応する漢字を、その前後の仮名に対応する漢字から特定する。そしてこの音声合成装置は、読み仮名と漢字の対応関係に基づいて、長音化可能な母音の並びを実際に長音化するか否か判定する。 Therefore, when registering a word in the word dictionary, the speech synthesizer according to the present embodiment selects a kana that does not match any kanji reading kana included in the word notation among kana included in the reading kana of the word. The corresponding kanji is identified from the kanji corresponding to the preceding and succeeding kana. The speech synthesizer then determines whether or not the vowel sequence that can be made longer is actually made longer based on the correspondence between the reading kana and the kanji.

図２は、一つの実施形態による音声合成装置の概略構成図である。本実施形態では、音声合成装置１は、入力部２と、記憶部３と、処理部４と、出力部５とを有する。 FIG. 2 is a schematic configuration diagram of a speech synthesizer according to one embodiment. In the present embodiment, the speech synthesizer 1 includes an input unit 2, a storage unit 3, a processing unit 4, and an output unit 5.

入力部２は、合成音声の原文であり、漢字仮名交じり文であるテキストデータを取得する。また入力部２は、単語辞書に登録しようとする単語の表記及び読み仮名を取得する。そのために、入力部２は、例えば、キーボードを有する。また、入力部２は、マウスなどのポインティングデバイスとそのポインティングデバイスにより指示される入力すべき文字などを表示するディスプレイとを有する。あるいは、入力部２は、タッチパネルディスプレイを有してもよい。
さらにまた、入力部２は、テキストデータ、または単語の表記及び読み仮名を通信ネットワークを介して音声合成装置１と接続された他の機器から取得してもよい。この場合、入力部２は、音声合成装置１を通信ネットワークに接続するためのインターフェース回路を有する。
そして入力部２は、入力されたテキストデータ、または単語の表記及び読み仮名を処理部４へ渡す。 The input unit 2 obtains text data that is an original text of synthesized speech and is a kanji-kana mixed text. Further, the input unit 2 acquires the notation and reading kana of the word to be registered in the word dictionary. For this purpose, the input unit 2 includes, for example, a keyboard. The input unit 2 includes a pointing device such as a mouse and a display that displays characters to be input instructed by the pointing device. Alternatively, the input unit 2 may have a touch panel display.
Furthermore, the input unit 2 may acquire text data or word notation and reading kana from another device connected to the speech synthesizer 1 via a communication network. In this case, the input unit 2 includes an interface circuit for connecting the speech synthesizer 1 to a communication network.
The input unit 2 then passes the input text data or word notation and reading kana to the processing unit 4.

記憶部３は、例えば、半導体メモリ回路、磁気記憶装置または光記憶装置のうちの少なくとも一つを有する。そして記憶部３は、処理部４で用いられる各種コンピュータプログラム、音声合成処理または単語登録処理に用いられる各種のデータを記憶する。
記憶部３は、音声合成処理に用いられるデータとして、例えば、韻律モデルと、音声波形辞書を記憶する。さらに記憶部３は、言語処理に用いられるデータとして、テキストデータ中に出現すると想定される様々な単語について、その単語の表記及び表音文字列が登録された単語辞書を記憶する。なお、単語辞書には、登録された各単語の品詞情報及び活用形などがさらに登録されてもよい。さらに、記憶部３は、単語を単語辞書に登録する際に参照される、漢字ごとの表記と読み仮名との対応関係を表す単漢字辞書を記憶する。 The storage unit 3 includes, for example, at least one of a semiconductor memory circuit, a magnetic storage device, and an optical storage device. And the memory | storage part 3 memorize | stores the various data used for the various computer programs used by the process part 4, a speech synthesis process, or a word registration process.
The storage unit 3 stores, for example, a prosodic model and a speech waveform dictionary as data used for speech synthesis processing. Furthermore, the memory | storage part 3 memorize | stores the word dictionary where the notation of the word and the phonetic character string were registered about various words assumed to appear in text data as data used for language processing. Note that the word dictionary may further register part-of-speech information, a utilization form, and the like of each registered word. Furthermore, the memory | storage part 3 memorize | stores the single kanji dictionary showing the correspondence of the notation for every kanji and the kana which is referred when registering a word to a word dictionary.

出力部５は、処理部４から受け取った合成音声信号をスピーカ６へ出力する。そのために、出力部５は、例えば、スピーカ６を音声合成装置１と接続するためのオーディオインターフェース回路を有する。
また出力部５は、合成音声信号を、通信ネットワークを介して音声合成装置１と接続された他の装置へ出力してもよい。この場合、出力部５は、その通信ネットワークに音声合成装置１と接続するためのインターフェース回路を有する。なお、入力部２も通信ネットワークを介してテキストデータを取得する場合、入力部２と出力部５は一体化されていてもよい。 The output unit 5 outputs the synthesized voice signal received from the processing unit 4 to the speaker 6. For this purpose, the output unit 5 includes, for example, an audio interface circuit for connecting the speaker 6 to the speech synthesizer 1.
The output unit 5 may output the synthesized speech signal to another device connected to the speech synthesizer 1 via the communication network. In this case, the output unit 5 includes an interface circuit for connecting to the speech synthesizer 1 to the communication network. In addition, when the input part 2 also acquires text data via a communication network, the input part 2 and the output part 5 may be integrated.

処理部４は、一つまたは複数のプロセッサと、メモリ回路と、周辺回路とを有する。そして処理部４は、入力されたテキストデータに基づいて、合成音声信号を作成する。
図３は、処理部４の機能ブロック図である。処理部４は、言語処理部１０と、音声合成部１１と、辞書登録部１２とを有する。
処理部４が有するこれらの各部は、例えば、処理部４が有するプロセッサ上で動作するコンピュータプログラムにより実現される機能モジュールである。あるいは、処理部４が有するこれらの各部は、その各部の機能を実現する一つの集積回路として音声合成装置１に実装されてもよい。 The processing unit 4 includes one or a plurality of processors, a memory circuit, and a peripheral circuit. Then, the processing unit 4 creates a synthesized speech signal based on the input text data.
FIG. 3 is a functional block diagram of the processing unit 4. The processing unit 4 includes a language processing unit 10, a speech synthesis unit 11, and a dictionary registration unit 12.
Each of these units included in the processing unit 4 is, for example, a functional module realized by a computer program that operates on a processor included in the processing unit 4. Or these each part which the process part 4 has may be mounted in the speech synthesizer 1 as one integrated circuit which implement | achieves the function of each part.

言語処理部１０は、入力された、漢字仮名交じり文であるテキストデータに対応する表音文字列を生成し、さらにその表音文字列に基づいて中間表記を生成する。ここで、中間表記とは、表音文字列に、韻律を表す韻律記号が追加されたものである。韻律記号には、例えば、「アクセント位置」、「アクセント強弱」、「音程高低」、「抑揚大小」、「話速緩急」、「音量大小」及び「区切り」を表現する記号が含まれる。したがって、中間表記から韻律記号を除いたものは、表音文字列と一致する。 The language processing unit 10 generates a phonetic character string corresponding to the input text data that is a kanji kana mixed sentence, and further generates an intermediate notation based on the phonetic character string. Here, the intermediate notation is obtained by adding a prosodic symbol representing a prosody to a phonetic character string. The prosodic symbols include, for example, symbols that represent “accent position”, “accent strength”, “pitch pitch”, “inflection magnitude”, “speech speed”, “volume level”, and “separation”. Therefore, the intermediate notation excluding the prosodic symbols matches the phonetic character string.

言語処理部１０は、入力されたテキストデータから中間表記を生成するために、記憶部３に記憶されている単語辞書を読み込む。そして言語処理部１０は、例えば、その単語辞書を用いて、テキストデータに対して形態素解析及び係り受け解析を行って、テキストデータ中に出現する各単語の順序及び読み、アクセントの位置及び区切りの位置を決定する。 The language processing unit 10 reads a word dictionary stored in the storage unit 3 in order to generate an intermediate notation from the input text data. The language processing unit 10 performs, for example, the morphological analysis and dependency analysis on the text data using the word dictionary, and the order and reading of each word appearing in the text data, the position of the accent and the delimiter Determine the position.

言語処理部１０は、形態素解析として、例えば、動的計画法を用いる方法を利用できる。また言語処理部１０は、係り受け解析として、例えば、先読みＬＲパーザまたはＬＬ法といった構文解析の手法を利用できる。そして言語処理部１０は、各単語の順序、読み、アクセントの位置及び区切りの位置に応じて中間表記を作成する。
言語処理部１０は、生成した中間表記を記憶部３に一時的に記憶する。 The language processing unit 10 can use, for example, a method using dynamic programming as the morphological analysis. The language processing unit 10 can use a syntax analysis technique such as a prefetch LR parser or an LL method, for example, as dependency analysis. Then, the language processing unit 10 creates an intermediate notation according to the order of each word, reading, accent position, and break position.
The language processing unit 10 temporarily stores the generated intermediate notation in the storage unit 3.

音声合成部１１は、入力されたテキストデータの中間表記に基づいて合成音声信号を作成する。 The speech synthesizer 11 creates a synthesized speech signal based on the intermediate notation of the input text data.

音声合成部１１は、中間表記に基づいて、合成音声信号を生成する際の目標韻律を生成する。そのために、音声合成部１１は、記憶部３から複数の韻律モデルを読み込む。この韻律モデルは、声を高くする位置及び声を低くする位置などを時間順に表したものである。そして音声合成部１１は、複数の韻律モデルのうち、中間表記に示されたアクセントの位置などに最も一致する韻律モデルを選択する。そして音声合成部１１は、選択した韻律モデル及び合成パラメータに従って、中間表記に対して声が高くなる位置あるいは声が低くなる位置、声の抑揚、ピッチなどを設定することにより、目標韻律を作成する。目標韻律は、音声波形を決定する単位となる音素ごとに、音素の長さ及びピッチ周波数を含む。なお、音素は、例えば、一つの母音あるいは一つの子音とすることができる。 The speech synthesizer 11 generates a target prosody for generating a synthesized speech signal based on the intermediate notation. For this purpose, the speech synthesis unit 11 reads a plurality of prosodic models from the storage unit 3. This prosodic model represents a position in which the voice is raised and a position in which the voice is lowered in time order. Then, the speech synthesizer 11 selects a prosodic model that most closely matches the position of the accent indicated by the intermediate notation among the plural prosodic models. Then, the speech synthesizer 11 creates a target prosody by setting a position where the voice becomes high or low, a position where the voice becomes low, a voice inflection, a pitch, and the like according to the selected prosodic model and synthesis parameters. . The target prosody includes a phoneme length and a pitch frequency for each phoneme as a unit for determining a speech waveform. Note that the phoneme can be, for example, one vowel or one consonant.

音声合成部１１は、生成した目標韻律に従って、例えば、音素接続方式またはコーパスベース方式によって合成音声信号を作成する。
例えば、音声合成部１１は、音素ごとに、目標韻律の音素長及びピッチ周波数に最も近い音声波形を、例えばパターンマッチングにより音声波形辞書に登録されている複数の音声波形の中から選択する。そのために、音声合成部１１は、記憶部３から音声波形辞書を読み込む。音声波形辞書は、複数の音声波形及び各音声波形の識別番号を記録する。また音声波形は、例えば、一人以上のナレータが様々なテキストを読み上げた様々な音声を録音した音声信号から、音素単位で取り出された波形信号である。
さらに、音声合成部１１は、音素ごとに選択された音声波形を目標韻律に沿って接続できるようにするため、それら選択された音声波形と目標韻律に示された対応する音素の波形パターンとのずれ量を、波形変換情報として算出してもよい。
音声合成部１１は、音素ごとに選択された音声波形の識別番号を含む波形生成情報を作成する。波形生成情報は、波形変換情報をさらに含んでもよい。 The speech synthesizer 11 creates a synthesized speech signal by, for example, a phoneme connection method or a corpus-based method according to the generated target prosody.
For example, for each phoneme, the speech synthesizer 11 selects a speech waveform closest to the phoneme length and pitch frequency of the target prosody from a plurality of speech waveforms registered in the speech waveform dictionary by pattern matching, for example. For this purpose, the speech synthesis unit 11 reads a speech waveform dictionary from the storage unit 3. The speech waveform dictionary records a plurality of speech waveforms and an identification number of each speech waveform. The voice waveform is, for example, a waveform signal extracted in units of phonemes from voice signals obtained by recording various voices in which one or more narrators read various texts.
Furthermore, the speech synthesizer 11 connects the selected speech waveform and the waveform pattern of the corresponding phoneme indicated in the target prosody so that the speech waveform selected for each phoneme can be connected along the target prosody. The deviation amount may be calculated as waveform conversion information.
The speech synthesizer 11 creates waveform generation information including the identification number of the speech waveform selected for each phoneme. The waveform generation information may further include waveform conversion information.

音声合成部１１は、波形生成情報に含まれる各音素の音声波形の識別番号に対応する音声波形信号を記憶部３から読み込む。そして音声合成部１１は、各音声波形信号を連続的に接続することにより、合成音声信号を作成する。なお、波形生成情報に波形変換情報が含まれている場合、音声合成部１１は、各音声波形信号を、対応する音素について求められた波形変換情報に従って補正して音声波形信号を連続的に接続することにより、合成音声信号を作成する。
音声合成部１１は、合成音声信号を出力部５へ出力する。 The speech synthesizer 11 reads a speech waveform signal corresponding to the speech waveform identification number of each phoneme included in the waveform generation information from the storage unit 3. Then, the speech synthesizer 11 creates a synthesized speech signal by connecting each speech waveform signal continuously. When the waveform conversion information is included in the waveform generation information, the speech synthesizer 11 continuously connects the speech waveform signals by correcting each speech waveform signal according to the waveform conversion information obtained for the corresponding phoneme. By doing so, a synthesized speech signal is created.
The voice synthesizer 11 outputs the synthesized voice signal to the output unit 5.

辞書登録部１２は、例えば、入力部２を介して入力された登録対象の単語の表記と読み仮名に基づいて、その単語の読みに含まれる長音化可能な母音の並びを長音化するか否かを決定した上でその単語の表音文字列を生成する。そして辞書登録部１２は、その単語の表記を表音文字列とともに単語辞書に登録する。そのために、辞書登録部１２は、単漢字分割判定部２１と、単漢字不一致処理部２２と、長音変換部２３と、登録部２４とを有する。 For example, the dictionary registration unit 12 determines whether or not to lengthen the arrangement of vowels that can be made longer in the reading of the word based on the notation and the reading kana of the word to be registered input via the input unit 2. Is determined, and a phonetic character string of the word is generated. The dictionary registration unit 12 registers the notation of the word together with the phonetic character string in the word dictionary. For this purpose, the dictionary registration unit 12 includes a single Chinese character division determination unit 21, a single Chinese character mismatch processing unit 22, a long sound conversion unit 23, and a registration unit 24.

単漢字分割判定部２１は、登録対象の単語の読み仮名が、その単語の表記に含まれる漢字の単漢字辞書に登録された読み仮名の組み合わせとなっているか否かを判定する。そのために、単漢字分割判定部２１は、登録対象の単語の表記に含まれる先頭の漢字から順に、単漢字辞書に登録されたその漢字の読み仮名とその単語の読み仮名とを比較する。そして単漢字分割判定部２１は、その単語の読み仮名に含まれるそれぞれの仮名のうち、漢字の読み仮名と一致するものを、その漢字に対応付ける。単語の読み仮名に含まれる全ての仮名が、その単語の表記に含まれる漢字の何れかに対応付けられた場合、登録対象の単語の読み仮名は、その単語の表記に含まれる漢字の単漢字辞書に登録された読み仮名の組み合わせとなっている。この場合、単語の読みに含まれる各モーラがどの漢字に対応するかも明らかなので、長音変換部２３は、長音化可能な母音の並びを実際に長音化できるか否かの判定も直ちに行える。そこで、単漢字分割判定部２１は、単語の読み仮名に含まれる各仮名と漢字の対応関係を長音変換部２３へ通知する。 The single kanji division determination unit 21 determines whether the reading kana of the word to be registered is a combination of the reading kana registered in the single kanji dictionary of the kanji included in the notation of the word. For this purpose, the single-kanji division determination unit 21 compares the reading kana of the kanji registered in the single-kanji dictionary with the reading kana of the word in order from the first kanji included in the notation of the word to be registered. Then, the single kanji division determination unit 21 associates each kana included in the reading kana of the word with the kanji corresponding to the kana reading kana. If all kana included in the reading of a word is associated with one of the kanji included in the word notation, the reading kana of the word to be registered is the single kanji of the kanji included in the word notation It is a combination of reading kana registered in the dictionary. In this case, since it is clear which kanji each mora included in the word reading corresponds to, the long sound conversion unit 23 can immediately determine whether or not the arrangement of vowels that can be made long can actually be made long. Therefore, the single kanji division determination unit 21 notifies the long sound conversion unit 23 of the correspondence between each kana and kanji included in the reading kana of the word.

一方、単語の読み仮名に含まれる仮名の何れか一つでも、その単語の表記に含まれる漢字の何れにも対応付けられない場合、その単語の読み仮名では、その単語の表記に含まれる漢字の単漢字辞書に登録された読み仮名とは異なる読み仮名が使用されている。この場合、長音変換部２３は、そのままでは長音化可能な母音の並びを実際に長音化できるか否か判定できない。そこで単漢字分割判定部２１は、登録対象の単語の表記と読み仮名を単漢字不一致処理部２２へ通知する。 On the other hand, if any one of the kana included in the reading of the word is not associated with any of the kanji included in the notation of the word, the kanji included in the notation of the word in the reading of the word A reading kana different from the reading kana registered in the single kanji dictionary is used. In this case, the long sound conversion unit 23 cannot determine whether or not the arrangement of vowels that can be lengthened as they are can be actually lengthened. Therefore, the single-kanji division determination unit 21 notifies the single-kanji mismatch processing unit 22 of the notation of the registration target word and the reading kana.

図４は、単漢字辞書の一例を示す図である。単漢字辞書４００には、漢字ごとに、その漢字の１種類以上の一般的な読みが登録されている。例えば、漢字「安」については、その読みとして、「あん」と「やす」が登録されている。また、漢字「石」については、その読みとして、「いし」と「せき」が登録されている。 FIG. 4 is a diagram illustrating an example of a single kanji dictionary. In the single kanji dictionary 400, for each kanji, one or more general readings of the kanji are registered. For example, for the Chinese character “An”, “An” and “Yasu” are registered as readings. As for the kanji “stone”, “Ishi” and “seki” are registered as readings.

例えば、登録対象の単語として「瀬戸内」が入力された場合、単漢字分割判定部２１は、単漢字辞書４００を参照することにより、先頭の漢字「瀬」の読み仮名「せ」と単語の読み仮名「せとうち」の先頭の仮名「せ」が一致することが分かる。そのため、単漢字分割判定部２１は、単語の読み仮名のうちの先頭の仮名「せ」を、その単語の先頭の漢字「瀬」に対応付ける。
次に、単漢字分割判定部２１は、単語「瀬戸内」のうちの次の漢字「戸」について、単漢字辞書４００を参照する。すると、単漢字分割判定部２１は、その漢字の読み「と」と「こ」のうちの「と」が、その単語の読み仮名「せとうち」のうち、漢字との対応付けが済んでいない仮名の先頭の「と」と一致することが分かる。そのため、単漢字分割判定部２１は、単語の読み仮名のうちの２番目の仮名「と」を、その単語の２番目の漢字「戸」に対応付ける。同様に、単漢字分割判定部２１は、単語「瀬戸内」の読み仮名のうちの仮名「うち」を、その単語の最後の漢字「内」に対応付ける。このように、単語「瀬戸内」については、その読み仮名に含まれる全ての仮名がその単語の表記に含まれる何れかの漢字に対応付けられる。そのため、単漢字分割判定部２１は、単語の読み仮名「せとうち」に含まれる各仮名と漢字「瀬」「戸」「内」の対応関係を長音変換部２３へ通知する。 For example, when “Setouchi” is input as a word to be registered, the single-kanji division determination unit 21 refers to the single-kanji dictionary 400, thereby reading the first kanji “Set” reading kana “se” and the word reading. It can be seen that the first kana “se” of the kana “Setouchi” matches. Therefore, the single kanji division determination unit 21 associates the first kana “se” in the reading kana of the word with the first kanji “se” of the word.
Next, the single Chinese character division determination unit 21 refers to the single Chinese character dictionary 400 for the next Chinese character “door” in the word “Setouchi”. Then, the single kanji division determination unit 21 determines that the kanji readings “to” and “ko” “to” are not associated with kanji in the word reading kana “setouchi”. It can be seen that it matches with “to” at the beginning of. Therefore, the single kanji division determination unit 21 associates the second kana “to” of the word reading kana with the second kanji “door” of the word. Similarly, the single kanji division determination unit 21 associates the kana “Uchi” in the reading “Kana” of the word “Setouchi” with the last Kanji “in” of the word. Thus, for the word “Setouchi”, all kana characters included in the reading kana are associated with any kanji included in the notation of the word. For this reason, the single kanji division determination unit 21 notifies the long sound conversion unit 23 of the correspondence between each kana included in the word reading kana “Setouchi” and the kanji characters “se”, “door”, “in”.

一方、登録対象の単語として「和歌浦」が入力されたとする。この場合、単漢字分割判定部２１は、単漢字辞書４００を参照することにより、先頭及び２番目の漢字「和」「歌」の読み仮名「わ」「か」と単語の読み仮名「わかのうら」の仮名「わ」「か」がそれぞれ一致することが分かる。そのため、単漢字分割判定部２１は、単語の読み仮名のうちの仮名「わ」「か」を、その単語の表記に含まれる漢字「和」「歌」にそれぞれ対応付ける。しかし、その単語の読み仮名のうち、対応付けの済んでいない先頭の仮名「の」は、その単語の表記に含まれる対応付けの済んでいない漢字「浦」の読み仮名と一致しない。そのため、単漢字分割判定部２１は、単漢字辞書に登録されている漢字の読み仮名だけでは、単語「和歌浦」に含まれる読み仮名全てを、その単語の表記に含まれる漢字に対応付けられない。そこでこの例では、単漢字分割判定部２１は、単語の表記「和歌浦」と読み仮名「わかのうら」を単漢字不一致処理部２２へ通知する。 On the other hand, it is assumed that “Wakaura” is input as a word to be registered. In this case, the single-kanji division determination unit 21 refers to the single-kanji dictionary 400 to read the first and second kanji characters “Japanese” and “song” reading kana “wa” “ka” and the word reading kana “wakana”. It can be seen that kana "wa" and "ka" of "Ura" match each other. Therefore, the single kanji division determination unit 21 associates the kana “wa” and “ka” in the reading of the word with the kanji “sum” and “song” included in the word notation. However, of the reading kana of the word, the first kana “no” that is not associated does not match the kana “ura” kana that is not associated in the word notation. For this reason, the single kanji division determination unit 21 cannot associate all kana characters included in the word “Wakaura” with kanji characters included in the notation of the word only by using kanji reading kana registered in the single kanji dictionary. . Therefore, in this example, the single kanji division determination unit 21 notifies the single kanji mismatch processing unit 22 of the word notation “Wakaura” and the reading kana “Wakaura”.

また、表記が「安宅一丁目」で読み仮名が「あたけいっちょうめ」である単語が登録対象として入力されたとする。この場合、単語の表記に含まれる漢字「安」「宅」の読み仮名と、単語の読み仮名に含まれる「あたけ」が一致しない。そのため、単漢字分割判定部２１は、単漢字辞書に登録されている漢字の読み仮名だけでは、単語「安宅一丁目」に含まれる読み仮名全てを、その単語の表記に含まれる漢字に対応付けられない。そこでこの例でも、単漢字分割判定部２１は、単語の表記「安宅一丁目」と読み仮名「あたけいっちょうめ」を単漢字不一致処理部２２へ通知する。 Further, it is assumed that a word whose notation is “Ataka 1-chome” and whose reading is “Atake Ichichome” is input as a registration target. In this case, the readings of the kanji characters “A” and “Kaku” included in the word notation do not match “Atake” included in the reading of the word. Therefore, the single kanji division determination unit 21 associates all kana characters included in the word “Ataka 1-chome” with kanji characters included in the word notation only by the kana reading kana registered in the single kanji dictionary. I can't. Therefore, also in this example, the single kanji division determination unit 21 notifies the single kanji mismatch processing unit 22 of the word notation “Ataka 1-chome” and the reading kana “Atake Ichichome”.

単漢字不一致処理部２２は、登録対象の単語の読み仮名のうち、単漢字辞書に登録されている漢字の読み仮名と一致しない仮名に対応する漢字を、その前または後の仮名に対応する、その単語の表記に含まれる漢字に基づいて特定する。そのために、本実施形態では、単漢字不一致処理部２２は、動的計画法を利用する。 The single kanji mismatch processing unit 22 corresponds to the kana corresponding to the kana before or after the kana corresponding to the kana that does not match the kana of the kanji registered in the single kanji dictionary among the reading kana of the word to be registered. It is specified based on the kanji included in the word notation. Therefore, in this embodiment, the single kanji mismatch processing unit 22 uses dynamic programming.

図５は、動的計画法を用いた単語の読み仮名と各漢字の読みとの対応付けの一例を示す図である。この例では、登録対象の単語「安宅一丁目」の読み仮名「あたけいっちょうめ」が、横方向に、左から順に並べられる。また、その単語の表記に含まれる各漢字の読み仮名が、縦方向に、かつ、先頭の漢字が一番下に位置するように配列される。そして縦方向、横方向とも、読み仮名をモーラごとに区切ることで、単語の読み仮名のモーラと、単語の表記に含まれる漢字の読み仮名のモーラの組み合わせごとにマス目が形成される。そしてそれぞれのマス目には、そのマス目の縦方向のモーラと横方向のモーラが一致する場合、'0'が書き込まれ、一方、一致しない場合には、ペナルティー値として'1'が書き込まれる。 FIG. 5 is a diagram showing an example of correspondence between word reading kana and dynamic kanji reading using dynamic programming. In this example, the reading pseudonyms “Atake Ichichome” of the registration target word “Ataka 1-chome” are arranged in order from the left in the horizontal direction. The kana readings of each kanji included in the notation of the word are arranged in the vertical direction so that the first kanji is positioned at the bottom. In both the vertical and horizontal directions, a kana is formed for each combination of a reading kana of a word and a mora of a kanji reading kana included in the word notation. In each cell, if the vertical mora and the horizontal mora of the cell match, '0' is written. On the other hand, if they do not match, '1' is written as a penalty value. .

単漢字不一致処理部２２は、左下端のマス目から、右上端のマス目までの経路ごとに、その経路が通る各マス目の値の合計を、その経路のペナルティー値として算出する。そして単漢字不一致処理部２２は、ペナルティー値が最小となる経路を特定する。すなわち、単漢字不一致処理部２２は、単語の読み仮名と、単語の表記に含まれる漢字の順序に従った各漢字の単漢字辞書に登録された読み仮名の配列の差が最小となるように、各漢字の単漢字辞書に登録された読み仮名に対応しない仮名の漢字への対応付けを行う。例えば、図５に示された例では、矢印で示された経路５００のペナルティー値が最小となる。
そして単漢字不一致処理部２２は、ペナルティー値が最小となる経路に従って、単語の読み仮名に含まれる各仮名を、その単語の表記に含まれる漢字に対応付ける。図５の例では、経路５００に従うと、単語の読み仮名「あたけいっちょうめ」のうち、仮名「いっ」、「ちょう」、「め」が、それぞれ、漢字「一」、「丁」、「目」に対応付けられる。そして漢字の読み仮名と一致しない部分である「あたけ」について、対応付けの済んだ漢字「一丁目」よりも前の漢字「安」「宅」のうち、先頭の仮名「あ」が漢字「安」に対応付けられ、仮名「たけ」が漢字「宅」に対応付けられる。 The single kanji mismatch processing unit 22 calculates, for each route from the lower left cell to the upper right cell, the sum of the values of each cell through which the route passes as a penalty value for the route. Then, the single kanji mismatch processing unit 22 identifies the route having the minimum penalty value. In other words, the single kanji mismatch processing unit 22 minimizes the difference between the reading kana of the word and the arrangement of the reading kana registered in the single kanji dictionary of each kanji according to the order of the kanji included in the word notation. The kana is associated with kanji that does not correspond to the reading kana registered in the single kanji dictionary of each kanji. For example, in the example shown in FIG. 5, the penalty value of the route 500 indicated by the arrow is the minimum.
Then, the single kanji mismatch processing unit 22 associates each kana included in the word reading kana with the kanji included in the word notation in accordance with the path having the minimum penalty value. In the example of FIG. 5, according to the route 500, the kana characters “I”, “Cho”, and “Me” are the kanji characters “Ichi”, “Ding”, “ ". And for “Atake”, which is the part that does not match the kanji reading kana, the first kana “a” of the kanji “an” and “home” before the associated kanji “1 chome” is the kanji “an” And the kana “take” is associated with the Chinese character “home”.

図６は、動的計画法を用いた単語の読み仮名と各漢字の読みとの対応付けの他の一例を示す図である。この例では、登録対象の単語「和歌浦」について、実線の矢印で示された経路６００のペナルティー値と点線の矢印で示された経路６０１のペナルティー値が等しく、かつ、最小となる。経路６００と経路６０１とでは、仮名「の」が、漢字「歌」に対応するか、漢字「浦」に対応するかで異なっている。このように、単語の表記に含まれる連続する二つの漢字のそれぞれの読み仮名の間に、それらの漢字の読み仮名とは異なる仮名が挿入されている場合、二つの経路のペナルティー値が最小となる。このような場合、母音の並びを長音化するか否かの観点では、挿入されている仮名を前の漢字と対応付けた方が、その単語の実際の発音と整合する。そこでこの実施形態では、単漢字不一致処理部２２は、二つの漢字の読み仮名の間に挿入された仮名を前側の仮名に対応する漢字に対応付ける。すなわち、単漢字不一致処理部２２は、ペナルティー値が等しい複数の経路のうち、最も下側の経路を選択する。したがって、図６に示された例では、単漢字不一致処理部２２は、経路６００を選択し、単語の読み仮名「わ」「かの」「うら」を、それぞれ、漢字「和」「歌」「浦」に対応付ける。 FIG. 6 is a diagram illustrating another example of correspondence between the reading of a word using dynamic programming and the reading of each kanji. In this example, for the word “Wakaura” to be registered, the penalty value of the path 600 indicated by the solid line arrow and the penalty value of the path 601 indicated by the dotted line arrow are equal and minimum. The path 600 and the path 601 differ depending on whether the kana “no” corresponds to the kanji “song” or the kanji “ura”. In this way, when a kana different from the kana readings of two consecutive kanji characters included in the word notation is inserted, the penalty value of the two paths is the minimum. Become. In such a case, in terms of whether or not to make the vowel sequence longer, matching the inserted kana with the previous kanji matches the actual pronunciation of the word. Therefore, in this embodiment, the single kanji mismatch processing unit 22 associates the kana inserted between the readings of two kanji with the kanji corresponding to the front kana. That is, the single kanji mismatch processing unit 22 selects the lowermost path among a plurality of paths having the same penalty value. Therefore, in the example shown in FIG. 6, the single kanji mismatch processing unit 22 selects the path 600, and reads the kana words “wa”, “kan”, “ura”, and the kanji characters “Japanese”, “song”, respectively. Correspond to "Ura".

なお、対象となる単語の表記に含まれる漢字に複数の読み仮名がある場合、単漢字不一致処理部２２は、読み仮名ごとに並列に、上記のような経路を探索すればよい。例えば、図５の例では、漢字「安」には「あん」という読み仮名の他に、「やす」という読み仮名もある。そこで単漢字不一致処理部２２は、「あん」と並列に、「やす」についても、単語の読み仮名「あたけいっちょうめ」と対応させたマス目を設け、それらのマス目を通る経路のペナルティー値も計算すればよい。
また、読み仮名の配列は、上記の例に限られない。例えば、単漢字不一致処理部２２は、単語の表記に含まれる各漢字の読み仮名を上から順に並べ、左上端のマス目から右下端のマス目までの経路のペナルティー値を算出してもよい。 In addition, when there are a plurality of reading kana characters in the kanji included in the notation of the target word, the single kanji mismatch processing unit 22 may search for the above path in parallel for each reading kana. For example, in the example of FIG. 5, the kanji “A” has a reading Kana “Yasu” in addition to a reading Kana “An”. Accordingly, the single kanji mismatch processing unit 22 provides a square corresponding to the word reading pseudonym “Atake Ichichome” for “Yasu” in parallel with “An”, and the penalty value of the route passing through those squares. Can also be calculated.
Further, the arrangement of reading characters is not limited to the above example. For example, the single kanji mismatch processing unit 22 may arrange the reading kana of each kanji included in the word notation in order from the top, and calculate the penalty value of the route from the upper left cell to the lower right cell. .

変形例によれば、単漢字不一致処理部２２は、動的計画法を用いる代わりに、単語の読み仮名のうち、登録対象の単語の先頭の漢字から順に漢字の読み仮名と一致する部分を検出する。さらに単漢字不一致処理部２２は、単語の最後尾の漢字からも順番に漢字の読み仮名と単語の読み仮名を比較して、単語の読み仮名のうちの登録対象の表記に含まれる漢字の読み仮名と一致する部分を検出する。これにより、単漢字不一致処理部２２は、単語の読み仮名のうち、どの漢字の読み仮名とも一致しない仮名を特定する。そして単漢字不一致処理部２２は、一致しない仮名を、その直前の仮名が対応付けられている漢字に対応付ける。 According to the modified example, instead of using dynamic programming, the single kanji mismatch processing unit 22 detects a part of the reading kana of the word that matches the kanji reading kana in order from the first kanji of the registered word. To do. Further, the single kanji mismatch processing unit 22 compares the kanji reading kana and the word reading kana in order from the last kanji of the word, and reads the kanji included in the registered notation of the word reading kana. The part that matches the kana is detected. Thus, the single kanji mismatch processing unit 22 identifies kana characters that do not match any kanji reading kana among the kana readings of the words. Then, the single kanji mismatch processing unit 22 associates the kana that does not match with the kanji associated with the kana immediately before.

例えば、上記の例のように、登録対象となる単語の表記が「和歌浦」で読み仮名が「わかのうら」であるとする。この場合、単語の先頭の漢字から順に、その漢字の読み仮名と単語の読み仮名を比較して一致する仮名を特定することで、漢字「和」「歌」と仮名「わ」「か」が対応付けられる。また、単語の最後尾の漢字から順に、その漢字の読み仮名と単語の読み仮名を比較して一致する仮名を特定することで、漢字「浦」と仮名「うら」が対応付けられる。そして最後に、単語の読み仮名のうち、仮名「の」だけがどの漢字にも対応付けられずに残る。そこで単漢字不一致処理部２２は、その仮名「の」を、直前の仮名「か」が対応づけられている漢字「歌」に対応付ける。 For example, as in the above example, it is assumed that the notation of the word to be registered is “Wakaura” and the reading kana is “Wakaura”. In this case, by comparing the kana readings of the kanji and the kana readings of the words in order from the first kanji of the word, the matching kana is identified, so that the kanji “wa” “song” and kana “wa” “ka” It is associated. In addition, the kanji “Ura” and the kana “Ura” are associated with each other by comparing the kana readings of the kanji and the kana readings of the words in order from the last kanji of the word. Finally, of the word reading kana, only the kana “no” remains without being associated with any kanji. Therefore, the single kanji mismatch processing unit 22 associates the kana “no” with the kanji “song” to which the immediately preceding kana “ka” is associated.

この変形例によれば、単漢字不一致処理部２２は、動的マッチングを用いるよりも単語の読み仮名を漢字に対応付けるための演算量を削減できる。
あるいはまた、単漢字不一致処理部２２は、最初に、上記の変形例のように、単語の先頭と最後尾からそれぞれ順に漢字の読み仮名と単語の読み仮名を対応付ける処理を行ってもよい。そして、複数の仮名が漢字と対応付けられなかった場合に、単漢字不一致処理部２２は、動的マッチングにより読み仮名と漢字の対応付けを行ってもよい。これにより、単漢字不一致処理部２２は、「和歌浦」のように、読み仮名中で一つの仮名のみが漢字の読みと一致しない場合の演算量を削減しつつ、「安宅一丁目」のように、複数の仮名が漢字の読みと一致しない場合でも、読み仮名と漢字を適切に対応付けられる。 According to this modification, the single kanji mismatch processing unit 22 can reduce the amount of calculation for associating the reading kana of a word with a kanji rather than using dynamic matching.
Alternatively, the single kanji mismatch processing unit 22 may first perform a process of associating a kanji reading kana and a word reading kana sequentially from the beginning and the tail of the word, respectively, as in the above modification. When a plurality of kana characters are not associated with kanji, the single kanji mismatch processing unit 22 may associate the reading kana and kanji by dynamic matching. Thereby, the single kanji mismatch processing unit 22 reduces the amount of calculation when only one kana in the reading kana does not match the reading of the kanji like “Wakaura”, while reducing the calculation amount to “Ataka 1-chome”. Even when a plurality of kana characters do not coincide with kanji readings, kana and kanji characters can be appropriately associated.

単漢字不一致処理部２２は、単語の読み仮名に含まれる各仮名と漢字の対応関係を長音変換部２３へ通知する。 The single kanji mismatch processing unit 22 notifies the long sound conversion unit 23 of the correspondence between each kana and kanji included in the reading kana of the word.

長音変換部２３は、単語の読み仮名に含まれる、長音化可能な母音の並びを抽出する。そして長音変換部２３は、単語の読み仮名と漢字の対応関係に基づいて、その母音の並びが同じ漢字に対応付けられているか否か判定する。そしてその母音の並びが同じ漢字に対応付けられている場合に、長音変換部２３は、その母音の並びを長音化することで、単語の表音文字列を生成する。 The long sound conversion unit 23 extracts a sequence of vowels that can be made into long sound included in the reading of the word. Then, the long sound conversion unit 23 determines whether or not the sequence of the vowels is associated with the same kanji based on the correspondence between the reading of the word and the kanji. When the vowel sequence is associated with the same Chinese character, the long sound conversion unit 23 generates a phonetic character string of the word by making the vowel sequence longer.

図７は、長音化変換処理の動作フローチャートを示す図である。長音変換部２３は、単語の読み仮名に含まれるモーラの総数Nを求める（ステップＳ１０１）。長音変換部２３は、注目するモーラの先頭からの順序iを1に設定する（ステップＳ１０２）。なお、モーラの総数Nが1の場合、長音化される仮名は存在しないので、長音変換部２３は、直ちに長音変換処理を終了してもよい。 FIG. 7 is a diagram showing an operation flowchart of the long sound conversion processing. The long sound conversion unit 23 obtains the total number N of mora included in the word reading kana (step S101). The long sound conversion unit 23 sets the order i from the head of the noted mora to 1 (step S102). When the total number N of mora is 1, since there is no kana to be made into a long sound, the long sound conversion unit 23 may immediately end the long sound conversion process.

長音変換部２３は、単語の読み仮名のうちのi番目のモーラと(i+1)番目のモーラが長音化可能な母音の並びを含むか否か判定する（ステップＳ１０３）。i番目のモーラと(i+1)番目のモーラが長音化可能な母音の並びを含む場合（ステップＳ１０３−Ｙｅｓ）、長音変換部２３は、i番目のモーラと(i+1)番目のモーラが同じ漢字に対応するか否か判定する（ステップＳ１０４）。そしてi番目のモーラと(i+1)番目のモーラが同じ漢字に対応付けられている場合（ステップＳ１０４−Ｙｅｓ）、長音変換部２３は、(i+1)番目のモーラに相当する仮名を長音記号に変換する（ステップＳ１０５）。 The long sound conversion unit 23 determines whether or not the i-th mora and the (i + 1) -th mora of the word reading pseudonyms include a sequence of vowels that can be made long (step S103). When the i-th mora and the (i + 1) -th mora include a sequence of vowels that can be made longer (step S103-Yes), the long-tone conversion unit 23 uses the i-th mora and the (i + 1) -th mora. Are determined to correspond to the same kanji (step S104). When the i-th mora and the (i + 1) -th mora are associated with the same kanji (step S104-Yes), the long sound conversion unit 23 sets the kana corresponding to the (i + 1) -th mora. Conversion into a long sound symbol (step S105).

ステップＳ１０５の後、またはステップＳ１０４にてi番目のモーラと(i+1)番目のモーラがそれぞれ別の漢字に対応する場合（ステップＳ１０４−Ｎｏ）、長音変換部２３は、(i+1)がモーラの総数Nよりも小さいか否か判定する（ステップＳ１０６）。同様に、ステップＳ１０３にてi番目のモーラと(i+1)番目のモーラが長音化可能な母音の並びを含まない場合も（ステップＳ１０３−Ｎｏ）、長音変換部２３は、(i+1)がモーラの総数Nよりも小さいか否か判定する（ステップＳ１０６）。 After step S105 or when the i-th mora and the (i + 1) -th mora correspond to different kanji characters in step S104 (step S104-No), the long sound conversion unit 23 selects (i + 1) Is smaller than the total number N of mora (step S106). Similarly, when the i-th mora and the (i + 1) -th mora do not include a sequence of vowels that can be made longer in step S103 (step S103-No), the long sound conversion unit 23 (i + 1) ) Is smaller than the total number N of mora (step S106).

(i+1)がモーラの総数Nよりも小さければ（ステップＳ１０６−Ｙｅｓ）、長音変換部２３は、iを1インクリメントし（ステップＳ１０７）、ステップＳ１０３以降の処理を繰り返す。一方、(i+1)がモーラの総数N以上であれば（ステップＳ１０６−Ｎｏ）、長音変換部２３は、長音変換処理を終了する。 If (i + 1) is smaller than the total number N of mora (step S106—Yes), the long sound conversion unit 23 increments i by 1 (step S107), and repeats the processing after step S103. On the other hand, if (i + 1) is equal to or greater than the total number N of mora (No at Step S106), the long sound conversion unit 23 ends the long sound conversion process.

長音変換部２３は、単語の読み仮名に対する長音変換処理後に得られた文字列を、単語の表記とともに登録部２４に渡す。 The long sound conversion unit 23 passes the character string obtained after the long sound conversion process for the word reading kana together with the word notation to the registration unit 24.

登録部２４は、得られた文字列を表音文字列として、単語の表記ととともに単語辞書に登録する。 The registration unit 24 registers the obtained character string as a phonetic character string in the word dictionary together with the word notation.

図８は、音声合成装置１の辞書登録部１２により実行される、単語登録処理の動作フローチャートである。辞書登録部１２は、登録対象の単語ごとに、以下の動作フローチャートに従って単語登録処理を実行する。 FIG. 8 is an operation flowchart of word registration processing executed by the dictionary registration unit 12 of the speech synthesizer 1. The dictionary registration unit 12 executes word registration processing according to the following operation flowchart for each word to be registered.

辞書登録部１２の単漢字分割判定部２１は、単漢字辞書を参照して、登録対象の単語の読み仮名と漢字の対応付けを実行する（ステップＳ２０１）。そして単漢字分割判定部２１は、読み仮名に含まれる全ての仮名を、単語中の何れかの漢字に対応付けられたか否か判定する（ステップＳ２０２）。 The single-kanji division determination unit 21 of the dictionary registration unit 12 refers to the single-kanji dictionary and associates the kana and kanji of the registration target word (step S201). Then, the single kanji division determination unit 21 determines whether or not all kana included in the reading kana are associated with any kanji in the word (step S202).

読み仮名に含まれる仮名の何れかが単語中のどの漢字にも対応付けられなかった場合（ステップＳ２０２−Ｎｏ）、単漢字分割判定部２１は、その単語の表記と読み仮名を辞書登録部１２の単漢字不一致処理部２２へ通知する。そして単漢字不一致処理部２２は、登録対象の単語の読み仮名のうち、辞書登録された漢字の読み仮名と一致しない仮名に対応する漢字を、前後の仮名に対応する漢字に基づいて特定する（ステップＳ２０３）。そして単漢字不一致処理部２２は、読み仮名と漢字の対応関係を辞書登録部１２の長音変換部２３へ通知する。 If any of the kana included in the reading kana is not associated with any kanji in the word (step S202-No), the single kanji division determination unit 21 stores the notation of the word and the reading kana in the dictionary registration unit 12. The single kanji mismatch processing unit 22 is notified. Then, the single kanji mismatch processing unit 22 identifies kanji corresponding to kana characters that do not match the kana readings of the kanji registered in the dictionary, based on the kanji characters corresponding to the preceding and following kana characters. Step S203). Then, the single kanji mismatch processing unit 22 notifies the long sound conversion unit 23 of the dictionary registration unit 12 of the correspondence between the reading kana and the kanji.

一方、ステップＳ２０２にて、読み仮名に含まれる全ての仮名を単語中の何れかの漢字に対応付けられた場合（ステップＳ２０２−Ｙｅｓ）、単漢字分割判定部２１は、読み仮名と漢字の対応関係を長音変換部２３へ通知する。 On the other hand, in step S202, when all the kana characters included in the reading kana are associated with any kanji in the word (step S202-Yes), the single kanji division determination unit 21 associates the reading kana with the kanji. The relationship is notified to the long sound conversion unit 23.

長音変換部２３は、読み仮名と漢字の対応関係が通知されると、その対応関係に基づいて、登録対象の単語の読み仮名に対する長音変換処理を実行する（ステップＳ２０４）。そして長音変換部２３は、単語の読み仮名に対する長音変換処理後に得られた文字列を、単語の表記とともに辞書登録部１２の登録部２４に渡す。
登録部２４は、長音変換処理により得られた文字列を表音文字列として、単語の表記ととともに単語辞書に登録する（ステップＳ２０５）。その後、辞書登録部１２は、辞書登録処理を終了する。 When the correspondence between the reading kana and the kanji is notified, the long sound conversion unit 23 performs a long sound conversion process on the reading kana of the word to be registered based on the correspondence (step S204). Then, the long sound conversion unit 23 passes the character string obtained after the long sound conversion process for the word reading kana to the registration unit 24 of the dictionary registration unit 12 together with the word notation.
The registration unit 24 registers the character string obtained by the long sound conversion process as a phonetic character string in the word dictionary together with the word notation (step S205). Thereafter, the dictionary registration unit 12 ends the dictionary registration process.

以上に説明してきたように、この音声合成装置は、単語辞書に単語を登録する際、その単語の読み仮名から自動的に表音文字列を生成する。その際、この音声合成装置は、登録対象の単語の読み仮名に含まれる仮名のうち、その単語の表記に含まれる何れの漢字の読み仮名とも一致しない仮名に対応する漢字を、その前または後の仮名に対応する漢字に基づいて特定する。そのため、この音声合成装置は、長音変換可能な母音の並びが単語の読み仮名に含まれる場合に、その母音の並びが同じ漢字に対応付けられているか否かが分かるので、その母音の並びを実際に長音に変換するか否かを適切に判断できる。 As described above, when registering a word in the word dictionary, the speech synthesizer automatically generates a phonogram string from the reading kana of the word. At this time, the speech synthesizer converts a kanji character corresponding to a kana character that does not match any kanji reading kana character included in the word notation from among the kana characters included in the reading kana character of the registration target word. Based on the kanji corresponding to the kana. Therefore, this speech synthesizer knows whether or not the sequence of vowels that can be converted into long tones is included in the reading of a word, so that the sequence of vowels is associated with the same kanji. It can be appropriately determined whether or not the sound is actually converted into a long sound.

変形例によれば、単漢字分割判定部２１は省略されてもよい。この場合には、辞書登録部１２は、登録しようとする全ての単語に対して単漢字不一致処理部２２の処理を実行することで、その単語の読み仮名に含まれる各仮名をその単語の何れかの漢字に対応付ける。 According to the modification, the single Chinese character division determination unit 21 may be omitted. In this case, the dictionary registration unit 12 executes the process of the single kanji mismatch processing unit 22 for all the words to be registered, so that each kana included in the reading kana of the word is Correspond to kanji.

さらに、上記の各実施形態による音声合成装置の処理部が有する各機能をコンピュータに実現させるコンピュータプログラムは、コンピュータによって読み取り可能な媒体、例えば、磁気記録媒体、光記録媒体または半導体メモリに記録された形で提供されてもよい。 Furthermore, a computer program that causes a computer to realize each function of the processing unit of the speech synthesizer according to each of the above embodiments is recorded on a computer-readable medium, for example, a magnetic recording medium, an optical recording medium, or a semiconductor memory. It may be provided in the form.

ここに挙げられた全ての例及び特定の用語は、読者が、本発明及び当該技術の促進に対する本発明者により寄与された概念を理解することを助ける、教示的な目的において意図されたものであり、本発明の優位性及び劣等性を示すことに関する、本明細書の如何なる例の構成、そのような特定の挙げられた例及び条件に限定しないように解釈されるべきものである。本発明の実施形態は詳細に説明されているが、本発明の精神及び範囲から外れることなく、様々な変更、置換及び修正をこれに加えることが可能であることを理解されたい。 All examples and specific terms listed herein are intended for instructional purposes to help the reader understand the concepts contributed by the inventor to the present invention and the promotion of the technology. It should be construed that it is not limited to the construction of any example herein, such specific examples and conditions, with respect to showing the superiority and inferiority of the present invention. Although embodiments of the present invention have been described in detail, it should be understood that various changes, substitutions and modifications can be made thereto without departing from the spirit and scope of the present invention.

１音声合成装置
２入力部
３記憶部
４処理部
５出力部
６スピーカ
１０言語処理部
１１音声合成部
１２辞書登録部
２１単漢字分割判定部
２２単漢字不一致処理部
２３長音変換部
２４登録部 DESCRIPTION OF SYMBOLS 1 Speech synthesizer 2 Input part 3 Storage part 4 Processing part 5 Output part 6 Speaker 10 Language processing part 11 Speech synthesizer part 12 Dictionary registration part 21 Single Chinese character division | segmentation determination part 22 Single Chinese character mismatch processing part 23 Long sound conversion part 24 Registration part

Claims

A word dictionary registration computer for registering a word in a word dictionary in which kanji kana notation of a word and a phonetic character string representing pronunciation of the word are registered, which is used to generate a synthesized speech signal from text data A program,
Among the reading kana of the word to be registered in the word dictionary, the kanji corresponding to the kana that does not match the reading kana registered in the single kanji dictionary of any kanji included in the kanji kana notation of the word is By specifying based on the kanji corresponding to the preceding or succeeding kana, the single kanji mismatch processing is performed to associate all kana included in the reading kana of the word with one of the kanji included in the kana notation of the word And
Based on the correspondence between the reading kana and kanji of the word to be registered, when it is determined that the arrangement of vowels that can be made longer is included in the reading kana of the word and corresponds to the same kanji, the arrangement of the vowels Is converted into a long sound to generate the phonogram string of the word to be registered,
Registering the phonetic character string of the word to be registered in the word dictionary together with the kanji kana notation of the word;
A computer program for registering a word dictionary for causing a computer to execute the operation.

Of the kana included in the reading kana of the word to be registered in the word dictionary, the kana matching the kana registered in the single kanji dictionary of the kanji included in the kanji kana notation of the word is associated with the kanji And causing the computer to execute the single kanji mismatch process when any of the kana included in the reading kana of the word does not correspond to any kanji included in the kanji kana notation of the word. Item 6. A computer program for registering a word dictionary according to Item 1.

Performing the single kanji mismatch processing is registered in the single kanji dictionary of each kanji according to the kanji order included in the kana kana notation of the word and the kana kana notation of the word according to dynamic programming A kana that does not correspond to any kanji included in the kanji kana notation of the word among the reading kana of the word is set as a kana before or after the kana so that the difference from the arrangement of the kana is minimized. The word dictionary registration computer program according to claim 1 or 2, wherein the computer program is associated with a corresponding kanji.

Executing the single kanji mismatch processing detects, in order from the first kanji of the word, a matching part of the reading kana registered in the single kanji dictionary of the kanji and the reading kana of the word, and In order from the last kanji of the word, by detecting a matching part of the reading kana registered in the single kanji dictionary of the kanji and the reading kana of the word, The word according to claim 1, wherein a kana that does not correspond to any kanji included in the kanji kana notation of the word is specified, and the specified kana is associated with a kanji corresponding to a kana before the kana. Computer program for dictionary registration.

A speech synthesizer that generates a synthesized speech signal from text data using a word dictionary that registers a kanji kana notation of a word and a phonogram representing a pronunciation of the word,
A storage unit for storing a single kanji dictionary in which kanji notation and reading kana are registered for each kanji;
An input unit for acquiring kana kana notation and reading kana of a word to be registered in the word dictionary;
Among the reading kana of the word to be registered, the kanji corresponding to the kana that does not match the reading kana registered in the single kanji dictionary of any kanji included in the kanji kana notation of the word is displayed before the kana or A single kanji mismatch processing unit that associates all kana included in the reading kana of the word with any of the kanji included in the kana kana notation of the word by specifying the kana corresponding to the later kana,
Based on the correspondence between the reading kana and kanji of the word to be registered, when it is determined that the arrangement of vowels that can be made longer is included in the reading kana of the word and corresponds to the same kanji, the arrangement of the vowels A long sound converting unit that generates the phonetic character string of the word to be registered by converting the sound into a long sound;
A registration unit for registering the phonetic character string of the word to be registered in the word dictionary together with the kanji kana notation of the word;
A speech synthesizer.

A method of registering a word in a word dictionary in which kanji kana notation of a word and a phonetic character string representing the pronunciation of the word are used to generate a synthesized speech signal from text data,
Obtain kanji kana notation and kana for the word to be registered in the word dictionary,
Among the reading kana of the word to be registered, the processor selects kanji corresponding to the kana that does not match the reading kana registered in the single kanji dictionary of any kanji included in the kanji kana notation of the word. By specifying based on the kanji corresponding to the preceding or succeeding kana, all kana included in the reading kana of the word is associated with any kanji included in the kana kana notation of the word,
If the processor determines that the sequence of vowels that can be made longer in the reading kana of the word corresponds to the same kanji based on the correspondence between the reading kana and kanji of the word to be registered, By converting the sequence of vowels to a long sound, the phonogram string of the word to be registered is generated,
The processor registers the phonetic character string of the word to be registered in the word dictionary together with the kanji kana notation of the word;
Word dictionary registration method including