JP6197523B2

JP6197523B2 - Speech synthesizer, language dictionary correction method, and language dictionary correction computer program

Info

Publication number: JP6197523B2
Application number: JP2013192984A
Authority: JP
Inventors: 野田　拓也; 拓也野田
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2013-09-18
Filing date: 2013-09-18
Publication date: 2017-09-20
Anticipated expiration: 2033-09-18
Also published as: JP2015060038A

Description

本発明は、例えば、テキストデータから音声信号を合成する音声合成装置及びその音声合成装置で利用される言語辞書の修正方法及び言語辞書修正用コンピュータプログラムに関する。 The present invention relates to a speech synthesizer that synthesizes a speech signal from text data, a language dictionary correction method and a language dictionary correction computer program used in the speech synthesizer, for example.

近年、音声を自動合成する音声合成技術が開発されている。音声合成技術は、短時間で所望の音声を作成できるというメリットを有するため、これまで予め録音されたプロのナレータによる音声を用いていたアプリケーションの中には、このような音声合成技術を採用したものもある。特に、商業施設における案内放送、ハイウェイラジオ、ハイウェイテレホンまたは天気予報の放送など、短い時間間隔で提供する情報が更新されるアプリケーションでは、上記のメリットを持つ音声合成技術が有用である。 In recent years, speech synthesis technology for automatically synthesizing speech has been developed. Since speech synthesis technology has the advantage that it can create desired speech in a short time, such speech synthesis technology has been adopted in applications that have used pre-recorded speech by professional narrators. There are also things. In particular, in an application in which information provided at a short time interval is updated, such as a guidance broadcast in a commercial facility, a highway radio, a highway telephone, or a weather forecast broadcast, the speech synthesis technology having the above-described advantages is useful.

合成したい音声信号を生成するために、音声合成装置には、例えば、キーボードなどを介して漢字仮名交じりのテキストデータが入力される。そして音声合成装置は、そのテキストデータに対して、単語の読みなどを登録した言語辞書を利用して、形態素解析または係り受け解析といった言語処理を行う。そして音声合成装置は、その言語処理によって、形態素単位の読み表記を表す形態素情報と、その形態素情報にアクセント位置、アクセントの強弱あるいは抑揚の大小といった韻律を表す韻律記号を付した中間表記を生成する。そして音声合成装置は、その中間表記に基づいて、合成音声信号を生成する。なお、漢字仮名表記の読み、アクセント位置及びその漢字仮名表記を含むアクセント句の範囲を表す情報を、本明細書では読み情報と呼ぶ。また、漢字仮名表記におけるアクセントの強弱、音程の高低、抑揚の大小、話速の緩急、音量の大小、及び発音中の区切り位置などを、本明細書では韻律情報と呼ぶ。 In order to generate a speech signal to be synthesized, text data mixed with kanji characters is input to the speech synthesizer via, for example, a keyboard. The speech synthesizer performs language processing such as morphological analysis or dependency analysis on the text data using a language dictionary in which word readings are registered. Then, the speech synthesizer generates morpheme information representing the morpheme-based reading notation and intermediate notation with the prosody symbol representing the prosody such as the accent position, the strength of the accent, or the size of the inflection on the morpheme information by the language processing. . Then, the speech synthesizer generates a synthesized speech signal based on the intermediate notation. Note that information representing the reading of the kanji kana notation, the accent position and the range of the accent phrase including the kanji kana notation is referred to as reading information in this specification. Also, in this specification, accent strength, kanji pitch, pitch, inflection, speed of speech, volume level, break position during pronunciation, etc. in kanji kana notation are referred to as prosodic information in this specification.

漢字には、複数の読み方があり、また、単語によってその単語に含まれる漢字の読み方は異なる。また、日常で使用される単語は日々変化するので、全ての単語を予め言語辞書に登録することは事実上不可能であり、入力されたテキストデータに、言語辞書に登録されていない単語が含まれていることもある。そのため、言語処理の結果得られる中間表記が正確でないこともある。このような場合、正しい合成音声信号を得るために、ユーザが手動で中間表記を修正する必要がある。このような修正作業は、ユーザにとって煩雑であるため、できるだけこのような修正作業が発生しないことが好ましい。そこで、入力テキストデータの終端に至った際にいままで抽出された未知語を一括してユーザに伝え、その未知語とユーザにより入力されたその未知語についての情報を単語辞書に登録する技術が提案されている（例えば、特許文献１を参照）。また、言語解析結果のうちの第１候補の言語解析結果の中に記憶された置換条件と一致する部分が存在する場合、その一致部分を置換条件に対応する置換情報に置き換えて新たな言語解析結果を生成する技術が提案されている（例えば、特許文献２を参照）。この技術では、新たな言語解析結果と同じものが第１候補以外の言語解析結果として存在している場合には、新たな言語解析結果に基づいて合成音声が生成される。 There are several ways to read kanji, and how to read kanji included in the word differs depending on the word. In addition, since words used in daily life change day by day, it is virtually impossible to register all words in the language dictionary in advance, and the input text data includes words that are not registered in the language dictionary. Sometimes it is. Therefore, the intermediate notation obtained as a result of language processing may not be accurate. In such a case, in order to obtain a correct synthesized speech signal, the user needs to manually correct the intermediate notation. Since such a correction work is complicated for the user, it is preferable that such a correction work does not occur as much as possible. Therefore, when the input text data reaches the end of the input text data, the unknown words extracted so far are collectively transmitted to the user, and the unknown word and the information about the unknown word input by the user are registered in the word dictionary. It has been proposed (see, for example, Patent Document 1). Further, if there is a part that matches the stored replacement condition in the first candidate language analysis result in the language analysis result, the matching part is replaced with replacement information corresponding to the replacement condition, and a new language analysis is performed. A technique for generating a result has been proposed (see, for example, Patent Document 2). In this technique, when the same language analysis result other than the first candidate exists as a language analysis result other than the first candidate, a synthesized speech is generated based on the new language analysis result.

特開平７−２４４４９１号公報JP 7-244491 A 特開平１０−３１２３７７号公報Japanese Patent Laid-Open No. 10-312377

言語辞書に、読み情報及び韻律情報が登録されている単語について、例えば、複数のユーザがその単語の合成音声のレビューを行ってより適切な合成音声を得ようとする場合など、その読み情報または韻律情報が修正されることがある。特に、単語の読み情報が修正されると、その単語またはその単語を含む文の韻律情報も、場合によっては修正が必要となる。韻律情報は、読み情報に影響されるためである。もし、読み情報が修正されても韻律情報が修正されなければ、韻律情報が付された単語等の読みと実際の読みとが異なるために、その韻律情報が使用されなくなるか、その韻律情報を無理に使用しても適切な韻律とならなくなる。一方、単語の読み情報の修正に伴って、その単語に関連する韻律情報を全て修正するのは煩雑な手間を要する。 For words whose reading information and prosodic information are registered in the language dictionary, for example, when a plurality of users review the synthesized speech of the word to obtain a more appropriate synthesized speech, Prosodic information may be modified. In particular, when the reading information of a word is corrected, the prosodic information of the word or a sentence including the word also needs to be corrected in some cases. This is because prosodic information is influenced by reading information. If the prosodic information is not corrected even if the reading information is corrected, the prosodic information is not used or the prosodic information is not used because the reading of the word or the like with the prosodic information is different from the actual reading. Even if it is forcibly used, it does not become an appropriate prosody. On the other hand, along with the correction of the word reading information, it is troublesome to correct all the prosodic information related to the word.

そこで本明細書は、一つの側面として、言語辞書に登録された単語の読み情報が修正されても、その単語を含む句に言語辞書に登録済みの韻律情報を反映させることが可能な音声合成装置を提供することを目的とする。 Therefore, as one aspect, this specification is a speech synthesis capable of reflecting prosodic information registered in a language dictionary in a phrase including the word even if the reading information of the word registered in the language dictionary is corrected. An object is to provide an apparatus.

一つの実施形態によれば、テキストデータから生成した、そのテキストデータの読み及び韻律を表す韻律情報を含む中間表記に基づいて合成音声信号を生成する音声合成装置が提供される。この音声合成装置は、少なくとも１以上の単語の漢字仮名表記及びその単語の読み情報と、少なくとも１以上の韻律修正対象句の漢字仮名表記と、韻律修正対象句の中間表記と、韻律修正対象句の中間表記に含まれる韻律情報が適用されるときに韻律修正対象句と連続する文字列の漢字仮名表記とが登録され、かつ、テキストデータから中間表記を生成するために利用される言語辞書を記憶し、読み情報は単語の読み及びアクセント位置に関する情報である記憶部と、少なくとも１以上の単語の何れかの読み情報が修正されると、少なくとも１以上の韻律修正対象句の中から、読み情報が修正された単語を含む韻律修正対象句を検出する検索部と、読み情報が修正された単語を含む韻律修正対象句と、その韻律修正対象句と連続する文字列とを結合することで得られる文字列を韻律修正対象句の韻律情報を継承するか否かを判定する対象範囲として設定する対象範囲決定部と、対象範囲をアクセント句単位に分割し、韻律修正対象句の少なくとも一部と読み情報が修正された単語を含むアクセント句を、韻律修正対象句の韻律情報を継承する韻律継承アクセント句と判定するアクセント句判定部と、修正された単語の読み情報を含む言語辞書を用いて対象範囲を言語解析することにより得られた中間表記のうちの韻律継承アクセント句に対応する部分の中間表記を、韻律継承アクセント句についての韻律情報で修正する修正部と、韻律修正対象句を韻律継承アクセント句で置換するとともに、置換後の韻律修正対象句に対応する中間表記を韻律継承アクセント句についての修正された中間表記とすることで言語辞書を更新する更新部とを有する。 According to one embodiment, there is provided a speech synthesizer that generates a synthesized speech signal based on an intermediate notation including prosody information that represents reading and prosody of the text data generated from the text data. This speech synthesizer includes at least one kanji kana notation of a word and reading information of the word, at least one kanji kana notation of a prosodic correction target phrase, an intermediate notation of a prosody correction target phrase, and a prosody correction target phrase. When the prosodic information included in the intermediate notation is applied, the language dictionary used for generating the intermediate notation from the text data is registered with the prosody correction target phrase and the kanji kana notation of the continuous character string The stored information is information related to the reading and accent position of the word, and when at least one of the reading information of one or more words is corrected, the reading is read from at least one of the prosody correction target phrases. A search unit for detecting a prosodic correction target phrase including a word whose information is corrected; a prosody correction target phrase including a word whose reading information is corrected; and a character string continuous with the prosody correction target phrase; A target range determination unit that sets a character string obtained by combining as a target range for determining whether or not to inherit the prosodic information of the prosodic correction target phrase, and divides the target range into accent phrase units, and the prosody correction target phrase And an accent phrase determination unit that determines an accent phrase including at least a part of the word and a word whose reading information has been corrected as a prosodic inheritance accent phrase that inherits the prosodic information of the prosody correction target phrase, and includes reading information of the corrected word A correction unit for correcting the intermediate notation of the part corresponding to the prosodic inherited accent phrase in the intermediate notation obtained by linguistic analysis of the target range using the language dictionary, with the prosodic information about the prosodic inherited accent phrase, and the prosody The revision target phrase is replaced with the prosodic inheritance accent phrase, and the intermediate notation corresponding to the replaced prosody revision phrase is corrected for the prosodic inheritance phrase. And a update unit for updating the language dictionary by the intermediate representation.

本発明の目的及び利点は、請求項において特に指摘されたエレメント及び組み合わせにより実現され、かつ達成される。
上記の一般的な記述及び下記の詳細な記述の何れも、例示的かつ説明的なものであり、請求項のように、本発明を限定するものではないことを理解されたい。 The objects and advantages of the invention will be realized and attained by means of the elements and combinations particularly pointed out in the appended claims.
It should be understood that both the foregoing general description and the following detailed description are exemplary and explanatory and are not restrictive of the invention as claimed.

本明細書に開示された音声合成装置は、言語辞書に登録された単語の読み情報が修正されても、その単語を含む句に言語辞書に登録済みの韻律情報を反映させることができる。 The speech synthesizer disclosed in this specification can reflect the prosodic information registered in the language dictionary in the phrase including the word even if the reading information of the word registered in the language dictionary is corrected.

修正された読み情報と韻律情報間に不整合が生じた中間表記の一例を示す図である。It is a figure which shows an example of the intermediate notation in which the mismatch produced between the corrected reading information and prosodic information. 一つの実施形態による音声合成装置の概略構成図である。It is a schematic block diagram of the speech synthesizer by one Embodiment. （ａ）は、ユーザ辞書の一例を示す図であり、（ｂ）は、中間表記辞書の一例を示す図である。(A) is a figure which shows an example of a user dictionary, (b) is a figure which shows an example of an intermediate notation dictionary. 一つの実施形態による音声合成装置が有する処理部の機能ブロック図である。It is a functional block diagram of the process part which the speech synthesizer by one Embodiment has. 中間表記辞書及びユーザ辞書と韻律継承アクセント句の関係の一例を示す図である。It is a figure which shows an example of the relationship between an intermediate notation dictionary and a user dictionary, and a prosodic inheritance accent phrase. 中間表記辞書及びユーザ辞書と韻律継承アクセント句の関係の他の一例を示す図である。It is a figure which shows another example of the relationship between an intermediate notation dictionary and a user dictionary, and a prosodic inheritance accent phrase. 中間表記辞書及びユーザ辞書と韻律継承アクセント句の関係のさらに他の一例を示す図である。It is a figure which shows another example of the relationship between an intermediate notation dictionary and a user dictionary, and a prosodic inheritance accent phrase. 中間表記辞書及びユーザ辞書と韻律継承アクセント句の関係のさらに他の一例を示す図である。It is a figure which shows another example of the relationship between an intermediate notation dictionary and a user dictionary, and a prosodic inheritance accent phrase. 複合名詞の一例を示す図である。It is a figure which shows an example of a compound noun. 複合名詞の他の一例を示す図である。It is a figure which shows another example of a compound noun. 修正単語が複合名詞に含まれない場合における、韻律情報継承範囲の一例を示す図である。It is a figure which shows an example of the prosodic information inheritance range in case a correction word is not contained in a compound noun. 修正単語が複合名詞に含まれない場合における、韻律情報継承範囲の他の一例を示す図である。It is a figure which shows another example of the prosodic information inheritance range in case a correction word is not contained in a compound noun. 修正単語が複合名詞に含まれない場合における、韻律情報継承範囲のさらに他の一例を示す図である。It is a figure which shows another example of the prosodic information inheritance range in case a correction word is not contained in a compound noun. 修正単語が複合名詞に含まれる場合における、韻律情報継承範囲の一例を示す図である。It is a figure which shows an example of the prosodic information inheritance range in case a correction word is contained in a compound noun. 修正単語が複合名詞に含まれる場合における、韻律情報継承範囲の一例を示す図である。It is a figure which shows an example of the prosodic information inheritance range in case a correction word is contained in a compound noun. 辞書修正処理の動作フローチャートである。It is an operation | movement flowchart of a dictionary correction process.

以下、図を参照しつつ、一つの実施形態による音声合成装置について説明する。
最初に、図１を参照しつつ、修正された読み情報と韻律情報間に不整合が生じた中間表記の例を説明する。 Hereinafter, a speech synthesizer according to an embodiment will be described with reference to the drawings.
First, an example of intermediate notation in which a mismatch occurs between the corrected reading information and prosodic information will be described with reference to FIG.

ユーザにより編集され、単語の漢字仮名表記、中間表記、品詞及び活用形などを格納した、言語辞書の一例であるユーザ辞書１００には、先ず、単語「Data」について、その読みが「データー」として登録されているとする。また、韻律情報を記憶する言語辞書の一例である中間表記辞書１１０には、韻律情報の修正対象となる、単語「Data」を含む句の漢字仮名表記「Dataです」が登録されている。また、中間表記辞書１１０には、その韻律情報修正対象となる句についての韻律情報を含む中間表記「デＭーターデス」が登録されている。さらに、中間表記辞書１１０には、韻律情報の修正対象となる句についての中間表記に含まれる韻律情報が適用される場合に、その句と連続する文字列、すなわち、その句に前置される漢字仮名表記「これは」及び後置される漢字仮名表記「。」が登録されている。なお、本明細書では、漢字仮名表記と中間表記の区別を容易にするために、中間表記では片仮名を使用する。また、記号「Ｍ」は、韻律情報を表す記号であり、例えば、記号「Ｍ」は、アクセントが中程度のレベルであることを表す。この場合、音声合成装置は、漢字仮名表記が「これはDataです。」という文が入力されると、言語辞書及び中間表記辞書を参照することにより、「コレハ｜デＭーターデス．」という中間表記を作成する。 In the user dictionary 100, which is an example of a language dictionary that is edited by the user and stores the kanji kana notation, intermediate notation, part of speech, inflection form, etc. of the word, first, the word “Data” is read as “data”. Suppose that it is registered. Also, in the intermediate notation dictionary 110 that is an example of a language dictionary that stores prosodic information, a kanji kana notation “Data” of a phrase including the word “Data”, which is a correction target of the prosodic information, is registered. In the intermediate notation dictionary 110, an intermediate notation “De Matterdes” including prosodic information about the phrase whose prosody information is to be corrected is registered. Further, in the intermediate notation dictionary 110, when the prosodic information included in the intermediate notation for the phrase whose prosodic information is to be corrected is applied, the intermediate notation dictionary 110 is preceded by a character string continuous with the phrase, that is, the phrase. A kanji kana notation “This is” and a postscript kanji kana notation “.” Are registered. In the present specification, katakana is used in the intermediate notation in order to facilitate the distinction between the kanji kana notation and the intermediate notation. The symbol “M” is a symbol representing prosodic information. For example, the symbol “M” represents that the accent is at a medium level. In this case, the speech synthesizer, when the sentence “This is Data.” Is input as the kanji kana notation, by referring to the language dictionary and the intermediate notation dictionary, the intermediate notation “Koreha | de Matterdes.” Create

ここで、ユーザ辞書１００に登録されている単語「Data」の読みが「データー」から「データ」に修正されたとする。この修正後に、「これはDataです。」という文が入力されると、ユーザ辞書１００に登録されている「Data」の読みが変更されているために、中間表記辞書１１０に登録されている韻律情報が反映されず、「コレハ｜データデＨス．」という中間表記が生成される。このように、ユーザが作成した韻律情報が、言語辞書の修正に伴って利用されなくなってしまう。 Here, it is assumed that the reading of the word “Data” registered in the user dictionary 100 is corrected from “data” to “data”. After this correction, when the sentence “This is Data.” Is input, the prosody registered in the intermediate notation dictionary 110 is changed because the reading of “Data” registered in the user dictionary 100 is changed. The information is not reflected, and an intermediate notation “Koreha | Data des H.” is generated. In this way, the prosodic information created by the user is not used when the language dictionary is corrected.

そこで、本実施形態による音声合成装置は、ユーザ辞書に登録された単語の読み情報が修正されると、中間表記辞書に登録されている、韻律情報の修正対象となる単語、句又は文節などの漢字仮名表記の中から、その修正された単語を含むものを検出する。さらに音声合成装置は、検出された句などの漢字仮名表記に、中間表記辞書に登録されている前後の単語などの漢字仮名表記を結合したものを、中間表記辞書に登録されている韻律情報を継承させる可能性がある対象範囲に設定する。そしてこの音声合成装置は、対象範囲を構成するアクセント句のうちで、読み情報が修正された単語を含むアクセント句を、韻律情報を継承する範囲として設定する。さらにこの音声合成装置は、読み情報が修正された単語が特定の複合名詞に含まれるか否かによって、中間表記辞書に登録されている韻律情報を継承するか否かを判定する。なお、アクセント句とは、アクセントを一つのみ含む句である。また、以下の実施形態では、理解を容易にするために、韻律情報としてアクセントの強弱のみが示されるが、上記のように、韻律情報には、音程の高低、抑揚の大小、話速の緩急、音量の大小、及び発音中の区切り位置も含まれる。 Therefore, when the reading information of the word registered in the user dictionary is corrected, the speech synthesizer according to the present embodiment, such as a word, a phrase, or a phrase that is registered in the intermediate notation dictionary and is subject to correction of the prosodic information. The kanji kana notation is detected including the corrected word. Furthermore, the speech synthesizer combines the kanji kana notation such as the detected phrase with the kanji kana notation such as the preceding and following words registered in the intermediate notation dictionary, and the prosodic information registered in the intermediate notation dictionary. Set the target range that may be inherited. Then, the speech synthesizer sets an accent phrase including a word whose reading information is corrected among the accent phrases constituting the target range as a range in which the prosodic information is inherited. Further, this speech synthesizer determines whether to inherit the prosodic information registered in the intermediate notation dictionary depending on whether or not the word whose reading information is corrected is included in a specific compound noun. An accent phrase is a phrase including only one accent. In the following embodiments, for easy understanding, only the strength of the accent is shown as the prosody information. As described above, the prosody information includes the pitch of the pitch, the level of inflection, and the slowness of the speech speed. , Volume level, and break position during pronunciation are also included.

図２は、一つの実施形態による音声合成装置の概略構成図である。本実施形態では、音声合成装置１は、入力部２と、記憶部３と、処理部４と、出力部５とを有する。 FIG. 2 is a schematic configuration diagram of a speech synthesizer according to one embodiment. In the present embodiment, the speech synthesizer 1 includes an input unit 2, a storage unit 3, a processing unit 4, and an output unit 5.

入力部２は、合成音声の原文であり、漢字仮名交じり文であるテキストデータを取得する。そのために、入力部２は、例えば、キーボードを有する。また、入力部２は、マウスなどのポインティングデバイスとそのポインティングデバイスにより指示される入力すべき文字または数値などを表示するディスプレイとを有する。あるいは、入力部２は、タッチパネルディスプレイを有してもよい。
さらにまた、入力部２は、テキストデータを通信ネットワークを介して音声合成装置１と接続された他の機器から取得してもよい。この場合、入力部２は、音声合成装置１を通信ネットワークに接続するためのインターフェース回路を有する。
そして入力部２は、入力されたテキストデータを処理部４へ渡す。 The input unit 2 obtains text data that is an original text of synthesized speech and is a kanji-kana mixed text. For this purpose, the input unit 2 includes, for example, a keyboard. The input unit 2 includes a pointing device such as a mouse and a display that displays characters or numerical values to be input, which are instructed by the pointing device. Alternatively, the input unit 2 may have a touch panel display.
Furthermore, the input unit 2 may acquire text data from another device connected to the speech synthesizer 1 via a communication network. In this case, the input unit 2 includes an interface circuit for connecting the speech synthesizer 1 to a communication network.
The input unit 2 passes the input text data to the processing unit 4.

記憶部３は、例えば、半導体メモリ回路、磁気記憶装置または光記憶装置のうちの少なくとも一つを有する。そして記憶部３は、処理部４で用いられる各種コンピュータプログラム及び音声合成処理に用いられる各種のデータを記憶する。
記憶部３は、音声合成処理に用いられるデータとして、例えば、韻律モデルと、音声波形辞書を記憶する。さらに記憶部３は、言語処理に用いられるデータとして、テキストデータ中に出現すると想定される様々な単語について、その単語の漢字仮名表記、中間表記、品詞及び活用形などを格納した言語辞書を記憶する。さらに記憶部３は、ユーザにより登録された単語について、その単語の漢字仮名表記、中間表記、品詞及び活用形などを格納したユーザ辞書を記憶する。さらに記憶部３は、韻律情報の修正対象となる単語、句または文節などの漢字仮名表記及びその韻律情報と、その単語、句または文節などの前後に位置する１以上の単語または句読点などの漢字仮名表記とを格納した中間表記辞書を記憶する。なお、以下では、便宜上、韻律情報の修正対象となる単語、句または文節などの文字列を、韻律修正対象句と呼ぶ。なお、ユーザ辞書及び中間表記辞書も、言語辞書の一例である。 The storage unit 3 includes, for example, at least one of a semiconductor memory circuit, a magnetic storage device, and an optical storage device. The storage unit 3 stores various computer programs used in the processing unit 4 and various data used for speech synthesis processing.
The storage unit 3 stores, for example, a prosodic model and a speech waveform dictionary as data used for speech synthesis processing. Further, the storage unit 3 stores, as data used for language processing, a language dictionary that stores kanji kana notation, intermediate notation, part of speech, and utilization form of the words that are supposed to appear in the text data. To do. Furthermore, the memory | storage part 3 memorize | stores the user dictionary which stored the kanji kana notation of the word, intermediate notation, a part of speech, a utilization form, etc. about the word registered by the user. Further, the storage unit 3 includes kanji kana notation such as a word, phrase or phrase to be corrected for prosodic information and its prosodic information and one or more words or punctuation marks such as one or more words or punctuation marks positioned before and after the word, phrase or phrase. An intermediate notation dictionary storing kana notation is stored. Hereinafter, for the sake of convenience, a character string such as a word, phrase, or phrase for which prosody information is to be corrected is referred to as a prosody correction target phrase. The user dictionary and the intermediate notation dictionary are also examples of language dictionaries.

図３（ａ）は、ユーザ辞書の一例を示す図である。この例では、単語「Data」がユーザ辞書３００に登録されている。そしてユーザ辞書３００には、その単語の漢字仮名表記「Data」と、中間表記「データ」と、単語「Data」の品詞（普通名詞）が登録されている。
図３（ｂ）は、中間表記辞書の一例を示す図である。この例では、アクセント句「Dataです」が韻律修正対象句として中間表記辞書３１０に登録されている。そして中間表記辞書３１０には、その韻律修正対象句の漢字仮名表記「Dataです」及び韻律情報を含む中間表記「デＭーターデス」が登録される。さらに、中間表記辞書３１０には、韻律修正対象句についての中間表記に含まれる韻律情報が適用されるときに韻律修正対象句に前置される単語等の文字列の漢字仮名表記「これは」及びアクセント句に後置される単語等の文字列の漢字仮名表記「。」が登録される。 FIG. 3A shows an example of a user dictionary. In this example, the word “Data” is registered in the user dictionary 300. In the user dictionary 300, kanji kana notation “Data”, intermediate notation “data”, and part of speech (common noun) of the word “Data” are registered.
FIG. 3B is a diagram illustrating an example of the intermediate notation dictionary. In this example, the accent phrase “Data” is registered in the intermediate notation dictionary 310 as a prosody modification target phrase. In the intermediate notation dictionary 310, the kanji kana notation “Data” of the prosody modification target phrase and the intermediate notation “De-Matter Death” including prosody information are registered. Further, in the intermediate notation dictionary 310, the kanji kana notation “this is” of a character string such as a word prefixed to the prosodic correction target phrase when the prosodic information included in the intermediate notation for the prosody correction target phrase is applied. In addition, kanji kana notation “.” Of a character string such as a word after the accent phrase is registered.

出力部５は、処理部４から受け取った合成音声信号をスピーカ６へ出力する。そのために、出力部５は、例えば、スピーカ６を音声合成装置１と接続するためのオーディオインターフェース回路を有する。
また出力部５は、合成音声信号を、通信ネットワークを介して音声合成装置１と接続された他の装置へ出力してもよい。この場合、出力部５は、その通信ネットワークに音声合成装置１と接続するためのインターフェース回路を有する。なお、入力部２も通信ネットワークを介してテキストデータを取得する場合、入力部２と出力部５は一体化されていてもよい。 The output unit 5 outputs the synthesized voice signal received from the processing unit 4 to the speaker 6. For this purpose, the output unit 5 includes, for example, an audio interface circuit for connecting the speaker 6 to the speech synthesizer 1.
The output unit 5 may output the synthesized speech signal to another device connected to the speech synthesizer 1 via the communication network. In this case, the output unit 5 includes an interface circuit for connecting to the speech synthesizer 1 to the communication network. In addition, when the input part 2 also acquires text data via a communication network, the input part 2 and the output part 5 may be integrated.

処理部４は、一つまたは複数のプロセッサと、メモリ回路と、周辺回路とを有する。そして処理部４は、入力されたテキストデータに基づいて、合成音声信号を作成する。
図４は、処理部４の機能ブロック図である。処理部４は、言語処理部１０と、音声合成部１１と、辞書登録部１２と、辞書修正部１３とを有する。
処理部４が有するこれらの各部は、例えば、処理部４が有するプロセッサ上で動作するコンピュータプログラムにより実現される機能モジュールである。あるいは、処理部４が有するこれらの各部は、その各部の機能を実現する一つの集積回路として音声合成装置１に実装されてもよい。 The processing unit 4 includes one or a plurality of processors, a memory circuit, and a peripheral circuit. Then, the processing unit 4 creates a synthesized speech signal based on the input text data.
FIG. 4 is a functional block diagram of the processing unit 4. The processing unit 4 includes a language processing unit 10, a speech synthesis unit 11, a dictionary registration unit 12, and a dictionary correction unit 13.
Each of these units included in the processing unit 4 is, for example, a functional module realized by a computer program that operates on a processor included in the processing unit 4. Or these each part which the process part 4 has may be mounted in the speech synthesizer 1 as one integrated circuit which implement | achieves the function of each part.

言語処理部１０は、入力された、漢字仮名交じり文であるテキストデータから形態素読み表記を生成するとともに、そのテキストデータに含まれる各形態素情報を特定する。さらに、言語処理部１０は、入力されたテキストデータから、中間表記及びそのテキストデータに含まれる各品詞の情報を特定する。ここで、形態素読み表記とは、形態素単位の読み表記を表し、例えば、カタカナで表される。また、中間表記とは、形態素読み表記に、韻律を表す韻律記号が追加されたものである。韻律記号には、例えば、「アクセント位置」、「アクセント強弱」、「音程高低」、「抑揚大小」、「話速緩急」、「音量大小」及び「区切り」を表現する記号が含まれる。したがって、中間表記から韻律記号を除いたものは、形態素読み表記と一致する。また、中間表記から韻律記号を除いたものにおける、各品詞の情報は、形態素情報と１対１に対応する。すなわち、中間表記と品詞情報から、形態素読み表記と形態素情報が抽出される。 The language processing unit 10 generates a morpheme reading notation from the input text data that is a kanji mixed sentence, and specifies each morpheme information included in the text data. Furthermore, the language processing unit 10 specifies intermediate notation and information on each part of speech included in the text data from the input text data. Here, the morpheme reading notation represents the reading notation in units of morphemes, and is expressed in katakana, for example. The intermediate notation is obtained by adding prosodic symbols representing prosody to the morpheme reading notation. The prosodic symbols include, for example, symbols that represent “accent position”, “accent strength”, “pitch pitch”, “inflection magnitude”, “speech speed”, “volume level”, and “separation”. Accordingly, the intermediate notation excluding the prosodic symbols matches the morpheme reading notation. In addition, the information of each part of speech in the intermediate notation excluding the prosodic symbols corresponds to the morpheme information on a one-to-one basis. That is, morpheme reading notation and morpheme information are extracted from the intermediate notation and part-of-speech information.

言語処理部１０は、入力されたテキストデータから形態素読み表記及び中間表記などを生成するために、記憶部３に記憶されている言語辞書、ユーザ辞書及び中間表記辞書を読み込む。そして言語処理部１０は、例えば、その言語辞書及びユーザ辞書を用いて、テキストデータに対して形態素解析及び係り受け解析を行って、テキストデータ中に出現する各単語の順序及び読み、アクセントの位置及び区切りの位置を決定する。その際、言語処理部１０は、テキストデータ中に言語辞書とユーザ辞書の両方に登録されている単語がある場合、ユーザ辞書に登録されている単語を優先的に利用してもよい。 The language processing unit 10 reads the language dictionary, user dictionary, and intermediate notation dictionary stored in the storage unit 3 in order to generate morpheme reading notation and intermediate notation from the input text data. Then, the language processing unit 10 performs morphological analysis and dependency analysis on the text data using the language dictionary and the user dictionary, for example, the order and reading of each word appearing in the text data, and the position of the accent And the position of the break is determined. At that time, when there are words registered in both the language dictionary and the user dictionary in the text data, the language processing unit 10 may preferentially use the words registered in the user dictionary.

言語処理部１０は、形態素解析として、例えば、動的計画法を用いる方法を利用できる。また言語処理部１０は、係り受け解析として、例えば、先読みＬＲパーザまたはＬＬ法といった構文解析の手法を利用できる。そして言語処理部１０は、各単語の順序、読み、アクセントの位置及び区切りの位置に応じて形態素読み表記及び中間表記を作成する。 The language processing unit 10 can use, for example, a method using dynamic programming as the morphological analysis. The language processing unit 10 can use a syntax analysis technique such as a prefetch LR parser or an LL method, for example, as dependency analysis. Then, the language processing unit 10 creates a morpheme reading notation and an intermediate notation according to the order of each word, reading, accent position, and break position.

さらに、言語処理部１０は、入力テキストデータ中で、中間表記辞書に登録されている韻律修正対象句の漢字仮名表記及びその前後の漢字仮名表記と一致する部分があれば、その一致する部分に対応する中間表記の韻律情報を中間表記辞書に従って修正する。
言語処理部１０は、生成した形態素読み表記及び中間表記などを記憶部３に一時的に記憶する。 Furthermore, if there is a part that matches the kanji kana notation of the prosody modification target phrase registered in the intermediate notation dictionary and the kanji kana notation before and after that in the input text data, the language processing unit 10 The corresponding intermediate notation prosodic information is corrected according to the intermediate notation dictionary.
The language processing unit 10 temporarily stores the generated morpheme reading notation and intermediate notation in the storage unit 3.

音声合成部１１は、入力されたテキストデータの中間表記に基づいて合成音声信号を作成する。 The speech synthesizer 11 creates a synthesized speech signal based on the intermediate notation of the input text data.

音声合成部１１は、中間表記に基づいて、合成音声信号を生成する際の目標韻律を生成する。そのために、音声合成部１１は、記憶部３から複数の韻律モデルを読み込む。この韻律モデルは、声を高くする位置及び声を低くする位置などを時間順に表したものである。そして音声合成部１１は、複数の韻律モデルのうち、中間表記に示されたアクセントの位置などに最も一致する韻律モデルを選択する。そして音声合成部１１は、選択した韻律モデル及び合成パラメータに従って、中間表記に対して声が高くなる位置あるいは声が低くなる位置、声の抑揚、ピッチなどを設定することにより、目標韻律を作成する。目標韻律は、音声波形を決定する単位となる音素ごとに、音素の長さ及びピッチ周波数を含む。なお、音素は、例えば、一つの母音あるいは一つの子音とすることができる。 The speech synthesizer 11 generates a target prosody for generating a synthesized speech signal based on the intermediate notation. For this purpose, the speech synthesis unit 11 reads a plurality of prosodic models from the storage unit 3. This prosodic model represents a position in which the voice is raised and a position in which the voice is lowered in time order. Then, the speech synthesizer 11 selects a prosodic model that most closely matches the position of the accent indicated by the intermediate notation among the plural prosodic models. Then, the speech synthesizer 11 creates a target prosody by setting a position where the voice becomes high or low, a position where the voice becomes low, a voice inflection, a pitch, and the like according to the selected prosodic model and synthesis parameters. . The target prosody includes a phoneme length and a pitch frequency for each phoneme as a unit for determining a speech waveform. Note that the phoneme can be, for example, one vowel or one consonant.

音声合成部１１は、生成した目標韻律に従って、例えば、HMM(Hidden Markov Model)合成方式、音素接続方式またはコーパスベース方式によって合成音声信号を作成する。
例えば、音声合成部１１は、音素ごとに、目標韻律の音素長及びピッチ周波数に最も近い音声波形を、例えばパターンマッチングにより音声波形辞書に登録されている複数の音声波形の中から選択する。そのために、音声合成部１１は、記憶部３から音声波形辞書を読み込む。音声波形辞書は、複数の音声波形及び各音声波形の識別番号を記録する。また音声波形は、例えば、一人以上のナレータが様々なテキストを読み上げた様々な音声を録音した音声信号から、音素単位で取り出された波形信号である。
さらに、音声合成部１１は、音素ごとに選択された音声波形を目標韻律に沿って接続できるようにするため、それら選択された音声波形と目標韻律に示された対応する音素の波形パターンとのずれ量を、波形変換情報として算出してもよい。
音声合成部１１は、音素ごとに選択された音声波形の識別番号を含む波形生成情報を作成する。波形生成情報は、波形変換情報をさらに含んでもよい。 The speech synthesizer 11 creates a synthesized speech signal according to the generated target prosody, for example, by an HMM (Hidden Markov Model) synthesis method, a phoneme connection method, or a corpus-based method.
For example, for each phoneme, the speech synthesizer 11 selects a speech waveform closest to the phoneme length and pitch frequency of the target prosody from a plurality of speech waveforms registered in the speech waveform dictionary by pattern matching, for example. For this purpose, the speech synthesis unit 11 reads a speech waveform dictionary from the storage unit 3. The speech waveform dictionary records a plurality of speech waveforms and an identification number of each speech waveform. The voice waveform is, for example, a waveform signal extracted in units of phonemes from voice signals obtained by recording various voices in which one or more narrators read various texts.
Furthermore, the speech synthesizer 11 connects the selected speech waveform and the waveform pattern of the corresponding phoneme indicated in the target prosody so that the speech waveform selected for each phoneme can be connected along the target prosody. The deviation amount may be calculated as waveform conversion information.
The speech synthesizer 11 creates waveform generation information including the identification number of the speech waveform selected for each phoneme. The waveform generation information may further include waveform conversion information.

音声合成部１１は、波形生成情報に含まれる各音素の音声波形の識別番号に対応する音声波形信号を記憶部３から読み込む。そして音声合成部１１は、各音声波形信号を連続的に接続することにより、合成音声信号を作成する。なお、波形生成情報に波形変換情報が含まれている場合、音声合成部１１は、各音声波形信号を、対応する音素について求められた波形変換情報に従って補正して音声波形信号を連続的に接続することにより、合成音声信号を作成する。
音声合成部１１は、合成音声信号を出力部５へ出力する。 The speech synthesizer 11 reads a speech waveform signal corresponding to the speech waveform identification number of each phoneme included in the waveform generation information from the storage unit 3. Then, the speech synthesizer 11 creates a synthesized speech signal by connecting each speech waveform signal continuously. When the waveform conversion information is included in the waveform generation information, the speech synthesizer 11 continuously connects the speech waveform signals by correcting each speech waveform signal according to the waveform conversion information obtained for the corresponding phoneme. By doing so, a synthesized speech signal is created.
The voice synthesizer 11 outputs the synthesized voice signal to the output unit 5.

辞書登録部１２は、言語処理部１０が生成した中間表記をユーザが修正したときに、その修正内容をユーザ辞書または中間表記辞書に登録する。 When the user corrects the intermediate notation generated by the language processing unit 10, the dictionary registration unit 12 registers the correction content in the user dictionary or the intermediate notation dictionary.

処理部４は、入力部２から中間表記の編集を行うことを示す操作信号を受け取ると、辞書登録部１２を起動する。そして辞書登録部１２は、例えば、編集対象となる中間表記を、対応するテキストデータとともに入力部２が有するディスプレイに表示させる。
そして辞書登録部１２は、入力部２のキーボード等から、編集対象の中間表記の一部、例えば、ある単語の読み情報を修正する操作信号を受け取ると、その操作信号に従って、ユーザ辞書に登録されているその単語の読み情報を修正する。また辞書登録部１２は、読み情報が修正された単語がユーザ辞書に登録されていなかった場合には、その単語の漢字仮名表記と中間表記とをユーザ辞書に登録する。 When the processing unit 4 receives an operation signal indicating that the intermediate notation editing is to be performed from the input unit 2, the processing unit 4 activates the dictionary registration unit 12. And the dictionary registration part 12 displays the intermediate notation used as editing object on the display which the input part 2 has with corresponding text data, for example.
When the dictionary registration unit 12 receives an operation signal for correcting a part of the intermediate notation to be edited, for example, reading information of a certain word, from the keyboard or the like of the input unit 2, it is registered in the user dictionary according to the operation signal. Correct the reading information for that word. If the word whose reading information has been corrected is not registered in the user dictionary, the dictionary registration unit 12 registers the kanji kana notation and the intermediate notation of the word in the user dictionary.

さらにまた、辞書登録部１２は、入力部２のキーボード等から、ある単語の韻律情報を修正する操作信号を受け取ると、中間表記辞書に登録されている韻律修正対象句の中から、修正された単語を含む韻律修正対象句及び前後の単語等と一致するものを抽出する。そして辞書登録部１２は、抽出した韻律修正対象句の韻律情報を、上記の操作信号に従って修正する。また辞書登録部１２は、韻律情報が修正された韻律修正対象句及びその前後の単語などと一致するものが中間表記辞書に登録されていなければ、その韻律修正対象句の漢字仮名表記と韻律情報を、前後の単語等の漢字仮名表記とともに中間表記辞書に登録する。 Furthermore, when the dictionary registration unit 12 receives an operation signal for correcting the prosodic information of a certain word from the keyboard or the like of the input unit 2, the dictionary registration unit 12 has corrected the prosody correction target phrase registered in the intermediate notation dictionary. A prosody correction target phrase including a word and words that match the preceding and following words are extracted. Then, the dictionary registration unit 12 corrects the prosody information of the extracted prosody modification target phrase according to the operation signal. Also, if the dictionary registration unit 12 does not register in the intermediate notation dictionary the prosody modification target phrase whose prosody information is modified and the words before and after it, the kanji kana notation and the prosody information of the prosody modification target phrase are registered. Are registered in the intermediate notation dictionary together with kanji kana notations such as preceding and following words.

なお、音声合成に対する専門知識が無いユーザでも、中間表記を修正できるように、辞書登録部１２は、入力部２が有するディスプレイに韻律記号を日本語で表示させたり、各形態素の形態素情報を表示させてもよい。またユーザが、例えば、入力部２が有するマウスなどを介して修正する韻律記号を選択したり、韻律記号を追加する位置を指定すると、辞書登録部１２は、例えば、プルダウンメニューなどで、選択可能な韻律記号を表す日本語表記をディスプレイに表示させる。そして辞書登録部１２は、マウスなどを介して選択された日本語表記に対応する韻律記号で、中間表記の指定された位置の韻律記号を置換したり、選択された日本語表記に対応する韻律記号をその指定された位置に自動的に追加する。
また辞書登録部１２は、ユーザが自分で修正した内容を把握できるようにするために、修正後の中間表記をディスプレイに表記させてもよい。さらに、辞書登録部１２は、修正後の中間表記を音声合成部１１に入力することにより、修正後の中間表記に対して実際に生成される合成音声をスピーカ６から出力させることで、ユーザに修正内容を確認させてもよい。 The dictionary registration unit 12 displays the prosodic symbols in Japanese on the display of the input unit 2 or displays the morpheme information of each morpheme so that even a user who has no expertise in speech synthesis can correct the intermediate notation. You may let them. Further, when the user selects a prosodic symbol to be corrected through the mouse of the input unit 2 or designates a position where a prosodic symbol is added, the dictionary registration unit 12 can be selected using, for example, a pull-down menu. A Japanese notation representing a prosodic symbol is displayed on the display. Then, the dictionary registration unit 12 replaces the prosodic symbol at the designated position of the intermediate notation with the prosodic symbol corresponding to the Japanese notation selected via the mouse or the like, or the prosodic corresponding to the selected Japanese notation. Automatically add a symbol at its specified position.
Further, the dictionary registration unit 12 may display the corrected intermediate notation on the display so that the user can grasp the contents corrected by himself / herself. Further, the dictionary registration unit 12 inputs the corrected intermediate notation to the speech synthesizer 11, thereby causing the user to output the synthesized speech actually generated for the corrected intermediate notation from the speaker 6. You may be asked to confirm the details of the correction.

辞書修正部１３は、ユーザ辞書に新たな単語が登録されるか、またはユーザ辞書に登録された単語の読み情報が修正された場合に、中間表記辞書に登録されている、その単語を含む韻律修正対象句の韻律情報を修正後の単語に適用するか否かを決定する。そのために、辞書修正部１３は、検索部２１と、対象範囲決定部２２と、アクセント句判定部２３と、品詞判定部２４と、修正部２５と、辞書更新部２６とを有する。 When the new word is registered in the user dictionary or the reading information of the word registered in the user dictionary is corrected, the dictionary correction unit 13 is registered in the intermediate notation dictionary and includes the prosody including the word. Whether to apply the prosodic information of the correction target phrase to the corrected word is determined. For this purpose, the dictionary correction unit 13 includes a search unit 21, a target range determination unit 22, an accent phrase determination unit 23, a part of speech determination unit 24, a correction unit 25, and a dictionary update unit 26.

検索部２１は、中間表記辞書に登録された韻律修正対象句の漢字仮名表記うち、ユーザ辞書に新たに登録された単語またはユーザ辞書に登録済みの単語のうちで読み情報が修正された単語を含む漢字仮名表記を検出する。なお、ユーザ辞書に登録がない単語については、言語辞書に登録されたその単語の読み情報を参照して中間表記が作成される。また上記のように、ユーザ辞書は、ユーザが読み情報を修正した単語のみが登録される、言語辞書の一例である。このことから、ユーザ辞書に新規に単語とその読み情報を登録することも、言語辞書に登録された単語の読み情報の修正の一形態である。また以下では、便宜上、ユーザ辞書に新たに登録された単語またはユーザ辞書に登録済みの単語のうちで読み情報が修正された単語を、修正単語と呼ぶ。 The search unit 21 searches for a word whose reading information has been corrected among words newly registered in the user dictionary or words already registered in the user dictionary out of the Kanji kana notations of the prosody correction target phrase registered in the intermediate notation dictionary. Detect kanji kana notation. For words that are not registered in the user dictionary, intermediate notation is created with reference to the reading information of the words registered in the language dictionary. Moreover, as described above, the user dictionary is an example of a language dictionary in which only words whose reading information has been corrected by the user are registered. Therefore, registering a new word and its reading information in the user dictionary is another form of correcting the reading information of the word registered in the language dictionary. Hereinafter, for the sake of convenience, a word newly registered in the user dictionary or a word whose reading information has been corrected among words registered in the user dictionary will be referred to as a corrected word.

再度図３（ａ）及び図３（ｂ）を参照しつつ、検索部２１により検出される漢字仮名表記の例について説明する。ユーザ辞書３００に登録された単語「Data」の読み情報が修正され、その結果としてその単語の中間表記が修正されたとする。この場合、検索部２１は、中間表記辞書３１０に登録されている韻律修正対象句の漢字仮名表記のなかから、単語「Data」を含む漢字仮名表記を検出する。この例では、中間表記辞書３１０に、「Data」を含む漢字仮名表記「Dataです」が登録されているので、検索部２１は、その漢字仮名表記「Dataです」を検出する。 With reference to FIG. 3A and FIG. 3B again, an example of kanji kana notation detected by the search unit 21 will be described. It is assumed that the reading information of the word “Data” registered in the user dictionary 300 is corrected, and as a result, the intermediate notation of the word is corrected. In this case, the search unit 21 detects a kanji kana notation including the word “Data” from the kanji kana notations of the prosody correction target phrase registered in the intermediate notation dictionary 310. In this example, since the kanji kana notation “Data is” including “Data” is registered in the intermediate notation dictionary 310, the search unit 21 detects the kanji kana notation “Data is”.

対象範囲決定部２２は、検索部２１により検出された漢字仮名表記に対応する、中間辞書に登録された前置漢字仮名表記と後置漢字仮名表記とを、その検出された漢字仮名表記と連結する。これにより、対象範囲決定部２２は、中間表記辞書に登録された韻律情報を継承するか否かを判定する漢字仮名表記の対象範囲を決定する。
再度図３（ｂ）を参照する。検索部２１により中間表記辞書３１０から漢字仮名表記「Dataです」が検出されたとすると、対象範囲決定部２２は、その前置漢字仮名表記「これは」と後置漢字仮名表記「。」とを、検出された漢字仮名表記「Dataです」と連結する。そして対象範囲決定部２２は、連結された漢字仮名表記「これはDataです。」を対象範囲とする。 The target range determination unit 22 concatenates the front kanji kana notation and the post kanji kana notation registered in the intermediate dictionary corresponding to the kanji kana notation detected by the search unit 21 with the detected kanji kana notation. To do. Thereby, the target range determination unit 22 determines the target range of kanji kana notation for determining whether to inherit the prosodic information registered in the intermediate notation dictionary.
Reference is again made to FIG. If the search unit 21 detects the kanji kana notation “Data” from the intermediate notation dictionary 310, the target range determination unit 22 uses the prefix kanji kana notation “this is” and the postfix kanji kana notation “.”. The kanji kana notation “Data” is connected. The target range determination unit 22 sets the concatenated kanji kana notation “This is Data.” As the target range.

アクセント句判定部２３は、対象範囲全体の漢字仮名表記をアクセント句単位で分割し、アクセント句単位で、中間表記辞書に登録されている韻律情報をそのアクセント句において継承するか否か判定する。 The accent phrase determination unit 23 divides the kanji kana notation of the entire target range in units of accent phrases, and determines whether the prosodic information registered in the intermediate notation dictionary is inherited in the accent phrases in units of accent phrases.

そのために、アクセント句判定部２３は、対象範囲全体の漢字仮名表記を言語処理部１０へ入力して、言語処理部１０に、修正後のユーザ辞書を用いてその対象範囲全体を言語解析することで対象範囲全体についての中間表記を生成させる。その際、言語処理部１０は、中間表記辞書を使用せずに中間表記を生成してもよい。中間表記辞書には、アクセントの位置に関する情報が含まれていないためである。そしてアクセント句判定部２３は、対象範囲全体の中間表記を言語処理部１０から受け取る。なお、以下では、便宜上、対象範囲全体の中間表記を修正後中間表記と呼ぶ。アクセント句判定部２３は、修正後中間表記を参照して、アクセントが一つのみ含まれるアクセント句単位で、その修正後中間表記に対応する対象範囲全体の漢字仮名表記を分割する。 For this purpose, the accent phrase determination unit 23 inputs the kanji kana notation of the entire target range to the language processing unit 10, and linguistically analyzes the entire target range using the corrected user dictionary. To generate an intermediate notation for the entire target range. At that time, the language processing unit 10 may generate the intermediate notation without using the intermediate notation dictionary. This is because the intermediate notation dictionary does not include information regarding the position of the accent. Then, the accent phrase determination unit 23 receives the intermediate notation of the entire target range from the language processing unit 10. Hereinafter, for convenience, the intermediate notation of the entire target range is referred to as a corrected intermediate notation. The accent phrase determination unit 23 refers to the corrected intermediate notation, and divides the kanji kana notation of the entire target range corresponding to the corrected intermediate notation in units of accent phrases including only one accent.

韻律修正対象句がアクセント句単位である場合、アクセント句判定部２３は、対象範囲全体を分割したアクセント句のうち、修正単語の漢字仮名表記及び中間表記辞書に登録された韻律修正対象句の漢字仮名表記の少なくとも一部を含むアクセント句を特定する。そしてアクセント句判定部２３は、その特定されたアクセント句を、中間表記辞書に登録されている韻律修正対象句の韻律情報を継承する韻律継承アクセント句とする。一方、アクセント句判定部２３は、韻律継承アクセント句以外のアクセント句を、中間表記辞書に登録された韻律情報を継承しない韻律不継承アクセント句とする。 When the prosodic correction target phrase is an accent phrase unit, the accent phrase determination unit 23 sets the kanji of the prosody correction target phrase registered in the Kanji kana notation and intermediate notation dictionary of the correction word among the accent phrases obtained by dividing the entire target range. Identify accent phrases that contain at least part of the kana notation. Then, the accent phrase determination unit 23 sets the specified accent phrase as a prosodic inherited accent phrase that inherits the prosodic information of the prosody modification target phrase registered in the intermediate notation dictionary. On the other hand, the accent phrase determination unit 23 sets accent phrases other than the prosodic inheritance accent phrase as prosodic non-inherited accent phrases that do not inherit the prosodic information registered in the intermediate notation dictionary.

図５は、中間表記辞書及びユーザ辞書と韻律継承アクセント句の関係の一例を示す図である。この例では、中間表記辞書５００には、韻律修正対象句の漢字仮名表記「Dataです」と、その前後の漢字仮名表記「これは」、「。」と、韻律情報を含む中間表記「デＭーターデス」が登録されている。なお、韻律記号「Ｍ」は、アクセントが中程度であることを表す。ここで、漢字仮名表記「Data」の読みを「データー」から「データ」に変更するべく、ユーザ辞書５１０に、単語「Data」の漢字仮名表記とその中間表記「データ」及び品詞（普通名詞）が登録されたとする。この場合、「これはDataです。」が対象範囲となる。そこで対象範囲「これはDataです。」がユーザ辞書５１０を用いて言語処理される。その結果として、この対象範囲は、二つのアクセント句「これは」（中間表記「コレワ」）、「Dataです」（中間表記「データデＨス」）に分割される。なお、韻律記号「Ｈ」は、アクセントが強であることを表す。この場合、アクセント句「Dataです」に、ユーザ辞書に新規登録された単語「Data」及び中間表記辞書に登録された韻律修正対象句が含まれるので、アクセント句「Dataです」が韻律継承アクセント句となる。 FIG. 5 is a diagram illustrating an example of the relationship between the intermediate notation dictionary and the user dictionary and the prosodic inheritance accent phrase. In this example, the intermediate notation dictionary 500 includes the kanji kana notation “Data” of the prosody modification target phrase, the kanji kana notations “kore” and “.” Before and after the kanji kana notation “de M”. Is registered. The prosodic symbol “M” indicates that the accent is medium. Here, in order to change the reading of the kanji kana notation “Data” from “data” to “data”, the kanji kana notation of the word “Data” and its intermediate notation “data” and the part of speech (common noun) are stored in the user dictionary 510. Is registered. In this case, “This is Data.” Is the target range. Therefore, the target range “This is Data.” Is subjected to language processing using the user dictionary 510. As a result, this target range is divided into two accent phrases “This is” (intermediate notation “Colwa”) and “Data is” (intermediate notation “data des H”). The prosodic symbol “H” represents that the accent is strong. In this case, since the accent phrase “Data is” includes the word “Data” newly registered in the user dictionary and the prosody modification target phrase registered in the intermediate notation dictionary, the accent phrase “Data” is the prosodic inheritance accent phrase. It becomes.

図６は、中間表記辞書及びユーザ辞書と韻律継承アクセント句の関係の他の一例を示す図である。この例では、中間表記辞書６００には、韻律修正対象句の漢字仮名表記「Dataも」と、その前後の漢字仮名表記「良い」、「なく、」と、韻律情報を含む中間表記「デＭーターモ」が登録されている。ここで、漢字仮名表記「Data」の読みを「データー」から「データ」に変更するべく、ユーザ辞書６１０に単語「Data」の漢字仮名表記とその中間表記「データ」及び品詞（普通名詞）が登録されたとする。この場合、「良いDataもなく、」が対象範囲となる。そこでこの対象範囲「良いDataもなく、」が、ユーザ辞書６１０を用いて言語処理される。その結果として、この対象範囲は、二つのアクセント句「良い」（中間表記「ヨＨイ」）「Dataもなく」（中間表記「データデナＨク」）に分割される。この場合、韻律修正対象句とその後置文字列の漢字仮名表記が結合されたアクセント句「Dataもなく」に、ユーザ辞書に新規登録された単語「Data」及び中間表記辞書に登録された韻律修正対象句が含まれる。そのため、アクセント句「Dataもなく」が韻律継承アクセント句となる。 FIG. 6 is a diagram illustrating another example of the relationship between the intermediate notation dictionary and the user dictionary and the prosodic inheritance accent phrase. In this example, in the intermediate notation dictionary 600, the kanji kana notation “Data” of the prosody modification target phrase, the kanji kana notations “good” and “none” before and after that, and the intermediate notation “de M” including prosody information are included. Is registered. Here, in order to change the reading of the kanji kana notation “Data” from “data” to “data”, the kanji kana notation of the word “Data” and its intermediate notation “data” and the part of speech (common noun) are stored in the user dictionary 610. Suppose you are registered. In this case, “There is no good data” is the target range. Therefore, the target range “no good data” is subjected to language processing using the user dictionary 610. As a result, this target range is divided into two accent phrases “good” (intermediate notation “yo hai”) “no data” (intermediate notation “data dena h ku”). In this case, the word “Data” newly registered in the user dictionary and the prosody correction registered in the intermediate notation dictionary are added to the accent phrase “No data” in which the prosodic correction target phrase and the kanji kana notation of the postscript string are combined. The target phrase is included. Therefore, the accent phrase “without data” is the prosodic inheritance accent phrase.

なお、中間表記辞書に登録されている漢字仮名表記がアクセント句単位でないこともある。この場合、アクセント句判定部２３は、対象範囲全体の漢字仮名表記を言語処理部１０へ入力して、言語処理部１０に、修正前のユーザ辞書を用いて言語解析させることでその対象範囲全体の中間表記を生成させてもよい。なお、以下では、便宜上、修正前のユーザ辞書を用いて作成された、対象範囲全体の漢字仮名表記に対応する中間表記を修正前中間表記と呼ぶ。そしてアクセント句判定部２３は、修正前中間表記と修正後中間表記とを比較する。アクセント句判定部２３は、修正前中間表記と修正後中間表記とで、韻律修正対象句の少なくとも一部を含むアクセント句が一致する場合には、ユーザ辞書において修正された単語を含まなくてもその一致するアクセント句を韻律継承アクセント句とする。 In addition, the kanji kana notation registered in the intermediate notation dictionary may not be an accent phrase unit. In this case, the accent phrase determination unit 23 inputs the kanji kana notation of the entire target range to the language processing unit 10, and causes the language processing unit 10 to perform a language analysis using the user dictionary before correction, thereby performing the entire target range. An intermediate notation may be generated. Hereinafter, for the sake of convenience, an intermediate notation created using a user dictionary before correction and corresponding to the kanji kana notation of the entire target range is referred to as an uncorrected intermediate notation. The accent phrase determination unit 23 compares the intermediate notation before correction with the intermediate notation after correction. If the accent phrase including at least a part of the prosody modification target phrase matches between the pre-correction intermediate expression and the corrected intermediate expression, the accent phrase determination unit 23 does not include the word corrected in the user dictionary. The matching accent phrase is taken as the prosodic inheritance phrase.

図７は、中間表記辞書及びユーザ辞書と韻律継承アクセント句の関係のさらに他の一例を示す図である。この例では、中間表記辞書７００には、韻律修正対象句の漢字仮名表記「分別もなく」と、その前後の漢字仮名表記「何の」、「、」と、韻律情報を含む中間表記「ブンベツモナＭク」が登録されている。ここで、漢字仮名表記「分別」の読みを「ブンベツ」から「フンベツ」に変更し、アクセント位置を先頭に変更するべく、ユーザ辞書７１０に漢字仮名表記「分別」とその中間表記「フＡンベツ」及び品詞（普通名詞）が登録されたとする。なお、記号「Ａ」は、アクセント位置を表す。この場合、「何の分別もなく、」が対象範囲となる。そこでこの対象範囲「何の分別もなく、」が、ユーザ辞書７１０を用いて言語処理されることによって修正後中間表記７０１が得られる。その結果として、この対象範囲は、三つのアクセント句「何の」（中間表記「ナンノ」）「分別も」（中間表記「フＨンベツモ」）、「なく」（中間表記「ナＨク」）に分割される。
一方、対象範囲「何の分別もなく、」を、ユーザ辞書７１０を用いずに言語処理することによって修正前中間表記７０２が得られる。その結果として、この対象範囲は、二つのアクセント句「何の」（中間表記「ナンノ」）「分別もなく」（中間表記「ブンベツモナＨク」）に分割される。 FIG. 7 is a diagram showing still another example of the relationship between the intermediate notation dictionary and user dictionary and the prosodic inheritance accent phrase. In this example, in the intermediate notation dictionary 700, the kanji kana notation “no discrimination” of the prosody modification target phrase, the kana kana notations “what”, “,” before and after the kanji kana notation “Bumbetsumona” "M" is registered. Here, in order to change the reading of the kanji kana notation “classification” from “Bumbetsu” to “Humbetsu” and to change the accent position to the top, the user dictionary 710 stores the kana kana kana notation “separation” and its intermediate notation “Humbetsu”. ”And parts of speech (common nouns) are registered. The symbol “A” represents an accent position. In this case, “no discrimination” is the target range. Therefore, the corrected intermediate notation 701 is obtained by performing language processing on the target range “without any discrimination” using the user dictionary 710. As a result, this target range includes three accent phrases “what” (intermediate notation “Nanno”) “fractionation” (intermediate notation “Hu Humbetsumo”), “no” (intermediate notation “Na H Ku”) It is divided into.
On the other hand, the pre-correction intermediate notation 702 is obtained by performing language processing on the target range “without any discrimination” without using the user dictionary 710. As a result, this target range is divided into two accent phrases “what” (intermediate notation “Nanno”) and “without discrimination” (intermediate notation “Bumbetsumona H Ku”).

この場合、修正後中間表記７０１におけるアクセント句「分別も」に、ユーザ辞書に新規登録された単語「分別」及び中間表記辞書に登録された韻律修正対象句の一部が含まれるので、アクセント句「分別も」が韻律継承アクセント句となる。一方、アクセント句「なく」は、韻律修正対象句の一部ではあるが、ユーザ辞書に新規登録された単語「分別」を含まず、かつ、修正前中間表記７０２の何れのアクセント句とも一致しないので、韻律情報を継承しない。 In this case, the accent phrase “classification” in the modified intermediate notation 701 includes the word “classification” newly registered in the user dictionary and part of the prosody modification target phrase registered in the intermediate notation dictionary. "Separation is also a prosodic inheritance accent phrase. On the other hand, the accent phrase “None” is a part of the prosodic correction target phrase, but does not include the word “classification” newly registered in the user dictionary and does not match any accent phrase in the intermediate notation 702 before correction. So do not inherit prosodic information.

図８は、中間表記辞書及びユーザ辞書と韻律継承アクセント句の関係のさらに他の一例を示す図である。この例では、中間表記辞書８００には、韻律修正対象句の漢字仮名表記「機密Data保持で」と、その前後の漢字仮名表記「。」、「、」と、韻律情報を含む中間表記「キミツデＭーター｜ホＨジデ」が登録されている。なお、記号「｜」は、アクセント句の区切りを表す。ここで、漢字仮名表記「Data」の読みを「データー」から「データ」に変更するべく、ユーザ辞書８１０に漢字仮名表記「Data」とその中間表記「データ」及び品詞（普通名詞）が登録されたとする。この場合、「機密Data保持で、」が対象範囲となる。そこでこの対象範囲「機密Data保持で、」を、ユーザ辞書８１０を用いて言語処理することによって修正後中間表記８０１が得られる。そして修正後中間表記８０１は、二つのアクセント句「機密データ」（中間表記「キミツデＨータ」）「保持で」（中間表記「ホＭジデ」）に分割される。 FIG. 8 is a diagram showing still another example of the relationship between the intermediate notation dictionary and user dictionary and the prosodic inheritance accent phrase. In this example, the intermediate notation dictionary 800 includes an intermediate notation “Kimitsude” containing prosody information, including the kanji kana notation “with confidential data” of the prosody modification target phrase, the kanji kana notations “.”, “,” Before and after that. "Mota | Ho Hide" is registered. The symbol “|” represents an accent phrase delimiter. Here, in order to change the reading of kanji kana notation “Data” from “data” to “data”, kanji kana notation “Data”, its intermediate notation “data”, and part of speech (common noun) are registered in the user dictionary 810. Suppose. In this case, “With confidential data retention” is the target range. Therefore, the intermediate range 801 after correction is obtained by performing language processing on the target range “With confidential data retention” using the user dictionary 810. Then, the modified intermediate notation 801 is divided into two accent phrases “confidential data” (intermediate notation “Kimitsude Hota”) “holding” (intermediate notation “Home Mide”).

一方、対象範囲「機密Data保持で、」を、ユーザ辞書８１０を用いずに言語処理することによって修正前中間表記８０２が得られる。そして修正前中間表記８０２は、二つのアクセント句「機密データ」（中間表記「キミツデＨーター」）「保持で」（中間表記「ホＭジデ」）に分割される。この場合、修正前中間表記８０２の二つのアクセント句と修正後中間表記８０１の二つのアクセント句は一致し、かつ、何れのアクセント句も中間表記辞書８００に登録された韻律情報を持つ。そこでアクセント句判定部２３は、二つのアクセント句とも韻律継承アクセント句とする。 On the other hand, by performing language processing on the target range “With confidential data retention” without using the user dictionary 810, an intermediate notation 802 before correction is obtained. The pre-correction intermediate notation 802 is divided into two accent phrases “confidential data” (intermediate notation “Kimitsude Hota”) “holding” (intermediate notation “Home Mide”). In this case, the two accent phrases of the pre-correction intermediate notation 802 and the two accent phrases of the post-correction intermediate notation 801 match, and any accent phrase has prosodic information registered in the intermediate notation dictionary 800. Therefore, the accent phrase determination unit 23 determines that both accent phrases are prosodic inheritance accent phrases.

品詞判定部２４は、対象範囲決定部２２により決定された対象範囲内で修正単語の品詞を判定する。そのために、品詞判定部２４は、言語処理部１０に、修正後のユーザ辞書を用いて対象範囲を言語処理させ、形態素単位の品詞情報を特定する。この場合も、言語処理部１０は、中間表記辞書を使用しなくてよい。そして品詞判定部２４は、修正単語の品詞と修正単語の前後の形態素の品詞とに基づいて、修正単語が複合名詞の一部か否か判定する。すなわち、品詞判定部２４は、修正単語の品詞が名詞でないか、修正単語の前後の形態素の何れも名詞でなければ、修正単語は複合名詞の一部ではなく、ユーザ辞書において修正単語と関連付けられた品詞を、その修正単語の品詞とする。 The part of speech determination unit 24 determines the part of speech of the corrected word within the target range determined by the target range determination unit 22. For this purpose, the part-of-speech determination unit 24 causes the language processing unit 10 to perform language processing on the target range using the modified user dictionary, and specifies the part-of-speech information in morpheme units. Also in this case, the language processing unit 10 does not have to use the intermediate notation dictionary. Then, the part of speech determination unit 24 determines whether or not the correction word is a part of a compound noun based on the part of speech of the correction word and the part of speech of the morpheme before and after the correction word. That is, the part-of-speech determination unit 24 associates the corrected word with the corrected word in the user dictionary instead of being a part of the compound noun if the part of speech of the corrected word is not a noun or if none of the morphemes before and after the corrected word is a noun The part of speech is the part of speech of the corrected word.

また品詞判定部２４は、修正単語が名詞で、かつ、修正単語の前後の何れかの形態素が名詞であれば、修正単語を含む連続する名詞の集合を複合名詞と判定する。この場合、品詞判定部２４は、修正単語に対応する中間表記を参照して、さらに、修正単語のアクセント位置が修正単語中の何れの位置にも規定されていない、いわゆる「平板」であるか否か判定する。さらに品詞判定部２４は、修正単語が複合名詞の終端に位置するか否か判定する。修正単語のアクセントが「平板」であるか、あるいは、修正単語が複合名詞の終端以外に位置していれば、品詞判定部２４は、その複合名詞を、中間表記辞書に登録された韻律情報の継承対象となる韻律継承複合名詞と判定する。
一方、修正単語のアクセントが「平板」でなく、かつ、修正単語が複合名詞の終端に位置している場合には、品詞判定部２４は、その複合名詞を、中間表記辞書に登録された韻律情報の継承対象とならない韻律不継承複合名詞と判定する。この理由は、複合名詞では、複合名詞の終端の名詞のアクセントが「平板」以外の場合には、いわゆるアクセント結合が生じて、複合名詞終端の名詞のアクセントが維持されるので、中間表記辞書の韻律情報が無視されるためである。 Further, if the corrected word is a noun and any morpheme before and after the corrected word is a noun, the part of speech determining unit 24 determines a set of consecutive nouns including the corrected word as a compound noun. In this case, the part-of-speech determination unit 24 refers to the intermediate notation corresponding to the corrected word, and is the so-called “flat plate” in which the accent position of the corrected word is not defined at any position in the corrected word? Judge whether or not. Furthermore, the part-of-speech determination unit 24 determines whether the corrected word is located at the end of the compound noun. If the accent of the corrected word is “flat plate”, or if the corrected word is located at a position other than the end of the compound noun, the part-of-speech determination unit 24 selects the compound noun from the prosodic information registered in the intermediate notation dictionary. It is determined as a prosodic inherited compound noun to be inherited.
On the other hand, if the accent of the corrected word is not “flat plate” and the corrected word is located at the end of the compound noun, the part-of-speech determination unit 24 adds the compound noun to the prosody registered in the intermediate notation dictionary. It is determined as a prosodic non-inherited compound noun that is not subject to information inheritance. This is because in compound nouns, if the noun accent at the end of the compound noun is other than “flat plate”, so-called accent coupling occurs and the noun accent at the end of the compound noun is maintained. This is because prosodic information is ignored.

図９は、複合名詞の一例を示す図である。複合名詞９００は、３個の名詞「機密」、「Data」、「保持」が連結された名詞である。このうち、名詞「Data」が修正単語であるとする。この場合、修正単語「Data」は、複合名詞９００の終端に位置していないので、複合名詞９００は、韻律継承複合名詞である。 FIG. 9 is a diagram illustrating an example of a compound noun. The compound noun 900 is a noun in which three nouns “confidential”, “Data”, and “hold” are connected. Of these, the noun “Data” is a corrected word. In this case, since the corrected word “Data” is not located at the end of the compound noun 900, the compound noun 900 is a prosodic inherited compound noun.

図１０は、複合名詞の他の一例を示す図である。複合名詞１０００は、３個の名詞「次世代」、「オープン」、「クラウド」が連結された複合名詞である。このうち、終端の名詞「クラウド」が修正単語であり、アクセントの位置が「ク」に設定されていると、複合名詞１０００は、韻律不継承複合名詞である。 FIG. 10 is a diagram illustrating another example of a compound noun. The compound noun 1000 is a compound noun in which three nouns “next generation”, “open”, and “cloud” are connected. Among these, if the terminal noun “cloud” is a corrected word and the accent position is set to “ku”, the compound noun 1000 is a prosodic non-inherited compound noun.

修正部２５は、対象範囲内の各アクセント句について、韻律継承アクセント句か否か、及び、韻律継承複合名詞に含まれるか否かによって、対象範囲のうちで中間表記辞書の韻律情報を継承させる範囲を決定する。そして修正部２５は、韻律情報を継承させる範囲について、修正後中間表記のうち、韻律情報継承範囲についての中間表記を、中間表記辞書の韻律情報を用いて修正する。本実施形態では、修正部２５は、下記の規則に従って、韻律情報を継承させる範囲を決定する。
（１）修正単語が複合名詞に含まれない場合、または、修正単語を含む複合名詞が韻律継承複合名詞である場合
この場合、修正部２５は、韻律継承アクセント句について中間表記辞書の韻律情報を継承させ、韻律不継承アクセント句については中間表記辞書の韻律情報を継承させない。
（２）修正単語を含む複合名詞が韻律不継承複合名詞である場合
この場合、修正部２５は、韻律継承アクセント句であっても、韻律不継承複合名詞の一部を含むものについては中間表記辞書の韻律情報を継承させない。また、修正部２５は、その他のアクセント句については、（１）と同様の規則に従って韻律情報を継承するか否か決定する。 The correction unit 25 inherits the prosodic information of the intermediate notation dictionary in the target range depending on whether or not each accent phrase in the target range is a prosodic inheritance accent phrase and whether it is included in the prosodic inherited compound noun. Determine the range. Then, the correction unit 25 corrects the intermediate notation for the prosodic information inheritance range among the corrected intermediate notations using the prosodic information in the intermediate notation dictionary for the range in which the prosodic information is inherited. In the present embodiment, the correction unit 25 determines a range in which the prosodic information is inherited according to the following rules.
(1) When the corrected word is not included in the compound noun, or the compound noun including the corrected word is a prosodic inherited compound noun In this case, the correcting unit 25 uses the prosody information of the intermediate notation dictionary for the prosodic inherited accent phrase. Prosody information is inherited and prosodic information in the intermediate notation dictionary is not inherited for prosodic non-inherited accent phrases.
(2) In the case where the compound noun including the corrected word is a prosodic non-inherited compound noun In this case, the correcting unit 25 performs intermediate notation for a prosodic non-inherited compound noun even if it is a prosodic inherited accent phrase Don't inherit dictionary prosodic information. Further, the correction unit 25 determines whether or not to inherit prosodic information according to the same rules as in (1) for other accent phrases.

図１１は、修正単語が複合名詞に含まれない場合における、韻律情報継承範囲の一例を示す図である。図１１において、ユーザ辞書１１００には、単語「Data」の漢字仮名表記とその中間表記「データ」が登録されている。また、中間表記辞書１１０１には、韻律修正対象句の漢字仮名表記「Dataです」及びその前後の漢字仮名表記「これは」、「。」と、韻律情報を含む中間表記「デＭーターデス」が登録されている。なお、中間表記辞書１１０１は、ユーザ辞書１１００の修正がまだ反映されていないので、「Data」の形態素読み表記が「データー」となっている。この場合、対象範囲「これはDataです」は、二つのアクセント句「これは」（中間表記「コレワ」）、「Dataです」（中間表記「データデＨス」）に分割され、アクセント句「Dataです」が韻律継承アクセント句となる。そしてアクセント句「Dataです」の中間表記には、アクセントの強弱を表す韻律記号が含まれているので、このアクセント句では中間表記辞書に登録されている韻律情報が継承され、韻律記号が「Ｈ」から「Ｍ」に修正される。その結果として、最終的に得られる中間表記１１０２は「コレハ｜データデＭス」となる。 FIG. 11 is a diagram illustrating an example of the prosodic information inheritance range when the corrected word is not included in the compound noun. In FIG. 11, a kanji kana notation of the word “Data” and its intermediate notation “data” are registered in the user dictionary 1100. Also, the intermediate notation dictionary 1101 includes the kanji kana notation “Data” of the prosody modification target phrase and the kanji kana notations “this is” and “.” Before and after that, and the intermediate notation “deuterde” containing prosody information. It is registered. Note that the correction of the user dictionary 1100 has not yet been reflected in the intermediate notation dictionary 1101, so the morphological reading notation of “Data” is “data”. In this case, the target range “This is Data” is divided into two accent phrases “This is” (intermediate notation “Colwa”) and “Data is” (intermediate notation “Data des H”). Is a prosodic inheritance phrase. Since the intermediate notation of the accent phrase “Data” includes prosodic symbols representing the strength of the accent, the prosody information registered in the intermediate notation dictionary is inherited in this accent phrase, and the prosodic symbol is “H”. To "M". As a result, the intermediate notation 1102 that is finally obtained is “Koreha | data des MS”.

図１２は、修正単語が複合名詞に含まれない場合における、韻律情報継承範囲の他の一例を示す図である。図１２において、ユーザ辞書１２００には、単語「分別」の漢字仮名表記とその中間表記「フＡンベツ」が登録されている。また、中間表記辞書１２０１には、韻律修正対象句の漢字仮名表記「分別もなく」及びその前後の漢字仮名表記「何の」、「、」と、韻律情報を含む中間表記「ブンベツモナＭク」が登録されている。なお、中間表記辞書１２０１は、ユーザ辞書１２００の修正がまだ反映されていないので、「分別」の形態素読み表記が「ブンベツ」となっている。この場合、対象範囲「何の分別もなく」は、三つのアクセント句「何の」（中間表記「ナンノ」）、「分別も」（中間表記「フＨンベツモ」）、「なく」（中間表記「ナＨク」）に分割され、アクセント句「分別も」が韻律継承アクセント句となる。そしてアクセント句「分別も」の中間表記には、アクセントの強弱を表す韻律記号が含まれているので、このアクセント句では中間表記辞書に登録されている韻律情報が継承され、韻律記号が「Ｈ」から「Ｍ」に修正される。その結果として、最終的に得られる中間表記１２０２は「ナンノ｜フＭンベツモ｜ナＨク」となる。 FIG. 12 is a diagram illustrating another example of the prosodic information inheritance range when the corrected word is not included in the compound noun. In FIG. 12, a kanji kana notation of the word “classification” and an intermediate notation “Humbetsu” are registered in the user dictionary 1200. The intermediate notation dictionary 1201 also includes an intermediate notation “Bumbetsumona M ku” that includes prosodic information, including kanji notation “no discrimination” of the prosody modification target phrase and kanji notation “what”, “,” before and after that. Is registered. Since the intermediate notation dictionary 1201 does not yet reflect the correction of the user dictionary 1200, the morphological reading notation of “classification” is “Bumbetsu”. In this case, the target range “without any discrimination” has three accent phrases “what” (intermediate notation “Nanno”), “separation too” (intermediate notation “Humbetsumo”), “no” (intermediate notation) The accent phrase “fractionation” becomes the prosodic inheritance accent phrase. Since the intermediate notation of the accent phrase “separation also” includes prosodic symbols representing the strength of the accent, the prosodic information registered in the intermediate notation dictionary is inherited in this accent phrase, and the prosodic symbol is “H To "M". As a result, the finally obtained intermediate notation 1202 is “Nanno | Funbemmo | NaHku”.

図１３は、修正単語が複合名詞に含まれない場合における、韻律情報継承範囲のさらに他の一例を示す図である。図１３において、ユーザ辞書１３００には、単語「Data」の漢字仮名表記とその中間表記「データ」が登録されている。また、中間表記辞書１３０１には、韻律修正対象句の漢字仮名表記「機密Data保持で」及びその前後の漢字仮名表記「。」、「、」と、韻律情報を含む中間表記「キミツデＭーター｜ホＨジデ」が登録されている。なお、中間表記辞書１３０１は、ユーザ辞書１３００の修正がまだ反映されていないので、「Data」の形態素読み表記が「データー」となっている。この場合、対象範囲「機密Data保持で」は、二つのアクセント句「機密Data」（中間表記「キミツデＨータ」）、「ホジデ」（中間表記「ホＭジデ」）に分割され、何れのアクセント句も韻律継承アクセント句となる。そしてアクセント句「機密Data」、「保持で」の両方の中間表記には、それぞれ、アクセントの強弱を表す韻律記号が含まれているので、これらのアクセント句では中間表記辞書に登録されている韻律情報が継承される。その結果、前側のアクセント句では、韻律記号が「Ｈ」から「Ｍ」に修正され、後ろ側のアクセント句では、韻律記号が「Ｍ」から「Ｈ」に修正される。その結果として、最終的に得られる中間表記１３０２は「キミツデＭータ｜ホＨジデ」となる。 FIG. 13 is a diagram showing still another example of the prosodic information inheritance range when the corrected word is not included in the compound noun. In FIG. 13, the user dictionary 1300 registers a kanji kana notation of the word “Data” and its intermediate notation “data”. The intermediate notation dictionary 1301 also includes an intermediate notation “Kimitsude Mutter” containing prosodic information, including the kanji kana notation “with confidential data retention” and the kanji kana notations “.”, “,” Before and after that. "Hoshide" is registered. In the intermediate notation dictionary 1301, the correction of the user dictionary 1300 has not been reflected yet, so the morpheme reading notation of “Data” is “data”. In this case, the target range “Contains confidential data” is divided into two accent phrases “Confidential Data” (intermediate notation “Kimitsude Hota”) and “Hodede” (intermediate notation “HoMdede”). Accent phrases are also prosodic accent phrases. The intermediate notation for both the accent phrases “confidential data” and “holding” includes prosodic symbols that represent the strength of the accents, so these accent phrases have prosody registered in the intermediate notation dictionary. Information is inherited. As a result, the prosodic symbol is corrected from “H” to “M” in the front accent phrase, and the prosodic symbol is corrected from “M” to “H” in the rear accent phrase. As a result, the intermediate notation 1302 that is finally obtained is “Kimitsude Mta | Hoshide”.

図１４は、修正単語が複合名詞に含まれる場合における、韻律情報継承範囲の一例を示す図である。図１４において、ユーザ辞書１４００には、単語「クラウド」の漢字仮名表記とその中間表記「クラウド」が登録されている。また、中間表記辞書１４０１には、韻律修正対象句の漢字仮名表記「次世代オープンクラウドは」及びその前後の漢字仮名表記「。」、「、」と、韻律情報を含む中間表記「ジセＨダイ｜オープンクラＭウドハ」が登録されている。この場合、対象範囲「次世代オープンクラウドは」は、それ自体が一つのアクセント句（中間表記「ジセダイオープンクＨラウドワ」）となる。この場合、「次世代オープンクラウド」は、終端の名詞「クラウド」にアクセントがあるので、韻律不継承複合名詞である。そのため、アクセント句「次世代オープンクラウドは」では、中間表記辞書の韻律情報は継承されない。したがって、最終的に得られる中間表記１４０２は、ユーザ辞書１４００を用いて作成された中間表記と同一である。 FIG. 14 is a diagram illustrating an example of the prosodic information inheritance range when the corrected word is included in the compound noun. In FIG. 14, a kanji kana notation of the word “cloud” and its intermediate notation “cloud” are registered in the user dictionary 1400. Further, the intermediate notation dictionary 1401 includes a Chinese character kana notation “next-generation open cloud is” of the prosodic correction target phrase and kanji kana notations “.”, “,” Before and after that, and an intermediate notation “Gise H die” including prosody information. "Open Cura M Udoha" is registered. In this case, the target range “next generation open cloud” is itself an accent phrase (intermediate notation “jisedai open cloud H loudwa”). In this case, “next-generation open cloud” is a compound noun that is not prosodic because it has an accent on the terminal noun “cloud”. Therefore, the prosodic information of the intermediate notation dictionary is not inherited in the accent phrase “next generation open cloud is”. Therefore, the intermediate notation 1402 finally obtained is the same as the intermediate notation created using the user dictionary 1400.

図１５は、修正単語が複合名詞に含まれる場合における、韻律情報継承範囲の一例を示す図である。図１５において、ユーザ辞書１５００には、単語「クラウド」の漢字仮名表記とその中間表記「クラウド」が登録されている。また、中間表記辞書１５０１には、韻律修正対象句の漢字仮名表記「次世代クラウドサービスは」及びその前後の漢字仮名表記「。」、「、」と、韻律情報を含む中間表記「ジセダイ｜クラウドサＨービスハ」が登録されている。この場合、対象範囲「次世代クラウドサービスは」は、それ自体が一つのアクセント句となる。ここで、複合名詞「次世代クラウドサービス」では、ユーザ辞書において修正された単語「クラウド」は、複合名詞の終端ではないので、複合名詞「次世代クラウドサービス」は韻律継承複合名詞である。そのため、アクセント句「次世代クラウドサービスは」では、中間表記辞書の韻律情報が継承される。 FIG. 15 is a diagram illustrating an example of the prosodic information inheritance range when the corrected word is included in the compound noun. In FIG. 15, a kanji expression for the word “cloud” and its intermediate expression “cloud” are registered in the user dictionary 1500. Further, the intermediate notation dictionary 1501 includes a kanji notation “next generation cloud service is” for the prosody modification target phrase and kanji kana notations “.”, “,” Before and after that, and an intermediate notation “jisedai | cloud service” including prosody information. "Hibisha" is registered. In this case, the target range “next generation cloud service” itself becomes an accent phrase. Here, in the compound noun “next generation cloud service”, the word “cloud” corrected in the user dictionary is not the end of the compound noun, so the compound noun “next generation cloud service” is a prosodic inherited compound noun. Therefore, in the accent phrase “next generation cloud service”, the prosodic information of the intermediate notation dictionary is inherited.

辞書更新部２６は、修正後中間表記のうち、韻律情報継承範囲について中間表記辞書の韻律情報を継承させることで得られる中間表記である韻律継承後中間表記と整合するように、中間表記辞書を更新する。ここで、韻律継承後中間表記は、元の中間表記辞書に登録された韻律修正対象句の漢字仮名表記に、その前置漢字仮名表記と後置漢字仮名表記を連結して得られる対象範囲全体に対して言語解析を行って得られるものである。そのため、韻律継承後中間表記における韻律情報継承範囲と、元の韻律修正対象句の範囲が異なっていることもある。このような場合、辞書更新部２６は、韻律修正対象句を、韻律情報継承範囲に合わせるように修正する。本実施形態では、韻律情報継承範囲は、アクセント句単位で決定されるので、辞書更新部２６は、更新後の韻律修正対象句もアクセント句単位とすることが好ましい。また、韻律情報継承範囲内の読み情報が変更されている場合には、辞書更新部２６は、変更後の読み情報を中間表記辞書の中間表記に反映させる。
辞書更新部２６は、更新後の中間表記辞書を記憶部３に記憶する。 The dictionary update unit 26 updates the intermediate notation dictionary so that it matches the intermediate notation after the prosody inheritance, which is an intermediate notation obtained by inheriting the prosodic information of the intermediate notation dictionary for the prosodic information inheritance range among the corrected intermediate notations. Update. Here, the intermediate notation after prosody inheritance is the entire target range obtained by concatenating the kanji kana notation and the kanji kana notation in the kanji kana notation of the phrase subject to prosody correction registered in the original intermediate notation dictionary. Obtained by performing language analysis on For this reason, the prosodic information inheritance range in the intermediate notation after prosody inheritance may be different from the original prosody modification target phrase range. In such a case, the dictionary updating unit 26 corrects the prosody modification target phrase so that it matches the prosody information inheritance range. In the present embodiment, since the prosodic information inheritance range is determined in units of accent phrases, it is preferable that the dictionary update unit 26 also sets the updated prosody modification target phrase in units of accent phrases. When the reading information within the prosodic information inheritance range has been changed, the dictionary update unit 26 reflects the changed reading information in the intermediate notation of the intermediate notation dictionary.
The dictionary update unit 26 stores the updated intermediate notation dictionary in the storage unit 3.

例えば、再度図１１を参照すると、韻律情報継承範囲は「Dataです」となっており、元の韻律修正対象句と同じである。そのため、辞書更新部２６は、中間表記辞書の韻律修正対象句及びその前置漢字仮名表記及び後置漢字仮名表記を修正せず、中間表記のみを「デＭーターデス」から「データデＭス」に修正する。
また、図１２を参照すると、韻律情報継承範囲はアクセント句「分別も」であり、中間表記辞書に登録されている韻律修正対象句「分別もなく」より短くなっている。そこで辞書更新部２６は、中間表記辞書の韻律修正対象句を「分別もなく」から「分別も」に修正し、「なく」を後置漢字仮名表記に含める。また辞書更新部２６は、中間表記を「ブンベツモナＭク」から「フＭンベツモ」）に修正する。 For example, referring to FIG. 11 again, the prosodic information inheritance range is “Data”, which is the same as the original prosody modification target phrase. Therefore, the dictionary update unit 26 does not correct the prosody modification target phrase of the intermediate notation dictionary and its prefix kanji kana notation and postfix kanji kana notation, and changes only the intermediate notation from “de-Mutades” to “data de MS”. Correct it.
Referring to FIG. 12, the prosodic information inheritance range is the accent phrase “fractionation”, which is shorter than the prosody modification target phrase “no fractionation” registered in the intermediate notation dictionary. Therefore, the dictionary update unit 26 corrects the prosody modification target phrase in the intermediate notation dictionary from “no discrimination” to “fractionation”, and includes “no” in the postscript kanji kana notation. Further, the dictionary updating unit 26 corrects the intermediate notation from “Bumbetsumona M” to “Hunbetsumo”).

図１６は、辞書修正部１３により実行される、辞書修正処理の動作フローチャートである。
検索部２１は、中間表記辞書に登録された韻律修正対象句の漢字仮名表記うち、修正単語を含む漢字仮名表記を検出する（ステップＳ１０１）。対象範囲決定部２２は、検索部２１により検出された漢字仮名表記に対応する、中間辞書に登録された前置漢字仮名表記と後置漢字仮名表記とを、その検出された漢字仮名表記と連結することで対象範囲を決定する（ステップＳ１０２）。 FIG. 16 is an operation flowchart of dictionary correction processing executed by the dictionary correction unit 13.
The search unit 21 detects a kanji kana notation including a corrected word from the kanji kana notations of the prosody correction target phrase registered in the intermediate notation dictionary (step S101). The target range determination unit 22 concatenates the front kanji kana notation and the post kanji kana notation registered in the intermediate dictionary corresponding to the kanji kana notation detected by the search unit 21 with the detected kanji kana notation. Thus, the target range is determined (step S102).

アクセント句判定部２３は、対象範囲全体の漢字仮名表記をアクセント句単位で分割する。そしてアクセント句判定部２３は、そのアクセント句のうち、韻律修正対象句の少なくとも一部と修正単語とを含むアクセント句を、中間表記辞書に登録されている韻律情報を継承する韻律継承アクセント句とする（ステップＳ１０３）。一方、品詞判定部２４は、対象範囲内で修正単語の品詞を判定し、修正単語を終端とする韻律不継承複合名詞があるか否か判定する（ステップＳ１０４）。 The accent phrase determination unit 23 divides the kanji kana notation of the entire target range in units of accent phrases. Then, the accent phrase determination unit 23 selects an accent phrase including at least a part of the prosody modification target phrase and the modified word from the accent phrase and the prosodic inheritance accent phrase that inherits the prosodic information registered in the intermediate notation dictionary. (Step S103). On the other hand, the part-of-speech determination unit 24 determines the part-of-speech of the corrected word within the target range, and determines whether there is a prosodic non-inherited compound noun that ends with the corrected word (step S104).

修正部２５は、対象範囲内で着目するアクセント句に設定されていないアクセント句のうちの先頭を着目するアクセント句に設定する（ステップＳ１０５）。そして修正部２５は、着目するアクセント句が韻律不継承複合名詞に含まれるか否か判定する（ステップＳ１０６）。着目するアクセント句が韻律不継承複合名詞に含まれる場合（ステップＳ１０６−Ｙｅｓ）、修正部２５は、着目するアクセント句を韻律継承範囲から外す（ステップＳ１０７）。一方、着目するアクセント句が韻律不継承複合名詞に含まれない場合（ステップＳ１０６−Ｎｏ）、修正部２５は、着目するアクセント句が韻律継承アクセント句か否か判定する（ステップＳ１０８）。 The correcting unit 25 sets the beginning of the accent phrases that are not set as the focused accent phrase within the target range as the focused accent phrase (step S105). Then, the correcting unit 25 determines whether or not the focused accent phrase is included in the prosodic non-inherited compound noun (step S106). When the focused accent phrase is included in the prosodic non-inherited compound noun (step S106-Yes), the correcting unit 25 removes the focused accent phrase from the prosodic inheritance range (step S107). On the other hand, when the focused accent phrase is not included in the prosodic non-inherited compound noun (step S106-No), the correcting unit 25 determines whether the focused accent phrase is a prosodic inherited accent phrase (step S108).

着目するアクセント句が韻律不継承アクセント句であれば（ステップＳ１０８−Ｎｏ）、修正部２５は、着目するアクセント句を韻律継承範囲から外す（ステップＳ１０７）。一方、着目するアクセント句が韻律継承アクセント句であれば（ステップＳ１０８−Ｙｅｓ）、修正部２５は、着目するアクセント句を韻律継承範囲に含める（ステップＳ１０９）。 If the focused accent phrase is a prosodic non-inheritance accent phrase (step S108-No), the correcting unit 25 removes the focused accent phrase from the prosodic inheritance range (step S107). On the other hand, if the accent phrase of interest is a prosodic inheritance accent phrase (step S108—Yes), the correction unit 25 includes the accent phrase of interest in the prosodic inheritance range (step S109).

ステップＳ１０７またはＳ１０８の後、修正部２５は、着目済みでないアクセント句が残っているか否か判定する（ステップＳ１１０）。着目済みでないアクセント句が残っていれば（ステップＳ１１０−Ｙｅｓ）、修正部２５は、ステップＳ１０５以降の処理を繰り返す。 After step S107 or S108, the correcting unit 25 determines whether or not an accent phrase that has not been noticed remains (step S110). If an accent phrase that has not been noticed remains (step S110-Yes), the correction unit 25 repeats the processing from step S105.

一方、着目済みでないアクセント句が残っていなければ（ステップＳ１１０−Ｎｏ）、修正部２５は、韻律継承範囲の中間表記を、中間表記辞書に登録されている韻律情報を用いて修正することで、韻律継承後中間表記を作成する（ステップＳ１１１）。辞書更新部２６は、韻律継承後中間表記と整合するように、中間表記辞書を更新する（ステップＳ１１２）。そして辞書修正部１３は、辞書修正処理を終了する。なお、辞書修正部１３は、ステップＳ１０３とＳ１０４の処理の順序を入れ替えてもよい。また、ステップＳ１０４にて、韻律不継承複合名詞が無いと判定された場合には、ステップＳ１０６の処理は省略されてもよい。 On the other hand, if there is no remaining accent phrase (step S110-No), the correction unit 25 corrects the intermediate notation of the prosody inheritance range using the prosodic information registered in the intermediate notation dictionary, An intermediate notation after prosody inheritance is created (step S111). The dictionary update unit 26 updates the intermediate notation dictionary so as to be consistent with the intermediate notation after the prosody inheritance (step S112). Then, the dictionary correction unit 13 ends the dictionary correction process. Note that the dictionary correction unit 13 may change the order of the processes in steps S103 and S104. If it is determined in step S104 that there is no prosodic non-inheriting compound noun, the process in step S106 may be omitted.

以上に説明してきたように、この音声合成装置は、中間表記辞書に登録された韻律修正対象句の少なくとも一部に含まれる単語の読み情報が修正されると、その修正に合わせて、中間表記辞書も自動的に更新できる。そのため、この音声合成装置は、言語辞書に登録された単語の読み情報が変更されても、変更後の読み情報に対してその単語を含む句などの韻律情報を反映させることができる。 As described above, when the reading information of the word included in at least a part of the prosodic correction target phrase registered in the intermediate notation dictionary is corrected, the speech synthesizer performs intermediate notation in accordance with the correction. The dictionary can also be updated automatically. Therefore, even if the reading information of a word registered in the language dictionary is changed, this speech synthesizer can reflect prosodic information such as a phrase including the word on the changed reading information.

変形例によれば、中間表記辞書に登録される単語について、その単語の漢字仮名表記及び中間表記とともに、その単語に前置される１以上の単語またはその単語に後続する１以上の単語の何れかのみが登録されてもよい。また、中間表記辞書には、一つの単語が、その単語に前置または後置される１以上の異なる単語の組み合わせごとに、複数登録されてもよい。 According to the modified example, for a word registered in the intermediate notation dictionary, any one of one or more words preceding the word or one or more words following the word, together with kanji kana notation and intermediate notation of the word Only or may be registered. A plurality of one word may be registered in the intermediate notation dictionary for each combination of one or more different words that precede or follow the word.

上記の実施形態では、言語辞書、ユーザ辞書及び中間表記辞書は、それぞれ、別個の辞書としたが、言語辞書、ユーザ辞書及び中間表記辞書は、一つの辞書であってもよい。この場合には、ユーザが中間表記及び読み情報についてした修正内容は、全て言語辞書に反映される。そして言語処理部１０は、言語辞書のうち、ユーザ修正の結果により追加または修正された内容を優先して言語処理を行えばよい。 In the above embodiment, the language dictionary, the user dictionary, and the intermediate notation dictionary are separate dictionaries. However, the language dictionary, the user dictionary, and the intermediate notation dictionary may be a single dictionary. In this case, all corrections made by the user regarding the intermediate notation and reading information are reflected in the language dictionary. And the language processing part 10 should just perform a language process preferentially in the language dictionary, the content added or corrected by the result of user correction.

また、複合名詞のアクセント結合を考慮しなくてよい場合には、品詞判定部２４は省略されてもよい。この場合には、韻律継承アクセント句が韻律継承範囲そのものとなる。 In addition, the part-of-speech determination unit 24 may be omitted when it is not necessary to consider the accent combination of compound nouns. In this case, the prosodic inheritance accent phrase is the prosodic inheritance range itself.

さらに、上記の各実施形態による音声合成装置の処理部が有する各機能をコンピュータに実現させるコンピュータプログラムは、コンピュータによって読み取り可能な媒体、例えば、磁気記録媒体、光記録媒体または半導体メモリに記録された形で提供されてもよい。 Furthermore, a computer program that causes a computer to realize each function of the processing unit of the speech synthesizer according to each of the above embodiments is recorded on a computer-readable medium, for example, a magnetic recording medium, an optical recording medium, or a semiconductor memory. It may be provided in the form.

ここに挙げられた全ての例及び特定の用語は、読者が、本発明及び当該技術の促進に対する本発明者により寄与された概念を理解することを助ける、教示的な目的において意図されたものであり、本発明の優位性及び劣等性を示すことに関する、本明細書の如何なる例の構成、そのような特定の挙げられた例及び条件に限定しないように解釈されるべきものである。本発明の実施形態は詳細に説明されているが、本発明の精神及び範囲から外れることなく、様々な変更、置換及び修正をこれに加えることが可能であることを理解されたい。 All examples and specific terms listed herein are intended for instructional purposes to help the reader understand the concepts contributed by the inventor to the present invention and the promotion of the technology. It should be construed that it is not limited to the construction of any example herein, such specific examples and conditions, with respect to showing the superiority and inferiority of the present invention. Although embodiments of the present invention have been described in detail, it should be understood that various changes, substitutions and modifications can be made thereto without departing from the spirit and scope of the present invention.

以上説明した実施形態及びその変形例に関し、更に以下の付記を開示する。
（付記１）
テキストデータから生成した、該テキストデータの読み及び韻律を表す韻律情報を含む中間表記に基づいて合成音声信号を生成する音声合成装置であって、
少なくとも１以上の単語の漢字仮名表記及び該単語の読み情報と、少なくとも１以上の韻律修正対象句の漢字仮名表記と、前記韻律修正対象句の前記中間表記と、前記韻律修正対象句の前記中間表記に含まれる韻律情報が適用されるときに前記韻律修正対象句と連続する文字列の漢字仮名表記とが登録され、かつ、前記テキストデータから前記中間表記を生成するために利用される言語辞書を記憶し、前記読み情報は前記単語の読み及びアクセント位置に関する情報である記憶部と、
前記少なくとも１以上の単語の何れかの読み情報が修正されると、前記少なくとも１以上の韻律修正対象句の中から、当該読み情報が修正された単語を含む韻律修正対象句を検出する検索部と、
前記読み情報が修正された単語を含む韻律修正対象句と、当該韻律修正対象句と連続する前記文字列とを結合することで得られる文字列を当該韻律修正対象句の韻律情報を継承するか否かを判定する対象範囲として設定する対象範囲決定部と、
前記対象範囲をアクセント句単位に分割し、前記韻律修正対象句の少なくとも一部と前記読み情報が修正された単語を含むアクセント句を、前記韻律修正対象句の韻律情報を継承する韻律継承アクセント句と判定するアクセント句判定部と、
前記修正された単語の読み情報を含む前記言語辞書を用いて前記対象範囲を言語解析することにより得られた中間表記のうちの前記韻律継承アクセント句に対応する部分の中間表記を、前記韻律継承アクセント句についての前記韻律情報で修正する修正部と、
前記韻律修正対象句を前記韻律継承アクセント句で置換するとともに、置換後の前記韻律修正対象句に対応する中間表記を前記韻律継承アクセント句についての修正された中間表記とすることで前記言語辞書を更新する更新部と、
を有する音声合成装置。
（付記２）
前記言語辞書には、当該言語辞書に登録されている単語ごとに、該単語の品詞がさらに登録され、
前記言語辞書を参照して、前記対象範囲内の前記修正された単語の品詞及び前記修正された単語の前後の単語の品詞に基づいて、前記対象範囲内で前記修正された単語が複合名詞の終端に位置する名詞か否か判定し、かつ、前記修正された単語が複合名詞の終端に位置する名詞である場合、前記修正された単語の読み情報を含む前記言語辞書を用いて前記対象範囲を言語解析することにより得られた中間表記に基づいて前記修正された単語にアクセントがあるか否か判定し、前記修正された単語が複合名詞の終端に位置する名詞であり、かつ、前記修正された単語にアクセントがあれば、前記複合名詞を韻律不継承複合名詞とする品詞判定部をさらに有し、
前記修正部は、前記韻律継承アクセント句が前記韻律不継承複合名詞に含まれる場合、当該韻律継承アクセント句の中間表記を修正しない、付記１に記載の音声合成装置。
（付記３）
前記アクセント句判定部は、前記少なくとも１以上の単語の何れかの読み情報が修正される前の前記言語辞書を用いて前記対象範囲を言語解析することにより前記対象範囲をアクセント句単位に分割したときの前記韻律修正対象句に含まれる第１のアクセント句が、前記少なくとも１以上の単語の何れかの読み情報が修正された後の前記言語辞書を用いて前記対象範囲を言語解析することにより前記対象範囲をアクセント句単位に分割したときの前記韻律修正対象句に含まれるアクセント句の何れかと一致する場合、前記第１のアクセント句を前記韻律継承アクセント句とする、付記１または２に記載の音声合成装置。
（付記４）
前記言語辞書を参照して、文字列の漢字仮名表記を言語解析することにより、当該文字列の前記中間表記を作成する言語処理部をさらに有し、
前記アクセント句判定部は、前記対象範囲の漢字仮名表記を前記言語処理部に入力することで前記対象範囲の中間表記を作成させ、当該中間表記におけるアクセントの位置を参照することで、前記対象範囲をアクセント句単位に分割する、付記１に記載の音声合成装置。
（付記５）
テキストデータから合成音声信号を生成するための、該テキストデータの読み及び韻律を表す韻律情報を含む中間表記の作成に利用され、少なくとも１以上の単語の漢字仮名表記と、該単語の読み及びアクセント位置を表す読み情報と、少なくとも１以上の韻律修正対象句の漢字仮名表記と、前記韻律修正対象句の前記中間表記と、前記韻律修正対象句の前記中間表記に含まれる韻律情報が適用されるときに前記韻律修正対象句と連続する文字列の漢字仮名表記とが登録される言語辞書の修正方法であって、
プロセッサが、前記少なくとも１以上の単語の何れかの読み情報が修正されると、前記少なくとも１以上の韻律修正対象句の中から、当該読み情報が修正された単語を含む韻律修正対象句を検出し、
前記プロセッサが、前記読み情報が修正された単語を含む韻律修正対象句と、当該韻律修正対象句と連続する前記文字列とを結合することで得られる文字列を当該韻律修正対象句の韻律情報を継承するか否かを判定する対象範囲として設定し、
前記プロセッサが、前記対象範囲をアクセント句単位に分割し、前記韻律修正対象句の少なくとも一部と前記読み情報が修正された単語を含むアクセント句を、前記韻律修正対象句の韻律情報を継承する韻律継承アクセント句と判定し、
前記プロセッサが、前記修正された読み情報を含む前記言語辞書を用いて前記対象範囲を言語解析することにより得られた中間表記のうちの前記韻律継承アクセント句に対応する部分の中間表記を、前記韻律継承アクセント句についての前記韻律情報で修正し、
前記プロセッサが、前記韻律修正対象句を前記韻律継承アクセント句で置換するとともに、置換後の前記韻律修正対象句に対応する中間表記を前記韻律継承アクセント句についての修正された中間表記とすることで前記言語辞書を更新する、
言語辞書修正方法。
（付記６）
テキストデータから合成音声信号を生成するための、該テキストデータの読み及び韻律を表す韻律情報を含む中間表記の作成に利用され、少なくとも１以上の単語の漢字仮名表記と、該単語の読み及びアクセント位置を表す読み情報と、少なくとも１以上の韻律修正対象句の漢字仮名表記と、前記韻律修正対象句の前記中間表記と、前記韻律修正対象句の前記中間表記に含まれる韻律情報が適用されるときに前記韻律修正対象句と連続する文字列の漢字仮名表記とが登録される言語辞書をコンピュータに修正させるための言語辞書修正用コンピュータプログラムであって、
前記少なくとも１以上の単語の何れかの読み情報が修正されると、前記少なくとも１以上の韻律修正対象句の中から、当該読み情報が修正された単語を含む韻律修正対象句を検出し、
前記読み情報が修正された単語を含む韻律修正対象句と、当該韻律修正対象句と連続する前記文字列とを結合することで得られる文字列を当該韻律修正対象句の韻律情報を継承するか否かを判定する対象範囲として設定し、
前記対象範囲をアクセント句単位に分割し、前記韻律修正対象句の少なくとも一部と前記読み情報が修正された単語を含むアクセント句を、前記韻律修正対象句の韻律情報を継承する韻律継承アクセント句と判定し、
前記修正された読み情報を含む前記言語辞書を用いて前記対象範囲を言語解析することにより得られた中間表記のうちの前記韻律継承アクセント句に対応する部分の中間表記を、前記韻律継承アクセント句についての前記韻律情報で修正し、
前記韻律修正対象句を前記韻律継承アクセント句で置換するとともに、置換後の前記韻律修正対象句に対応する中間表記を前記韻律継承アクセント句についての修正された中間表記とすることで前記言語辞書を更新する、
ことをコンピュータに実行させるための言語辞書修正用コンピュータプログラム。 The following supplementary notes are further disclosed regarding the embodiment described above and its modifications.
(Appendix 1)
A speech synthesizer that generates a synthesized speech signal based on intermediate notation that includes prosody information representing prosody and reading of the text data, generated from text data,
Kanji kana notation of at least one word and reading information of the word, Kanji kana notation of at least one prosody modification target phrase, the intermediate representation of the prosody modification target phrase, and the intermediate of the prosody modification target phrase A language dictionary in which the prosodic correction target phrase and the kanji kana notation of a continuous character string are registered when the prosodic information included in the notation is applied, and used to generate the intermediate notation from the text data The reading information is information about the reading and accent position of the word;
When any reading information of the at least one or more words is corrected, a search unit that detects a prosody correction target phrase including the word whose reading information is corrected from the at least one or more prosody correction target phrases When,
Whether the prosody information of the prosodic correction target phrase is inherited from the prosody correction target phrase including the word whose reading information is corrected and the character string that is continuous with the prosody correction target phrase. A target range determination unit that is set as a target range for determining whether or not;
Prosody inheritance accent phrases that divide the target range into accent phrase units and inherit at least a part of the prosodic correction target phrases and words whose reading information has been corrected, and inherit the prosodic information of the prosodic correction target phrases An accent phrase determination unit for determining
The intermediate notation of the portion corresponding to the prosodic inheritance accent phrase in the intermediate notation obtained by linguistically analyzing the target range using the language dictionary including the corrected word reading information is used as the prosody inheritance. A correction unit for correcting the prosody information about the accent phrase;
The language dictionary is obtained by replacing the prosodic correction target phrase with the prosodic inheritance accent phrase and setting the intermediate notation corresponding to the prosodic correction target phrase after replacement as a corrected intermediate notation for the prosodic inheritance phrase An update section to update;
A speech synthesizer.
(Appendix 2)
For each word registered in the language dictionary, the part of speech of the word is further registered in the language dictionary,
Referring to the language dictionary, based on the part of speech of the corrected word in the target range and the part of speech of the word before and after the corrected word, the corrected word in the target range is a compound noun. It is determined whether or not the noun is located at the end, and when the corrected word is a noun located at the end of a compound noun, the target range using the language dictionary including the reading information of the corrected word Whether or not the corrected word has an accent based on an intermediate notation obtained by linguistic analysis of the word, the corrected word is a noun located at the end of a compound noun, and the correction If the word is accented, it further has a part-of-speech determination unit that uses the compound noun as a prosody-inherited compound noun,
The speech synthesizer according to attachment 1, wherein the correction unit does not correct an intermediate notation of the prosodic inheritance accent phrase when the prosodic inheritance accent phrase is included in the prosody non-inheritance compound noun.
(Appendix 3)
The accent phrase determination unit divides the target range into accent phrase units by performing linguistic analysis on the target range using the language dictionary before any reading information of the at least one word is corrected. When the first accent phrase included in the prosodic correction target phrase at the time is subjected to linguistic analysis of the target range using the language dictionary after the reading information of any of the at least one word is corrected The supplementary note 1 or 2, wherein the first accent phrase is used as the prosodic inheritance accent phrase when it matches any of the accent phrases included in the prosody modification target phrase when the target range is divided into accent phrase units. Voice synthesizer.
(Appendix 4)
A language processing unit that creates the intermediate representation of the character string by referring to the language dictionary and performing linguistic analysis of the kanji kana notation of the character string,
The accent phrase determination unit inputs the kanji kana notation of the target range to the language processing unit to create an intermediate notation of the target range, and refers to the position of the accent in the intermediate notation, thereby The speech synthesizer according to claim 1, wherein the speech synthesizer is divided into accent phrases.
(Appendix 5)
Used to create an intermediate notation including prosodic information representing the reading and prosody of the text data for generating a synthesized speech signal from the text data, and kanji kana notation of at least one word, and the reading and accent of the word The reading information representing the position, at least one kanji kana notation of the prosodic correction target phrase, the intermediate notation of the prosody correction target phrase, and the prosodic information included in the intermediate notation of the prosody correction target phrase are applied. A method for correcting a language dictionary in which sometimes the prosodic correction target phrase and a kanji kana notation of a continuous character string are registered,
When any reading information of the at least one or more words is corrected, the processor detects a prosody correction target phrase including the word whose reading information is corrected from the at least one or more prosody correction target phrases. And
The prosody information of the prosodic correction target phrase is obtained by combining the prosody correction target phrase including the word whose reading information has been corrected and the character string continuous with the prosody correction target phrase. Is set as the target range to determine whether to inherit
The processor divides the target range into accent phrase units, and inherits the prosodic information of the prosodic correction target phrase for an accent phrase including at least a part of the prosody correction target phrase and the word whose reading information is corrected Judge as prosodic inheritance phrase,
The intermediate notation of the portion corresponding to the prosodic inheritance accent phrase in the intermediate notation obtained by linguistically analyzing the target range using the language dictionary including the corrected reading information, Correct with the prosodic information about the prosodic inherited accent phrase,
The processor replaces the prosody modification target phrase with the prosody inheritance accent phrase, and sets the intermediate expression corresponding to the replaced prosody modification target phrase as a modified intermediate expression for the prosody inheritance accent phrase. Updating the language dictionary;
Language dictionary correction method.
(Appendix 6)
Used to create an intermediate notation including prosodic information representing the reading and prosody of the text data for generating a synthesized speech signal from the text data, and kanji kana notation of at least one word, and the reading and accent of the word The reading information representing the position, at least one kanji kana notation of the prosodic correction target phrase, the intermediate notation of the prosody correction target phrase, and the prosodic information included in the intermediate notation of the prosody correction target phrase are applied. A language dictionary correction computer program for causing a computer to correct a language dictionary in which sometimes the prosodic correction target phrase and a kanji kana notation of a continuous character string are registered,
When any reading information of the at least one or more words is corrected, a prosodic correction target phrase including the word whose reading information is corrected is detected from the at least one or more prosody correction target phrases;
Whether the prosody information of the prosodic correction target phrase is inherited from the prosody correction target phrase including the word whose reading information is corrected and the character string that is continuous with the prosody correction target phrase. Set as a target range to determine whether or not
Prosody inheritance accent phrases that divide the target range into accent phrase units and inherit at least a part of the prosodic correction target phrases and words whose reading information has been corrected, and inherit the prosodic information of the prosodic correction target phrases And
The intermediate notation of the portion corresponding to the prosodic inherited accent phrase in the intermediate notation obtained by performing language analysis on the target range using the language dictionary including the corrected reading information is the prosodic inherited accent phrase. Modified with the prosodic information about
The language dictionary is obtained by replacing the prosodic correction target phrase with the prosodic inheritance accent phrase and setting the intermediate notation corresponding to the prosodic correction target phrase after replacement as a corrected intermediate notation for the prosodic inheritance phrase Update,
A computer program for correcting a language dictionary for causing a computer to execute the operation.

１音声合成装置
２入力部
３記憶部
４処理部
５出力部
６スピーカ
１０言語処理部
１１音声合成部
１２辞書登録部
１３辞書修正部
２１検索部
２２対象範囲決定部
２３アクセント句判定部
２４品詞判定部
２５修正部
２６辞書更新部 DESCRIPTION OF SYMBOLS 1 Speech synthesizer 2 Input part 3 Storage | storage part 4 Processing part 5 Output part 6 Speaker 10 Language processing part 11 Speech synthesizer 12 Dictionary registration part 13 Dictionary correction part 21 Search part 22 Target range determination part 23 Accent phrase determination part 24 Part of speech determination Part 25 Correction part 26 Dictionary update part

Claims

A speech synthesizer that generates a synthesized speech signal based on intermediate notation that includes prosody information representing prosody and reading of the text data, generated from text data,
Kanji kana notation of at least one word and reading information of the word, Kanji kana notation of at least one prosody modification target phrase, the intermediate representation of the prosody modification target phrase, and the intermediate of the prosody modification target phrase A language dictionary in which the prosodic correction target phrase and the kanji kana notation of a continuous character string are registered when the prosodic information included in the notation is applied, and used to generate the intermediate notation from the text data The reading information is information about the reading and accent position of the word;
When any reading information of the at least one or more words is corrected, a search unit that detects a prosody correction target phrase including the word whose reading information is corrected from the at least one or more prosody correction target phrases When,
Whether the prosody information of the prosodic correction target phrase is inherited from the prosody correction target phrase including the word whose reading information is corrected and the character string that is continuous with the prosody correction target phrase. A target range determination unit that is set as a target range for determining whether or not;
Prosody inheritance accent phrases that divide the target range into accent phrase units and inherit at least a part of the prosodic correction target phrases and words whose reading information has been corrected, and inherit the prosodic information of the prosodic correction target phrases An accent phrase determination unit for determining
The intermediate notation of the portion corresponding to the prosodic inheritance accent phrase in the intermediate notation obtained by linguistically analyzing the target range using the language dictionary including the corrected word reading information is used as the prosody inheritance. A correction unit for correcting the prosody information about the accent phrase;
The language dictionary is obtained by replacing the prosodic correction target phrase with the prosodic inheritance accent phrase and setting the intermediate notation corresponding to the prosodic correction target phrase after replacement as a corrected intermediate notation for the prosodic inheritance phrase An update section to update;
A speech synthesizer.

For each word registered in the language dictionary, the part of speech of the word is further registered in the language dictionary,
Referring to the language dictionary, based on the part of speech of the corrected word in the target range and the part of speech of the word before and after the corrected word, the corrected word in the target range is a compound noun. It is determined whether or not the noun is located at the end, and when the corrected word is a noun located at the end of a compound noun, the target range using the language dictionary including the reading information of the corrected word Whether or not the corrected word has an accent based on an intermediate notation obtained by linguistic analysis of the word, the corrected word is a noun located at the end of a compound noun, and the correction If the word is accented, it further has a part-of-speech determination unit that uses the compound noun as a prosody-inherited compound noun,
The speech synthesizer according to claim 1, wherein when the prosodic inheritance accent phrase is included in the prosodic non-inheritance compound noun, the correction unit does not correct an intermediate representation of the prosodic inheritance accent phrase.

The accent phrase determination unit divides the target range into accent phrase units by performing linguistic analysis on the target range using the language dictionary before any reading information of the at least one word is corrected. When the first accent phrase included in the prosodic correction target phrase at the time is subjected to linguistic analysis of the target range using the language dictionary after the reading information of any of the at least one word is corrected The said 1st accent phrase is made into the said prosodic inherited accent phrase when it corresponds with either of the accent phrases contained in the said prosody modification object phrase when the said target range is divided | segmented into accent phrase units, The speech synthesizer described.

Used to create an intermediate notation including prosodic information representing the reading and prosody of the text data for generating a synthesized speech signal from the text data, and kanji kana notation of at least one word, and the reading and accent of the word The reading information representing the position, at least one kanji kana notation of the prosodic correction target phrase, the intermediate notation of the prosody correction target phrase, and the prosodic information included in the intermediate notation of the prosody correction target phrase are applied. A method for correcting a language dictionary in which sometimes the prosodic correction target phrase and a kanji kana notation of a continuous character string are registered,
When any reading information of the at least one or more words is corrected, the processor detects a prosody correction target phrase including the word whose reading information is corrected from the at least one or more prosody correction target phrases. And
The prosody information of the prosodic correction target phrase is obtained by combining the prosody correction target phrase including the word whose reading information has been corrected and the character string continuous with the prosody correction target phrase. Is set as the target range to determine whether to inherit
The processor divides the target range into accent phrase units, and inherits the prosodic information of the prosodic correction target phrase for an accent phrase including at least a part of the prosody correction target phrase and the word whose reading information is corrected Judge as prosodic inheritance phrase,
The intermediate notation of the portion corresponding to the prosodic inheritance accent phrase in the intermediate notation obtained by linguistically analyzing the target range using the language dictionary including the corrected reading information, Correct with the prosodic information about the prosodic inherited accent phrase,
The processor replaces the prosody modification target phrase with the prosody inheritance accent phrase, and sets the intermediate expression corresponding to the replaced prosody modification target phrase as a modified intermediate expression for the prosody inheritance accent phrase. Updating the language dictionary;
Language dictionary correction method.

Used to create an intermediate notation including prosodic information representing the reading and prosody of the text data for generating a synthesized speech signal from the text data, and kanji kana notation of at least one word, and the reading and accent of the word The reading information representing the position, at least one kanji kana notation of the prosodic correction target phrase, the intermediate notation of the prosody correction target phrase, and the prosodic information included in the intermediate notation of the prosody correction target phrase are applied. A language dictionary correction computer program for causing a computer to correct a language dictionary in which sometimes the prosodic correction target phrase and a kanji kana notation of a continuous character string are registered,
When any reading information of the at least one or more words is corrected, a prosodic correction target phrase including the word whose reading information is corrected is detected from the at least one or more prosody correction target phrases;
Whether the prosody information of the prosodic correction target phrase is inherited from the prosody correction target phrase including the word whose reading information is corrected and the character string that is continuous with the prosody correction target phrase. Set as a target range to determine whether or not
Prosody inheritance accent phrases that divide the target range into accent phrase units and inherit at least a part of the prosodic correction target phrases and words whose reading information has been corrected, and inherit the prosodic information of the prosodic correction target phrases And
The intermediate notation of the portion corresponding to the prosodic inherited accent phrase in the intermediate notation obtained by performing language analysis on the target range using the language dictionary including the corrected reading information is the prosodic inherited accent phrase. Modified with the prosodic information about
The language dictionary is obtained by replacing the prosodic correction target phrase with the prosodic inheritance accent phrase and setting the intermediate notation corresponding to the prosodic correction target phrase after replacement as a corrected intermediate notation for the prosodic inheritance phrase Update,
A computer program for correcting a language dictionary for causing a computer to execute the operation.