JP3446342B2

JP3446342B2 - Natural language processing method and speech synthesizer

Info

Publication number: JP3446342B2
Application number: JP26080994A
Authority: JP
Inventors: 徹也加賀美
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1994-10-26
Filing date: 1994-10-26
Publication date: 2003-09-16
Anticipated expiration: 2018-09-16
Also published as: JPH08123459A

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、例えば日本語のテキス
トなどを自然言語処理し、その結果得られる情報に基づ
いて音声合成を行う場合などに用いて好適な自然言語処
理方法、並びに音声合成装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a natural language processing method suitable for use in natural language processing of Japanese texts and the like, and speech synthesis based on the information obtained as a result, and speech synthesis. Regarding the device.

【０００２】[0002]

【従来の技術】従来の音声合成装置においては、例えば
漢字仮名混じり文を自然言語処理することにより、音韻
情報および韻律情報を求め、その音韻情報および韻律情
報に基づいて、入力された漢字仮名混じり文に対応する
音声の合成を行うようになされている。この場合、合成
音を人間の発話に近づけるために、通常は、韻律情報の
一つとして、漢字仮名混じり文に挿入するポーズを求め
るようになされている。2. Description of the Related Art In a conventional speech synthesizer, phonological information and prosody information are obtained by, for example, natural language processing of a sentence containing kanji and kana characters, and the input kanji and kana characters are mixed based on the phonological information and prosody information. It is designed to synthesize a voice corresponding to a sentence. In this case, in order to bring the synthetic speech closer to human speech, a pose to be inserted into a sentence containing kanji and kana is usually requested as one piece of prosody information.

【０００３】従来の音声合成装置では、ポーズは、例え
ば漢字仮名混じり文中の句読点の位置に設定されるよう
になされている。In the conventional speech synthesizer, the pose is set at the position of a punctuation mark in a sentence containing kanji and kana, for example.

【０００４】[0004]

【発明が解決しようとする課題】従って、従来において
は、ポーズの挿入位置は、文法的な規則を充分考慮せず
に設定されていたため、得られる合成音が不自然なもの
になる課題があった。Therefore, in the past, since the insertion position of the pose was set without sufficiently considering the grammatical rules, there is a problem that the obtained synthetic sound becomes unnatural. It was

【０００５】そこで例えば、"日本語文章音声の合成の
ための韻律規則"（河井、広瀬、藤崎：日本音響学会誌,
Vol.50,No.6,pp.433-442,1994年）に記載されているよ
うに、係り受け関係に基づくポーズ設定方法も提案され
ている。しかしながら、この方法は、主として意味的な
情報を必要とし、このような意味的情報を自然言語処理
する汎用的技術は確立されていないため、限定されたテ
キストのみが自然言語処理が対象となり、限定のない汎
用的なテキストに対する自然言語処理を行うことが困難
であった。即ち、この方法では、合成音が自然に聴こえ
るように、全ての入力文にポーズ挿入位置を設定するこ
とが難しかった。Then, for example, "prosodic rules for synthesis of Japanese sentence speech" (Kawai, Hirose, Fujisaki: The Acoustical Society of Japan,
Vol.50, No.6, pp.433-442, 1994), a pose setting method based on a dependency relationship is also proposed. However, this method mainly requires semantic information, and since a general-purpose technology for processing such semantic information in natural language has not been established, only limited text is targeted for natural language processing. It was difficult to perform natural language processing on general-purpose texts that do not have any. That is, with this method, it was difficult to set the pose insertion position for all input sentences so that the synthesized sound could be heard naturally.

【０００６】また、従来においては、挿入したポーズ間
の拍（仮名文字単位に相当するもの）の数（拍数）（モ
ーラ数）（仮名表記にほぼ相当する発音単位数）が多過
ぎる場合、文法的な情報を加味せずに拍の合計数の半分
の位置にポーズを再設定するなどの方法をとっていたた
め、ポーズ設定の位置が不自然になることがあった。Further, in the past, when the number of beats (corresponding to the unit of kana characters) (the number of beats) (the number of mora) (the number of sounding units corresponding to the kana notation) between inserted poses is too large, Since the pose was set again at half the total number of beats without taking grammatical information into account, the position of the pose was sometimes unnatural.

【０００７】よって、従来では、合成音が聴きとりにく
くなったり、あるいはその内容を理解するのが困難にな
ったりする課題があった。Therefore, conventionally, there has been a problem that it is difficult to hear the synthesized sound or it is difficult to understand its contents.

【０００８】本発明は、このような状況に鑑みてなされ
たものであり、文中の適切な位置にポーズが設定できる
ようにし、これにより、例えば自然な合成音を得ること
ができるようにするものである。The present invention has been made in view of such a situation, and makes it possible to set a pose at an appropriate position in a sentence, thereby making it possible to obtain, for example, a natural synthesized voice. Is.

【０００９】[0009]

【課題を解決するための手段】本発明の自然言語処理方
法は、日本語でなる入力文を形態素解析し、入力文の形
態素解析結果から、複合語を構成する形態素間の文法的
な関係を規定する複合語規則に基づいて複合語を同定
し、複合語の同定結果から、文節を構成する形態素間の
文法的な関係を規定する文節規則に基づいて文節を同定
し、文節の同定結果から、連文節を構成する形態素間の
文法的な関係を規定する連文節規則に基づいて連文節を
同定し、その結果得られる入力文を構成する、１以上の
形態素からなる連文節に対し、統計的に求められた、文
中に挿入されるポーズの位置を規定するポーズ設定規則
を適用し、連文節間に挿入するポーズの位置を設定する
自然言語処理方法であって、複合語規則、文節規則、ま
たは連文節規則は、形態素どうしを結合させる結合条件
と、その結合条件を満足する形態素どうしを結合して得
られる複合語、文節、または連文節に付加する付加情報
とを含み、ポーズ設定規則および結合条件は、必要に応
じて、付加情報を用いて記述されていることを特徴とす
る。A natural language processing method according to the present invention morphologically analyzes an input sentence in Japanese, and from the morphological analysis result of the input sentence, finds a grammatical relationship between morphemes forming a compound word. A compound word is identified based on the defined compound word rule, and from the identification result of the compound word, the phrase is identified based on the phrase rule that defines the grammatical relationship between the morphemes forming the phrase, , A bunsetsu consisting of one or more morphemes, which forms an input sentence, is identified statistically based on the bunsetsu rule that defines the grammatical relationship between the morphemes forming the bunsetsu. In addition, it is a natural language processing method that applies a pose setting rule that defines the position of a pose to be inserted in a sentence and sets the position of a pose to be inserted between consecutive clauses, which is a compound word rule, a clause rule, or a consecutive clause rule. , The pose setting rule and the join condition include the join condition for joining the phonemes and the additional information to be added to the compound word, the clause, or the consecutive phrase obtained by joining the morphemes satisfying the join condition. Accordingly, it is characterized by being described using additional information.

【００１０】この自然言語処理方法においては、連文節
規則が、連用修飾連文節を構成する形態素間の関係を規
定する連用修飾規則と、連体修飾連文節を構成する形態
素間の関係を規定する連体修飾規則とを含む場合、文節
の同定結果から、連用修飾規則または連体修飾規則に基
づいて、連用修飾連文節または連体修飾連文節をそれぞ
れ同定することができる。In this natural language processing method, the conjunctive clause rule defines a relational modification rule that defines the relationship between the morphemes that form the continuation modified conjunctive clause, and a adnominal modification rule that defines the relationship between the morphemes that form the adnominal modified conjunctive clause. In the case of including the phrase, it is possible to identify the linked modified linked sentence or the linked modified linked phrase based on the linked modification rule or the linked modification rule from the bunsetsu identification result.

【００１１】また、形態素解析結果が、形態素文字列お
よび品詞情報を少なくとも含む場合、複合語規則、文節
規則、連文節規則、またはポーズ設定規則は、形態素解
析結果を用いて記述することができる。品詞情報には、
品詞の他、品詞を統合した上位分類、および品詞を細分
化した下位分類を含ませることができる。また、下位分
類には、活用形を少なくとも含ませることができる。When the morpheme analysis result includes at least a morpheme character string and part-of-speech information, the compound word rule, bunsetsu rule, continuous bunsetsu rule, or pause setting rule can be described using the morpheme analysis result. Part of speech information includes
In addition to the part-of-speech, it is possible to include a higher-level classification that integrates the part-of-speech and a lower-level classification that subdivides the part-of-speech. Further, the sub-class can include at least the inflectional form.

【００１２】付加情報は、形態素の品詞を統合した上位
分類、または形態素の品詞を細分化した下位分類とする
ことができる。[0012] The additional information can be a high-level classification in which the morpheme's part-of-speech is integrated or a sub-classification in which the morpheme's part-of-speech is subdivided.

【００１３】複合語規則、文節規則、または連文節規則
に、結合した２つの形態素どうしを分離させる分離条件
が、さらに記述されている場合、分離条件にしたがっ
て、結合した形態素を分離することができる。If the compound word rule, the clause rule, or the continuous clause rule further describes a separation condition for separating two combined morphemes, the combined morphemes can be separated according to the separation condition.

【００１４】複数の形態素からなる複合語と、それと隣
接する形態素である隣接形態素との間に、複合語規則が
適用される場合、複合語を構成する複数の形態素のう
ち、隣接形態素と隣接する形態素と、隣接形態素との間
に、複合語規則を適用することができる。また、複数の
形態素からなる文節と、それと隣接する形態素である隣
接形態素との間に、文節規則が適用される場合、文節を
構成する複数の形態素のうち、隣接形態素と隣接する形
態素と、隣接形態素との間に、文節規則を適用すること
ができる。さらに、複数の形態素からなる連文節と、そ
れと隣接する形態素である隣接形態素との間に、連文節
規則またはポーズ設定規則が適用される場合、連文節を
構成する複数の形態素のうち、隣接形態素と隣接する形
態素と、隣接形態素との間に、連文節規則またはポーズ
設定規則を適用することができる。When a compound word rule is applied between a compound word composed of a plurality of morphemes and an adjacent morpheme which is an adjacent morpheme, it is adjacent to an adjacent morpheme among a plurality of morphemes forming the compound word. Compound word rules can be applied between morphemes and adjacent morphemes. Further, when a bunsetsu rule is applied between a bunsetsu consisting of a plurality of morphemes and an adjacent morpheme which is a morpheme adjacent to the bunsetsu, a morpheme adjacent to the adjacent morpheme among the plurality of morphemes forming the bunsetsu, Phrase rules can be applied between morphemes. Furthermore, when a consecutive phrase clause or a pause setting rule is applied between a conjunctive phrase composed of a plurality of morphemes and an adjacent morpheme which is an adjacent morpheme, it is adjacent to an adjacent morpheme among a plurality of morphemes forming the consecutive phrase. A consecutive clause rule or a pause setting rule can be applied between a morpheme and an adjacent morpheme.

【００１５】また、ポーズ設定規則は、隣接する２つの
連文節における先の連文節を構成する形態素のうちの末
尾の形態素と、後の連文節を構成する形態素のうちの先
頭の形態素との間にポーズが挿入される頻度を、実際の
文章の朗読結果から統計的に求めて作成することができ
る。さらに、ポーズ設定規則が、末尾の形態素と先頭の
形態素との間にポーズを設定する設定条件が、その間に
ポーズを設定する優先順位を表す優先度が付されて記述
された優先度規則と、ポーズの設定位置を絞り込むため
の絞り込み条件が記述された絞り込み規則とを含む場
合、優先度規則に基づいて、ポーズを設定する位置の候
補を決定し、その候補の中から、絞り込み規則に基づい
て、ポーズを設定する最終的な位置を決定することがで
きる。In the pause setting rule, the pause is set between the last morpheme of the morphemes forming the preceding consecutive bunsetsu and the head morpheme of the morphemes forming the subsequent consecutive bunsetsu in two adjacent consecutive bunsetsus. The insertion frequency can be statistically obtained from the reading result of the actual sentence and can be created. Furthermore, the pose setting rule is a priority rule in which a setting condition for setting a pose between the last morpheme and the first morpheme is described with a priority representing a priority order for setting a pause in between, When the narrowing conditions for narrowing down the pose setting position are included, and the narrowing rule that describes the position is determined, a candidate for the position to set the pose is determined based on the priority rule, and based on the narrowing rule based on the candidates. , You can determine the final position to set the pose.

【００１６】本発明の音声合成装置は、日本語でなる入
力文を自然言語処理することにより、その入力文に対応
する音韻情報および韻律情報を求める生成手段（例え
ば、図１に示す言語処理部１および韻律処理部８など）
と、音韻情報および韻律情報に基づいて、入力文に対応
する音声を合成する合成手段（例えば、図１に示す音響
処理部１１など）とを備え、韻律情報は、入力文に挿入
されるポーズの位置を含み、生成手段は、ポーズの位置
を、請求項１乃至１２のいずれかに記載の自然言語処理
方法により求めることを特徴とする。The speech synthesizer of the present invention performs a natural language processing on an input sentence in Japanese to generate phonological information and prosody information corresponding to the input sentence (for example, the language processing unit shown in FIG. 1). 1 and prosody processing unit 8)
And a synthesizing unit (for example, the acoustic processing unit 11 shown in FIG. 1) that synthesizes a voice corresponding to the input sentence based on the phoneme information and the prosody information, and the prosody information is a pause inserted in the input sentence. The position of the pose is obtained by the natural language processing method according to any one of claims 1 to 12.

【００１７】[0017]

【作用】本発明の自然言語処理方法においては、入力文
が形態素解析され、その形態素解析結果から、形態素間
の文法的な関係を規定する複合語規則、文節規則、およ
び連文節規則に基づいて、複合語、文節、および連文節
が同定される。そして、その同定結果に対し、統計的に
求められたポーズ設定規則が順次適用されることによ
り、連文節間に挿入するポーズの位置が設定される。複
合語規則、文節規則、または連文節規則は、形態素どう
しを結合させる結合条件と、その結合条件を満足する形
態素どうしを結合して得られる複合語、文節、または連
文節に付加する付加情報とを含み、ポーズ設定規則およ
び結合条件は、必要に応じて、付加情報を用いて記述さ
れている。従って、形態素を結合することにより、その
結合した形態素全体に対し、付加情報を与えることがで
き、さらにその付加情報を用いてポーズ設定規則および
結合条件が記述されているので、結合する形態素の組み
合わせに応じて、ポーズを設定することが可能となる。
即ち、入力文を特に制限することなく、その文中の適切
な位置にポーズを挿入することが可能となる。In the natural language processing method of the present invention, the input sentence is morphologically analyzed, and based on the morphological analysis result, based on the compound word rule, the bunsetsu rule, and the continuous bunsetsu rule that define the grammatical relationship between the morphemes, Compound words, clauses, and consecutive clauses are identified. Then, the position of the pose to be inserted between the consecutive clauses is set by sequentially applying the pose setting rules statistically obtained to the identification result. A compound word rule, a bunsetsu rule, or a continuous bunsetsu rule includes a join condition for joining morphemes, and additional information added to the compound word, bunsetsu, or consecutive bunsetsu obtained by joining morphemes satisfying the join condition. , Pose setting rules and connection conditions are described using additional information as necessary. Therefore, by combining morphemes, additional information can be given to the entire combined morpheme, and the pose setting rule and the combining condition are described using the additional information. It is possible to set the pose according to.
That is, it is possible to insert a pause at an appropriate position in the input sentence without particularly limiting the input sentence.

【００１８】本発明の音声合成装置においては、日本語
でなる入力文を自然言語処理することにより、その入力
文に対応する音韻情報および韻律情報が求められ、その
音韻情報および韻律情報に基づいて、入力文に対応する
音声が合成される。この場合に、韻律情報の一つであ
る、入力文に挿入されるポーズの位置が、請求項１乃至
１２のいずれかに記載の自然言語処理方法により求めら
れる。従って、入力文を特に制限することなく、自然で
理解のし易い合成音を得ることができる。In the speech synthesizer of the present invention, the Japanese input sentence is subjected to natural language processing to obtain phonological information and prosodic information corresponding to the input sentence, and based on the phonological information and prosodic information. , The voice corresponding to the input sentence is synthesized. In this case, the position of the pause inserted in the input sentence, which is one of the prosody information, is obtained by the natural language processing method according to any one of claims 1 to 12. Therefore, it is possible to obtain a synthetic sound that is natural and easy to understand without particularly limiting the input sentence.

【００１９】[0019]

【実施例】図１は、本発明を適用した音声合成装置の一
実施例の構成を示すブロック図である。この音声合成装
置においては、日本語の、例えば漢字仮名混じり文（例
えば、テキストデータでなるもの）など（以下、単に入
力文という）から、それに対応する合成音を生成するよ
うになされている。1 is a block diagram showing the configuration of an embodiment of a speech synthesizer to which the present invention is applied. In this speech synthesizer, a synthesized voice corresponding to a Japanese sentence, for example, a mixture of kanji and kana (for example, composed of text data) or the like (hereinafter, simply referred to as an input sentence) is generated.

【００２０】この音声合成装置は、大きく分けて、演算
装置１とメモリ装置２とから構成されている。演算装置
１は、言語処理部３、韻律処理部８、および音響処理部
１１の３つの基本的処理部から構成されており、メモリ
装置２は、演算装置１で用いられる辞書（漢字辞書その
他）、規則（形態素解析規則、結合規則、ポーズ設定規
則、パラメータ生成規則）、データ類（韻律制御モデ
ル、音素片データ）を記憶している。This speech synthesizer is roughly divided into a computing unit 1 and a memory unit 2. The arithmetic unit 1 is composed of a language processing unit 3, a prosody processing unit 8, and a sound processing unit 11, which are three basic processing units, and the memory unit 2 is a dictionary (a Kanji dictionary or the like) used in the arithmetic unit 1. , Rules (morphological analysis rules, connection rules, pause setting rules, parameter generation rules) and data (prosodic control model, phoneme piece data) are stored.

【００２１】言語処理部３は、形態素解析部４、結合処
理部５、ポーズ設定処理部６、および発音記号生成部７
から構成され、入力文を自然言語処理し、合成する音声
（合成音）の音韻情報（入力文の読み（発音））、およ
び韻律情報（例えば、韻律句や、アクセント句（アクセ
ント型）、ポーズの位置、その長さ、その他の情報）を
生成（抽出）するようになされている。The language processing unit 3 includes a morphological analysis unit 4, a combination processing unit 5, a pose setting processing unit 6, and a phonetic symbol generation unit 7.
The input sentence is processed by natural language, and the phoneme information (reading (pronunciation) of the input sentence) and the prosodic information (for example, prosodic phrase, accent phrase (accent type), and pose) of a voice (synthetic sound) to be synthesized are synthesized. Position, its length, and other information) are generated (extracted).

【００２２】韻律処理部８は、韻律制御モデル用パラメ
ータ生成部９および韻律データ生成部１０から構成さ
れ、韻律制御モデル用パラメータ生成部（以下、パラメ
ータ生成部という）９は、言語処理部３で生成された音
韻情報および韻律情報に基づいて、合成音の韻律的特徴
を制御するモデルを駆動するためのパラメータ（モデル
制御パラメータ）を、メモリ装置２に記憶されているパ
ラメータ生成規則にしたがって生成するようになされて
いる。The prosody processing unit 8 comprises a prosody control model parameter generation unit 9 and a prosody data generation unit 10. The prosody control model parameter generation unit (hereinafter referred to as parameter generation unit) 9 is the language processing unit 3. Parameters (model control parameters) for driving a model for controlling the prosodic features of the synthetic sound are generated based on the generated phonological information and prosody information according to the parameter generation rules stored in the memory device 2. It is done like this.

【００２３】韻律データ生成部１０は、メモリ装置２に
記憶されている、合成音の韻律的特徴を制御する種々の
モデル（韻律制御モデル）を、パラメータ生成部９で生
成されたパラメータを用いて駆動し、これにより合成音
の韻律的特徴を表す具体的な数値（モデルから算出され
る数値）、即ち韻律データを生成するようになされてい
る。The prosody data generation unit 10 uses various parameters (prosody control models) stored in the memory device 2 for controlling the prosody characteristics of the synthesized voice, using the parameters generated by the parameter generation unit 9. It is adapted to drive and generate specific numerical values (numerical values calculated from a model) representing the prosodic characteristics of the synthetic sound, that is, prosody data.

【００２４】音響処理部１１は、言語処理部３または韻
律データ生成部１０でそれぞれ生成された音韻情報また
は韻律データに基づいて音声合成を行い、その結果得ら
れた合成音を、図示せぬ合成音声部に送り、その内蔵す
る例えばスピーカなどから出力させるようになされてい
る。The acoustic processing unit 11 performs voice synthesis based on the phoneme information or the prosody data generated by the language processing unit 3 or the prosody data generation unit 10, respectively, and synthesizes the resultant synthesized sound (not shown). It is designed to be sent to an audio section and output from a built-in speaker, for example.

【００２５】なお、言語処理部３で生成された情報（後
述する音韻韻律情報）は、その後段のパラメータ生成部
９の他、パラメータ生成部９を介して韻律データ生成部
１０および音響処理部１１にも供給されるようになされ
ている。これは、韻律データ生成部１０および音響処理
部１１の処理で、音韻韻律情報に含まれる音韻情報が用
いられるためである。The information generated by the language processing unit 3 (phonological prosody information described later) is not only the parameter generation unit 9 in the subsequent stage but also the prosody data generation unit 10 and the acoustic processing unit 11 via the parameter generation unit 9. Is also being supplied to. This is because the phonological information included in the phonological prosody information is used in the processes of the prosody data generation unit 10 and the acoustic processing unit 11.

【００２６】次に、その動作について説明する。言語処
理部３では、そこに入力文が入力されると、その入力文
に対し自然言語処理が施される。Next, the operation will be described. When the input sentence is input to the language processing unit 3, the input sentence is subjected to natural language processing.

【００２７】即ち、まず形態素解析部４で、辞書と形態
素解析規則を用いて、入力文が形態素解析されて、形態
素に分解され、さらにその発音（読み）、品詞情報およ
び拍数などが判定される。That is, first, the morpheme analysis unit 4 morphologically analyzes the input sentence using a dictionary and morpheme analysis rules, decomposes it into morphemes, and further determines its pronunciation (reading), part-of-speech information, and beat number. It

【００２８】さらに、形態素解析部４では、形態素解析
された各形態素に対し、アクセント（単語が、それ一つ
だけで発声される場合のアクセント（以下、適宜、基本
的なアクセントという））が付加され、必要に応じて、
母音の長音化、母音の無声化などの処理が施される。そ
して、その処理結果は、結合処理部５に供給される。Further, the morpheme analysis unit 4 adds an accent (an accent when a word is uttered by only one (hereinafter, appropriately referred to as a basic accent)) to each morpheme analyzed. And, if necessary,
Processing such as vowel lengthening and vowel devoicing is performed. Then, the processing result is supplied to the combination processing unit 5.

【００２９】結合処理部５では、入力文を構成する各形
態素が、結合規則に基づいて結合され、さらに必要に応
じてその結合結果が分離され、これにより入力文の連文
節（句）（１個以上の文節からなる単位（従って、１個
以上の形態素からなる単位でもある））が同定されると
同時に、連文節間にポーズを設定する位置が同定され
る。In the combination processing unit 5, the morphemes forming the input sentence are combined based on the combination rule, and the combination result is separated as necessary, whereby the continuous sentence clause (phrase) (one piece) of the input sentence is obtained. At the same time that the unit consisting of the above clauses (and therefore also the unit consisting of one or more morphemes) is identified, the position at which a pose is set between consecutive clauses is identified.

【００３０】その後、ポーズ設定処理部６において、ポ
ーズ設定規則に基づいて、同定された連文節間にポーズ
が設定される。なお、ポーズ設定処理部６では、ポーズ
の設定とともに、そのポーズに対する優先度（後述す
る）の設定も行われ、その後、ポーズ間の拍数を加味し
て最終的なポーズの位置（以下、適宜、ポーズ情報とい
う）が設定される。Then, the pose setting processing section 6 sets a pose between the identified consecutive clauses based on the pose setting rule. Note that the pose setting processing unit 6 sets a pose and a priority (to be described later) for the pose, and then takes into consideration the beat count between the poses to determine the final pose position (hereinafter, referred to as appropriate). , Pose information) is set.

【００３１】ポーズ情報の設定後、その設定結果から、
発音記号生成部７において、入力文の構造が求められ、
この文の構造に基づいて、韻律句その他の情報が求めら
れる。さらに、発音記号生成部７では、そして、ポーズ
設定処理部６の処理結果、および形態素解析結果から、
音韻情報と韻律情報とを記号や文字で表現した情報（こ
の情報は、音韻情報と韻律情報を含むので、以下、音韻
韻律情報という）が生成される。After setting the pose information, from the setting result,
The phonetic symbol generator 7 obtains the structure of the input sentence,
Based on the structure of this sentence, prosodic phrases and other information are obtained. Furthermore, in the phonetic symbol generation unit 7, from the processing result of the pose setting processing unit 6 and the morphological analysis result,
Information in which phonological information and prosodic information are expressed by symbols or characters (since this information includes phonological information and prosodic information, hereinafter, referred to as phonological prosodic information) is generated.

【００３２】なお、形態素（単語）どうしが接続されて
文となると、その文中で形態素が発話される場合と、そ
の形態素が独立で発話される場合とで、形態素のアクセ
ントの位置が変化することがあるので、発音記号生成部
７は、音韻韻律情報の生成時に、例えば所定のアクセン
ト移動規則などに基づいて、フレーズ中のアクセントの
位置、即ちアクセント句も決定するようになされてい
る。When morphemes (words) are connected to form a sentence, the position of the morpheme's accent may change depending on whether the morpheme is uttered in the sentence or the morpheme is uttered independently. Therefore, the phonetic symbol generation unit 7 determines the position of the accent in the phrase, that is, the accent phrase, based on, for example, a predetermined accent movement rule when the phonological prosody information is generated.

【００３３】以上のようにして、言語処理部３は、入力
文の読み（音韻情報）、およびその入力文に対応する合
成音の韻律的特徴を制御するために必要な韻律句、アク
セント句、その他の韻律に関する情報を含む韻律情報で
なる音韻韻律情報を生成し、パラメータ生成部９に出力
する。As described above, the language processing unit 3 reads the input sentence (phonological information) and the prosodic phrase, accent phrase, which is necessary for controlling the prosodic features of the synthesized speech corresponding to the input sentence. Phonological prosody information, which is prosody information including other prosody information, is generated and output to the parameter generation unit 9.

【００３４】パラメータ生成部９では、言語処理部３か
らの音韻韻律情報およびメモリ装置２に記憶されたパラ
メータ生成規則に基づいて、例えば合成音の基本周波数
（ピッチ周波数）、音韻の継続時間長、およびパワー
（例えば、音素単位のパワー）などの韻律的特徴を制御
する、メモリ装置２に記憶されている韻律制御モデルを
駆動するためのパラメータが生成される。In the parameter generation unit 9, based on the phonological prosody information from the language processing unit 3 and the parameter generation rule stored in the memory device 2, for example, the fundamental frequency (pitch frequency) of the synthesized voice, the phoneme duration, And a parameter for driving a prosody control model stored in the memory device 2, which controls prosody features such as power (for example, power in units of phonemes).

【００３５】即ち、メモリ装置２に、基本周波数を求め
るための韻律制御モデルとして、例えばいわゆる藤崎モ
デルが記憶されているとともに、継続時間長およびパワ
ーを求めるための韻律制御モデルとして、例えばいわゆ
る数量化１類によるモデルが記憶されている場合、パラ
メータ生成部９は、藤崎モデルを駆動するパラメータと
して、例えばフレーズ指令やアクセント指令の大きさお
よびその位置（フレーズ指令やアクセント指令をする時
点）などを設定し、数量化１類によるモデルを駆動する
パラメータとして、要因（例えば、音声合成しようとし
ている音素や、その前後の音素など）のカテゴリを設定
する。ここで、ポーズ設定処理部６により設定されたポ
ーズ情報は、パラメータ生成部９におけるフレーズ指令
の位置を決定するのに利用される。That is, a so-called Fujisaki model, for example, is stored in the memory device 2 as a prosody control model for obtaining the fundamental frequency, and a so-called quantification is used as a prosody control model for obtaining the duration and power. When the model according to the first class is stored, the parameter generation unit 9 sets, for example, the size of the phrase command or the accent command and the position thereof (at the time of issuing the phrase command or the accent command) as the parameters for driving the Fujisaki model. Then, as a parameter for driving the model according to the quantification type 1, a category of factors (for example, a phoneme to be speech-synthesized, phonemes before and after it) is set. Here, the pose information set by the pose setting processor 6 is used to determine the position of the phrase command in the parameter generator 9.

【００３６】なお、韻律制御モデルは、上述したものの
他、従来から知られているその他のものなどを用いるよ
うにすることができる、ただし、パラメータ生成部９で
は、メモリ装置２に記憶された韻律制御モデルに対応す
るパラメータを生成するようにする必要がある。As the prosody control model, other than the above-mentioned ones, it is possible to use other conventionally known ones. However, in the parameter generation unit 9, the prosody stored in the memory device 2 is used. It is necessary to generate the parameters corresponding to the control model.

【００３７】韻律データ生成部１０では、メモリ装置２
に記憶されている韻律制御モデルとしての、例えば藤崎
モデルや数量化１類によるモデルなどが、パラメータ生
成部９で生成（設定）されたパラメータにより駆動さ
れ、これにより韻律データが出力される。即ち、藤崎モ
デルが駆動されることにより、韻律データとしての基本
周波数などが出力され、また数量化１類によるモデルが
駆動されることにより、韻律データとしてのパワーおよ
び継続時間長などが出力される。In the prosody data generator 10, the memory device 2
As the prosody control model stored in, for example, a Fujisaki model or a model according to the quantification type 1 is driven by the parameters generated (set) by the parameter generation unit 9, and thereby prosody data is output. That is, when the Fujisaki model is driven, the fundamental frequency or the like as the prosody data is output, and when the model according to the quantification type 1 is driven, the power and the duration time or the like as the prosody data are output. .

【００３８】具体的には、韻律データとして、合成音の
フレーズごとの基本周波数値、音素ごとの継続時間を示
すフレーム数、および音節ごとのパワーの値を示すフレ
ーム数などが、音響処理部１１に出力される。Concretely, as the prosody data, the fundamental frequency value for each phrase of the synthetic sound, the number of frames indicating the duration of each phoneme, the number of frames indicating the power value of each syllable, and the like are included in the acoustic processing unit 11. Is output to.

【００３９】音響処理部１１は、例えば従来と同様の手
法で音声合成を行う合成器、即ち例えば波形素片接続に
よる音声合成器や、いわゆるフォルマント合成器などで
構成されている。音響処理部１１が、例えば波形素片接
続による音声合成器でなり、メモリ装置２に、例えばＣ
ＶやＣＶＣ／ＶＣＶなどの単位で、規則音声合成に必要
な音声素片データ（例えば、ディジタル化された波形デ
ータ）が記憶されている場合、音響処理部１１は、言語
処理部３で生成された音韻韻律情報のうちの音韻情報、
および韻律データ生成部１０より出力された韻律データ
（基本周波数、継続時間、およびパワーの時間の具体的
数値など）に基づいて、必要な音声素片データ（音素片
データ）を連続した音声波形となるように接続する。さ
らに、音響処理部１１は、言語処理部３で生成された音
韻韻律情報のうちのポーズ情報に基づき、音声波形にポ
ーズを挿入する。そして、合成音声部において、この音
声波形に対し、例えばＤ／Ａ変換処理などの必要な処理
が施され、その内蔵するスピーカから出力される。The sound processing section 11 is composed of, for example, a synthesizer for performing speech synthesis by a method similar to the conventional one, that is, for example, a speech synthesizer by waveform segment connection, a so-called formant synthesizer, or the like. The sound processing unit 11 is, for example, a voice synthesizer by waveform segment connection, and is stored in the memory device 2, for example, C
When speech unit data (eg, digitized waveform data) necessary for regular speech synthesis is stored in units of V, CVC / VCV, etc., the sound processing unit 11 is generated by the language processing unit 3. Phonological information of phonological prosody information,
Based on the prosody data (specific numerical values of the fundamental frequency, duration, and power time, etc.) output from the prosody data generation unit 10, necessary speech segment data (speech segment data) is converted into a continuous speech waveform. To be connected. Further, the acoustic processing unit 11 inserts a pause into the voice waveform based on the pause information in the phonological prosody information generated by the language processing unit 3. Then, in the synthesized voice unit, the voice waveform is subjected to necessary processing such as D / A conversion processing, and output from the built-in speaker.

【００４０】以上のようにして、入力文に対応する合成
音が出力される。As described above, the synthesized voice corresponding to the input sentence is output.

【００４１】次に、言語処理部３を構成する形態素解析
部４における処理の詳細について説明する。Next, the details of the processing in the morphological analysis unit 4 constituting the language processing unit 3 will be described.

【００４２】形態素解析部４では、上述したように、辞
書、形態素解析規則に基づいて、入力文が単語に相当す
る形態素に分解され、さらに、その発音（読み）、アク
セント、品詞情報、および拍数などが判定される。これ
らのうちの品詞情報は、形態素解析規則に含まれる、例
えば図２乃至図４に示すような品詞情報（活用形を含
む）を参照して、各形態素に付与される。In the morpheme analysis unit 4, as described above, the input sentence is decomposed into morphemes corresponding to words based on the dictionary and the morpheme analysis rule, and further, its pronunciation (reading), accent, part-of-speech information, and beat. The number etc. is determined. The part-of-speech information of these is given to each morpheme by referring to the part-of-speech information (including the inflectional form) included in the morpheme analysis rule, for example, as shown in FIGS.

【００４３】なお、図２乃至図４に示す品詞情報（活用
形を含む）は、結合規則およびポーズ設定規則にも含ま
れている。また、ここにいう品詞情報（活用形を含む）
には、いわゆる品詞の他、例えば「数字」や「アルファ
ベット」、その他記号なども含まれている（以下、これ
らも品詞として取り扱う）。さらに、この品詞情報（活
用形も含む）には、品詞を統合した上位分類、および品
詞を細分化した下位分類が含まれており、さらに活用形
も含まれている。ここで、以下では、図２乃至図４に示
した品詞情報（活用形）を、形態素に付与される品詞情
報と区別するために、品詞情報辞書という。The part-of-speech information (including inflectional forms) shown in FIGS. 2 to 4 is also included in the combination rule and the pose setting rule. Also, the part-of-speech information mentioned here (including inflectional forms)
In addition to so-called part-of-speech, for example, "number", "alphabet", and other symbols are also included (hereinafter, these are also treated as part-of-speech). Further, the part-of-speech information (including the inflectional forms) includes upper-class classifications in which part-of-speech is integrated and sub-classifications in which part-of-speech is subdivided, and also includes inflectional forms. Here, in the following, in order to distinguish the part-of-speech information (inflectional form) shown in FIGS. 2 to 4 from the part-of-speech information given to the morpheme, it is referred to as a part-of-speech information dictionary.

【００４４】品詞情報辞書には、品詞情報が、「全品
詞」を最上位の階層（最上位分類）として、階層構造に
分類されて登録されている。In the part-of-speech information dictionary, part-of-speech information is classified and registered in a hierarchical structure with "all parts-of-speech" as the highest hierarchy (highest classification).

【００４５】即ち、例えば、図２の下位分類「カ行五段
動詞」は「動詞」という上位階層の品詞に含まれ、「動
詞」という品詞は、「用言」という上位分類に含まれ
る。さらに、「用言」は、その上位の「自立語」という
上位分類に含まれ、「自立語」は、最上位分類の「全品
詞」に含まれる。That is, for example, the subordinate classification "Ka line five-stage verb" in FIG. 2 is included in the higher-level part of speech "verb", and the part-of-speech "verb" is included in the higher classification "defective". Further, the "defective" is included in a higher-level classification called "independent word", and the "independent word" is included in "all parts of speech" in the highest classification.

【００４６】また、例えば図２の「固有名詞」は「名
詞」という上位の品詞（この場合、「名詞」は、「固有
名詞」の上位分類となる）に含まれ、「名詞」は、その
上位の「自立語」という上位分類に含まれる。「自立
語」は、上述したように最上位分類の「全品詞」に含ま
れる。Further, for example, the "proper noun" in FIG. 2 is included in a higher part-of-speech called "noun" (in this case, "noun" is a superclass of "proper noun"), and "noun" is It is included in the higher-level category called "independent word". The "independent word" is included in the "total part of speech" of the highest classification as described above.

【００４７】また、形態素解析部４から、形態素解析結
果として出力される品詞情報には、必要に応じて活用形
が含められるようになされている。即ち、形態素解析結
果は、例えば「助動詞／連用形」などように出力される
ようになされている。活用形は、図４に示すように、品
詞情報辞書の最後に、階層構造とは別に規定されてい
る。Further, the part-of-speech information output from the morphological analysis unit 4 as the morphological analysis result includes the inflectional form as necessary. That is, the morpheme analysis result is output as, for example, "auxiliary verb / conjunctive form". As shown in FIG. 4, the inflectional form is defined separately from the hierarchical structure at the end of the part-of-speech information dictionary.

【００４８】なお、活用形は、品詞（但し、動詞や助動
詞などの活用がある品詞）の活用を表すから、活用形に
よって品詞を細分化することができる。従って、活用形
は、下位分類として取り扱う。Since the inflectional form represents the utilization of a part of speech (however, there is a part of speech in which a verb or auxiliary verb is utilized), the part of speech can be subdivided according to the inflectional form. Therefore, the inflectional form is treated as a subclass.

【００４９】また、図２乃至図４には図示していない
が、必要に応じて、各品詞の最下位分類として、形態素
そのものを記述しておくことが可能である。即ち、例え
ば「名詞」の下位分類である「数詞」のさらに下位分類
として、具体的な数値を記述しておくことが可能であ
る。Although not shown in FIGS. 2 to 4, the morpheme itself can be described as the lowest classification of each part of speech if necessary. That is, for example, it is possible to describe a specific numerical value as a further subclass of "numerical" which is a subclass of "noun".

【００５０】品詞情報を、このような階層構造にしてお
くことにより、各種規則を記述する際に、効率的な記述
をすることができる。即ち、例えば、図３に示す「数
字」、「アルファベット」、「ひらがな」、「かたか
な」、「ギリシャ文字」、「ロシア文字」、および「単
漢字」のすべてを対象とする規則を記述する場合、これ
らのすべてを記述するのではなく、これらの上位の品詞
である「単漢字」を記述するだけで済むことになる。By setting the part-of-speech information in such a hierarchical structure, an efficient description can be made when describing various rules. That is, for example, the rules for all of the numbers, alphabets, hiragana, katakana, Greek letters, Russian letters, and single kanji shown in FIG. 3 are described. If you do, you don't have to describe all of them, but just the higher part of speech, "single kanji".

【００５１】また、上述したように、最下位分類とし
て、形態素そのものである、例えば具体的な数値を記述
しておくことにより、その具体的数値を除く「数詞」に
対しては、ある原則的な規則を適用するとともに、その
具体的数値にのみ、例外的な規則を適用することを、容
易に行うことができる。この場合、規則を、いわば細か
に記述することが可能となるので、処理の精度を向上さ
せることができる（実際の発話に対応したポーズの設定
を行うことができる）。As described above, by describing the morpheme itself, for example, a concrete numerical value as the lowest classification, there is a certain principle for "numerical words" excluding the concrete numerical value. It is easy to apply such rules and apply exceptional rules only to the specific numerical values. In this case, the rules can be described in detail, so that the accuracy of the processing can be improved (a pose corresponding to the actual utterance can be set).

【００５２】次に、図５は、例えば入力文「昨夜男は店
員に押えられ、丸の内警察に窃盗の疑いで逮捕され
た。」が入力された場合における形態素解析部４の形態
素解析結果を示している。なお、形態素解析部４では、
入力文が形態素に分析され、図２乃至図４に示した品詞
情報辞書その他（例えば、形態素解析規則など）を参照
して、各形態素に対し、品詞情報その他が付与される
が、図５では、本願発明に関係する項目だけを示してあ
る。Next, FIG. 5 shows the morphological analysis result of the morphological analysis unit 4 when, for example, the input sentence "A man was pressed down by a clerk last night and arrested by Marunouchi police for suspicion of theft." ing. In the morphological analysis unit 4,
The input sentence is analyzed into morphemes, and with reference to the part-of-speech information dictionary and the like shown in FIGS. 2 to 4 (for example, morpheme analysis rules), part-of-speech information and the like are added to each morpheme, but in FIG. Only the items related to the present invention are shown.

【００５３】即ち、図５においては、入力文「昨夜男は
店員に押えられ、丸の内警察に窃盗の疑いで逮捕され
た。」に含まれる形態素、その発音、アクセントの型、
品詞情報、拍数を、その左端の欄から右に順次示してあ
る。That is, in FIG. 5, the morpheme, its pronunciation, and the type of accent included in the input sentence "A man was held down by a clerk last night and arrested by the Marunouchi police for suspicion of theft."
The part-of-speech information and the number of beats are sequentially shown from the leftmost column to the right.

【００５４】ここで、形態素の発音（読み）は、原則と
してひらがなで示してあるが、長音は「ー」で、鼻濁音
は「゜」で、それぞれ示してある。また、アクセント
は、アクセント核（日本語は、アクセントのある拍の直
後、発話のレベルが高レベルから低レベルに落下する
が、この落下する部分がアクセント核）が、形態素の先
頭から何拍目にあるかを示すアクセント型によって示し
てある。但し、アクセント核のないものは、０型として
ある。また、付属語類（例えば助詞や句読点など）に
は、アクセントが付与されず、そのアクセントの欄に
は、「＊」印を示してある。Here, the pronunciation (reading) of the morpheme is shown in hiragana as a general rule, but the long sound is shown by "-" and the nasal voice is shown by "o". The accent nucleus is the accent nucleus (in Japanese, the utterance level drops from high level to low level immediately after the accented beat, but this falling part is the accent nucleus). It is indicated by an accent type indicating whether or not However, those with no accent nucleus are of type 0. Further, no accent is attached to the attached words (for example, particles and punctuation marks), and a "*" mark is shown in the column of the accent.

【００５５】さらに、図５における品詞情報は、図２乃
至図４に示した品詞情報辞書に記述されている品詞情報
である。なお、図５における品詞情報のうち、（）内、
／以降、および・以降は、下位分類を示している。即
ち、品詞情報の、例えば１行目における「名詞（副詞用
法）」というのは、その形態素の品詞は「名詞」であ
り、その用法が副詞的であることを示している。但し、
形態素「昨夜」の品詞情報は、実際には、図２から「名
詞・普通名詞（副詞用法）」となるが、図５において
は、記述が煩雑になるので、「名詞（副詞用法）」とし
てある。Further, the part-of-speech information in FIG. 5 is the part-of-speech information described in the part-of-speech information dictionary shown in FIGS. 2 to 4. In addition, in the part-of-speech information in FIG.
Subsequent classifications are shown below /, and below. That is, for example, "noun (adverb usage)" in the first line of the part-of-speech information indicates that the part-of-speech of the morpheme is "noun" and the usage is adverbial. However,
The part-of-speech information of the morpheme "Yesterday" is actually "noun / common noun (adverb usage)" from FIG. 2, but in FIG. 5, the description becomes complicated, so it is referred to as "noun (adverb usage)". is there.

【００５６】さらに、品詞情報の、例えば１８行目にお
ける「助動詞／連用形」というのは、その形態その品詞
が「助動詞」であり、その活用が「連用形」であること
を示しいている。Further, for example, "auxiliary verb / continuous form" in the 18th line of the part-of-speech information indicates that the form and the part-of-speech is "auxiliary verb" and its utilization is "continuous form".

【００５７】また、各形態素の拍数は、原則として数字
で示してあるが、句読点の拍数はないものとして、＊印
で示してある。The number of beats of each morpheme is shown as a number in principle, but the number of beats of punctuation marks is not given, and is shown as *.

【００５８】次に、以上のような形態素解析結果を処理
する結合処理部５の詳細、およびその後段のポーズ設定
処理部６の詳細について説明する。Next, the details of the combination processing unit 5 that processes the above morphological analysis results and the details of the pose setting processing unit 6 in the subsequent stage will be described.

【００５９】図６および図７は、結合処理部５で用いら
れる結合規則の例を示している。FIG. 6 and FIG. 7 show examples of combining rules used in the combining processing section 5.

【００６０】この結合規則は、例えば（１）乃至（９）
の９個の項目を１単位として記述されている。なお、ポ
ーズ設定処理部６で参照するポーズ設定規則は、（１）
乃至（９）のうちの（９）を除く８個の項目でなる条件
を１単位として記述されている（ポーズ設定規則につい
ては後述する）。This combining rule is, for example, (1) to (9)
9 items are described as one unit. The pose setting rule referred to by the pose setting processing unit 6 is (1)
The condition consisting of 8 items except (9) of (9) to (9) is described as one unit (the pose setting rule will be described later).

【００６１】即ち、結合規則には、左端から右へ順番
に、（１）条件の番号、（２）前の形態素の見出し、
（３）前の形態素の品詞、（４）前の形態素の下位分
類、（５）前の形態素の活用、（６）後の形態素の見出
し、（７）後の形態素の品詞、（８）後の形態素の下位
分類、および（９）フラグ情報の９個の項目が１単位と
して記述されている。なお、９個の項目のすべてを記述
する必要は必ずしもなく、そのうちの必要な項目のみを
記述しておくようにすればよい。図６および図７（後述
する図８のポーズ設定規則においても同様）では、ドン
トケア（Don't Care）の項目は、NULLとしてある。That is, in the combination rule, from left end to right, (1) condition number, (2) previous morpheme heading,
(3) Part of speech of previous morpheme, (4) Subclassification of previous morpheme, (5) Utilization of previous morpheme, (6) Heading of morpheme after, (7) Part of speech of morpheme after, (8) After The 9 categories of the morpheme subclass and (9) flag information are described as one unit. Note that it is not always necessary to describe all nine items, and only the necessary items among them may be described. In FIG. 6 and FIG. 7 (the same applies to the pose setting rule of FIG. 8 described later), the item “Don't Care” is NULL.

【００６２】以上の９項目のうち、（２）乃至（８）
は、入力文中の隣接する２つの形態素が、複合語、文
節、連文節、（連用修飾連文節および連体修飾連文節）
を構成すると文法的に認められるときに満たすべき条件
（結合条件）とされている（あるいは、入力文中の隣接
する２つの形態素が、複合語、文節、連文節、（連用修
飾連文節および連体修飾連文節）を構成すると文法的に
認められないときに満たすべき条件（分割条件）とされ
ている）。Among the above 9 items, (2) to (8)
Indicates that two adjacent morphemes in the input sentence are compound words, bunsetsu, continuous bunsetsu, and
Is considered to be a condition (joining condition) that must be met when it is grammatically recognized (or two adjacent morphemes in the input sentence are compound words, bunsetsus, consecutive bunsetsus, and (Conditions that should be met when the grammatical is not allowed to configure (division conditions) has been).

【００６３】また、以上のような条件の集合である結合
規則（ポーズ設定規則も同様）は、図５で説明した形態
素解析結果（形態素を表す文字列（形態素文字列）、ア
クセント、品詞情報（活用形を含む）、拍数など）を用
いて記述される。なお、図６および図７に示した結合規
則（図８に示すポーズ設定規則（ポーズ優先度規則）も
同様）では、形態素解析結果のうちの形態素文字列、品
詞情報（活用形を含む）が用いられている。また、結合
規則の１単位を構成する９項目のうちの（２）乃至
（８）（ポーズ設定規則も同様）は、必要に応じて、
（９）のフラグ情報も用いて記述されている。The combination rule (the same applies to the pose setting rule), which is a set of conditions as described above, is the result of morphological analysis described in FIG. 5 (character string representing a morpheme (morpheme character string), accent, part-of-speech information ( (Including inflectional forms), beats, etc.). Note that in the combination rules shown in FIGS. 6 and 7 (the same applies to the pose setting rule (pause priority rule) shown in FIG. 8), the morpheme character string and part-of-speech information (including inflectional forms) in the morpheme analysis result are It is used. In addition, (2) to (8) (the same applies to the pause setting rule) among the nine items that form one unit of the combining rule,
It is also described using the flag information of (9).

【００６４】（１）の条件の番号には、例えば、重複す
ることのない（ユニークな）数字が昇順に記述される。
なお、この条件の番号は、必ず記述する必要がある。ま
た、数字は、連続している必要はなく、不連続であって
も良い。In the condition number of (1), for example, unique (unique) numbers are described in ascending order.
The number of this condition must be described. Also, the numbers do not have to be continuous and may be discontinuous.

【００６５】（２）の前の形態素の見出しには、隣接す
る２つの形態素のうち、前（先）の形態素を表す形態素
文字列が記述され、（６）の後の形態素の見出しには、
後の形態素を表す文字列が記述される。即ち、（２）お
よび（６）には、図５に示した形態素解析結果のうちの
左端の欄に示した形態素が記述される。従って、例え
ば、（２）は「丸の内」、（６）は「警察」のように、
それぞれ記述される。In the morpheme heading before (2), a morpheme character string representing the previous (preceding) morpheme of two adjacent morphemes is described, and in the morpheme heading after (6),
A character string representing a later morpheme is described. That is, in (2) and (6), the morpheme shown in the leftmost column of the morpheme analysis result shown in FIG. 5 is described. So, for example, (2) is "Marunouchi," (6) is "police,"
Each is described.

【００６６】（３）の前の形態素の品詞には、隣接する
２つの形態素のうち、前の形態素の品詞が記述され、
（７）の後の形態素の品詞には、後の形態素の品詞が記
述される。ここでいう形態素の品詞とは、図５に示した
形態素解析結果における品詞情報欄のうちの下位分類、
即ち丸括弧でくくられた、例えば「（副詞用法）」とい
う用法と、／以降の、例えば「未然形」などの活用形
（活用）を除く部分（品詞と・以降の下位分類）が記述
される。従って、例えば、（３）は「名詞・サ変」、
（７）は「一段動詞」のように、それぞれ記述される。In the part-of-speech of the previous morpheme of (3), the part-of-speech of the previous morpheme of two adjacent morphemes is described.
The part of speech of the morpheme after (7) describes the part of speech of the subsequent morpheme. The part-of-speech of a morpheme here is a subclass of the part-of-speech information column in the morpheme analysis result shown in FIG.
In other words, the usages such as "(adverb usage)" enclosed in parentheses and the parts (parts of speech and subclasses below) excluding inflectional shapes (inflections) such as "/" It Therefore, for example, (3) is "noun / sahen",
(7) is described as "one-stage verb".

【００６７】（４）の前の形態素の下位分類には、隣接
する２つの形態素のうち、前の形態素の下位分類のうち
の用法（図５の品詞情報のうちの丸括弧でくくられた部
分）が記述され、（８）の後の形態素の下位分類には、
後の形態素の下位分類のうちの用法が記述される。な
お、名詞は、図２に示した「（副詞用法）」の他に、例
えば「（独立用法）」、「（形式名詞用法）」などの多
くの下位分類（用法）を有するが、図２ではその図示を
省略してある。従って、例えば、（４）は「（副詞用
法）」、（８）は「（独立用法）」のようにそれぞれ記
述される。The sub-classification of the previous morpheme in (4) is the usage of the sub-classification of the previous morpheme of the two adjacent morphemes (the part enclosed in parentheses in the part-of-speech information in FIG. 5). ) Is described, and in the subclass of morpheme after (8),
The usages of the latter subclasses of morphemes are described. Note that nouns have many subclasses (usages) such as "(independent usage)" and "(formal noun usage)" in addition to "(adverb usage)" shown in FIG. The illustration is omitted. Therefore, for example, (4) is described as “(adverb usage)” and (8) is described as “(independent usage)”.

【００６８】（５）の前の形態素の活用には、隣接する
２つの形態素のうち、前の形態素の下位分類のうちの活
用（活用形）が記述される。ここでいう形態素の活用と
は、図５に示した形態素解析結果における品詞情報のう
ち、／以降の、例えば「未然形」などのように「〜形」
とされている部分を意味する。従って、例えば、（５）
は「未然形」などのように記述される。In (5) the utilization of the previous morpheme, the utilization (inflectional form) of the subclass of the previous morpheme of the two adjacent morphemes is described. The use of morphemes here means "... form" such as, for example, "preformed form" after / in the part-of-speech information in the morpheme analysis result shown in FIG.
Means the part that is said to be. Therefore, for example, (5)
Is described as "preformed".

【００６９】（９）のフラグ情報には、（２）乃至
（８）の条件を満足する形態素どうしを結合して得られ
る複合語、文節、または連文節に付加する情報（付加情
報）が記述される（詳細は後述する）。In the flag information of (9), information (additional information) to be added to a compound word, phrase, or continuous phrase obtained by combining morphemes satisfying the conditions of (2) to (8) is described. (Details will be described later).

【００７０】図６および図７に示した結合規則は、複合
語を構成する形態素間の文法的な関係を規定する複合語
規則（複合語を構成する２つの形態素が文法的に満足す
べき条件）、文節を構成する形態素間の文法的な関係を
規定する文節規則（文節を構成する２つの形態素が文法
的に満足すべき条件）、連用修飾連文節を構成する形態
素間の文法的な関係を規定する連用修飾連文節規則（連
用修飾連文節を構成する２つの形態素が文法的に満足す
べき条件）、および連体修飾連文節を構成する形態素間
の文法的な関係を規定する連体修飾連文節規則（連体修
飾連文節を構成する２つの形態素が文法的に満足すべき
条件）を含んでいる。The associative rules shown in FIGS. 6 and 7 are compound word rules that define the grammatical relationship between the morphemes that make up a compound word (conditions that two morphemes that make up a compound word must satisfy grammatically. ), A bunsetsu rule that defines the grammatical relationship between the morphemes forming the bunsetsu (conditions that the two morphemes forming the bunsetsu must be grammatically satisfied), and a grammatical relationship between the morphemes forming the consecutive modified consecutive bunsetsu. Adjacent modified adjunct clause rules (conditions that the two morphemes that compose adjunct modified conjunctive clauses must be grammatically satisfied), and adnominal modified adjunct clause rules that define the grammatical relationship between the morphemes that constitute adjunct modified adjunct clauses (adjective modifier) The two morphemes that make up the conjunctive clause include conditions that must be satisfied grammatically.

【００７１】結合処理部５では、隣接する２つの形態素
が、複合語規則、文節規則、連用修飾連文節規則、また
は連体修飾連文節規則に合致するか否かが判定され、そ
れぞれの規則に合致すると判定された場合、それらの２
つの形態素が複合語、文節、連用修飾連文節、または連
体修飾連文節を構成するものとして結合される。In the combination processing unit 5, it is determined whether or not the two adjacent morphemes match the compound word rule, the bunsetsu rule, the consecutive modified consecutive bunsetsu rule, or the adnominal modified consecutive bunsetsu rule, and it is determined that they match the respective rules. 2 of them if
The two morphemes are combined to form a compound word, bunsetsu, consecutive modified bunsetsu, or adnominal modified bunsetsu.

【００７２】ここで、上述のように２つの形態素を結合
規則に基づいて結合したものが、例外的に、複合語、文
節、連用修飾連文節、または連体修飾連文節を構成しな
い場合がある。そこで、このような例外的な場合に対処
すべく、複合語規則、文節規則、連用修飾連文節規則、
または連体修飾連文節規則は、結合した２つの形態素を
分割（分離）させるための分割規則（分割条件）を含ん
でいる。Here, as described above, there are cases where a combination of two morphemes based on a combination rule does not form a compound word, a clause, a consecutive modified consecutive clause, or an adnominal modified consecutive clause. Therefore, in order to deal with such an exceptional case, a compound word rule, a bunsetsu rule, a continuous modified bunsetsu rule,
Alternatively, the adnominal modified continuous clause rule includes a division rule (division condition) for dividing (separating) two joined morphemes.

【００７３】ここで、以下、適宜、複合語、文節、連用
修飾連文節、または連体修飾連文節として形態素を結合
させるための規則を、複合語結合規則、文節結合規則、
連用修飾結合規則、または連体修飾結合規則とそれぞれ
いう。また、複合語、文節、連用修飾連文節、または連
体修飾連文節として結合した形態素を分割させるための
規則を複合語分割規則、文節分割規則、連用修飾分割規
則、または連体修飾分割規則とそれぞれいう。Here, hereinafter, the rules for combining morphemes as a compound word, a bunsetsu, a consecutive modified consecutive bunsetsu, or an adjunct modified consecutive bunsetsu will be referred to as a compound word joining rule, a bunsetsu joining rule,
These are called the modified binding rules for continuous use or the modified binding rules for adnominal bonds, respectively. Further, a rule for dividing a morpheme that is combined as a compound word, a bunsetsu, a linked modified linked bunsetsu, or an adornment modified linked bunsetsu is referred to as a compound word division rule, a bunsetsu division rule, a linked modification division rule, or an adjunct modification division rule.

【００７４】従って、複合語規則、文節規則、連用修飾
連文節規則、または連体修飾連文節規則は、複合語結合
規則および複合語分割規則、文節結合規則および文節分
割規則、連用修飾結合規則および連用修飾分割規則、ま
たは連体修飾結合規則および連体修飾分割規則から、そ
れぞれ構成されているということができる。Therefore, the compound word rule, the bunsetsu rule, the continuous modified bunsetsu rule, or the union modified continuous bunsetsu rule is the compound word combination rule and the compound word division rule, the bunsetsu combination rule and the bunsetsu division rule, the continuous modification tie rule and the continuous modification split. It can be said that each of them is composed of a rule, or a adnominal modification combining rule and an adnominal modification division rule.

【００７５】図６および図７に示した条件（規則）のう
ち、（１）の条件の番号が、1000,2000,3000,4000,500
0,6000,7000,8000番台のものは、それぞれ複合語結合規
則、複合語分割規則、文節結合規則、文節分割規則、連
用修飾結合規則、連用修飾分割規則、連体修飾結合規
則、または連体修飾分割規則に相当する。なお、図６お
よび図７において、4000および6000番台の条件（規
則）、即ち文節分割規則および連用修飾分割規則の図示
は省略してある。Of the conditions (rules) shown in FIGS. 6 and 7, the condition number of (1) is 1000, 2000, 3000, 4000, 500.
The 0, 6000, 7000, and 8000 series are compound word combination rules, compound word division rules, bunsetsu combination rules, bunsetsu division rules, tandem qualification join rules, tandem qualification split rules, tandem qualification join rules, or tandem modifier splits, respectively. Equivalent to the rules. 6 and 7, conditions (rules) in the 4000 and 6000 series, that is, the bunsetsu division rule and the continuous modification division rule are not shown.

【００７６】結合処理部５は、実際にはプログラムでな
るが、上述のように各規則をまとめておくことにより、
各規則にしたがって処理を行うプログラムのモジュール
それぞれに対し、用いる規則を簡単に指定することがで
きる。即ち、各モジュールに対しては、使用する規則
を、（１）の条件の番号（例えば1000番台や2000番台な
どのように）によって指定することができる。Although the combination processing section 5 is actually a program, by combining the rules as described above,
It is possible to easily specify the rule to be used for each module of the program that performs processing according to each rule. That is, for each module, the rule to be used can be specified by the number of the condition (1) (such as 1000 series or 2000 series).

【００７７】次に、図８は、ポーズ設定処理部６で用い
られるポーズ設定規則の例を示している。ポーズ設定規
則には、多くのサンプルの文章（例えば、新聞に記載さ
れた文章など）中に含まれる連文節間（従って形態素間
でもある）にポーズが挿入される場合の統計（頻度の統
計）をとり（これは、文章を、例えば実際のアナウンサ
などに朗読してもらうことにより行う）、その統計結果
から求めた、形態素間にポーズが挿入されるときに、そ
の形態素が満たす文法的な条件（ポーズの設定条件）が
記述されている。Next, FIG. 8 shows an example of a pose setting rule used in the pose setting processing section 6. The pause setting rules include statistics (frequency statistics) when poses are inserted between consecutive clauses (and thus also between morphemes) included in many sample sentences (such as sentences written in newspapers). Take (this is done by having the actual announcer read the sentence, for example), and the grammatical condition that the morpheme satisfies when the pose is inserted between the morphemes, which is obtained from the statistical results ( Pose setting conditions) are described.

【００７８】なお、ポーズ設定規則は、形態素間にポー
ズを設定する設定条件が、その間にポーズを設定する優
先順位を表す優先度が付されて記述されたポーズ優先度
規則と、ポーズの設定位置を絞り込むための絞り込み条
件が記述されたポーズ絞り込み規則とを含んでいる。ポ
ーズ優先度規則にしたがって、ポーズを設定する位置の
候補を決定し、その候補の中から、ポーズ絞り込み規則
に基づいて、ポーズを設定する最終的な位置が決定され
る。The pose setting rule includes a pose priority rule in which a setting condition for setting a pose between morphemes is added with a priority indicating a priority order for setting a pause, and a setting position of the pose. And a pause narrowing rule in which narrowing conditions for narrowing down are described. According to the pose priority rule, candidates for positions at which poses are set are determined, and from among the candidates, final positions at which poses are set are determined based on the pose narrowing rules.

【００７９】図８に示したものは、ポーズ設定規則を構
成するポーズ優先度規則であり、それは、上述したよう
に結合規則（図６および図７）を構成する（１）乃至
（９）の項目のうち、（９）を除いた項目を１単位とし
て記述されている。The one shown in FIG. 8 is a pause priority rule which constitutes a pause setting rule, which constitutes the combining rule (FIGS. 6 and 7) as described above (1) to (9). Of the items, the items except (9) are described as one unit.

【００８０】このポーズ優先度規則においては、ある設
定条件を満たす形態素間にポーズが挿入される頻度が高
い場合、その設定条件には高い優先度が付され、またポ
ーズが挿入される頻度が低ければ、低い優先度が付され
ている。In this pause priority rule, when a pose is frequently inserted between morphemes satisfying a certain setting condition, the setting condition is given a high priority, and the pose is inserted less frequently. For example, it is given a low priority.

【００８１】このように、設定条件に優先度を付してお
くことにより、高い優先度の設定条件を満たす形態素間
に対して、優先的にポーズを設定するようにすることが
できる。従って、優先度は、ポーズを設定する優先順位
ということができる。As described above, by assigning the priority to the setting condition, it is possible to preferentially set the pause for the morphemes that satisfy the setting condition of the high priority. Therefore, the priority can be said to be the priority for setting the pose.

【００８２】ここで、優先度は、例えば１が最も高く、
以下、数字が大きくなるごとに低くなっていくものとす
る。Here, for example, 1 is the highest priority,
Hereinafter, it is assumed that as the number increases, it decreases.

【００８３】図８においては、優先度１乃至６の設定条
件が、（１）の条件の番号が11000，12000，13000，140
00，15000，16000番台の部分に、それぞれまとめて記述
されている。In FIG. 8, the setting conditions of the priorities 1 to 6 have the condition numbers (1) of 11000, 12000, 13000, 140.
They are collectively described in the 00, 15000, 16000 series.

【００８４】従って、ポーズ設定処理部６でどの優先度
の設定条件を用いるかも、上述した結合処理部５におけ
る場合と同様に、（１）の条件の番号によって、簡単に
指定することができる。Therefore, which priority setting condition is used in the pose setting processing unit 6 can be easily designated by the condition number of (1), as in the case of the above-described combination processing unit 5.

【００８５】なお、ポーズ設定規則を構成するポーズ絞
り込み条件については後述する。The pose narrowing-down condition which constitutes the pose setting rule will be described later.

【００８６】次に、図９は、形態素解析結果（図５）を
結合処理部５で、結合規則に基づいて結合処理し、さら
にその処理結果をポーズ設定処理部６で、ポーズ優先度
規則に基づいてポーズ優先度処理した結果を示してい
る。なお、ポーズ設定処理部６では、ポーズ優先度処理
の他、後述するようにポーズ絞り込み規則に基づいてポ
ーズ絞り込み処理も行われる。また、図１０は、結合処
理部５およびポーズ設定処理部６の動作を説明するフロ
ーチャートを示している。結合処理部５では、ステップ
Ｓ１乃至Ｓ９の処理が行われ、ポーズ設定処理部６で
は、ステップＳ１０およびＳ１１の処理が行われる。Next, in FIG. 9, the morpheme analysis result (FIG. 5) is combined by the combination processing unit 5 based on the combination rule, and the processing result is converted into a pause priority rule by the pose setting processing unit 6. The result of the pause priority processing based on FIG. In addition to the pause priority process, the pose setting processing unit 6 also performs a pose narrowing process based on a pose narrowing rule as described later. Further, FIG. 10 shows a flowchart for explaining the operation of the combination processing unit 5 and the pause setting processing unit 6. The combination processing unit 5 performs the processes of steps S1 to S9, and the pose setting processing unit 6 performs the processes of steps S10 and S11.

【００８７】即ち、結合処理部５では、ステップＳ１に
おいて、基準拍数が設定される。ここで、基準拍数と
は、ポーズ設定処理部６の処理で基準として用いられる
２つのポーズ間の拍数（但し、句読点は拍数には含めな
い）を意味する。ここでは、例えば基準拍数は、ＡとＢ
の２種類設定されるものとする。また、基準拍数Ａは、
例えば１１拍（モーラ）以上２６拍（モーラ）未満の範
囲とされ（この場合、基準拍数Ａは範囲を表すので、以
下、基準拍数範囲Ａという）、基準拍数Ｂは、例えば４
５拍（モーラ）とされるものとする。That is, in the combination processing section 5, the reference beat rate is set in step S1. Here, the reference number of beats means the number of beats between two poses used as a reference in the process of the pose setting processing unit 6 (however, punctuation marks are not included in the number of beats). Here, for example, the standard beats are A and B.
Two types are set. Also, the standard beat A is
For example, the range is set to 11 beats (mora) or more and less than 26 beats (mora) (in this case, the reference beat rate A represents the range, and henceforth referred to as the reference beat rate range A), and the reference beat rate B is, for example, 4
It shall be 5 beats (mora).

【００８８】その後、ステップＳ２に進み、複合語結合
規則に基づいて、形態素解析結果（図５）から複合語が
同定される（複合語結合処理が行われる）。Then, in step S2, a compound word is identified from the morphological analysis result (FIG. 5) based on the compound word combination rule (compound word combination processing is performed).

【００８９】ここで、複合語と同定された形態素のう
ち、先頭の形態素または末尾の形態素は、それぞれ
「頭」または「尾」と表される。また、複合語として他
の形態素に結合されずに残された形態素は「孤」と表さ
れる。なお、句読点はそのままとされる。Here, of the morphemes identified as the compound word, the leading morpheme or the trailing morpheme is expressed as "head" or "tail", respectively. In addition, a morpheme that remains as a compound word without being combined with another morpheme is expressed as “arc”. The punctuation marks are left as they are.

【００９０】即ち、ステップＳ２では、図６および図７
の結合規則のうちの条件の番号1001からの1000番台の条
件（規則）を、図５の形態素解析結果における隣接する
２つの形態素間に適用し、いずれかの条件を満足するか
否かが判定される。そして、２つの形態素がいずれかの
条件を満足する場合は、前の形態素が「頭」、後の形態
素が「尾」とされる。That is, in step S2, as shown in FIGS.
The conditions (rules) in the 1000s from the condition number 1001 of the combination rules of are applied between two adjacent morphemes in the morphological analysis result of FIG. 5, and it is determined whether any of the conditions is satisfied. To be done. When the two morphemes satisfy either condition, the front morpheme is the "head" and the rear morpheme is the "tail".

【００９１】具体的には、図５において、例えば形態素
「昨夜」と「男」、および「丸の内」と「警察」は、い
ずれも図６に示した（１）条件の番号の1001の前の形態
素の品詞が「名詞」で、後の形態素の品詞が「名詞」と
いう条件を満足する。従って、この場合、「昨夜」＝
頭、「男」＝尾（以下、適宜、（「昨夜」「男」）のよ
うに表す）、「丸の内」＝頭、「警察」＝尾（（「丸の
内」「警察」））とされる。Specifically, in FIG. 5, for example, the morphemes "last night" and "male" and "Marunouchi" and "police" are all before the number 1001 of the condition number (1) shown in FIG. The morpheme's part of speech is "noun", and the latter morpheme's part of speech is "noun". Therefore, in this case, "last night" =
Head, "male" = tail (hereinafter appropriately referred to as "(last night""male")),"Marunouchi" = head, "police" = tail (("Marunouchi""police")) .

【００９２】なお、３個以上の形態素が連続して条件を
満足する場合には、先頭の形態素が「頭」、末尾の形態
素が「尾」とされ、それらの中間に挟まれた形態素は全
て「胴」とされる。例えば、品詞が「名詞」である形態
素「丸の内」、「中央」および「警察」からなる複合語
「丸の内中央警察」は、形態素「丸の内」と「中央」、
および「中央」と「警察」が、1001の条件を満足するか
ら、それぞれ「丸の内」＝頭、「中央」＝胴、「警察」
＝尾とされる（この場合、まず「丸の内」と「中央」が
結合されることにより、（「丸の内」「中央」）とさ
れ、その後、「中央」と「警察」が結合されることによ
り、（（「丸の内」「中央」）「警察」）とされる）。If three or more morphemes continuously satisfy the condition, the leading morpheme is the "head" and the trailing morpheme is the "tail", and all the morphemes sandwiched between them are the morphemes. It is referred to as the "body". For example, the compound word "Marunouchi Central Police" consisting of the morphemes "Marunouchi", "center" and "police" whose part of speech is "noun" is the morpheme "Marunouchi" and "center",
And "center" and "police" satisfy the condition of 1001, so "Marunouchi" = head, "center" = trunk, "police" respectively.
= Tailed (In this case, "Marunouchi" and "center" are first combined, then ("Marunouchi" and "center"), and then "center" and "police" are combined , (("Marunouchi""center")"police")).

【００９３】以上の複合語結合処理の後、ステップＳ３
に進み、複合語分割規則に基づいて、複合語分割処理が
行なわれる。即ち、ステップＳ３では、図６および図７
の結合規則のうちの条件の番号2001からの2000番台の条
件を、上述の複合語結合処理結果における、隣接する２
つの形態素（但し、複合語を構成するとして結合された
もの）間に適用し、いずれかの条件を満足するか否かが
判定される。そして、２つの形態素がいずれかの条件を
満足する場合は、「頭」とされた前の形態素が「孤」
に、「尾」とされた後の形態素が「孤」に変更される。
この複合語分割処理は、複合語として結合した形態素の
組合せの中で、例外的に複合語と同定するのが好ましく
ないものを除外するために行われる。なお、後述するス
テップＳ５，Ｓ７，Ｓ９の各処理も、ステップＳ４，Ｓ
６，Ｓ８の各処理で結合された形態素を例外的に除外
（分割）するために行われる。After the above compound word combination processing, step S3
Then, the compound word dividing process is performed based on the compound word dividing rule. That is, in step S3, FIG.
The conditions in the 2000s from the condition number 2001 in the associative rule of 2 are adjacent to each other in the result of the compound word combining process described above.
It is applied between two morphemes (however, they are combined to form a compound word), and it is determined whether any of the conditions is satisfied. If the two morphemes satisfy any of the conditions, the previous morpheme that has been designated as the "head" is the "arc".
In addition, the morpheme after being made a "tail" is changed to "fox".
This compound word division processing is performed in order to exclude a combination of morphemes combined as a compound word that is exceptionally unfavorable to be identified as a compound word. In addition, each processing of steps S5, S7, and S9 described later also includes steps S4 and S9.
This is performed in order to exceptionally exclude (divide) the morphemes combined in the respective processes of 6 and S8.

【００９４】具体的には、図５において、例えば、形態
素「昨夜」と「男」は、1001の条件を満足するから、上
述したように、「昨夜」＝頭、「男」＝尾とされる。し
かしながら、前の形態素である「昨夜」の下位分類は
「（副詞用法）」であり（図５）、また、後の形態素で
ある「男」の品詞が「名詞」であるから、この２つの形
態素は、図６の2001の条件を満たし、従って、「昨夜」
＝孤、「男」＝孤と変更される。なお、分割された前後
の形態素は「孤」と「弧」になる場合の他、「尾」と
「頭」になる場合がある。即ち、例えば、「頭尾」とさ
れた２つの形態素が分割されれば「孤」と「弧」となる
が、例えば、「頭胴胴胴胴尾」とされた６つの形態素が
中央で分割されれば「頭胴尾」と「頭胴尾」とされ、分
割された前後の形態素は、「尾」と「頭」になる。Specifically, in FIG. 5, for example, the morphemes “last night” and “male” satisfy the condition of 1001. Therefore, as described above, “last night” = head and “male” = tail. It However, the sub-classification of the previous morpheme "Yoruyo" is "(adverb usage)" (Fig. 5), and the part of speech of the latter morpheme "male" is "noun". The morpheme meets the conditions of 2001 in Fig. 6, and therefore "last night"
= Fox, "male" = fox The morphemes before and after the division may be “tail” and “head” in addition to “arc” and “arc”. That is, for example, if two morphemes that are "head-to-tail" are divided into "arc" and "arc", for example, six morphemes that are "head-body-body-body-tail" are divided at the center. If it is done, it is considered to be "caudal torso" and "caudal torso", and the divided front and back morphemes are "tail" and "head".

【００９５】以上の複合語結合処理、複合語分割処理の
処理結果を、図９の「語」の欄に示す。The processing results of the compound word combination processing and compound word division processing described above are shown in the "word" column in FIG.

【００９６】ここで、結合処理部５およびポーズ設定処
理部６で採用している基本的な処理方法について説明す
る。即ち、結合処理部５およびポーズ設定処理部６で
は、２個以上の形態素が結合した際には、その先頭また
は末尾の形態素を、結合した形態素全体を代表する形態
素とするようになされている。従って、例えば、品詞に
着目して説明すれば、複合語結合処理によって結合され
た「丸の内警察」については、「丸の内」も、末尾の形
態素「警察」もその品詞は「名詞」であるから、いずれ
かが代表となっても全体の品詞は「名詞」であるが、後
述する文節結合処理（ステップＳ４）の結果得られる
「丸の内警察に」については、先頭の形態素「丸の内」
の品詞は「名詞」であり、末尾の形態素「に」の品詞は
「格助詞」であるから、先頭または末尾のいずれの形態
素が代表になるかによって、結合された形態素全体の品
詞が異なることになる。Here, a basic processing method adopted in the combination processing section 5 and the pose setting processing section 6 will be described. That is, in the combination processing unit 5 and the pose setting processing unit 6, when two or more morphemes are combined, the leading or trailing morpheme is made to be a morpheme representing the entire combined morpheme. Therefore, for example, focusing on the part-of-speech, for the "Marunouchi police" combined by the compound word combination process, the part-of-speech is "noun" for both "Marunouchi" and the morpheme "police" at the end. Even if one of them becomes the representative, the whole part of speech is "noun", but for "Marunouchi police to" obtained as a result of the phrase combination processing (step S4) described later, the leading morpheme "Marunouchi"
The part-of-speech of the combined morpheme is different because the part-of-speech of is a noun and the part-of-speech of the ending morpheme "ni" is a case particle. become.

【００９７】即ち、図５に示した、例えば「丸の内警察
に」と、その前にある（隣接する）形態素「られ」との
間に、結合規則またはポーズ設定規則が適用される場合
には、形態素「られ」と、「丸の内警察に」のうちの、
形態素「られ」に隣接する先頭の形態素「丸の内」との
間に結合規則またはポーズ設定規則が適用される。That is, in the case where the combining rule or the pose setting rule is applied between, for example, "to Marunouchi police" and the (adjacent) morpheme in front of it shown in FIG. Of the morpheme "re" and "to Marunouchi police",
A coupling rule or a pose setting rule is applied between the morpheme "re" and the leading morpheme "Marunouchi".

【００９８】また、例えば「丸の内警察に」と、その後
にある（隣接する）形態素「窃盗」との間に、結合規則
またはポーズ設定規則が適用される場合には、「丸の内
警察に」のうちの、形態素「窃盗」に隣接する末尾の形
態素「に」と、形態素「窃盗」との間に、結合規則また
はポーズ設定規則が適用される。In addition, for example, in the case of "to Marunouchi police", if the combining rule or the pose setting rule is applied between "to Marunouchi police" and the (adjacent) morpheme "theft" that follows it, Between the last morpheme "ni" adjacent to the morpheme "theft" and the morpheme "theft", the combining rule or the pose setting rule is applied.

【００９９】さらに、２個以上の形態素が結合したもの
どうしに対しては、隣接する末尾と先頭の形態素の間
に、結合規則またはポーズ設定規則が適用される。具体
的には、以下のような処理が行われる。（上述したよう
に、１つの形態素自体をカギ括弧「」で囲み、２つ以
上の形態素が結合したものを、その外側を丸括弧（）
で囲んで表す）。Furthermore, for two or more morphemes that are combined, a combining rule or a pause setting rule is applied between adjacent morphemes at the end and the beginning. Specifically, the following processing is performed. (As described above, one morpheme itself is enclosed in square brackets "", and a combination of two or more morphemes is surrounded by round brackets ().
Surrounded by).

【０１００】即ち、例えば、（「店員」「に」）と
（「押さえ」「られ」）とが隣接する場合、（「店員」
「に」）を代表する末尾の形態素「に」と、（「押さ
え」「られ」）を代表する先頭の形態素「押さえ」が、
形態素を結合する条件を満たせば、（「店員」「に」）
と（「押さえ」「られ」）は一つに結合され、その結
果、（（「店員」「に」）（「押さえ」「られ」））と
なる。That is, for example, when (“clerk” “ni”) and (“hold” “re”) are adjacent to each other, (“clerk”)
The trailing morpheme "ni" representing "ni") and the leading morpheme "holding" representing ("holding""are")
If the conditions for combining morphemes are met, (“clerk” “ni”)
And ("press""re") are combined into one, resulting in (("clerk""ni")("press""re")).

【０１０１】従って、結合規則またはポーズ設定規則が
適用される形態素は、「頭」、「尾」、「孤」とされた
ものだけであり、「胴」とされたものには適用されな
い。Therefore, the morphemes to which the combining rule or the pose setting rule is applied are only those "head", "tail" and "arc", not the ones "body".

【０１０２】次に、ステップＳ３の処理後、ステップＳ
４に進み、文節結合処理が文節結合規則に基づいて行わ
れ、これにより複合語の同定結果から文節が同定され
る。即ち、複合語分割処理の結果に対し、図６に示した
3001から始まる3000番台の文節結合規則が適用される。
これにより、図９の「語」の欄に示した、例えば（「丸
の内」「警察」）と、それに続く「に」は、「尾」とさ
れた形態素「警察」と「孤」とされた形態素「に」が、
図５に示した形態素解析結果から、図６に示した3004の
条件を満足するので結合され、（（「丸の内」「警
察」）「に」）とされる。即ち、形態素「丸の内」、
「警察」、「に」は、それぞれ「頭」、「胴」、「尾」
とされる。Next, after the processing of step S3, step S
4, the bunsetsu joining process is performed based on the bunsetsu joining rule, whereby the bunsetsu is identified from the compound word identification result. That is, the result of the compound word division processing is shown in FIG.
The clause linkage rules in the 3000s starting from 3001 are applied.
As a result, for example, "(Marunouchi""police") and the following "ni" shown in the "word" column of FIG. 9 are regarded as "tails" morpheme "police" and "fox". The morpheme "ni"
From the morphological analysis result shown in FIG. 5, the condition of 3004 shown in FIG. 6 is satisfied, so that they are combined and expressed as ((“Marunouchi” “police”) “ni”). That is, the morpheme "Marunouchi",
"Police" and "ni" are "head", "body", and "tail", respectively.
It is said that

【０１０３】以上の文節結合処理後、ステップＳ５に進
み、文節分割規則に基づいて、文節分割処理が行われ
る。即ち、文節を構成するとして結合された形態素間
に、文節分割規則が適用され、文節とは認められない結
合が解除される。After the phrase combination process described above, the process proceeds to step S5, and the phrase division process is performed based on the phrase division rule. That is, the bunsetsu division rule is applied between the morphemes that are combined to form a bunsetsu, and the bunsetsu that is not recognized as a bunsetsu is released.

【０１０４】具体的には、図５および図６には図示して
いないが、文節分割規則として、例えば 4001,”警察”,名詞,NULL,NULL,NULL,格助詞,NULL,NULL が記述されていた場合には、文節として同定された
（（「丸の内」「警察」）「に」）を構成する形態素
「警察」と「に」は、この4001番の条件を満たすので、
（「丸の内」「警察」）と「に」に分割されることにな
る。Specifically, although not shown in FIGS. 5 and 6, as the segment division rule, for example, 4001, "police", noun, NULL, NULL, NULL, case particle, NULL, NULL are described. In this case, the morpheme “police” and “ni” that compose the ((“Marunouchi” “police”) “ni”) identified as a phrase satisfy the condition of this 4001,
("Marunouchi""police") and "ni" will be divided.

【０１０５】文節結合処理および文節分割処理により得
られた結果を、図９の「文節」の欄に示す。なお、図９
は、結合規則に、上述の4001番の条件が記述されていな
い場合の処理結果を示している。The results obtained by the phrase combination process and the phrase segmentation process are shown in the column of "bunsetsu" in FIG. Note that FIG.
Shows a processing result when the above condition No. 4001 is not described in the combining rule.

【０１０６】以下、ステップＳ６乃至Ｓ９に順次進み、
連用修飾結合規則、連用修飾分割規則、連体修飾結合規
則、または連体修飾分割規則に基づいて、連用修飾結合
処理、連用修飾分割処理、連体修飾結合処理、または連
体修飾分割処理がそれぞれ行われる。After that, the process proceeds to steps S6 to S9 in sequence.
Based on the continuous modification combination rule, the continuous modification division rule, the continuous modification combination rule, or the continuous modification division rule, the continuous modification combination process, the continuous modification division process, the continuous modification combination process, or the continuous modification split process is performed, respectively.

【０１０７】即ち、文節の同定結果から、連用修飾連文
節と連体修飾連文節とが同定される。なお、ステップＳ
６およびＳ７の処理は、ステップＳ８およびＳ９の処理
の前ではなく、その後に行うようにしても良い。但し、
ステップＳ６およびＳ７の処理を、ステップＳ８および
Ｓ９の処理の前に行う方が、より自然な合成音が得られ
ることが、シミュレーションによりわかっている。That is, from the result of identifying the bunsetsu, the continuous modified linked bunsetsu and the adnominal modified linked bunsetsu are identified. Note that step S
The processing of 6 and S7 may be performed after the processing of steps S8 and S9 instead of before the processing. However,
Simulations have shown that a more natural synthesized sound can be obtained by performing the processing of steps S6 and S7 before the processing of steps S8 and S9.

【０１０８】具体的には、連用修飾結合処理では、図９
の「文節」の欄に示すように処理された、例えば（「店
員」「に」）と（「押さえ」「られ」）は、（「店員」
「に」）を代表する末尾の「に」と、（「押さえ」「ら
れ」）を代表する「押さえ」が、図５の形態素解析結果
から、図７の5004番の条件を満たすので、（（「店員」
「に」）（「押さえ」「られ」））のように結合され、
図９の句１の欄に示すように、「店員」、「に」、「押
さえ」、「られ」はそれぞれ「頭」、「胴」、「胴」、
「尾」とされる。Specifically, in the continuous modification and binding process, as shown in FIG.
Processed as shown in the column of "bunsetsu", for example, "(clerk""ni") and ("holding""are")
From the morphological analysis result of FIG. 5, the last "ni" representing "ni") and the "holding" representing ("holding""are") satisfy the condition of No. 5004 in FIG. ("Clerk"
"Ni") ("holding""rear"),
As shown in the column of clause 1 in FIG. 9, “clerk”, “ni”, “holding”, and “re” are “head”, “trunk”, “trunk”, and
It is called a "tail".

【０１０９】次に、連用修飾分割処理では、図５および
図６には図示していないが、連用修飾分割規則として、
例えば 6001,NULL,格助詞,NULL,NULL,”押さえ”,一般動詞,NUL
L,NULL が記述されていた場合には、連用修飾連文節として同定
された（（「店員」「に」）（「押さえ」「られ」））
を構成する形態素「に」と「押さえ」は、この6001番の
条件を満たすので、（「店員」「に」）と（「押さえ」
「られ」）に分割されることになる。Next, in the continuous modification division processing, although not shown in FIG. 5 and FIG.
For example, 6001, NULL, case particle, NULL, NULL, “press”, general verb, NUL
If L, NULL was described, it was identified as a continuous modified clause ((“clerk” “ni”) (“hold” “re”))
Since the morpheme "ni" and "holding down" that compose are satisfying the condition of No. 6001, these are ("clerk""ni") and ("holding down").
It will be divided into two parts.

【０１１０】連用修飾結合処理および連用修飾分割処理
により得られた結果を、図９の「句１」の欄に示す。な
お、図９は、結合規則に、上述の6001番の条件が記述さ
れていない場合の処理結果を示している。The results obtained by the continuous modification combination processing and the continuous modification division processing are shown in the column "Phrase 1" of FIG. Note that FIG. 9 shows a processing result when the above-mentioned condition No. 6001 is not described in the combining rule.

【０１１１】連体修飾結合処理または連体修飾分割処理
においても、連用修飾結合処理または連用修飾分割処理
とそれぞれ同様の処理が、連体修飾結合規則または連体
修飾分割規則を参照して行われ、これにより図９の「句
２」の欄に示すような処理結果が得られる。In the adjoining modified binding process or the adjoining modified dividing process, the same processes as the adjoining modified joining process or the adjoining modified dividing process are performed with reference to the adjoining modified joining rule or the adjoining modified dividing rule. The processing result as shown in the column "Phrase 2" of 9 is obtained.

【０１１２】ここで、形態素解析の結果出力される品詞
情報は、上述した図２乃至図４に示した品詞情報辞書に
記述されているものだけである。即ち、例えば品詞を細
分化した下位分類を記述しておけば、品詞情報としてそ
の下位分類を得ることができる。具体的には、図２に示
したように、例えば「普通名詞」の下位分類として、
「（副詞用法）」（副詞用法の普通名詞）を記述してお
けば、形態素がそれに該当する場合、その品詞情報とし
て「（副詞用法）」という下位分類が得られる。上位分
類についても同様で、図２に示したように、例えば「名
詞」の上位分類として、「自立語」を記述しておくこと
により、形態素が、その下位分類である、例えば「（副
詞用法）」に該当する場合、その品詞情報として「自立
語」という上位分類も得られる。Here, the part-of-speech information output as a result of the morphological analysis is only the one described in the part-of-speech information dictionary shown in FIGS. 2 to 4 described above. That is, for example, if a subclass that subdivided the part of speech is described, the subclass can be obtained as the part of speech information. Specifically, as shown in FIG. 2, for example, as a subclass of “ordinary noun”,
If "(adverb usage)" (an ordinary noun of adverb usage) is described, a subclass "(adverb usage)" can be obtained as part-of-speech information when a morpheme falls under that. The same applies to the upper classification, as shown in FIG. 2, for example, by describing "independent word" as a higher classification of "noun", the morpheme is a lower classification thereof, for example, "(adverb usage ) ”, The upper classification“ independent word ”is also obtained as the part-of-speech information.

【０１１３】このように品詞情報辞書に品詞情報を、直
接記述しておくことにより、形態素を分類する方法を、
以下、静的分類方法という。As described above, the method of classifying morphemes by directly describing the part-of-speech information in the part-of-speech information dictionary is as follows.
Hereinafter, it is referred to as a static classification method.

【０１１４】ところで、例えば「逮捕」という名詞は、
「逮捕を」のように名詞としても用いられるし、「逮捕
する」のようにサ変化動詞としても用いられる。従っ
て、「逮捕」のみに注目すれば、その品詞は「名詞」と
いうことになるが、より自然な位置にポーズを挿入する
ためには、「逮捕」の後に、例えば「を」などの助詞が
続くか、あるいは例えば「する」などの活用語尾（サ変
活用語尾）が続くかによって、「逮捕」という形態素を
区別して分類する必要がある。By the way, for example, the noun "arrest" is
It is also used as a noun, like "arrest", and as a change verb, like "arrest". Therefore, if you focus only on “arrest”, the part of speech will be a “noun”, but in order to insert a pose at a more natural position, after “arrest”, a particle such as “wo” is added. It is necessary to distinguish and classify the morpheme of "arrest" depending on whether it continues or is followed by an inflectional ending (for example, "Suru").

【０１１５】そこで、静的分類方法により、その区別を
行う場合には、図２に示したように、上述したような
「逮捕」が、名詞であることを意味する品詞情報「名詞
・サ変」の他に、動詞であることを意味する品詞情報
「名詞・サ変［サ変化］」を、品詞情報辞書に記述して
おくようにすれば良い（［サ変化］は、「逮捕」をサ変
化動詞として認定したことを意味する）。Therefore, in the case of making the distinction by the static classification method, as shown in FIG. 2, the above-mentioned "arrest" means that the "arrest" is a noun. In addition to this, the part-of-speech information "noun / sa-hen [sa-change]" that means that it is a verb should be described in the part-of-speech information dictionary ([sa-change] means "arrest" Means that it was certified as a verb).

【０１１６】このようにした場合には、形態素解析部４
に、形態素「逮捕」の他、それに続く形態素に着目させ
ることにより、「サ変活用語尾」の形態素が続いている
ときに、その品詞情報として、「名詞」またはサ変化し
た「動詞」のうち、サ変化した「動詞」であることを意
味する品詞情報「名詞・サ変［サ変化］」を出力させる
ようにする。これにより、以降の処理では、「逮捕」
は、「名詞・サ変［サ変化］」の形態素として取り扱わ
れることになる。In this case, the morphological analysis unit 4
In addition to the morpheme "arrest", when the morpheme following it is followed, when the morpheme of "sa variant inflection" continues, as part of speech information, "noun" or The part-of-speech information "noun / sa-hen [sa-hen]", which means that it is a changed "verb", is output. As a result, in subsequent processing, "arrest"
Will be treated as a morpheme of "noun / sa-hen [sa-hen]".

【０１１７】次に、上述したような「逮捕」という形態
素の分類するには、それに続く形態素にも注目する必要
があることから、以下のようにして行うことも可能であ
る。即ち、形態素解析では、「逮捕」の品詞情報として
「名詞・サ変」が出力されるようにしておく（従って、
品詞情報辞書には、「名詞・サ変」は記述しておくが、
「名詞・サ変［サ変化］」は記述しておかないようにす
る）とともに、単独では「名詞」の「逮捕」の後に、
「サ変活用語尾」の形態素が続いていた場合には、それ
らを結合するように、結合規則を記述しておく。Next, in order to classify the morpheme of "arrest" as described above, it is necessary to pay attention to the morpheme following the morpheme. Therefore, it is possible to carry out as follows. That is, in the morphological analysis, "noun / sahen" is output as the part of speech information of "arrest" (therefore,
In the part-of-speech information dictionary, "noun / sahen" is described,
"Do not describe" noun / sa-hen "[sa-chang]", and after "arrest" of "noun" alone,
When the morpheme of "sa-inflectional ending" is continued, the combination rule is described so as to combine them.

【０１１８】そして、品詞情報「名詞・サ変」の形態素
「逮捕」と、それに続く「サ変活用語尾」の形態素と
が、結合規則（図６および図７）に基づいて結合された
ときに、「逮捕」という形態素が、サ変化する動詞であ
ることを示す付加情報（［サ変化］という付加情報）
を、結合結果（この場合は、結合結果を構成する「逮
捕」）に付加するようにする。Then, when the morpheme "arrest" of the part-of-speech information "noun / sa-hen" and the morpheme of the following "sa-inflectional ending" are combined based on the combination rule (FIGS. 6 and 7), " Additional information indicating that the morpheme "arrest" is a verb that changes (additional information called [change])
Is added to the combined result (in this case, "arrest" constituting the combined result).

【０１１９】その後は、形態素解析の結果得られた品詞
情報である「名詞・サ変」と、上述したように付加され
た付加情報である［サ変化］とを加味し、「逮捕」の品
詞情報を、「名詞・サ変［サ変化］」として取り扱うよ
うにする。After that, taking into consideration the part-of-speech information "noun / sa-hen" obtained as a result of the morpheme analysis and the additional information [sa-chang] added as described above, the part-of-speech information of "arrest" is added. Is treated as a "noun / sa-hen [sa-hen]".

【０１２０】一方、品詞情報「名詞・サ変」の形態素
「逮捕」に続く形態素が、「サ変活用語尾」以外のもの
である場合には、たとえそれらが結合されても、特に付
加情報を付加しなければ、「逮捕」は、そのまま「名詞
・サ変」の形態素として取り扱われることになる。On the other hand, if the morpheme following the morpheme “arrest” of the part-of-speech information “noun / sa-hen” is other than “sa-hen-inflectional ending”, even if they are combined, additional information is particularly added. Otherwise, "arrest" will be treated as it is as a morpheme of "noun / sahen".

【０１２１】この場合、「逮捕」は、それに後続する形
態素に対応して、付加情報が付加されたり、またはされ
なかったりすることにより、「名詞・サ変［サ変化］」
または「名詞・サ変」として取り扱われる。即ち、結合
する形態素の組み合わせにより、その結合結果の分類
は、動的に変化する。従って、以上の方法により形態素
を分類する方法は、上述の静的分類方法に対して、いわ
ば動的分類方法ということができる。In this case, "arrest" means "noun / sa change [sa change]" by adding or not adding additional information corresponding to the morpheme following it.
Or it is treated as "noun / sahen". That is, the classification of the combination result dynamically changes depending on the combination of the morphemes to be combined. Therefore, the method of classifying morphemes by the above method can be called a dynamic classifying method in contrast to the above static classifying method.

【０１２２】動的分類方法においては、最初は、名詞を
意味する「名詞・サ変」という分類は存在するが、動詞
を意味する「名詞・サ変［サ変化］」という分類は存在
せず、結合処理において［サ変化］という付加情報が付
加されることによって初めて、「名詞・サ変［サ変
化］」という分類が現れることになる。In the dynamic classification method, at first, there is a classification of "noun / sahen" which means a noun, but there is no classification of "noun / sahen [sachange]" which means a verb. Only when the additional information “sa change” is added in the processing, the classification “noun / sa change [sa change]” appears.

【０１２３】図６および図７に示した結合規則の１単位
を構成する（９）のフラグ情報は、上述の付加情報に相
当し、例えば図９の「文節」の欄に示した、１つの文節
を構成するとして結合された形態素「逮捕」、「さ」、
「れ」、および「た」のうちの「逮捕」は、動的分類方
法により「名詞・サ変［サ変化］」に分類されている。The flag information of (9) which constitutes one unit of the associative rule shown in FIGS. 6 and 7 corresponds to the above-mentioned additional information, and for example, one of the flags shown in the column of “clause” in FIG. The morpheme "arrest", "sa", which are combined to form a clause,
The "arrest" of "re" and "ta" is classified into "noun / sa strange [sa change]" by the dynamic classification method.

【０１２４】即ち、まず、図５の形態素解析結果から、
「逮捕」または「さ」の品詞はそれぞれ、「名詞・サ
変」または「サ変活用語尾」であり、従って図６の3001
番の文節結合規則の条件を満たすので、文節を構成する
として（「逮捕」「さ」）とされる。即ち、「逮捕」＝
頭、「さ」＝尾とされる。That is, first, from the morphological analysis result of FIG.
The part of speech of "arrest" or "sa" is "noun / sa-hen" or "sa-inflectional ending", respectively, and therefore 3001 in FIG.
It is said that the clause is composed ("arrest""sa") because it satisfies the conditions of the clause clause combination rule. That is, "arrest" =
Head, "sa" = tail.

【０１２５】この場合、3001番の（９）のフラグ情報
は、「頭［サ変化］」と記述されている。この「頭［サ
変化］」というのは、その条件（3001番の条件）を満足
する形態素どうしを結合して得られる文節のうちの、先
頭の形態素（「頭」とされる形態素）に、［サ変化］と
いう情報を付加することを意味する。なお、フラグ情報
が、「尾［サ変化］」と記述されていた場合、この「尾
［サ変化］」というのは、その条件を満足する形態素ど
うしの結合結果のうちの、末尾の形態素（「尾」とされ
る形態素）に、［サ変化］という情報を付加することを
意味する。In this case, the flag information No. 3001 (9) is described as "head [change]". This "head [sa change]" means that the first morpheme (the morpheme called "head") of the morpheme obtained by combining morphemes that satisfy the condition (condition 3001) is This means that the information “change” is added. If the flag information is described as “tail [sa change]”, the term “tail [sa change]” means that the tail morpheme (of the tail morphemes among the morphemes combined with each other that satisfies the condition). It means that the information "sa change" is added to the "morpheme" which is called "tail".

【０１２６】従って、この場合、結合結果である（「逮
捕」「さ」）の先頭の形態素「逮捕」の品詞情報は、
「名詞・サ変化」から「名詞・サ変［サ変化］」に変更
される。Therefore, in this case, the part-of-speech information of the first morpheme "arrest" of the combined result ("arrest""sa") is:
Changed from "noun / sa change" to "noun / sa change [sa change]".

【０１２７】なお、以下では、（「逮補」「さ」））を
代表する末尾の「さ」と隣接する「れ」が、図６の3006
番の条件を満たすので、（（「逮捕」「さ」）「れ」）
とされ、さらに（（「逮捕」「さ」）「れ」）を代表す
る末尾の「れ」とそれに隣接する「た」が、図６の3010
番の条件を満たすので（（（「逮捕」「さ」）「れ」）
「た」）と結合される。即ち、「逮捕」、「さ」、
「れ」、「た」は、それぞれ「頭」、「胴」、「胴」、
「尾」とされる。In the following, "re" adjacent to the last "sa" representing ("arrest""sa") is 3006 in FIG.
(("Arrest""sa")"re")
Furthermore, the last "re" representing (("arrest""sa")"re") and the adjacent "ta" are 3010 in FIG.
Because the condition of the number is satisfied (((“arrest” “sa”) “re”)
"Ta"). That is, "arrest", "sa",
"Re" and "ta" are "head", "body", "body",
It is called a "tail".

【０１２８】ポーズ設定規則および結合規則は、必要に
応じて、フラグ情報を用いて記述されており、形態素
「逮捕」に付加されたフラグ情報は、その後、「逮捕」
に対し、ポーズ設定規則および結合規則を適用するにあ
たって加味される。即ち、文節を構成するとして同定さ
れた形態素「逮捕」の前に隣接する形態素が、例えば
「で」などの「助詞」である場合、形態素「で」または
「逮捕」の品詞は、それぞれ「助詞」または「名詞・サ
変［サ変化］」であるから、図７に示した5001番の連用
修飾結合条件を満足し、従って、これらは連用修飾連文
節を構成するとして結合される。The pause setting rule and the combining rule are described using flag information as necessary, and the flag information added to the morpheme "arrest" is then "arrest".
However, it is taken into consideration when applying the pose setting rule and the combining rule. That is, when the morpheme adjacent to the morpheme "arrest" identified as constituting the phrase is a "particle" such as "de", the part of speech of the morpheme "de" or "arrest" is "particle", respectively. Or "noun-sa-hen [sa-hen]", the condition 5001 shown in FIG. 7 is satisfied, and these conditions are combined to form a continuous modified consecutive clause.

【０１２９】なお、形態素「逮捕」に、［サ変化］とい
うフラグ情報が付加されていない場合には、形態素「逮
捕」の品詞は、「名詞・サ変」であるから、図７に示し
た5001番の連用修飾結合条件を満足せず、従って、これ
らは結合されないこととなる。If the morpheme "arrest" is not added with the flag information "sa change", the part of speech of the morpheme "arrest" is "noun / sa-hen", and therefore 5001 shown in FIG. Turn modified binding conditions are not met, so they will not be bound.

【０１３０】フラグ情報は、形態素に対し、［サ変化］
という品詞情報を追加する場合だけでなく、その他の品
詞情報（あるいは、品詞情報以外の情報）を追加する場
合にも利用することができる。The flag information is [sa change] for morphemes.
It can be used not only when adding part-of-speech information, but also when adding other part-of-speech information (or information other than part-of-speech information).

【０１３１】即ち、例えば図１１（ａ）に示すような入
力文「決して逮捕されないような」からは、同図（ｂ）
に示すような形態素解析結果が得られる。なお、図１１
（ｂ）（後述する図１２（ｂ）も同様）における品詞情
報の（）内は、図２乃至図４には図示していないが、そ
の下位分類を示している。That is, from the input sentence "never arrested" as shown in FIG. 11A, for example, FIG.
A morphological analysis result as shown in is obtained. Note that FIG.
Although not shown in FIGS. 2 to 4, the part of parentheses in (b) (the same applies to FIG. 12B described later) is shown in FIG.

【０１３２】まず、「副詞（叙述副詞）」である「決し
て」と、「名詞・サ変」である「逮捕」とは、図６およ
び図７に示した結合条件のいずれも満たさず、結合され
ない。次に、形態素「逮捕」と「さ」は、上述したよう
に（「逮捕」「さ」）と結合され、「逮捕」には、［サ
変化］というフラグ情報が付加される。即ち、この段階
で、「逮捕」は、「名詞・サ変［サ変化］」とされる。
そして、（「逮捕」「さ」）と「れ」も、上述したよう
に結合され、（（「逮捕」「さ」）「れ」）とされる。First, the “adverb (declarative adverb)” “never” and the “noun / sahen” “arrest” do not meet any of the join conditions shown in FIGS. 6 and 7 and are not joined. . Next, the morphemes “arrest” and “sa” are combined with (“arrest” “sa”) as described above, and the flag information “sa change” is added to “arrest”. That is, at this stage, the "arrest" is regarded as "noun / sa strange [sa change]".
Then, (“arrest” “sa”) and “re” are also combined as described above to be ((“arrest” “sa”) “re”).

【０１３３】次に、（（「逮捕」「さ」）「れ」）を代
表する「れ」と、それに続く「ない」は、図１１（ｂ）
の形態素解析結果から、図６の3009番の条件を満たし、
従って（（（「逮捕」「さ」）「れ」）「ない」）とさ
れる。さらに、この場合、3009番の（９）のフラグ情報
には、「頭［ない］」が記述されているので、結合結果
（（（「逮捕」「さ」）「れ」）「ない」）の先頭の形
態素「逮捕」には、［ない］というフラグ情報が付加さ
れる。Next, "re" representing (("arrest""sa")"re") and "no" following it are shown in FIG. 11 (b).
From the morphological analysis result of, satisfy the condition of No. 3009 in FIG.
Therefore ((("arrest""sa")"re")"not"). Further, in this case, the flag information of the number 909 (9) describes "head [absent]", so the combined result ((("arrest""sa")"re")"absent") The flag information "not" is added to the morpheme "arrest" at the beginning of the.

【０１３４】なお、フラグ情報は、結合結果の先頭（ま
たは末尾）の形態素が確定した後に、形態素を結合して
いく各段階で順次付加するようにしても良いし、また、
例えば次のようにして付加することも可能である。即
ち、結合結果の先頭（または末尾）の形態素が確定した
後、フラグ情報を、例えばスタックなどに記憶しておい
て、最終的な結合結果が得られた後に、記憶しておいた
フラグ情報をまとめて付加するようにしても良い。The flag information may be sequentially added at each stage of combining the morphemes after the morpheme at the head (or the end) of the combination result is determined, or
For example, it is possible to add as follows. That is, after the head (or end) morpheme of the combined result is determined, the flag information is stored in, for example, a stack, and the stored combined flag information is obtained after the final combined result is obtained. You may add it collectively.

【０１３５】従って、（（（「逮捕」「さ」）「れ」）
「ない」）を構成する「逮捕」の品詞情報は、「名詞・
サ変［サ変化］［ない］」となる。Therefore, ((("arrest""sa")"re")
The part-of-speech information for "arrest" that composes "no" is
Sa change [sa change] [never].

【０１３６】次に、（（（「逮捕」「さ」）「れ」）
「ない」）を代表する「ない」と、それに続く「よう
な」は、図１１（ｂ）の形態素解析結果から、図６の30
10番の条件を満たし、従って（（（（「逮捕」「さ」）
「れ」）「ない」）「ような」）とされる。Next, ((("arrest""sa")"re")
From the result of the morphological analysis of FIG.
Meet condition 10 and therefore (((("arrest""sa")
"Re") "not") "like").

【０１３７】図１１（ａ）に示した入力文は、以上のよ
うにして文節「決して」と、（（（（「逮捕」「さ」）
「れ」）「ない」）「ような」）とに同定される。そし
て、この文節（形態素）「決して」と、（（（（「逮
捕」「さ」）「れ」）「ない」）「ような」）を代表す
る（形態素）「逮捕」とは、「逮捕」の品詞情報が「名
詞・サ変［サ変化］［ない］」とされているから、図７
の5003番の連用修飾結合規則を満し、従って、この段階
で連用修飾連文節を構成するとして結合され、図１１
（ｃ）に示すように、（「決して」（（（（「逮捕」
「さ」）「れ」）「ない」）「ような」））とされる。As described above, the input sentence shown in FIG. 11 (a) has the clauses "never" and (((("arrest""sa")).
"Re") "not") "like"). And this phrase (morpheme) "never" and ((("arrest""sa")"re")"not")"like") (morpheme) "arrest" mean "arrest" Since the part-of-speech information of "is noun / sa strange [sa change] [not]",
No. 5003 of the consecutive modified qualification combining rule is satisfied, and thus, at this stage, it is combined as a constituent of the consecutive modified qualification clause.
As shown in (c), ("never"(((("arrested"
"Sa") "re") "not") "like")).

【０１３８】入力文「決して逮捕されないような」にお
いて、「決して」と係り受け関係にあるのは、否定を表
す「ない」であり、この「ない」の文法的性質が、［な
い］というフラグ情報によって、「逮捕」に反映される
ことにより、以上のように、「決して」と、
（（（（「逮捕」「さ」）「れ」）「ない」）「よう
な」）とが連用修飾連文節を構成するとして結合される
ことになる。In the input sentence "never arrested", the relationship that has a dependency with "never" is "not" which represents negation, and the grammatical property of this "not" is the flag "not". By being reflected in “arrest” by information, as described above, “never”,
((("Arrest""sa")"re")"not")"like") will be combined to form a continuous modified clause.

【０１３９】次に、例えば図１２（ａ）に示すような入
力文「３キロメートル先の箱崎インターを」からは、同
図（ｂ）に示すような形態素解析結果が得られる。この
形態素解析結果から、「３」と「キロメール」は、図６
の1002番の複合語結合規則を満たし、（「３」「キロメ
ートル」）とされる。そして、（「３」「キロメート
ル」）を代表する「キロメートル」と、それに隣接する
「先」は、図６の1003番の複合語結合規則を満たし、
（（「３」「キロメートル」）「先」）とされる。さら
に、この場合、1003番の複合語結合規則のフラグ情報
は、「尾［距離方向］」とされているから、結合の結果
得られた複合語（（「３」「キロメートル」）「先」）
の末尾の形態素「先」に［距離方向］というフラグ情報
が付加される。Next, for example, from the input sentence "Hakozaki Inter 3 km away" as shown in FIG. 12A, the morphological analysis result as shown in FIG. 12B is obtained. From this morphological analysis result, "3" and "km-mail" are shown in FIG.
It satisfies the 1002 compound word combination rule, and is defined as (“3” “km”). Then, "kilometer" representing ("3""kilometer") and its adjacent "destination" satisfy the compound word combination rule of 1003 in FIG.
((“3” “kilometer”) “destination”). Further, in this case, since the flag information of the compound word combination rule of 1003 is “tail [distance direction]”, the compound word ((“3” “km”) “forward” obtained as a result of the combination. )
Flag information "distance direction" is added to the morpheme "first" at the end of the.

【０１４０】その結果、（（「３」「キロメートル」）
「先」）を代表する「先」と、それに続く「の」は、図
６の3003番の文節結合規則を満たし、従って文節を構成
するとして結合され、（（（「３」「キロメートル」）
「先」）「の」）とされる。さらに、この場合、3003番
の文節結合規則のフラグ情報は、「尾［距離方向］」と
されているから、結合の結果得られた文節（（（「３」
「キロメートル」）「先」）「の」）の末尾の形態素
「の」に［距離方向］というフラグ情報が付加される。
即ち、この場合、形態素「先」に付加されたフラグ情報
が、形態素「の」に、いわば継承付加され、「の」の品
詞情報は、「格助詞［距離方向］」となる。As a result, (("3""km")
The "preceding" representing "preceding") and the subsequent "no" satisfy the clause coupling rule 3003 in FIG. 6 and are thus combined as a clause, and thus ((("3""km")
"Destination") "No"). Furthermore, in this case, since the flag information of the clause connection rule of No. 3003 is "tail [distance direction]", the clause ((("3"
Flag information "distance direction" is added to the morpheme "no" at the end of "kilometer") "forward") "no").
That is, in this case, the flag information added to the morpheme "preceding" is, so to speak, added to the morpheme "no", and the part-of-speech information of "no" becomes "case particle [distance direction]".

【０１４１】一方、「箱崎インター」と「を」は、図１
２（ｂ）の形態素解析結果から、図６の3004番の文節結
合規則を満たし、従って文節を構成するとして結合さ
れ、（「箱崎インター」「を」）とされる。On the other hand, "Hakozaki interchange" and "o" are shown in FIG.
From the morphological analysis result of 2 (b), the bunsetsu combination rule of No. 3004 in FIG. 6 is satisfied, and thus the bunsetsu is combined to form a bunsetsu, which is (“Hakozaki Inter” “wa”).

【０１４２】文節（（（「３」「キロメートル」）
「先」）「の」）と、（「箱崎インター」「を」）との
関係を考えた場合、（（（「３」「キロメートル」）
「先」）「の」）または（「箱崎インター」「を」）を
代表する形態素「の」または「箱崎インター」の品詞情
報は、それぞれ「格助詞［距離方向］」または「固有名
詞（地名）」であるから、これらは、図７の7003番の連
体修飾結合規則を満たす。その結果、（（（「３」「キ
ロメートル」）「先」）「の」）と、（「箱崎インタ
ー」「を」）とは、連体修飾連文節を構成するとして結
合され、（（（（「３」「キロメートル」）「先」）
「の」）（「箱崎インター」「を」））とされる。Phrase ((("3""km")
If you consider the relationship between "(destination))" no ") and (" Hakozaki interchange "" o "), (((" 3 "" km "))
The part-of-speech information of the morpheme "no" or "Hakozaki Inter" representing """""""" or "(Hakozaki Inter""wo") is "case particle [distance direction]" or "proper noun (place name)", respectively. ) ”, They satisfy the adnominal modified binding rule numbered 7003 in FIG. 7. As a result, ((("3""km")"destination")"no") and ("Hakozaki interchange""wo") are combined to form an adnominal-modifying continuous clause, and ((((" 3 "" kilometer ")" destination ")
"No") ("Hakozaki interchange""wa")).

【０１４３】しかしながら、形態素「の」と「箱崎イン
ター」は、図７の8001番の連体修飾分割規則も満たすか
ら、連体修飾連文節を構成するとして結合された
（（（（「３」「キロメートル」）「先」）「の」）
（「箱崎インター」「を」））は、元の文節
（（（「３」「キロメートル」）「先」）「の」）と、
（「箱崎インター」「を」）とに分割される。However, since the morphemes “no” and “Hakozaki Inter” also satisfy the adnominal modification division rule of No. 8001 in FIG. 7, they are combined to form an adnominal modifier continuous clause ((((“3” “kilometer” )"Previous")
(“Hakozaki interchange” “o”)) is the original phrase (((“3” “km”) “destination”) “no”),
("Hakozaki Inter""wo") and.

【０１４４】従って、その結合処理結果は、図１２
（ｃ）に示すようになる。Therefore, the result of the combining process is shown in FIG.
As shown in (c).

【０１４５】即ち、この場合、形態素「の」には、［距
離方向］というフラグ情報が付加されているため、通常
の格助詞と名詞の関係とは異なり、（（（「３」「キロ
メートル」）「先」）「の」）と、（「箱崎インター」
「を」）とは連体修飾連文節を構成するとして結合され
ない。このように、フラグ情報を設けておくことによ
り、特定の形態素の前または後には、その他の特定の形
態素を結合しないという、いわば例外処理が可能とな
る。That is, in this case, since the morpheme "no" is added with the flag information "distance direction", unlike the usual relationship between a case particle and a noun, ((("3""km"). ) "Ahead") "No") and ("Hakozaki Inter"
"W") is not combined as it constitutes an adnominal-modified conjunctive clause. In this way, by providing the flag information, it is possible to perform so-called exceptional processing in which other specific morphemes are not combined before or after the specific morpheme.

【０１４６】後述するように、連体修飾連文節を構成す
るとして結合されなかった（（（「３」「キロメート
ル」）「先」）「の」）と、（「箱崎インター」
「を」）との間、即ち連体修飾連文節間は、ポーズを設
定する候補とされる。即ち、「３キロメートル先の」
と、「箱崎インターを」との間にはポーズが挿入される
可能性が高くなる。As will be described later, ((("3""kilometer")"destination")"no") and ("Hakozaki interchange") that were not combined to form an adnominal-modified continuous clause.
Between "" and "," that is, between adnominal-modified conjunctive clauses, are candidates for setting a pose. That is, "3 kilometers ahead"
There is a high possibility that a pose will be inserted between "Hakozaki Inter".

【０１４７】従って、本発明を、例えばカーナビゲーシ
ョンシステムにおける目的地案内のための音声合成処理
に適用した場合には、「３キロメートル先の」などのよ
うな距離を表す文節と、「箱崎インターを」などのよう
な目的地に対応する地名を表す文節との間に、ポーズが
挿入され易くなる。本願発明者が行った実験によれば、
距離を表す文節と、目的地に対応する地名を表す文節と
の間にポーズが挿入される場合には、その合成音が聞き
取り易いものになることがわかっている。Therefore, when the present invention is applied to a voice synthesis process for guiding a destination in a car navigation system, for example, a phrase representing a distance such as "3 km ahead" and "Hakozaki Inter A pose is likely to be inserted between a phrase representing a place name corresponding to a destination such as "." According to the experiment conducted by the inventor of the present application,
It has been known that when a pose is inserted between a phrase representing a distance and a phrase representing a place name corresponding to a destination, the synthesized voice becomes easy to hear.

【０１４８】結合処理部５における処理、即ち図１０の
ステップＳ１乃至Ｓ９の処理が終了し、これにより入力
文の連文節の同定がなされた後、ステップＳ１０に進
み、ポーズ設定処理部６において、まずポーズ優先度処
理が行われる。After the processing in the merging processing unit 5, that is, the processing in steps S1 to S9 in FIG. 10 is completed and the consecutive clauses of the input sentence are identified, the process proceeds to step S10, in which the pose setting processing unit 6 first Pause priority processing is performed.

【０１４９】即ち、ポーズ設定処理部６では、図９の
「句２」の欄が「孤」または「尾」とされた形態素と、
それに続く（その次の）形態素との間（連体修飾連文節
間）、および形態素が「の」で、図９の「句１」の欄が
「尾」とされたものと、その直後の形態素との間に、図
８に示したポーズ優先度規則が適用される。That is, in the pose setting processing unit 6, the morpheme in which the column of "phrase 2" in FIG. 9 is "arc" or "tail",
Between the following (the next) morpheme (between adnominal-modified conjunctive clauses), the morpheme is “no”, and the column of “phrase 1” in FIG. 9 is “tail”, and the morpheme immediately after that. During this period, the pause priority rule shown in FIG. 8 is applied.

【０１５０】なお、ポーズ優先度規則の適用対象は、図
９の「句２」の欄が「孤」または「尾」とされた形態素
と、それに続く（その次の）形態素との間（連体修飾連
文節間）だけでも良いが、特に入力文中の１文の拍数が
多い場合には、形態素が「の」で、図９の「句１」の欄
が「尾」とされたものと、その直後の形態素との間に
も、ポーズ優先度規則を適用した方が、より自然で聞き
取り易い合成音が得られることが、シミュレーションに
よりわかっている。Note that the pause priority rule is applied between the morpheme in which the column of "phrase 2" in FIG. 9 is "arc" or "tail" and the morpheme that follows it (union). It may be only between the modified consecutive clauses), but especially when the number of beats of one sentence in the input sentence is large, the morpheme is “no” and the column of “phrase 1” in FIG. 9 is “tail”, Simulations have shown that applying a pause priority rule to a morpheme immediately after that also gives a more natural and easy-to-hear synthetic sound.

【０１５１】ポーズ優先度規則の適用の結果、例えば、
「孤」とされた形態素「昨夜」と、その次の「頭」とさ
れた形態素「男」は、図５の形態素解析結果から、図８
の12001番の優先度２の条件を満たすので、ポーズを挿
入する位置の候補であることを示す優先度２のフラグが
「昨夜」にたてられる。As a result of applying the pause priority rule, for example,
From the morphological analysis result of FIG. 5, the morpheme “last night” that is regarded as “lone” and the morpheme “male” that is the next “head” are
Since the condition of No. 12001 of priority 2 is satisfied, the flag of priority 2 indicating that the pose is a candidate for the position to be inserted is set to “last night”.

【０１５２】また、例えば「尾」とされた形態素「は」
と、その次の「頭」とされた形態素「店員」は、図５の
形態素解析結果から、図８の優先度３の13001番の条件
を満たすので、優先度３のフラグが「は」にたてられ
る。以下同様にして、図９の「間」の欄に示すようにフ
ラグ（優先度フラグ）がたてられる。なお、図９の
「間」の欄には、フラグがたてられた位置に、優先度を
示す数字を示してある。Also, for example, the morpheme "ha" which is "tail"
Then, the morpheme “clerk”, which is the next “head”, satisfies the condition 13001 No. 3 of priority 3 in FIG. 8 from the morpheme analysis result of FIG. 5, so the flag of priority 3 is “ha”. Be built up. In the same manner, flags (priority flags) are set as shown in the column of "interval" in FIG. In the column of "between" in FIG. 9, a number indicating the priority is shown at the position where the flag is set.

【０１５３】さらに、ポーズ優先度処理では、優先度フ
ラグが立てられた位置の間の拍数の小計が求められる。
但し、入力文中の最初と最後の優先度フラグについて
は、入力文の先頭から最初の優先度フラグまでの拍数の
小計が、また最後の優先度フラグから入力文の最後まで
の拍数の小計が、それぞれ求められる。Further, in the pause priority processing, a subtotal of the number of beats between the positions where the priority flag is set is obtained.
However, for the first and last priority flags in the input sentence, the subtotal of the number of beats from the beginning of the input sentence to the first priority flag, and the subtotal of the number of beats from the last priority flag to the end of the input sentence. Are required respectively.

【０１５４】図９の右端の「拍」の欄の数字は、この小
計値を表している。例えば、「昨夜」（３拍）について
は、その拍数である３拍となり、例えば「は」（１拍）
については、「男」（３拍）との小計で４拍となり、ま
た、例えば「られ」（２拍）については、「店員」（４
拍）からの小計で１０拍となる。The numbers in the "beat" column at the right end of FIG. 9 represent this subtotal value. For example, for “last night” (3 beats), the number of beats is 3 beats. For example, “ha” (1 beat)
Is a total of 4 beats with "male" (3 beats), and for "re" (2 beats), "clerk" (4 beats)
The total number of beats is 10 beats.

【０１５５】ポーズ優先度処理の後、ステップＳ１１に
進み、ポーズ絞り込み規則にしたがってポーズ絞り込み
処理が行われ、優先度フラグがたてられた、ポーズを挿
入する位置（フラグがたてられた形態素と、その次の形
態素との間）の候補の中から、最終的なポーズの設定位
置が決定される。After the pause priority processing, the process proceeds to step S11, where the pause narrowing processing is performed according to the pause narrowing rules, and the position for inserting the pose with the priority flag set (the morpheme with the flag set). , And the next morpheme), the final setting position of the pose is determined.

【０１５６】ここで、ポーズ絞り込み規則は、実際に、
アナウンサに新聞を朗読させたときに、そのアナウンサ
が、文中の意味の切れ目にポーズをおく朗読方法をモデ
ル化したものである。なお、このポーズ絞り込み規則
も、図８に示したポーズ優先度規則と同様に、いわばテ
ーブル形式で記述されている。Here, the pose narrowing rule is actually
It is a model of a method in which an announcer puts pauses at meaning breaks in a sentence when the announcer reads the newspaper. Note that this pause narrowing rule is also described in a table format, so to speak, like the pause priority rule shown in FIG.

【０１５７】図１３および図１４は、ポーズ絞り込み処
理の詳細を説明するフローチャートを示している。な
お、このフローチャートは、ポーズ絞り込み規則を反映
したものとしてある。即ち、このフローチャートには、
ポーズ絞り込み規則が含まれている（但し、実際には、
上述したように、テーブル形式で記述された条件が用意
されており、その条件を満たすか否かがポーズ絞り込み
処理（ステップＳ１１）において判断される）。13 and 14 are flowcharts for explaining the details of the pause narrowing down process. Note that this flowchart reflects the pose narrowing-down rule. That is, in this flowchart,
Includes pose refinement rules (however,
As described above, the conditions described in the table format are prepared, and whether or not the conditions are satisfied is determined in the pause narrowing process (step S11)).

【０１５８】ポーズ絞り込み処理では、まず最初に、図
１３のステップＳ２１において、例えば優先度１および
２の優先度フラグがたてられた形態素の直後にポーズが
設定される。なお、優先度１または２の優先度フラグが
たてられた形態素（の直後）は、節境界または句境界
に、それぞれ相当する。また、節境界に設定されたポー
ズの長さは、比較的長くされ、句境界に設定されたポー
ズの長さは、節境界の場合よりも幾分短くされる。In the pose narrowing down process, first, in step S21 of FIG. 13, a pose is set immediately after a morpheme to which priority flags of priority 1 and 2 are set, for example. A morpheme (immediately after) to which a priority flag of priority 1 or 2 is set corresponds to a clause boundary or a phrase boundary, respectively. Also, the length of the pose set at the clause boundary is made relatively long, and the length of the pose set at the phrase boundary is made slightly shorter than that at the clause boundary.

【０１５９】その後、ステップＳ２２に進み、所定の判
定処理が行われる。即ち、ステップＳ２２では、その判
定処理の対象とされていない（未判定の）、例えば優先
度３または４の優先度フラグがたてられた形態素が存在
するか否かが判定される。ステップＳ２２において、未
判定の優先度３または４の優先度フラグがたてられた形
態素が存在すると判定された場合、ステップＳ２３に進
み、その形態素（以下、適宜、注目形態素という）の位
置（その形態素の直後）にポーズを設定した結果、ポー
ズ間の拍数が、基準拍数範囲Ａ内に収まるか否かが判定
される。ステップＳ２３において、ポーズ間の拍数が、
基準拍数範囲Ａ内に収まらないと判定された場合、ステ
ップＳ２４をスキップして、ステップＳ２２に戻る。即
ち、この場合、ポーズの設定（追加設定）は行われな
い。Then, the process proceeds to step S22, and a predetermined determination process is performed. That is, in step S22, it is determined whether or not there is a morpheme that is not a target of the determination processing (undetermined), for example, a priority flag of priority 3 or 4 is set. If it is determined in step S22 that there is an undetermined morpheme to which the priority flag of priority 3 or 4 is set, the process proceeds to step S23, and the position of the morpheme (hereinafter, appropriately referred to as a morpheme of interest) (that As a result of setting the pause immediately after the morpheme), it is determined whether or not the number of beats between the poses falls within the reference beat rate range A. In step S23, the number of beats between the poses is
When it is determined that the number does not fall within the reference beat rate range A, step S24 is skipped and the process returns to step S22. That is, in this case, the pose setting (additional setting) is not performed.

【０１６０】一方、ステップＳ２３において、ポーズ間
の拍数が、基準拍数範囲Ａ内に収まると判定された場
合、ステップＳ２４に進み、注目形態素の位置にポーズ
が設定され、ステップＳ２２に戻る。On the other hand, if it is determined in step S23 that the number of beats between the poses is within the reference number of beats range A, the process proceeds to step S24, the pose is set at the position of the morpheme of interest, and the process returns to step S22.

【０１６１】なお、ステップＳ２２乃至Ｓ２４の処理
は、まず、入力文の文頭から文末方向へ、優先度３の優
先度フラグがたてられた形態素を対象として行われ、そ
の後、再び入力文の文頭から文末方向へ、優先度４の優
先度フラグがたてられた形態素を対象として行われる。The processing of steps S22 to S24 is first performed from the beginning of the input sentence toward the end of the sentence for the morpheme to which the priority flag of priority 3 is set, and then the beginning of the input sentence again. From the end of the sentence toward the end of the sentence, the morpheme with the priority flag of priority 4 set.

【０１６２】ここで、図９に示した場合においては、ま
ず「昨夜」、「られ」、および「た」の直後にポーズが
設定される（ステップＳ２１）。次に、「は」にたてら
れた優先度フラグの優先度が３であるから、そこにポー
ズを設定した場合に、ポーズ間の拍数が、基準拍数範囲
Ａ内に収まるか否かが判定される。「は」の位置にポー
ズを設定した場合、図９の「拍」の欄から、既にポーズ
が設定されている「昨夜」の直後から「は」までの拍数
は４拍であり、また、「は」の直後から、既にポーズが
設定されている「られ」までの拍数は１０拍であるか
ら、いずれも基準拍数範囲Ａ（上述したように、１１拍
以上２６拍未満）に収まらず、従って「は」の位置には
ポーズは設定されない。Here, in the case shown in FIG. 9, first, a pose is set immediately after "last night", "rear", and "ta" (step S21). Next, since the priority of the priority flag set to "ha" is 3, whether or not the beats between the pauses fall within the reference beat range A when a pause is set there. Is determined. When a pose is set at the position of "ha", the number of beats from immediately after "last night" to "ha" in which the pose is already set is 4 beats in the "beat" column of FIG. The number of beats from immediately after "ha" to "reset" where the pause has already been set is 10 beats, so all of them fall within the reference beat range A (11 beats to less than 26 beats as described above). No, so no pose is set at the "ha" position.

【０１６３】なお、仮に、「は」の位置にポーズを設定
した結果、その前後のポーズ間の拍数が、基準拍数範囲
Ａ内に収まる場合には、その位置にポーズが設定される
（ステップＳ２４）。また、このようにしてポーズが追
加設定された場合、次のポーズを設定するかどうかは
（ステップＳ２２乃至Ｓ２４の処理は）、その追加設定
されたポーズを加味して行われる。If, as a result of setting a pause at the position of "ha", the beats before and after that pose fall within the reference beat range A, the pose is set at that position ( Step S24). When the pose is additionally set in this way, whether or not to set the next pose (the processing of steps S22 to S24) is performed in consideration of the additionally set pose.

【０１６４】さらに、ポーズ間の拍数が基準拍数範囲Ａ
内に収まるかどうかでポーズの設定を行うようにしてい
るのは、アナウンサの朗読時におけるポーズ間の拍数
が、極端に少なかったり、また多くなったりしないこと
に対応する。また、本実施例では、新聞等の比較的長文
の文章を、アナウンサに朗読させて得たモデル（朗読モ
デル）を規則化（ポーズ絞り込み規則化）したため、基
準拍数範囲Ａを上述したように１１拍以上２６拍未満と
したが、入力文とする文章が短文である場合には、基準
拍数範囲Ａ（基準拍数Ｂも同様）を、例えば３モーラ以
上１１モーラ未満などと小さな範囲に設定することが可
能である。さらに、入力文における１文の拍数に対応し
て、基準拍数範囲Ａ（基準拍数Ｂも同様）を変更（動的
に変更）するようにすることも可能である。Furthermore, the number of beats between pauses is the reference number of beats range A.
The reason why the poses are set depending on whether or not they fit within the range corresponds to the fact that the number of beats between the poses when the announcer reads aloud is not extremely small or large. Further, in the present embodiment, since the model (reading model) obtained by making the announcer read a relatively long sentence such as a newspaper is regularized (pause narrowing regularization), the reference beat range A is set as described above. Although the number of beats is 11 or more and less than 26, when the input sentence is a short sentence, the reference beat rate range A (same as the reference beat number B) is set to a small range such as 3 mora or more and less than 11 mora. It is possible to set. Further, it is possible to change (dynamically change) the reference beat number range A (the same applies to the reference beat number B) in accordance with the beat number of one sentence in the input sentence.

【０１６５】一方、ステップＳ２２において、未判定の
優先度３または４の優先度フラグがたてられた形態素が
存在しないと判定された場合、図１４のステップＳ３１
に進み、現時点で設定済みのポーズ間における拍数が、
基準拍数Ｂ（上述したように、４６拍）以上である部分
が存在するか否かが判定される。On the other hand, if it is determined in step S22 that there is no morpheme to which an undetermined priority flag of priority 3 or 4 is set, step S31 in FIG.
And the number of beats between the poses that have been set at the moment,
It is determined whether or not there is a portion having a reference beat count B (46 beats as described above) or more.

【０１６６】ここで、基準拍数Ｂは、アナウンサが一回
の呼吸で（一呼吸で）朗読可能で、かつ聞き手が聴きと
りにくくない限界と想定される拍数に対応する。ステッ
プＳ３１以下では、基準拍数Ｂを超える拍数のポーズ間
が存在する場合は、アナウンサの朗読結果を統計処理し
て得られた結果（ポーズ絞り込み規則）に基づいて、原
則的には、より優先度の高い優先度フラグがたてられた
位置からポーズを設定（追加設定）し、ポーズ間の拍数
が基準拍数Ｂ以上となっている部分が存在しなくなった
時点で、ポーズの設定（追加設定）（再設定）を終了す
る。Here, the reference number of beats B corresponds to the number of beats which is assumed to be the limit at which the announcer can read aloud in one breath (in one breath) and is not difficult for the listener to hear. In step S31 and subsequent steps, if there is a pause between beats that exceeds the reference beat B, based on the result (pause narrowing rule) obtained by statistically processing the announcer's reading result, in principle, more Pause is set (additional setting) from the position where the priority flag of high priority is set, and when there is no part where the beat count between pauses is the reference beat B or higher, the pose is set. (Additional settings) (Reset) ends.

【０１６７】基準拍数範囲Ａおよび基準拍数Ｂは、いず
れも、アナウンサが、聴き手に対し朗読を不自然と感じ
させないための基準である点で共通している。しかしな
がら、基準拍数範囲Ａは、アナウンサが、可能ならば
（意味的に不自然でなければ）、ポーズをおくことが望
ましいとする、ポーズのおかれる優先順位の高い範囲で
あるのに対し、基準拍数Ｂは、アナウンサが一呼吸で朗
読するのが困難で（アナウンサの身体的制約により拘束
され）、また聴き手が聞き取りにくくなるという理由か
ら決まる拍数である点で相違する。従って、ポーズ間の
拍数が基準拍数Ｂ以上である部分は、可能ならば（アナ
ウンサが一呼吸で朗読することができ、かつ聴き手が聞
き取りにくくなければ）ポーズをおかない方が望ましい
部分、即ちポーズのおかれる優先順位としては低い部分
である。Both the reference beat rate range A and the reference beat rate B are common in that the announcer does not make the listener feel unnatural reading. However, the reference beat range A is a high-priority range in which the announcer prefers to pose if possible (unless semantically unnatural). The reference beat rate B differs in that it is a beat rate determined because it is difficult for the announcer to read aloud in one breath (constrained by the physical constraints of the announcer), and the listener has difficulty in hearing. Therefore, the part where the beats between the poses is the reference beat B or more is a part where it is preferable not to put the poses if possible (unless the announcer can read aloud in one breath and the listener is hard to hear). That is, it is a low part as the priority order of the pose.

【０１６８】ステップＳ３１において、現時点で設定済
みのポーズ間における拍数が、基準拍数Ｂ以上である部
分が存在しないと判定された場合、処理を終了する。If it is determined in step S31 that there is no portion where the number of beats between the poses set at the present time is equal to or greater than the reference number of beats B, the process ends.

【０１６９】ここで、図９に示した場合においては、ス
テップＳ３１の処理前までにポーズが設定された位置
は、「昨夜」、「られ」、および「た」の直後である。
文頭から「昨夜」まで、「昨夜」から「られ」まで、
「られ」から「た」まで、または「た」から文末までの
拍数は、それぞれ３、１４、２５、または０拍であるか
ら、基準拍数Ｂ以上である部分は存在せず、処理を終了
する。従って、図９に示した場合には、「昨夜」、「ら
れ」、および「た」の直後が最終的なポーズの設定位置
となる。Here, in the case shown in FIG. 9, the position to which the pose is set before the processing of step S31 is immediately after "last night", "gone", and "ta".
From the beginning of the sentence to “last night”, from “last night” to “rear”,
Since the number of beats from “re” to “ta” or from “ta” to the end of the sentence is 3, 14, 25, or 0 beats respectively, there is no portion having a reference beat B or more, and the processing is not performed. finish. Therefore, in the case shown in FIG. 9, the position immediately after “last night”, “rear”, and “ta” is the final pose setting position.

【０１７０】一方、ステップＳ３１において、現時点で
設定済みのポーズ間における拍数が基準拍数Ｂ以上であ
る部分が存在すると判定された場合、ステップＳ３２に
進み、そのポーズ間（以下、適宜、注目ポーズ間とい
う）に、優先度３のフラグがたてられた係助詞「は」で
あって、それに続く形態素の品詞情報が「用言」以外の
「自立語」（非用言自立語）であるものが存在するか否
かが判定される。On the other hand, if it is determined in step S31 that there is a portion in which the number of beats between the preset poses is equal to or greater than the reference beat B, the process proceeds to step S32, and the pauses (hereinafter referred to as appropriate). It is a particle "ha" that is flagged with a priority of 3 in (between pauses), and the part-of-speech information of the morpheme following it is "independent word" (non-verbal independent word) It is determined whether something exists.

【０１７１】ステップＳ３２において、注目ポーズ間
に、優先度３のフラグがたてられた係助詞「は」であっ
て、それに続く形態素の品詞情報が非用言自立語である
ものが存在すると判定された場合、ステップＳ３３に進
み、その係助詞「は」（の直後）にポーズが設定され、
ステップＳ３１に戻る。In step S32, it is determined that there is a particle "ha" flagged with the priority 3 and the morpheme part of speech information that follows it is a non-verbal independent word between the target poses. If so, the process proceeds to step S33, where a pose is set (immediately after) the particle "ha",
It returns to step S31.

【０１７２】ここで、図９に示した場合において、仮
に、「昨夜」の直後から「られ」の直後までの拍数が基
準拍数Ｂ以上であれば、優先度３のフラグがたてられた
係助詞「は」（この「は」とそれに続く「店員」は、図
８の13001番の条件を満たすものである）は、ステップ
Ｓ３２の条件を満たすので、ステップＳ３３において、
その位置にポーズが設定されることになる。Here, in the case shown in FIG. 9, if the number of beats from immediately after "Last night" to immediately after "Play" is not less than the reference beat B, a flag of priority 3 is set. The particle "ha" (this "ha" and the "clerk" following it "satisfy the condition 13001 in FIG. 8) satisfy the condition of step S32, so in step S33,
The pose will be set at that position.

【０１７３】一方、注目ポーズ間に、優先度３のフラグ
がたてられた係助詞「は」であって、それに続く形態素
の品詞情報が非用言自立語であるものが存在しないと判
定された場合、ステップＳ３４に進み、注目ポーズ間
に、優先度５の優先度フラグたてられた形態素が存在す
るか否かが判定される。ステップＳ３４において、注目
ポーズ間に、優先度５の優先度フラグたてられた形態素
が存在すると判定された場合、ステップＳ３５に進み、
その形態素の直後にポーズが設定され、ステップＳ３１
に戻る。なお、注目ポーズ間に、優先度５の優先度フラ
グたてられた形態素が複数存在する場合には、そのうち
の、注目ポーズ間の中心（真ん中）に最も近い形態素の
位置にポーズが設定される。On the other hand, it is determined that there is no particle "ha" flagged with the priority 3 and the part of speech information of the morpheme that follows it is a non-verbal independent word between the attention poses. In this case, the process proceeds to step S34, and it is determined whether or not there is a morpheme set with a priority flag of priority 5 between the focus poses. If it is determined in step S34 that a morpheme with a priority flag of priority 5 exists between the target poses, the process proceeds to step S35.
A pose is set immediately after the morpheme, and step S31 is performed.
Return to. When there are a plurality of morphemes flagged with a priority level of 5 between the target poses, the pose is set to the position of the morpheme closest to the center (middle) between the target poses. .

【０１７４】また、ステップＳ３５において、注目ポー
ズ間に、優先度５の優先度フラグたてられた形態素が存
在しないと判定された場合、ステップＳ３６に進み、注
目ポーズ間に、優先度６の優先度フラグたてられた形態
素が存在するか否かが判定される。ステップＳ３６にお
いて、注目ポーズ間に、優先度６の優先度フラグたてら
れた形態素が存在すると判定された場合、ステップＳ３
７に進み、その形態素の直後にポーズが設定され、ステ
ップＳ３１に戻る。なお、注目ポーズ間に、優先度６の
優先度フラグたてられた形態素が複数存在する場合に
は、上述したステップＳ３５における場合と同様に、そ
のうちの、注目ポーズ間の中心（真ん中）に最も近い形
態素の位置にポーズが設定される。If it is determined in step S35 that there is no morpheme set with the priority flag of priority 5 between the noticed poses, the process proceeds to step S36, and priority 6 is given between the noticed poses. It is determined whether there is a morpheme flagged. If it is determined in step S36 that a morpheme with a priority flag of priority 6 exists between the target poses, step S3
7, the pose is set immediately after the morpheme, and the process returns to step S31. In addition, when there are a plurality of morphemes flagged with the priority flag of priority 6 between the noticed poses, as in the case of step S35 described above, the center (middle) between the noticed poses is the most. A pose is set at the position of a nearby morpheme.

【０１７５】一方、ステップＳ３６において、注目ポー
ズ間に、優先度６の優先度フラグたてられた形態素が存
在しないと判定された場合、処理を終了する。On the other hand, if it is determined in step S36 that no morpheme with the priority flag of priority 6 exists between the target poses, the process ends.

【０１７６】なお、ステップＳ３６で、注目ポーズ間
に、優先度６の優先度フラグたてられた形態素が存在し
ないと判定されて処理を終了する場合というのは、意味
の切れ目として不自然でない位置が存在せず、どうして
も一呼吸で朗読すべき場合に対応する。In step S36, it is determined that there is no morpheme set with the priority flag of priority 6 between the target poses, and the processing is terminated when the position is not unnatural as a break in meaning. It corresponds to the case where there is no, and it should be read in one breath.

【０１７７】また、図１４においては、優先度４の優先
度フラグがたてられた形態素についての判定処理が抜け
ているが、これは、図８に示したポーズ優先度規則を用
いてシミュレーションを行ったところ、優先度４の優先
度フラグがたてられた形態素の判定処理を行っても、最
終的なポーズの設定位置にほとんど影響がなかったこと
による。従って、優先度４の優先度フラグがたてられた
形態素についての判定処理は、特に行わない方が良いと
いうことではない。Further, in FIG. 14, the determination process for the morpheme to which the priority flag of priority 4 is set is omitted, but this is performed by using the pause priority rule shown in FIG. The reason is that, even if the morpheme determination process for which the priority flag of priority 4 is set is performed, there is almost no effect on the final pose setting position. Therefore, it does not mean that the determination process for a morpheme with a priority flag of priority 4 set should not be performed.

【０１７８】こうして設定されたポーズ（ポーズ情報）
をもとに、図１の発音記号生成部７において音韻韻律情
報が生成され、さらに韻律処理部８と音響処理部１１を
経て、ポーズの挿入された合成音声が出力される。The pose set in this way (pose information)
Based on the above, phonological prosody information is generated in the phonetic symbol generation unit 7 of FIG. 1, and further, through the prosody processing unit 8 and the acoustic processing unit 11, synthetic voices with inserted pauses are output.

【０１７９】以上のように、複合語規則、文節規則、ま
たは連文節規則は、形態素どうしを結合させる結合条件
と、その結合条件を満足する形態素どうしを結合して得
られる複合語、文節、または連文節に付加するフラグ情
報とを含み、ポーズ設定規則は、必要に応じて、フラグ
情報を用いて記述されているので、形態素を結合するこ
とにより、その結合した形態素全体に対し、フラグ情報
を与えることができ、さらにそのフラグ情報を用いてポ
ーズ設定規則および結合条件が記述されているので、結
合する形態素の組み合わせに応じて、ポーズを設定する
ことが可能となる。即ち、入力文を特に制限することな
く、その文中の適切な位置にポーズを挿入することが可
能となる。As described above, the compound word rule, the bunsetsu rule, or the conjunctive clause rule is a compound word, a bunsetsu, or a conjunctive clause obtained by combining a morpheme that joins morphemes and morphemes that satisfy the join condition. Since the pose setting rule is described using the flag information as necessary, including the flag information to be added to, the flag information is given to the entire combined morphemes by combining the morphemes. Further, since the pose setting rule and the combining condition are described by using the flag information, it is possible to set the pose according to the combination of the morphemes to be combined. That is, it is possible to insert a pause at an appropriate position in the input sentence without particularly limiting the input sentence.

【０１８０】また、入力文が形態素解析され、その形態
素解析結果から、複合語、文節、連用修飾連文節、連体
修飾連文節が同定されるので、処理の各段階で形成され
る単位（同定結果）を利用して、ポーズ設定処理と併せ
て、その他の様々な処理も行うことが可能となる。即
ち、例えば同定された複合語単位で、アクセントを付与
する処理を行ったり、また同定された文節単位でアクセ
ントやピッチなどを付与する処理を行うことが可能とな
る。Further, since the input sentence is morphologically analyzed and the compound word, the bunsetsu, the consecutive modified consecutive bunsetsu, and the adnominal modified consecutive bunsetsu are identified from the morphological analysis result, the unit (identification result) formed at each stage of the processing is identified. By using this, various other processes can be performed in addition to the pose setting process. That is, for example, it is possible to perform the process of adding an accent in the unit of the identified compound word, or the process of adding the accent or the pitch in the unit of the identified phrase.

【０１８１】また、形態素解析の結果出力される品詞情
報は、いわゆる品詞だけでなく、それを統合した上位分
類や、あるいは細分化した下位分類を含むので、結合規
則およびポーズ設定規則の記述が容易になるとともに、
例えば形態素の品詞のみに基づいてポーズの設定を行う
場合に比較して、例外的な場合にも対処可能な精度の高
いポーズの設定を行う（より自然な位置にポーズの設定
を行う）ことが可能となる。Further, the part-of-speech information output as a result of the morphological analysis includes not only so-called part-of-speech, but also a higher-level classification or a subdivided lower-level classification, so that it is easy to describe a combination rule and a pose setting rule. As
For example, compared to the case of setting a pose based only on the part-of-speech of a morpheme, it is possible to set a highly accurate pose that can handle exceptional cases (set a pose at a more natural position). It will be possible.

【０１８２】さらに、結合規則およびポーズ設定規則
を、形態素解析の結果得られる形態素文字列、品詞情報
などを用いて記述するようにしたので、それ以外の情報
の付与や入力が不要であり、従って簡単に処理系を構成
するモジュールを構築することができ、また汎用的な日
本語テキストを処理対象とすることができる。Further, since the combination rule and the pose setting rule are described by using the morpheme character string obtained as a result of the morpheme analysis, the part-of-speech information, etc., it is not necessary to add or input other information. Modules that make up the processing system can be easily constructed, and general-purpose Japanese text can be processed.

【０１８３】また、連文節間に、ポーズが出現する頻度
を統計的に求めて作成したポーズ優先度規則と、実際の
人間の朗読結果から作成したポーズ絞り込み規則とに基
づいて、最終的なポーズの設定位置を決定するようにし
たので、自然な位置にポーズを設定することができる。Also, based on the pose priority rule created by statistically obtaining the frequency of appearance of poses between consecutive phrases, and the pose narrowing down rule created from actual human reading results, the final pose Since the setting position is determined, the pose can be set at a natural position.

【０１８４】また、結合規則およびポーズ設定規則は、
各処理を行うモジュールにプログラムとして記述するの
ではなく、いわばテーブル形式で、プログラムとは別に
用意するようにしたので、その適用順序や増補改訂を容
易に行うことができる。Further, the combination rule and the pose setting rule are
Instead of being written as a program in the module that performs each process, it is provided in a table format, so to speak, separately from the program, so that the order of application and expansion / revision can be easily performed.

【０１８５】さらに、上述したように基準拍数範囲Ａお
よび基準拍数Ｂは変更可能となされているので、処理対
象となる入力文に応じた自然で理解のしやすい合成音を
得ることが可能となる。Further, as described above, the reference beat number range A and the reference beat number B can be changed, so that a natural and easy-to-understand synthesized voice corresponding to the input sentence to be processed can be obtained. Becomes

【０１８６】以上、本発明を音声合成装置に適用した場
合について説明したが、本発明は、音声合成装置の他、
例えば日本語の入力文を、他の言語に翻訳し、または逆
に他の言語を日本語に翻訳して音声で出力する音声翻訳
装置などに適用することが可能である。日本語のポーズ
の挿入される位置は日本語構文の意味的な切れ目と一致
するので、構文解析精度が向上し、他の言語への変換精
度を向上させることが可能となり、あるいはまた、日本
語に翻訳して音声で出力する際に、自然で聴きとりやす
い音声（合成音）を出力することが可能となる。The case where the present invention is applied to a voice synthesizing apparatus has been described above.
For example, it can be applied to a voice translation device that translates a Japanese input sentence into another language, or conversely translates another language into Japanese and outputs it by voice. Since the position where the Japanese pose is inserted matches the semantic break in the Japanese syntax, it is possible to improve the parsing accuracy and improve the conversion accuracy to other languages. It becomes possible to output a voice (synthesized sound) that is natural and easy to hear when translated and output as voice.

【０１８７】また、本発明は、例えば音声認識装置や、
音声入力が可能なワードプロセッサ、文書作成装置など
に適用することができる。音声認識装置に適用した場合
には、ポーズが挿入される前後の発話を認識対象をして
絞り込むことができ、またポーズ前後の形態素の組合せ
パターンは規則化されている（ポーズ設定規則に記述さ
れている）ので、認識精度を向上させることができる。Further, the present invention is, for example, a voice recognition device,
It can be applied to a word processor capable of voice input, a document creation device, and the like. When applied to a voice recognition device, utterances before and after a pose is inserted can be narrowed down as a recognition target, and a combination pattern of morphemes before and after a pause is regularized (described in the pause setting rule). Therefore, the recognition accuracy can be improved.

【０１８８】また、ワードプロセッサや文書作成装置に
適用した場合には、音声で入力された文章などの「仮名
漢字変換」を行うにあたって、ポーズにまたがる部分の
漢字変換処理を行なわないようにすることができるの
で、変換効率を向上させることができる。Further, when applied to a word processor or a document creation device, when performing "kana-kanji conversion" of a sentence or the like input by voice, it is possible not to perform the kanji conversion processing of the portion that spans the pauses. Therefore, the conversion efficiency can be improved.

【０１８９】さらに、本実施例では、結合規則に、結合
した形態素を分割する規則（分割規則）（複合語分割規
則、文節分割規則、連用修飾分割規則、連体修飾分割規
則）を含めるようにしたが、この分割規則は、必ずしも
結合規則に記述しておく必要はない。Further, in the present embodiment, the combining rule includes a rule (division rule) for dividing the combined morphemes (composite word division rule, phrase division rule, continuous modification division rule, adnominal modification division rule). However, this division rule does not necessarily have to be described in the combining rule.

【０１９０】また、本実施例においては、結合規則およ
びポーズ設定規則を、形態素解析結果のうちの形態素文
字列および品詞情報を用いて記述するようにしたが、そ
の他、例えばアクセントや拍数、発音などの形態素解析
結果を用いて記述するようにすることも可能である。Further, in the present embodiment, the combination rule and the pause setting rule are described by using the morpheme character string and the part-of-speech information in the morpheme analysis result. It is also possible to describe by using the result of morphological analysis such as.

【０１９１】さらに、本実施例では、文法的な情報のみ
を、結合規則およびポーズ設定規則（ポーズ優先度規
則）に記述するようにしたが、意味的な情報も併せて記
述するようにしても良い。Further, in this embodiment, only the grammatical information is described in the combining rule and the pose setting rule (pause priority rule), but the semantic information may also be described. good.

【０１９２】また、本実施例においては、句読点を、結
合規則およびポーズ設定規則の適用対象とせず無視する
ようにしたが、結合規則およびポーズ設定規則の適用対
象とすることも可能である。即ち、本実施例では、例え
ば図５に示したような形態素解析結果が得られた場合に
おいて、例えば「丸の内」の前にある句点「、」を無視
し、「られ」と「丸の内」との間に、結合規則（ポーズ
設定規則）を適用するようにしたが、「、」と「丸の
内」との間にも、結合規則（ポーズ設定規則）を適用す
るようにすることが可能である。Further, in this embodiment, the punctuation mark is not applied to the combining rule and the pause setting rule but ignored, but it may be applied to the combining rule and the pause setting rule. That is, in the present embodiment, for example, when the morphological analysis result as shown in FIG. 5 is obtained, for example, the punctuation mark “,” preceding “Marunouchi” is ignored, and “are” and “Marunouchi” are separated. Although the joining rule (pause setting rule) is applied in between, it is possible to apply the joining rule (pause setting rule) between “,” and “Marunouchi”.

【０１９３】[0193]

【発明の効果】以上の如く、本発明の自然言語処理方法
によれば、形態素を結合することにより、その結合した
形態素全体に対し、付加情報を与えることができ、さら
にその付加情報を用いてポーズ設定規則および結合条件
が記述されているので、結合する形態素の組み合わせに
応じて、ポーズを設定することが可能となる。即ち、入
力文を特に制限することなく、その文中の適切な位置に
ポーズを挿入することが可能となる。As described above, according to the natural language processing method of the present invention, by combining morphemes, additional information can be given to the entire combined morphemes, and the additional information can be used. Since the pose setting rule and the combining condition are described, it is possible to set the pose according to the combination of the morphemes to be combined. That is, it is possible to insert a pause at an appropriate position in the input sentence without particularly limiting the input sentence.

【０１９４】また、本発明の音声合成装置によれば、入
力文を特に制限することなく、自然で理解のし易い合成
音を得ることができる。Further, according to the speech synthesizer of the present invention, a natural and easy-to-understand synthesized speech can be obtained without particularly limiting the input sentence.

[Brief description of drawings]

【図１】本発明を適用した音声合成装置の一実施例の構
成を示すブロック図である。FIG. 1 is a block diagram showing the configuration of an embodiment of a speech synthesizer to which the present invention is applied.

【図２】品詞情報辞書を示す図である。FIG. 2 is a diagram showing a part-of-speech information dictionary.

【図３】品詞情報辞書を示す図（図２に続く図）であ
る。FIG. 3 is a diagram showing a part-of-speech information dictionary (a diagram following FIG. 2).

【図４】品詞情報辞書を示す図（図３に続く図）であ
る。FIG. 4 is a diagram showing a part-of-speech information dictionary (a diagram following FIG. 3).

【図５】形態素解析結果を示す図である。FIG. 5 is a diagram showing a result of morphological analysis.

【図６】結合規則を示す図である。FIG. 6 is a diagram showing a combination rule.

【図７】結合規則を示す図（図６に続く図）である。FIG. 7 is a diagram showing a joining rule (a diagram following FIG. 6).

【図８】ポーズ優先度規則を示す図である。FIG. 8 is a diagram showing a pause priority rule.

【図９】結合処理およびポーズ優先度処理結果を示す図
である。FIG. 9 is a diagram showing a result of a combination process and a pause priority process.

【図１０】図１の結合処理部５およびポーズ設定処理部
６の動作を説明するフローチャートである。FIG. 10 is a flowchart illustrating operations of a combination processing unit 5 and a pose setting processing unit 6 in FIG.

【図１１】動的分類方法を説明するための図である。FIG. 11 is a diagram for explaining a dynamic classification method.

【図１２】動的分類方法を説明するための図である。FIG. 12 is a diagram for explaining a dynamic classification method.

【図１３】図１０のステップＳ１１の処理のより詳細を
説明するフローチャートである。FIG. 13 is a flowchart illustrating the details of the process of step S11 of FIG.

【図１４】図１０のステップＳ１１の処理のより詳細を
説明するフローチャート（図１３のフローチャートに続
くフローチャート）である。FIG. 14 is a flowchart (flowchart following the flowchart in FIG. 13) for explaining the details of the processing in step S11 in FIG.

[Explanation of symbols]

１演算装置２メモリ装置３言語処理部４形態素解析部５結合処理部６ポーズ設定処理部７発音記号生成部８韻律処理部９韻律制御モデル用パラメータ生成部１０韻律データ生成部１１音響処理部 1 arithmetic unit 2 memory devices 3 Language Processing Department 4 Morphological analysis section 5 Join processing unit 6 Pose setting processing section 7 Phonetic symbol generator 8 Prosody processing section 9 Prosodic control model parameter generator 10 Prosody data generator 11 Sound processing unit

Claims

(57) [Claims]

1. A morphological analysis of an input sentence in Japanese, and a compound word is identified from a morphological analysis result of the input sentence based on a compound word rule defining a grammatical relationship between morphemes forming a compound word. Then, from the identification result of the compound word, the bunsetsu is identified based on the bunsetsu rule that defines the grammatical relationship between the morphemes forming the bunsetsu, and from the bunsetsu identification result, the grammatical relationship between the morphemes forming the consecutive bunsetsu Pose that is inserted into the sentence, which is statistically obtained, for the continuous sentence clause that is composed of one or more morphemes forming the input sentence obtained by identifying the consecutive sentence clause based on the continuous sentence clause rule that defines the relation Is a natural language processing method in which a pose setting rule that defines the position of a phrase is applied to set the position of a pose to be inserted between the consecutive clauses, wherein the compound word rule, the clause rule, or the consecutive clause rule is And binding conditions for binding, compound words obtained by combining the morphemes each other that satisfy the join condition, and a additional information to be added to clause or Renbunsetsu, the pose rules and binding conditions, optionally,
A natural language processing method characterized by being described using the additional information.

2. The link clause rule includes a link modifier rule that defines a relationship between morphemes that form a link modifier link clause, and a link modifier rule that defines a relationship between morphemes that form a link modifier link clause. 2. The natural language processing method according to claim 1, further comprising identifying, from the identification result of (1), a linked modified linked sentence clause or a linked linked modified clause, based on the linked modification rule or the linked modifier rule.

3. The morpheme analysis result includes at least a morpheme character string and part-of-speech information, and the compound word rule, bunsetsu rule, continuous bunsetsu rule, or pause setting rule is described using the morpheme analysis result. The natural language processing method according to claim 1 or 2.

4. The natural language processing method according to claim 3, wherein the part-of-speech information includes, in addition to the part-of-speech, a higher-level classification that integrates the part-of-speech and a lower-level classification that subdivides the part-of-speech.

5. The natural language processing method according to claim 4, wherein the sub-class includes at least an inflectional form.

6. The natural language processing according to claim 1, wherein the additional information is a high-level classification in which the POS of a morpheme is integrated or a low-level classification in which the POS of a morpheme is subdivided. Method.

7. The compound word rule, bunsetsu rule, or continuous bunsetsu rule further describes a separation condition for separating two combined morphemes, and separates the combined morphemes according to the separation condition. The natural language processing method according to any one of claims 1 to 6, characterized in that.

8. Between the compound word composed of a plurality of the morphemes and an adjacent morpheme which is an adjacent morpheme,
When the compound word rule is applied, among the plurality of morphemes forming the compound word, the compound word rule is applied between the adjacent morpheme adjacent to the adjacent morpheme and the adjacent morpheme. The natural language processing method according to any one of claims 1 to 7, which is characterized.

9. The clause including a plurality of the morphemes,
Between the adjacent morpheme and the adjacent morpheme, if the phrase rule is applied, between the morpheme adjacent to the adjacent morpheme among the plurality of morphemes constituting the bunsetsu, between the adjacent morpheme The natural language processing method according to any one of claims 1 to 7, wherein the clause rule is applied.

10. Between the conjunctive phrase consisting of a plurality of the morphemes and an adjacent morpheme which is a morpheme adjacent to it,
When the consecutive phrase clause or the pause setting rule is applied, among the plurality of morphemes forming the consecutive phrase, between the adjacent morpheme and the adjacent morpheme, the consecutive phrase clause or the pose setting rule is The natural language processing method according to any one of claims 1 to 7, which is applied.

11. The pause setting rule sets a pause between a morpheme at the end of the morphemes forming the preceding syllable and a head morpheme of the morphemes forming the subsequent syllable in two adjacent syllables. The natural language processing method according to any one of claims 1 to 10, wherein the insertion frequency is created by statistically obtaining from the reading result of the actual sentence.

12. The pose setting rule is described with setting conditions for setting a pause between the last morpheme and the first morpheme added with a priority representing a priority order for setting a pause between them. Includes a priority rule and a narrowing-down rule in which a narrowing-down condition for narrowing down the setting position of the pose is described, based on the priority rule, a position candidate for setting the pose is determined, and from among the candidates, The natural language processing method according to claim 11, wherein a final position for setting a pose is determined based on the narrowing-down rule.

13. Generation means for obtaining phonological information and prosodic information corresponding to the input sentence by natural language processing the input sentence in Japanese, and to the input sentence based on the phonological information and prosody information. The synthesizing unit for synthesizing corresponding voices, the prosody information includes a position of a pose inserted in the input sentence, and the generating unit determines the position of the pose.
2. A voice synthesis device characterized by being obtained by the natural language processing method according to any one of 2.