JP2016009428A

JP2016009428A - Morphological analysis turning apparatus, voice synthesis system, and morphological analysis tuning method

Info

Publication number: JP2016009428A
Application number: JP2014131243A
Authority: JP
Inventors: 慶華孫; Keika Son
Original assignee: Hitachi ULSI Systems Co Ltd
Current assignee: Hitachi Solutions Technology Ltd
Priority date: 2014-06-26
Filing date: 2014-06-26
Publication date: 2016-01-18
Anticipated expiration: 2034-06-26
Also published as: JP6318024B2

Abstract

PROBLEM TO BE SOLVED: To provide a technique for reducing a burden of a user applied to correction work of a morpheme by optimizing a morphological dictionary while retaining the morpheme to a minimum unit.SOLUTION: A morphological analysis tuning apparatus performs morphological analysis of an input text, and compares an optimum morphological analysis result as a morphological analysis result in which a cost of the morphological analysis result takes a minimum value, with a correct morphological analysis result that a user inputs related to the input text as correct data. As the result of the comparison processing, when there is a different part between the correct morphological analysis result and the optimum morphological analysis result, the morphological analysis tuning apparatus creates a cost adjustment rule such that the cost of the correct morphological analysis result becomes smaller than that of the optimum morphological analysis result.

Description

本発明は、形態素解析チューニング装置、音声合成システム、及び形態素解析チューニング方法に関し、例えば、入力されるテキストの音声を合成する際に、テキスト解析時に生じる形態素の誤解析を修正（チューニング）するための技術に関する。 The present invention relates to a morpheme analysis tuning apparatus, a speech synthesis system, and a morpheme analysis tuning method, for example, for correcting (tuning) a morpheme misanalysis that occurs during text analysis when synthesizing input text speech. Regarding technology.

近年、生活の多くの場面で音声合成処理された音声を耳にする機会が増えている。波形接続方式の導入などにより、合成音は音質的にもかなり改善が進んでいる。従って、車載用ナビゲーション装置、公共施設における自動放送装置、メール読み上げ装置、自動通訳システムなど、音声を用いて自動的に情報を提供するサービスが広く普及している。 In recent years, there are increasing opportunities to hear voices that have undergone voice synthesis processing in many scenes of daily life. With the introduction of the waveform connection method, the synthesized sound has been improved considerably in terms of sound quality. Accordingly, services that automatically provide information using voice, such as an in-vehicle navigation device, an automatic broadcasting device in a public facility, a mail reading device, and an automatic interpretation system, are widely used.

テキスト音声合成技術は、テキスト解析と音声生成という２つのステップで構成されている。テキスト解析では、入力された自然言語を、意味の持つ最小単位（形態素）に分解したうえ、読み付与、アクセント決定、韻律階層解析などを行う（形態素解析）。この形態素解析は、後続する全てのテキスト解析モジュールの動作を左右するため、最も音質に影響するプロセスである。形態素解析については、例えば、特許文献１に開示されている。 Text-to-speech synthesis technology consists of two steps: text analysis and speech generation. In text analysis, the input natural language is decomposed into the smallest meaningful units (morphemes), and then reading, accent determination, prosodic hierarchy analysis, etc. are performed (morphological analysis). This morphological analysis is the process that most affects the sound quality because it affects the operation of all subsequent text analysis modules. About morphological analysis, it is indicated by patent documents 1, for example.

現在、日本語形態素解析においてよく使用されている形態素解析ツールとしては、Yahoo!（登録商標）、google（登録商標）などが開発したビジネス用のツールもあれば、JUMAN、ChaSenとMeCabをはじめとするオープンソースのツールも挙げられる。前者は有料となっているか、あるいは使用に当たって厳しい制限が課せられているため、言語研究においては、後者のほうが幅広く利用されている。 Currently, there are business tools developed by Yahoo! (registered trademark), google (registered trademark), etc. as morpheme analysis tools often used in Japanese morpheme analysis, including JUMAN, ChaSen and MeCab. Open source tools to do this. The latter is more widely used in linguistic research because the former is charged or has severe restrictions on its use.

形態素解析手法は、大きく分けて、ルールベース形態素解析手法と統計ベース形態素解析手法に分類される。統計ベース形態素解析手法は、データに基づいて統計的にパラメータ学習を行うものであり、最近よく用いられている。一方、ルールベース形態素解析手法は、人手による規則(ヒューリスティックス)に基づいて解析を行うものであり、80年代以前は主流であった。具体的に、ルールベース形態素解析手法は、単語、あるいは、品詞が連接して文中に現れる可能性を規則として列挙（連接可能性規則表）し、可能な品詞列だけを解として求めるようにした方法である。そして、連接可能性規則表は、品詞（あるいは単語）による２次元の行列として表される。この連接可能性表を用いることにより文法的に不自然な品詞の連接を制限することができるので、ある程度の曖昧性を解消することができる一方で、複数の解が存在する可能性が残るという課題がある。この課題を解決するために、ルールベース形態素解析手法においては、単語、あるいは、品詞が連接して文中に現れる可能性の強さをコストとして数値化し、最もコストの低い品詞列を解として求めるようにしている。接続コスト表は、表１に示されるように、品詞（あるいは単語）を行と列に持ち、コストを要素とする２次元の行列によって表される。よって、形態素解析時に生じる複数の解の間の優劣をコストによって区別することができるようになる。なお、このようなコストは、人手によって決めるのが一般的である。また、ここで、コストの他に、スコアという表現もあるが、両者の違いは、コストは小さければ小さいほど接続の強さが強くなる概念であるのに対して、スコアは大きければ大きいほど接続の強さが強くなる概念である。両者は相補的に用いることができる概念である。なお、形態素解析では、形態素の数が多くならないように、スコアの最大化ではなく、コストの最小化を行うのが一般的である。
しかるに、人手によってコストを決める場合、ユーザの負担が大きく、コストの決め方に一貫性を保つのが難しいなどの課題がある。 Morphological analysis methods are roughly classified into rule-based morpheme analysis methods and statistical-based morpheme analysis methods. The statistical-based morphological analysis method is a method that statistically performs parameter learning based on data, and is frequently used recently. On the other hand, the rule-based morphological analysis method performs analysis based on manual rules (heuristics), and was prevalent before the 1980s. Specifically, the rule-based morphological analysis method enumerates the possibility that words or parts of speech are concatenated and appear in a sentence as rules (jointability rule table), and obtains only possible part of speech as a solution. Is the method. The connectability rule table is represented as a two-dimensional matrix with parts of speech (or words). By using this connection possibility table, it is possible to limit the connection of parts of speech that are grammatically unnatural, so that some ambiguity can be resolved, but there is still the possibility that multiple solutions exist. There are challenges. In order to solve this problem, in the rule-based morphological analysis method, the strength of the possibility that words or parts of speech are connected and appear in a sentence is quantified as a cost, and the part of speech sequence with the lowest cost is obtained as a solution. I have to. As shown in Table 1, the connection cost table is represented by a two-dimensional matrix having parts of speech (or words) in rows and columns and cost as an element. Therefore, superiority or inferiority among a plurality of solutions generated during morphological analysis can be distinguished by cost. Such costs are generally determined manually. In addition to the cost, there is also the expression of the score, but the difference between the two is the concept that the strength of the connection increases as the cost decreases, whereas the connection increases as the score increases. It is a concept that increases the strength of. Both are concepts that can be used complementarily. In morphological analysis, it is common to minimize the cost rather than maximizing the score so that the number of morphemes does not increase.
However, when the cost is determined manually, there is a problem that the burden on the user is large and it is difficult to maintain consistency in the cost determination method.

この点、近年、パソコンの処理能力の向上を背景に、大規模テキストデータ(タグ付コーパス)からのコスト(確率値)学習が可能になった。このため、品詞が付与された解析済みコーパスから、単語の出現のしやすさ、品詞(単語)のつながりやすさを確率値として求める手法（コーパスベース形態素解析手法）が提案されている。これは、文を構成する単語の出現確率が直前のｎ−１語にのみ依存すると仮定して言語の文をモデル化する手法である（ｎ＝３のときはtri-gram model、ｎ＝２のときはbi-gram modelという）。このモデル化の手法として代表的なものとして、前出の、bi-gram とtri-gram の混合を用いる「ChaSen」、HMMによる自動学習機能が実現した「JUMAN」、パラメータの推定にConditional Random Fields (CRF) を用いた「MeCab」等が挙げられる。これにより、ユーザの負担を軽減することはできるようになった。 In recent years, it has become possible to learn cost (probability value) from large-scale text data (tagged corpus) against the backdrop of improvements in the processing power of personal computers. For this reason, a technique (corpus-based morphological analysis technique) has been proposed in which the ease of appearance of words and the ease of connection of parts of speech (words) are obtained as probability values from an analyzed corpus assigned parts of speech. This is a technique for modeling a sentence in a language on the assumption that the appearance probability of a word constituting the sentence depends only on the immediately preceding n-1 word (tri-gram model when n = 3, n = 2). Is called a bi-gram model). Typical modeling methods include “ChaSen” that uses a mixture of bi-gram and tri-gram, “JUMAN” that realizes an automatic learning function using HMM, and Conditional Random Fields for parameter estimation. Examples include “MeCab” using (CRF). As a result, the burden on the user can be reduced.

特開昭５６−１７４６７号公報JP-A-56-17467

しかし、このようなコーパスベース形態素解析手法では、大規模テキストコーパスが必要となるのが一般的であり、この大規模テキストコーパスの整備は非常に難しい。そして、テキスト解析にミスがある場合、テキストコーパスをチューニングすることになるが、手作業によって統計モデルを修正することはできない。このため、テキストコーパスに正解データを追加し、再学習することで対応することになる。ただし、この方法を用いたとしても解析ミスが是正可能であることを確実に保証することはできない。 However, such a corpus-based morphological analysis method generally requires a large-scale text corpus, and it is very difficult to maintain this large-scale text corpus. If there is a mistake in text analysis, the text corpus will be tuned, but the statistical model cannot be corrected manually. For this reason, the correct data is added to the text corpus and re-learning is performed. However, even if this method is used, it cannot be surely guaranteed that an analysis error can be corrected.

ところで、近年、合成音声の音質が格段に向上し、いろんな場面で応用されている。例えば、カーナビの道案内や鉄道の構内放送の場面である。これらの場面においては、読み間違いを極力減らしたいというお客さんの要望がある。この要望に対応し、読み上げ精度を上げるためには、日々辞書整備や、読み間違いの修正などのチューニング作業が必須となっている。 By the way, in recent years, the sound quality of synthesized speech has been remarkably improved and applied in various scenes. For example, it is a scene of car navigation route guidance or railway premises broadcasting. In these situations, there is a customer's desire to reduce reading errors as much as possible. In order to meet this demand and improve the reading accuracy, tuning work such as daily dictionary maintenance and correction of reading mistakes is indispensable.

しかし、統計ベース形態素解析手法では、大規模データベース（学習データ）から、統計した確率モデルを用いるため、手作業によるチューニングができない。形態素解析ミスの発生した文書については、正解ラベルを付与してから、学習用データに追加し、モデルを再学習することで対応するので、非常に時間がかかるうえ、解析ミスを確実に改善する保証ができない。 However, since the statistical-based morphological analysis method uses a statistical probability model from a large-scale database (learning data), manual tuning cannot be performed. For documents with morphological analysis errors, correct labels are assigned, and then added to the training data, and the model is re-learned. I cannot guarantee it.

一方、ルールベース形態素解析手法では、品詞情報のみを用いたコストテーブルのチューニングが非常に難しく、個別の特殊ケースに対応できない。このため、より長い形態素を登録することで、形態素解析に対するチューニングを行ってきた。 On the other hand, with the rule-based morphological analysis method, it is very difficult to tune the cost table using only part-of-speech information, and it is not possible to deal with individual special cases. For this reason, tuning for morphological analysis has been performed by registering longer morphemes.

しかし、このようなチューニング手法では、形態素（汎用性がない個別の形態素）の数が膨大となり、システム肥大化やパフォーマンス低下の原因となる。そして、複数品詞にわたった単語登録が必要な場合や複数アクセントが存在する場合は、その登録が難しくなり、ユーザの負担が過大となるとともに、ユーザの主観が入り易くなってしまう。また、形態素はできるだけ意味のある最小単位で構成されることが望ましいが、特殊な文言を形態素として出現の都度に登録することになると、今までの形態素解析手法と処理動作上のマッチングが悪化する可能性がある。 However, in such a tuning method, the number of morphemes (individual morphemes having no versatility) becomes enormous, causing system enlargement and performance degradation. When word registration over a plurality of parts of speech is necessary or there are a plurality of accents, the registration becomes difficult, the burden on the user becomes excessive, and the user's subjectivity easily enters. In addition, it is desirable that morphemes are composed of the smallest units that are meaningful as much as possible. However, if special words are registered as morphemes each time they appear, the matching between the morphological analysis methods used so far and the processing operations deteriorates. there is a possibility.

本発明はこのような状況に鑑みてなされたものであり、形態素を最小単位に保ちつつ、形態素辞書の最適化を図り、ユーザの形態素の修正作業に掛かる負担を軽減するための技術を提供する。 The present invention has been made in view of such a situation, and provides a technique for optimizing a morpheme dictionary and reducing a burden on a user's morpheme correction work while keeping the morpheme in a minimum unit. .

以上課題を解決するために、本発明は、テキスト文章の形態素解析結果に基づいて形態素解析に用いるコスト調整ルールを生成する形態素解析チューニング装置を提供する。当該形態素解析チューニング装置は、少なくとも、ユーザが正しいと判定して入力された正解形態素解析結果を格納する正解形態素解析結果データベースと、正解形態素解析結果と、形態素辞書から読み出された形態素と、を用いてコスト調整ルールを最適化しながら作成するプロセッサと、を有している。さらに、プロセッサは、テキスト文章の形態素解析結果のうちコストが最小の値を取る形態素解析結果である最適形態素解析結果と、テキスト文章に関してユーザが正解データとして入力した正解形態素解析結果と、を比較する比較処理と、比較処理の結果、正解形態素解析結果と最適形態素解析結果とが異なる部分を有する場合、正解形態素解析結果が最適形態素解析結果よりもコストが小さくなるように、コスト調整ルールを生成する調整処理と、を実行する。 In order to solve the above problems, the present invention provides a morphological analysis tuning apparatus that generates a cost adjustment rule used for morphological analysis based on a morphological analysis result of a text sentence. The morpheme analysis tuning apparatus includes at least a correct morpheme analysis result database that stores correct morpheme analysis results that have been determined by the user to be correct, a correct morpheme analysis result, and a morpheme read from the morpheme dictionary. And a processor that creates a cost adjustment rule while optimizing it. Further, the processor compares the optimal morpheme analysis result, which is the morpheme analysis result having the lowest cost among the morpheme analysis results of the text sentence, with the correct morpheme analysis result input by the user as the correct data regarding the text sentence. If the comparison process and the result of the comparison process show that the correct morpheme analysis result and the optimal morpheme analysis result have different parts, a cost adjustment rule is generated so that the correct morpheme analysis result is less expensive than the optimal morpheme analysis result. The adjustment process is executed.

本発明に関連する更なる特徴は、本明細書の記述、添付図面から明らかになるものである。また、本発明の態様は、要素及び多様な要素の組み合わせ及び以降の詳細な記述と添付される特許請求の範囲の様態により達成され実現される。 Further features related to the present invention will become apparent from the description of the present specification and the accompanying drawings. The embodiments of the present invention can be achieved and realized by elements and combinations of various elements and the following detailed description and appended claims.

本明細書の記述は典型的な例示に過ぎず、本発明の特許請求の範囲又は適用例を如何なる意味に於いても限定するものではないことを理解する必要がある。 It should be understood that the description herein is merely exemplary and is not intended to limit the scope of the claims or the application of the invention in any way.

本発明によれば、ユーザが文全体の正しい形態素解析結果をシステムに与えるだけで、システムが自動的に最適な修正対象形態素列と、最適な修正範囲（汎用性の大きさ）、最適なスコア情報調整値などを決定できる。このため、ユーザに対する負担が小さく、異なったユーザに対しても、修正の一貫性を保つことができる。 According to the present invention, the user automatically gives the correct morpheme analysis result of the entire sentence to the system, and the system automatically optimizes the correction target morpheme sequence, the optimal correction range (the degree of versatility), and the optimal score. Information adjustment values can be determined. For this reason, the burden on the user is small, and consistency of correction can be maintained for different users.

本発明の実施形態による音声合成システムの概略構成を示す図である。It is a figure which shows schematic structure of the speech synthesis system by embodiment of this invention. 一般的なテキスト音声合成処理の構成と手順の概要を説明するための図である。It is a figure for demonstrating the structure and procedure outline | summary of a general text-to-speech synthesis process. 自然言語処理の内容を示す図である。It is a figure which shows the content of the natural language process. テキスト解析装置及び形態素解析チューニング装置を１つのコンピュータシステムで構成した場合のハードウェア構成を示す図である。It is a figure which shows the hardware constitutions at the time of comprising a text analysis device and a morphological analysis tuning device by one computer system. ユーザインターフェースを端末側のデバイスに設け、テキスト解析機能及び形態素解析チューニング機能の全部もしくは一部を実現するハードウェアをサーバに設けたシステム構成例を示す図である。It is a figure which shows the system configuration example which provided the user interface in the device of the terminal side, and provided the hardware which implement | achieves all or one part of a text analysis function and a morphological analysis tuning function in the server. テキスト解析装置と形態素解析チューニング装置を別々のコンピュータで構成した場合の構成例を示す図である。It is a figure which shows the structural example at the time of comprising a text analysis apparatus and a morphological analysis tuning apparatus with a separate computer. テキスト解析及び形態素解析チューニングの機能的構成をブロック図である。It is a block diagram about the functional structure of text analysis and morphological analysis tuning. 解析結果提示部７０３の詳細構成（図８Ａ）及び提示例（図８Ｂ）を示す図である。It is a figure which shows the detailed structure (FIG. 8A) and the example of presentation (FIG. 8B) of the analysis result presentation part 703. FIG. 正解形態素解析結果の提示画面例を示す図である。It is a figure which shows the example of a presentation screen of a correct morphological analysis result. ターゲット形態素自動抽出部で実行される処理を説明するための図である。It is a figure for demonstrating the process performed by the target morpheme automatic extraction part. 形態素解析結果比較モジュールによって抽出される、正解形態素解析結果と最適形態素解析結果の異なる部分の例を示す図である。It is a figure which shows the example of the part from which the correct morphological analysis result and the optimal morphological analysis result extracted by the morphological analysis result comparison module. 修正手法判定部で実行される処理を説明するための図である。It is a figure for demonstrating the process performed in the correction method determination part. 単語登録部で実行される処理を説明するための図である。It is a figure for demonstrating the process performed in a word registration part. コスト調整ルール作成部で実行される処理を説明するための図である。It is a figure for demonstrating the process performed by the cost adjustment rule preparation part.

以下、添付図面を参照して本発明の実施形態について説明する。添付図面では、機能的に同じ要素は同じ番号で表示される場合もある。なお、添付図面は本発明の原理に則った具体的な実施形態と実装例を示しているが、これらは本発明の理解のためのものであり、決して本発明を限定的に解釈するために用いられるものではない。 Hereinafter, embodiments of the present invention will be described with reference to the accompanying drawings. In the accompanying drawings, functionally identical elements may be denoted by the same numbers. The attached drawings show specific embodiments and implementation examples based on the principle of the present invention, but these are for understanding the present invention and are not intended to limit the present invention. Not used.

本実施形態では、当業者が本発明を実施するのに十分詳細にその説明がなされているが、他の実装・形態も可能で、本発明の技術的思想の範囲と精神を逸脱することなく構成・構造の変更や多様な要素の置き換えが可能であることを理解する必要がある。従って、以降の記述をこれに限定して解釈してはならない。 This embodiment has been described in sufficient detail for those skilled in the art to practice the present invention, but other implementations and configurations are possible without departing from the scope and spirit of the technical idea of the present invention. It is necessary to understand that the configuration and structure can be changed and various elements can be replaced. Therefore, the following description should not be interpreted as being limited to this.

更に、本発明の実施形態は、後述されるように、汎用コンピュータ上で稼動するソフトウェアで実装しても良いし専用ハードウェア又はソフトウェアとハードウェアの組み合わせで実装しても良い。 Furthermore, as will be described later, the embodiment of the present invention may be implemented by software running on a general-purpose computer, or may be implemented by dedicated hardware or a combination of software and hardware.

以下では、各処理「部」（例えば、コスト調整ルール作成部７０９）を主語（動作主体）として本発明の実施形態における各処理について説明を行うが、各処理部による処理はプロセッサによって実行されることになるため、プロセッサを主語とした説明としてもよい。 Hereinafter, each process in the embodiment of the present invention will be described with each process “unit” (for example, the cost adjustment rule creating unit 709) as the subject (operation subject), but the process by each process unit is executed by the processor. Therefore, the description may be made with the processor as the subject.

＜音声合成システム＞
図１は、本発明の実施形態による音声合成システム１００の概略構成を示す図である。音声合成システム１００は、テキスト解析装置１０１と、形態素解析チューニング装置１０２と、音声合成装置１０３と、を有している。 <Speech synthesis system>
FIG. 1 is a diagram showing a schematic configuration of a speech synthesis system 100 according to an embodiment of the present invention. The speech synthesis system 100 includes a text analysis device 101, a morphological analysis tuning device 102, and a speech synthesis device 103.

テキスト解析装置１０１は、ユーザによって入力された（キーボード入力、スキャナ入力等を含む）テキストデータを受け付け、形態素辞書を参照して形態素解析を実行することにより入力されたテキストを分解・解析する。 The text analysis apparatus 101 receives text data (including keyboard input, scanner input, etc.) input by the user, and decomposes and analyzes the input text by referring to the morpheme dictionary and executing morpheme analysis.

形態素解析チューニング装置１０２は、テキスト解析結果を取得し、当該テキスト解析結果が正しいか否かのユーザによる判定入力を受け付ける。そして、形態素解析チューニング装置１０２は、ユーザによる判定入力に基づいて形態素解析結果のコスト（スコア）値を修正するか判定し、修正する場合にはそのためのルールを決定する。新たなルールが決定された場合、テキスト解析装置１０１は、新ルールに従って、再度形態素解析を行い、正しい解析結果を音声合成装置１０３に出力する。 The morphological analysis tuning apparatus 102 acquires a text analysis result and accepts a determination input by the user as to whether or not the text analysis result is correct. Then, the morphological analysis tuning apparatus 102 determines whether or not to correct the cost (score) value of the morphological analysis result based on the determination input by the user, and determines a rule for the correction. When a new rule is determined, the text analysis apparatus 101 performs morphological analysis again according to the new rule, and outputs a correct analysis result to the speech synthesizer 103.

音声合成装置１０３は、テキスト解析結果をテキスト解析装置１０１から取得し、テキスト解析結果に含まれる発音記号列を用いて、韻律を予測し、音声を生成する。 The speech synthesizer 103 acquires a text analysis result from the text analysis device 101, predicts a prosody using a phonetic symbol string included in the text analysis result, and generates speech.

＜テキスト音声合成処理の構成及び手順＞
図２は、一般的なテキスト音声合成処理の構成と手順の概要を説明するための図である。テキスト音声合成（ＴＴＳともいわれる）技術は、テキストから音声を合成する一連のプロセスであり、大きく分けて、「自然言語処理」と「音声生成」で構成されている。図３は、自然言語処理の詳細を示す図である。 <Structure and procedure of text-to-speech synthesis>
FIG. 2 is a diagram for explaining an outline of a configuration and procedure of a general text-to-speech synthesis process. Text-to-speech synthesis (also referred to as TTS) technology is a series of processes for synthesizing speech from text, and is roughly composed of “natural language processing” and “speech generation”. FIG. 3 is a diagram showing details of natural language processing.

自然言語処理では、図３に示されるように、入力された自然言語を、意味の持つ最小単位（形態素）に分解し、読みを付与し、アクセントを決定し、韻律階層解析などを行って、発音記号列を出力する。
音声生成処理では、入力された発音記号列を用いて、韻律を予測し、音声を生成する。 In natural language processing, as shown in FIG. 3, the input natural language is decomposed into the smallest meaningful units (morphemes), readings are added, accents are determined, and prosodic hierarchy analysis is performed. Output phonetic symbol string.
In the speech generation process, the prosody is predicted using the input phonetic symbol string, and speech is generated.

本発明の実施形態は、テキスト音声合成処理において、形態素解析部分をチューニングすることで、合成音声処理における読み間違いを改善することを目的とする。このため、上述したように、テキスト解析装置１０１及び音声合成装置１０３の他に、形態素解析チューニング装置１０２が音声合成システム１００に含まれている。 An object of the embodiment of the present invention is to improve reading errors in synthesized speech processing by tuning a morphological analysis part in text speech synthesis processing. Therefore, as described above, the morphological analysis tuning apparatus 102 is included in the speech synthesis system 100 in addition to the text analysis apparatus 101 and the speech synthesis apparatus 103.

まず、形態素解析チューニング装置１０２は、テキスト解析結果（もしくは、合成した音声）をチューニング作業者（ユーザ）に提示する。ユーザは、提示されたテキスト解析結果を検証し、解析間違い（音声の場合は、読み間違い）があるかどうかを判断する。そして、間違いがあった場合、当該形態素解析チューニング装置は、ユーザが入力する正解形態素解析結果（ユーザが正解と思われるもの）を受け付ける。また、当該形態素解析チューニング装置は、テキスト解析装置が解析した形態素解析結果（コストが最小値を取る最適形態素解析結果）と照らし合わせ、最適なターゲット形態素系列を抽出する。ターゲット形態素系列について、当該装置は、自動的に単語登録が必要なケースと、コスト調整が必要なケースと分別し、それぞれ単語登録とコスト調整ルール作成を行う。コスト調整ルールを作成する場合、当該装置は、ユーザが指定する正解形態素解析結果のコスト最小になるように、抽出した形態素系列のみに対して、コスト調整を行い、コスト調整ルールを形態素解析用データに追加する。なお、コスト調整ルールの作成に際しては、事前に用意した正解形態素解析データベースを用いて、最適なコスト調整値及びルール適用範囲を設定することが望ましい。 First, the morphological analysis tuning apparatus 102 presents a text analysis result (or synthesized speech) to a tuning operator (user). The user verifies the presented text analysis result and determines whether there is an analysis error (in the case of speech, reading error). And when there is a mistake, the said morphological analysis tuning apparatus receives the correct morphological analysis result (what a user thinks is a correct answer) which a user inputs. Further, the morpheme analysis tuning apparatus extracts an optimal target morpheme series in comparison with a morpheme analysis result (optimal morpheme analysis result having a minimum cost) analyzed by the text analysis apparatus. For the target morpheme sequence, the apparatus automatically separates the case where word registration is necessary and the case where cost adjustment is necessary, and performs word registration and cost adjustment rule creation, respectively. When creating a cost adjustment rule, the device performs cost adjustment only on the extracted morpheme sequence so that the cost of the correct morphological analysis result specified by the user is minimized, and the cost adjustment rule is converted into morphological analysis data. Add to In creating the cost adjustment rule, it is desirable to set an optimal cost adjustment value and rule application range using a correct morphological analysis database prepared in advance.

＜ハードウェア構成例＞
以下、本発明の実施形態に係る音声合成システム１００に含まれるテキスト解析装置１０１及び形態素解析チューニング装置１０２のハードウェア構成について説明する。 <Hardware configuration example>
Hereinafter, the hardware configuration of the text analysis device 101 and the morphological analysis tuning device 102 included in the speech synthesis system 100 according to the embodiment of the present invention will be described.

（１）１つのコンピュータで構成した場合の構成例
図４は、テキスト解析装置１０１及び形態素解析チューニング装置１０２を１つのコンピュータシステムで構成した場合のハードウェア構成を示す図である。テキスト解析／形態素解析チューニング装置４００は、ＣＰＵ４０１と、主記憶装置であるメモリ４０２と、記憶装置４０３と、テキスト入力Ｉ／Ｆ（インタフェース）４０４と、音声入力Ｉ／Ｆ４０５と、スピーカーに接続する音声出力Ｉ／Ｆ４０６と、テキスト提示Ｉ／Ｆ４０７と、を含み、これらの各構成部はバス４０８によって相互に接続されている。テキスト解析／形態素解析チューニング装置４００は、例えば、カーナビゲーション装置、携帯電話機、パーソナルコンピュータ等のデバイスに、テキスト解析／形態素解析チューニングユニットとして組み込まれている。そのため、図４に示した各ハードウェアは、テキスト解析／形態素解析チューニング装置が組み込まれたデバイスの構成を用いて実現しても良いし、テキスト解析／形態素解析チューニング装置が組み込まれたデバイスとは別個に設けられていても良い。 (1) Configuration Example when Constructed with One Computer FIG. 4 is a diagram showing a hardware configuration when the text analysis device 101 and the morphological analysis tuning device 102 are configured with one computer system. The text analysis / morpheme analysis tuning apparatus 400 includes a CPU 401, a memory 402 as a main storage device, a storage device 403, a text input I / F (interface) 404, a voice input I / F 405, and a voice connected to a speaker. An output I / F 406 and a text presentation I / F 407 are included, and these components are connected to each other by a bus 408. The text analysis / morpheme analysis tuning apparatus 400 is incorporated as a text analysis / morpheme analysis tuning unit in a device such as a car navigation apparatus, a mobile phone, or a personal computer. Therefore, each hardware shown in FIG. 4 may be realized by using a device configuration in which the text analysis / morpheme analysis tuning apparatus is incorporated, or a device in which the text analysis / morpheme analysis tuning apparatus is incorporated. It may be provided separately.

（２）端末とサーバで構成した場合の構成例
図４のように全ての機能を１つのコンピュータデバイスだけで実現しても良いが、図５に示すように、ユーザインターフェース（テキスト入力Ｉ／Ｆ、音声出力Ｉ／Ｆ、テキスト提示Ｉ／Ｆ）は端末側のデバイスに備えて、テキスト解析機能、及び形態素解析チューニング機能の全部もしくは一部を実現するハードウェアをサーバに備え、その間に通信Ｉ／Ｆによって相互に接続されている場合も考えられる。 (2) Configuration example when configured with terminals and servers Although all functions may be realized by only one computer device as shown in FIG. 4, a user interface (text input I / F as shown in FIG. 5). , Voice output I / F, text presentation I / F) are provided in the device on the terminal side, and the server is provided with hardware for realizing all or part of the text analysis function and the morphological analysis tuning function. It is also conceivable that they are connected to each other by / F.

（３）テキスト解析装置と形態素解析チューニング装置を別々に構成した場合の構成例
図６は、テキスト解析装置１０１と形態素解析チューニング装置１０２を別々のコンピュータで構成した場合の構成例を示す図である。形態素解析チューニング装置１０２で作成した形態素解析用データ（形態素辞書やコスト調整ルールなど）を使ってテキスト解析を行うために、形態素解析の機能を実現するハードウェア（形態素解析チューニング装置１０２）から、テキスト解析装置１０１の記憶装置であって、形態素解析用データを格納する記憶装置にアクセスできなければならない。そこで、テキスト解析装置１０１と形態素解析チューニング装置１０２は、ネットワーク６００を介して接続されている。なお、ネットワークを介さなくてもバス等によって直接接続するようにしても良いし、両装置間のデータの受け渡しを、記録媒体（ＤＶＤ、ＣＤ、ＵＳＢメモリなど）を介して行うようにしても良い。 (3) Configuration example when the text analysis device and the morphological analysis tuning device are configured separately FIG. 6 is a diagram illustrating a configuration example when the text analysis device 101 and the morphological analysis tuning device 102 are configured by separate computers. . In order to perform text analysis using morpheme analysis data (morpheme dictionary, cost adjustment rule, etc.) created by the morpheme analysis tuning device 102, text from the hardware (morpheme analysis tuning device 102) that implements the function of morpheme analysis It must be accessible to the storage device of the analysis device 101 that stores morphological analysis data. Therefore, the text analysis device 101 and the morphological analysis tuning device 102 are connected via the network 600. It should be noted that the connection may be made directly by a bus or the like without going through a network, or data may be exchanged between both devices via a recording medium (DVD, CD, USB memory, etc.). .

図６に示されるように、テキスト解析装置１０１は、ＣＰＵ１０１１と、メモリ１０１２と、記憶装置１０１３と、音声入力Ｉ／Ｆ１０１４と、音声出力Ｉ／Ｆ１０１５と、通信Ｉ／Ｆ１０１６と、を有し、これらがバス１０１７を介して相互に接続されている。また、形態素解析チューニング装置１０２は、ＣＰＵ１０２１と、メモリ１０２２と、記憶装置１０２３と、音声入力Ｉ／Ｆ１０２４と、音声出力Ｉ／Ｆ１０２５と、テキスト提示Ｉ／Ｆ１０２７と、通信Ｉ／Ｆ１０２６と、を有し、これらがバス１０２８を介して相互に接続されている。 As shown in FIG. 6, the text analysis apparatus 101 includes a CPU 1011, a memory 1012, a storage device 1013, a voice input I / F 1014, a voice output I / F 1015, and a communication I / F 1016. These are connected to each other via a bus 1017. The morphological analysis tuning apparatus 102 includes a CPU 1021, a memory 1022, a storage device 1023, a voice input I / F 1024, a voice output I / F 1025, a text presentation I / F 1027, and a communication I / F 1026. These are connected to each other via a bus 1028.

以上のように、テキスト解析装置１０１と形態素解析チューニング装置１０２は相互にデータの受け渡しをすることが可能であるので、テキスト解析装置１０１は、直接に記憶装置１０１３に格納されている、形態素解析チューニング装置１０２によって作成されたコスト調整ルールにアクセスすることができる。 As described above, since the text analysis device 101 and the morphological analysis tuning device 102 can exchange data with each other, the text analysis device 101 directly stores the morphological analysis tuning stored in the storage device 1013. Cost adjustment rules created by the device 102 can be accessed.

ＣＰＵ１０１１及び１０２１は、ユーザ音声ＤＢ作成装置の全体の制御を司る。メモリ１０１２及び１０２２は、ＣＰＵのワークエリアとして使用される。記憶装置１０１３及び１０２３は、不揮発性の記憶媒体であり、具体的には、例えば、ＨＤＤ（ハードディスク）、ＦＤ（フレキシブルディスク）、フラッシュメモリ等を用いることができる。記憶装置１０１３及び１０２３には、例えば、後記する音質評価プログラムや素片置き換えプログラム等の各種プログラム、音声合成用既存音声データベース等の各種データが記録される。音声入力Ｉ／Ｆ１０１４及び１０２４は、マクロフォンなどの音声入力装置（不図示）を接続するインタフェースであり、音声入力装置から音声の入力を受け付ける。音声出力Ｉ／Ｆ１０１５及び１０２５は、スピーカーなどの音声出力装置（不図示）を接続するインタフェースである。通信Ｉ／Ｆ１０１６及び１０２６は、装置の間にデータを交換する（通信する）ためのハードウェアである。例えば、有線ＬＡＮカード、無線ＬＡＮカード、モデムなどが考えられる。テキスト提示Ｉ／Ｆ１０２７は、パソコンモニターや携帯画面など、テキストを表示できる装置である。音声でテキストをユーザに提示する場合、テキスト提示Ｉ／Ｆ１０２７は、音声出力Ｉ／Ｆで代用（併用）することも可能である。 The CPUs 1011 and 1021 govern overall control of the user voice DB creation device. The memories 1012 and 1022 are used as a work area for the CPU. The storage devices 1013 and 1023 are nonvolatile storage media. Specifically, for example, an HDD (hard disk), an FD (flexible disk), a flash memory, or the like can be used. In the storage devices 1013 and 1023, for example, various programs such as a sound quality evaluation program and a segment replacement program described later, and various data such as an existing speech database for speech synthesis are recorded. The voice input I / Fs 1014 and 1024 are interfaces for connecting a voice input device (not shown) such as a macrophone, and receive voice input from the voice input device. Audio output I / Fs 1015 and 1025 are interfaces for connecting an audio output device (not shown) such as a speaker. The communication I / Fs 1016 and 1026 are hardware for exchanging data (communication) between apparatuses. For example, a wired LAN card, a wireless LAN card, a modem, etc. can be considered. The text presentation I / F 1027 is a device that can display text, such as a personal computer monitor or a mobile screen. When the text is presented to the user by voice, the text presentation I / F 1027 can be substituted (combined) with the voice output I / F.

＜テキスト解析及び形態素解析チューニングの機能的構成）
図７は、テキスト解析及び形態素解析チューニングの機能的構成をブロック図である。図７に示すように、テキスト解析装置を実現するための機能としては、テキスト入力部７０１と、形態素解析部７０２と、解析結果提示部７０３と、形態素辞書７０４と、が含まれる。形態素解析チューニング機能としては、解析結果判定部７０５と、正解形態素解析結果入力部７０６と、ターゲット形態素自動抽出部７０７と、修正手法判定部７０８と、コスト調整ルール作成部７０９と、正解形態素解析結果データベース７１０と、コスト調整ルール群７１１と、が含まれる。以下、それぞれについて、その詳細を説明する。 <Functional configuration of text analysis and morphological analysis tuning>
FIG. 7 is a block diagram showing the functional configuration of text analysis and morphological analysis tuning. As shown in FIG. 7, the functions for realizing the text analysis device include a text input unit 701, a morpheme analysis unit 702, an analysis result presentation unit 703, and a morpheme dictionary 704. The morphological analysis tuning function includes an analysis result determination unit 705, a correct morpheme analysis result input unit 706, a target morpheme automatic extraction unit 707, a correction method determination unit 708, a cost adjustment rule creation unit 709, and a correct morpheme analysis result. A database 710 and a cost adjustment rule group 711 are included. Details of each will be described below.

（i）テキスト入力部
テキスト入力部７０１は、ユーザが入力するテキストを受け付ける機能を有する。ここで、キーボードによる手入力や、事前に用意したテキストデータベースからの自動入力や、ウェブからテキストの自動入力などが考えられる。また、スキャナで文章を取り込んでテキストデータとしても良い。 (I) Text Input Unit The text input unit 701 has a function of accepting text input by the user. Here, manual input with a keyboard, automatic input from a text database prepared in advance, automatic input of text from the web, and the like can be considered. Alternatively, text data may be obtained by taking a sentence with a scanner.

（ii）形態素解析部
形態素解析部７０２は、テキスト入力部７０１で受け付けたテキストについて形態素解析を行う。ここで、形態素は、表記だけではなく、品詞、読み、アクセントなどにも考慮した形態素となる。例えば、同じ表記の「東京」についても、品詞=地名の場合と、品詞=人名の場合とがあり、これらを異なる形態素として定義している。形態素辞書７０４は、少なくとも、「形態素表記」、「形態素品詞」、及び「形態素読み（アクセント付き）」の３つの情報を構成項目として含むが必要がある。本発明の実施形態は音声合成に特化しているため、「形態素読み（アクセント付き）」の情報が必要となっている。テキスト解析のみであれば、当該情報は必要ない。 (Ii) Morphological Analysis Unit The morphological analysis unit 702 performs morphological analysis on the text received by the text input unit 701. Here, the morpheme is a morpheme that takes into account not only the notation but also the part of speech, reading, and accent. For example, for the same notation “Tokyo”, there are cases where the part of speech = name of place and the case of part of speech = person name, which are defined as different morphemes. The morpheme dictionary 704 needs to include at least three pieces of information of “morpheme notation”, “morpheme part of speech”, and “morpheme reading (with accent)” as constituent items. Since the embodiment of the present invention specializes in speech synthesis, information on “morpheme reading (with accent)” is required. If it is only text analysis, this information is not necessary.

（iii）解析結果提示部
解析結果提示部７０３は、形態素解析部７０２で解析された形態素解析結果をユーザに提示する機能を有する。提示方法については、テキストのみの提示と、音声のみの提示と、テキストと音声と同時に提示するなどが考えられる。ただし、音声合成の読み間違いを改善することを目的とした場合は、形態素解析結果が間違っていても、合成した音声に読み間違い（アクセント間違いや韻律境界間違いなども含める）がなければ、チューニングする必要がないため、音声とテキストとの同時提示を望ましい。 (Iii) Analysis Result Presentation Unit The analysis result presentation unit 703 has a function of presenting the morpheme analysis result analyzed by the morpheme analysis unit 702 to the user. As the presentation method, it is possible to present only text, present only voice, or present simultaneously with text and voice. However, if the purpose is to improve reading errors in speech synthesis, even if the morphological analysis result is incorrect, if the synthesized speech has no reading errors (including accent errors and prosodic boundary errors), tune it. Since there is no need, simultaneous presentation of voice and text is desirable.

図８は、解析結果提示部７０３の詳細構成（図８Ａ）及び提示例（図８Ｂ）を示す図である。図８では、入力されたテキストが「東京大学」の場合を例としている。テキスト生成モジュール７０３１は、入力された形態素解析結果を解析する。そして、テキスト表示デバイス７０３２は、生成したテキスト（「形態素１：表記=東京、品詞=地名、読み=トウキョウ; 形態素２：表記=大学、品詞=一般名詞、読み=タ゛イカ゛ク」）のみをユーザに提示する。一方、音声合成無ジュール７０３３は、解析した形態素系列を受け取り、音声を合成処理する。そして、音声合成再生デバイス７０３４は、音声合成モジュール７０３３で生成された合成音声を出力する。 FIG. 8 is a diagram illustrating a detailed configuration (FIG. 8A) and a presentation example (FIG. 8B) of the analysis result presentation unit 703. FIG. 8 shows an example in which the input text is “University of Tokyo”. The text generation module 7031 analyzes the input morpheme analysis result. Then, the text display device 7032 presents only the generated text (“morpheme 1: notation = Tokyo, part of speech = place name, reading = Tokyo; morpheme 2: notation = university, part of speech = general noun, reading = dialog”) to the user. To do. On the other hand, the speech synthesis non-joule 7033 receives the analyzed morpheme sequence and synthesizes speech. The speech synthesis / playback device 7034 outputs the synthesized speech generated by the speech synthesis module 7033.

図８Ｂは、テキスト表示と合成音声の両方をユーザに提示する例を示す図である。このようテキストと合成音声の両方を提示しても良いし、どちらか一方のみの提示でも良い。ただし、合成音声を提示した場合、ユーザに特殊な技能や専門知識が必要なく、形態素解析結果の正しさを判定できると考えられる。また、音声合成のため形態素解析であることを前提にした場合、より実際の応用状況に近い提示手法として、合成音声による提示が望ましい。 FIG. 8B is a diagram illustrating an example in which both text display and synthesized speech are presented to the user. In this way, both text and synthesized speech may be presented, or only one of them may be presented. However, when the synthesized speech is presented, it is considered that the user does not need special skills or expertise, and the correctness of the morphological analysis result can be determined. In addition, when it is assumed that morphological analysis is performed for speech synthesis, it is desirable to present synthesized speech as a presentation method that is closer to the actual application situation.

（iv）解析結果判定部
解析結果判定部７０５は、提示された形態素解析結果について、ユーザによる当該結果が正しいか否かの判定を受け付ける機能を有する。解析結果判定部７０５は、例えばボタンなどのインタフェース（例えば、「ＯＫ」と「ＮＧ」のボタン）で構成される。解析結果判定部７０５は、ユーザが当該インタフェースを用いて解析結果の可否の判定を入力し、その入力を受け付ける。ここで、ユーザが「解析結果が正しい」と判断した場合（「ＯＫ」ボタンが押下された場合）、テキスト解析チューニングの全プロセスが終了する。そして、テキスト入力部７０１は、次テキストが入力するまで待機状態に入る。これと同時に、後述するコスト調整ルール作成部７０９でコスト調整値を決定するために、この解析結果（正解形態素解析結果）を正解形態素解析結果データベース７１０に蓄積しておくことが望ましい。一方、ユーザが「解析結果が正しくない」と判断した場合（「ＮＧ」ボタンが押下された場合）、次のプロセス（正解形態素解析結果入力部７０６）に進む。 (Iv) Analysis Result Determination Unit The analysis result determination unit 705 has a function of receiving a determination as to whether or not the result of the presented morpheme analysis result is correct by the user. The analysis result determination unit 705 includes an interface such as a button (for example, “OK” and “NG” buttons). The analysis result determination unit 705 inputs a determination as to whether the analysis result is acceptable using the interface, and accepts the input. Here, when the user determines that the “analysis result is correct” (when the “OK” button is pressed), the entire process of text analysis tuning ends. Then, the text input unit 701 enters a standby state until the next text is input. At the same time, it is desirable to accumulate this analysis result (correct morpheme analysis result) in the correct morpheme analysis result database 710 in order to determine a cost adjustment value by the cost adjustment rule creation unit 709 described later. On the other hand, when the user determines that “the analysis result is incorrect” (when the “NG” button is pressed), the process proceeds to the next process (correct morphological analysis result input unit 706).

例えば、「東京大学院長」のテキストの解析結果「形態素１：表記=東京、品詞=地名、読み=トウキョウ; 形態素２：表記=大学、品詞=一般名詞、読み=タ゛イカ゛ク; 形態素３：表記=院長、品詞=一般名詞、読み=インチョウ」が提示された場合、ユーザが正しいと判断し、「ＯＫ」をクリックして、チューニングプロセスを終了し、解析結果を正解形態素解析コーパスに蓄積する。（ここで、「正解形態素解析結果」とは、ユーザが正しいと思われたものであり、必ず正しい結果であるとは限らない。例えば、「東京大学院長」の場合は、「形態素１：表記=東京、品詞=地名、読み=トウキョウ; 形態素２：表記=大学院、品詞=一般名詞、読み=タ゛イカ゛クイン; 形態素３：表記=長、品詞=接尾語、読み=チョウ」が正しいとする専門家もいるかもしれない）。 For example, the analysis result of the text of “Tokyo Graduate School Director” “morpheme 1: notation = Tokyo, part of speech = place name, reading = Tokyo; morpheme 2: notation = university, part of speech = general noun, reading = diak; morpheme 3: notation = director If the part of speech = general noun, reading = incho ”is presented, the user determines that it is correct, clicks“ OK ”, ends the tuning process, and stores the analysis result in the correct morphological analysis corpus. (Here, the “correct morphological analysis result” is what the user thinks is correct and is not necessarily the correct result. For example, in the case of “Tokyo Graduate School Director”, “morpheme 1: notation” = Tokyo, part of speech = place name, reading = Tokyo; morpheme 2: notation = graduate school, part of speech = general noun, reading = digaquin; morpheme 3: notation = long, part of speech = suffix, reading = butterfly Maybe)

一方、「東京大学院長」のテキストの解析結果「形態素１：表記=東、品詞=一般名詞、読み=ヒカ゛シ; 形態素２：表記=京大、品詞=固有名詞、読み=キョウタ゛イ; 形態素３：表記=学院、品詞=一般名詞、読み=カ゛クイン；形態素４：表記=長、品詞=接尾語、読み=チョウ」が提示された場合、ユーザが形態素解析ミスであると判断し、「ＮＧ」をクリックして、次のプロセスに進む。 On the other hand, the analysis result of the text of "Tokyo Graduate School Director" "morpheme 1: notation = east, part of speech = general noun, reading = hibashi; morpheme 2: notation = Kyoto University, part of speech = proper noun, reading = Kyoto; morpheme 3: notation = Gakuin, part of speech = general noun, reading = cuckin; morpheme 4: notation = long, part of speech = suffix, reading = butterfly ”is presented, the user determines that the morphological analysis error and clicks“ NG ” Then proceed to the next process.

（v）正解形態素解析結果入力部
正解形態素解析結果入力部７０６は、形態素解析部７０２で最適であると判断した形態素解析結果が「正しくない」とユーザが判断した場合のみに動作し、ユーザからの正解形態素解析結果の入力を求める。 (V) Correct morpheme analysis result input unit The correct morpheme analysis result input unit 706 operates only when the user determines that the morpheme analysis result determined to be optimal by the morpheme analysis unit 702 is “incorrect”. The input of the correct morphological analysis result is obtained.

この正解形態素結果入力部７０６は、インタフェースを有している。例えば、キーボードによって、ユーザが直接正解形態素解析結果を入力しても良い。ただし、ユーザの負担を考えると、入力された文書を構成できるすべての形態素の組み合わせについて、コスト小さい順（スコアが大きい順）からリストしたものをユーザに提示し、選択させるようにすることがより望ましい。この場合、「リストには正解が存在しない」という選択肢も用意する必要がある。ここで、正解形態素解析結果を構成するすべての形態素が、形態素辞書に存在するかどうか（もしくは、「リストには正解が存在しない」が選択されたかどうか）によって、ターゲット形態素自動抽出部７０７における処理が異なってくる。正解形態素解析結果を構成する形態素の１つ以上が形態素辞書７０４に存在しない（もしくは、「リストには正解が存在しない」が選択された）場合、ターゲット形態素自動抽出部７０７は、ユーザが入力した正解形態素を取得する。一方、正解形態素解析結果を構成するすべての形態素が形態素辞書に存在する（もしくは、「リストには正解が存在しない」以外の選択肢が選択された）場合は、ターゲット形態素自動抽出部７０７は、リストの中からユーザが選択した正解パスを取得する。 The correct morpheme result input unit 706 has an interface. For example, the user may directly input the correct morphological analysis result using a keyboard. However, considering the burden on the user, it is better to present the user with a list of all morpheme combinations that can compose the input document, starting from the lowest cost (in descending score) desirable. In this case, it is also necessary to prepare an option “There is no correct answer in the list”. Here, processing in the target morpheme automatic extraction unit 707 depends on whether or not all morphemes constituting the correct morpheme analysis result exist in the morpheme dictionary (or whether or not “No correct answer exists in the list” is selected). Will be different. When one or more of the morphemes constituting the correct morphological analysis result does not exist in the morpheme dictionary 704 (or “No correct answer exists in the list” is selected), the target morpheme automatic extraction unit 707 inputs the user Get the correct morpheme. On the other hand, when all the morphemes constituting the correct morpheme analysis result exist in the morpheme dictionary (or an option other than “No correct answer exists in the list” is selected), the target morpheme automatic extraction unit 707 The correct path selected by the user from among the above is acquired.

図９は、正解形態素解析結果の提示画面例を示す図である。図９Ａ及びＢで示す例では、テキストとして「彼は東京大学院長です。」が入力された場合、形態素解析部７０２は、全てのパス（ここで、テキストを構成可能な形態素組み合わせをパスと呼ぶ）を取得し、それぞれパスのコスト値（コスト値とは、事前に定義した品詞コストリストと品詞接続コストテーブルから、構成するすべての形態素の品詞コストとすべての隣接形態素の品詞接続コストを足し合わせた総和である）を算出する。そして、解析結果提示部７０３は、それぞれのパスをユーザに提示する。テキストのみで提示する場合は図９Ａのようになり、テキストと音声と同時に提示する場合は図９Ｂのようになる。ユーザは、用意されているユーザインターフェースを使って、提示された１〜Ｎのパスから正解パスを選択することになる。ただし、正解パスが存在しない場合、「リストに正解パスが存在しない」を選び、正解形態素解析結果入力部７０６に含まれるキーボード等を用いて、正解形態素解析結果を入力する。 FIG. 9 is a diagram illustrating an example of a screen for presenting a correct morphological analysis result. In the example shown in FIGS. 9A and B, when “He is a Tokyo Graduate School Director” is input as the text, the morpheme analysis unit 702 calls all paths (here, morpheme combinations that can constitute the text as paths). ), And the cost value of each path (the cost value is the sum of the part-of-speech cost of all morphemes and the part-of-speech connection cost of all neighboring morphemes from the pre-defined part-of-speech cost list and part-of-speech connection cost table) Is the total sum). Then, the analysis result presentation unit 703 presents each path to the user. When presenting only text, it is as shown in FIG. 9A, and when presenting simultaneously with text and voice, it is as shown in FIG. 9B. The user selects the correct answer path from the presented 1 to N paths using the prepared user interface. However, if the correct path does not exist, “No correct path exists in the list” is selected, and the correct morphological analysis result is input using the keyboard or the like included in the correct morphological analysis result input unit 706.

（vi）ターゲット形態素自動抽出部
図１０は、ターゲット形態素自動抽出部で実行される処理を説明するための図である。ターゲット形態素自動抽出部７０７は、形態素解析結果比較モジュール７０７１を有し、形態素解析部７０２によって得られた最適形態素解析結果（テキストを構成可能なすべての形態素組み合わせの中に、最もコストの低いもの（最もスコアが高いもの））と、正解形態素解析結果入力部７０６で得られた正解形態素解析結果（ユーザが正解だと思ったもの）とを比較する。そして、形態素解析結果比較モジュール７０７１は、両者の異なる部分（ターゲット形態素列）を抽出し、修正手法判定部７０８に出力する。異なる部分が複数に存在する場合は、ターゲット形態素列が複数個に抽出される。一方、異なる部分が存在しない場合（比較ステップでＹｅｓの場合）、全プロセスを終了する。 (Vi) Target Morphological Automatic Extraction Unit FIG. 10 is a diagram for explaining processing executed by the target morpheme automatic extraction unit. The target morpheme automatic extraction unit 707 has a morpheme analysis result comparison module 7071, and the optimal morpheme analysis result obtained by the morpheme analysis unit 702 (the lowest cost among all morpheme combinations that can constitute text ( The one with the highest score)) is compared with the correct morphological analysis result obtained by the correct morphological analysis result input unit 706 (what the user thinks is the correct answer). Then, the morpheme analysis result comparison module 7071 extracts both different parts (target morpheme string) and outputs them to the correction method determination unit 708. When there are a plurality of different portions, a plurality of target morpheme strings are extracted. On the other hand, if there are no different parts (Yes in the comparison step), the entire process is terminated.

例えば、図１１に示されるように、最適形態素解析結果が「彼/は/東/京大/学院/長/です」（これ以降、記述を簡潔するために、単に形態素と表した場合でも、表記、品詞、読みなどの情報を含まれている形態素を指す。つまり、単に「東京/大学」と記述した場合でも、「形態素１：表記=東京、品詞=地名、読み=トウキョウ; 形態素２：表記=大学、品詞=一般名詞、読み=タ゛イカ゛ク」同意義である）であり、正解形態素解析結果が「彼/は/東京/大学/院長/です」の場合、形態素解析結果比較モジュール７０７１は、異なる部分１１０１を抽出し、正解形態素解析結果（「東京/大学/院長」）を正解ターゲット形態素とする。一方、異なる部分の最適形態素解析結果（「東/京大/学院/長」）を最適ターゲット形態素とする。つまり、ターゲット形態素は、正解ターゲット形態素と最適ターゲット形態素で構成されている。 For example, as shown in FIG. 11, the optimal morpheme analysis result is “He / Higashi / Kyoto University / Gakuin / Director /” (hereinafter, in order to simplify the description, even if it is simply expressed as morpheme, It refers to a morpheme that contains information such as notation, part of speech, reading, etc. In other words, even if “Tokyo / University” is simply described, “morpheme 1: notation = Tokyo, part of speech = place name, reading = Tokyo; Notation = university, part of speech = general noun, reading = diagram ”, and the correct morphological analysis result is“ He / Ha / Tokyo / University / Director / ”is the morphological analysis result comparison module 7071 A different portion 1101 is extracted, and the correct morphological analysis result (“Tokyo / University / Director”) is set as the correct target morpheme. On the other hand, the optimum morpheme analysis results (“East / Kyoto University / Gakuin / Head”) of different parts are set as the optimum target morpheme. That is, the target morpheme is composed of a correct target morpheme and an optimal target morpheme.

（vii）修正手法判定部
図１２は、修正手法判定部で実行される処理を説明するための図である。修正手法判定部７０８は、形態素検索モジュール７０８１を有し、これを用いて、ターゲット形態素自動抽出部７０７から得たターゲット形態素を構成するすべての形態素について、形態素辞書７０４に存在するか否か調べる。１つでも形態素辞書に載っていない形態素（未登録形態素とする）が存在すれば（検索判定ステップでＮｏの場合）、未登録形態素を単語登録部（本明細書において「単語」と書いた場合、形態素と同意義である）７１２に出力し、形態素辞書７０４に登録する。一方、すべての形態素が形態素辞書に載っている場合（検索判定ステップでＹｅｓの場合）は、修正手法判定部７０８は、正解ターゲット形態素と最適ターゲット形態素を、コスト調整ルール作成部７０９に出力する。 (Vii) Correction Method Determination Unit FIG. 12 is a diagram for explaining processing executed by the correction method determination unit. The correction method determination unit 708 has a morpheme search module 7081, and checks whether or not all morphemes constituting the target morpheme obtained from the target morpheme automatic extraction unit 707 exist in the morpheme dictionary 704. If there is at least one morpheme (unregistered morpheme) that is not listed in the morpheme dictionary (No in the search determination step), the unregistered morpheme is written as a word registration unit (in this specification, “word”) , Which is synonymous with morpheme) and is registered in the morpheme dictionary 704. On the other hand, when all the morphemes are listed in the morpheme dictionary (Yes in the search determination step), the correction method determination unit 708 outputs the correct target morpheme and the optimal target morpheme to the cost adjustment rule creation unit 709.

例えば、正解形態素解析結果が「東京/大学/院長」の場合、形態素検索モジュール７０８１は、３つの形態素「東京」、「大学」、「院長」それぞれについて、形態素辞書７０４に存在するかどうか検索する。全てが形態素辞書７０４に存在すれば、処理は、コスト調整ルール作成部７０９に進む。例えば、「東京」、「大学」が辞書に載っているが、「院長」が辞書に載っていない場合は、形態素「院長」が単語登録部７１２に出力される。 For example, when the correct morpheme analysis result is “Tokyo / University / Director”, the morpheme search module 7081 searches whether each of the three morphemes “Tokyo”, “University”, and “Director” exists in the morpheme dictionary 704. . If everything exists in the morpheme dictionary 704, the process proceeds to the cost adjustment rule creation unit 709. For example, when “Tokyo” and “University” are listed in the dictionary but “Director” is not listed in the dictionary, the morpheme “Director” is output to the word registration unit 712.

（viii）単語登録部
図１３は、単語登録部で実行される処理を説明するための図である。単語登録部７１２は、形態素辞書７０４に登録されていない正解形態素を登録する機能を有する。単語登録部７１２は、形態素登録モジュール７１２１を有し、これを用いて未登録の単語を形態素辞書７０４に登録し、形態素辞書７０４を更新する。また、形態素登録モジュール７１２１は、形態素辞書７０４に登録した形態素をターゲット形態素自動抽出部７０７に出力する。 (Viii) Word Registration Unit FIG. 13 is a diagram for explaining processing executed by the word registration unit. The word registration unit 712 has a function of registering correct morphemes that are not registered in the morpheme dictionary 704. The word registration unit 712 has a morpheme registration module 7121, registers unregistered words in the morpheme dictionary 704 using this, and updates the morpheme dictionary 704. Also, the morpheme registration module 7121 outputs the morpheme registered in the morpheme dictionary 704 to the target morpheme automatic extraction unit 707.

例えば、形態素「院長」が入力された場合、単語登録部７１２は、「院長」を形態素辞書７０４に登録した後、ターゲット形態素自動抽出部７０７に出力する。 For example, when the morpheme “Director” is input, the word registering unit 712 registers the “Director” in the morpheme dictionary 704 and then outputs it to the target morpheme automatic extraction unit 707.

辞書を更新したことによって、ターゲット形態素自動抽出部７０７で行う最適形態素解析結果が、更新する前の辞書を用いた最適形態素解析結果と異なることがある。例えば、辞書更新する前は、最適形態素解析結果が「彼/は/東/京大/学院/長/です」だったのに対して、辞書更新後は、最適形態素解析結果が「彼/は/東京/大学/院長/です」となる場合が考えられる。この場合、ユーザの入力した正解形態素解析結果と同じであるため、全プロセスが終了する。一方、辞書更新した後でも、最適形態素解析結果が「彼/は/東/京大/学院/長/です」となった場合、さらに、修正手法判定部７０８に処理は進むが、未登録形態素が存在しない（既に、未登録形態素が登録した）ため、処理はさらにコスト調整ルール作成部７０９に進むことになる。 By updating the dictionary, the optimum morpheme analysis result performed by the target morpheme automatic extraction unit 707 may be different from the optimum morpheme analysis result using the dictionary before being updated. For example, before updating the dictionary, the optimal morphological analysis result was "He / Higashi / Kyoto University / Gakuin / Chief /", but after updating the dictionary, the optimal morphological analysis result was "Hi / ha / Tokyo / University / Director / ". In this case, since it is the same as the correct morphological analysis result input by the user, the entire process is completed. On the other hand, even after updating the dictionary, if the optimal morphological analysis result is “His / Higashi / Kyoto University / Gakuin / Chief /”, the process further proceeds to the correction method determination unit 708, but the unregistered morpheme Does not exist (the unregistered morpheme has already been registered), and the process further proceeds to the cost adjustment rule creation unit 709.

（ix）コスト調整ルール作成部
コスト調整ルール作成部７０９は、修正手法判定部７０８から入力されたターゲット形態素列について、コスト調整ルールを自動的に作成する機能を有する。当該コスト調整ルールは、ターゲット形態素列を構成する形態素の表記、品詞、読み、アクセント位置などの言語特徴量（表２）を用いて、記載されるルールである。 (Ix) Cost Adjustment Rule Creation Unit The cost adjustment rule creation unit 709 has a function of automatically creating a cost adjustment rule for the target morpheme sequence input from the correction method determination unit 708. The said cost adjustment rule is a rule described using language feature-values (Table 2), such as description of a morpheme which comprises a target morpheme string, a part of speech, a reading, and an accent position.

図１４は、コスト調整ルール作成部で実行される処理を説明するための図である。ここで、記載に用いる特徴量の違いによって、そのルールの汎用性と正確性が異なる。ここで、すべての言語特徴について、汎用性（の大きさ）を定義する。特に、音声合成の読み間違いを改善することを目的とした場合は、解体素の特徴として、アクセント型、アクセント位置など、合成音声に影響する特徴を用いることが望ましい。 FIG. 14 is a diagram for explaining processing executed by the cost adjustment rule creation unit. Here, the versatility and accuracy of the rule differ depending on the difference in the feature values used for the description. Here, versatility (size) is defined for all language features. In particular, when it is intended to improve reading errors in speech synthesis, it is desirable to use features that affect the synthesized speech, such as accent type and accent position, as the characteristics of the dismantling elements.

例えば、「東京/大学/院長」に対して、表記を用いて記述した場合、「ルール：形態素の表記は「東京」、「大学」、「院長」の３つの形態素が順番に並んでいる場合、コストを-30とする。」となる。この場合、「彼は東京大学院長です」に対して適用されるが、「彼は南京大学院長です」に対して、適用できない。（ここで、「南京」は中国の地名である。）つまり、このルールの正確性は高いが、汎用性は低い。 For example, when “Tokyo / University / Director” is described using notation, “Rule: Morphological notation is“ Tokyo ”,“ University ”, and“ Director ”are arranged in order. The cost is -30. " In this case, it applies to “He is a Tokyo Graduate School Director” but cannot apply to “He is a Nanjing Graduate School President”. (Here, "Nanjing" is a place name in China.) In other words, this rule has high accuracy but low versatility.

例えば、同じ「東京/大学/院長」に対して、品詞を用いて記述した場合、「ルール：形態素の品詞は「地名名詞」、「一般名詞」、「一般名詞」の三つの形態素が順番に並んでいる場合、コストを-30とする。」となる。この場合、「彼は東京大学院長です」に対して適用される。一方、「ここは東京王百貨店です」に対して、「東京：地名名詞」「王：一般名詞（王様の意味）」「百貨店：一般名詞」の形態素組み合わせを適用すると、「ここ/は/東京/王/百貨店/です」が最適パスと解析される。しかし、「ここ/は/東/京王/百貨店/です」が正解であるため、このルールの登録により、誤解析を招いたことになってしまう。このように、汎用性の高いルールでは、正確性が欠けている。一般的に、汎用性と正確性がトレートオフの関係となり、両立することが非常に難しい。 For example, if the same “Tokyo / University / Director” is described using parts of speech, then “rule: the part of speech of the morpheme is“ genoun noun ”,“ general noun ”, and“ general noun ”. If they are lined up, the cost is -30. " In this case, it applies to “He is the Tokyo Graduate School Director”. On the other hand, when applying the morphological combination of “Tokyo: place name noun”, “king: general noun (meaning of king)” and “department store: general noun” to “here is Tokyo Oo department store”, “here / ha / tokyo / Oh / Department Store / ”is analyzed as the optimal path. However, since “Here / Higashi / Keio / Department Store /” is the correct answer, the registration of this rule would have resulted in misanalysis. Thus, the highly versatile rule lacks accuracy. In general, versatility and accuracy are in a trait-off relationship, and it is very difficult to achieve both.

さらに、コストの値も正確性を大きく左右する。例えば、「ルール：形態素の品詞は「地名名詞」、「一般名詞」、「一般名詞」の３つの形態素が順番に並んでいる場合、コストを-30とする。」を登録した場合は、「ここは東京王百貨店です」に対して、「ここ/は/東/京王/百貨店/です」のコストは最も低く、正しく形態素解析できるが、「ルール：形態素の品詞は「地名名詞」、「一般名詞」、「一般名詞」の３つの形態素が順番に並んでいる場合、コストを-130とする。」を登録した場合は、「ここは東京王百貨店です」に対して、「ここ/は/東京/王/百貨店/です」のコストが最も低くなり、形態素解析ミスが生じる。 In addition, the cost value greatly affects the accuracy. For example, “rule: the part of speech of a morpheme is“ place name noun ”,“ general noun ”,“ general noun ”, and the cost is set to −30 when three morphemes are arranged in order. ”Is the lowest cost of“ here / is / East / Keio / department store / ”compared to“ here is Tokyo Oo Department Store ”, but it can be analyzed correctly, but“ rule: morpheme part of speech ” If the three morphemes “place name noun”, “general noun”, and “general noun” are arranged in order, the cost is −130. ”Is registered, the cost of“ here / is / Tokyo / king / department store / is ”is the lowest with respect to“ here is Tokyo King department store ”, and a morphological analysis error occurs.

これらの問題を解決するために、コスト調整ルール作成部７０９は、図１４に示されるように、ルール作成モジュール７０９１と、コスト最適化モジュール７０９２と、ルール登録モジュール７０９３と、を有している。ルール作成モジュール７０９１では、少なくとも、正解ターゲット形態素のコスト値に調整コスト値を加算（コスト値が複数の場合は、事実上減算となる）した後、最適ターゲット形態素のコスト値より、小さくなることがルール作成の条件となる。つまり、「彼は東京大学院長です」について、例えば、最適形態素解析結果の「彼/は/東/京大/学院/長/です」のコスト値が５００であり、正解形態素解析結果の「彼/は/東京/大学/院長/です」のコスト値が５２５であった場合、調整コスト値は少なくとも“−２６”より小さい値に設定しなくてはならない。 In order to solve these problems, the cost adjustment rule creation unit 709 includes a rule creation module 7091, a cost optimization module 7092, and a rule registration module 7093 as shown in FIG. In the rule creation module 7091, at least the adjustment cost value is added to the cost value of the correct target morpheme (if there are a plurality of cost values, it is effectively subtracted), and then it may become smaller than the cost value of the optimal target morpheme. This is a rule creation condition. In other words, for "He is a graduate student of Tokyo", for example, the cost value of "He / has / Higashi / Kyoto University / Gakuin / Head /" of the optimal morphological analysis result is 500, and the correct morphological analysis result of "He" If the cost value of “/ is / Tokyo / University / Director /” is 525, the adjustment cost value must be set to a value smaller than at least “−26”.

しかし、このようなコスト決定（コスト調整ルール作成）方法は、現在処理中の入力テキストに対して、正解形態素系列が、コスト最小（つまり、最適形態素解析結果）になることを保証できるが、他の文章に対する影響を評価できない。特に、より汎用性の高いルールを作成した場合、前述「ここは東京王百貨店です」の例のように、作成したコスト調整ルールを用いたのでは形態解析システムの解析精度を悪くしてしまう可能性がある。 However, such a cost determination (cost adjustment rule creation) method can guarantee that the correct morpheme sequence has the minimum cost (that is, the optimal morpheme analysis result) for the input text currently being processed. Cannot evaluate the effect on the sentence. In particular, if a more versatile rule is created, using the created cost adjustment rule as in the example of “Here is Tokyo Oo Department Store” may degrade the analysis accuracy of the morphological analysis system. There is sex.

そこで、汎用性と正確性とのバランスを最も良くするために、コスト最適化モジュール７０９２は、事前に用意した正解形態素解析結果データベース７１０を用いて、ルールの適用によって従来正解としていた形態素解析結果への影響（正しい形態素と異なる結果になること）の有無を確認する（影響判定ステップ）。影響が出てしまう場合（影響判定ステップでＹｅｓの場合）、ルール作成モジュール７０９１は、汎用性を少し下げたルールを作成し直して、再度コスト最適化モジュール７０９２の処理が実行されることになる。一方、影響がない場合（影響判定ステップでＮｏの場合）、ルール登録モジュール７０９３は、当該コスト調整ルールをコスト寮生ルール群７１１に登録する。このように、コスト最適化モジュール７０９２は、全ての正解形態素解析結果に影響しない（もしくは、最も影響が小さい）ようにコスト値の調整を行う。なお、ルール作成モジュール７０９１は、より汎用性の高いルールから作成していくことが望ましい。 Therefore, in order to achieve the best balance between versatility and accuracy, the cost optimization module 7092 uses the correct morphological analysis result database 710 prepared in advance to convert the morphological analysis results that have conventionally been made correct by applying the rules. The presence or absence of an effect (being a result different from the correct morpheme) is confirmed (impact determination step). In the case where there is an influence (in the case where the influence determination step is Yes), the rule creation module 7091 recreates a rule with slightly reduced versatility, and the process of the cost optimization module 7092 is executed again. . On the other hand, when there is no influence (No in the influence determination step), the rule registration module 7093 registers the cost adjustment rule in the cost dormitory rule group 711. In this way, the cost optimization module 7092 adjusts the cost value so as not to affect all the correct morphological analysis results (or to have the least effect). Note that the rule creation module 7091 is preferably created from rules with higher versatility.

例えば、「彼は東京大学院長です」のコスト調整をするために、「ルール：形態素の品詞は「地名名詞」、「一般名詞」、「一般名詞」の３つの形態素が順番に並んでいる場合、コストを“−Ｎ”とする。」のようなルールを作成し、−Ｎの値は、正解形態素解析結果データベースを用いて最適化する。最適化する手法については、ＥＭアルゴリズムなど、様々な高速に最適解を求める数学アルゴリズムが提案されているが、最もシンプルなものとしては、−Ｎの値を０から十分に小さい値まで変動させ、最も正解データベースに影響しない（もしくは、影響が小さい）ものの範囲を特定し、その範囲から、最も大きい値を設定する手法が考えられる。ここで、「最も影響が小さいルールを採用する」とは、汎用性が高いルールから低いルールを正解形態素解析結果データベース７１０に登録されている複数の正解形態素解析結果に順次適用していき、不正解となる形態素の数（正しい形態素と異なる結果となる数）が最小となるルールを採用することを意味するものである。例えば、−Ｎの値を０〜１０００まで、１刻みで変化させた結果、−３０〜−１０００の範囲では、正解形態素解析結果データベースに影響しないことが確認できた場合、−Ｎを−３０と設定する。一方、−Ｎの値を０〜１０００まで、１刻みで変化させた結果、すべての範囲において、正解形態素解析結果に影響を及ぶと確認した場合、さらに、例えば、汎用性の小さいルールである「ルール：形態素の品詞は「地名名詞」、「一般名詞」、「一般名詞」の３つの形態素が順番に並んでいる場合、コストを−３０とする。」を作成し、前述と同じように、−Ｎの値を最適化する。このように、汎用性が高いルールから低いルールを順々に適用していき、正解形態素解析結果データベース７１０に影響しないことを確認できたときのルールをコスト調整ルールとして採用する。ただし、最も汎用性の低い（すべての特徴を用いた場合）ルールにおいても、データベースに影響してしまう場合は、最も影響の小さいルールを採用するか、ルール作成をしないかのどちらかを選択できる。これは、その都度ユーザに判断してもらってもいいし、事前に定義しておいても良い。 For example, in order to adjust the cost of “He is a graduate student at Tokyo,” “rule: morpheme part of speech is“ place name noun ”,“ general noun ”, and“ general noun ”. The cost is assumed to be “−N”. And the value of −N is optimized using the correct morphological analysis result database. As for the optimization method, various mathematical algorithms such as the EM algorithm for obtaining an optimum solution have been proposed. As the simplest method, the value of −N is varied from 0 to a sufficiently small value, A method may be considered in which the range of the one that does not affect the correct answer database (or has the least influence) is specified, and the largest value is set from the range. Here, “adopting the rule with the least influence” means that the rule having the highest versatility is applied to the plurality of correct morphological analysis results registered in the correct morphological analysis result database 710 sequentially. This means that a rule that minimizes the number of correct morphemes (a number that results in a difference from the correct morpheme) is adopted. For example, as a result of changing the value of −N from 0 to 1000 in increments of 1 in the range of −30 to −1000, it can be confirmed that the correct morphological analysis result database is not affected. Set. On the other hand, when the value of -N is changed from 0 to 1000 in increments of 1 and it is confirmed that the correct morphological analysis result is affected in all ranges, for example, a rule with small versatility is “ Rule: The part of speech of a morpheme has a cost of −30 when three morphemes of “place name noun”, “general noun”, and “general noun” are arranged in order. ”And optimize the value of −N in the same manner as described above. In this way, rules that have high versatility are applied in order, and rules that are confirmed to have no effect on the correct morphological analysis result database 710 are adopted as cost adjustment rules. However, even if the rule is the least versatile (when all features are used), if it affects the database, you can choose between adopting the rule with the least impact or not creating a rule. . This may be determined by the user each time or may be defined in advance.

ここで、正解形態素解析結果データベース７１０がなくても、入力文章（入力テキスト）において形態素解析ミスの改善ができる。しかし、正解形態素解析結果データベース７１０が大きければ大きいほど、より品質の良い（最も形態素解析システムの解析精度に向上する）ルール作成が可能であると考えられる。また、正解形態素解析結果データベース７１０には、元のテキストと正解とされた形態素解析結果（パスＮ）がセットで登録されるようになっている。 Here, even if there is no correct morphological analysis result database 710, morphological analysis mistakes can be improved in the input sentence (input text). However, it is considered that the larger the correct morphological analysis result database 710 is, the higher quality the rule can be created (which improves the analysis accuracy of the morphological analysis system most). In the correct morphological analysis result database 710, the original text and the correct morphological analysis result (path N) are registered as a set.

＜まとめ＞
（i）本発明の実施形態によれば、形態素解析チューニング装置は、入力テキストを形態素解析し、その形態素解析結果のうちコストが最小の値を取る形態素解析結果である最適形態素解析結果と、当該入力テキストに関してユーザが正解データとして入力した正解形態素解析結果と、を比較する。そして、当該装置は、比較処理の結果、正解形態素解析結果と最適形態素解析結果とが異なる部分を有する場合、正解形態素解析結果が最適形態素解析結果よりもコストが小さくなるように、コスト調整ルールを生成する。例えば、最適形態素解析結果のコストが２０で、正解形態素解析結果のコストが３０である場合、コスト調整ルールを「正解形態素解析結果の形態素の品詞並び（「地名名詞」＋「一般名詞」＋「一般名詞」（例：「東京」「大学」「院長」））の場合、コストを-11する」とすることができる。このようにすることにより、正解形態素解析結果のコストを１９とすることができる（この場合、最適形態素解析結果のコストに変動がないことが条件となる）ので、正解形態素解析結果の最適性（コストが最小となること（スコアが最大となること））を担保することができる。 <Summary>
(I) According to the embodiment of the present invention, the morphological analysis tuning apparatus performs morphological analysis on the input text, and among the morphological analysis results, the optimal morphological analysis result that is the morphological analysis result that takes the minimum value, and The correct morphological analysis result input as correct data by the user with respect to the input text is compared. Then, if the correct morpheme analysis result and the optimal morpheme analysis result have different parts as a result of the comparison process, the apparatus sets the cost adjustment rule so that the correct morpheme analysis result is less expensive than the optimal morpheme analysis result. Generate. For example, when the cost of the optimal morpheme analysis result is 20 and the cost of the correct morpheme analysis result is 30, the cost adjustment rule is “the morphological part of morpheme of the correct morpheme analysis result (“ place name noun ”+“ general noun ”+“ In the case of “general nouns” (eg, “Tokyo”, “university”, “director”), the cost can be “-11”. By doing so, the cost of the correct morphological analysis result can be set to 19 (in this case, the cost of the optimal morphological analysis result is not changed), so the optimality of the correct morphological analysis result ( Costs can be minimized (scores can be maximized).

上記比較の結果、正解形態素解析結果と最適形態素解析結果に異なる部分があれば、この異なる部分の形態素を形態素辞書に登録する。また、形態素解析チューニング装置は、この更新された形態素辞書を用いて、同じ入力テキストに対して形態素解析を実行し、更新された最適形態素解析結果を取得する。そして、当該装置は、この更新された最適形態素解析結果と正解形態素解析結果を再度比較し、異なる部分がある場合にコスト調整ルールを生成する処理を実行する。このようにすることにより、単に形態素辞書になかったために正解形態素解析結果が最適形態素解析結果と異なってしまった状態を即座に是正するとともに、不必要にコスト調整ルールを生成しなくても良い。また、形態素辞書を最適な状態に保つことができるようになる。 If there is a difference between the correct morpheme analysis result and the optimum morpheme analysis result as a result of the comparison, the morpheme of the different part is registered in the morpheme dictionary. Further, the morpheme analysis tuning apparatus uses this updated morpheme dictionary to perform morpheme analysis on the same input text and obtain an updated optimum morpheme analysis result. Then, the apparatus again compares the updated optimal morpheme analysis result and the correct answer morpheme analysis result, and executes a process of generating a cost adjustment rule when there is a different part. By doing so, it is possible to immediately correct a state where the correct morpheme analysis result is different from the optimal morpheme analysis result because it is simply not in the morpheme dictionary, and it is not necessary to generate a cost adjustment rule unnecessarily. In addition, the morpheme dictionary can be maintained in an optimum state.

より具体的には、形態素解析チューニング装置は、コスト調整ルールを適用した場合に、正解形態素解析結果データベースに登録されている正解形態素解析結果の正解が維持されるようにコスト調整ルールを生成する、或いは、正解形態素解析結果データベースに登録されている正解形態素解析結果のコストが最小となるコスト調整ルールを採用する。さらに詳細には、当該装置は、正解形態素解析結果データベースに登録されている正解形態素解析結果に、最も汎用性の高いコスト調整ルールから汎用性の低いコスト調整ルールを順次適用して正解が維持されるコスト調整ルールを採用する、或いは、正解形態素解析結果のコストが最小となるコスト調整ルールを採用する。このようにすることにより、既にデータベースに登録されている正解形態素解析結果に影響を与えずに（正解であったものを不正解にすることなくする）、或いは与える影響を最小限にとどめることができるようになる。 More specifically, the morphological analysis tuning device generates the cost adjustment rule so that the correct answer of the correct morphological analysis result registered in the correct morphological analysis result database is maintained when the cost adjustment rule is applied. Alternatively, a cost adjustment rule that minimizes the cost of the correct morphological analysis result registered in the correct morphological analysis result database is adopted. More specifically, the device applies the cost adjustment rule with the highest versatility to the correct morphological analysis result registered in the correct morpheme analysis result database in order to maintain the correct answer. The cost adjustment rule that minimizes the cost of the correct morphological analysis result is adopted. By doing in this way, the correct morphological analysis result already registered in the database is not affected (the correct answer is not changed to an incorrect answer) or the influence is minimized. become able to.

（ii）本発明は、実施形態の機能を実現するソフトウェアのプログラムコードによっても実現できる。この場合、プログラムコードを記録した記憶媒体をシステム或は装置に提供し、そのシステム或は装置のコンピュータ（又はＣＰＵやＭＰＵ）が記憶媒体に格納されたプログラムコードを読み出す。この場合、記憶媒体から読み出されたプログラムコード自体が前述した実施形態の機能を実現することになり、そのプログラムコード自体、及びそれを記憶した記憶媒体は本発明を構成することになる。このようなプログラムコードを供給するための記憶媒体としては、例えば、フレキシブルディスク、ＣＤ−ＲＯＭ、ＤＶＤ−ＲＯＭ、ハードディスク、光ディスク、光磁気ディスク、ＣＤ−Ｒ、磁気テープ、不揮発性のメモリカード、ＲＯＭなどが用いられる。 (Ii) The present invention can also be realized by software program codes that implement the functions of the embodiments. In this case, a storage medium in which the program code is recorded is provided to the system or apparatus, and the computer (or CPU or MPU) of the system or apparatus reads the program code stored in the storage medium. In this case, the program code itself read from the storage medium realizes the functions of the above-described embodiments, and the program code itself and the storage medium storing the program code constitute the present invention. As a storage medium for supplying such program code, for example, a flexible disk, CD-ROM, DVD-ROM, hard disk, optical disk, magneto-optical disk, CD-R, magnetic tape, nonvolatile memory card, ROM Etc. are used.

また、プログラムコードの指示に基づき、コンピュータ上で稼動しているＯＳ（オペレーティングシステム）などが実際の処理の一部又は全部を行い、その処理によって前述した実施の形態の機能が実現されるようにしてもよい。さらに、記憶媒体から読み出されたプログラムコードが、コンピュータ上のメモリに書きこまれた後、そのプログラムコードの指示に基づき、コンピュータのＣＰＵなどが実際の処理の一部又は全部を行い、その処理によって前述した実施の形態の機能が実現されるようにしてもよい。 Also, based on the instruction of the program code, an OS (operating system) running on the computer performs part or all of the actual processing, and the functions of the above-described embodiments are realized by the processing. May be. Further, after the program code read from the storage medium is written in the memory on the computer, the computer CPU or the like performs part or all of the actual processing based on the instruction of the program code. Thus, the functions of the above-described embodiments may be realized.

さらに、実施の形態の機能を実現するソフトウェアのプログラムコードを、ネットワークを介して配信することにより、それをシステム又は装置のハードディスクやメモリ等の記憶手段又はＣＤ−ＲＷ、ＣＤ−Ｒ等の記憶媒体に格納し、使用時にそのシステム又は装置のコンピュータ（又はＣＰＵやＭＰＵ）が当該記憶手段や当該記憶媒体に格納されたプログラムコードを読み出して実行するようにしても良い。 Further, by distributing the program code of the software that realizes the functions of the embodiment via a network, it is stored in a storage means such as a hard disk or memory of a system or apparatus, or a storage medium such as a CD-RW or CD-R And the computer (or CPU or MPU) of the system or apparatus may read and execute the program code stored in the storage means or the storage medium when used.

最後に、ここで述べたプロセス及び技術は本質的に如何なる特定の装置に関連することはなく、コンポーネントの如何なる相応しい組み合わせによってでも実装できることを理解する必要がある。更に、汎用目的の多様なタイプのデバイスがここで記述した教授に従って使用可能である。ここで述べた方法のステップを実行するのに、専用の装置を構築するのが有益であることが判るかもしれない。また、実施形態に開示されている複数の構成要素の適宜な組み合わせにより、種々の発明を形成できる。例えば、実施形態に示される全構成要素から幾つかの構成要素を削除してもよい。さらに、異なる実施形態にわたる構成要素を適宜組み合わせてもよい。本発明は、具体例に関連して記述したが、これらは、すべての観点に於いて限定の為ではなく説明の為である。本分野にスキルのある者には、本発明を実施するのに相応しいハードウェア、ソフトウェア、及びファームウエアの多数の組み合わせがあることが解るであろう。例えば、記述したソフトウェアは、アセンブラ、Ｃ／Ｃ＋＋、ｐｅｒｌ、Ｓｈｅｌｌ、ＰＨＰ、Ｊａｖａ（登録商標）等の広範囲のプログラム又はスクリプト言語で実装できる。 Finally, it should be understood that the processes and techniques described herein are not inherently related to any particular apparatus, and can be implemented by any suitable combination of components. In addition, various types of devices for general purpose can be used in accordance with the teachings described herein. It may prove useful to build a dedicated device to perform the method steps described herein. Various inventions can be formed by appropriately combining a plurality of constituent elements disclosed in the embodiments. For example, some components may be deleted from all the components shown in the embodiment. Furthermore, constituent elements over different embodiments may be appropriately combined. Although the present invention has been described with reference to specific examples, these are in all respects illustrative rather than restrictive. Those skilled in the art will appreciate that there are numerous combinations of hardware, software, and firmware that are suitable for implementing the present invention. For example, the described software can be implemented in a wide range of programs or script languages such as assembler, C / C ++, perl, shell, PHP, Java (registered trademark).

さらに、上述の実施形態において、制御線や情報線は説明上必要と考えられるものを示しており、製品上必ずしも全ての制御線や情報線を示しているとは限らない。全ての構成が相互に接続されていても良い。 Furthermore, in the above-described embodiment, control lines and information lines are those that are considered necessary for explanation, and not all control lines and information lines on the product are necessarily shown. All the components may be connected to each other.

加えて、本技術分野の通常の知識を有する者には、本発明のその他の実装がここに開示された本発明の明細書及び実施形態の考察から明らかになる。記述された実施形態の多様な態様及び／又はコンポーネントは、単独又は如何なる組み合わせでも使用することが出来る。明細書と具体例は典型的なものに過ぎず、本発明の範囲と精神は後続する請求範囲で示される。 In addition, other implementations of the invention will be apparent to those skilled in the art from consideration of the specification and embodiments of the invention disclosed herein. Various aspects and / or components of the described embodiments can be used singly or in any combination. The specification and specific examples are merely exemplary, and the scope and spirit of the invention are indicated in the following claims.

１００・・・音声合成システム
１０１・・・テキスト解析装置
１０２・・・形態素解析チューニング装置
１０３・・・音声合成装置
７０１・・・テキスト入力部
７０２・・・形態素解析部
７０３・・・解析結果提示部
７０４・・・形態素辞書
７０５・・・解析結果判定部
７０６・・・正解形態素解析結果入力部
７０７・・・ターゲット形態素自動抽出部
７０８・・・修正手法判定部
７０９・・・コスト調整ルール作成部
７１０・・・正解形態素解析結果データベース
７１１・・・コスト調整ルール群
７１２・・・単語登録部 DESCRIPTION OF SYMBOLS 100 ... Speech synthesis system 101 ... Text analysis device 102 ... Morphological analysis tuning device 103 ... Speech synthesis device 701 ... Text input unit 702 ... Morphological analysis unit 703 ... Presentation of analysis result Unit 704 ... morpheme dictionary 705 ... analysis result determination unit 706 ... correct morpheme analysis result input unit 707 ... target morpheme automatic extraction unit 708 ... correction method determination unit 709 ... cost adjustment rule creation Unit 710 ... correct morphological analysis result database 711 ... cost adjustment rule group 712 ... word registration unit

Claims

A morphological analysis tuning device for generating a cost adjustment rule used for morphological analysis based on a morphological analysis result of a text sentence,
A correct morphological analysis result database for storing correct morphological analysis results that are input by the user as determined to be correct;
A processor that creates the cost adjustment rule while optimizing using the correct morphological analysis result and the morpheme read from the morpheme dictionary,
The processor is
A comparison process for comparing the optimal morpheme analysis result, which is the morpheme analysis result having the lowest cost among the morpheme analysis results of the text sentence, and the correct morpheme analysis result input by the user as correct data regarding the text sentence When,
As a result of the comparison process, when the correct morpheme analysis result and the optimal morpheme analysis result have different portions, the cost adjustment rule is set so that the correct morpheme analysis result has a lower cost than the optimal morpheme analysis result. Adjustment process to generate,
A morphological analysis tuning apparatus characterized by executing

In claim 1,
The processor registers the correct morpheme analysis result corresponding to the different part in the morpheme dictionary and updates the morpheme dictionary, and then obtains the updated optimal morpheme analysis result obtained using the updated morpheme dictionary. The correct morpheme analysis result and the updated optimal morpheme analysis result are compared again, and the adjustment process is performed when the correct morpheme analysis result and the updated optimal morpheme analysis result have different parts. A morphological analysis tuning device characterized by

In claim 1,
In the adjustment process, the processor generates the cost adjustment rule so that the correct answer of the correct morphological analysis result registered in the correct morphological analysis result database is maintained when the cost adjustment rule is applied. A morphological analysis tuning device characterized by that.

In claim 3,
In the adjustment process, the processor sequentially applies the most versatile cost adjustment rule to the low versatility cost adjustment rule to the correct morphological analysis result registered in the correct morphological analysis result database to maintain the correct answer. A morphological analysis tuning device characterized by adopting a cost adjustment rule.

In claim 1,
In the adjustment process, the processor employs a cost adjustment rule that minimizes the cost of the correct morphological analysis result registered in the correct morphological analysis result database when the cost adjustment rule is applied. A morphological analysis tuning device.

In claim 5,
In the adjustment process, the processor sequentially applies the most versatile cost adjustment rule to the correct morpheme analysis result registered in the correct morphological analysis result database, from the most versatile cost adjustment rule to the correct morpheme analysis result. A morphological analysis tuning apparatus that employs a cost adjustment rule that minimizes the cost of an analysis result.

In claim 1,
The processor presents a plurality of morphological analysis results of the text sentence to a user in ascending order of cost, and allows the user to select the correct morphological analysis result from the plurality of morphological analysis results. A morphological analysis tuning device.

A speech synthesis system that generates and outputs synthesized speech of an input text sentence,
A text analysis device that refers to the morpheme dictionary, executes morpheme analysis, gives a reading of the text sentence based on the result of the morpheme analysis, and determines an accent;
A morphological analysis tuning device according to claim 1;
A voice synthesizer that generates a synthesized voice of the text sentence using the reading of the text sentence from the text analysis device and the accent;
The speech analysis system, wherein the text analysis device executes the morpheme analysis using the cost adjustment rule generated by the morpheme analysis tuning device.

A morphological analysis tuning method for generating a cost adjustment rule used for morphological analysis based on a morphological analysis result of a text sentence,
A processor accessing a correct morphological analysis result database that stores a correct morphological analysis result inputted by determining that the user is correct, and obtaining the correct morphological analysis result;
Using the correct morpheme analysis result and the morpheme read from the morpheme dictionary to create the processor while optimizing the cost adjustment rule,
Creating the cost adjustment rule while optimizing,
The processor includes an optimal morpheme analysis result that is a morpheme analysis result that takes a minimum value among morpheme analysis results of the text sentence, and a correct morpheme analysis result that the user has input as correct data regarding the text sentence. Executing a comparison process to be compared;
When the processor has a portion where the correct morpheme analysis result and the optimal morpheme analysis result are different as a result of the comparison process, the correct morpheme analysis result has a lower cost than the optimal morpheme analysis result. And a step of executing an adjustment process for generating a cost adjustment rule.

In claim 9,
The processor registers the correct morpheme analysis result corresponding to the different part in the morpheme dictionary and updates the morpheme dictionary, and then obtains the updated optimal morpheme analysis result obtained using the updated morpheme dictionary. The correct morpheme analysis result and the updated optimal morpheme analysis result are compared again, and the adjustment process is performed when the correct morpheme analysis result and the updated optimal morpheme analysis result have different parts. A morphological analysis tuning method characterized by

In claim 9,
In the adjustment process, the processor generates the cost adjustment rule so that the correct answer of the correct morphological analysis result registered in the correct morphological analysis result database is maintained when the cost adjustment rule is applied. A morphological analysis tuning method characterized by the above.

In claim 11,
In the adjustment process, the processor sequentially applies the most versatile cost adjustment rule to the low versatility cost adjustment rule to the correct morphological analysis result registered in the correct morphological analysis result database to maintain the correct answer. A morphological analysis tuning method characterized by adopting a cost adjustment rule.

In claim 9,
In the adjustment process, the processor employs a cost adjustment rule that minimizes the cost of the correct morphological analysis result registered in the correct morphological analysis result database when the cost adjustment rule is applied. To tune morphological analysis.

In claim 13,
In the adjustment process, the processor sequentially applies the most versatile cost adjustment rule to the correct morpheme analysis result registered in the correct morphological analysis result database, from the most versatile cost adjustment rule to the correct morpheme analysis result. A morphological analysis tuning method that employs a cost adjustment rule that minimizes the cost of an analysis result.

In claim 9,
The processor presents a plurality of morphological analysis results of the text sentence to a user in ascending order of cost, and allows the user to select the correct morphological analysis result from the plurality of morphological analysis results. To tune morphological analysis.