JPH10187185A

JPH10187185A - Device and method for language processing

Info

Publication number: JPH10187185A
Application number: JP8344377A
Authority: JP
Inventors: Mitsuhisa Kamei; 光久亀井
Original assignee: Fuji Xerox Co Ltd
Current assignee: Fujifilm Business Innovation Corp
Priority date: 1996-12-25
Filing date: 1996-12-25
Publication date: 1998-07-14

Abstract

PROBLEM TO BE SOLVED: To eliminate the need for unnecessary computations, to precisely predict the language score of unretrieved portions and to greatly reduce the retrieval space during a language evaluation. SOLUTION: An LR table storage section 20 records the information, which makes the correspondence between an inputted character string and the result of a syntax analysis, the score of the language evaluation corresponding to applicable rewriting rules and the predicted value of the evaluation score generated after the state for each state. A syntax analysis section 12 refers to the LR table and conducts a syntax analysis. A language score computing section 14 refers to the LR table and computes the score generated by the syntax analysis. A candidate recording section 30 records the information of the candidate being processed, the evaluation score outputted by the section 14 and the predicted score read from the section 20. A syntax analysis control section 13 refers to the data of the section 30 and determines the candidates against whom analyses are conducted.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、文節や文など一つ
以上の単語から構成される言語単位（以下では文）を、
確からしさのスコアを利用しながら高速に構文解析およ
び言語的な評価をする技術に関する。The present invention relates to a language unit (hereinafter, sentence) composed of one or more words, such as a phrase or a sentence,
The present invention relates to a technique for performing high-speed parsing and linguistic evaluation using a certainty score.

【０００２】[0002]

【従来の技術】音声認識や文字認識、自動翻訳、意味解
析などにおいては、言語解析を行ない、構文解析や言語
的な確からしさなどの評価を必要とする。構文解析の結
果は、その言語の文として正しいかどうかを判定する。
また必要に応じて、文として正しいと判定したならば、
その言語構造を解析する。言語的な確からしさは、その
言語に属する文としてどの程度確からしいかという評価
であり、一般には数値スコアで評価することが多い。音
声認識や文字認識など複数の解析を同時に進める場合
や、自然言語における曖昧性を多分に含んだ言語を処理
する場合には、解析の結果として複数の候補が生じる。
特に、音声認識や文字認識などのように多数の認識結果
が生じるような場合には、すべての認識結果について言
語的評価をするのは現実的ではない。このような場合で
は、途中のスコアに応じて結果として見込みの少ないも
のを消去したり、スコアの良いものを優先して解析を進
めるなどの方法を取ることで、高速に処理を進めるよう
に目指している。2. Description of the Related Art Speech recognition, character recognition, automatic translation, semantic analysis, and the like require linguistic analysis to evaluate syntax analysis and linguistic certainty. It is determined whether the result of the parsing is correct as a sentence of the language.
If necessary, if it is determined that the sentence is correct,
Analyze the language structure. Linguistic certainty is an evaluation of the likelihood of a sentence belonging to the language, and is generally evaluated by a numerical score. When a plurality of analyzes, such as voice recognition and character recognition, are simultaneously advanced, or when a language that includes ambiguity in a natural language is processed, a plurality of candidates are generated as a result of the analysis.
In particular, when a large number of recognition results occur such as voice recognition and character recognition, it is not realistic to perform linguistic evaluation on all the recognition results. In such a case, we aim to speed up the processing by taking measures such as erasing those that are unlikely as a result according to the score in the middle or proceeding with the analysis by giving priority to the one with a good score ing.

【０００３】とくに、音声認識の分野においては、認識
の過程で多数の解析すべき言語候補が発生するので、高
速に言語処理をする方法が求められている。In particular, in the field of speech recognition, a large number of language candidates to be analyzed are generated in the course of recognition, so that a method for high-speed language processing is required.

【０００４】音響学会平成８年春季研究発表会講演論文
「音節グラフの文脈自由構文解析におけるＡ＊探索の検
討」において、阿部芳春・中島邦男は、音響処理の結果
出力された音節の有向グラフを文脈自由構文解析をする
場合に、Ａ＊アルゴリズムを利用する方法を開示してい
る。Ａ＊アルゴリズムとは、最良優先探索を実現するア
ルゴリズムである。最も評価の良い候補から順番に探索
を進める際に、探索が終了した部分のスコアに未探索部
分のスコアの推定スコアを加えたスコアによって最も評
価の良いスコアを決定する。この時に推定されたスコア
が、最終的に決定するスコアよりも常に良い方向に推定
されていれば、認容可能条件を満たしているといい、最
も良いスコアの解が見つかることが保証される。Ａ＊ア
ルゴリズムにおいて、予測スコアの算出に多大な時間を
必要とせずに、最終的なスコアを精度良く予測できる方
法があれば、効率的な探索が実現できる。[0004] In a lecture paper "A study on A * search in context-free parsing of syllable graphs" in the spring meeting of the Acoustical Society of Japan in 1996, Yoshiharu Abe and Kunio Nakajima described a directed graph of syllables output as a result of acoustic processing as context. It discloses a method of using the A * algorithm when performing free syntax analysis. The A * algorithm is an algorithm that implements the best priority search. When the search proceeds in order from the candidate with the highest evaluation, the score with the highest evaluation is determined by adding the estimated score of the score of the unsearched part to the score of the part where the search has been completed. If the score estimated at this time is always estimated in a better direction than the finally determined score, it is said that the acceptable condition is satisfied, and it is guaranteed that a solution with the best score is found. In the A * algorithm, if there is a method capable of accurately predicting the final score without requiring much time for calculating the prediction score, efficient search can be realized.

【０００５】別な技術として、音声認識において音響的
な照合による音響スコアと、言語的な評価による言語ス
コアを用いて、候補の評価を行うための評価スコアを算
出する場合に、評価を行っていない部分の一部の言語ス
コアを利用する技術が、特開平７−２９５５８７号公報
において開示されている。単語単位の基準信号を木構造
で構築しておき、その木構造の各ノードに、そのノード
よりさらに照合を進めた結果として確定する可能性のあ
る単語が受理された時に発生する言語スコアを計算して
おき、あらかじめそのノードに付加して記録しておく。
検査信号と基準信号の照合を行なう時に、ノードに記録
されたそのスコアを利用すると、候補の比較をする評価
スコアが正確なスコアに近くなるので、枝刈りの基準を
厳しくできる。しかも、ノードに記録する言語スコアの
計算はあらかじめ行うので、認識処理時にはそのスコア
を参照するだけであり、認識処理時の計算の負荷をそれ
ほど増加させることがない。このように、探索が済んで
いない部分のスコアの推定値である予測スコアをあらか
じめ計算をしておくことで、枝刈りの基準をより厳しく
して、有効な候補の数を削減できる。[0005] As another technique, evaluation is performed when an evaluation score for evaluating a candidate is calculated using an acoustic score obtained by acoustic verification and a linguistic score obtained by linguistic evaluation in speech recognition. Japanese Patent Application Laid-Open No. Hei 7-295587 discloses a technique that uses a part of the language score of the missing part. A word-based reference signal is constructed in a tree structure, and the language score generated when each node of the tree structure receives a word that may be determined as a result of further matching than that node is calculated. Then, it is added to the node and recorded in advance.
When the check signal is compared with the reference signal, if the score recorded in the node is used, the evaluation score for comparing the candidates becomes close to an accurate score, so that the pruning standard can be strict. Moreover, since the calculation of the language score to be recorded in the node is performed in advance, only the score is referred to at the time of the recognition processing, and the calculation load at the time of the recognition processing does not increase so much. As described above, by calculating in advance the prediction score, which is an estimated value of the score of the portion that has not been searched, the pruning criterion can be made more strict and the number of valid candidates can be reduced.

【０００６】[0006]

【発明が解決しようとする課題】以上のように、言語処
理を進める時にその膨大な探索空間を削減する方法とし
て、ビーム探索や最良優先探索がよく利用されている。
ビーム探索や最良優先探索において、その枝刈り基準や
最良候補の決定をするためのスコアの精度は重要であ
る。音声認識の中でも最初に音響的な照合を行い、その
結果に対して次に言語的な評価を行なう場合には、言語
的な評価の中で、それまでに評価の済んだ部分の評価ス
コアとともに、音響処理の結果を予測スコアとして利用
してスコアの精度をより高める技術は存在する。しか
し、未探索部分の言語スコアの推定値（予測言語スコ
ア）は予め用意されていないので、そのままでは、予測
言語スコアを利用することはできない。したがって評価
スコアの精度は十分でなく、探索空間を大きく削減でき
ない。予測言語スコアを解析中に算出すれば、予測言語
スコアを利用できるけれども、その処理に余分な時間を
必要としてしまい、探索空間を削減する効果を減らして
しまう。As described above, beam search and best-priority search are often used as methods for reducing the huge search space when proceeding with language processing.
In the beam search and the best priority search, the accuracy of the pruning criterion and the score for determining the best candidate are important. When performing acoustic matching first in speech recognition and then performing linguistic evaluation on the results, the linguistic evaluation is performed together with the evaluation score of the previously evaluated part. There is a technique for further improving the accuracy of the score by using the result of the acoustic processing as a prediction score. However, since the estimated value (predicted language score) of the language score of the unsearched portion is not prepared in advance, the predicted language score cannot be used as it is. Therefore, the accuracy of the evaluation score is not sufficient, and the search space cannot be significantly reduced. If the predicted language score is calculated during the analysis, the predicted language score can be used, but extra time is required for the processing, and the effect of reducing the search space is reduced.

【０００７】一方、前に示した特開平７−２９５５８７
号公報において開示された技術によると、その時照合し
ている単語が仮りに確定したならば発生するであろう言
語スコアを、確定する前にあらかじめ算出しておく方法
が開示されている。しかし、この技術では、その時に照
合している単語が確定した時に新たに発生するスコアの
みを予備計算できるにとどまっている。複数の単語が連
なって文などを形成している場合には、照合している単
語以後の単語に対して言語スコアを予測することはでき
ない。これでは十分な精度を期待できないので、探索空
間を大幅に減らすことは困難である。On the other hand, Japanese Unexamined Patent Application Publication No. 7-295587
According to the technique disclosed in Japanese Patent Laid-Open Publication No. H10-210, a method is disclosed in which a language score that would be generated if the word being collated at that time is temporarily determined is calculated before the word is determined. However, in this technique, only a score newly generated when a word to be collated at that time is determined can be preliminarily calculated. When a plurality of words are linked to form a sentence or the like, a language score cannot be predicted for words after the word being collated. Since sufficient accuracy cannot be expected with this, it is difficult to significantly reduce the search space.

【０００８】上述したように、構文解析およびそれに伴
う言語処理を進めるにあたって、解析が進んだときに発
生する予測言語スコアを処理中に時間をとられずに獲得
する方法は開示されていない。したがって、処理の精度
を落とすことなしに、言語処理の探索空間を大幅に削減
することは困難であった。As described above, in proceeding with the syntax analysis and the accompanying language processing, there is no disclosure of a method for acquiring a predicted language score generated when the analysis proceeds without taking a long time during the processing. Therefore, it has been difficult to significantly reduce the search space for language processing without lowering the processing accuracy.

【０００９】本発明は、このような事情を考慮してなさ
れたものであり、処理に余分な時間をかけることなく予
測言語スコアを得ることができるようにすることを目的
としている。The present invention has been made in view of such circumstances, and has as its object to obtain a predicted language score without taking extra time for processing.

【００１０】[0010]

【課題を解決するための手段】本発明によれば、チャー
ト方式や、（一般）ＬＲ方式や、有限状態オートマトン
などを利用して構文規則を行いながら言語評価を進める
時に、すべての構文規則の解析状態、あるいは特に必要
とする一部の解析状態に対して、後続できる言語列をあ
らかじめ予測しておく。その予測された言語列に伴う言
語スコアを算出しておき、予測言語スコアとして最も適
切な値を解析状態に対応させて記憶させておく。解析状
態とは、チャート方式ではエッジ、ＬＲ方式ではＬＲ表
の行もしくはスタックの状態、有限状態オートマトンで
は状態などである。そして言語評価を行う際に、その時
の構文解析の状態から対応する予測言語スコアを参照す
ることで、新たに計算することなく予測言語スコアを得
られる。この予測言語スコアを、ビーム探索の枝刈り評
価に利用したり、最良優先探索のヒューリスティックス
コアに利用することで、探索を効率よく進められる。According to the present invention, when performing language evaluation while performing syntax rules using a chart system, a (general) LR system, a finite state automaton, or the like, all syntax rules are used. For the analysis state or a part of the analysis state particularly required, a language sequence that can be succeeded is predicted in advance. A language score associated with the predicted language sequence is calculated, and the most appropriate value as the predicted language score is stored in association with the analysis state. The analysis state is an edge in the chart system, a row or stack state of the LR table in the LR system, a state in the finite state automaton, and the like. Then, when performing language evaluation, a predicted language score can be obtained without newly calculating by referring to the corresponding predicted language score from the state of the syntax analysis at that time. By using the predicted language score for the pruning evaluation of the beam search or the heuristic score of the best priority search, the search can be efficiently advanced.

【００１１】さらに、あらかじめ予測する言語スコア
が、実際に評価を進めたときに得られるスコアよりも、
同じ、もしくはより良い評価になることが保証されてい
る場合には、その予測スコアを利用した最良優先探索が
認容可能条件を満たす探索になる。この時に、最も良い
評価の解が見つかることが保証されるという利点があ
る。言語スコアの絶対値が少ないほど良いスコアである
時に、接続する語彙項目が増えれば増えるほどスコアの
絶対値が増大するような体系がよく利用される。このよ
うに、語彙項目の接続が長くなるほどスコアが悪くなる
体系においては、あらかじめ予測スコアを計算するとき
に全探索をすることで、最も良い予測スコアをあらかじ
め決定することができる。この最も良いスコアを予測ス
コアとして、解析状態に対応させて記憶させておき評価
時に予測スコアとして利用することで、評価が終わった
実際のスコアと同じかもしくはそれよりも良い評価にな
る。Further, the language score predicted in advance is smaller than the score obtained when the evaluation is actually advanced.
If the same or better evaluation is guaranteed, the best priority search using the predicted score is a search that satisfies the acceptable condition. At this time, there is an advantage that it is guaranteed that the best evaluation solution is found. When the absolute value of the linguistic score is smaller and the score is better, a system is often used in which the absolute value of the score increases as the number of connected vocabulary items increases. As described above, in a system in which the score worsens as the connection of the vocabulary items becomes longer, the best prediction score can be determined in advance by performing a full search when calculating the prediction score in advance. By storing the best score as a prediction score in association with the analysis state and using it as a prediction score at the time of evaluation, the evaluation becomes the same as or better than the actual score after the evaluation.

【００１２】ここで利用できる言語スコアには様々なも
のがある。音節などの音のつながりに関するｎ−ｇｒａ
ｍなどの統計情報や、単語や形態素などの語彙項目のつ
ながりに関するｎ−ｇｒａｍなどの統計情報や、長い語
彙項目ほど有利にする経験的に与えるスコアや、語彙項
目や品詞などに固有な経験的に与えるスコアや、書き換
え規則などの表現による構文規則に統計的もしくは経験
的に与えるスコアなど、が利用できる。これらの知識源
はそれぞれ単独で利用しても良いし、複数を組み合わせ
ることも可能である。複数を組み合わせるときには、各
知識源のスコアを各々別々のある定数を乗じた後に、和
をとる方法を利用すると計算が容易であり、最良優先探
索にも適合しやすい。また、ここであげた知識源のスコ
ア体系は、いずれも接続する語彙項目が増えるほどスコ
アが悪くなるような体系にできる。すなわち、認容可能
な最良優先探索を実現できる。There are various language scores that can be used here. N-gra for connection of sounds such as syllables
statistical information such as m, statistical information such as n-gram relating to the connection of vocabulary items such as words and morphemes, empirical scores given to longer vocabulary items, and empirical information unique to vocabulary items and parts of speech Or a score that is statistically or empirically assigned to a syntax rule based on an expression such as a rewriting rule. Each of these knowledge sources may be used alone, or a plurality of them may be combined. When a plurality of knowledge sources are combined, a method of multiplying the score of each knowledge source by a certain constant and then taking the sum is easy to calculate, and is suitable for the best priority search. In addition, any of the score systems of the knowledge sources mentioned here can be a system in which the score becomes worse as the number of connected vocabulary items increases. That is, an acceptable best priority search can be realized.

【００１３】[0013]

【発明の実施の形態】以下、本発明の実施例について説
明する。［実施例１］図１は、本発明の実施例１の言語処理装置
の構成を示しており、図１において、言語処理装置は言
語処理制御部１０、ＬＲ表記憶部２０および候補記憶部
３０等からなっている。また言語処理制御部１０は構文
解析部１２、構文解析制御部１３および言語スコア計算
部１４等を具備し、ＬＲ表記憶部２０のＬＲ表を参照し
ながら、入力された文字列を解析する処理の流れを制御
する。図２は本実施例におけるＬＲ表の内容を示す。Ｌ
Ｒ表は、入力された文字列と構文を解析した結果とを対
応させるための情報と、適用する書き換え規則に対応す
る言語評価のスコアと、各状態に対してその状態以降に
発生する評価スコアの予測値とを記録する。ＬＲ表を作
成するためには、あらかじめ書き換え規則の形で与えら
れた構文規則を用意しておき、この規則をコンパイルす
ればよい。また、各書き換え規則には、規則ごとにその
規則を適用する頻度や重要性、文としての自然さなどの
条件に応じてスコアを割り当てる。ここでは、適用しや
すい規則ほど低い値を持ち、あまり適用したくない規則
には高い値を割り振るようにしておく。一般には、規則
が適用される確率を求めて、その対数の絶対値をとるこ
とで上記の条件を満たすスコアを算出することができ
る。規則が適用される確率は、サンプルとなる文章の集
まりをそれらの規則で解析して、解析に利用した各規則
の回数に応じて算出できる。算出されたスコアは、ＬＲ
表の中の各書き換え規則に対応するリデュース番号とと
もにスコアの表に記入しておく。また、ＬＲ表の各状態
における評価スコアの予測値は、その状態から文の最後
に到達するまでに適用するリデュース操作によって発生
するスコアの予測値である。この予測スコアを算出する
処理の流れについては後述する。Embodiments of the present invention will be described below. Embodiment 1 FIG. 1 shows the configuration of a language processing apparatus according to Embodiment 1 of the present invention. In FIG. 1, the language processing apparatus includes a language processing control unit 10, an LR table storage unit 20, and a candidate storage unit 30. And so on. The language processing control unit 10 includes a syntax analysis unit 12, a syntax analysis control unit 13, a language score calculation unit 14, and the like, and performs a process of analyzing an input character string while referring to the LR table in the LR table storage unit 20. Control the flow of FIG. 2 shows the contents of the LR table in this embodiment. L
The R table includes information for associating an input character string with a result of parsing a syntax, a score of a language evaluation corresponding to a rewriting rule to be applied, and an evaluation score generated for each state after that state. And the predicted value of. In order to create the LR table, a syntax rule given in the form of a rewrite rule may be prepared in advance, and the rule may be compiled. In addition, a score is assigned to each rewriting rule according to conditions such as frequency and importance of applying the rule and naturalness as a sentence. Here, rules that are easier to apply have lower values, and rules that are less likely to be applied are assigned higher values. In general, a score that satisfies the above condition can be calculated by determining the probability that the rule is applied and taking the absolute value of the logarithm. The probability that a rule is applied can be calculated according to the number of rules used in the analysis by analyzing a sample set of sentences using those rules. The calculated score is LR
Write in the score table along with the reduce numbers corresponding to each rewrite rule in the table. The predicted value of the evaluation score in each state of the LR table is a predicted value of a score generated by a reduce operation applied from that state to the end of the sentence. The flow of the process of calculating the prediction score will be described later.

【００１４】構文解析部１２はＬＲ表を参照して構文解
析を行う。言語スコア計算部１４はＬＲ表を参照して構
文解析により生じるスコアを算出する。候補記録部３０
では、処理中の候補の情報、言語スコア計算部１４が出
力する評価スコアおよびＬＲ表記憶部２０から読み出し
た予測スコアを記録する。構文解析制御部１３は候補記
録部３０のデータを参照して解析を進める候補を決定す
る。The syntax analyzer 12 performs syntax analysis with reference to the LR table. The language score calculator 14 calculates a score generated by the syntax analysis with reference to the LR table. Candidate recording unit 30
Then, information on the candidate being processed, the evaluation score output by the language score calculation unit 14, and the prediction score read from the LR table storage unit 20 are recorded. The syntax analysis control unit 13 refers to the data in the candidate recording unit 30 to determine a candidate for which analysis is to be performed.

【００１５】実施例１における言語処理のフローチャー
トを図３に示す。まず、処理をする文字列に対して、構
文解析の初期状態の候補を一つ用意する（ステップＳ１
０１）。次に、最初の入力文字を先読み記号として読み
込む（ステップＳ１０２）。初期状態の候補と読み込ん
だ先読み記号に対して、ＬＲ表を参照する（ステップＳ
１０３）。ＬＲ表を参照した結果として、各候補におけ
る先読み記号に対する操作が可能ならば、その操作を行
って新しい候補を作成する（ステップＳ１０５）。この
結果として新しい候補が作成されなかったらば、構文解
析に失敗したことを示す（ステップＳ１０４）。構文解
析に失敗したら、結果を出力して終了する（ステップＳ
１０９）。失敗せずに新しい候補が残ったならば、その
候補に対してスコア評価を行う（ステップＳ１０６）。
スコアの評価は、ＬＲ表を参照したときにリデュース操
作を行った候補に対して、それまでの候補の評価スコア
に、リデュース操作に対応するスコアを加算する。リデ
ュース操作に対応するスコアは、ＬＲ表の中のリデュー
スの部分にスコアを記録してあるので、それを読み込
む。そして、新たなＬＲ表における状態に対してその予
測スコアを読み込む。予測スコアは、ＬＲ表の中のアク
ション表の部分に記録してあるので、それを読み込む。
評価スコアと予測スコアが新たに計算された後に、それ
らの合計スコアが、最もスコアの良い候補のスコアよ
り、あらかじめ決められた値以上の差を持っている候補
は削除する（ステップＳ１０７）。削除された候補は以
後、処理されることはない。ステップＳ１０２からステ
ップＳ１０７までの操作を、最後の入力文字まで到達す
るか、もしくは解析に失敗するまで行う。最後の入力文
字に対して処理を終了した時点で、最も評価スコアが良
い候補を認識結果として出力する（ステップＳ１０
９）。FIG. 3 shows a flowchart of the language processing in the first embodiment. First, one candidate for the initial state of syntax analysis is prepared for the character string to be processed (step S1).
01). Next, the first input character is read as a look-ahead symbol (step S102). The LR table is referred to for the initial state candidate and the read-ahead symbol (step S).
103). If the operation on the look-ahead symbol in each candidate is possible as a result of referring to the LR table, the operation is performed to create a new candidate (step S105). If a new candidate is not created as a result, it indicates that the syntax analysis has failed (step S104). If the syntax analysis fails, the result is output and the processing is terminated (step S
109). If a new candidate remains without failure, score evaluation is performed on the candidate (step S106).
In the evaluation of the score, for the candidate who performed the reduce operation when referring to the LR table, the score corresponding to the reduce operation is added to the evaluation score of the candidate up to that time. Since the score corresponding to the reduce operation is recorded in the reduce portion in the LR table, the score is read. Then, the prediction score is read for the state in the new LR table. Since the prediction score is recorded in the action table portion in the LR table, it is read.
After the evaluation score and the prediction score are newly calculated, the candidate whose total score has a difference greater than or equal to a predetermined value from the score of the candidate with the highest score is deleted (step S107). The deleted candidate is no longer processed. The operations from step S102 to step S107 are performed until the last input character is reached or the analysis fails. When the processing for the last input character is completed, the candidate having the highest evaluation score is output as a recognition result (step S10).
9).

【００１６】ＬＲ表に記録しておく各状態からの予測ス
コアを算出する処理のフローチャートを図４に示す。ま
ず、構文解析するための状態を一つ取り出す（ステップ
Ｓ２０１）。本実施例では、対象とする状態はＬＲ表の
行に対応する状態である。取り出した状態から、次に遷
移可能な状態をすべてリストアップする（ステップＳ２
０２）。遷移可能なすべての状態に対して、その遷移の
際に発生する言語的スコアを加えて、元の状態からその
状態へ遷移するのに必要なスコアを計算する（ステップ
Ｓ２０３）。ステップＳ２０２とステップＳ２０３の操
作を、適切な予測スコアが見つかったと判断されるまで
繰り返す（ステップＳ２０４）。本実施例ではスコアの
絶対値が小さければ小さいほど良いスコアであり、か
つ、構文解析の状態をたどるほどスコアが悪化するよう
に設定されているので、適切な予測スコアが見つかった
と判断される場合には、以下の三つが存在する。第１の
場合は、構文解析終了として受理できる状態に達した解
析の候補ができて、その予測スコアが他の解析を継続し
ている候補のスコアとそれまでに受理に到達した候補の
予測スコアのいずれもより小さい値となっている場合で
ある。第２の場合は、解析を継続している候補のスコア
が増大して、すでに受理できる状態に達した予測スコア
の候補よりも悪くなった場合である。第３の場合は、解
析している候補が無くなった場合である。ステップＳ２
０４において予測スコアが見つかったと判断されたとき
は、見つかった予測スコアを対応する記録すべき場所へ
記録する（ステップＳ２０５）。また、適切な予測スコ
アが見つけられていないと判断した場合には、ステップ
Ｓ２０２へ戻って探索を進める。ステップＳ２０１から
ステップＳ２０５までの操作を、すべての状態に対して
行ったら処理は終了する。上記の処理の中で、ある状態
に対する予測スコアを算出するときに、受理に至るまで
に経由する別の状態の予測スコアがすでに算出されてい
ていれば、その予測スコアを利用することで計算量を減
せらせる。FIG. 4 shows a flowchart of a process for calculating a prediction score from each state recorded in the LR table. First, one state for parsing is extracted (step S201). In this embodiment, the target state is a state corresponding to a row of the LR table. From the extracted state, all states that can be changed next are listed (step S2).
02). A linguistic score generated at the time of the transition is added to all the transitionable states, and a score required to transition from the original state to the state is calculated (step S203). The operations of step S202 and step S203 are repeated until it is determined that an appropriate prediction score has been found (step S204). In the present embodiment, the smaller the absolute value of the score is, the better the score is, and the score is set to be worse as the state of the parsing is traced, so that it is determined that an appropriate prediction score is found. The following three exist. In the first case, a candidate for an analysis that has reached an acceptable state as the completion of parsing is created, and its predicted score is the score of a candidate that continues to perform another analysis and the predicted score of a candidate that has reached acceptance until then. Are smaller values. The second case is a case where the score of the candidate for which the analysis is continued increases and becomes worse than the candidate of the prediction score which has already reached the acceptable state. The third case is when there are no more candidates to be analyzed. Step S2
When it is determined in step 04 that the prediction score has been found, the found prediction score is recorded in the corresponding recording location (step S205). If it is determined that an appropriate prediction score has not been found, the process returns to step S202 to advance the search. When the operations from step S201 to step S205 have been performed for all the states, the process ends. In the above process, when calculating the prediction score for a certain state, if the prediction score of another state that has been passed before the acceptance is already calculated, the calculation score is calculated by using the prediction score. Can be reduced.

【００１７】［実施例２］次に最良優先探索を適用した
本発明を音声認識の言語処理に利用した実施例２につい
て説明する。図５は実施例２の構成図であり、図５にお
いて、音韻認識部６０は、入力された音声を信号処理し
て、ＨＭＭ（隠れマルコフモデル）などで表現された音
節レベルの音声モデルと照合して、音節やモーラおよび
音素を単位として音声を認識する。本実施例では認識さ
れた結果は、モーラをアークとするネットワーク表現
（以下モーラグラフ）によって表現する。モーラグラフ
による音韻認識部の出力は図６に示すようなものであ
る。言語処理制御部１１は、モーラグラフを入力とし
て、辞書記憶部２２、ＬＲ表記憶部２１、候補記録部３
１、スコア計算部４０、言語知識源５０とともに言語的
評価を行い、評価の結果にしたがって音声認識の認識結
果を出力する。辞書記憶部２２は、モーラの並びがどの
ような形態素を構成しうるのかという情報を持つ。ＬＲ
表記憶部２１は、どのような形態素のならびが文として
受理できるかの情報をＬＲ表の形式にコンパイルした情
報を記憶している。候補記録部３１は、探索中の候補の
情報を保持する。言語知識源５０は、言語的にどの程度
もっともらしいかの情報を保持している。本実施例では
言語知識源として、モーラのモノグラム（ｍｏｎｏ−
ｇｒａｍ）、バイグラム（ｂｉ−ｇｒａｍ）、トライグ
ラム（ｔｒｉ−ｇｒａｍ）と、形態素のモノグラム、バ
イグラム、トライグラム、および品詞の最小コスト法を
利用する。モーラおよび形態素のモノグラム、バイグラ
ム、トライグラムは、あらかじめテキストデータに含ま
れるモーラおよび形態素のならびの数を計数しておき、
統計的に与えておく。品詞の最小コストは、認識結果が
良くなるように品詞ごとにコストを経験的にあらかじ与
えておく。スコア計算部４０は、言語知識源５０を参照
して探索中もしくは探索終了した候補の評価スコアを計
算する。[Second Embodiment] Next, a description will be given of a second embodiment in which the present invention to which the best priority search is applied is used for speech recognition language processing. FIG. 5 is a configuration diagram of the second embodiment. In FIG. 5, a phoneme recognition unit 60 performs signal processing on an input voice and checks it against a syllable-level voice model represented by an HMM (Hidden Markov Model) or the like. Then, speech is recognized in units of syllables, mora, and phonemes. In the present embodiment, the recognized result is expressed by a network expression (hereinafter, a mora graph) using a mora as an arc. The output of the phoneme recognition unit based on the mora graph is as shown in FIG. The language processing control unit 11 receives a mora graph as input, and stores the dictionary storage unit 22, the LR table storage unit 21, the candidate recording unit 3
1. A linguistic evaluation is performed together with the score calculation unit 40 and the linguistic knowledge source 50, and a speech recognition result is output according to the evaluation result. The dictionary storage unit 22 has information on what morphemes can be formed by the arrangement of mora. LR
The table storage unit 21 stores information obtained by compiling information on what morpheme sequence can be accepted as a sentence into an LR table format. The candidate recording unit 31 holds information on the candidate under search. The linguistic knowledge source 50 holds information on the degree of linguistic likelihood. In the present embodiment, a monogram of mora (mono-
gram, bi-gram, tri-gram, and morpheme monogram, bigram, trigram, and part of speech minimum cost methods. Monograms, bigrams, and trigrams of mora and morphemes count the number of mora and morphemes contained in text data in advance,
Give it statistically. The minimum cost of the part of speech is empirically given in advance for each part of speech so that the recognition result is improved. The score calculation unit 40 refers to the linguistic knowledge source 50 and calculates an evaluation score of a candidate that is being searched or has been searched.

【００１８】実施例２の言語処理制御部１１の処理の流
れを図７にフローチャートで示す。まず、音韻認識部６
０の出力するモーラグラフを受け取る（ステップＳ３０
１）。次に、初期状態の候補を生成して候補記録部３１
で記録させる（ステップＳ３０２）。本実施例の初期状
態の候補とは、形態素が一つも受理されていなくて、到
達ノードをモーラグラフの開始ノードとし、評価スコア
が０の候補である。そして、候補記憶部３１に記憶され
ている候補の数を調べて、０個ならば処理は終了する
（ステップＳ３０３）。ただし、最初はステップＳ３０
２で候補を生成しているので候補が０個ということはな
い。次に、最も良いスコアの候補を一つ取り出す（ステ
ップＳ３０４）。ここで最も良いスコアとは、それまで
の評価スコアと予測スコアの和が最も小さいスコアのこ
とである。取り出した候補が解として成立していれば、
その候補を認識結果として出力する（ステップＳ３０
５、ステップＳ３０６）。候補が解として成立している
とは、到達ノードがモーラグラフの終了ノードまで達し
ていて、かつ構文解析が受理可能であることである。出
力した認識結果は、出力する度にディスプレイに表示し
ても良いし、まとめて記憶しておき言語処理制御部１１
の一つの発話に対する処理が終了した時点でまとめて表
示しても良い。出力した解の数が１個あるいは５個など
のようにあらかじめ決められた数に達したら、そこで処
理は終了する（ステップＳ３０７）。解の数が決められ
た数に達していない場合は、ステップＳ３０３へ行き次
の候補へ処理が移る。ステップＳ３０５において候補が
解でないと判定された場合は、その候補の到達ノードか
らモーラグラフの上で辞書照合を行う（ステップＳ３０
８）。ステップＳ３０８において複数の形態素が照合に
成功したら、それぞれの結果に対して照合に成功した形
態素、およびその品詞、モーラグラフ上の到達ノード、
および到達ノードに付属している音韻予測スコアを記憶
する。次に、ステップＳ３０８で辞書検索に成功した形
態素に対して、ＬＲ表を参照して構文解析を行う（ステ
ップＳ３０９）。ＬＲ表を参照した結果として、解析に
失敗した候補は削除する。解析に成功した候補は、その
構文解析状態とともにその状態における予測言語スコア
を読み出す。構文解析に成功した候補に対して、その候
補のモーラのならび形態素のならびに応じて、スコア計
算部４０にスコアを計算させる（ステップＳ３１０）。
スコア計算部４０は、言語知識源５０を参照して評価ス
コアを計算する。候補記録部３１の中の候補の数と、新
たに作成された候補の評価スコアに応じて、新しく作成
された候補は候補記録部３１に記録される（ステップＳ
３１１）。すでに、候補記録部３１に記憶されている候
補の中で最も良いスコアからある一定の差以上に悪いス
コアの候補、および、スコアの良い順番に候補を並べ
て、あらかじめ決められたある数より順位の悪い候補
を、候補記録部３１から削除する。ステップＳ３０３か
らステップＳ３１１までの処理を、候補記録部３１から
候補がなくなるか、解として出力した候補の数が決めら
れた数より多くなるまで繰り返す。FIG. 7 is a flow chart showing the flow of processing of the language processing control section 11 of the second embodiment. First, the phoneme recognition unit 6
The mora graph output from 0 is received (step S30).
1). Next, a candidate in the initial state is generated, and the candidate recording unit 31
(Step S302). The candidates for the initial state in the present embodiment are candidates for which no morpheme has been received, the reaching node is the start node of the mora graph, and the evaluation score is 0. Then, the number of candidates stored in the candidate storage unit 31 is checked, and if the number is 0, the process ends (step S303). However, initially, step S30
Since the candidate is generated in step 2, the number of candidates is not zero. Next, one candidate with the best score is extracted (step S304). Here, the best score is a score having the smallest sum of the evaluation score and the prediction score up to that point. If the extracted candidate is established as a solution,
The candidate is output as a recognition result (step S30)
5, step S306). The fact that the candidate is established as a solution means that the reaching node has reached the end node of the mora graph, and the parsing is acceptable. The output recognition result may be displayed on the display each time it is output, or may be stored together and stored in the language processing control unit 11.
May be displayed collectively when the processing for one utterance is completed. When the number of output solutions reaches a predetermined number such as one or five, the process ends there (step S307). If the number of solutions has not reached the predetermined number, the process proceeds to step S303 and the process proceeds to the next candidate. If it is determined in step S305 that the candidate is not a solution, dictionary matching is performed on the mora graph from the node reached by the candidate (step S30).
8). If the plurality of morphemes are successfully matched in step S308, the morphemes that have been successfully matched for each result, their parts of speech, the reached nodes on the mora graph,
And a phoneme prediction score attached to the reaching node. Next, syntax analysis is performed on the morpheme for which the dictionary search was successful in step S308 with reference to the LR table (step S309). As a result of referring to the LR table, candidates that failed in analysis are deleted. A candidate that has been successfully analyzed reads its parsing state and the predicted language score in that state. For the candidate for which the syntax analysis was successful, the score calculating unit 40 calculates a score according to the mora and morpheme sequence of the candidate (step S310).
The score calculation unit 40 calculates an evaluation score with reference to the language knowledge source 50. The newly created candidate is recorded in the candidate recording unit 31 according to the number of candidates in the candidate recording unit 31 and the evaluation score of the newly created candidate (Step S).
311). Among the candidates already stored in the candidate recording unit 31, candidates with a score that is worse than a certain difference from the best score and candidates are arranged in the order of the score, and the candidates are ranked in a predetermined number. The bad candidate is deleted from the candidate recording unit 31. The processing from step S303 to step S311 is repeated until there are no more candidates in the candidate recording unit 31 or the number of candidates output as solutions exceeds a predetermined number.

【００１９】予測スコア算出の処理の流れは、上述実施
例１に準じる。ただし、各種の知識源のスコアをまとめ
た点を評価するのが実施例１とは異なる。The flow of the process of calculating the prediction score is in accordance with the first embodiment. However, it is different from the first embodiment in that the points obtained by summarizing the scores of various knowledge sources are evaluated.

【００２０】[0020]

【発明の効果】以上説明したように、本発明によれば、
言語評価を行う際に、余分な計算を必要とせずに未探索
部分の言語スコアも精度良く予測できるので、その予測
スコアを利用して探索空間を大幅に削減する。したがっ
て、言語評価の処理時間を大幅に短い時間で行うことを
可能とする。特に、音声認識や文字認識の結果などのよ
うにあいまいさがあり、探索空間が広い時において、処
理を高速化するのに高い効果を示す。As described above, according to the present invention,
When performing language evaluation, the language score of the unsearched portion can be accurately predicted without requiring extra calculation, and the prediction space is significantly reduced by using the predicted score. Therefore, the processing time for language evaluation can be performed in a significantly short time. In particular, when there is ambiguity such as the result of voice recognition or character recognition, and the search space is wide, it is highly effective in speeding up the processing.

[Brief description of the drawings]

【図１】本発明の実施例１の構成を示すブロック図で
ある。FIG. 1 is a block diagram illustrating a configuration of a first exemplary embodiment of the present invention.

【図２】実施例１における予測スコアを含むＬＲ表の
例を説明する図である。FIG. 2 is a diagram illustrating an example of an LR table including a prediction score according to the first embodiment.

【図３】実施例１の動作を説明するフローチャートで
ある。FIG. 3 is a flowchart illustrating an operation according to the first exemplary embodiment.

【図４】実施例１における予測スコアの算出動作を説
明するフローチャートである。FIG. 4 is a flowchart illustrating an operation of calculating a prediction score according to the first embodiment.

【図５】本発明の実施例２の構成を説明するブロック
図である。FIG. 5 is a block diagram illustrating a configuration of a second exemplary embodiment of the present invention.

【図６】実施例２におけるモーラグラフの１例を説明
する図である。FIG. 6 is a diagram illustrating an example of a mora graph according to the second embodiment.

【図７】実施例２の動作を説明するフローチャートで
ある。FIG. 7 is a flowchart illustrating the operation of the second embodiment.

[Explanation of symbols]

１０、１１言語処理制御部１２構文解析部１３構文解析制御部１４、４０言語スコア計算部２０、２１ＬＲ表記憶部３０、３１候補記憶部６０音声認識部 10, 11 language processing control unit 12 syntax analysis unit 13 syntax analysis control unit 14, 40 language score calculation unit 20, 21 LR table storage unit 30, 31 candidate storage unit 60 voice recognition unit

Claims

[Claims]

1. A language processing apparatus for performing language evaluation together with syntax analysis, comprising: a syntax analysis means for performing syntax analysis; and an analysis state that can be taken at the time of syntax analysis. A means for recording the predicted value of the language evaluation score to be generated; a means for generating a language evaluation score for each processing during the syntax analysis; and an analysis state that has been reached before the analysis state is reached. A language processing apparatus comprising: a syntax analysis control unit that controls the syntax analysis unit based on a score of a language evaluation and a predicted value of the score of the analysis state.

2. The parsing control means performs a parsing process after the parsing state when the sum of the score of the language evaluation and the predicted value of the score is poorly evaluated for the reached parsing state. 2. The language processing apparatus according to claim 1, wherein said language processing apparatus is not provided.

3. The language according to claim 1, wherein the syntax analysis control means continues the syntax analysis processing after the analysis state in the order in which the sum of the score of the language evaluation and the predicted value of the score is evaluated well. Processing equipment.

4. The language processing apparatus according to claim 3, wherein a best predicted value that can be predicted from a predetermined analysis state of the syntax analysis is a predicted value recorded in correspondence with the analysis state.

5. The language processing apparatus according to claim 1, further comprising a speech recognition unit, wherein the syntax analysis and the language evaluation are performed on a speech recognition result by the speech recognition unit.

6. A language processing method for performing a language evaluation together with a syntax analysis, comprising the steps of: performing a syntax analysis; Recording a predicted value of the evaluation score; generating a language evaluation score for each process during the syntax analysis; and determining the analytic state that has been reached before reaching the analytic state. A language processing method, comprising: controlling a step of performing the syntax analysis based on a score and a predicted value of the score of the analysis state.