JPH03180899A

JPH03180899A - Voice recognizing method

Info

Publication number: JPH03180899A
Application number: JP1321685A
Authority: JP
Inventors: Tatsuya Kimura; 達也木村
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1989-12-11
Filing date: 1989-12-11
Publication date: 1991-08-06

Abstract

PURPOSE:To analyze a sentence efficiently for sentence voice recognition by providing a clause sequence storage part, a clause detection start condition generation part, etc., and carrying out processes for clause detection and meaning analysis simultaneously, progressively, and integrally. CONSTITUTION:The clause detection part 12 of a meaning expression output part 11 detects a clause candidate from the phoneme sequence of an input sentence voice recognized by a phoneme recognition part 10 and a meaning analysis part 13 checks the possibility of linguistic and position-concerned connections between the clause candidate and a clause sequence group which is already stored in a clause sequence storage part 14, so that clauses which can be connected are stored in the storage part 14 together with the meaning expression of modification analytic results. At the same time, the clause detection start condition generation part 15 generates clause start conditions according to the storage contents of the storage part 14 and supplies them to a clause detection part 12, and clause candidates which do not match the conditions are inhibited from being detected. Those clause detecting process and meaning analyzing process are carried out simultaneously, progressively, and integrally to inhibit unnecessary clause detection from being performed, thereby analyzing sentences efficiently.

Description

【発明の詳細な説明】産業上の利用分野本発明は、連続発声された文音声を主な認識対象とする
場合の音声認識方法に関する。DETAILED DESCRIPTION OF THE INVENTION Field of the Invention The present invention relates to a speech recognition method when the main object of recognition is continuously uttered sentence speech.

従来の技術従来、文音声を認識する手法に関しては、種々の方法が
提案されている。BACKGROUND OF THE INVENTION Conventionally, various methods have been proposed for recognizing sentence speech.

日本語文を認識対象とした場合のその中の１つの典型的
な方法は、まず入力音声のどの部分区間がどの文節に該
当するかを全音声区間について推定して文節候補の集合
を作成しておき、その文節候補の集合より、時間的な位
置関係と文法や意味などの言語的な制約（一般に文節間
の掛り受は規則を指す場合が多い）を満足する一連の系
列をなす文節からなる部分集合を、認識結果として推定
する方法である。One typical method when recognizing Japanese sentences is to first estimate which subsections of the input speech correspond to which clauses for all speech intervals, and then create a set of clause candidates. The set of phrase candidates consists of a series of phrases that satisfy the temporal positional relationship and linguistic constraints such as grammar and meaning (in general, the overlap between phrases often refers to rules). This is a method of estimating a subset as a recognition result.

時間的な位置情報を伴った文節の候補の集合のことを、
「文節ラティス」と呼ぶことにすると、「文節ラティス
」から得られる上記諸制約を同時に満たす文節の系列は
、一般には複数個存在するが、推定の妥当性を評価する
尺度（例えば確率論的解釈によって導かれる尺度（後述
する）をあらかじめ用意しておき、その評価値にしたが
って、文節候補の順位付けをし、１位の候補を認識結果
として採択することがよく行なわれる。A set of clause candidates with temporal position information is
If we call it the "Bunsetsu lattice," there are generally multiple sets of clauses that simultaneously satisfy the above constraints obtained from the "Bunsetsu lattice." It is common practice to prepare in advance a scale (described later) derived from the above, rank the phrase candidates according to the evaluation value, and select the first-ranked candidate as the recognition result.

ところで、以上述べた方法を実現するためには・以下の
具体的課題を明確にする必要がある。By the way, in order to realize the method described above, it is necessary to clarify the following specific issues.

（１）文節ラティスから最終結果を得る具体的な方法。(1) A specific method to obtain the final result from the bunsetsu lattice.

（２）音声信号から文節ラティスを作成する方法。(2) A method of creating a bunsetsu lattice from audio signals.

（３）利用する文法の種類。(3) Type of grammar used.

本発明で取り扱う内容である（１）としては、従来、「
文節ラティス」を構成する全ての文節について総当たり
的に時間的位置関係及び言語的制約のチエツクを行なう
方法や、文頭から文末の方向へグラフ理論の分野におけ
る木探索又は経路探索の方法を用いて、文頭より文末の
方向にしたがって上記言語的制約のチエツクを文節候補
に適用する手法などが知られている（例えば１９７９年
刊、新美著「音声認識」共立出版発行）。As for (1), which is the subject of the present invention, conventionally, “
This method uses a brute force method to check the temporal positional relationships and linguistic constraints of all the clauses that make up the clause lattice, and a tree search or path search method in the field of graph theory from the beginning of the sentence to the end. , a method is known in which the above-mentioned linguistic constraint check is applied to clause candidates from the beginning to the end of the sentence (for example, "Speech Recognition" by Niimi, published by Kyoritsu Shuppan, published in 1979).

しかし、前者は演算効率の点で、後者は日本語の言語構
造、即ち「係り受け」における「係り」に相当する文節
が「受け」に相当する文節より前に位置するという日本
語特有の言語構造に対する整合性の面でそれぞれ問題が
あり、日本語文音声認識の実現に際し、必ずしも満足の
いく結果を得るには至っていなかった。However, the former is due to computational efficiency, and the latter is due to the linguistic structure of Japanese, in which the clause corresponding to ``kari'' in ``modariuke'' is placed before the clause corresponding to ``uke'', a language unique to Japanese. Each method has problems in terms of consistency with respect to its structure, and it has not always been possible to obtain satisfactory results when realizing Japanese sentence speech recognition.

これを解決するため、本発明者により、上記探索の方法
に関し、文末から文頭の方向で、文節間の係り受は構造
規則の制約条件のもとて探索を行ない文節候補の系列を
得る方法が提案されている。In order to solve this problem, the present inventor proposed a method to obtain a sequence of clause candidates by searching from the end of the sentence to the beginning of the sentence, subjecting the dependencies between clauses to the constraints of the structure rules. Proposed.

つまり、この方法は、日本語の言語構造に適しかつ効率
が良いという特長を有している。In other words, this method has the advantage of being suitable for the language structure of Japanese and being efficient.

即ち、第３図はこの方法の文音声認識システムを示し、
音素認識部１は入力された音声信号より音素認識を行な
い音素系列を出力する手段を有している。得られた音素
系列は完全に正しいことは保証されず、話者や周囲雑音
あるいは音素認識装置自体の性能に起因するある確率で
誤りを含む。That is, FIG. 3 shows a sentence speech recognition system using this method,
The phoneme recognition unit 1 has means for performing phoneme recognition from an input audio signal and outputting a phoneme sequence. The obtained phoneme sequence is not guaranteed to be completely correct, and has a certain probability of containing errors due to the speaker, ambient noise, or the performance of the phoneme recognition device itself.

そのため、従来の文字入力を対象とした自然言語処理技
術による構文解析や意味解析の手法をそのまま適用する
ことはできず、文音声認識特有の処理方法が必要となる
。Therefore, it is not possible to directly apply conventional syntactic analysis and semantic analysis techniques based on natural language processing technology for character input, and a processing method specific to sentence speech recognition is required.

文節ラティス作成部２は誤りを含む認識音素系列の中の
どの部分がどの文節に該当する可能性が高いかを推定し
て文節候補を得る機能を持つ。推定された文節は始端及
び終端に関する時間的位置情報並びに推定の妥当性に関
する情報を伴う。文節推定の妥当性は、例えば音素の置
換、付加、脱落の各誤り率から得られる音素間のコンツ
ー−ジョンマトリクスから導かれる尤度などで与えられ
る。ここでは、文節候補は文頭から文末に至る全範囲に
わたって推定の妥当性が所定の値を越えるものが、全て
得られるものとする。The clause lattice creation unit 2 has a function of obtaining clause candidates by estimating which part of the recognized phoneme sequence containing errors is likely to correspond to which clause. The estimated phrase is accompanied by temporal position information regarding the beginning and end, as well as information regarding the validity of the estimation. The validity of phrase estimation is given, for example, by the likelihood derived from a concussion matrix between phonemes obtained from each error rate of phoneme substitution, addition, and omission. Here, it is assumed that all phrase candidates whose estimated validity exceeds a predetermined value over the entire range from the beginning of the sentence to the end of the sentence are obtained.

意味解析部３では、位置関係による制約及び言語的な制
約にしたがって「文節ラティス」の中の文節候補を連接
していく操作により、最終的な結果である文節の系列を
得る。言語的制約を決定する文法（ルール）としては、
例えばＦｉｌｌｍｏｒｅによる格文法（後述）や、５ｃ
ｈａｎｋによる概念依存文法などが考えられる。The semantic analysis unit 3 obtains a series of clauses as the final result by concatenating clause candidates in the "clause lattice" according to positional relationship constraints and linguistic constraints. The grammar (rules) that determine linguistic constraints are:
For example, Fillmore's case grammar (described later), 5c
A concept-dependent grammar using hank can be considered.

第４図に連続発声による例文「新大阪まで切符を３枚下
さい。」を入力した時の音素認識結果例及び「文節ラテ
ィス」の例を示す。図中、「文節ラティス」においてア
／ダーラインが施されている文節が正解の文節である。FIG. 4 shows an example of a phoneme recognition result and an example of a ``bunsetsu lattice'' when an example sentence ``Please give me three tickets to Shin-Osaka'' is input by continuous utterance. In the figure, the phrases with a/dare lines in the "Phrase lattice" are the correct phrases.

また、この例では言語的制約のチエツクに、格文法を用
いているので、格文法について簡単ｔｇ説明をしておく
。Also, in this example, a case grammar is used to check linguistic constraints, so a brief explanation of the case grammar will be given below.

格文法は述語と他の語句とがどのような関係で共存しつ
るかを記述する文法である。即ち、文における主語、述
語、目的語、補語といった役割で考えるのではなく、述
語にとって意味の上から各単語がどのような立場に立つ
かを考えろ方法による文の解析手段である。格文法は上
述のように、構文解析よりも意味解析に重点をおいてい
るため、（日本語）文音声認識への応用を考えた場合、
以下に示すような利点がある。Case grammar is a grammar that describes how predicates and other words coexist. In other words, it is a method of analyzing sentences that does not consider the roles of subjects, predicates, objects, and complements in a sentence, but rather considers the position of each word in terms of meaning for the predicate. As mentioned above, case grammar emphasizes semantic analysis rather than syntactic analysis, so when considering its application to (Japanese) sentence speech recognition,
It has the following advantages.

（１）単語間の意味的な共起関係を利用した単語候補の
絞り込みを導入しやすい。(1) It is easy to narrow down word candidates using semantic co-occurrence relationships between words.

（２）語順に対する自由度が大きいため、日本語を取り
扱い易い。(2) It is easy to handle Japanese because there is a large degree of freedom regarding word order.

（３）結果が意味の形で得られるので対話システムなど
への組み込みが容易である。(3) Since the results are obtained in the form of meaning, it is easy to incorporate into dialogue systems, etc.

第６図には、この格文法による意味解析結果の−例を示
してあり、−行目の（ＭＯＤＥ　ＭＥＩＩＩＥＩ）はこ
の文が命令文であることを表わしている。また２行目（
７）（ＡＣＴ　ＫＩＪＤＡＳＡＲＵ）は述語が「下さる
」という行為を意味していることを表わしている。また
、３行目の（ＯＢＪＥＣＴ　ＫＩＰＰＵ）以下は、述語
「下さる」という行為の内容の詳細を説明するための機
能を持つ句または節に関する記述である。即ち、行為「
下さる」の対象物は「切符」であり、その枚数は「３枚
」であり、行き先は「新大阪」であることを表わしてい
る。FIG. 6 shows an example of the result of semantic analysis using this case grammar, where (MODE MEIIIEI) in the -th line indicates that this sentence is an imperative sentence. Also, the second line (
7) (ACT KIJDASARU) indicates that the predicate means the act of ``giving.'' Furthermore, the third line (OBJECT KIPPU) and the following are descriptions of phrases or clauses that have the function of explaining the details of the action of the predicate "Gaseru". That is, the act ``
The object of ``Give me'' is ``tickets'', the number of which is ``3'', and the destination is ``Shin-Osaka''.

格文法では、各動詞について、それを意味的な詳細を説
明することが可能な項目を用意しておき、文中の各単語
を該当する項目に順次にあてはめていく操作により意味
解析が進められる。この各動詞ごとに存在する項目のこ
とを、「充填のための溝」という意味で「格スロット」
と呼ぶ。また文中の各単語を踪当する項目に順次当ては
めていく操作のことを、「スロットを埋める操作」とい
う意味で「スロットフィリング」と呼ぶ。本明細書でも
、以後、これらの用語を使用することにする。In case grammar, for each verb, items are prepared that can explain the meaning in detail, and semantic analysis proceeds by sequentially applying each word in a sentence to the corresponding item. These items that exist for each verb are called ``case slots,'' meaning ``grooves for filling.''
It is called. Also, the operation of sequentially applying each word in a sentence to the corresponding item is called ``slot filling'', which means ``slot filling operation.'' These terms will also be used hereinafter.

上記言語的制約のチエツクとは、後述する本発明の実施
例においては、「スロットフィリング」が取立するか否
かのチエツクのことを示す。In the embodiment of the present invention described later, the above-mentioned linguistic restriction check refers to a check as to whether "slot filling" is accepted.

なお、本発明で取り扱う言語処理系においては、以上に
説明した文節間の格関係のみならず、「赤い花」の例の
ような連体修飾関係や、「美しく咲く」のような連用修
飾関係、あるいは「切符を買って、京都へ行く」のよう
な接続関係も格文法における「スロットフィリング」と
同じ概念で取り扱う。いい換えると、本発明では、活用
語のみならず、被修飾語にもスロットを持たせて係り受
は関係を解析する枠組みとなっている。Note that the language processing system handled by the present invention not only handles case relations between clauses as explained above, but also adjunctive modification relations such as the example of "red flower", conjunctive modification relations such as "blooms beautifully", Also, conjunctions such as ``buy a ticket and go to Kyoto'' are treated with the same concept as ``slot filling'' in case grammar. In other words, in the present invention, not only conjugated words but also modified words have slots, and modification is a framework for analyzing relationships.

「文節ラティス」の文節候補の連結処理は、文末に位置
するあらかじめ定められた個数の文節候補群を起点にし
て、文頭方向に向けて段階的に行なう。The process of concatenating clause candidates in the "Bunsetsu lattice" is performed in stages from a predetermined number of clause candidates located at the end of the sentence toward the beginning of the sentence.

発明が解決しようとする課題ところで、上述の従来の方法の場合は、音声信号から「
文節ラティス」を作成する段階で、意味解析に不要な文
節候補も検出されるため、認識処理の効率低下を招くと
いう課題があった。Problems to be Solved by the Invention However, in the case of the above-mentioned conventional method, it is difficult to
At the stage of creating a bunsetsu lattice, clause candidates that are unnecessary for semantic analysis are also detected, which poses the problem of reducing the efficiency of recognition processing.

本発明は、このような従来の課題に鑑み、なされたもの
で・その目的とするところは、従来別個の処理として行
なわれていた「文節ラティス」作成の処理と意味解析の
処理を統合し、新たに文節の検出と意味解析の処理と同
時進行的に行なう手段を設けることにより無駄な文節候
補の検出を回避し、効率の良い分解析機構の実現を図る
ことが可能に音声認識方法を得ることにある。The present invention has been made in view of these conventional problems, and its purpose is to integrate the process of creating a "bunsetsu lattice" and the process of semantic analysis, which were conventionally performed as separate processes, To obtain a speech recognition method that avoids unnecessary detection of phrase candidates and realizes an efficient segmentation analysis mechanism by providing a new means for simultaneously performing phrase detection and semantic analysis processing. There is a particular thing.

課題を解決するための手段上記目的を達成するため、本発明の音声認識方法は、音
声信号から文節候補の区間情報と信頼度情報を検出する
文節検出手段と、文節検出手段より得られた文節候補の
中の文節候補間の位置関係に関する制約条件及び文法あ
るいは意味などの言語的な制約条件を調べる意味解析手
段と、文節間の制約条件を満たす文節の系列として文節
候補を接続することにより作成される文節系列を格納す
る文節系列格納手段とを備え、文節系列格納手段に格納
されている解析途中の文節系列の端点の情報をもとに、
必要な区間のみの文節候補を上記文節検出手段により検
出することにより文解析処理を進めることを提案するも
のである。Means for Solving the Problems In order to achieve the above object, the speech recognition method of the present invention includes a phrase detection means for detecting section information and reliability information of phrase candidates from an audio signal, and a phrase detection means for detecting interval information and reliability information of phrase candidates from a speech signal. Created by connecting phrase candidates as a series of phrases that satisfy the constraint conditions between phrases with a semantic analysis means that examines constraints on the positional relationship between phrase candidates in the candidates and linguistic constraints such as grammar or meaning. and a clause series storage means for storing a clause series to be analyzed, and based on the information of the end point of the clause series that is being analyzed, which is stored in the clause series storage means,
It is proposed that the sentence analysis process be carried out by detecting clause candidates in only the necessary sections using the clause detecting means.

作用上記音声認識方法により、以下の作用にしたがって効率
良く、更に認識精度の高い日本語文音声の認識を実現す
るようになる。Effects The above-mentioned speech recognition method realizes efficient recognition of Japanese sentence speech with higher recognition accuracy according to the following effects.

（１）文節候補を段階的に採択することにより複数個の
文節系列を同時進行的に作成しながら解析処理を行なう
ことができる。(1) By selecting clause candidates in stages, it is possible to perform analysis processing while simultaneously creating a plurality of clause sequences.

（２）　　（１）の処理の段階で文節間の「係り受け］
解析を完了できる。(2) At the processing stage of (1), “dependency” between bunsetsu clauses
Analysis can be completed.

（３）解析の途中過程で文節系列の推定妥当性の値が一
定の評価基準に達しない文節系列については以降の処理
を打ち切ることができる。(3) For phrase sequences whose estimated validity value does not reach a certain evaluation standard during the course of analysis, subsequent processing can be discontinued.

（４）文末側から文頭側への方向に解析処理を進め、文
節推定妥当性及び言語的な制約条件による制御によって
探索を行なうことができる。(4) The analysis process proceeds from the end of the sentence to the beginning of the sentence, and the search can be performed under control based on clause estimation validity and linguistic constraints.

実施例以下、第１図について本発明の実施例の詳細な説明する
。EXAMPLE Hereinafter, an example of the present invention will be described in detail with reference to FIG.

第１図は本発明による文音声認識システムの概念であり
、音素認識部１０は入力された音声信号より音素認識を
行ない音素系列を出力する手段であり、音素系列は誤認
識によりある確率で誤りを含んでいる。FIG. 1 shows the concept of a sentence speech recognition system according to the present invention. The phoneme recognition unit 10 is a means for performing phoneme recognition from an input speech signal and outputting a phoneme sequence. The phoneme sequence has a certain probability of being incorrect due to misrecognition. Contains.

図中点線の意味表現出力部１１は、文節検出部１２、意
味解析部１３、文節系列格納部１４、文節検出開始条件
作成部１６から構成され、誤りを含む音素系列から認識
結果である意味表現を得る機能を有する。そして、上記
意味表現出力部１１の各構成要素により従来の「文節ラ
ティス」を求めろ際に生じていた、意味解析に不必要へ
文節候補の検出を回避し、効率の良い文解析処理を実現
するようになっている。The semantic expression output unit 11 indicated by the dotted line in the figure is composed of a phrase detection unit 12, a meaning analysis unit 13, a phrase sequence storage unit 14, and a phrase detection start condition creation unit 16, and is a semantic expression that is a recognition result from a phoneme sequence containing errors. It has the function of obtaining Then, each component of the semantic expression output unit 11 avoids the detection of clause candidates that are unnecessary for semantic analysis, which occurs when calculating the conventional "clause lattice", and realizes efficient sentence analysis processing. It is supposed to be done.

次いで、同構成の文音声認識システムの動作について説
明する。文節検出部１２では、後述の文節系列格納部１
４及び文節検出開始条件作成部１６とから作成される文
節系列開始条件の情報の中の、検出開始位置の情報と検
出開始時点近傍の音素情報を参照することにより、必要
な区間のみについての文節候補の検出を行なう。次の意
味解析部１３では、得られた文節候補と、文節系列格納
部１４に格納されている現解析時点までに蓄積されてい
る文節系列群の要素との、言語的及び位置的な連接可能
性を調べ、もし連結が可能ならば、上記文節系列群の要
素に上記文節候補を接続し、新たな文節系列として文節
系列格納部１４に格納する。これと同虹、「係り受け」
解析結果である意味表現も文節系列格納部１４に格納す
る。また、文節検出開始条件作成部１６は、上記文節検
出部１２が必要としている文節検出開始位置の情報と、
文節検出位置近傍の音素の情報とを文節系列格納部１４
かも作成する機能を持つ。Next, the operation of the sentence speech recognition system having the same configuration will be explained. In the phrase detection unit 12, the phrase series storage unit 1, which will be described later,
By referring to the information on the detection start position and the phoneme information in the vicinity of the detection start point in the information on the bunsetsu sequence start conditions created by the bunsetsu detection start condition creation unit 16 and the bunsetsu detection start condition creation unit 16, Detect candidates. Next, in the semantic analysis section 13, it is possible to link the obtained clause candidates linguistically and positionally with the elements of the clause series group stored in the clause series storage section 14 and accumulated up to the time of the current analysis. If connection is possible, the clause candidate is connected to the element of the clause series group and stored in the clause series storage unit 14 as a new clause series. The same rainbow as this, "Kariuke"
The semantic expression that is the analysis result is also stored in the phrase series storage unit 14. In addition, the phrase detection start condition creation unit 16 includes information on the phrase detection start position required by the phrase detection unit 12,
Information on phonemes near the phrase detection position is stored in the phrase sequence storage unit 14.
It also has the ability to create.

以上の操作は、入力音声の始端もしくは終端のいずれか
の端点位置に端点を持つ文節候補から開始され、他方の
端点位置に端点を持つ文節候補に関する解析が終了する
まで続けられる。処理が終了した時点で、上記文節系列
の妥当性の評価値が第１位の文節系列を認識結果とし、
併せて第４図において既に示した形の意味表現を意味解
析結果として得られることになる。The above operations are started from a clause candidate having an end point at either the start or end position of the input speech, and are continued until the analysis of clause candidates having an end point at the other end position is completed. When the processing is completed, the phrase series with the highest validity evaluation value of the phrase series is set as the recognition result,
At the same time, the semantic expression of the form already shown in FIG. 4 can be obtained as a result of semantic analysis.

第１図の構成で実現される文認識機能を実現するアルゴ
リズムの例を第２図に示す。第２図に示す例では、解析
処理を文末から文頭方向へ進める場合について説明して
いるが、用語「文末」を「文頭」に、「文頭」を「文末
」に読み替えることにより、文頭から文末方向へ解析処
理を進めることももちろん可能である。説明の準備のた
め、第２図で用いる用語及び記号の定義を行なう。FIG. 2 shows an example of an algorithm that implements the sentence recognition function realized with the configuration shown in FIG. The example shown in Figure 2 explains the case where the analysis process proceeds from the end of the sentence to the beginning of the sentence. Of course, it is also possible to proceed with the analysis process in this direction. In preparation for the explanation, terms and symbols used in FIG. 2 will be defined.

用語の定義；［定義１］「文節候補すが文末条件を満たす」とは、文節すが文法
的にも位置的にも文末に存在可能であることを言う。Definition of Terms; [Definition 1] "The clause candidate satisfies the sentence-final condition" means that the clause can exist at the end of the sentence both grammatically and positionally.

［定義２］「文節候補すが文頭条件を満たす」とは、文節すが文法
的にも位置的にも文頭に存在可能であることを言う。[Definition 2] "The phrase candidate satisfies the sentence-initial condition" means that the phrase can exist at the beginning of a sentence both grammatically and positionally.

［定義３］「文節候補すが文節系列Ｂに左連接可能である」とは、
文節系列Ｂの左端に文節すを文法的に接続することが可
能であることを言う。[Definition 3] “The bunsetsu candidate can be left connected to bunsetsu series B” means:
This means that it is possible to grammatically connect the bunsetsu s to the left end of the bunsetsu series B.

記号の定義；（Ｂｎ）：文節数ｎの・文節系列の集合（Ｆ）　：文候
補の集合（ｂ）　二文節候補の集合［］：［］の中の演算を優先するｂｌｅｉｌ　　：文末条件［定義１］を満たす文節候補
ｂｅａｍ　（Ｂ、　　）　　　：　（Ｂ、　　ｌの要素
で枝刈りの対象になら々かった（残った）文節系列の集合ｈｅａｄ　（Ｂ、　　ｌ　　　：　（Ｂ、　　）の要素
で文頭条件［定義２コを満たす文節系列の集合ａｄｊＩｅ（ｔ　（Ｂ、　　）　　：少なくとも（Ｂ、
）の１個の要素に左連接可能［定義３］な文節候補の集合（ｂｌ［有］（Ｂ、ｌ：（ｂｌと（Ｂ、）の直積集合の
中の連接条件を満たす要素について文節候補と文部系列を連接して作成した文節系列の候補の集合ｌＪ：和集合演算第２図に示したアルゴリズムでは、文節数がｎ−１（ｎ
≧１）の文節系列に１個の文節を接続して文節数Ｎの文
節系列を作成する処理を１つの単位としている。この１
つの単位の処理を図中では「第ｎ文節の処理」と表現し
、ｎ　＝　１から図中に示しである終了条件が満たされ
るまでこの処理を繰り返す。Definition of symbols; (Bn): Set of clause sequences with n clauses (F): Set of sentence candidates (b) Set of two clause candidates []: bleil that gives priority to operations in []: Sentence-final condition [ Clause candidate beam (B, ) that satisfies [Definition 1]: (B, l: An element of (B, ) which is a set of clause series that were not (remained) subject to pruning (B, l: An element of (B, ) Sentence-initial condition [Set of clause sequences that satisfy definition 2 adjIe(t (B, ): at least (B,
A set of phrase candidates that can be left connected to one element of A set of phrase sequence candidates created by concatenating sentence sequence and sentence sequence lJ: Union set operation In the algorithm shown in Figure 2, the number of phrases is n-1 (n
One unit is the process of connecting one clause to the clause series of ≧1) to create a clause series of N clauses. This one
Processing for one unit is expressed as "n-th clause processing" in the figure, and this process is repeated from n = 1 until the end condition shown in the figure is satisfied.

次に、第１図に示した本発明の実施例と第２図に示した
本発明によるアルゴリズムとの関連について説明すると
、第２図中の文節の集合（Ｎは第１図中の「文節検出」
によって行なわれる。文節の集合［ｂｌの検出は第２図
の文末の文節候補ｂ　ｊａｉｌおよび文節系列と文節候
補の隣接条件を表現する記号ａｄｊｌｅｆｔの機能は、
ｉ１１図中の「文節検出開始条件作成」の出力する文節
検出開始条件及び「意味解析」によって実現される。即
ち、文節系列と文節候補の間の位置的な隣接関係は「文
節検出開始条件」により、また意味的な隣接関係は「意
味解析」により、文節系列と文節候補との境界に関する
条件として決定される。また第２図中の記号１１ｎで表
現される第ｎ文節を採択した直後の時点における文節系
列は第１図中の「文節系列格納」の中に格納され、次の
段階の処理即ち第ｎ　−１文節採択の処理を行なうため
の文節候補採択条件として利用される。Next, to explain the relationship between the embodiment of the present invention shown in FIG. 1 and the algorithm according to the present invention shown in FIG. detection"
It is carried out by Detection of a set of clauses [bl is the clause candidate b jail at the end of the sentence in Figure 2. The function of the symbol adjleft, which expresses the adjacency condition between the clause sequence and the clause candidate, is as follows.
This is realized by the clause detection start conditions and the "semantic analysis" output in "Create Clause Detection Start Conditions" in Figure i11. In other words, the positional adjacency relationship between the bunsetsu sequence and the bunsetsu candidate is determined by the ``bunsetsu detection start condition'', and the semantic adjacency relationship is determined by the ``semantic analysis'' as a condition regarding the boundary between the bunsetsu sequence and the bunsetsu candidate. Ru. Furthermore, the phrase sequence at the time immediately after adopting the n-th phrase represented by the symbol 11n in FIG. 2 is stored in the "Phrase series storage" in FIG. It is used as a phrase candidate selection condition for the process of selecting one phrase.

以上、本発明の実施例について述べたが、本発明はこれ
に必ずしも限定されるものではない。即ち、例えば、対
話のよった語句の省略を伴う音声や、文節ごとに区切ら
れて発声された文音声なども、本発明の認識対象に含ま
れる。Although the embodiments of the present invention have been described above, the present invention is not necessarily limited thereto. That is, for example, speech that involves the omission of words or phrases that occur in dialogue, or sentence speech that is uttered divided into phrases are also included in the recognition targets of the present invention.

発明の効果以上述べてきたように、本発明によれば、従来別個の処
理として行なわれていた「文節ラティス」作成の処理と
意味解析の処理を統合し、新たに文節の検出と意味解析
の処理と同時進行的に行なうことにより、従来の方式に
おける無駄な文節候補の検出を回避し、効率の良い文解
析機構を実現することが可能となる。Effects of the Invention As described above, according to the present invention, the process of creating a "bunsetsu lattice" and the process of semantic analysis, which were conventionally performed as separate processes, are integrated, and a new process of detecting and analyzing the clauses is performed. By performing this simultaneously with the processing, it is possible to avoid unnecessary detection of phrase candidates in conventional methods and realize an efficient sentence analysis mechanism.

[Brief explanation of drawings]

第１図は本発明の一実施例における文音声認識システム
の機能ブロック図、第２図は、本発明を文音声認識シス
テムに適用した場合のアルゴリズムの一例を示す図、第
３図は従来の文音声認識システムの典型的な構成例を示
した機能ブロック図、第４図は文音声に対する音素認識
結果と「文節ラティス」の例を示した図、第６図はその
意味解析結果を示す図である。１２・・・文節検出部、１３・・・意味解析部、１４・
・・文節系列格納部。FIG. 1 is a functional block diagram of a sentence speech recognition system according to an embodiment of the present invention, FIG. 2 is a diagram showing an example of an algorithm when the present invention is applied to a sentence speech recognition system, and FIG. 3 is a block diagram of a conventional sentence speech recognition system. A functional block diagram showing a typical configuration example of a sentence speech recognition system. Figure 4 is a diagram showing the phoneme recognition results for sentence sounds and an example of a "bunsetsu lattice." Figure 6 is a diagram showing the semantic analysis results. It is. 12... Clause detection section, 13... Semantic analysis section, 14.
...Bunsetsu series storage section.

Claims

[Claims]

A clause detection means detects interval information and reliability information of clause candidates from the audio signal, and constraints regarding the positional relationship between clause candidates among the clause candidates obtained by the clause detection means and linguistic information such as grammar or meaning. The system includes a semantic analysis means for examining constraint conditions, and a clause series storage means for storing a clause series created by connecting clause candidates as a series of clauses satisfying the constraint conditions between clauses. A speech recognition method characterized in that the sentence analysis process is proceeded by detecting phrase candidates in only necessary sections by the phrase detecting means based on information on the end points of a phrase series that is being analyzed.