JPH03180899A - Voice recognizing method - Google Patents

Voice recognizing method

Info

Publication number
JPH03180899A
JPH03180899A JP1321685A JP32168589A JPH03180899A JP H03180899 A JPH03180899 A JP H03180899A JP 1321685 A JP1321685 A JP 1321685A JP 32168589 A JP32168589 A JP 32168589A JP H03180899 A JPH03180899 A JP H03180899A
Authority
JP
Japan
Prior art keywords
clause
sentence
candidates
phrase
detection
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP1321685A
Other languages
Japanese (ja)
Inventor
Tatsuya Kimura
達也 木村
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Holdings Corp
Original Assignee
Matsushita Electric Industrial Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matsushita Electric Industrial Co Ltd filed Critical Matsushita Electric Industrial Co Ltd
Priority to JP1321685A priority Critical patent/JPH03180899A/en
Publication of JPH03180899A publication Critical patent/JPH03180899A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To analyze a sentence efficiently for sentence voice recognition by providing a clause sequence storage part, a clause detection start condition generation part, etc., and carrying out processes for clause detection and meaning analysis simultaneously, progressively, and integrally. CONSTITUTION:The clause detection part 12 of a meaning expression output part 11 detects a clause candidate from the phoneme sequence of an input sentence voice recognized by a phoneme recognition part 10 and a meaning analysis part 13 checks the possibility of linguistic and position-concerned connections between the clause candidate and a clause sequence group which is already stored in a clause sequence storage part 14, so that clauses which can be connected are stored in the storage part 14 together with the meaning expression of modification analytic results. At the same time, the clause detection start condition generation part 15 generates clause start conditions according to the storage contents of the storage part 14 and supplies them to a clause detection part 12, and clause candidates which do not match the conditions are inhibited from being detected. Those clause detecting process and meaning analyzing process are carried out simultaneously, progressively, and integrally to inhibit unnecessary clause detection from being performed, thereby analyzing sentences efficiently.

Description

【発明の詳細な説明】 産業上の利用分野 本発明は、連続発声された文音声を主な認識対象とする
場合の音声認識方法に関する。
DETAILED DESCRIPTION OF THE INVENTION Field of the Invention The present invention relates to a speech recognition method when the main object of recognition is continuously uttered sentence speech.

従来の技術 従来、文音声を認識する手法に関しては、種々の方法が
提案されている。
BACKGROUND OF THE INVENTION Conventionally, various methods have been proposed for recognizing sentence speech.

日本語文を認識対象とした場合のその中の1つの典型的
な方法は、まず入力音声のどの部分区間がどの文節に該
当するかを全音声区間について推定して文節候補の集合
を作成しておき、その文節候補の集合より、時間的な位
置関係と文法や意味などの言語的な制約(一般に文節間
の掛り受は規則を指す場合が多い)を満足する一連の系
列をなす文節からなる部分集合を、認識結果として推定
する方法である。
One typical method when recognizing Japanese sentences is to first estimate which subsections of the input speech correspond to which clauses for all speech intervals, and then create a set of clause candidates. The set of phrase candidates consists of a series of phrases that satisfy the temporal positional relationship and linguistic constraints such as grammar and meaning (in general, the overlap between phrases often refers to rules). This is a method of estimating a subset as a recognition result.

時間的な位置情報を伴った文節の候補の集合のことを、
「文節ラティス」と呼ぶことにすると、「文節ラティス
」から得られる上記諸制約を同時に満たす文節の系列は
、一般には複数個存在するが、推定の妥当性を評価する
尺度(例えば確率論的解釈によって導かれる尺度(後述
する)をあらかじめ用意しておき、その評価値にしたが
って、文節候補の順位付けをし、1位の候補を認識結果
として採択することがよく行なわれる。
A set of clause candidates with temporal position information is
If we call it the "Bunsetsu lattice," there are generally multiple sets of clauses that simultaneously satisfy the above constraints obtained from the "Bunsetsu lattice." It is common practice to prepare in advance a scale (described later) derived from the above, rank the phrase candidates according to the evaluation value, and select the first-ranked candidate as the recognition result.

ところで、以上述べた方法を実現するためには・以下の
具体的課題を明確にする必要がある。
By the way, in order to realize the method described above, it is necessary to clarify the following specific issues.

(1)文節ラティスから最終結果を得る具体的な方法。(1) A specific method to obtain the final result from the bunsetsu lattice.

(2)音声信号から文節ラティスを作成する方法。(2) A method of creating a bunsetsu lattice from audio signals.

(3)利用する文法の種類。(3) Type of grammar used.

本発明で取り扱う内容である(1)としては、従来、「
文節ラティス」を構成する全ての文節について総当たり
的に時間的位置関係及び言語的制約のチエツクを行なう
方法や、文頭から文末の方向へグラフ理論の分野におけ
る木探索又は経路探索の方法を用いて、文頭より文末の
方向にしたがって上記言語的制約のチエツクを文節候補
に適用する手法などが知られている(例えば1979年
刊、新美著「音声認識」共立出版発行)。
As for (1), which is the subject of the present invention, conventionally, “
This method uses a brute force method to check the temporal positional relationships and linguistic constraints of all the clauses that make up the clause lattice, and a tree search or path search method in the field of graph theory from the beginning of the sentence to the end. , a method is known in which the above-mentioned linguistic constraint check is applied to clause candidates from the beginning to the end of the sentence (for example, "Speech Recognition" by Niimi, published by Kyoritsu Shuppan, published in 1979).

しかし、前者は演算効率の点で、後者は日本語の言語構
造、即ち「係り受け」における「係り」に相当する文節
が「受け」に相当する文節より前に位置するという日本
語特有の言語構造に対する整合性の面でそれぞれ問題が
あり、日本語文音声認識の実現に際し、必ずしも満足の
いく結果を得るには至っていなかった。
However, the former is due to computational efficiency, and the latter is due to the linguistic structure of Japanese, in which the clause corresponding to ``kari'' in ``modariuke'' is placed before the clause corresponding to ``uke'', a language unique to Japanese. Each method has problems in terms of consistency with respect to its structure, and it has not always been possible to obtain satisfactory results when realizing Japanese sentence speech recognition.

これを解決するため、本発明者により、上記探索の方法
に関し、文末から文頭の方向で、文節間の係り受は構造
規則の制約条件のもとて探索を行ない文節候補の系列を
得る方法が提案されている。
In order to solve this problem, the present inventor proposed a method to obtain a sequence of clause candidates by searching from the end of the sentence to the beginning of the sentence, subjecting the dependencies between clauses to the constraints of the structure rules. Proposed.

つまり、この方法は、日本語の言語構造に適しかつ効率
が良いという特長を有している。
In other words, this method has the advantage of being suitable for the language structure of Japanese and being efficient.

即ち、第3図はこの方法の文音声認識システムを示し、
音素認識部1は入力された音声信号より音素認識を行な
い音素系列を出力する手段を有している。得られた音素
系列は完全に正しいことは保証されず、話者や周囲雑音
あるいは音素認識装置自体の性能に起因するある確率で
誤りを含む。
That is, FIG. 3 shows a sentence speech recognition system using this method,
The phoneme recognition unit 1 has means for performing phoneme recognition from an input audio signal and outputting a phoneme sequence. The obtained phoneme sequence is not guaranteed to be completely correct, and has a certain probability of containing errors due to the speaker, ambient noise, or the performance of the phoneme recognition device itself.

そのため、従来の文字入力を対象とした自然言語処理技
術による構文解析や意味解析の手法をそのまま適用する
ことはできず、文音声認識特有の処理方法が必要となる
Therefore, it is not possible to directly apply conventional syntactic analysis and semantic analysis techniques based on natural language processing technology for character input, and a processing method specific to sentence speech recognition is required.

文節ラティス作成部2は誤りを含む認識音素系列の中の
どの部分がどの文節に該当する可能性が高いかを推定し
て文節候補を得る機能を持つ。推定された文節は始端及
び終端に関する時間的位置情報並びに推定の妥当性に関
する情報を伴う。文節推定の妥当性は、例えば音素の置
換、付加、脱落の各誤り率から得られる音素間のコンツ
ー−ジョンマトリクスから導かれる尤度などで与えられ
る。ここでは、文節候補は文頭から文末に至る全範囲に
わたって推定の妥当性が所定の値を越えるものが、全て
得られるものとする。
The clause lattice creation unit 2 has a function of obtaining clause candidates by estimating which part of the recognized phoneme sequence containing errors is likely to correspond to which clause. The estimated phrase is accompanied by temporal position information regarding the beginning and end, as well as information regarding the validity of the estimation. The validity of phrase estimation is given, for example, by the likelihood derived from a concussion matrix between phonemes obtained from each error rate of phoneme substitution, addition, and omission. Here, it is assumed that all phrase candidates whose estimated validity exceeds a predetermined value over the entire range from the beginning of the sentence to the end of the sentence are obtained.

意味解析部3では、位置関係による制約及び言語的な制
約にしたがって「文節ラティス」の中の文節候補を連接
していく操作により、最終的な結果である文節の系列を
得る。言語的制約を決定する文法(ルール)としては、
例えばFillmoreによる格文法(後述)や、5c
hankによる概念依存文法などが考えられる。
The semantic analysis unit 3 obtains a series of clauses as the final result by concatenating clause candidates in the "clause lattice" according to positional relationship constraints and linguistic constraints. The grammar (rules) that determine linguistic constraints are:
For example, Fillmore's case grammar (described later), 5c
A concept-dependent grammar using hank can be considered.

第4図に連続発声による例文「新大阪まで切符を3枚下
さい。」を入力した時の音素認識結果例及び「文節ラテ
ィス」の例を示す。図中、「文節ラティス」においてア
/ダーラインが施されている文節が正解の文節である。
FIG. 4 shows an example of a phoneme recognition result and an example of a ``bunsetsu lattice'' when an example sentence ``Please give me three tickets to Shin-Osaka'' is input by continuous utterance. In the figure, the phrases with a/dare lines in the "Phrase lattice" are the correct phrases.

また、この例では言語的制約のチエツクに、格文法を用
いているので、格文法について簡単tg説明をしておく
Also, in this example, a case grammar is used to check linguistic constraints, so a brief explanation of the case grammar will be given below.

格文法は述語と他の語句とがどのような関係で共存しつ
るかを記述する文法である。即ち、文における主語、述
語、目的語、補語といった役割で考えるのではなく、述
語にとって意味の上から各単語がどのような立場に立つ
かを考えろ方法による文の解析手段である。格文法は上
述のように、構文解析よりも意味解析に重点をおいてい
るため、(日本語)文音声認識への応用を考えた場合、
以下に示すような利点がある。
Case grammar is a grammar that describes how predicates and other words coexist. In other words, it is a method of analyzing sentences that does not consider the roles of subjects, predicates, objects, and complements in a sentence, but rather considers the position of each word in terms of meaning for the predicate. As mentioned above, case grammar emphasizes semantic analysis rather than syntactic analysis, so when considering its application to (Japanese) sentence speech recognition,
It has the following advantages.

(1)単語間の意味的な共起関係を利用した単語候補の
絞り込みを導入しやすい。
(1) It is easy to narrow down word candidates using semantic co-occurrence relationships between words.

(2)語順に対する自由度が大きいため、日本語を取り
扱い易い。
(2) It is easy to handle Japanese because there is a large degree of freedom regarding word order.

(3)結果が意味の形で得られるので対話システムなど
への組み込みが容易である。
(3) Since the results are obtained in the form of meaning, it is easy to incorporate into dialogue systems, etc.

第6図には、この格文法による意味解析結果の−例を示
してあり、−行目の(MODE MEIIIEI)はこ
の文が命令文であることを表わしている。また2行目(
7)(ACT KIJDASARU)は述語が「下さる
」という行為を意味していることを表わしている。また
、3行目の(OBJECT KIPPU)以下は、述語
「下さる」という行為の内容の詳細を説明するための機
能を持つ句または節に関する記述である。即ち、行為「
下さる」の対象物は「切符」であり、その枚数は「3枚
」であり、行き先は「新大阪」であることを表わしてい
る。
FIG. 6 shows an example of the result of semantic analysis using this case grammar, where (MODE MEIIIEI) in the -th line indicates that this sentence is an imperative sentence. Also, the second line (
7) (ACT KIJDASARU) indicates that the predicate means the act of ``giving.'' Furthermore, the third line (OBJECT KIPPU) and the following are descriptions of phrases or clauses that have the function of explaining the details of the action of the predicate "Gaseru". That is, the act ``
The object of ``Give me'' is ``tickets'', the number of which is ``3'', and the destination is ``Shin-Osaka''.

格文法では、各動詞について、それを意味的な詳細を説
明することが可能な項目を用意しておき、文中の各単語
を該当する項目に順次にあてはめていく操作により意味
解析が進められる。この各動詞ごとに存在する項目のこ
とを、「充填のための溝」という意味で「格スロット」
と呼ぶ。また文中の各単語を踪当する項目に順次当ては
めていく操作のことを、「スロットを埋める操作」とい
う意味で「スロットフィリング」と呼ぶ。本明細書でも
、以後、これらの用語を使用することにする。
In case grammar, for each verb, items are prepared that can explain the meaning in detail, and semantic analysis proceeds by sequentially applying each word in a sentence to the corresponding item. These items that exist for each verb are called ``case slots,'' meaning ``grooves for filling.''
It is called. Also, the operation of sequentially applying each word in a sentence to the corresponding item is called ``slot filling'', which means ``slot filling operation.'' These terms will also be used hereinafter.

上記言語的制約のチエツクとは、後述する本発明の実施
例においては、「スロットフィリング」が取立するか否
かのチエツクのことを示す。
In the embodiment of the present invention described later, the above-mentioned linguistic restriction check refers to a check as to whether "slot filling" is accepted.

なお、本発明で取り扱う言語処理系においては、以上に
説明した文節間の格関係のみならず、「赤い花」の例の
ような連体修飾関係や、「美しく咲く」のような連用修
飾関係、あるいは「切符を買って、京都へ行く」のよう
な接続関係も格文法における「スロットフィリング」と
同じ概念で取り扱う。いい換えると、本発明では、活用
語のみならず、被修飾語にもスロットを持たせて係り受
は関係を解析する枠組みとなっている。
Note that the language processing system handled by the present invention not only handles case relations between clauses as explained above, but also adjunctive modification relations such as the example of "red flower", conjunctive modification relations such as "blooms beautifully", Also, conjunctions such as ``buy a ticket and go to Kyoto'' are treated with the same concept as ``slot filling'' in case grammar. In other words, in the present invention, not only conjugated words but also modified words have slots, and modification is a framework for analyzing relationships.

「文節ラティス」の文節候補の連結処理は、文末に位置
するあらかじめ定められた個数の文節候補群を起点にし
て、文頭方向に向けて段階的に行なう。
The process of concatenating clause candidates in the "Bunsetsu lattice" is performed in stages from a predetermined number of clause candidates located at the end of the sentence toward the beginning of the sentence.

発明が解決しようとする課題 ところで、上述の従来の方法の場合は、音声信号から「
文節ラティス」を作成する段階で、意味解析に不要な文
節候補も検出されるため、認識処理の効率低下を招くと
いう課題があった。
Problems to be Solved by the Invention However, in the case of the above-mentioned conventional method, it is difficult to
At the stage of creating a bunsetsu lattice, clause candidates that are unnecessary for semantic analysis are also detected, which poses the problem of reducing the efficiency of recognition processing.

本発明は、このような従来の課題に鑑み、なされたもの
で・その目的とするところは、従来別個の処理として行
なわれていた「文節ラティス」作成の処理と意味解析の
処理を統合し、新たに文節の検出と意味解析の処理と同
時進行的に行なう手段を設けることにより無駄な文節候
補の検出を回避し、効率の良い分解析機構の実現を図る
ことが可能に音声認識方法を得ることにある。
The present invention has been made in view of these conventional problems, and its purpose is to integrate the process of creating a "bunsetsu lattice" and the process of semantic analysis, which were conventionally performed as separate processes, To obtain a speech recognition method that avoids unnecessary detection of phrase candidates and realizes an efficient segmentation analysis mechanism by providing a new means for simultaneously performing phrase detection and semantic analysis processing. There is a particular thing.

課題を解決するための手段 上記目的を達成するため、本発明の音声認識方法は、音
声信号から文節候補の区間情報と信頼度情報を検出する
文節検出手段と、文節検出手段より得られた文節候補の
中の文節候補間の位置関係に関する制約条件及び文法あ
るいは意味などの言語的な制約条件を調べる意味解析手
段と、文節間の制約条件を満たす文節の系列として文節
候補を接続することにより作成される文節系列を格納す
る文節系列格納手段とを備え、文節系列格納手段に格納
されている解析途中の文節系列の端点の情報をもとに、
必要な区間のみの文節候補を上記文節検出手段により検
出することにより文解析処理を進めることを提案するも
のである。
Means for Solving the Problems In order to achieve the above object, the speech recognition method of the present invention includes a phrase detection means for detecting section information and reliability information of phrase candidates from an audio signal, and a phrase detection means for detecting interval information and reliability information of phrase candidates from a speech signal. Created by connecting phrase candidates as a series of phrases that satisfy the constraint conditions between phrases with a semantic analysis means that examines constraints on the positional relationship between phrase candidates in the candidates and linguistic constraints such as grammar or meaning. and a clause series storage means for storing a clause series to be analyzed, and based on the information of the end point of the clause series that is being analyzed, which is stored in the clause series storage means,
It is proposed that the sentence analysis process be carried out by detecting clause candidates in only the necessary sections using the clause detecting means.

作用 上記音声認識方法により、以下の作用にしたがって効率
良く、更に認識精度の高い日本語文音声の認識を実現す
るようになる。
Effects The above-mentioned speech recognition method realizes efficient recognition of Japanese sentence speech with higher recognition accuracy according to the following effects.

(1)文節候補を段階的に採択することにより複数個の
文節系列を同時進行的に作成しながら解析処理を行なう
ことができる。
(1) By selecting clause candidates in stages, it is possible to perform analysis processing while simultaneously creating a plurality of clause sequences.

(2)  (1)の処理の段階で文節間の「係り受け]
解析を完了できる。
(2) At the processing stage of (1), “dependency” between bunsetsu clauses
Analysis can be completed.

(3)解析の途中過程で文節系列の推定妥当性の値が一
定の評価基準に達しない文節系列については以降の処理
を打ち切ることができる。
(3) For phrase sequences whose estimated validity value does not reach a certain evaluation standard during the course of analysis, subsequent processing can be discontinued.

(4)文末側から文頭側への方向に解析処理を進め、文
節推定妥当性及び言語的な制約条件による制御によって
探索を行なうことができる。
(4) The analysis process proceeds from the end of the sentence to the beginning of the sentence, and the search can be performed under control based on clause estimation validity and linguistic constraints.

実施例 以下、第1図について本発明の実施例の詳細な説明する
EXAMPLE Hereinafter, an example of the present invention will be described in detail with reference to FIG.

第1図は本発明による文音声認識システムの概念であり
、音素認識部10は入力された音声信号より音素認識を
行ない音素系列を出力する手段であり、音素系列は誤認
識によりある確率で誤りを含んでいる。
FIG. 1 shows the concept of a sentence speech recognition system according to the present invention. The phoneme recognition unit 10 is a means for performing phoneme recognition from an input speech signal and outputting a phoneme sequence. The phoneme sequence has a certain probability of being incorrect due to misrecognition. Contains.

図中点線の意味表現出力部11は、文節検出部12、意
味解析部13、文節系列格納部14、文節検出開始条件
作成部16から構成され、誤りを含む音素系列から認識
結果である意味表現を得る機能を有する。そして、上記
意味表現出力部11の各構成要素により従来の「文節ラ
ティス」を求めろ際に生じていた、意味解析に不必要へ
文節候補の検出を回避し、効率の良い文解析処理を実現
するようになっている。
The semantic expression output unit 11 indicated by the dotted line in the figure is composed of a phrase detection unit 12, a meaning analysis unit 13, a phrase sequence storage unit 14, and a phrase detection start condition creation unit 16, and is a semantic expression that is a recognition result from a phoneme sequence containing errors. It has the function of obtaining Then, each component of the semantic expression output unit 11 avoids the detection of clause candidates that are unnecessary for semantic analysis, which occurs when calculating the conventional "clause lattice", and realizes efficient sentence analysis processing. It is supposed to be done.

次いで、同構成の文音声認識システムの動作について説
明する。文節検出部12では、後述の文節系列格納部1
4及び文節検出開始条件作成部16とから作成される文
節系列開始条件の情報の中の、検出開始位置の情報と検
出開始時点近傍の音素情報を参照することにより、必要
な区間のみについての文節候補の検出を行なう。次の意
味解析部13では、得られた文節候補と、文節系列格納
部14に格納されている現解析時点までに蓄積されてい
る文節系列群の要素との、言語的及び位置的な連接可能
性を調べ、もし連結が可能ならば、上記文節系列群の要
素に上記文節候補を接続し、新たな文節系列として文節
系列格納部14に格納する。これと同虹、「係り受け」
解析結果である意味表現も文節系列格納部14に格納す
る。また、文節検出開始条件作成部16は、上記文節検
出部12が必要としている文節検出開始位置の情報と、
文節検出位置近傍の音素の情報とを文節系列格納部14
かも作成する機能を持つ。
Next, the operation of the sentence speech recognition system having the same configuration will be explained. In the phrase detection unit 12, the phrase series storage unit 1, which will be described later,
By referring to the information on the detection start position and the phoneme information in the vicinity of the detection start point in the information on the bunsetsu sequence start conditions created by the bunsetsu detection start condition creation unit 16 and the bunsetsu detection start condition creation unit 16, Detect candidates. Next, in the semantic analysis section 13, it is possible to link the obtained clause candidates linguistically and positionally with the elements of the clause series group stored in the clause series storage section 14 and accumulated up to the time of the current analysis. If connection is possible, the clause candidate is connected to the element of the clause series group and stored in the clause series storage unit 14 as a new clause series. The same rainbow as this, "Kariuke"
The semantic expression that is the analysis result is also stored in the phrase series storage unit 14. In addition, the phrase detection start condition creation unit 16 includes information on the phrase detection start position required by the phrase detection unit 12,
Information on phonemes near the phrase detection position is stored in the phrase sequence storage unit 14.
It also has the ability to create.

以上の操作は、入力音声の始端もしくは終端のいずれか
の端点位置に端点を持つ文節候補から開始され、他方の
端点位置に端点を持つ文節候補に関する解析が終了する
まで続けられる。処理が終了した時点で、上記文節系列
の妥当性の評価値が第1位の文節系列を認識結果とし、
併せて第4図において既に示した形の意味表現を意味解
析結果として得られることになる。
The above operations are started from a clause candidate having an end point at either the start or end position of the input speech, and are continued until the analysis of clause candidates having an end point at the other end position is completed. When the processing is completed, the phrase series with the highest validity evaluation value of the phrase series is set as the recognition result,
At the same time, the semantic expression of the form already shown in FIG. 4 can be obtained as a result of semantic analysis.

第1図の構成で実現される文認識機能を実現するアルゴ
リズムの例を第2図に示す。第2図に示す例では、解析
処理を文末から文頭方向へ進める場合について説明して
いるが、用語「文末」を「文頭」に、「文頭」を「文末
」に読み替えることにより、文頭から文末方向へ解析処
理を進めることももちろん可能である。説明の準備のた
め、第2図で用いる用語及び記号の定義を行なう。
FIG. 2 shows an example of an algorithm that implements the sentence recognition function realized with the configuration shown in FIG. The example shown in Figure 2 explains the case where the analysis process proceeds from the end of the sentence to the beginning of the sentence. Of course, it is also possible to proceed with the analysis process in this direction. In preparation for the explanation, terms and symbols used in FIG. 2 will be defined.

用語の定義; [定義1] 「文節候補すが文末条件を満たす」とは、文節すが文法
的にも位置的にも文末に存在可能であることを言う。
Definition of Terms; [Definition 1] "The clause candidate satisfies the sentence-final condition" means that the clause can exist at the end of the sentence both grammatically and positionally.

[定義2] 「文節候補すが文頭条件を満たす」とは、文節すが文法
的にも位置的にも文頭に存在可能であることを言う。
[Definition 2] "The phrase candidate satisfies the sentence-initial condition" means that the phrase can exist at the beginning of a sentence both grammatically and positionally.

[定義3] 「文節候補すが文節系列Bに左連接可能である」とは、
文節系列Bの左端に文節すを文法的に接続することが可
能であることを言う。
[Definition 3] “The bunsetsu candidate can be left connected to bunsetsu series B” means:
This means that it is possible to grammatically connect the bunsetsu s to the left end of the bunsetsu series B.

記号の定義; (Bn):文節数nの・文節系列の集合(F) :文候
補の集合 (b) 二文節候補の集合 []:[]の中の演算を優先する bleil  :文末条件[定義1]を満たす文節候補
beam (B、  )   : (B、  lの要素
で枝刈りの対象になら々かった(残っ た)文節系列の集合 head (B、  l   : (B、  )の要素
で文頭条件[定義2コを満たす文節系 列の集合 adjIe(t (B、  )  :少なくとも(B、
)の1個の要素に左連接可能[定義 3]な文節候補の集合 (bl[有](B、l:(blと(B、)の直積集合の
中の連接条件を満たす 要素について文節候補と文 部系列を連接して作成した 文節系列の候補の集合 lJ:和集合演算 第2図に示したアルゴリズムでは、文節数がn−1(n
≧1)の文節系列に1個の文節を接続して文節数Nの文
節系列を作成する処理を1つの単位としている。この1
つの単位の処理を図中では「第n文節の処理」と表現し
、n = 1から図中に示しである終了条件が満たされ
るまでこの処理を繰り返す。
Definition of symbols; (Bn): Set of clause sequences with n clauses (F): Set of sentence candidates (b) Set of two clause candidates []: bleil that gives priority to operations in []: Sentence-final condition [ Clause candidate beam (B, ) that satisfies [Definition 1]: (B, l: An element of (B, ) which is a set of clause series that were not (remained) subject to pruning (B, l: An element of (B, ) Sentence-initial condition [Set of clause sequences that satisfy definition 2 adjIe(t (B, ): at least (B,
A set of phrase candidates that can be left connected to one element of A set of phrase sequence candidates created by concatenating sentence sequence and sentence sequence lJ: Union set operation In the algorithm shown in Figure 2, the number of phrases is n-1 (n
One unit is the process of connecting one clause to the clause series of ≧1) to create a clause series of N clauses. This one
Processing for one unit is expressed as "n-th clause processing" in the figure, and this process is repeated from n = 1 until the end condition shown in the figure is satisfied.

次に、第1図に示した本発明の実施例と第2図に示した
本発明によるアルゴリズムとの関連について説明すると
、第2図中の文節の集合(Nは第1図中の「文節検出」
によって行なわれる。文節の集合[blの検出は第2図
の文末の文節候補b jailおよび文節系列と文節候
補の隣接条件を表現する記号adjleftの機能は、
i11図中の「文節検出開始条件作成」の出力する文節
検出開始条件及び「意味解析」によって実現される。即
ち、文節系列と文節候補の間の位置的な隣接関係は「文
節検出開始条件」により、また意味的な隣接関係は「意
味解析」により、文節系列と文節候補との境界に関する
条件として決定される。また第2図中の記号11nで表
現される第n文節を採択した直後の時点における文節系
列は第1図中の「文節系列格納」の中に格納され、次の
段階の処理即ち第n −1文節採択の処理を行なうため
の文節候補採択条件として利用される。
Next, to explain the relationship between the embodiment of the present invention shown in FIG. 1 and the algorithm according to the present invention shown in FIG. detection"
It is carried out by Detection of a set of clauses [bl is the clause candidate b jail at the end of the sentence in Figure 2. The function of the symbol adjleft, which expresses the adjacency condition between the clause sequence and the clause candidate, is as follows.
This is realized by the clause detection start conditions and the "semantic analysis" output in "Create Clause Detection Start Conditions" in Figure i11. In other words, the positional adjacency relationship between the bunsetsu sequence and the bunsetsu candidate is determined by the ``bunsetsu detection start condition'', and the semantic adjacency relationship is determined by the ``semantic analysis'' as a condition regarding the boundary between the bunsetsu sequence and the bunsetsu candidate. Ru. Furthermore, the phrase sequence at the time immediately after adopting the n-th phrase represented by the symbol 11n in FIG. 2 is stored in the "Phrase series storage" in FIG. It is used as a phrase candidate selection condition for the process of selecting one phrase.

以上、本発明の実施例について述べたが、本発明はこれ
に必ずしも限定されるものではない。即ち、例えば、対
話のよった語句の省略を伴う音声や、文節ごとに区切ら
れて発声された文音声なども、本発明の認識対象に含ま
れる。
Although the embodiments of the present invention have been described above, the present invention is not necessarily limited thereto. That is, for example, speech that involves the omission of words or phrases that occur in dialogue, or sentence speech that is uttered divided into phrases are also included in the recognition targets of the present invention.

発明の効果 以上述べてきたように、本発明によれば、従来別個の処
理として行なわれていた「文節ラティス」作成の処理と
意味解析の処理を統合し、新たに文節の検出と意味解析
の処理と同時進行的に行なうことにより、従来の方式に
おける無駄な文節候補の検出を回避し、効率の良い文解
析機構を実現することが可能となる。
Effects of the Invention As described above, according to the present invention, the process of creating a "bunsetsu lattice" and the process of semantic analysis, which were conventionally performed as separate processes, are integrated, and a new process of detecting and analyzing the clauses is performed. By performing this simultaneously with the processing, it is possible to avoid unnecessary detection of phrase candidates in conventional methods and realize an efficient sentence analysis mechanism.

【図面の簡単な説明】[Brief explanation of drawings]

第1図は本発明の一実施例における文音声認識システム
の機能ブロック図、第2図は、本発明を文音声認識シス
テムに適用した場合のアルゴリズムの一例を示す図、第
3図は従来の文音声認識システムの典型的な構成例を示
した機能ブロック図、第4図は文音声に対する音素認識
結果と「文節ラティス」の例を示した図、第6図はその
意味解析結果を示す図である。 12・・・文節検出部、13・・・意味解析部、14・
・・文節系列格納部。
FIG. 1 is a functional block diagram of a sentence speech recognition system according to an embodiment of the present invention, FIG. 2 is a diagram showing an example of an algorithm when the present invention is applied to a sentence speech recognition system, and FIG. 3 is a block diagram of a conventional sentence speech recognition system. A functional block diagram showing a typical configuration example of a sentence speech recognition system. Figure 4 is a diagram showing the phoneme recognition results for sentence sounds and an example of a "bunsetsu lattice." Figure 6 is a diagram showing the semantic analysis results. It is. 12... Clause detection section, 13... Semantic analysis section, 14.
...Bunsetsu series storage section.

Claims (1)

【特許請求の範囲】[Claims] 音声信号から文節候補の区間情報と信頼度情報を検出す
る文節検出手段と、文節検出手段より得られた文節候補
の中の文節候補間の位置関係に関する制約条件及び文法
あるいは意味などの言語的な制約条件を調べる意味解析
手段と、文節間の制約条件を満たす文節の系列として文
節候補を接続することにより作成される文節系列を格納
する文節系列格納手段とを備え、文節系列格納手段に格
納されている解析途中の文節系列の端点の情報をもとに
、必要な区間のみの文節候補を上記文節検出手段により
検出することにより文解析処理を進めることを特徴とす
る音声認識方法。
A clause detection means detects interval information and reliability information of clause candidates from the audio signal, and constraints regarding the positional relationship between clause candidates among the clause candidates obtained by the clause detection means and linguistic information such as grammar or meaning. The system includes a semantic analysis means for examining constraint conditions, and a clause series storage means for storing a clause series created by connecting clause candidates as a series of clauses satisfying the constraint conditions between clauses. A speech recognition method characterized in that the sentence analysis process is proceeded by detecting phrase candidates in only necessary sections by the phrase detecting means based on information on the end points of a phrase series that is being analyzed.
JP1321685A 1989-12-11 1989-12-11 Voice recognizing method Pending JPH03180899A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP1321685A JPH03180899A (en) 1989-12-11 1989-12-11 Voice recognizing method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP1321685A JPH03180899A (en) 1989-12-11 1989-12-11 Voice recognizing method

Publications (1)

Publication Number Publication Date
JPH03180899A true JPH03180899A (en) 1991-08-06

Family

ID=18135281

Family Applications (1)

Application Number Title Priority Date Filing Date
JP1321685A Pending JPH03180899A (en) 1989-12-11 1989-12-11 Voice recognizing method

Country Status (1)

Country Link
JP (1) JPH03180899A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20190079578A (en) * 2017-12-27 2019-07-05 사운드하운드, 인코포레이티드 Parse prefix-detection in a human-machine interface

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20190079578A (en) * 2017-12-27 2019-07-05 사운드하운드, 인코포레이티드 Parse prefix-detection in a human-machine interface
US11308960B2 (en) 2017-12-27 2022-04-19 Soundhound, Inc. Adapting an utterance cut-off period based on parse prefix detection
US11862162B2 (en) 2017-12-27 2024-01-02 Soundhound, Inc. Adapting an utterance cut-off period based on parse prefix detection

Similar Documents

Publication Publication Date Title
EP0977174B1 (en) Search optimization system and method for continuous speech recognition
US9330660B2 (en) Grammar fragment acquisition using syntactic and semantic clustering
US20040220809A1 (en) System with composite statistical and rules-based grammar model for speech recognition and natural language understanding
US8566076B2 (en) System and method for applying bridging models for robust and efficient speech to speech translation
US5875426A (en) Recognizing speech having word liaisons by adding a phoneme to reference word models
JPH05197389A (en) Voice recognition device
US7401019B2 (en) Phonetic fragment search in speech data
EP1475779B1 (en) System with composite statistical and rules-based grammar model for speech recognition and natural language understanding
Chow et al. Speech understanding using a unification grammar
US20040199374A1 (en) Automatic resolution of segmentation ambiguities in grammar authoring
CN112084791A (en) Dialog process intention extraction and utterance prompting method and system and electronic equipment thereof
JP4220151B2 (en) Spoken dialogue device
JP3088364B2 (en) Spoken language understanding device and spoken language understanding system
JPH03180899A (en) Voice recognizing method
Wang Mandarin spoken document retrieval based on syllable lattice matching
JP2000267693A (en) Voice processor and index preparation device
JP3577725B2 (en) Voice recognition device
KR100366703B1 (en) Human interactive speech recognition apparatus and method thereof
JPH10232693A (en) Voice recognition device
JP3766111B2 (en) Voice recognition device
JPH01260494A (en) Voice recognizing method
Jurafsky et al. Integrating experimental models of syntax, phonology, and accent/dialect in a speech recognizer
KR0136426B1 (en) Voice recognition method for hidden markov modeling method system
KR20000026814A (en) Method for separating word clause for successive voice recognition and voice recognition method using the method
JPH10254485A (en) Speaker normalizing device, speaker adaptive device and speech recognizer